OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-09 06:25:07 +00:00

Author	SHA1	Message	Date
Nik Everett	25119a7e78	Harden painless test against "fun" caching (#24077 ) The JVM caches `Integer` objects. This is known. A test in Painless was relying on the JVM not caching the particular integer `1000`. It turns out that when you provide `-XX:+AggressiveOpts` the JVM does cache `1000`, causing the test to fail when that is specified. This replaces `1000` with a randomly selected integer that we test to make sure isn't cached by the JVM. Hopefully this test is good enough. It relies on the caching not changing in between when we check that the value isn't cached and when we run the painless code. The cache now is a simple array but there is nothing preventing it from changing. If it does change in a way that thwarts this test then the test fail fail again. At least when that happens the next person can see the comment about how it is important that the integer isn't cached and can follow that line of inquiry. Closes #24041	2017-04-17 13:44:05 -04:00
Jason Tedor	972bdc09ee	Reject empty IDs When indexing a document via the bulk API where IDs can be explicitly specified, we currently accept an empty ID. This is problematic because such a document can not be obtained via the get API. Instead, we should rejected these requets as accepting them could be a dangerous form of leniency. Additionally, we already have a way of specifying auto-generated IDs and that is to not explicitly specify an ID so we do not need a second way. This commit rejects the individual requests where ID is specified but empty. Relates #24118	2017-04-15 10:36:03 -04:00
Jay Modi	30ab8739a6	Closing a ReleasableBytesStreamOutput closes the underlying BigArray (#23941 ) This commit makes closing a ReleasableBytesStreamOutput release the underlying BigArray so that we can use try-with-resources with these streams and avoid leaking memory by not returning the BigArray. As part of this change, the ReleasableBytesStreamOutput adds protection to only release the BigArray once. In order to make some of the changes cleaner, the ReleasableBytesStream interface has been removed. The BytesStream interface is changed to a abstract class so that we can use it as a useable return type for a new method, Streams#flushOnCloseStream. This new method wraps a given stream and overrides the close method so that the stream is simply flushed and not closed. This behavior is used in the TcpTransport when compression is used with a ReleasableBytesStreamOutput as we need to close the compressed stream to ensure all of the data is written from this stream. Closing the compressed stream will try to close the underlying stream but we only want to flush so that all of the written bytes are available. Additionally, an error message method added in the BytesRestResponse did not use a builder provided by the channel and instead created its own JSON builder. This changes that method to use the channel builder and in turn the bytes stream output that is managed by the channel. Note, this commit differs from 6bfecdf921a1941b48273d76551872df4062cfae in that it updates ReleasableBytesStreamOutput to handle the case of the BigArray decreasing in size, which changes the reference to the BigArray. When the reference is changed, the releasable needs to be updated otherwise there could be a leak of bytes and corruption of data in unrelated streams. This reverts commit afd45c14327cd0f8d155e5ac9740f48e8e39b09c, which reverted #23572.	2017-04-14 10:50:31 -04:00
Tim Brooks	ffaac5a08a	Simplify BulkProcessor handling and retry logic (#24051 ) This commit collapses the SyncBulkRequestHandler and AsyncBulkRequestHandler into a single BulkRequestHandler. The new handler executes a bulk request and awaits for the completion if the BulkProcessor was configured with a concurrentRequests setting of 0. Otherwise the execution happens asynchronously. As part of this change the Retry class has been refactored. withSyncBackoff and withAsyncBackoff have been replaced with two versions of withBackoff. One method takes a listener that will be called on completion. The other method returns a future that will been complete on request completion.	2017-04-13 14:48:52 -05:00
Nik Everett	e99f90fb46	Add more debugging information to rethrottles I'm still trying to track down failures like: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+dockeralpine-periodic/1180/console It looks like a task is hanging but I'm not sure why. So this adds more logging for next time.	2017-04-12 08:37:31 -04:00
Jason Tedor	653619079c	Skip two Painless branch tests on Windows This commit skips the two Painless tests EqualsTests#testBranchEqualsDefAndPrimitive and EqualsTests#testBranchNotEqualsDefAndPrimitive on Windows as the tests are repeatedly failing there.	2017-04-11 06:19:42 -04:00
Colin Goodheart-Smithe	0114f0061c	Removes version 2.x constants from Version (#24011 ) * Removes version 2.x constants from Version Closes #21887 * Addresses review comments	2017-04-11 08:31:22 +01:00
Luca Cavanna	2c545c064d	Move getProperty method out of MultiBucketsAggregation.Bucket interface (#23988 ) The getProperty method is an internal method needed to run pipeline aggregations and retrieve info by path from the aggs tree. It is not needed in the MultiBucketsAggregation.Bucket interface, which is returned to users running aggregations from the transport client. The method is moved to the InternalMultiBucketAggregation class as that's where it belongs.	2017-04-10 13:35:01 +02:00
Nik Everett	de6837b7ac	Fix throttled reindex_from_remote (#23953 ) reindex_from_remote was using `TimeValue#toString` to generate the scroll timeout which is bad because that generates fractional time values that are useful for people but bad for Elasticsearch which doesn't like to parse them. This switches it to using `TimeValue#getStringRep` which spits out whole time values. Closes to #23945 Makes #23828 even more desirable	2017-04-07 15:56:52 -04:00
Martijn van Groningen	3d9671a668	[PERCOLATOR] Allowing range queries with now ranges inside percolator queries. Before now ranges where forbidden, because the percolator query itself could get cached and then the percolator queries with now ranges that should no longer match, incorrectly will continue to match. By disabling caching when the `percolator` is being used, the percolator can now correctly support range queries with now based ranges. I think this is the right tradeoff. The percolator query is likely to not be the same between search requests and disabling range queries with now ranges really disabled people using the percolator for their use cases. Also fixed an issue that existed in the percolator fieldmapper, it was unable to find forbidden queries inside `dismax` queries. Closes #23859	2017-04-07 08:44:43 +02:00
Tim Brooks	5b1fbe5e6c	Decouple BulkProcessor from client implementation (#23373 ) This commit modifies the BulkProcessor to be decoupled from the client implementation. Instead it just takes a BiConsumer<BulkRequest, ActionListener<BulkResponse>> that executes the BulkRequest.	2017-04-05 12:12:43 -05:00
Jason Tedor	afd45c1432	Revert "Closing a ReleasableBytesStreamOutput closes the underlying BigArray (#23572 )" This reverts commit 6bfecdf921a1941b48273d76551872df4062cfae.	2017-04-04 20:33:51 -04:00
Jay Modi	6bfecdf921	Closing a ReleasableBytesStreamOutput closes the underlying BigArray (#23572 ) This commit makes closing a ReleasableBytesStreamOutput release the underlying BigArray so that we can use try-with-resources with these streams and avoid leaking memory by not returning the BigArray. As part of this change, the ReleasableBytesStreamOutput adds protection to only release the BigArray once. In order to make some of the changes cleaner, the ReleasableBytesStream interface has been removed. The BytesStream interface is changed to a abstract class so that we can use it as a useable return type for a new method, Streams#flushOnCloseStream. This new method wraps a given stream and overrides the close method so that the stream is simply flushed and not closed. This behavior is used in the TcpTransport when compression is used with a ReleasableBytesStreamOutput as we need to close the compressed stream to ensure all of the data is written from this stream. Closing the compressed stream will try to close the underlying stream but we only want to flush so that all of the written bytes are available. Additionally, an error message method added in the BytesRestResponse did not use a builder provided by the channel and instead created its own JSON builder. This changes that method to use the channel builder and in turn the bytes stream output that is managed by the channel.	2017-04-04 17:01:30 +01:00
Jason Tedor	3136ed1490	Rename random ASCII helper methods This commit renames the random ASCII helper methods in ESTestCase. This is because this method ultimately uses the random ASCII methods from randomized runner, but these methods actually only produce random strings generated from [a-zA-Z]. Relates #23886	2017-04-04 11:04:18 -04:00
Nik Everett	ebd74f09cf	Add extra debugging to reindex cancel tests Adds more diagnostics when reindex's cancel tests fail. It fails every once in a while and didn't have useful failure messages: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.3+multijob-unix-compatibility/os=amazon/86/consoleFull	2017-03-31 11:17:06 -04:00
Jim Ferenczi	f3a925fdbe	Fix reindex with a remote source on a version before 2.0.0 (#23805 ) Send the scroll id in the body as plain text when the remote version is before 2.0.0	2017-03-31 09:07:43 +02:00
Tim Brooks	5fa80a6521	Pass exception from sendMessage to listener (#23559 ) This commit changes the listener passed to sendMessage from a Runnable to a ActionListener. This change also removes IOException from the sendMessage signature. That signature is misleading as it allows implementers to assume an exception will be thrown in case of failure. That does not happen due to Netty's async nature.	2017-03-30 15:08:23 -05:00
Dimitris Athanasiou	34f116eae3	Require explicit query in _delete_by_query API (#23632 ) As the query of a search request defaults to match_all, calling _delete_by_query without an explicit query may result in deleting all data. In order to protect users against falling into that pitfall, this commit adds a check to require the explicit setting of a query. Closes #23629	2017-03-28 15:44:57 +01:00
Jim Ferenczi	0e95c90e9f	Upgrade to Lucene 6.5.0 (#23750 )	2017-03-27 15:57:54 +02:00
Ryan Ernst	4cb8a0100c	Build: Rewrite antlr regeneration in gradle (#23733 ) This change ports the regeneration of antlr parser/lexer into gradle (but does still take advantage of ant calls where appropriate).	2017-03-24 09:44:53 -07:00
Ryan Ernst	8c53555b28	Tests: Use local clone build of 5.x with bwc tests (#22946 ) The current rest backcompat tests, which run against a mixed cluster of 5.x and 6.0 nodes, depend on snapshot builds of 5.x. However, this has the potential for inconsistency that results in CI failures, and happens quite often, whenever some backcompat logic is added to 5.x, but the bwc test on master fails because the 5.x code has not yet been published as a snapshot. This change creates a git clone of the 5.x branch, builds the zip distribution, and ties that into gradle substitutions for the 5.x version.	2017-03-23 22:32:13 -07:00
AdityaJNair	63757efe9c	Remove DocumentMapper#parse(String index, String type, String id, BytesReference source) (#23706 ) Removed `parse(String index, String type, String id, BytesReference source)` in DocumentMapper.java and replaced all of its use in Test files with `parse(SourceToParse source)`. `parse(String index, String type, String id, BytesReference source)` was only used in test files and never in the main code so it was removed. All of the test files that used it was then modified to use `parse(SourceToParse source)` method that existing in DocumentMapper.java	2017-03-23 11:01:09 -04:00
Nik Everett	257a7d77ed	Painless: Fix regex lexer and error messages (#23634 ) Without this change, if write a script with multiple regexes sometimes the lexer will decide to look at them like one big regex and then some trailing garbage. Like this discuss post: https://discuss.elastic.co/t/error-with-the-split-function-in-painless-script/79021 ``` def val = /\\\\/.split(ctx._source.event_data.param17); if (val[2] =~ /\\./) { def val2 = /\\./.split(val[2]); ctx._source['user_crash'] = val2[0] } else { ctx._source['user_crash'] = val[2] } ``` The error message you get from the lexer is `lexer_no_viable_alt_exception` right after the second regex. With this change each regex is just a single regex like it ought to be. As a bonus, while looking into this issue I found that the error reporting for regexes wasn't very nice. If you specify an invalid pattern then you get an error marker on the start of the pattern with the JVM's regex error message which attempts to point you to the location in the regex but is totally unreadable in the JSON response. This change fixes the location to point to the appropriate spot inside the pattern and removes the portion of the JVM's error message that doesn't render well. It is no longer needed now that we point users to the appropriate spot in the pattern.	2017-03-22 15:56:17 -04:00
Nik Everett	bc65be2a65	Reindex: wait for cleanup before responding (#23677 ) Changes reindex and friends to wait until the entire request has been "cleaned up" before responding. "Clean up" in this context is clearing the scroll and (for reindex-from-remote) shutting down the client. Failures to clean up are still only logged, not returned to the user. Closes #23653	2017-03-21 15:33:39 -04:00
Jason Tedor	8dfb68cf1c	Upgrade to Netty 4.1.9 This commit upgrades the Netty dependencies from version 4.1.8 to version 4.1.9. This commit picks up a few bug fixes that impacted us: - Netty was incorrectly ignoring interfaces with self-assigned MAC addresses (e.g., instances running in Docker containers or on EC2) - incorrect handling of the Expect: 100-continue header Relates #23540	2017-03-11 18:28:31 -08:00
Daniel Mitterdorfer	6f7cd71e1f	Adjust default Netty receive predictor size to 64k (#23542 ) With this commit we change the default receive predictor size for Netty from 32kB to 64kB as our testing has shown that this leads to less allocations on smaller heaps like the default out of the box configuration and this value also works reasonably well for larger heaps. Closes #23185	2017-03-11 17:32:35 -08:00
Jason Tedor	8e09eca9a6	Mute Painless lambda tests on JDK 9 This commit mutes a ton of Painless lambda tests on JDK 9. This commit did not attempt to discover exactly which tests are failing, but instead just blanket muted all tests in LambdaTests, FunctionRefTests, and AugmentationTests. Relates #23473	2017-03-02 22:36:26 -05:00
Jay Modi	01502893eb	HTTP transport stashes the ThreadContext instead of the RestController (#23456 ) Previously, the RestController would stash the context prior to copying headers. However, there could be deprecation log messages logged and in turn warning headers being added to the context prior to the stashing of the context. These headers in the context would then be removed from the request and also leaked back into the calling thread's context. This change moves the stashing of the context to the HttpTransport so that the network threads' context isn't accidentally populated with warning headers and to ensure the headers added early on in the RestController are not excluded from the response.	2017-03-02 14:44:01 -05:00
Luca Cavanna	cc65a94fd4	[TEST] improve yaml test sections parsing (#23407 ) Throw error when skip or do sections are malformed, such as they don't start with the proper token (START_OBJECT). That signals bad indentation, which would be ignored otherwise. Thanks (or due to) our pull parsing code, we were still able to properly parse the sections, yet other runners weren't able to. Closes #21980 * [TEST] fix indentation in matrix_stats yaml tests * [TEST] fix indentation in painless yaml test * [TEST] fix indentation in analysis yaml tests * [TEST] fix indentation in generated docs yaml tests * [TEST] fix indentation in multi_cluster_search yaml tests	2017-03-02 12:43:20 +01:00
Nik Everett	2dcdaa1c9d	Mustache: don't extend AbstractComponent (#23419 ) Don't extend `AbstractComponent` in `MustacheScriptEngine` because it doesn't buy anything.	2017-03-01 14:54:27 -05:00
Ryan Ernst	019263d664	Revert "Internal: Change version constant names for already released versions (#23416 )" This reverts commit dc0e93ed6238e5df6c66206b9f31cdc162db166a.	2017-02-28 14:45:13 -08:00
Ryan Ernst	dc0e93ed62	Internal: Change version constant names for already released versions (#23416 ) We have many version constants in master that have already been released, but are still marked (by naming convention) as unreleased. This commit renames those version constants.	2017-02-28 13:05:44 -08:00
Tanguy Leroux	33eb6a13bf	Tests: Fix RemoteScrollableHitSourceTests With #23307, the expected exception is wrapped two times into a RuntimeException instead of being thrown directly.	2017-02-28 11:30:33 +01:00
Jim Ferenczi	5c84640126	Upgrade to lucene-6.5.0-snapshot-d00c5ca (#23385 ) Lucene upgrade	2017-02-27 18:39:04 +01:00
javanna	9a2dba3036	[TEST] add support for binary responses to REST tests infra	2017-02-27 12:27:03 +01:00
javanna	dad025a6ad	[TEST] move test for binary field to specific test file that sets Content-Type header explicitly	2017-02-27 12:27:03 +01:00
Ryan Ernst	9df95def90	Build: Remove extra copies of netty license (#23361 ) The dependencyLicenses check has the ability to map multiple jar files to the same license file. However, netty was not taking advantage of this, and had duplicate copies of its license/notice files for each jar. This commit reduces the copies to one and uses the mapping feature.	2017-02-24 14:40:07 -08:00
Jason Tedor	f85a7aed37	Keep the pipeline handler queue small initially This commit sets the intial size of the pipeline handler queue small to prevent waste if pipelined requests are never sent. Since the queue will grow quickly if pipeline requests are indeed set, this should not be problematic. Relates #23335	2017-02-23 14:17:46 -05:00
sabi0	09b3c7f270	Do not create String instances in 'Strings' methods accepting StringBuilder (#22907 )	2017-02-23 10:57:34 -08:00
Christoph Büscher	8b1b152e91	Remove abstract InternalMetricsAggregation class (#23326 ) This class doesn't seem to do much other than to group together certain types of aggregations.	2017-02-23 18:03:40 +01:00
Jason Tedor	3e69c38dbd	Respect promises on pipelined responses When pipelined responses are sent to the pipeline handler for writing, they are not necessarily written immediately. They must be held in a priority queue until all responses preceding the given response are written. This means that when write is invoked on the handler, the promise that is attached to the write invocation will not necessarily be the promise associated with the responses that are written while the queue is drained. To address this, the promise associated with a pipelined response must be held with the response and then used when the channel context is actually written to. This was introduced when ensuring that the releasing promise is always chained through on write calls lest the releasing promise never be invoked. This leads to many failing test cases, so no new test cases are needed here. Relates #23317	2017-02-23 09:32:43 -05:00
Jason Tedor	6ca90a61a6	Relocate a comment in HttpPipeliningHandler This commit moves a comment in HttpPipeliningHandler as it makes more sense for this comment to be where the field that it is explaining is declared.	2017-02-22 20:51:18 -05:00
Jason Tedor	30f723d2b0	Add comments to HttpPipeliningHandler This commit adds some comments explaining the design of HttpPipeliningHandler.	2017-02-22 20:47:34 -05:00
Ryan Ernst	175bda64a0	Build: Rework integ test setup and shutdown to ensure stop runs when desired (#23304 ) Gradle's finalizedBy on tasks only ensures one task runs after another, but not immediately after. This is problematic for our integration tests since it allows multiple project's integ test clusters to be simultaneously. While this has not been a problem thus far (gradle 2.13 happened to keep the finalizedBy tasks close enough that no clusters were running in parallel), with gradle 3.3 the task graph generation has changed, and numerous clusters may be running simultaneously, causing memory pressure, and thus generally slower tests, or even failure if the system has a limited amount of memory (eg in a vagrant host). This commit reworks how integ tests are configured. It adds an `integTestCluster` extension to gradle which is equivalent to the current `integTest.cluster` and moves the rest test runner task to `integTestRunner`. The `integTest` task is then just a dummy task, which depends on the cluster runner task, as well as the cluster stop task. This means running `integTest` in one project will both run the rest tests, and shut down the cluster, before running `integTest` in another project.	2017-02-22 12:43:15 -08:00
Jason Tedor	708d11f54a	Ensure that releasing listener is called When sending a response to a client, we attach a releasing listener to the channel promise. If the client disappears before the response is sent, the releasing listener was never notified. The reason the listeners were never notified was due to a mistaken invocation of write and flush on the channel which has two overrides: one that takes an existing promise, and one that does not and instead creates a new promise. When the client disappears, it is this latter promise that is notified, which does not contain the releasing listener. This commit addreses this issue by invoking the override that passes our channel promise through. Relates #23310	2017-02-22 13:54:17 -05:00
javanna	594f00c582	Remove content type auto-detection from search templates Now that search templates always get converted to json, we don't need to try and auto-detect their content-type, which anyways didn't work as expected before given that only json was really working.	2017-02-22 16:20:53 +01:00
javanna	f2acf466aa	Convert script/template objects to json format Elasticsearch accepts multiple content-type formats, hence scripts can be stored/provided in json, yaml, cbor or smile. Yet the format that should be used internally is json. This is a problem mainly around search templates, as they only support json out of the four content-types, so instead of maintaining the content-type of the request we should rather convert the scripts/templates to json. Binary formats were not previously supported. If you stored a template in yaml format, you'd get back an error "No encoder found for MIME type [application/yaml]" when trying to execute it. With this commit the request content-type is independent from the template, which always gets converted to json internally. That is transparent to users and doesn't affect the content type of the response obtained when executing the template.	2017-02-22 16:20:53 +01:00
javanna	9391c6ffa9	Replace CustomMustacheFactory constant with same constant from Script (CONTENT_TYPE_OPTION)	2017-02-22 16:20:53 +01:00
Nik Everett	38d25a0369	Fix Painless's implementation of interfaces returning primitives (#23298 ) Fixes Painless to properly implement scripts that return primitives and void. Adds some simple tests that we emit sane opcodes and some other tests that we implement primitives as expected. Mostly this is just a fix following up from #22983 but there is one thing I did really worth talking about, I think. So, before this script Painless scripts could only ever return Object and they did would always return null for paths that didn't return any values. Now that they can return primitives the question is "what should Painless return from paths that don't return any values?" And I answered that with "whatever the JLS default value is". So 0/0L/0f/0d/false.	2017-02-21 17:10:55 -05:00
Martijn van Groningen	81d53470e7	percolator: add support for term extraction for MultiPhraseQuery	2017-02-21 21:10:55 +01:00

1 2 3 4 5 ...

3943 Commits