OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	257a7d77ed	Painless: Fix regex lexer and error messages (#23634 ) Without this change, if write a script with multiple regexes sometimes the lexer will decide to look at them like one big regex and then some trailing garbage. Like this discuss post: https://discuss.elastic.co/t/error-with-the-split-function-in-painless-script/79021 ``` def val = /\\\\/.split(ctx._source.event_data.param17); if (val[2] =~ /\\./) { def val2 = /\\./.split(val[2]); ctx._source['user_crash'] = val2[0] } else { ctx._source['user_crash'] = val[2] } ``` The error message you get from the lexer is `lexer_no_viable_alt_exception` right after the second regex. With this change each regex is just a single regex like it ought to be. As a bonus, while looking into this issue I found that the error reporting for regexes wasn't very nice. If you specify an invalid pattern then you get an error marker on the start of the pattern with the JVM's regex error message which attempts to point you to the location in the regex but is totally unreadable in the JSON response. This change fixes the location to point to the appropriate spot inside the pattern and removes the portion of the JVM's error message that doesn't render well. It is no longer needed now that we point users to the appropriate spot in the pattern.	2017-03-22 15:56:17 -04:00
Nik Everett	bc65be2a65	Reindex: wait for cleanup before responding (#23677 ) Changes reindex and friends to wait until the entire request has been "cleaned up" before responding. "Clean up" in this context is clearing the scroll and (for reindex-from-remote) shutting down the client. Failures to clean up are still only logged, not returned to the user. Closes #23653	2017-03-21 15:33:39 -04:00
Jason Tedor	8dfb68cf1c	Upgrade to Netty 4.1.9 This commit upgrades the Netty dependencies from version 4.1.8 to version 4.1.9. This commit picks up a few bug fixes that impacted us: - Netty was incorrectly ignoring interfaces with self-assigned MAC addresses (e.g., instances running in Docker containers or on EC2) - incorrect handling of the Expect: 100-continue header Relates #23540	2017-03-11 18:28:31 -08:00
Daniel Mitterdorfer	6f7cd71e1f	Adjust default Netty receive predictor size to 64k (#23542 ) With this commit we change the default receive predictor size for Netty from 32kB to 64kB as our testing has shown that this leads to less allocations on smaller heaps like the default out of the box configuration and this value also works reasonably well for larger heaps. Closes #23185	2017-03-11 17:32:35 -08:00
Jason Tedor	8e09eca9a6	Mute Painless lambda tests on JDK 9 This commit mutes a ton of Painless lambda tests on JDK 9. This commit did not attempt to discover exactly which tests are failing, but instead just blanket muted all tests in LambdaTests, FunctionRefTests, and AugmentationTests. Relates #23473	2017-03-02 22:36:26 -05:00
Jay Modi	01502893eb	HTTP transport stashes the ThreadContext instead of the RestController (#23456 ) Previously, the RestController would stash the context prior to copying headers. However, there could be deprecation log messages logged and in turn warning headers being added to the context prior to the stashing of the context. These headers in the context would then be removed from the request and also leaked back into the calling thread's context. This change moves the stashing of the context to the HttpTransport so that the network threads' context isn't accidentally populated with warning headers and to ensure the headers added early on in the RestController are not excluded from the response.	2017-03-02 14:44:01 -05:00
Luca Cavanna	cc65a94fd4	[TEST] improve yaml test sections parsing (#23407 ) Throw error when skip or do sections are malformed, such as they don't start with the proper token (START_OBJECT). That signals bad indentation, which would be ignored otherwise. Thanks (or due to) our pull parsing code, we were still able to properly parse the sections, yet other runners weren't able to. Closes #21980 * [TEST] fix indentation in matrix_stats yaml tests * [TEST] fix indentation in painless yaml test * [TEST] fix indentation in analysis yaml tests * [TEST] fix indentation in generated docs yaml tests * [TEST] fix indentation in multi_cluster_search yaml tests	2017-03-02 12:43:20 +01:00
Nik Everett	2dcdaa1c9d	Mustache: don't extend AbstractComponent (#23419 ) Don't extend `AbstractComponent` in `MustacheScriptEngine` because it doesn't buy anything.	2017-03-01 14:54:27 -05:00
Ryan Ernst	019263d664	Revert "Internal: Change version constant names for already released versions (#23416 )" This reverts commit `dc0e93ed62`.	2017-02-28 14:45:13 -08:00
Ryan Ernst	dc0e93ed62	Internal: Change version constant names for already released versions (#23416 ) We have many version constants in master that have already been released, but are still marked (by naming convention) as unreleased. This commit renames those version constants.	2017-02-28 13:05:44 -08:00
Tanguy Leroux	33eb6a13bf	Tests: Fix RemoteScrollableHitSourceTests With #23307, the expected exception is wrapped two times into a RuntimeException instead of being thrown directly.	2017-02-28 11:30:33 +01:00
Jim Ferenczi	5c84640126	Upgrade to lucene-6.5.0-snapshot-d00c5ca (#23385 ) Lucene upgrade	2017-02-27 18:39:04 +01:00
javanna	9a2dba3036	[TEST] add support for binary responses to REST tests infra	2017-02-27 12:27:03 +01:00
javanna	dad025a6ad	[TEST] move test for binary field to specific test file that sets Content-Type header explicitly	2017-02-27 12:27:03 +01:00
Ryan Ernst	9df95def90	Build: Remove extra copies of netty license (#23361 ) The dependencyLicenses check has the ability to map multiple jar files to the same license file. However, netty was not taking advantage of this, and had duplicate copies of its license/notice files for each jar. This commit reduces the copies to one and uses the mapping feature.	2017-02-24 14:40:07 -08:00
Jason Tedor	f85a7aed37	Keep the pipeline handler queue small initially This commit sets the intial size of the pipeline handler queue small to prevent waste if pipelined requests are never sent. Since the queue will grow quickly if pipeline requests are indeed set, this should not be problematic. Relates #23335	2017-02-23 14:17:46 -05:00
sabi0	09b3c7f270	Do not create String instances in 'Strings' methods accepting StringBuilder (#22907 )	2017-02-23 10:57:34 -08:00
Christoph Büscher	8b1b152e91	Remove abstract InternalMetricsAggregation class (#23326 ) This class doesn't seem to do much other than to group together certain types of aggregations.	2017-02-23 18:03:40 +01:00
Jason Tedor	3e69c38dbd	Respect promises on pipelined responses When pipelined responses are sent to the pipeline handler for writing, they are not necessarily written immediately. They must be held in a priority queue until all responses preceding the given response are written. This means that when write is invoked on the handler, the promise that is attached to the write invocation will not necessarily be the promise associated with the responses that are written while the queue is drained. To address this, the promise associated with a pipelined response must be held with the response and then used when the channel context is actually written to. This was introduced when ensuring that the releasing promise is always chained through on write calls lest the releasing promise never be invoked. This leads to many failing test cases, so no new test cases are needed here. Relates #23317	2017-02-23 09:32:43 -05:00
Jason Tedor	6ca90a61a6	Relocate a comment in HttpPipeliningHandler This commit moves a comment in HttpPipeliningHandler as it makes more sense for this comment to be where the field that it is explaining is declared.	2017-02-22 20:51:18 -05:00
Jason Tedor	30f723d2b0	Add comments to HttpPipeliningHandler This commit adds some comments explaining the design of HttpPipeliningHandler.	2017-02-22 20:47:34 -05:00
Ryan Ernst	175bda64a0	Build: Rework integ test setup and shutdown to ensure stop runs when desired (#23304 ) Gradle's finalizedBy on tasks only ensures one task runs after another, but not immediately after. This is problematic for our integration tests since it allows multiple project's integ test clusters to be simultaneously. While this has not been a problem thus far (gradle 2.13 happened to keep the finalizedBy tasks close enough that no clusters were running in parallel), with gradle 3.3 the task graph generation has changed, and numerous clusters may be running simultaneously, causing memory pressure, and thus generally slower tests, or even failure if the system has a limited amount of memory (eg in a vagrant host). This commit reworks how integ tests are configured. It adds an `integTestCluster` extension to gradle which is equivalent to the current `integTest.cluster` and moves the rest test runner task to `integTestRunner`. The `integTest` task is then just a dummy task, which depends on the cluster runner task, as well as the cluster stop task. This means running `integTest` in one project will both run the rest tests, and shut down the cluster, before running `integTest` in another project.	2017-02-22 12:43:15 -08:00
Jason Tedor	708d11f54a	Ensure that releasing listener is called When sending a response to a client, we attach a releasing listener to the channel promise. If the client disappears before the response is sent, the releasing listener was never notified. The reason the listeners were never notified was due to a mistaken invocation of write and flush on the channel which has two overrides: one that takes an existing promise, and one that does not and instead creates a new promise. When the client disappears, it is this latter promise that is notified, which does not contain the releasing listener. This commit addreses this issue by invoking the override that passes our channel promise through. Relates #23310	2017-02-22 13:54:17 -05:00
javanna	594f00c582	Remove content type auto-detection from search templates Now that search templates always get converted to json, we don't need to try and auto-detect their content-type, which anyways didn't work as expected before given that only json was really working.	2017-02-22 16:20:53 +01:00
javanna	f2acf466aa	Convert script/template objects to json format Elasticsearch accepts multiple content-type formats, hence scripts can be stored/provided in json, yaml, cbor or smile. Yet the format that should be used internally is json. This is a problem mainly around search templates, as they only support json out of the four content-types, so instead of maintaining the content-type of the request we should rather convert the scripts/templates to json. Binary formats were not previously supported. If you stored a template in yaml format, you'd get back an error "No encoder found for MIME type [application/yaml]" when trying to execute it. With this commit the request content-type is independent from the template, which always gets converted to json internally. That is transparent to users and doesn't affect the content type of the response obtained when executing the template.	2017-02-22 16:20:53 +01:00
javanna	9391c6ffa9	Replace CustomMustacheFactory constant with same constant from Script (CONTENT_TYPE_OPTION)	2017-02-22 16:20:53 +01:00
Nik Everett	38d25a0369	Fix Painless's implementation of interfaces returning primitives (#23298 ) Fixes Painless to properly implement scripts that return primitives and void. Adds some simple tests that we emit sane opcodes and some other tests that we implement primitives as expected. Mostly this is just a fix following up from #22983 but there is one thing I did really worth talking about, I think. So, before this script Painless scripts could only ever return Object and they did would always return null for paths that didn't return any values. Now that they can return primitives the question is "what should Painless return from paths that don't return any values?" And I answered that with "whatever the JLS default value is". So 0/0L/0f/0d/false.	2017-02-21 17:10:55 -05:00
Martijn van Groningen	81d53470e7	percolator: add support for term extraction for MultiPhraseQuery	2017-02-21 21:10:55 +01:00
Nik Everett	9105672969	Allow painless to implement more interfaces (#22983 ) Generalizes three previously hard coded things in painless into generic concepts: 1. The "main method" is no longer hardcoded to: ``` public abstract Object execute(Map<String, Object> params, Scorer scorer, LeafDocLookup doc, Object value); ``` Instead Painless's compiler takes an interface and implements it. It looks like: ``` public interface SomeScript { // Argument names we expose to Painless scripts String[] ARGUMENTS = new String[] {"a", "b"}; // Method implemented by Painless script. Must be named execute but can have any parameters or return any value. Object execute(String a, int b); // Is the "a" argument used by the script? boolean uses$a(); } SomeScript script = scriptEngine.compile(SomeScript.class, null, "the_script_here", emptyMap()); Object result = script.execute("a", 1); ``` `PainlessScriptEngine` now compiles all scripts to the new `GenericElasticsearchScript` interface by default for compatibility with the rest of Elasticsearch until it is able to use this new ability. 2. `_score` and `ctx` are no longer hardcoded to be extracted from `#score` and `params` respectively. Instead Painless's default implementation of Elasticsearch scripts uses the `uses$_score` and `uses$ctx` methods to determine if it is used and gives them dummy values if they are not used. 3. Throwing the `ScriptException` is now handled by the Painless script itself. That way Painless doesn't have to leak the metadata that is required to build the fancy stack trace. And all painless scripts get the fancy stack trace.	2017-02-21 14:08:57 -05:00
Jack Conradson	fac2d954e3	Fix certain bad casts in Painless due to boxing/unboxing. (#23282 )	2017-02-21 10:23:27 -08:00
Daniel Mitterdorfer	0744a00001	Set network receive predictor size to 32kb (#23284 ) Previously we calculated Netty' receive predictor size for HTTP and transport traffic based on available memory and worker nodes. This resulted in a receive predictor size between 64kb and 512kb. In our benchmarks this leads to increased GC pressure. With this commit we set Netty's receive predictor size to 32kb. This value is in a sweet spot between heap memory waste (-> GC pressure) and effect on request metrics (achieved throughput and latency numbers). Closes #23185	2017-02-21 14:45:33 +01:00
Jay Modi	b234644035	Enforce Content-Type requirement on the rest layer and remove deprecated methods (#23146 ) This commit enforces the requirement of Content-Type for the REST layer and removes the deprecated methods in transport requests and their usages. While doing this, it turns out that there are many places where *Entity classes are used from the apache http client libraries and many of these usages did not specify the content type. The methods that do not specify a content type explicitly have been added to forbidden apis to prevent more of these from entering our code base. Relates #19388	2017-02-17 14:45:41 -05:00
Jason Tedor	0a5917d182	Fix get HEAD requests Get HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for get HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23186	2017-02-15 13:07:29 -05:00
Ryan Ernst	79a1629f74	Fix line length	2017-02-14 21:23:21 -08:00
Jason Tedor	9e80e290d6	Add failing tests for expect header violations This commit adds unit tests for two cases where Elasticsearch violates expect header handling. These tests are marked as awaits fix. Relates #23173	2017-02-14 19:24:22 -05:00
Jason Tedor	673754b1d5	Fix get source HEAD requests Get source HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for get source HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23151	2017-02-14 16:37:22 -05:00
Martijn van Groningen	cab43707dc	[percolator] Removed old 2.x bwc logic.	2017-02-14 22:17:17 +01:00
Simon Willnauer	aef0665ddb	Detach SearchPhases from AbstractSearchAsyncAction (#23118 ) Today all search phases are inner classes of AbstractSearchAsyncAction or one of it's subclasses. This makes unit testing of these classes practically impossible. This commit Extracts `DfsQueryPhase` and `FetchSearchPhase` or of the code that composes the actual query execution types and moves most of the fan-out and collect code into an `InitialSearchPhase` class that can be used to build initial search phases (phases that retry on shards). This will make modification to these classes simpler and allows to easily compose or add new search phases down the road if additional roundtrips are required.	2017-02-14 12:34:25 +01:00
Jason Tedor	5343b87502	Handle bad HTTP requests When Netty decodes a bad HTTP request, it marks the decoder result on the HTTP request as a failure, and reroutes the request to GET /bad-request. This either leads to puzzling responses when a bad request is sent to Elasticsearch (if an index named "bad-request" does not exist then it produces an index not found exception and otherwise responds with the index settings for the index named "bad-request"). This commit addresses this by inspecting the decoder result on the HTTP request and dispatching the request to a bad request handler preserving the initial cause of the bad request and providing an error message to the client. Relates #23153	2017-02-13 17:39:25 -05:00
Jay Modi	61e383813d	Make the version of the remote node accessible on a transport channel (#23019 ) This commit adds a new method to the TransportChannel that provides access to the version of the remote node that the response is being sent on and that the request came from. This is helpful for serialization of data attached as headers.	2017-02-13 15:15:57 -05:00
jaymode	d8d03f45c2	Fix communication with 5.3.0 nodes This commit fixes communication with 5.3.0 nodes to send XContentType to these nodes since #22691 was backported to the 5.3 branch.	2017-02-13 13:15:51 -05:00
Jason Tedor	0f21ed5b70	Fix template HEAD requests Template HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for template HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23130	2017-02-11 18:30:16 -05:00
Jason Tedor	a6158398dd	Fix index HEAD requests Index HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for index HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23112	2017-02-10 09:44:01 -05:00
Jason Tedor	7ac44656df	Fix alias HEAD requests Alias HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for alias HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23094	2017-02-10 09:19:35 -05:00
Adrien Grand	709cc9ba65	Upgrade to lucene-6.5.0-snapshot-f919485. (#23087 )	2017-02-10 15:08:47 +01:00
Tanguy Leroux	e2e5937455	Use `typed_keys` parameter to prefix suggester names by type in search responses (#23080 ) This pull request reuses the typed_keys parameter added in #22965, but this time it applies it to suggesters. When set to true, the suggester names in the search response will be prefixed with a prefix that reflects their type.	2017-02-10 10:53:38 +01:00
Nik Everett	0250c7ab18	Fix reindex test after toString change Weakens the assertion on wait_for_active_shards so that we don't check the toString of the bulk request because it isn't important. Relates to #22900	2017-02-09 16:48:40 -05:00
Tim Brooks	a331405aff	Isolated SocketPermissions to Netty (#23057 ) Netty 4.1.8 wraps connect and accept operations in doPrivileged blocks. This means that we not need to give permissions to the entire transport module. Additionally this commit deletes the privileged socket channel and privileged server socket chanel.	2017-02-09 10:00:25 -06:00
Tanguy Leroux	3553522328	Add parameter to prefix aggs name with type in search responses (#22965 ) This pull request adds a new parameter to the REST Search API named `typed_keys`. When set to true, the aggregation names in the search response will be prefixed with a prefix that reflects the internal type of the aggregation. Here is a simple example: ``` GET /_search?typed_keys { "aggs": { "tweets_per_user": { "terms": { "field": "user" } } }, "size": 0 } ``` And the response: ``` { "aggs": { "sterms:tweets_per_user": { ... } } } ``` This parameter is intended to make life easier for REST clients that could parse back the prefix and could detect the type of the aggregation to parse. It could also be implemented for suggesters.	2017-02-09 11:19:04 +01:00
Tim Brooks	735e5b1983	Upgrade to Netty 4.1.8 (#23055 ) This commit upgrades the Netty dependency to version 4.1.8.Final.	2017-02-08 11:44:36 -06:00
Simon Willnauer	ecb01c15b9	Fold InternalSearchHits and friends into their interfaces (#23042 ) We have a bunch of interfaces that have only a single implementation for 6 years now. These interfaces are pretty useless from a SW development perspective and only add unnecessary abstractions. They also require lots of casting in many places where we expect that there is only one concrete implementation. This change removes the interfaces, makes all of the classes final and removes the duplicate `foo` `getFoo` accessors in favor of `getFoo` from these classes.	2017-02-08 14:40:08 +01:00
Tim Brooks	fcc568fd8d	Add methods requiring connect to forbidden apis (#22964 ) This is related to #22116. This commit adds calls that require SocketPermission connect to forbidden APIs. The following calls are now forbidden: - java.net.URL#openStream() - java.net.URLConnection#connect() - java.net.URLConnection#getInputStream() - java.net.Socket#connect(java.net.SocketAddress) - java.net.Socket#connect(java.net.SocketAddress, int) - java.nio.channels.SocketChannel#open(java.net.SocketAddress) - java.nio.channels.SocketChannel#connect(java.net.SocketAddress)	2017-02-07 14:41:50 -06:00
Boaz Leskes	ba06c14a97	TransportService.connectToNode should validate remote node ID (#22828 ) #22194 gave us the ability to open low level temporary connections to remote node based on their address. With this use case out of the way, actual full blown connections should validate the node on the other side, making sure we speak to who we think we speak to. This helps in case where multiple nodes are started on the same host and a quick node restart causes them to swap addresses, which in turn can cause confusion down the road.	2017-02-07 22:11:32 +02:00
Tim Brooks	27b7d9bd8d	Add FileSystemUtil method to read 'file:/' URLs (#23020 ) As part of #22116 we are going to forbid usage of api java.net.URL#openStream(). However in a number of places across the we use this method to read files from the local filesystem. This commit introduces a helper method openFileURLStream(URL url) to read files from URLs. It does specific validation to only ensure that file:/ urls are read. Additionlly, this commit removes unneeded method FileSystemUtil.newBufferedReader(URL, Charset). This method used the openStream () method which will soon be forbidden. Instead we use the Files.newBufferedReader(Path, Charset).	2017-02-07 10:24:22 -06:00
Jay Modi	c898e8ab83	Add support for newline delimited JSON Content-Type (#22947 ) This commit adds support for the newline delimited JSON Content-Type, which is how the bulk, multi-search, and multi-search template APIs expect data to be formatted. The `elasticsearch-js` client has also been using this content type for these types of requests. Closes #22943	2017-02-07 09:20:06 -05:00
Nik Everett	0d6e622242	Make dates be ReadableDateTimes in scripts (#22948 ) Instead of longs. If you want millis since epoch you can call doc.date_field.value.millis. Relates to #22875	2017-02-06 16:44:56 -05:00
Nicholas Knize	1c9fdfd1b3	Remove GeoPointFieldMapper abstraction In order to support the evolving GeoPoint encodings in Lucene 5 and 6, ES 2.x and 5.x implements an abstraction layer to the GeoPointFieldMapper classes. As of 5.x the geo_point field mapper settled on using Lucene's more performant LatLonPoint field type and deprecated all other encodings. In 6.0 all encodings except LatLonPoint have been removed rendering this abstraction layer useless. This commit removes the abstraction layer and renames the LatLonPointFieldMapper back to GeoPointFieldMapper to mantain consistency with ES field naming.	2017-02-06 14:17:21 -06:00
Adrien Grand	c8496fc4f4	Upgrade to Lucene 6.4.1. (#22978 )	2017-02-06 09:28:43 +01:00
Nik Everett	b0c9759441	Painless: Don't allow casting from void to def (#22969 ) Painless can cast anything into the magic type `def` but it really shouldn't try to cast nothing into `def`. That causes the byte code generation library to freak out a little. Closes #22908	2017-02-03 16:38:47 -05:00
Nik Everett	9ca871af7e	Test: weaken assertion in fix sliced reindex test This test was using initial count of slices instead of the count of unfinished slices to pick the expected throttle. Unfortunely due to race conditions the actual rethrottle count is between the two. So we weaken the assertion from "the new throttle is exactly X" to "the new throttle is between X and Y (inclusive)".	2017-02-03 13:00:49 -05:00
Tim Brooks	f70188ac58	Remove connect SocketPermissions from core (#22797 ) This is related to #22116. Core no longer needs `SocketPermission` `connect`. This permission is relegated to these modules/plugins: - transport-netty4 module - reindex module - repository-url module - discovery-azure-classic plugin - discovery-ec2 plugin - discovery-gce plugin - repository-azure plugin - repository-gcs plugin - repository-hdfs plugin - repository-s3 plugin And for tests: - mocksocket jar - rest client - httpcore-nio jar - httpasyncclient jar	2017-02-03 09:39:56 -06:00
Christoph Büscher	c33f894846	Fixing compilation problem in Eclipse (#22956 )	2017-02-03 16:16:51 +01:00
Nik Everett	18eb0827e6	Reindex: do not log when can't clear old scroll (#22942 ) Versions of Elasticsearch prior to 2.0 would return a scroll id even with the last scroll response. They'd then automatically clear the scroll because it is empty. When terminating reindex will attempt to clear the last scroll it received, regardless of the remote version. This quiets the warning when the scroll cannot be cleared for versions before 2.0. Closes #22937	2017-02-03 10:08:27 -05:00
Jason Tedor	9a0b216c36	Upgrade checkstyle to version 7.5 This commit upgrades the checkstyle configuration from version 5.9 to version 7.5, the latest version as of today. The main enhancement obtained via this upgrade is better detection of redundant modifiers. Relates #22960	2017-02-03 09:46:44 -05:00
Nik Everett	ea4eb06b0a	Test: Make update-by-query test more resilient `UpdateByQueryWhileModifyingTests#testUpdateWhileReindexing` runs update-by-query and concurrently updates, asserting that the update-by-query never reverts any changes made by the update. It is a smoke test for concurrent updates. Now, it expects to hit a certain number of version conflicts during the updates. This is normal as it is racing the update-by-query. We have a maximum number of failures we expect (10) and I'd never seen us come close until https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+multijob-unix-compatibility/os=sles/495/console This bumps the max failures from 10 to 50 and improves logging a bit. If we continue to see this failure then we have some other issue. Closes #22938	2017-02-03 09:18:26 -05:00
Jay Modi	7520a107be	Optionally require a valid content type for all rest requests with content (#22691 ) This change adds a strict mode for xcontent parsing on the rest layer. The strict mode will be off by default for 5.x and in a separate commit will be enabled by default for 6.0. The strict mode, which can be enabled by setting `http.content_type.required: true` in 5.x, will require that all incoming rest requests have a valid and supported content type header before the request is dispatched. In the non-strict mode, the Content-Type header will be inspected and if it is not present or not valid, we will continue with auto detection of content like we have done previously. The content type header is parsed to the matching XContentType value with the only exception being for plain text requests. This value is then passed on with the content bytes so that we can reduce the number of places where we need to auto-detect the content type. As part of this, many transport requests and builders were updated to provide methods that accepted the XContentType along with the bytes and the methods that would rely on auto-detection have been deprecated. In the non-strict mode, deprecation warnings are issued whenever a request with body doesn't provide the Content-Type header. See #19388	2017-02-02 14:07:13 -05:00
Nik Everett	ce8e042b66	Reindex: fix reindex-from-remote from <2.0 (#22931 ) In 5.2 we stopped sending the source parameter if the user didn't specify it. This was a mistake as versions before 2.0 look like they don't always include the `_source`. This is because reindex requests some metadata fields. Anyway, now we say `"_source": true` if there isn't a `_source` configured in the reindex request. Closes #22893	2017-02-02 11:46:24 -05:00
Nik Everett	73bf29072f	Painless: Fix def invoked qualified method refs (#22918 ) We were incorrectly resolving qualified method references at run time when invoked on `def`. This lead to errors like `The struct with name [org] has not been defined.` when attempting ``` doc.date.dates.stream().map( org.joda.time.ReadableDateTime::centuryOfEra ).collect(Collectors.toList()) ```	2017-02-02 10:15:03 -05:00
Nik Everett	dacc150934	Expose multi-valued dates to scripts and document painless's date functions (#22875 ) Implemented by wrapping an array of reused `ModuleDateTime`s that we grow when needed. The `ModuleDateTime`s are reused when we move to the next document. Also improves the error message returned when attempting to modify the `ScriptdocValues`, removes a couple of allocations, and documents that the date functions are available in Painless. Relates to #22162	2017-02-01 21:57:07 -05:00
Jack Conradson	3d2626c4c6	Change Namespace for Stored Script to Only Use Id (#22206 ) Currently, stored scripts use a namespace of (lang, id) to be put, get, deleted, and executed. This is not necessary since the lang is stored with the stored script. A user should only have to specify an id to use a stored script. This change makes that possible while keeping backwards compatibility with the previous namespace of (lang, id). Anywhere the previous namespace is used will log deprecation warnings. The new behavior is the following: When a user specifies a stored script, that script will be stored under both the new namespace and old namespace. Take for example script 'A' with lang 'L0' and data 'D0'. If we add script 'A' to the empty set, the scripts map will be ["A" -- D0, "A#L0" -- D0]. If a script 'A' with lang 'L1' and data 'D1' is then added, the scripts map will be ["A" -- D1, "A#L1" -- D1, "A#L0" -- D0]. When a user deletes a stored script, that script will be deleted from both the new namespace (if it exists) and the old namespace. Take for example a scripts map with {"A" -- D1, "A#L1" -- D1, "A#L0" -- D0}. If a script is removed specified by an id 'A' and lang null then the scripts map will be {"A#L0" -- D0}. To remove the final script, the deprecated namespace must be used, so an id 'A' and lang 'L0' would need to be specified. When a user gets/executes a stored script, if the new namespace is used then the script will be retrieved/executed using only 'id', and if the old namespace is used then the script will be retrieved/executed using 'id' and 'lang'	2017-01-31 13:27:02 -08:00
Nik Everett	2e48fb8294	Move delete by query helpers into core (#22810 ) This moves the building blocks for delete by query into core. This should enabled two thigns: 1. Plugins other than reindex to implement "bulk by scroll" style operations. 2. Plugins to directly call delete by query. Those plugins should be careful to make sure that task cancellation still works, but this should be possible. Notes: 1. I've mostly just moved classes and moved around tests methods. 2. I haven't been super careful about cohesion between these core classes and reindex. They are quite interconnected because I wanted to make the change as mechanical as possible. Closes #22616	2017-01-27 16:09:18 -05:00
Nik Everett	8a2d424d68	Generate reference links for painless API (#22775 ) Adds "Appending B. Painless API Reference", a reference of all classes and methods available from Painless. Removes links to java packages because they contain methods that we don't expose and don't contain methods that we do expose (the ones in Augmentation). Instead this generates a list of every class and every exposed method using the same type information available to the interpreter/compiler/whatever-we-call-it. From there you can jump to the relevant docs. Right now you build all the asciidoc files by running ``` gradle generatePainlessApi ``` These files are expected to be committed because we build the docs without running `gradle`. Also changes the output of `Debug.explain` so that it is easy to search for the class in the generated reference documentation. You can also run it in an IDE safely if you pass the path to the directory in which to generate the docs as the first parameter. It'll blow away the entire directory an recreate it from scratch so be careful. And then you can build the docs by running something like: ``` ../docs/build_docs.pl --out ../built_docs/ --doc docs/reference/index.asciidoc --open ``` That is, if you have checked out https://github.com/elastic/docs in `../docs`. Wait a minute or two and your browser will pop open in with all of Elasticsearch's reference documentation. If you go to `http://localhost:8000/painless-api-reference.html` you can see this list. Or you can get there by following the links to `Modules` and `Scripting` and `Painless` and then clicking the link in the paragraphs below titled `Appendix B. Painless API Reference`. I like having these in asciidoc because we can deep link to them from the rest of the guide with constructs like `<<painless-api-reference-Object-hashCode-0>>` and `<<painless-api-reference->>` and we get link checking. Then the only brittle link maintenance bit is the link generation for javadoc. Which sucks. But I think it is important that we link to the methods directly so they are easy to find. Relates to #22720	2017-01-26 10:39:19 -05:00
Tim Brooks	719e75bb3f	Add repository-url module and move URLRepository (#22752 ) This is related to #22116. URLRepository requires SocketPermission connect. This commit introduces a new module called "repository-url" where URLRepository will reside. With the new module, permissions can be removed from core.	2017-01-25 17:09:25 -06:00
Tal Levy	e9a68b3287	fix date-processor to a new default year for every new pipeline execution. (#22601 ) Beforehand, the DateProcessor constructs its joda pattern formatter during processor construction. This led to newly ingested documents being defaulted to the year that the pipeline was constructed, not that of processing. Fixes #22547.	2017-01-25 15:09:07 -08:00
Chris Earle	f0f75b187a	Support Preemptive Authentication with RestClient (#21336 ) This adds the necessary `AuthCache` needed to support preemptive authorization. By adding every host to the cache, the automatically added `RequestAuthCache` interceptor will add credentials on the first pass rather than waiting to do it after _each_ anonymous request is rejected (thus always sending everything twice when basic auth is required).	2017-01-24 11:34:05 -05:00
Luca Cavanna	47c0e13a3b	Stop returning "es." internal exception headers as http response headers (#22703 ) move "es." internal headers to separate metadata set in ElasticsearchException and stop returning them as response headers Closes #17593 * [TEST] remove ESExceptionTests, move its methods to ElasticsearchExceptionTests or ExceptionSerializationTests	2017-01-24 16:12:45 +01:00
Nik Everett	28cfc533e2	Generate javadoc jar for painless's public API (#22704 ) The simplest way to do that is to move the public API into a new package and generate javadoc for that package.	2017-01-23 17:16:20 -05:00
Jim Ferenczi	e48bc2eed7	Add field collapsing for search request (#22337 ) * Add top hits collapsing to search request The field collapsing is done with a custom top docs collector that "collapse" search hits with same field value. The distributed aspect is resolve using the two passes that the regular search uses. The first pass "collapse" the top hits, then the coordinating node merge/collapse the top hits from each shard. ``` GET _search { "collapse": { "field": "category", } } ``` This change also adds an ExpandCollapseSearchResponseListener that intercepts the search response and expands collapsed hits using the CollapseBuilder#innerHit} options. The retrieval of each inner_hits is done by sending a query to all shards filtered by the collapse key. ``` GET _search { "collapse": { "field": "category", "inner_hits": { "size": 2 } } } ```	2017-01-23 16:33:51 +01:00
Tim Brooks	a4ac29c005	Add single static instance of SpecialPermission (#22726 ) This commit adds a SpecialPermission constant and uses that constant opposed to introducing new instances everywhere. Additionally, this commit introduces a single static method to check that the current code has permission. This avoids all the duplicated access blocks that exist currently.	2017-01-21 12:03:52 -06:00
Jim Ferenczi	8028578305	Upgrade to Lucene 6.4.0 (#22724 ) * Upgrade to Lucene 6.4.0 `ValueSource`s are now converted to `DoubleValueSource`s using the Lucene adapter made for the migration to the new API in 6.4.0.	2017-01-21 04:48:01 +01:00
Nik Everett	6265ef1c1b	Deguice rest handlers (#22575 ) There are presently 7 ctor args used in any rest handlers: * `Settings`: Every handler uses it to initialize a logger and some other strange things. * `RestController`: Every handler registers itself with it. * `ClusterSettings`: Used by `RestClusterGetSettingsAction` to render the default values for cluster settings. * `IndexScopedSettings`: Used by `RestGetSettingsAction` to get the default values for index settings. * `SettingsFilter`: Used by a few handlers to filter returned settings so we don't expose stuff like passwords. * `IndexNameExpressionResolver`: Used by `_cat/indices` to filter the list of indices. * `Supplier<DiscoveryNodes>`: Used to fill enrich the response by handlers that list tasks. We probably want to reduce these arguments over time but switching construction away from guice gives us tighter control over the list of available arguments. These parameters are passed to plugins using `ActionPlugin#initRestHandlers` which is expected to build and return that handlers immediately. This felt simpler than returning an reference to the ctors given all the different possible args. Breaks java plugins by moving rest handlers off of guice.	2017-01-20 11:48:51 -05:00
Tim Brooks	bc16162d21	Remove accept SocketPermissions from core (#22622 ) This is related to #22116. Core no longer needs SocketPermission accept. This permission is relegated to the transport-netty4 module and (for tests) to the mocksocket jar.	2017-01-20 09:27:45 -06:00
Nik Everett	22f1c9fa0f	Remove @header we no longer need	2017-01-19 11:44:13 -05:00
Nik Everett	bb83c283bb	Make lexer abstract	2017-01-19 11:41:50 -05:00
Nik Everett	dbb4a2ca6c	Move lexer hacks to EnhancedPainlessLexer This "feels" nicer. Less classes at least.	2017-01-19 11:23:16 -05:00
Nik Everett	e2da6a8ee5	Improve painless's javadocs Hopefully useful references.	2017-01-19 11:04:08 -05:00
Tim Brooks	a10aa8aade	Add TestWithDependenciesPlugin to build (#22646 ) This commit adds a MessyRestTestPlugin to the gradle build. It extends StandaloneRestTestPlugin. The main piece of functionality that it adds is to copy plugin-metadata from dependencies into the generated-resources for the current test source. This is necessary to ensure that permissions for dependencies are applied when running the tests. A current limitation is that the permissions are applied differently than in the distribution sources. When permissions are granted to all depedencies for a module or plugin, the permissions are granted to all dependencies on the classpath for tests besides a few hardcoded exclusions: - es core - es test framework - lucene test framework - randomized runner - junit library	2017-01-19 09:43:53 -06:00
Nik Everett	3ce41a0e15	Painless: Add augmentation to string for base 64 (#22665 ) We don't want to expose `String#getBytes` which is required for `Base64.getEncoder.encode` to work because we're worried about character sets. This adds `encodeBase64` and `decodeBase64` methods to `String` in Painless that are duals of one another such that: `someString == someString.encodeBase64().decodeBase64()`. Both methods work with the UTF-8 encoding of the string. Closes #22648	2017-01-19 09:31:45 -05:00
Nik Everett	ee5f8c4522	Consolidate some reindex utility classes (#22666 ) Everything that extended `AbstractAsyncBulkByScrollAction` also extended `AbstractAsyncBulkIndexByScrollAction` so this removes `AbstractAsyncBulkIndexByScrollAction`, merging it into `AbstractAsyncBulkByScrollAction`.	2017-01-18 16:58:39 -05:00
Nik Everett	1fe74a6b4b	Better error when can't auto create index (#22488 ) Changes the error message when `action.auto_create_index` or `index.mapper.dynamic` forbids automatic creation of an index from `no such index` to one of: * `no such index and [action.auto_create_index] is [false]` * `no such index and [index.mapper.dynamic] is [false]` * `no such index and [action.auto_create_index] contains [-<pattern>] which forbids automatic creation of the index` * `no such index and [action.auto_create_index] ([all patterns]) doesn't match` This should make it more clear why there is `no such index`. Closes #22435	2017-01-18 15:18:32 -05:00
Simon Willnauer	24e2847af2	Streamline foreign stored context restore and allow to perserve response headers (#22677 ) Today we do not preserve response headers if they are present on a transport protocol response. While preserving these headers is not always desired, in the most cases we should pass on these headers to have consistent results for depreciation headers etc. yet, this hasn't been much of a problem since most of the deprecations are detected early ie. on the coordinating node such that this bug wasn't uncovered until #22647 This commit allow to optionally preserve headers when a context is restored and also streamlines the context restore since it leaked frequently into the callers thread context when the callers context wasn't restored again.	2017-01-18 16:17:54 +01:00
Igor Motov	500548fcda	Remove taskManager.registerChildTask Instead of forcing each task to register all nodes where its children are running, this commit runs cancellation on all nodes. The task cancellation operation doesn't run too frequently, so this optimization doesn't seem to be worth additional complexity of the interface.	2017-01-17 18:07:31 -05:00
Ali Beyad	e2977889b8	Allow comma delimited array settings to have a space after each entry (#22591 ) Previously, certain settings that could take multiple comma delimited values would pick up incorrect values for all entries but the first if each comma separated value was followed by a whitespace character. For example, the multi-value "A,B,C" would be correctly parsed as ["A", "B", "C"] but the multi-value "A, B, C" would be incorrectly parsed as ["A", " B", " C"]. This commit allows a comma separated list to have whitespace characters after each entry. The specific settings that were affected by this are: cluster.routing.allocation.awareness.attributes index.routing.allocation.require.* index.routing.allocation.include.* index.routing.allocation.exclude.* cluster.routing.allocation.require.* cluster.routing.allocation.include.* cluster.routing.allocation.exclude.* http.cors.allow-methods http.cors.allow-headers For the allocation filtering related settings, this commit also provides validation of each specified entry if the filtering is done by _ip, _host_ip, or _publish_ip, to ensure that each entry is a valid IP address. Closes #22297	2017-01-17 08:51:04 -06:00
Tanguy Leroux	f5542ed47f	Simplify ElasticsearchException rendering as a XContent (#22611 ) This commit tries to simplify the way ElasticsearchException are rendered to xcontent. It adds some documentation and renames and merges some methods. Current behavior is preserved, the goal is to be more readable and centralize everything in the ElasticsearchException class.	2017-01-17 15:44:49 +01:00
Tim Brooks	16a76d9bc0	Remove blocking TCP clients and servers (#22639 ) This commit removes the option to use the blocking variants of the TCP transport server, TCP transport client, or http server.	2017-01-16 18:38:51 -06:00
Simon Willnauer	f30b1f82ee	Remove HttpServer and HttpServerAdapter in favor of a simple dispatch method (#22636 ) Today we have quite some abstractions that are essentially providing a simple dispatch method to the plugins defining a `HttpServerTransport`. This commit removes `HttpServer` and `HttpServerAdaptor` and introduces a simple `Dispatcher` functional interface that delegate to `RestController` by default. Relates to #18482	2017-01-16 21:06:08 +01:00
javanna	a8a13bb46f	replace custom functional interface with CheckedFunction in percolate module	2017-01-16 13:57:58 +01:00
Alexander Reelsen	f6ee6e420b	Indexing: Add shard id to indexing operation listener (#22606 ) The IndexingOperationListener interface did not provide any information about the shard id when a document was indexed. This commit adds the shard id as the first parameter to all methods in the IndexingOperationListener.	2017-01-16 09:08:16 +01:00
Tim Brooks	f4270f9914	Wrap netty accept/connect ops with doPrivileged (#22572 ) This is related to #22116. netty channels require socket `connect` and `accept` privileges. Netty does not currently wrap these operations with `doPrivileged` blocks. These changes extend the netty channels and wrap calls to the relevant super methods in doPrivileged blocks.	2017-01-13 14:27:09 -06:00
Zachary Tong	18fdc39b8c	Increase visibility of doExecute so it can be used directly (#22614 )	2017-01-13 09:42:02 -05:00
Nik Everett	baed02bbe2	Whitelist some ScriptDocValues in painless (#22600 ) Without this whitelist painless can't use ip or binary doc values. Closes #22584	2017-01-12 15:26:09 -05:00
Jason Tedor	126efea56c	Upgrade to Netty 4.1.7 This commit upgrades the Netty dependency to version 4.1.7.Final, picking up some important bug fixes. Relates #22587	2017-01-12 10:58:21 -05:00
javanna	64c3212fdb	Remove ParseFieldMatcher usages from IndexSettings	2017-01-12 14:43:35 +01:00
javanna	8072f168a3	Remove ParseFieldMatcher usages from QueryParseContext	2017-01-12 14:43:35 +01:00
Luca Cavanna	0f7d52df68	Remove some more ParseFieldMatcher usages (#22571 )	2017-01-12 10:04:10 +01:00
Nik Everett	25a5f1869a	Improve error message when reindex-from-remote gets bad json (#22536 ) Adds a message about how the remote is unlikely to be Elasticsearch. This isn't as good as including the whole message from the remote but we can't do that because we are stream parsing it and we don't want to mark the whole request. Closes #22330	2017-01-11 12:55:23 -05:00
Jack Conradson	0c694b3d19	Update loop counter to be higher (1000000) instead of (10000).	2017-01-11 09:22:24 -08:00
Nik Everett	abb7d7841f	Remove SearchRequestParsers (#22538 ) It is empty now that we've moved all the parsing into `namedObject`.	2017-01-11 10:28:14 -05:00
Luca Cavanna	0f391336f5	Clean up SearchShardTarget (#22468 ) * unify shard target setter * Remove indexText member from SearchShardTarget * Remove duplicated indexName getter from SearchShardTarget * Remove duplicated shardId getter from SearchShardTarget * Remove duplicated nodeIde getter from SearchShardTarget * Rename SearchShardTarget#nodeIdText getter to getNodeIdText * Remove unused InternalSearchHit#internalSourceRef unused method * Remove unused InternalSearchHit#internalHighlightFields unused method * Make SearchShardTarget members final	2017-01-11 10:08:31 +01:00
Nik Everett	b71b8acf59	Remove ClusterService from ctors in reindex (#22539 ) Moves fetching the local node id into `NodeClient` which is a fairly useful place to put it so you can generate task ids from `NodeClient#executeLocally`.	2017-01-10 18:26:06 -05:00
Nik Everett	d50f96e122	Remove InternalAggregation.Type (#22511 ) It is no longer needed. It used to contain a lot of strings used by serialization but those have since been removed. Now it is just another thing to pass around that we don't really need.	2017-01-10 11:57:19 -05:00
Nik Everett	78bb56671e	Fix reindex from remote clearing scroll (#22525 ) Reindex-from-remote had a race when it tried to clear the scroll. It first starts the request to clear the scroll and then submits a task to the generic threadpool to shutdown the client. These two things race and, in my experience, closing the scroll generally loses. That means that most of the time reindex-from-remote isn't clearing the scrolls that it uses. This isn't the end of the world because we flush old scroll contexts after a while but this isn't great. Noticed while experimenting with #22514.	2017-01-10 10:30:23 -05:00
Nik Everett	5ef78fd015	Fix source filtering in reindex-from-remote (#22514 ) Reindex-from-remote was accepting source filtering in the request but ignoring it and setting `_source=true` on the search URI. This fixes the filtering so it is piped through to the remote node and adds tests for that. Closes #22507	2017-01-10 09:00:12 -05:00
Martijn van Groningen	cb2333dacd	percolator: remove deprecated percolate and mpercolate apis	2017-01-10 11:18:27 +01:00
Nik Everett	3fb9254b95	Replace Suggesters with namedObject (#22491 ) Removes another parser registery type thing in favor of `XContentParser#namedObject`.	2017-01-09 16:51:08 -05:00
Nik Everett	057194f9ab	Fix test under windows Silly `\r`.	2017-01-09 16:29:59 -05:00
Nik Everett	e3f77b4795	Replace AggregatorParsers with namedObject (#22397 ) Removes `AggregatorParsers`, replacing all of its functionality with `XContentParser#namedObject`. This is the third bit of payoff from #22003, one less thing to pass around the entire application.	2017-01-09 13:59:38 -05:00
Nik Everett	fc1f7c2147	Remove content-type detection from reindex-from-remote (#22504 ) If the remote doesn't return a content type then reindex tried to guess the content-type. This didn't work most of the time and produced a rather useless error message. Given that Elasticsearch always returns the content-type we are dropping content-type detection in favor of just failing the request if the remote didn't return a content-type. Closes #22329	2017-01-09 11:50:20 -05:00
Nik Everett	f4884e0726	Replace SearchExtRegistry with namedObject (#22492 ) This is one of the last things in `SearchRequestParsers`.	2017-01-09 08:35:54 -05:00
javanna	ded694fc83	Make StatusToXContent extend ToXContentObject and rename it to StatusToXContentObject This also allows to make RestToXContentListener require ToXContentObject rather than ToXContent	2017-01-06 23:31:48 +01:00
javanna	4e49860f68	Make PercolateResponse a ToXContentObject	2017-01-06 23:31:48 +01:00
javanna	d5510701a0	Make SearchResponse a ToXContentObject	2017-01-06 23:31:48 +01:00
javanna	45d4938fcc	Migrate some more responses to ToXContentObject	2017-01-06 23:31:48 +01:00
Nik Everett	f24ca5188a	Fix some issues with painless's strings (#22393 ) 1. Escape sequences we're working. For example `\\` is now correctly interpreted as `\` instead of `\\`. Same with `\'` being `'` and `\"` being `"`. 2. `'` delimited strings weren't allowed to contain `"`s but it looked like they were intended to support it. Now they do. 3. Improves the error message when the script contains an invalid escape sequence inside a string to include a list of the valid escape sequences. Closes #22372	2017-01-06 11:35:22 -05:00
javanna	dea7d65439	remove ParseFieldMatcher usages from RestSearchTemplateAction	2017-01-05 19:33:04 +01:00
javanna	6102523033	remove ParseFieldMatcher usages from Script parsing code	2017-01-05 19:33:04 +01:00
javanna	9394792392	remove unused ParseFieldMatcher imports/arguments	2017-01-05 19:33:04 +01:00
Tim B	be22a250b6	Replace Socket, ServerSocket, and HttpServer usages in tests with mocksocket versions (#22287 ) This integrates the mocksocket jar with elasticsearch tests. Mocksocket wraps actions requiring SocketPermissions in doPrivilege blocks. This will eventually allow SocketPermissions to be assigned to the mocksocket jar opposed to the entire elasticsearch codebase.	2017-01-04 14:38:51 -06:00
Adrien Grand	f8998fece5	Upgrade to lucene-6.4.0-snapshot-084f7a0. (#22413 )	2017-01-04 19:03:52 +01:00
Jason Tedor	96ba45e310	Fix stale comment in Netty4Utils We previously named the thread using a frame from the stack trace, but this was removed to simplify the code here. However, the comment explaining this was left behind and this commit cleans that up.	2017-01-03 08:15:57 -05:00
Daniel Mitterdorfer	1ed64f0551	Eliminate unneccessary declaration of IOException With this commit we remove the declaration of IOException from assertWarnings and modify all call sites. Checked with @javanna	2017-01-03 12:36:28 +01:00
javanna	cd6b569286	Remove some usages of ParseFieldMatcher in favour of using ParseField directly Relates to #19552 Relates to #22130	2016-12-31 09:24:44 +01:00
javanna	df2acb3d9d	Remove some more usages of ParseFieldMatcher in favour of using ParseField directly Relates to #19552 Relates to #22130	2016-12-30 18:57:47 +01:00
javanna	6c54cbade4	Remove some more usages of ParseFieldMatcher in favour of using ParseField directly Relates to #19552 Relates to #22130	2016-12-30 18:57:47 +01:00
javanna	45d010e874	Remove some usages of ParseFieldMatcher in favour of using ParseField directly Relates to #19552 Relates to #22130	2016-12-30 18:57:47 +01:00
Martijn van Groningen	9ccdd3303d	percolator: Fix NPE in percolator's 'now' range check for percolator queries with range queries. Closes #22355	2016-12-27 22:56:01 +01:00
Tal Levy	e6fb3a5d95	fix index out of bounds error in KV Processor (#22288 ) - checks for index-out-of-bounds - added unit tests for failed `field_split` and `value_split` scenarios missed this test in #22272.	2016-12-27 10:57:11 -08:00
Nik Everett	f5f2149ff2	Remove much ceremony from parsing client yaml test suites (#22311 ) * Remove a checked exception, replacing it with `ParsingException`. * Remove all Parser classes for the yaml sections, replacing them with static methods. * Remove `ClientYamlTestFragmentParser`. Isn't used any more. * Remove `ClientYamlTestSuiteParseContext`, replacing it with some static utility methods. I did not rewrite the parsers using `ObjectParser` because I don't think it is worth it right now.	2016-12-22 11:00:34 -05:00
Jason Tedor	7946396fe6	Introduce translog no-op As the translog evolves towards a full operations log as part of the sequence numbers push, there is a need for the translog to be able to represent operations for which a sequence number was assigned, but the operation did not mutate the index. Examples of how this can arise are operations that fail after the sequence number is assigned, and gaps in this history that arise when an operation is assigned a sequence number but the operation never completed (e.g., a node crash). It is important that these operations appear in the history so that they can be replicated and replayed during recovery as otherwise the history will be incomplete and local checkpoints will not be able to advance. This commit introduces a no-op to the translog to set the stage for these efforts. Relates #22291	2016-12-21 23:08:16 -05:00
Nik Everett	567c65b0d5	Replace IndicesQueriesRegistry (#22289 ) * Switch query parsing to namedObject * Remove IndicesQueriesRegistry	2016-12-21 09:05:14 -05:00
Tal Levy	c53b2ee9cd	introduce KV Processor in Ingest Node (#22272 ) Now you can parse field values of the `key=value` variety and have `key` be inserted as a field name in an ingest document. Closes #22222.	2016-12-20 13:26:17 -08:00
Nik Everett	a04dcfb95b	Introduce XContentParser#namedObject (#22003 ) Introduces `XContentParser#namedObject which works a little like `StreamInput#readNamedWriteable`: on startup components register parsers under names and a superclass. At runtime we look up the parser and call it to parse the object. Right now the parsers take a context object they use to help with the parsing but I hope to be able to eliminate the need for this context as most what it is used for at this point is to move around parser registries which should be replaced by this method eventually. I make no effort to do so in this PR because it is big enough already. This is meant to the a start down a road that allows us to remove classes like `QueryParseContext`, `AggregatorParsers`, `IndicesQueriesRegistry`, and `ParseFieldRegistry`. The goal here is to reduce the amount of plumbing required to allow parsing pluggable things. With this you don't have to pass registries all over the place. Instead you must pass a super registry to fewer places and use it to wrap the reader. This is the same tradeoff that we use for NamedWriteable and it allows much, much simpler binary serialization. We think we want that same thing for xcontent serialization. The only parsing actually converted to this method is parsing `ScoreFunctions` inside of `FunctionScoreQuery`. I chose this because it is relatively self contained.	2016-12-20 11:05:24 -05:00
Nik Everett	73320566c1	Reindex test: catch exception name instead of reason It looks like the exception reason can differ in different default locales, so the build would fail in any non-English locale. This switches the catch to the name of the exception which shouldn't vary.	2016-12-20 10:00:14 -05:00
Nik Everett	8de4be9e4d	Reinex test: don't fail if iis is running on port 0	2016-12-19 16:44:08 -05:00
Grzegorz Gajos	f6b6e4e376	Added ability to remove pipelines via wildcards (#22149 ) (#22191 ) This commit is adding an ability to remove pipelines with wildcards.	2016-12-19 10:59:59 -08:00
javanna	5dae10db11	[TEST] add warnings check to ESTestCase We are currenlty checking that no deprecation warnings are emitted in our query tests. That can be moved to ESTestCase (disabled in ESIntegTestCase) as it allows us to easily catch where our tests use deprecated features and assert on the expected warnings.	2016-12-19 19:39:56 +01:00
javanna	6a27628f12	Remove support for strict parsing mode We return deprecation warnings as response headers, besides logging them. Strict parsing mode stayed around, but was only used in query tests, though we also introduced checks for deprecation warnings there that don't need strict parsing anymore (see #20993). We can then safely remove support for strict parsing mode. The final goal is to remove the ParseFieldMatcher class, but there are many many users of it. This commit prepares the field for the removal, by deprecating ParseFieldMatcher and making it effectively not needed. Strict parsing is removed from ParseFieldMatcher, and strict parsing is replaced in tests where needed with deprecation warnings checks. Note that the setting to enable strict parsing was never ported to the new settings infra hance it cannot be set in production. It is really only used in our own tests. Relates to #19552	2016-12-19 19:39:56 +01:00
Nik Everett	2d71ced221	Properly fail reindex-from-remote if can't detect content type	2016-12-19 12:51:38 -05:00
Yannick Welsch	63af03a104	Atomic mapping updates across types (#22220 ) This commit makes mapping updates atomic when multiple types in an index are updated. Mappings for an index are now applied in a single atomic operation, which also allows to optimize some of the cross-type updates and checks.	2016-12-19 14:39:50 +01:00
Simon Willnauer	ccfeac8dd5	Remove `doHandshake` test-only settings from TcpTransport (#22241 ) In #22094 we introduce a test-only setting to simulate transport impls that don't support handshakes. This commit implements the same logic without a setting.	2016-12-18 09:26:53 +01:00
Tal Levy	bb37167946	Enables the ability to inject serialized json fields into root of document. (#22179 ) The JSON processor has an optional field called "target_field". If you don't specify target_field then target_field becomes what you specified as "field". There isn't anyway to add the fields to the root of a document. By setting `add_to_root`, now serialized fields will be inserted into the top-level fields of the ingest document. Closes #21898.	2016-12-16 10:17:27 -08:00
Jason Tedor	df43c268da	Eagerly initialize Netty 4 Today we initialize Netty in a static initializer. We trigger this method via static initializers from Netty-related classes, but we can trigger this method earlier than we do to ensure that Netty is initialized how we want it to be.	2016-12-15 13:24:47 -05:00
Tal Levy	eaf82a6e7e	compile ScriptProcessor inline scripts when creating ingest pipelines (#21858 ) Inline scripts defined in Ingest Pipelines are now compiled at creation time to preemptively catch errors on initialization of the pipeline. Fixes #21842.	2016-12-14 17:26:51 -08:00
Simon Willnauer	80d6539e9c	Handle connection close / reset events gracefully during handshake (#22178 ) Low level handshake code doesn't handle situations gracefully if the connection is concurrently closed or reset by peer. This commit adds the relevant code to fail the handshake if the connection is closed.	2016-12-14 23:04:14 +01:00
Nik Everett	749039ad4f	Consolidate the last easy parser construction (#22095 ) Moves the last of the "easy" parser construction into `RestRequest`, this time with a new method `RestRequest#contentParser`. The rest of the production code that builds `XContentParser` isn't "easy" because it is exposed in the Transport Client API (a Builder) object.	2016-12-14 15:41:25 -05:00
Adrien Grand	149ef74b26	Fix `missing` on aggs on `boolean` fields. (#22135 ) The creation of the `ValuesSource` used to pass `DateTimeZone.UTC` as a time zone all the time in case of empty fields in spite of the fact that all doc value formats but the date one reject this parameter. This commit centralizes the creation of the `ValuesSource` and adds unit tests to it. Closes #22009	2016-12-14 10:03:09 +01:00
Daniel Mitterdorfer	7e5058037b	Enable strict duplicate checks for JSON content With this commit we enable the Jackson feature 'STRICT_DUPLICATE_DETECTION' by default. This ensures that JSON keys are always unique. While this has a performance impact, benchmarking has indicated that the typical drop in indexing throughput is around 1 - 2%. As a last resort, we allow users to still disable strict duplicate checks by setting `-Des.json.strict_duplicate_detection=false` which is intentionally undocumented. Closes #19614	2016-12-14 09:35:53 +01:00
Nik Everett	49bdd29f91	Consolidate more parser creation into ESTestCase This will make it easier to add the forthcoming required argument, `NamedXContentRegistry`.	2016-12-13 20:28:41 -05:00
Nik Everett	872984d21a	Continue consolidating `XContentParser` construction in tests (#22145 ) Consolidate more parser creation in tests Moves more parser creation in tests to the `createParser` methods in `ESTestCase`.	2016-12-13 17:22:39 -05:00
Tal Levy	f56097b57a	Fixes GrokProcessor's ignorance of named-captures with same name. (#22131 ) Grok was originally ignoring potential matches to named-capture groups larger than one. For example, If you had two patterns containing the same named field, but only the second pattern matched, it would fail to pick this up. This PR fixes this by exploring all potential places where a named-capture was used and chooses the first one that matched. Fixes #22117.	2016-12-13 13:19:55 -08:00
Simon Willnauer	7a9b667e98	Introduce a low level protocol handshake (#22094 ) Today we rely on the version that the API user passes in together with the DiscoveryNode. This commit introduces a low level handshake where nodes exchange their version to be used with the transport protocol that is executed every time a connection to a node is established. This, on the one hand allows to change the wire protocol based on the version we are talking to even without a full cluster restart. Today we would need to carry on a BWC layer across major versions but with a handshake we can rely on the fact that the latest version of the previous minor executes a handshake and uses the latest protocol version across all communication with the N+1 version nodes. This change is yet fully backwards compatible, a followup PR will remove the BWC in 6.0 once this has been back-ported to the 5.x branch	2016-12-13 21:06:23 +01:00
Adrien Grand	049fd3991c	Remove `AggregationContext`. (#22124 ) This class is just a wrapper around `SearchContext`, so let's use `SearchContext` directly. The change is mechanical, except the `ValuesSourceConfig` class, where I moved the logic to get a `ValuesSource` given a config.	2016-12-13 09:09:40 +01:00
Luca Cavanna	6d987a9b69	Remove support for empty queries (#22092 ) Our query DSL supports empty queries (`{}`), which have a different meaning depending on the query that holds it, either ignored, match_all or match_none. We deprecated the support for empty queries in 5.0, where we log a deprecation warning wherever they are used. The way we supported it once we moved query parsing to the coordinating node was having an Optional<QueryBuilder> return type in all of our parse methods (called fromXContent). See #17624. The central place for this was QueryParseContext#parseInnerQueryBuilder. We can now remove all the optional return types and simply throw an exception whenever an empty query is found.	2016-12-12 12:37:12 +01:00
Simon Willnauer	01d67e09b9	Detach handshake from connect to node (#22037 ) Today we connect and publish the nodes connection before we execute a handshake with the node we connect to. In the case of connecting to a node that won't pass the handshake this connection is already `published` and other code paths can use it. This commit detaches the connection and the publish of the connection such that `TransportService` can do a handshake before actually connect and publish the connection.	2016-12-10 10:03:26 +01:00
Nik Everett	3adefb7b4a	Begin centralizing XContentParser creation into RestRequest (#22041 ) To get #22003 in cleanly we need to centralize as much `XContentParser` creation as possible into `RestRequest`. That'll mean we have to plumb the `NamedXContentRegistry` into fewer places. This removes `RestAction.hasBody`, `RestAction.guessBodyContentType`, and `RestActions.getRestContent`, moving callers over to `RestRequest.hasContentOrSourceParam`, `RestRequest.contentOrSourceParam`, and `RestRequest.contentOrSourceParamParser` and `RestRequest.withContentOrSourceParamParserOrNull`. The idea is to use `withContentOrSourceParamParserOrNull` if you need to handle requests without any sort of body content and to use `contentOrSourceParamParser` otherwise. I believe the vast majority of this PR to be purely mechanical but I know I've made the following behavioral change (I'll add more if I think of more): * If you make a request to an endpoint that requires a request body and has cut over to the new APIs instead of getting `Failed to derive xcontent` you'll get `Body required`. * Template parsing is now non-strict by default. This is important because we need to be able to deprecate things without requests failing.	2016-12-09 20:23:02 -05:00
Nik Everett	fc2060ba7e	Don't close rest client from its callback (#22061 ) If you try to close the rest client inside one of its callbacks then it blocks itself. The thread pool switches the status to one that requests a shutdown and then waits for the pool to shutdown. When another thread attempts to honor the shutdown request it waits for all the threads in the pool to finish what they are working on. Thus thread a is waiting on thread b while thread b is waiting on thread a. It isn't quite that simple, but it is close. Relates to #22027	2016-12-09 10:39:51 -05:00
Adrien Grand	36f598138a	Start using `ObjectParser` for aggs. (#22048 ) This is an attempt to start moving aggs parsing to `ObjectParser`. There is still A LOT to do, but ObjectParser is way better than the way aggregations parsing works today. For instance in most cases, we reject numbers that are provided as strings, which we are supposed to accept since some client languages (looking at you Perl) cannot make sure to use the appropriate types. Relates to #22009	2016-12-09 09:45:16 +01:00
Ryan Ernst	b1cef5fdf8	Remove 2.0 prerelease version constants (#22004 ) * Remove 2.0 prerelease version constants This is a start to addressing #21887. This removes: * pre 2.0 snapshot format support * automatic units addition to cluster settings * bwc check for delete by query in pre 2.0 indexes	2016-12-08 21:48:35 -08:00
Lee Hinman	ef64d230e7	Merge remote-tracking branch 'dakrone/index-seq-id-and-primary-term'	2016-12-08 19:47:21 -07:00
Lee Hinman	ee22a477df	Add internal _primary_term doc values field, fix _seq_no indexing This adds the `_primary_term` field internally to the mappings. This field is populated with the current shard's primary term. It is intended to be used for collision resolution when two document copies have the same sequence id, therefore, doc_values for the field are stored but the filed itself is not indexed. This also fixes the `_seq_no` field so that doc_values are retrievable (they were previously stored but irretrievable) and changes the `stats` implementation to more efficiently use the points API to retrieve the min/max instead of iterating on each doc_value value. Additionally, even though we intend to be able to search on the field, it was previously not searchable. This commit makes it searchable. There is no user-visible `_primary_term` field. Instead, the fields are updated by calling: ```java index.parsedDoc().updateSeqID(seqNum, primaryTerm); ``` This includes example methods in `Versions` and `Engine` for retrieving the sequence id values from the index (see `Engine.getSequenceID`) that are only used in unit tests. These will be extended/replaced by actual implementations once we make use of sequence numbers as a conflict resolution measure. Relates to #10708 Supercedes #21480 P.S. As a side effect of this commit, `SlowCompositeReaderWrapper` cannot be used for documents that contain `_seq_no` because it is a Point value and SCRW cannot wrap documents with points, so the tests have been updated to loop through the `LeafReaderContext`s now instead.	2016-12-08 19:47:03 -07:00
Christoph Büscher	7454a9647b	Add fromXContent to HighlightField This adds a fromXContent method and unit test to the HighlightField class so we can parse it as part of a serch response. This is part of the preparation for parsing search responses on the client side.	2016-12-07 16:32:44 +01:00
Nik Everett	ef83dbfbe6	Reindex: Better error message for pipeline in wrong place (#21985 ) `_update_by_query` supports specifying the `pipeline` to process the documents as a url parameter but `_reindex` doesn't. It doesn't because everything about the `_reindex` request that has to do with writing the documents is grouped under the `dest` object in the request body. This changes the response parameter from `request [_reindex] contains unrecognized parameter: [pipeline]` to `_reindex doesn't support [pipeline] as a query parmaeter. Specify it in the [dest] object instead.`	2016-12-06 14:55:46 -05:00
Ryan Ernst	c8f241f284	Plugins: Remove response action filters (#21950 ) Action filters currently have the ability to filter both the request and response. But the response side was not actually used. This change removes support for filtering responses with action filters.	2016-12-05 16:14:04 -08:00
Nik Everett	2087234d74	Timeout improvements for rest client and reindex (#21741 ) Changes the default socket and connection timeouts for the rest client from 10 seconds to the more generous 30 seconds. Defaults reindex-from-remote to those timeouts and make the timeouts configurable like so: ``` POST _reindex { "source": { "remote": { "host": "http://otherhost:9200", "socket_timeout": "1m", "connect_timeout": "10s" }, "index": "source", "query": { "match": { "test": "data" } } }, "dest": { "index": "dest" } } ``` Closes #21707	2016-12-05 10:54:51 -05:00
Igor Motov	c391b3fff6	Add proper descriptions to reindex, update-by-query and delete-by-query tasks. Related to #21768	2016-12-02 21:46:38 -05:00
Jack Conradson	0ecdef026d	Test fix for def equals test in Painless. (#21945 ) Closes #21801	2016-12-02 14:41:13 -08:00
Nik Everett	0c724b1878	Keep context during reindex's retries (#21941 ) * Keep context during reindex's retries This fixes reindex and friend's retries to keep the context. * Docs	2016-12-02 13:48:51 -05:00
Simon Willnauer	842e00c689	[TEST] Add back skip of external clusters	2016-12-02 11:53:33 +01:00
Simon Willnauer	572b4c3e72	Port assert from 5.x to master I added an assertion to Netty4/Netty3Transport in 5.x that is not in master yet. This commit port the assert to ensure we consumed all connection in `connectToChannels`	2016-12-02 10:34:33 +01:00
Simon Willnauer	adf9bd90a4	Remove legacy BWC test infrastructure and tests (#21915 ) We don't use the test infra nor do we run the tests. They might all be entirely out of date. We also have a different BWC test infra in-place. This change removes all of the legacy infra.	2016-12-02 08:06:20 +01:00
Simon Willnauer	155de53fe3	Add a connect timeout to the ConnectionProfile to allow per node connect timeouts (#21847 ) Timeouts are global today across all connections this commit allows to specify a connection timeout per node such that depending on the context connections can be established with different timeouts. Relates to #19719	2016-12-01 15:39:49 +01:00
Boaz Leskes	fe01c0f83b	fix TemplateQueryBuilderTests & Murmur3FieldMapperTests	2016-12-01 14:21:57 +01:00
Simon Willnauer	dd5256c324	Reduce number of connections per node depending on the nodes role (#21849 ) We currently treat every node equally when we establish connections to a node. Yet, if we are not master eligible or can't hold any data there is no point in creating a dedicated connection for sending the cluster state or running remote recoveries respectively. The usage of STATE and RECOVERY connections on non-master and/or non-data nodes will result in an IllegalStateException.	2016-12-01 08:00:48 +01:00
Jason Tedor	6c45695d52	Add version 5.1.1 This commit removes the version constant for 5.1.0 (due to an inadvertent release) and adds the version constant for 5.1.1. Relates #21890	2016-11-30 11:14:17 -05:00
Luca Cavanna	5b8bdba12e	Remove subrequests method from CompositeIndicesRequest (#21873 )	2016-11-30 15:03:58 +01:00
Adrien Grand	6231009a8f	Remove 2.x backward compatibility of mappings. (#21670 ) For the record, I also had to remove the geo-hash cell and geo-distance range queries to make the code compile. These queries already throw an exception in all cases with 5.x indices, so that does not hurt any more. I also had to rename all 2.x bwc indices from `index-${version}` to `unsupported-${version}` to make `OldIndexBackwardCompatibilityIT` happy.	2016-11-30 13:34:46 +01:00
Luca Cavanna	6eaff9432d	SearchTemplateRequest to implement CompositeIndicesRequest (#21865 ) SearchTemplateRequest to implement CompositeIndicesRequest Given that SearchTemplateRequest effectively delegates to search when a search is being executed, it should implement the CompositeIndicesRequest interface. The subrequests method should return a single search request. When a search is not going to be executed, because we are in simulate mode, there are no inner requests, and there are no corresponding indices to that request either. Closes #21747	2016-11-29 20:52:43 +01:00
Jim Ferenczi	d791ddf704	Upgrade to lucene-6.4.0-snapshot-ec38570 (#21853 ) Set lucene version to 6.4.0-snapshot-ec38570 and update all the sha1s/license Fix invalid combo after upgrade in query_string query. split_on_whitespace=false is disallowed if auto_generate_phrase_queries=true Adapt the expectations of some tests to the new format of the Lucene explain output	2016-11-29 18:40:31 +01:00
Nicholas Knize	af1ab68b64	Add RangeFieldMapper for numeric and date range types Lucene 6.2 added index and query support for numeric ranges. This commit adds a new RangeFieldMapper for indexing numeric (int, long, float, double) and date ranges and creating appropriate range and term queries. The design is similar to NumericFieldMapper in that it uses a RangeType enumerator for implementing the logic specific to each type. The following range types are supported by this field mapper: int_range, float_range, long_range, double_range, date_range. Lucene does not provide a DocValue field specific to RangeField types so the RangeFieldMapper implements a CustomRangeDocValuesField for handling doc value support. When executing a Range query over a Range field, the RangeQueryBuilder has been enhanced to accept a new relation parameter for defining the type of query as one of: WITHIN, CONTAINS, INTERSECTS. This provides support for finding all ranges that are related to a specific range in a desired way. As with other spatial queries, DISJOINT can be achieved as a MUST_NOT of an INTERSECTS query.	2016-11-29 10:10:14 -06:00
Simon Willnauer	f5ff69fabe	Remove connectToNodeLight and replace it with a connection profile (#21799 ) The Transport#connectToNodeLight concepts is confusing and not very flexible. neither really testable on a unittest level. This commit cleans up the code used to connect to nodes and simplifies transport implementations to share more code. This also allows to connect to nodes with custom profiles if needed, for instance future improvements can be added to connect to/from nodes that are non-data nodes without dedicated bulks and recovery connections.	2016-11-29 09:35:07 +01:00
Jason Tedor	a6082eb563	Grant Netty permission to read system somaxconn When Netty listens on a socket, it specifies the established connection backlog for the socket. On Linux, Netty tries to read the system-wide configuration for this from /proc/sys/net/core/somaxconn and falls back to a default value when it can not read this value. This commit grants Netty permission to read this file so that it can honor the system-wide configuration for the connection backlog for sockets that it is listening on. This also removes an obnoxious stack trace that appears when Netty logging is set to debug logging. Relates #21840	2016-11-28 18:47:32 -05:00
Luca Cavanna	360b74eda8	[TEST] Don't reinitialize YamlTestClient and RestClient before each single test (#21807 ) In the past we ran yaml tests against an internal cluster, which would get restarted after each test failure, hence the client objects needed to eventually be refreshed before each test. That is why we had the initClient method to re-initialize the YamlTestClient in the execution context. We ended up though re-initializing the client unconditionally, which is not needed. Also, ESRestTestCase recreates the RestClient against the external cluster before each test, which is not needed given that nothing changes in the external cluster. This commit removes the initClient method from the yaml tests execution context. The YamlTestClient can be eagerly created before the first yaml test runs and then re-used in subsequent tests. Also api calls to check for nodes versions etc. are moved out of YamlTestClient to ESClientYamlSuiteTestCase. Also the RestClient is now initialized in ESRestTestCase before the first test runs, and kept around afterwards as a static member. Basically each subclass of EsRestTestCase will have its own RestClient instance, but the client will be shared across the different tests within the same class. The yaml test suite is just a special suite, composed of 600+ tests that are loaded from files, which will share the same client instance. This change should speed tests up as well, as we don't recreate the RestClient before each single test, and we don't call _cat/nodes either before each single test.	2016-11-28 18:43:27 +01:00
Jason Tedor	6f95261632	Remove unused imports from Netty4Utils This commit removes two unused imports from Netty4Utils that were leftover from a previous change.	2016-11-27 13:18:50 -05:00
Jason Tedor	5e73282bbc	Simplify handling of fatal network layer errors This commit simplifies the handling of fatal errors on the network layer. The simplification here is to remove the use of a StringWriter/PrintWriter pair to format the stack trace, removing the need for the method to declare that it throws a checked IOException.	2016-11-27 13:14:24 -05:00
Tanguy Leroux	28dc02f01a	[Test] Mute EqualsTests..testBranch(Not)EqualsDefAndPrimitive It fails regurlarly and it is tracked by https://github.com/elastic/elasticsearch/issues/21801	2016-11-25 17:21:59 +01:00
Ryan Ernst	c3ec8e22b8	Wrap VerifyError in ScriptException (#21769 ) If a bug occurs in painless compilation (not from a user, but from the painless infrastructure), a VerifyError may be thrown when compiling the broken generated class. This commit wraps VerifyErrors in ScriptException so that useful information is returned to the user, which can be passed on to the ES team for analysis.	2016-11-23 14:45:21 -08:00
Jack Conradson	ba2d772668	Fix a VerifyError bug in Painless (#21765 ) This bug would cause a VerifyError when scripts using the === operator were comparing a def type against a primitive type since the primitive type wasn't being appropriately boxed.	2016-11-23 13:57:14 -08:00
Jason Tedor	8416b16dfd	Improve handling of unreleased versions Today when handling unreleased versions for backwards compatilibity support, we scatted version constants across the code base and add some asserts to support removing these constants when the version in question is actually released. This commit improves this situation, enabling us to just add a single unreleased version constant that can be renamed when the version is actually released. This should make maintenance of these versions simpler. Relates #21760	2016-11-23 15:49:05 -05:00
Nik Everett	434fa4bd26	Docs and tests for painless lack of boxing for ?: and ?. (#21756 ) NOTE: The result of `?.` and `?:` can't be assigned to primitives. So `int[] someArray = null; int l = someArray?.length` and `int s = params.size ?: 100` don't work. Do `def someArray = null; def l = someArray?.length` and `def s = params.size ?: 100` instead. Relates to #21748	2016-11-23 14:33:32 -05:00
Ryan Ernst	6940b2b8c7	Remove groovy scripting language (#21607 ) * Scripting: Remove groovy scripting language Groovy was deprecated in 5.0. This change removes it, along with the legacy default language infrastructure in scripting.	2016-11-22 19:24:12 -08:00
Nik Everett	dbdcf9e95c	Move painless yaml tests into painless dir They were in a directory named "plan_a", the old name for painless.	2016-11-22 20:27:14 -05:00
Nik Everett	457c2d8fb0	Add Debug.explain to painless You can use `Debug.explain(someObject)` in painless to throw an `Error` that can't be caught by painless code and contains an object's class. This is useful because painless's sandbox doesn't allow you to call `someObject.getClass()`. Closes #20263	2016-11-22 12:46:02 -05:00
Jason Tedor	446037ccb8	Die with dignity on the network layer When a fatal error is thrown on the network layer, such an error never makes its way to the uncaught exception handler. This prevents the node from being torn down if an out of memory error or other fatal error is thrown while handling HTTP or transport traffic. This commit adds logic to ensure that such errors bubble their way up to the uncaught exception handler, even though Netty tries really hard to swallow everything. Relates #21720	2016-11-21 22:14:30 -05:00
Nik Everett	f5c8c746e6	Implement toString in painless's AST This should make debugging painless' analysis and code generation a little easier. The `toString` implementations mirror the AST somewhat, and look like `(SSource (SReturn (ENumeric 1)))`.	2016-11-21 16:24:10 -05:00
Simon Willnauer	cb5c25ab4f	Add a StreamInput#readArraySize method that ensures sane array sizes (#21697 ) Today we read a vint from the stream to allocate the size of an array up-front before we start reading the values. This can be dangerous if for instance we read from a corrupted stream or if some manipulated bytes are send for instance from an attacker or a fuzzer. In most of the cases we can apply some best effort and validate the array size to be _sane_ by ensuring we can at read at least N bytes where N is the expected size of the array.	2016-11-21 21:39:21 +01:00
Jason Tedor	655c4fe172	Wrap GroovyBugErrors in ScriptExceptions When Groovy detects a bug in its runtime because an internal assertion was violated, it throws an GroovyBugError. This descends from AssertionError and if it goes uncaught will land in the uncaught exception handler and will not deliver any useful information to the user. This commit wraps GroovyBugErrors in ScriptExceptions so that useful information is returned to the user.	2016-11-19 07:11:13 -05:00
Nik Everett	ae468441dc	Implement the ?: operator in painless (#21506 ) Implements a null coalescing operator in painless that looks like `?:`. This form was chosen to emulate Groovy's `?:` operator. It is different in that it only coalesces null values, instead of Groovy's `?:` operator which coalesces all falsy values. I believe that makes it the same as Kotlin's `?:` operator. In other languages this operator looks like `??` (C#) and `COALESCE` (SQL) and `:-` (bash). This operator is lazy, meaning the right hand side is only evaluated at all if the left hand side is null.	2016-11-18 13:54:26 -05:00
Jack Conradson	ced433e9a8	Fix reserved variable availability in lambdas in Painless	2016-11-17 13:39:08 -08:00
Jason Tedor	b08a2e1f31	Expose executor service interface from thread pool This commit exposes the executor service interface from thread pool. This will enable some high-level concurrency primitives that will make some code cleaner and simpler. Relates #21608	2016-11-17 09:18:49 -05:00
Simon Willnauer	de04aad994	Remove `modules/transport_netty_3` in favor of `netty_4` (#21590 ) We kept `netty_3` as a fallback in the 5.x series but now that master is 6.0 we don't need this or in other words all issues coming up with netty 4 will be blockers for 6.0.	2016-11-17 12:44:42 +01:00
Jason Tedor	d06a8903fd	Merge branch 'master' into feature/seq_no * master: (22 commits) Add proper toString() method to UpdateTask (#21582) Fix `InternalEngine#isThrottled` to not always return `false`. (#21592) add `ignore_missing` option to SplitProcessor (#20982) fix trace_match behavior for when there is only one grok pattern (#21413) Remove dead code from GetResponse.java Fixes date range query using epoch with timezone (#21542) Do not cache term queries. (#21566) Updated dynamic mapper section Docs: Clarify date_histogram bucket sizes for DST time zones Handle release of 5.0.1 Fix skip reason for stats API parameters test Reduce skip version for stats API parameter tests Strict level parsing for indices stats Remove cluster update task when task times out (#21578) [DOCS] Mention "all-fields" mode doesn't search across nested documents InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted Fixed bad asciidoc in boolean mapping docs Fixed bad asciidoc ID in node stats Be strict when parsing values searching for booleans (#21555) Fix time zone rounding edge case for DST overlaps ...	2016-11-16 09:10:35 -05:00
Tal Levy	6796464f16	add `ignore_missing` option to SplitProcessor (#20982 ) Closes #20840.	2016-11-16 15:46:09 +02:00
Tal Levy	04b712bdc5	fix trace_match behavior for when there is only one grok pattern (#21413 ) There is an issue in the Grok Processor, where trace_match: true does not inject the _ingest._grok_match_index into the ingest-document when there is just one pattern provided. This is due to an optimization in the regex construction. This commit adds a check for when this is the case, and injects a static index value of "0", since there is only one pattern matched (at the first index into the patterns). To make this clearer, more documentation was added to the grok-processor docs. Fixes #21371.	2016-11-16 15:41:54 +02:00
Boaz Leskes	2c0338fa87	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 17:09:08 +00:00
Adrien Grand	df4482fdc8	Do not cache the QueryShardContext in PercolatorFieldMapper: it is cheap to create.	2016-11-15 15:45:18 +01:00
Adrien Grand	54809065a6	Make PercolatorFieldMapper get a QueryShardContext lazily.	2016-11-15 12:02:40 +01:00
Boaz Leskes	c9f49039d3	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 10:14:47 +00:00
Ryan Ernst	d14c470b89	Remove generics from ActionRequest closes #21368	2016-11-14 15:32:01 -08:00
Adrien Grand	1fd5c47e7f	Upgrade to lucene-6.3.0. (#21464 )	2016-11-14 09:36:45 +01:00
Jason Tedor	c7a1b3eb50	Merge branch 'master' into feature/seq_no * master: Hack around cluster service and logging race Do not prematurely shutdown Log4j Support decimal constants with trailing [dD] in painless (#21412) In painless suggest a long constant if int won't do (#21415) Account for different paths for sysctl utilities [TEST] testRebalancePossible() may not have an assigned node id Tests: Disable merge in SearchCancellationTests Tests: clean search scroll at the end of SearchCancellationIT	2016-11-13 20:01:44 -05:00
Nik Everett	2a328034ef	Support decimal constants with trailing [dD] in painless (#21412 ) This adds support to painless for decimal constants with trailing `d` or `D` to make it compatible with Java. It already supported integer constants with a trailing `d` or `D` but this adds tests for it. Closes #21116	2016-11-12 11:08:39 -05:00
Nik Everett	a26b5a113c	In painless suggest a long constant if int won't do (#21415 ) In painless we prefer explicit types over implicit ones whereas groovy is the other way around. Take this groovy code: ``` > 86400000.class java.lang.Integer > 864000000000.class java.lang.Long ``` Painless accepts `86400000` just fine because that is a valid `int` in the jvm. It rejects `864000000000` as an invlid `int` constant because, in painless as in java, `long` constants always end in `L` or `l`. To ease the transition from groovy to painless, this changes the compilation error returned from these invalid constants from: ``` Invalid int constant [864000000000]. ``` to ``` Invalid int constant [864000000000]. If you want a long constant then change it to [864000000000L]. ``` Inspired by #21313	2016-11-12 11:08:18 -05:00
Jason Tedor	d3417fb022	Merge branch 'master' into feature/seq_no * master: (516 commits) Avoid angering Log4j in TransportNodesActionTests Add trace logging when aquiring and releasing operation locks for replication requests Fix handler name on message not fully read Remove accidental import. Improve log message in TransportNodesAction Clean up of Script. Update Joda Time to version 2.9.5 (#21468) Remove unused ClusterService dependency from SearchPhaseController (#21421) Remove max_local_storage_nodes from elasticsearch.yml (#21467) Wait for all reindex subtasks before rethrottling Correcting a typo-Maan to Man-in README.textile (#21466) Fix InternalSearchHit#hasSource to return the proper boolean value (#21441) Replace all index date-math examples with the URI encoded form Fix typos (#21456) Adapt ES_JVM_OPTIONS packaging test to ubuntu-1204 Add null check in InternalSearchHit#sourceRef to prevent NPE (#21431) Add VirtualBox version check (#21370) Export ES_JVM_OPTIONS for SysV init Skip reindex rethrottle tests with workers Make forbidden APIs be quieter about classpath warnings (#21443) ...	2016-11-10 23:40:33 -05:00
Jack Conradson	aeb97ff412	Clean up of Script. Closes #21321	2016-11-10 09:59:13 -08:00
Nik Everett	4db21db0aa	Wait for all reindex subtasks before rethrottling In the test for reindex and friend's rethrottling feature we were waiting only for a single reindex sub task to start before rethrottling. This mostly worked because starting tasks is fast. But it didn't *always work and CI found that for us. This fixes the test to wait for all subtasks to start before rethrottling. I reproduced this locally semi-consistently with some fairly creative `Thread.sleep` calls and this test fix fixes the issue even with the sleeps so I'm fairly sure this will work consistently. Closes #21446	2016-11-10 10:49:25 -05:00
Luca Cavanna	bd23921a3a	Fix InternalSearchHit#hasSource to return the proper boolean value (#21441 ) The method used to be called `isSourceEmpty`, and was renamed to `hasSource`, but the return value never changed. Updated tests and users accordingly. Closes #21419	2016-11-10 13:13:38 +01:00
Nik Everett	b0f5ea3f59	Skip reindex rethrottle tests with workers They are flakey and spuriously fail the build. I'll hunt down the cause soon and reenabled but for now they should stop. Relates #21446	2016-11-09 17:50:09 -05:00
Nik Everett	d03b8e4abb	Implement reading from null safe dereferences Null safe dereferences make handling null or missing values shorter. Compare without: ``` if (ctx._source.missing != null && ctx._source.missing.foo != null) { ctx._source.foo_length = ctx.source.missing.foo.length() } ``` To with: ``` Integer length = ctx._source.missing?.foo?.length(); if (length != null) { ctx._source.foo_length = length } ``` Combining this with the as of yet unimplemented elvis operator allows for very concise defaults for nulls: ``` ctx._source.foo_length = ctx._source.missing?.foo?.length() ?: 0; ``` Since you have to start somewhere, we started with null safe dereferenes. Anyway, this is a feature borrowed from groovy. Groovy allows writing to null values like: ``` def v = null v?.field = 'cat' ``` And the writes are simply ignored. Painless doesn't support this at this point because it'd be complex to implement and maybe not all that useful. There is no runtime cost for this feature if it is not used. When it is used we implement it fairly efficiently, adding a jump rather than a temporary variable. This should also work fairly well with doc values.	2016-11-09 07:20:11 -05:00
Nik Everett	a3bd6d1ad9	Switch reindex with slices error to IAE If you try to reindex with multiple slices against a node that doesn't support it we throw an `IllegalArgumentException` so `assertVersionSerializable` is ok with it and so if this happens in REST it comes back as a 400 error.	2016-11-08 11:42:07 -05:00
Luca Cavanna	293a3cab01	Rest client: don't reuse that same HttpAsyncResponseConsumer across multiple retries (#21378 ) * Rest client: don't reuse that same HttpAsyncResponseConsumer across multiple retries Turns out that AbstractAsyncResponseConsumer from apache async http client is stateful and cannot be reused across multiple requests. The failover mechanism was mistakenly reusing that same instance, which can be provided by users, across retries in case nodes are down or return 5xx errors. The downside is that we have to change the signature of two public methods, as HttpAsyncResponseConsumer cannot be provided directly anymore, rather its factory needs to be provided which is going to be used to create one instance of the consumer per request attempt. Up until now we tested our RestClient against multiple nodes only in a mock environment, where we don't really send http requests. In that scenario we can verify that retries etc. work properly but the interaction with the http client library in a real scenario is different and can catch other problems. With this commit we also add an integration test that sends requests to multiple hosts, and some of them may also get stopped meanwhile. The specific test for pathPrefix was also removed as pathPrefix is now randomly applied by default, hence implicitly tested. Moved also a small test method that checked the validity of the path argument to the unit test RestClientSingleHostTests. Also increase default buffer limit to 100MB and make it required in default consumer The default buffer limit used to be 10MB but that proved not to be high enough for scroll requests (see reindex from remote). With this commit we increase the limit to 100MB and make it a bit more visibile in the consumer factory.	2016-11-08 16:42:42 +01:00
Ryan Ernst	7a2c984bcc	Test: Remove multi process support from rest test runner (#21391 ) At one point in the past when moving out the rest tests from core to their own subproject, we had multiple test classes which evenly split up the tests to run. However, we simplified this and went back to a single test runner to have better reproduceability in tests. This change removes the remnants of that multiplexing support.	2016-11-07 15:07:34 -08:00
Jason Tedor	23a271f092	Address race condition in HTTP pipeline tests This commit adapts a previous fix to the HTTP pipeline tests for Netty 4 to Netty 3. Relates #19845	2016-11-07 13:20:22 -05:00
Nik Everett	a13a050271	Add automatic parallelization support to reindex and friends (#20767 ) Adds support for `?slices=N` to reindex which automatically parallelizes the process using parallel scrolls on `_uid`. Performance testing sees a 3x performance improvement for simple docs on decent hardware, maybe 30% performance improvement for more complex docs. Still compelling, especially because clusters should be able to get closer to the 3x than the 30% number. Closes #20624	2016-11-04 20:59:15 -04:00
Adrien Grand	2a70f6e7b1	Upgrade to lucene-6.3.0-snapshot-a66a445. (#21309 ) This addresses a bug that was introduced with https://issues.apache.org/jira/browse/LUCENE-7501.	2016-11-04 10:34:04 +01:00
Nik Everett	24d5f31a54	Make painless's assertion about out of bound less brittle Instead of asserting that the message is shaped a certain way we cause the exception and catch it and assert that the messages are the same. This is the way to go because the exception message from the jvm is both local and jvm dependent. This is the CI failure that found this: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+java9-periodic/515/consoleFull	2016-11-02 12:38:51 -04:00
Christoph Büscher	b3370de715	Tests: Add warning header checks to QueryBuilder tests and QueryParseContextTests This adds checks for expected warning headers to the query builder test infrastructure. Tests that are adding deprecation warnings to the response headers need to check those, otherwise the abstract base class for the test class will complain at teardown.	2016-11-02 15:45:33 +01:00
Adrien Grand	aa6cd93e0f	Require arguments for QueryShardContext creation. (#21196 ) The `IndexService#newQueryShardContext()` method creates a QueryShardContext on shard `0`, with a `null` reader and that uses `System.currentTimeMillis()` to resolve `now`. This may hide bugs, since the shard id is sometimes used for query parsing (it is used to salt random score generation in `function_score`), passing a `null` reader disables query rewriting and for some use-cases, it is simply not ok to rely on the current timestamp (eg. percolation). So this pull request removes this method and instead requires that all call sites provide these parameters explicitly.	2016-11-02 09:48:49 +01:00
Nik Everett	a612e5988e	Bump reindex-from-remote's buffer to 200mb It was 10mb and that was causing trouble when folks reindex-from-remoted with large documents. We also improve the error reporting so it tells folks to use a smaller batch size if they hit a buffer size exception. Finally, adds some docs to reindex-from-remote mentioning the buffer and giving an example of lowering the size. Closes #21185	2016-11-01 13:19:28 -04:00
Jason Tedor	38663351dc	Fix logger names for Netty Previously Elasticsearch would only use the package name for logging levels, truncating the package prefix and the class name. This meant that logger names for Netty were just prefixed by netty3 and netty. We changed this for Elasticsearch so that it's the fully-qualified class name now, but never corrected this for Netty. This commit fixes the logger names for the Netty modules so that their levels are controlled by the fully-qualified class name. Relates #21223	2016-10-31 17:23:21 -04:00
Jack Conradson	185dff7346	Cleanup ScriptType (#21179 ) Refactored ScriptType to clean up some of the variable and method names. Added more documentation. Deprecated the 'in' ParseField in favor of 'stored' to match the indexed scripts being replaced by stored scripts.	2016-10-31 13:48:51 -07:00
Nik Everett	1bbd3c5400	Fix painless's out of bounds assertions in java 9 Java 9's exception message when lists have an out of bounds index is much better than java 8 but the painless code asserted on the java 8 message. Now it'll accept either. I'm tempted to weaken the assertion but I like asserting that the message is readable.	2016-10-29 22:21:57 -04:00
Nik Everett	3a7a218e8f	Support negative array ofsets in painless Adds support for indexing into lists and arrays with negative indexes meaning "counting from the back". So for if `x = ["cat", "dog", "chicken"]` then `x[-1] == "chicken"`. This adds an extra branch to every array and list access but some performance testing makes it look like the branch predictor successfully predicts the branch every time so there isn't a in execution time for this feature when the index is positive. When the index is negative performance testing showed the runtime is the same as writing `x[x.length - 1]`, again, presumably thanks to the branch predictor. Those performance metrics were calculated for lists and arrays but `def`s get roughly the same treatment though instead of inlining the test they need to make a invoke dynamic so we don't screw up maps. Closes #20870	2016-10-29 16:12:40 -04:00
Adrien Grand	b3cc54cf0d	Upgrade to lucene-6.3.0-snapshot-ed102d6 (#21150 ) Lucene 6.3 is expected to be released in the next weeks so it'd be good to give it some integration testing. I had to upgrade randomized-testing too so that both Lucene and Elasticsearch are on the same version.	2016-10-28 14:47:15 +02:00
Jack Conradson	512a77a633	Refactor ScriptType to be a top-level class.	2016-10-26 10:21:22 -07:00
Jason Tedor	9c3e4d6e22	Add correct Content-Length on HEAD requests This commit fixes responses to HEAD requests so that the value of the Content-Length is correct per the HTTP spec. Namely, the value of this header should be equal to the Content-Length if the request were not a HEAD request. This commit also fixes a memory leak on HEAD requests to the main action that arose from the bytes on a builder not being released due to them being dropped on the floor to ensure that the response to the main action did not have a body. Relates #21123	2016-10-25 23:08:19 -04:00
Nik Everett	18393a06f3	Fix reindex-from-remote for parent/child from <2.0 Versions before 2.0 needed to be told to return interesting fields like `_parent`, `_routing`, `_ttl`, and `_timestamp`. And they come back inside a `fields` block which we need to parse. Closes #21044	2016-10-21 13:14:33 -04:00
Jason Tedor	f51bf8ee47	Upgrade to Netty 4.1.6 This commit upgrades the transport-netty4 module dependency from Netty version 4.1.5 to version 4.1.6. This is a bug fix release of Netty. Relates #21051	2016-10-20 20:13:29 -04:00
Jack Conradson	ceaae47d38	Remove more equivalents of the now method from the Painless whitelist.	2016-10-20 10:35:26 -07:00
Nik Everett	b5da42905f	Remove publishAddress from reindex whitelist Removes the `publishAddress` parameter from the reindex-from-remote whitelist checking because it isn't in use after #21004.	2016-10-20 12:51:10 -04:00
Fanfan	043a45746c	some misspelled words in code (#21012 ) as the title mentioned, misspelling as follows, "construct" to "constrcut", "cumulation" to "cumalation", "initialize" to "intialize".	2016-10-19 11:42:38 -04:00

... 3 4 5 6 7 ...

4121 Commits