OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	204f4022c2	percolator: Do not take duplicate query extractions into account for minimum_should_match attribute If a percolator query contains duplicate query clauses somewhere in the query tree then when these clauses are extracted then they should not affect the msm. This can lead a percolator query that should be a valid match not become a candidate match, because at query time, the msm that is being used by the CoveringQuery would never match with the msm used at index time. Closes #28315	2018-01-30 07:25:33 +01:00
Jack Conradson	1d01bcf421	Remove Painless Type in favor of Java Class in FunctionRef. (#28429 )	2018-01-29 16:43:36 -08:00
Jack Conradson	f13da9f534	Remove Painless Type from e-nodes in favor of Java Class (#28364 )	2018-01-29 12:44:50 -08:00
Simon Willnauer	7957c9751c	[TEST] fix test to use a dedicated index to ensure lucene docIDs are guaranteed	2018-01-26 14:52:39 +01:00
Jay Modi	e59f14d139	Update Netty to 4.1.16.Final (#28345 ) This commit updates netty to 4.1.16.Final. This is the latest version that we can have work without extra permissions. This updated version of netty fixes issues seen with Java 9 and some data not being sent, which results in timeouts.	2018-01-25 12:48:43 -07:00
Jack Conradson	a57a0ae78b	Remove Painless Type from MethodWriter in favor of Java Class. (#28346 )	2018-01-24 11:02:46 -08:00
Nik Everett	2eede9b876	Reindex: Shore up rethrottle test The rethrottle test fails from time to time because one of the child task that want to be rethrottled hasn't properly started yet. We retry in this case but it looks like the retry either isn't long enough or something else strange is happening. This change adds yet more logging so future failure of this kind will be easier to track down and it adds an extra wait condition: this waits for all child tasks to be running or completed before rethrottling. This might avoid the failure because once a child task is properly started it should be quite ok to rethrottle. Relates to #26192	2018-01-24 11:03:30 -05:00
Jack Conradson	b945006938	Completely remove Painless Type from AnalyzerCaster in favor of Java Class. (#28329 ) Second part in a series of PR's to remove Painless Type in favor of Java Class. This completely removes the Painless Type dependency from AnalyzerCaster. Both casting and promotion are now based on Java Class exclusively. This also allows AnalyzerCaster to be decoupled from Definition and make cast checks be static calls again.	2018-01-23 12:38:31 -08:00
Nik Everett	eded5bc4f3	Reindex: Wait for deletion in test The test failure tracked by #28053 occurs because we fail to get the failure response from the reindex on the first try and on our second try the delete index API call that was supposed to trigger the failure actually deletes the index during document creation. This causes the test to fail catastrophically. This PR attempts to wait for the failure to finish before the test moves on to the second attempt. The failure doesn't reproduce locally for me so I can't be sure that this helps at all with the failure, but it certainly feels like it should help some. Here is hoping this prevents similar failures in the future.	2018-01-23 13:35:23 -05:00
Nik Everett	049f29710e	Reindex: log more on rare test failure The test failure tracked by #26758 occurs when we cancel a running reindex request that has been sliced into many children. The main reindex response looks canceled but none of the children look canceled. This is super strange because for the main request to look canceled for any length of time one of the children has to be canceled. This change adds additional logging to the test so we have more to go on to debug this the next time it fails.	2018-01-23 12:21:28 -05:00
Simon Willnauer	4d3f7a7695	Ensure we protect Collections obtained from scripts from self-referencing (#28335 ) Self referencing maps can cause SOE if they are iterated ie. in their toString methods. This chance adds some protected to the usage of those collections.	2018-01-23 16:57:26 +01:00
Jack Conradson	ef5c041819	Painless: Replace Painless Type with Java Class during Casts (#27847 ) This is the first step in a series to replace Painless Type with Java Class for any casting done during compilation. There should be no behavioural change.	2018-01-22 13:01:13 -08:00
Christoph Büscher	a6bfe67f8b	[Test] Lower bwc version for rank-eval rest tests The API was backported to 6.2 so the version we test against on master can be lowered to that.	2018-01-22 13:33:42 +01:00
Adrien Grand	700d9ecc95	Remove the `update_all_types` option. (#28288 ) This option is not useful in 7.x since no indices may have more than one type anymore.	2018-01-22 12:03:07 +01:00
Ryan Ernst	ba9c9e08e7	Painless: Add spi jar that will be published for extending whitelists (#28302 ) In order to build a plugin that extends the painless whitelist, the spi classes must be available to the plugin at compile time. This commit moves the spi classes into a separate jar which will be published. Any plugin authors whiching to extend painless through spi would then add a compileOnly dependency on this jar.	2018-01-18 19:16:26 -08:00
Ryan Ernst	19a2b01e43	Build: Omit dependency licenses check for elasticsearch deps (#28304 ) Sometimes modules/plugins depend on locally built elasticsearch jars. This means not only that the jar is constantly changing (so no need for a sha check), but also that the license falls under the Elasticsearch license, and there is no need to keep another copy. This commit updates the dependencies checked by dependencyLicenses to exclude those that are built by elasticsearch.	2018-01-18 14:15:44 -08:00
Christoph Büscher	77dcaab34f	Simplify RankEvalResponse output (#28266 ) Currenty the rest response of the ranking evaluation API wraps all inside an enclosing `rank_eval` object. This is redundant since it is clear from the API call and it doesn't provide any other useful information. This change removes this.	2018-01-18 09:32:27 +01:00
Ryan Ernst	18463e7e9f	Painless: Add whitelist extensions (#28161 ) This commit adds a PainlessExtension which may be plugged in via SPI to add additional classes, methods and members to the painless whitelist on a per context basis. An example plugin adding and using a whitelist is also added.	2018-01-15 11:28:31 -08:00
Tim Brooks	ee7eac8dc1	`MockTcpTransport` to connect asynchronously (#28203 ) The method `initiateChannel` on `TcpTransport` is explicit in that channels can be connect asynchronously. All production implementations do connect asynchronously. Only the blocking `MockTcpTransport` connects in a synchronous manner. This avoids testing some of the blocking code in `TcpTransport` that waits on connections to complete. Additionally, it requires a more extensive method signature than required for other transports. This commit modifies the `MockTcpTransport` to make these connections asynchronously on a different thread. Additionally, it simplifies that `initiateChannel` method signature.	2018-01-15 10:20:30 -07:00
Tim Brooks	3895add2ca	Introduce elasticsearch-core jar (#28191 ) This is related to #27933. It introduces a jar named elasticsearch-core in the lib directory. This commit moves the JarHell class from server to elasticsearch-core. Additionally, PathUtils and some of Loggers are moved as JarHell depends on them.	2018-01-15 09:59:01 -07:00
Jim Ferenczi	be012b1326	upgrade to lucene 7.2.1 (#28218 )	2018-01-15 16:47:46 +01:00
Igor Motov	c75ac319a6	Add ability to associate an ID with tasks (#27764 ) Adds support for capturing the X-Opaque-Id header from a REST request and storing it's value in the tasks that this request started. It works for all user-initiated tasks (not only search). Closes #23250 Usage: ``` $ curl -H "X-Opaque-Id: imotov" -H "foo:bar" "localhost:9200/_tasks?pretty&group_by=parents" { "tasks" : { "7qrTVbiDQKiZfubUP7DPkg:6998" : { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6998, "type" : "transport", "action" : "cluster:monitor/tasks/lists", "start_time_in_millis" : 1513029940042, "running_time_in_nanos" : 266794, "cancellable" : false, "headers" : { "X-Opaque-Id" : "imotov" }, "children" : [ { "node" : "V-PuCjPhRp2ryuEsNw6V1g", "id" : 6088, "type" : "netty", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 67785, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } }, { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6999, "type" : "direct", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 98754, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } } ] } } } ```	2018-01-12 15:34:17 -05:00
Yu	228f7ffcdf	Add scroll parameter to _reindex API (#28041 ) Be able to change scroll timeout in _reindex API (by default: 5m)	2018-01-11 14:40:04 +01:00
Martijn van Groningen	73f6857dff	test: ensure we endup with a single segment Closes #28127	2018-01-10 15:14:26 +01:00
Jack Conradson	5d795afddb	Painless: Add public member read/write access test. (#28156 )	2018-01-09 15:06:51 -08:00
Jack Conradson	1d1dcd4ae7	Painless: Add a simple cache for whitelist methods and fields. (#28142 ) With support for multiple contexts we are adding some caching to the whitelist to keep the memory footprint for definitions from exploding.	2018-01-08 17:54:45 -08:00
Jack Conradson	b5377d294f	Painless: Modify Loader to Load Classes Directly from Definition (#28088 )	2018-01-05 13:06:36 -08:00
Tim Brooks	38701fb6ee	Create nio-transport plugin for NioTransport (#27949 ) This is related to #27260. This commit moves the NioTransport from :test:framework to a new nio-transport plugin. Additionally, supporting tcp decoding classes are moved to this plugin. Generic byte reading and writing contexts are moved to the nio library. Additionally, this commit adds a basic MockNioTransport to :test:framework that is a TcpTransport implementation for testing that is driven by nio.	2018-01-05 09:41:29 -07:00
Ryan Ernst	d36ec18029	Plugins: Add plugin extension capabilities (#27881 ) This commit adds the infrastructure to plugin building and loading to allow one plugin to extend another. That is, one plugin may extend another by the "parent" plugin allowing itself to be extended through java SPI. When all plugins extending a plugin are finished loading, the "parent" plugin has a callback (through the ExtensiblePlugin interface) allowing it to reload SPI. This commit also adds an example plugin which uses as-yet implemented extensibility (adding to the painless whitelist).	2018-01-03 11:12:43 -08:00
Christoph Büscher	29b07bb6c4	[Test] Fix scores for dcg in RankEvalRequestIT and RankEvalYamlIT Allow small deviations when asserting ranking scores, otherwise some tests break on floating point calculation differences e.g. when running on ARM.	2018-01-03 17:24:10 +01:00
Sian Lerk Lau	5e3ba8a88d	Enable convert processor to support Long and Double. (#27957 ) Closes #23085	2018-01-03 11:27:55 +01:00
Christoph Büscher	8925dabcb8	[Test] Fix allowed delta for calculated scores in DiscountedCumulativeGainTests	2018-01-02 16:46:31 +01:00
Tanguy Leroux	d2939a9daa	[Test] Mute DiscountedCumulativeGainTests on ARM These tests fail on ARM architectures. This is tracked in https://github.com/elastic/elasticsearch/issues/28048	2018-01-02 16:16:43 +01:00
Adrien Grand	7d88851766	Upgrade beats templates that we use for bwc testing. (#27929 ) These templates were generated with 5.0. We need those generated with 6.0 since we do not guarantee compatibility with previous versions of the template anyway. I removed the winlogbeat template which is a bit harder to generate as it requires Windows, since we do not aim to be exhaustive.	2017-12-21 08:50:14 +01:00
Andy Bristol	863432668b	[TEST] logging for update by query test #27820	2017-12-20 18:27:16 -05:00
Sian Lerk Lau	47eefbe889	Enable grok processor to support long, double and boolean (#27896 )	2017-12-20 11:19:49 -08:00
Adrien Grand	77711508b0	Upgrade to Lucene 7.2.0. (#27910 )	2017-12-20 14:17:40 +01:00
Alan Woodward	af3f63616b	Allow TrimFilter to be used in custom normalizers (#27758 ) AnalysisFactoryTestCase checks that the ES custom token filter multi-term awareness matches the underlying lucene factory. For the trim filter this won't be the case until LUCENE-8093 is released in 7.3, so we add a temporary exclusion Closes #27310	2017-12-18 14:27:03 +00:00
Jason Tedor	75c0cd0672	Move range field mapper back to core This commit moves the range field mapper back to core so that we can remove the compile-time dependency of percolator on mapper-extras which compilcates dependency management for the percolator client JAR, and modules should not be intertwined like this anyway. Relates #27854	2017-12-17 14:27:10 -05:00
Martijn van Groningen	e9160fc014	percolator: also extract match_all queries I've seen several cases where match_all queries were being used inside percolator queries, because these queries were created generated by other systems. Extracting these queries will allow the percolator at query time in a filter context to skip over these queries without parsing or validating that these queries actually match with the document being percolated.	2017-12-15 08:50:29 +01:00
Christoph Büscher	c541a0c60e	Add skip versions for rank_eval yaml tests	2017-12-14 22:18:37 +01:00
Jack Conradson	1de927c80d	Painless: Clean Up Painless Cast Object (#27794 ) Added static methods to make creating Painless casts obvious as to what is being boxed/unboxed.	2017-12-14 09:08:10 -08:00
Adrien Grand	1b660821a2	Allow `_doc` as a type. (#27816 ) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751	2017-12-14 17:47:53 +01:00
Christoph Büscher	bb14b8f7c5	Merge branch 'rankeval' This commit adds a new module that provides an endpoint that can be used to evaluate search ranking results. Closes #19195	2017-12-14 16:45:03 +01:00
Christoph Büscher	33bcfddb54	Use SPI to provide named XContent parsers for ranking evaluation	2017-12-12 18:39:01 +01:00
Alan Woodward	77617c8e62	[TEST] Add test for _range fields in query_string queries (#27756 ) [TEST] Add test for _range fields in query_string queries Closes #26555	2017-12-12 13:33:37 +00:00
Jack Conradson	8188d9f7e5	Painless: Only allow Painless type names to be the same as the equivalent Java class. (#27264 ) Also adds a parameter called only_fqn to the whitelist to enforce that a painless type must be specified as the fully-qualifed java class name.	2017-12-11 16:37:35 -08:00
Christoph Büscher	97b25f3b0c	Merge branch 'master' into rankeval	2017-12-11 15:19:16 +01:00
Tim Brooks	d1acb7697b	Remove internal channel tracking in transports (#27711 ) This commit attempts to continue unifying the logic between different transport implementations. As transports call a `TcpTransport` callback when a new channel is accepted, there is no need to internally track channels accepted. Instead there is a set of accepted channels in `TcpTransport`. This set is used for metrics and shutting down channels.	2017-12-08 16:56:53 -07:00
Tim Brooks	d82c40d35c	Implement byte array reusage in `NioTransport` (#27696 ) This is related to #27563. This commit modifies the InboundChannelBuffer to support releasable byte pages. These byte pages are provided by the PageCacheRecycler. The PageCacheRecycler must be passed to the Transport with this change.	2017-12-08 10:39:30 -07:00
Tim Brooks	ad8a571677	Add read timeouts to http module (#27713 ) We currently do not have any server-side read timeouts implemented in elasticsearch. This commit adds a read timeout setting that defaults to 30 seconds. If after 30 seconds a read has not occurred, the channel will be closed. A timeout of value of 0 will disable the timeout.	2017-12-08 09:32:09 -07:00
Christoph Büscher	b83e14858a	Correcting some minor typos in comments	2017-12-07 16:39:23 +01:00
Christoph Büscher	52cb6c8ef2	Merge branch 'master' into rankeval	2017-12-07 14:22:46 +01:00
Robin Neatherway	057efea893	Correct two equality checks on incomparable types (#27688 )	2017-12-07 14:18:11 +01:00
Martijn van Groningen	4d78e1a9ad	Added msearch api to high level client	2017-12-05 10:17:47 +01:00
Christoph Büscher	bbec33d35c	Merge branch 'master' into rankeval	2017-12-04 12:57:19 +01:00
Christoph Büscher	72d0de4197	Add search window parameter k to MRR and DCG metric (#27595 )	2017-12-04 10:54:03 +01:00
Christoph Büscher	c4fe7d3f72	[Docs] add deprecation warning for `delimited_payload_filter` renaming	2017-12-04 10:22:05 +01:00
Adrien Grand	6323bb0d97	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27619 ) This new snapshot mostly brings a change to TopFieldCollector which can now early terminate collection when trackTotalHits is `false`. As a follow-up, we should replace our usage of `EarlyTerminatingSortingCollector` with this new option.	2017-12-04 09:40:08 +01:00
Christoph Büscher	35688f6441	Merge branch 'master' into rankeval	2017-11-29 15:24:06 +01:00
Christoph Büscher	7bfb273763	Add k parameter to PrecisionAtK metric (#27569 )	2017-11-29 15:19:16 +01:00
Jack Conradson	2d927fabab	Painless: Fix errors allowing void to be assigned to def. (#27460 )	2017-11-28 13:44:52 -08:00
Jack Conradson	9e42b77f7e	Painless: Fix variable scoping issue in lambdas not including captured variables. (#27571 )	2017-11-28 13:30:13 -08:00
Adrien Grand	996990ad1f	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27496 ) The main highlight of this new snapshot is that it introduces the opportunity for queries to opt out of caching. In case a query opts out of caching, not only will it never be cached, but also no compound query that wraps it will be cached.	2017-11-28 14:52:42 +01:00
Martijn van Groningen	cb1204774b	Include the _index, _type and _id to nested search hits in the top_hits and inner_hits response. Also include _type and _id for parent/child hits inside inner hits. In the case of top_hits aggregation the nested search hits are directly returned and are not grouped by a root or parent document, so it is important to include the _id and _index attributes in order to know to what documents these nested search hits belong to. Closes #27053	2017-11-28 14:05:29 +01:00
Christoph Büscher	1352b7c6ea	Use msearch instead of single search (#27520 ) Change TransportRankEvalAction to use one MultiSearchRequest instead of issuing several parallel search requests to simplify the transport action.	2017-11-27 10:15:59 +01:00
Martijn van Groningen	4ab638b71d	percolator: Avoid TooManyClauses exception if number of terms / ranges is exactly equal to 1024 The logic whether to use CoveringQuery was in two places which is why this bug snug in.	2017-11-27 08:55:11 +01:00
Christoph Büscher	5661b1c3df	Merge branch 'master' into rankeval	2017-11-24 16:25:05 +01:00
kel	4885acb048	Replace `delimited_payload_filter` by `delimited_payload` (#26625 ) The `delimited_payload_filter` is renamed to `delimited_payload`, the old name is deprecated and should be replaced by `delimited_payload`. Closes #21978	2017-11-24 13:03:19 +01:00
Simon Willnauer	fadbe0de08	Automatically prepare indices for splitting (#27451 ) Today we require users to prepare their indices for split operations. Yet, we can do this automatically when an index is created which would make the split feature a much more appealing option since it doesn't have any 3rd party prerequisites anymore. This change automatically sets the number of routinng shards such that an index is guaranteed to be able to split once into twice as many shards. The number of routing shards is scaled towards the default shard limit per index such that indices with a smaller amount of shards can be split more often than larger ones. For instance an index with 1 or 2 shards can be split 10x (until it approaches 1024 shards) while an index created with 128 shards can only be split 3x by a factor of 2. Please note this is just a default value and users can still prepare their indices with `index.number_of_routing_shards` for custom splitting. NOTE: this change has an impact on the document distribution since we are changing the hash space. Documents are still uniformly distributed across all shards but since we are artificually changing the number of buckets in the consistent hashign space document might be hashed into different shards compared to previous versions. This is a 7.0 only change.	2017-11-23 09:48:54 +01:00
Simon Willnauer	5a0b6d1977	Use the primary_term field to identify parent documents (#27469 ) This change stops indexing the `_primary_term` field for nested documents to allow fast retrieval of parent documents. Today we create a docvalues field for children to ensure we have a dense datastructure on disk. Yet, since we only use the primary term to tie-break on when we see the same seqID on indexing having a dense datastructure is less important. We can use this now to improve the nested docs performance and it's memory footprint. Relates to #24362	2017-11-21 15:14:03 +01:00
Christoph Büscher	d979ccace9	Merge branch 'master' into rankeval	2017-11-21 14:11:02 +01:00
Christoph Büscher	94a0631a3e	[Tests] Add testToXContent() RankEvalResponseTests	2017-11-21 14:09:50 +01:00
Christoph Büscher	35fabdaf8a	Parse EvluationMetrics as named Objects	2017-11-21 14:09:38 +01:00
Christoph Büscher	fdb24cd3e4	Fixing occasional test failure in RankEvalSpecTests	2017-11-21 14:09:13 +01:00
Christoph Büscher	3348d2317f	Reworking javadocs, minor changes in some implementation classes	2017-11-21 14:09:04 +01:00
Christoph Büscher	e278c1d17d	Improving and cleaning up tests Removing the unnecessary RankEvalTestHelper, making use of the common test infra in ESTestCase, also hardening a few of the classes by making more fields final.	2017-11-21 14:08:53 +01:00
Christoph Büscher	5c65a59369	Extending rank_eval asciidocs	2017-11-21 14:08:42 +01:00
Christoph Büscher	d9e67a2c95	Extending `_rank_eval` documentation	2017-11-21 14:08:28 +01:00
Christoph Büscher	0a6c6ac360	Remove usage of types in rank_eval endpoint	2017-11-21 14:07:41 +01:00
Jim Ferenczi	6319424e4a	Move composite aggregation to core (#27474 ) This change removes the module named aggs-composite and adds the `composite` aggs as a core aggregation. This allows other plugins to use this new aggregation and simplifies the integration in the HL rest client.	2017-11-21 13:31:01 +01:00
Luca Cavanna	29450de7b5	Cross Cluster Search: make remote clusters optional (#27182 ) Today Cross Cluster Search requires at least one node in each remote cluster to be up once the cross cluster search is run. Otherwise the whole search request fails despite some of the data (either local and/or remote) is available. This happens when performing the _search/shards calls to find out which remote shards the query has to be executed on. This scenario is different from shard failures that may happen later on when the query is actually executed, in case e.g. remote shards are missing, which is not going to fail the whole request but rather yield partial results, and the _shards section in the response will indicate that. This commit introduces a boolean setting per cluster called search.remote.$cluster_alias.skip_if_disconnected, set to false by default, which allows to skip certain clusters if they are down when trying to reach them through a cross cluster search requests. By default all clusters are mandatory. Scroll requests support such setting too when they are first initiated (first search request with scroll parameter), but subsequent scroll rounds (_search/scroll endpoint) will fail if some of the remote clusters went down meanwhile. The search API response contains now a new _clusters section, similar to the _shards section, that gets returned whenever one or more clusters were disconnected and got skipped: "_clusters" : { "total" : 3, "successful" : 2, "skipped" : 1 } Such section won't be part of the response if no clusters have been skipped. The per cluster skip_unavailable setting value has also been added to the output of the remote/info API.	2017-11-21 11:41:47 +01:00
Tim Brooks	4e04f95ab4	Fix issue where pages aren't released (#27459 ) This is related to #27422. Right now when we send a write to the netty transport, we attach a listener to the future. When you submit a write on the netty event loop and the event loop is shutdown, the onFailure method is called. Unfortunately, netty then tries to notify the listener which cannot be done without dispatching to the event loop. In this case, the dispatch fails and netty logs and error and does not tell us. This commit checks that netty is still not shutdown after sending a message. If netty is shutdown, we complete the listener.	2017-11-20 14:53:08 -07:00
Tim Brooks	0a8f48d592	Transition transport apis to use void listeners (#27440 ) Currently we use ActionListener<TcpChannel> for connect, close, and send message listeners in TcpTransport. However, all of the listeners have to capture a reference to a channel in the case of the exception api being called. This commit changes these listeners to be type <Void> as passing the channel to onResponse is not necessary. Additionally, this change makes it easier to integrate with low level transports (which use different implementations of TcpChannel).	2017-11-20 10:47:47 -07:00
Tim Brooks	80ef9bbdb1	Remove parameterization from TcpTransport (#27407 ) This commit is a follow up to the work completed in #27132. Essentially it transitions two more methods (sendMessage and getLocalAddress) from Transport to TcpChannel. With this change, there is no longer a need for TcpTransport to be aware of the specific type of channel a transport returns. So that class is no longer parameterized by channel type.	2017-11-16 11:19:36 -07:00
Jim Ferenczi	623367d793	Add composite aggregator (#26800 ) * This change adds a module called `aggs-composite` that defines a new aggregation named `composite`. The `composite` aggregation is a multi-buckets aggregation that creates composite buckets made of multiple sources. The sources for each bucket can be defined as: * A `terms` source, values are extracted from a field or a script. * A `date_histogram` source, values are extracted from a date field and rounded to the provided interval. This aggregation can be used to retrieve all buckets of a deeply nested aggregation by flattening the nested aggregation in composite buckets. A composite buckets is composed of one value per source and is built for each document as the combinations of values in the provided sources. For instance the following aggregation: ```` "test_agg": { "terms": { "field": "field1" }, "aggs": { "nested_test_agg": "terms": { "field": "field2" } } } ```` ... which retrieves the top N terms for `field1` and for each top term in `field1` the top N terms for `field2`, can be replaced by a `composite` aggregation in order to retrieve all the combinations of `field1`, `field2` in the matching documents: ```` "composite_agg": { "composite": { "sources": [ { "field1": { "terms": { "field": "field1" } } }, { "field2": { "terms": { "field": "field2" } } }, } } ```` The response of the aggregation looks like this: ```` "aggregations": { "composite_agg": { "buckets": [ { "key": { "field1": "alabama", "field2": "almanach" }, "doc_count": 100 }, { "key": { "field1": "alabama", "field2": "calendar" }, "doc_count": 1 }, { "key": { "field1": "arizona", "field2": "calendar" }, "doc_count": 1 } ] } } ```` By default this aggregation returns 10 buckets sorted in ascending order of the composite key. Pagination can be achieved by providing `after` values, the values of the composite key to aggregate after. For instance the following aggregation will aggregate all composite keys that sorts after `arizona, calendar`: ```` "composite_agg": { "composite": { "after": {"field1": "alabama", "field2": "calendar"}, "size": 100, "sources": [ { "field1": { "terms": { "field": "field1" } } }, { "field2": { "terms": { "field": "field2" } } } } } ```` This aggregation is optimized for indices that set an index sorting that match the composite source definition. For instance the aggregation above could run faster on indices that defines an index sorting like this: ```` "settings": { "index.sort.field": ["field1", "field2"] } ```` In this case the `composite` aggregation can early terminate on each segment. This aggregation also accepts multi-valued field but disables early termination for these fields even if index sorting matches the sources definition. This is mandatory because index sorting picks only one value per document to perform the sort.	2017-11-16 15:13:36 +01:00
Tim Brooks	ca11085bb6	Add TcpChannel to unify Transport implementations (#27132 ) Right now our different transport implementations must duplicate functionality in order to stay compliant with the requirements of TcpTransport. They must all implement common logic to open channels, close channels, keep track of channels for eventual shutdown, etc. Additionally, there is a weird and complicated relationship between Transport and TransportService. We eventually want to start merging some of the functionality between these classes. This commit starts moving towards a world where TransportService retains all the application logic and channel state. Transport implementations in this world will only be tasked with returning a channel when one is requested, calling transport service when a channel is accepted from a server, and starting / stopping itself. Specifically this commit changes how channels are opened and closed. All Transport implementations now return a channel type that must comply with the new TcpChannel interface. This interface has the methods necessary for TcpTransport to completely manage the lifecycle of a channel. This includes setting the channel up, waiting for connection, adding close listeners, and eventually closing.	2017-11-15 12:38:39 -07:00
Clinton Gormley	1caa5c8e32	Rest test fixes (#27354 ) * REST: Rename ingest.processor.grok to ingest.processor_grok * REST: Rename remote.info to cluster.remote_info * REST: Fixed bad YAML comments * REST: Force dummy scripts to be strings, not numbers * REST: Fix bad YAML in search/110_field_collapsing.yml * REST: Adjust percentile tests to work with Perl number handling	2017-11-14 11:14:14 +01:00
Tal Levy	5c34533761	add json-processor support for non-map json types (#27335 ) The Json Processor originally only supported parsing field values into Maps even though the JSON spec specifies that strings, null-values, numbers, booleans, and arrays are also valid JSON types. This commit enables parsing these values now. response to #25972.	2017-11-13 10:28:19 -08:00
Martijn van Groningen	7c056f4523	reword comment	2017-11-13 08:00:34 +01:00
Ryan Ernst	8b9e23de93	Plugins: Add versionless alias to all security policy codebase properties (#26756 ) This is a followup to #26521. This commit expands the alias added for the elasticsearch client codebase to all codebases. The original full jar name property is left intact. This only adds an alias without the version, which should help ease the pain in updating any versions (ES itself or dependencies).	2017-11-10 11:00:09 -08:00
Martijn van Groningen	1bd31e9b53	percolator: fixed issue where in indices created before 6.1 if minimum should match has been specified on a disjunction, the query would be marked as verified candidate match. This is wrong as it can only marked as verified candidate match on indices created on or after 6.1, due to the use of the CoveringQuery.	2017-11-10 12:02:33 +01:00
Martijn van Groningen	b4048b4e7f	Use CoveringQuery to select percolate candidate matches and extract all clauses from a conjunction query. When clauses from a conjunction are extracted the number of clauses is also stored in an internal doc values field (minimum_should_match field). This field is used by the CoveringQuery and allows the percolator to reduce the number of false positives when selecting candidate matches and in certain cases be absolutely sure that a conjunction candidate match will match and then skip MemoryIndex validation. This can greatly improve performance. Before this change only a single clause was extracted from a conjunction query. The percolator tried to extract the clauses that was rarest in order (based on term length) to attempt less candidate queries to be selected in the first place. However this still method there is still a very high chance that candidate query matches are false positives. This change also removes the influencing query extraction added via #26081 as this is no longer needed because now all conjunction clauses are extracted. https://www.elastic.co/guide/en/elasticsearch/reference/6.x/percolator.html#_influencing_query_extraction Closes #26307	2017-11-10 07:44:42 +01:00
Tal Levy	d22fd4ea58	Introduce templating support to timezone/locale in DateProcessor (#27089 ) Sometimes systems like Beats would want to extract the date's timezone and/or locale from a value in a field of the document. This PR adds support for mustache templating to extract these values. Closes #24024.	2017-11-09 09:45:32 -08:00
Mayya Sharipova	148376c2c5	Add limits for ngram and shingle settings (#27211 ) * Add limits for ngram and shingle settings (#27211) Create index-level settings: max_ngram_diff - maximum allowed difference between max_gram and min_gram in NGramTokenFilter/NGramTokenizer. Default is 1. max_shingle_diff - maximum allowed difference between max_shingle_size and min_shingle_size in ShingleTokenFilter. Default is 3. Throw an IllegalArgumentException when trying to create NGramTokenFilter, NGramTokenizer, ShingleTokenFilter where difference between max_size and min_size exceeds the settings value. Closes #25887	2017-11-07 08:14:55 -05:00
David Roberts	749c3ec716	Remove the single argument Environment constructor (#27235 ) Only tests should use the single argument Environment constructor. To enforce this the single arg Environment constructor has been replaced with a test framework factory method. Production code (beyond initial Bootstrap) should always use the same Environment object that Node.getEnvironment() returns. This Environment is also available via dependency injection.	2017-11-04 13:25:09 +00:00
Armin Braun	3deba0ed1f	#26260 Allow ip_range to accept CIDR notation (#27192 ) * #26260 Allow ip_range to accept CIDR notation * #26260 added non-byte-alligned cidr test cases	2017-11-03 13:34:48 -06:00
Armin Braun	8f0f024507	#27189 Fixed rounding of bounds in scaled float comparison (#27207 ) * #27189 Fixed rounding of bounds in scaled float comparison * #27189 more assertions from CR	2017-11-03 13:23:07 -06:00
Armin Braun	f9e755f980	Fixed byte buffer leak in Netty4 request handler If creating the REST request throws an exception (for example, because of invalid headers), we leak the request due to failure to release the buffer (which would otherwise happen after replying on the channel). This commit addresses this leak by handling the failure case. Relates #27222	2017-11-02 20:22:19 -04:00
Colin Goodheart-Smithe	c1b8140c83	Upgrade to Lucene 7.1 (#27225 )	2017-11-02 13:25:33 +00:00
Colin Goodheart-Smithe	99aca9cdfc	Enhances exists queries to reduce need for `_field_names` (#26930 ) * Enhances exists queries to reduce need for `_field_names` Before this change we wrote the name all the fields in a document to a `_field_names` field and then implemented exists queries as a term query on this field. The problem with this approach is that it bloats the index and also affects indexing performance. This change adds a new method `existsQuery()` to `MappedFieldType` which is implemented by each sub-class. For most field types if doc values are available a `DocValuesFieldExistsQuery` is used, falling back to using `_field_names` if doc values are disabled. Note that only fields where no doc values are available are written to `_field_names`. Closes #26770 * Addresses review comments * Addresses more review comments * implements existsQuery explicitly on every mapper * Reinstates ability to perform term query on `_field_names` * Added bwc depending on index created version * Review Comments * Skips tests that are not supported in 6.1.0 These values will need to be changed after backporting this PR to 6.x	2017-11-01 10:46:59 +00:00
Jack Conradson	abaede2373	Upgrade Painless from ANTLR 4.5.1-1 to ANTLR 4.5.3. (#27153 )	2017-10-27 11:07:49 -07:00
Christoph Büscher	b88dbe8f49	[Tests] Fix occasional test failure due to two random values being the same	2017-10-27 12:06:16 +02:00
Jack Conradson	dda5d1af29	Allow for the Painless Definition to have multiple instances (#27096 )	2017-10-26 08:33:55 -07:00
Jason Tedor	9aae2f593a	Avoid stack overflow on search phases When a search is executing locally over many shards, we can stack overflow during query phase execution. This happens due to callbacks that occur after a phase completes for a shard and we move to the same phase on another shard. If all the shards for the query are local to the local node then we will never go async and these callbacks will end up as recursive calls. With sufficiently many shards, this will end up as a stack overflow. This commit addresses this by truncating the stack by forking to another thread on the executor for the phase. Relates #27069	2017-10-25 22:05:46 -04:00
Ryan Ernst	2a8452b513	Reindex: Fix headers in reindex action (#26937 ) The headers passed to reindex were skipped except for the last one. This commit fixes the copying of the headers, as well as adds a base test case for rest client builders to access the headers within the built rest client. relates #22976	2017-10-25 16:37:01 -07:00
Tim Brooks	a7fa5d3335	Remove dangerous `ByteBufStreamInput` methods (#27076 ) This commit removes the `ByteBufStreamInput` `readBytesReference` and `readBytesRef` methods. These methods are zero-copy which means that they retain a reference to the underlying netty buffer. The problem is that our `TcpTransport` is not designed to handle zero-copy. The netty implementation sets the read index past the current message once it has been deserialized, handled, and mostly likely dispatched to another thread. This means that netty is free to release this buffer. So it is unsafe to retain a reference to it without calling `retain`. And we cannot call `retain` because we are not currently designed to handle reference counting past the transport level. This should not currently impact us as we wrap the `ByteBufStreamInput` in `NamedWriteableAwareStreamInput` in the `TcpTransport`. This stream essentially delegates to the underling stream. However, in the case of `readBytesReference` and `readBytesRef` it leaves thw implementations to the standard `StreamInput` methods. These methods call the read byte array method which delegates to `ByteBufStreamInput`. The read byte array method on `ByteBufStreamInput` copies so it is safe. The only impact of this commit should be removing methods that could be dangerous if they were eventually called due to some refactoring.	2017-10-24 08:51:14 -06:00
Martijn van Groningen	93107f8466	removed unused import	2017-10-23 10:00:54 +02:00
Martijn van Groningen	141d1b62e9	ingest: date processor should not fail if timestamp is specified as json number Closes #26967	2017-10-23 09:32:44 +02:00
Tanguy Leroux	463e7e6fa3	Revert "Upgrade to Jackson 2.9.2 (#27032 )" This reverts commit `0b9acc5ace`.	2017-10-20 08:25:41 +02:00
Tanguy Leroux	0b9acc5ace	Upgrade to Jackson 2.9.2 (#27032 ) Upgrade to Jackson 2.9.2 and also use a boolean `closed` flag to indicate that a FastStringReader instance is closed, so that length is still correctly reported after the reader is closed.	2017-10-19 15:15:02 +02:00
Simon Willnauer	8dda827ff4	Don't refresh on `_flush` `_force_merge` and `_upgrade` (#27000 ) Today all these API calls have a sideeffect of making documents visible to search requests. While this is sometimes desired it's an unnecessary sideeffect and now that we have an internal (engine-private) index reader (#26972) we artificially add a refresh call for bwc. This change removes this sideeffect in 7.0.	2017-10-16 10:16:35 +02:00
Tim Brooks	277637f42f	Do not set SO_LINGER on server channels (#26997 ) Right now we are attempting to set SO_LINGER to 0 on server channels when we are stopping the tcp transport. This is not a supported socket option and throws an exception. This also prevents the channels from being closed. This commit 1. doesn't set SO_LINGER for server channges, 2. checks that it is a supported option in nio, and 3. changes the log message to warn for server channel close exceptions.	2017-10-13 13:06:38 -06:00
Anton Pozhidaev	cee9640c20	Update by Query is modified to accept short `script` parameter. (#26841 ) Update by Query is modified to accept short `script` parameter. Closes issue #24898	2017-10-11 21:57:46 +00:00
kel	2e36f19051	Add support for parsing inline script (#23824 ) (#26846 ) * Add support for parsing inline script (#23824) * Fix test	2017-10-11 09:15:37 -07:00
Jason Tedor	4c06b8f1d2	Check for closed connection while opening While opening a connection to a node, a channel can subsequently close. If this happens, a future callback whose purpose is to close all other channels and disconnect from the node will fire. However, this future will not be ready to close all the channels because the connection will not be exposed to the future callback yet. Since this callback is run once, we will never try to disconnect from this node again and we will be left with a closed channel. This commit adds a check that all channels are open before exposing the channel and throws a general connection exception. In this case, the usual connection retry logic will take over. Relates #26932	2017-10-10 13:34:51 -04:00
Nik Everett	4a06dd919a	Painless: add tests for cached boxing (#24163 ) We had a TODO about adding tests around cached boxing. In #24077 I tracked down the uncached boxing tests and saw the TODO. Cached boxing testing is a fairly small extension to that work.	2017-10-10 10:34:03 -04:00
Tanguy Leroux	6658ff0fd6	Don't detect source's XContentType in DocumentParser.parseDocument() (#26880 ) DocumentParser.parseDocument() auto detects the XContentType of the document to parse, but this information is already provided by SourceToParse.	2017-10-10 15:31:56 +02:00
Daniel Mitterdorfer	e22844bd2a	Allow only a fixed-size receive predictor (#26165 ) With this commit we simplify our network layer by only allowing to define a fixed receive predictor size instead of a minimum and maximum value. This also means that the following (previously undocumented) settings are removed: * http.netty.receive_predictor_min * http.netty.receive_predictor_max Using an adaptive sizing policy in the receive predictor is a very low-level optimization. The implications on allocation behavior are extremely hard to grasp (see our previous work in #23185) and adaptive sizing does not provide a lot of benefits (see benchmarks in #26165 for more details).	2017-10-10 13:29:45 +02:00
Martijn van Groningen	bba70205e3	ingest: Fix bug that prevent date_index_name processor from accepting timestamps specified as a json number Closes #26890	2017-10-10 10:04:29 +02:00
Ryan Ernst	6b53dadcf9	Scripting: Fix expressions to temporarily support filter scripts (#26824 ) This commit adds a hack converting 0.0 to false and non-zero to true for expressions operating under a filter context. closes #26429	2017-10-09 17:02:21 -07:00
Simon Willnauer	cdd7c1e6c2	Return List instead of an array from settings (#26903 ) Today we return a `String[]` that requires copying values for every access. Yet, we already store the setting as a list so we can also directly return the unmodifiable list directly. This makes list / array access in settings a much cheaper operation especially if lists are large.	2017-10-09 09:52:08 +02:00
Nik Everett	38989191e7	Use LF line endings in Painless generated files (#26822 ) Right now if you run `gradle regen` on Windows you'll get `CRLF` line endings on all the ANTLR generated files because we run ``` ant.fixcrlf(srcdir: outputPath) { patternset(includes: 'Painless*.java') } ``` The docs for fixcrlf say that the default line endings that it corrects to is based on the OS: https://ant.apache.org/manual/Tasks/fixcrlf.html This change locks it to `LF`.	2017-10-06 16:44:03 -04:00
Yannick Welsch	c1666f4a22	Use proper logging placeholder for Netty logging	2017-10-06 10:02:51 +02:00
Yannick Welsch	ec6ea9b403	Add Netty channel information on write and flush failure	2017-10-06 09:16:58 +02:00
Jason Tedor	470e5e7cfc	Add additional low-level logging handler () * Add additional low-level logging handler We have the trace handler which is useful for recording sent messages but there are times where it would be useful to have more low-level logging about the events occurring on a channel. This commit adds a logging handler that can be enabled by setting a certain log level (org.elasticsearch.transport.netty4.ESLoggingHandler) to trace that provides trace logging on low-level channel events and includes some information about the request/response read/write events on the channel as well. * Remove imports * License header * Remove redundant * Add test * More assertions	2017-10-05 12:10:58 -04:00
Jason Tedor	597187048b	Unwrap causes when maybe dying We should unwrap the cause looking for any suppressed errors or root causes that are errors when checking if we should maybe die. This commit causes that to be the case. Relates #26884	2017-10-05 12:00:30 -04:00
Jason Tedor	4835d61a48	Change log level on write and flush failure to warn This commit changes the log level on a write and flush failure to warn as this is not necessarily an Elasticsearch problem but more likely indicative of an infrastructure problem.	2017-10-05 11:18:43 -04:00
Martijn van Groningen	b27e408ed2	Removed void token filter entries and added two tests	2017-10-05 13:25:05 +02:00
Md. Abdulla-Al-Sun	a40c474e10	Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238)	2017-10-05 13:25:05 +02:00
Simon Willnauer	00dfdf50cf	Represent lists as actual lists inside Settings (#26878 ) Today we represent each value of a list setting with it's own dedicated key that ends with the index of the value in the list. Aside of the obvious weirdness this has several issues especially if lists are massive since it causes massive runtime penalties when validating settings. Like a list of 100k words will literally cause a create index call to timeout and in-turn massive slowdown on all subsequent validations runs. With this change we use a simple string list to represent the list. This change also forbids to add a settings that ends with a .0 which was internally used to detect a list setting. Once this has been rolled out for an entire major version all the internal .0 handling can be removed since all settings will be converted. Relates to #26723	2017-10-05 09:27:08 +02:00
Martijn van Groningen	dca787ed8a	upgrade to Lucene 7.1.0 snapshot version	2017-10-05 09:06:56 +02:00
Simon Willnauer	d1533e2397	Remove Settings#getAsMap() (#26845 ) Since `#getAsMap` exposes internal representation we are trying to remove it step by step. This commit is cleaning up some xcontent writing as well as usage in tests	2017-10-04 01:21:38 -06:00
Tim Brooks	d80ad7f097	Check channel i open before setting SO_LINGER (#26857 ) This commit fixes a #26855. Right now we set SO_LINGER to 0 if we are stopping the transport. This can throw a ChannelClosedException if the raw channel is already closed. We have a number of scenarios where it is possible this could be called with a channel that is already closed. This commit fixes the issue be checking that the channel is not closed before attempting to set the socket option.	2017-10-02 15:09:52 -06:00
Jason Tedor	5869a7482b	Maybe die before trying to log cause This commit reorders a maybe die check and a logging statement for the following reasons: - we should die as quickly as possible if the cause is fatal - we do not want the JVM to be so broken that when we try to log another exception is thrown (maybe another out of memory exception) and then the maybe die is never invoked - maybe die will log the cause anyway if the cause is fatal so we only need to log if the cause is not fatal	2017-10-01 09:45:36 -04:00
Jason Tedor	1084c7b6b2	Log cause when a write and flush fails This commit logs the cause of a write and flush operation on the network layer that failed.	2017-10-01 09:41:13 -04:00
Jason Tedor	f79842be6f	Die if write listener fails due to fatal error This commit performs a maybe die check after a write listener fails.	2017-09-30 18:52:54 -04:00
Simon Willnauer	25d6778d31	Add comment to TCP transport impls why we set SO_LINGER on close	2017-09-28 13:07:01 +02:00
Martijn van Groningen	805437b8bc	percolator: Also support query extraction for queries wrapped inside a ESToParentBlockJoinQuery	2017-09-28 09:28:50 +02:00
Armin Braun	af06231d4c	#26701 Close TcpTransport on RST in some Spots to Prevent Leaking TIME_WAIT Sockets (#26764 ) #26701 Added option to RST instead of FIN to TcpTransport#closeChannels	2017-09-26 19:58:11 +00:00
Christoph Büscher	6189c54c84	Reject the `index_options` parameter for numeric fields (#26668 ) Numeric fields no longer support the index_options parameter. This changes the parameter to be rejected in numeric field types after it was deprecated in 6.0. Closes #21475	2017-09-25 23:43:14 +02:00
Nik Everett	eb754a71be	Fix update_by_query's default size parameter (#26784 ) We were accidentally defaulting it to the scroll size. Untwists some of the tricks that we play with parsing so that the size is no longer scrambled. Closes #26761	2017-09-25 16:25:27 -04:00
Jiri Tyr	76f8701eec	Fixing Grok pattern for Apache 2.4 (#26635 )	2017-09-25 07:59:37 -07:00
Simon Willnauer	aab4655e63	Unify Settings xcontent reading and writing (#26739 ) This change adds a fromXContent method to Settings that allows to read the xcontent that is produced by toXContent. It also replaces the entire settings loader infrastructure and removes the structured map representation. Future PRs will also tackle the `getAsMap` that exposes the internal represenation of settings for better encapsulation.	2017-09-25 13:23:01 +02:00
Jason Tedor	e0db89bc35	Upgrade to Lucene 7.0.0 This commit upgrades to the GA release of Luence 7! Relates #26744	2017-09-21 19:19:33 -04:00
Christoph Büscher	86b00b84bc	Remove parse field deprecations in query builders (#26711 ) The `fielddata` field and the use of the `_name` field in the short syntax of the range query have been deprecated in 5.0 and can be removed. The same goes for the deprecated `score_mode` field in HasParentQueryBuilder, the deprecated `like_text`, `ids` and `docs` parameter in the `more_like_this` query, the deprecated query name in the short version of the `regexp` query, and several deprecated alternative field names in other query builders.	2017-09-20 16:22:21 +02:00
Christoph Büscher	22e200e79a	Remove deprecated type and slop field in MatchQueryBuilder (#26720 ) The `type` field has been deprecated in 5.0 and can be removed. It has been replaced by using the MatchPhraseQueryBuilder or the MatchPhrasePrefixQueryBuilder. The `slop` field has also been deprecated and can be removed, the phrase and phrase prefix query builders still provide this parameter.	2017-09-20 14:24:30 +02:00
Yannick Welsch	ff1e26276d	Deguice ActionFilter (#26691 ) Allows to instantiate TransportAction instances without Guice.	2017-09-20 10:30:21 +02:00
Jack Conradson	c3746b268c	Separate Painless Whitelist Loading from the Painless Definition (#26540 ) Adds several small whitelist data structures and a new Whitelist class to separate the idea of loading a whitelist from the actual Painless Definition class. This is the first step of many in allowing users to define custom whitelists per context. Also supports the idea of loading multiple whitelists from different sources for a single context.	2017-09-18 15:51:07 -07:00
Simon Willnauer	9f97f9072a	Allow `InputStreamStreamInput` array size validation where applicable (#26692 ) Today we can't validate the array length in `InputStreamStreamInput` since we can't rely on `InputStream.available` yet in some situations we know the size of the stream and can apply additional validation.	2017-09-18 17:52:36 +02:00
Christoph Büscher	c83ec1f133	Fixing test after merging in master	2017-09-15 13:44:40 +02:00
Christoph Büscher	bea8451b2f	Merge branch 'master' into feature/rank-eval	2017-09-15 11:44:51 +02:00
Michael Basnight	f385e0cf26	Add bad_request to the rest-api-spec catch params (#26539 ) This adds another request to the catch params. It also makes sure that the generic request param does not allow 400 either.	2017-09-14 14:24:03 -05:00
Christoph Büscher	c7c6443b10	[Docs] "The the" is a great band, but ... (#26644 ) Removing several occurrences of this typo in the docs and javadocs, seems to be a common mistake. Corrections turn up once in a while in PRs, better to correct some of this in one sweep.	2017-09-14 15:08:20 +02:00
Jim Ferenczi	401f4ba2ce	Fix percolator highlight sub fetch phase to not highlight query twice (#26622 ) * Fix percolator highlight sub fetch phase to not highlight query twice The PercolatorHighlightSubFetchPhase does not override hitExecute and since it extends HighlightPhase the search hits are highlighted twice (by the highlight phase and then by the percolator). This does not alter the results, the second highlighting just overrides the first one but this slow down the request because it duplicates the work.	2017-09-14 09:31:14 +02:00
Adrien Grand	93da7720ff	Move non-core mappers to a module. (#26549 ) Today we have all non-plugin mappers in core. I'd like to start moving those that neither map to json datatypes nor are very frequently used like `date` or `ip` to a module. This commit creates a new module called `mappers-extra` and moves the `scaled_float` and `token_count` mappers to it. I'd like to eventually move `range` fields there but it's more complicated due to their intimate relationship with range queries. Relates #10368	2017-09-13 17:58:53 +02:00
Ryan Ernst	9834081254	Fix reference to painless inside expression engine (#26528 ) This was a simple copy/paste bug in an earlier refactoring.	2017-09-11 15:44:23 -07:00
Adrien Grand	1adee8b5a8	Fix the MapperFieldType.rangeQuery API. (#26552 ) RangeQueryBuilder needs to perform too many `instanceof` checks in order to check for `date` or `range` fields in order to know what it should do with the shape relation, time zone and date format. This commit adds those 3 parameters to the `rangeQuery` factory method so that those instanceof checks are not necessary anymore.	2017-09-11 11:02:05 +02:00
Martijn van Groningen	b391425da1	Added support to the percolate query to percolate multiple documents The percolator will add a `_percolator_document_slot` field to all percolator hits to indicate with what document it has matched. This number matches with the order in which the documents have been specified in the percolate query. Also improved the support for multiple percolate queries in a search request.	2017-09-08 17:28:39 +02:00
Ryan Ernst	c9964d17bf	Internal: Add versionless alias for rest client codebase in policy files (#26521 ) Security manager policy files contains grants for specific codebases, where a codebase is a jar file. We use a system property containing the name of the jar file to resolve the jar file location when parsing the policy file. However, this means the version of the jars must be modified when versions of dependencies change. This is particularly messy for elasticsearch, where we now have a dependency on the rest client, and need to support both a snapshot version for testing and non snapshot for release. This commit adds an alias for the elasticsearch rest client without a version to be used in policy files. That allows the policy files to not care whether the rest client is a snapshot or release.	2017-09-06 18:57:10 -07:00
Martijn van Groningen	6bdf591193	removed unused import	2017-09-06 07:01:58 +02:00
Martijn van Groningen	77bbe99102	Fix two unreleased percolator query analyze bugs * If in a range query upper is smaller than lower then ignore the range query * If two empty range extractions are compared don't fail with NoSuchElementException	2017-09-06 06:47:01 +02:00
Martijn van Groningen	2ad3608245	percolator: handle point queries with 2 or more dimensions correctly	2017-09-06 06:36:47 +02:00
Martijn van Groningen	78e9c96d7f	Added a limit to from + size in top_hits and inner hits. Relates to #11511	2017-09-05 08:44:45 +02:00
Martijn van Groningen	a4d5c6418e	percolator: Rename map_unmapped_fields_as_string setting to map_unmapped_fields_as_text The `index.percolator.map_unmapped_fields_as_text` is a more better name, because unmapped fields are mapped to a text field with default settings and string is no longer a field type (it is either keyword or text).	2017-09-04 14:12:44 +02:00
Alexander Reelsen	80d0a32f8e	ScriptService: Replace max compilation per minute setting with max compilation rate (#26399 ) The current script service has a script compilation limit for a one minute window. This is set to a small default value of 15. Instead of increasing that default value, this commit introduces a new setting that allows to configure a rate per time unit, so that the script service can deal with bursts better. The new setting is named `script.max_compilations_rate`, requires a nonnegative number and a positive time value. The default is `75/5m`, which is equivalent to the existing 15 per minute.	2017-09-01 10:15:27 +02:00
Adrien Grand	78681bc9e5	Upgrade to lucene-7.0.0-snapshot-d94a5f0. (#26441 )	2017-08-31 09:06:40 +02:00
Tal Levy	ed151d829d	Migrate Search requests to use Writeable reading strategies (#26428 ) Migrates many SearchRequest objects to use Writeable conventions and rejects usage of `readFrom` in these new classes.	2017-08-30 11:00:33 -07:00
Colin Goodheart-Smithe	ce1d85d7d0	Moves deferring code into its own subclass (#26421 ) * Moves deferring code into its own subclass This change moves the code that deals with deferring collection to a subclass of BucketAggregator called DeferringBucketAggregator. This means that the code in AggregatorBase is simplified and also means that the code for deferring colleciton is in one place and easier to maintain. * Makes SIngleBucketAggregator an interface This is so aggregators that extend BucketsAggregator directly and those that extend DeferringBucketAggregator can be a single bucket aggregator * review comments * More review comments	2017-08-30 11:15:40 +01:00
Adrien Grand	34a6c7af26	Consolidate locale parsing. (#26400 ) Mappings and ingest have different locale parsing code.	2017-08-30 10:58:33 +02:00
Adrien Grand	06b7f9c78e	Do not test the ingest date processor against random locales. Random locales include locales whose country name is obsolete like `CS` or have usage restrictions like `DG`. Closes #26425	2017-08-30 09:48:26 +02:00
Ryan Ernst	b56615ef46	Test: disable locale parsing test that is broken with some randomized values See https://github.com/elastic/elasticsearch/issues/26425	2017-08-29 11:57:57 -07:00
Andy Bristol	791cbc2ba7	[TEST] test logging for reindex RethrottleTests #26192	2017-08-28 15:50:38 -07:00
Jim Ferenczi	86d97971a4	Remove the _all metadata field (#26356 ) * Remove the _all metadata field This change removes the `_all` metadata field. This field is deprecated in 6 and cannot be activated for indices created in 6 so it can be safely removed in the next major version (e.g. 7).	2017-08-28 17:43:59 +02:00
Stuart Neivandt	f842ff1ae1	Simple verification of the format of the language tag used in DateProcessor. (#25513 ) Closes #26186	2017-08-28 10:59:00 +02:00
Adrien Grand	eb782492be	Remove support for lenient booleans. Closes #22298	2017-08-28 09:56:01 +02:00
Michael Basnight	cfd14cd2b8	Revert shading for the low level rest client (#26367 ) At current, we do not feel there is enough of a reason to shade the low level rest client. It caused problems with commons logging and IDE's during the brief time it was used. We did not know exactly how many users will need this, and decided that leaving shading out until we gather more information is best. Users can still shade the jar themselves. For information and feeback, see issue #26366. Closes #26328 This reverts commit `3a20922046`. This reverts commit `2c271f0f22`. This reverts commit `9d10dbea39`. This reverts commit `e816ef89a2`.	2017-08-25 14:13:12 -05:00
Tim Brooks	0551d2ff68	Move generic http settings out of netty module (#26310 ) There is a group of five settings relating to raw tcp configurations (no_delay, buffer sizes, etc) that we have for the http transport. These currently live in the netty module. As they are unrelated to netty specifically, this commit moves these settings to the `HttpTransportSettings` class in core.	2017-08-24 19:27:56 -05:00
Christoph Büscher	cb4fd3bac6	Fix more tests	2017-08-23 13:14:48 +02:00
Christoph Büscher	56360ecfb5	Fix failing tests due to xContent changes	2017-08-23 12:22:07 +02:00
Christoph Büscher	bc544e2d1b	Adapt branch to changes on master	2017-08-23 12:05:52 +02:00
Christoph Büscher	62a7cac3a0	Merge branch 'master' into feature/rank-eval	2017-08-23 11:19:16 +02:00
Yannick Welsch	0dbebd9177	Set classpath on Painless apiJavadoc task so it also works on JDK 9	2017-08-23 14:45:40 +09:30
Sergey Galkin	9a3216dfee	Stricter validation for min/max values for whole numbers (#26137 )	2017-08-21 12:16:45 +02:00
Andy Bristol	6eef6c4f7a	[TEST] wait until reindex tasks ready for rethrottle (#26250 ) When slices is set as auto, there's an additional network call needed for the reindex tasks to know how to rethrottle. Sometimes the rethrottle action happens before the reindex task is fully initialized, so in the test we wait for the task to be ready. This commit also adds some safeguards to ensure that cancel and rethrottle operations are handled correctly Closes #26192	2017-08-18 11:01:27 -07:00
Jack Conradson	23858789f0	Remove Sort enum from Painless Definition (#26179 ) This is step toward making Definition instanceable which is necessary for custom whitelists in different contexts.	2017-08-17 12:02:11 -07:00
Nik Everett	b840fa3117	Fix some links in Painless method reference Links to inner classes were using `$` in urls instead of `.`, causing them to 404. Also fixes the doc generation code to generate docs into the correct directory. We moved the docs but never updated the generation code.	2017-08-17 10:50:38 -04:00
Tim Brooks	f69cc78b67	Release pipelined http responses on close (#26226 ) Right now it is possible for the `HttpPipeliningHandler` to queue pipelined responses. On channel close, we do not clear and release these responses. This commit releases the responses and completes the promise.	2017-08-16 13:23:32 -05:00
Yannick Welsch	01f6851691	Serialize and expose timeout of acknowledged requests in REST layer (#26189 ) Due to the weird way of structuring the serialization code in AcknowledgedRequest, many request types forgot to properly serialize the request timeout, for example "index deletion", "index rollover", "index shrink", "putting pipeline", and other requests. This means that if those requests were not directly sent to the master node, the acknowledgement timeout information would be lost (and the default used instead). Some requests also don't properly expose the timeout mechanism in the REST layer, such as put / delete stored script. This commit fixes all that.	2017-08-16 07:43:05 +08:00
desmorto	292dd8f992	(refactor) some opportunities to use diamond operator (#25585 ) * (refactor) some opportunities to use diamond operator * Update ExceptionRetryIT.java update typo	2017-08-15 16:36:42 -06:00
Tal Levy	0c76d17fe1	fix targetField randomization in JoinProcessorTests (#26206 ) Closes #26203.	2017-08-14 09:26:47 -07:00
Andy Bristol	fcd8483505	AwaitsFix failing RethrottleTests	2017-08-14 08:10:47 -07:00
Tal Levy	10c3c1aef0	fix SplitProcessor targetField test (#26178 ) This test was too lenient with its randomization of targetFieldName and resulting in a conflict with the original existing fields. This commit fixes that. Closes #26177.	2017-08-11 16:18:04 -07:00
Martijn van Groningen	1146a35870	Move more token filters to analysis-common module The following token filters were moved: arabic_stem, brazilian_stem, czech_stem, dutch_stem, french_stem, german_stem and russian_stem. Relates to #23658	2017-08-11 17:39:24 +02:00
Andy Bristol	7e3cd6a019	reindex: automatically choose the number of slices (#26030 ) In reindex APIs, when using the `slices` parameter to choose the number of slices, adds the option to specify `slices` as "auto" which will choose a reasonable number of slices. It uses the number of shards in the source index, up to a ceiling. If there is more than one source index, it uses the smallest number of shards among them. This gives users an easy way to use slicing in these APIs without having to make decisions about how to configure it, as it provides a good-enough configuration for them out of the box. This may become the default behavior for these APIs in the future.	2017-08-11 08:25:25 -07:00
Martijn van Groningen	636e85e5b7	percolator: Hint what clauses are important in a conjunction query based on fields The percolator field mapper doesn't need to extract all terms and ranges from a bool query with must or filter clauses. In order to help to default extraction behavior, boost fields can be configured, so that fields that are known for not being selective enough can be ignored in favor for other fields or clauses with specific fields can forcefully take precedence over other clauses. This can help selecting clauses for fields that don't match with a lot of percolator queries over other clauses and thus improving performance of the percolate query. For example a status like field is something that should configured as an ignore field. Queries on this field tend to match with more documents and so if clauses for this fields get selected as best clause then that isn't very helpful for the candidate query that the percolate query generates to filter out percolator queries that are likely not going to match.	2017-08-11 15:32:01 +02:00
Daniel Mitterdorfer	637cc872f4	Remove unused Netty-related settings (#26161 ) With this commit we remove the following three previously unused (and undocumented) Netty 4 related settings: * transport.netty.max_cumulation_buffer_capacity, * transport.netty.max_composite_buffer_components and * http.netty.max_cumulation_buffer_capacity from Elasticsearch.	2017-08-11 12:03:00 +02:00
Martijn van Groningen	8285a0f399	percolator: Use correct version for bwc checking now that the change has been backported to 6.0 branch	2017-08-09 13:49:20 +02:00
Adrien Grand	f0c1e30544	Upgrade to lucene-7.0.0-snapshot-a128fcb. (#26090 )	2017-08-08 13:03:19 +02:00
olcbean	5c4c1c5e15	Verify that _bulk and _msearch requests are terminated by a newline (#25740 )	2017-08-08 10:45:44 +02:00
Simon Willnauer	82fa531ab4	Remove `_index` fielddata hack if cluster alias is present (#26082 ) We introduced a hack in #25885 to respect the cluster alias if available on the `_index` field. This is important if aggregations or other field data related operations are executed. Yet, we added a small hack that duplicated an implementation detail from the `_index` field data builder to make this work. This change adds a necessary but simple API change that allows us to remove the hack and only have a single implementation.	2017-08-08 09:24:24 +02:00
Adrien Grand	f0cba4fce5	Add a scripted similarity. (#25831 ) The goal of this similarity is to help users who would like to keep the functionality of the `tf-idf` similarity that we want to remove, or to allow for specific usec-cases (disabling idf, disabling tf, disabling length norm, etc.) to not have to build a custom plugin and familiarize with the low-level Lucene API.	2017-08-08 08:55:12 +02:00
Tal Levy	872526cad3	add URL-Decode Processor to Ingest (#26045 ) closes #25837 Adds a URL Decoder Processor to Ingest this will decode urls like: https%3a%2f%2felastic.co%2 to https://elastic.co/	2017-08-07 10:26:11 -07:00
Christoph Büscher	18155ed69a	Merge branch 'master' into feature/rank-eval	2017-08-07 16:07:34 +02:00
Luca Cavanna	14ba36977e	[TEST] prevent yaml tests from using raw requests (#26044 ) Raw requests are supported only by the java yaml test runner and were introduced to test docs snippets. Some yaml tests ended up using them (see #23497) which causes failures for other language clients. This commit migrates those yaml tests to Java tests that send requests through the Java low-level REST client, and also moves the ability to send raw requests to a special client that's only available when testing docs snippets. Closes #25694	2017-08-07 11:02:16 +02:00
Martijn van Groningen	11ce6b91a4	test: Do not use random index writer as test expects a single segment check against right version	2017-08-07 09:40:54 +02:00
Colin Goodheart-Smithe	bb3d5b7426	[TEST] Fix internalMatrixStatsTests failure	2017-08-02 16:36:34 +01:00
Colin Goodheart-Smithe	87c6e63e73	Adds mutate function to various tests (#25999 ) * Adds mutate function to various tests Relates to #25929 * fix test * implements mutate function for all single bucket aggs * review comments * convert getMutateFunction to mutateIInstance	2017-08-02 11:38:31 +01:00
Martijn van Groningen	53dd8afaea	fix test	2017-08-02 11:25:03 +02:00
Martijn van Groningen	a3d1248014	percolator: use correct version.	2017-08-02 10:37:59 +02:00
Adrien Grand	88d456989e	Make FieldMapper.copyTo() always non-null. (#25994 ) Otherwise it is confusing that both a null copyTo and an empty copyTo should be treated the same.	2017-08-02 10:07:29 +02:00
Tim Brooks	0f4f49496f	Use nio transport in test clusters (#25986 ) This commit adds the nio transport as an option in place of the mock tcp transport for tests. Each test will only use one transport type. The transport type is decided by a random boolean generated inside of the `ESTestCase` class.	2017-08-01 16:19:31 -05:00
Ryan Ernst	072281d5aa	Update version to 7.0.0-alpha1 (#25876 ) This commit updates the version for master to 7.0.0-alpha1. It also adds the 6.1 version constant, and fixes many tests, as well as marking some as awaits fix. Closes #25893 Closes #25870	2017-08-01 15:47:48 -04:00
Adrien Grand	53c829b6bc	Painless: allow doubles to be casted to longs. (#25936 ) Running `(long) someDoubleValue` currently throws a `ClassCastException` while eg. `(int) someDoubleValue` is accepted.	2017-08-01 16:22:55 +02:00
Jason Tedor	764f7ef2ef	Fix Netty 4 multi-port test This commit fixes an issue with the Netty 4 multi-port test that a transport client can connect. The problem here is that in case the bottom of the random port range was already bound to (for example, by another JVM) then then transport client could not connect to the data node. This is because the transport client was in fact using the bottom of the port range only. Instead, we simply try all the ports that the data node might be bound to. Closes #24441	2017-08-01 19:47:20 +09:00
Martijn van Groningen	5f36bdfda0	percolator: Also support IndexOrDocValuesQuery Otherwise ranges are never extracted properly.	2017-08-01 09:44:42 +02:00
Martijn van Groningen	ff3b909a83	Moved HtmlStripCharFilterFactory to analyis.common package like the other factories.	2017-07-31 15:34:54 +02:00
Martijn van Groningen	0b776a1de0	Move more token filters to analysis-common module The following token filters were moved: delimited_payload_filter, keep, keep_types, classic, apostrophe, decimal_digit, fingerprint, min_hash and scandinavian_folding. Relates to #23658	2017-07-31 15:15:04 +02:00
Martijn van Groningen	7c3735bdc4	percolator: Store the QueryBuilder's Writable representation instead of its XContent representation. The Writeble representation is less heavy to parse and that will benefit percolate performance and throughput. The query builder's binary format has now the same bwc guarentees as the xcontent format. Added a qa test that verifies that percolator queries written in older versions are still readable by the current version.	2017-07-28 12:24:10 +02:00
Yannick Welsch	1a01514081	Move tribe to a module (#25778 ) This commit moves tribe to a module, stripping core from the tribe functionality.	2017-07-28 11:23:50 +02:00
Jim Ferenczi	562c3744ca	Merge FunctionScoreQuery and FiltersFunctionScoreQuery (#25889 ) This change merges the functionality of the FiltersFunctionScoreQuery in the FunctionScoreQuery. It also ensures that an exception is thrown when the computed score is equals to Float.NaN or Float.NEGATIVE_INFINITY. These scores are invalid for TopDocsCollectors that relies on score comparison. Fixes #15709 Fixes #23628	2017-07-28 09:22:20 +02:00
Martijn van Groningen	edad7b4737	Add support for selecting percolator query candidate matches containing range queries. Extracts ranges from range queries on byte, short, integer, long, half_float, scaled_float, float, double, date and ip fields. byte, short, integer and date ranges are normalized to Lucene's LongRange. half_float and float are normalized to Lucene's DoubleRange. When extracting range queries, the QueryAnalyzer computes the width of the range. This width is used to determine what range should be preferred in a conjunction query. The QueryAnalyzer prefers the smaller ranges, because these ranges tend to match with less documents. Closes #21040	2017-07-26 21:25:45 +02:00
Simon Willnauer	b72c71083c	Cleanup IndexFieldData visibility (#25900 ) Today we expose `IndexFieldDataService` outside of IndexService to do maintenance or lookup field data in different ways. Yet, we have a streamlined way to access IndexFieldData via `QueryShardContext` that should encapsulate all access to it. This also ensures that we control all other functionality like cache clearing etc. This change also removes the `recycler` option from `ClearIndicesCacheRequest` this option is a no-op and should have been removed long ago.	2017-07-26 20:03:42 +02:00
Simon Willnauer	634ce90dc0	Respect cluster alias in `_index` aggs and queries (#25885 ) Today when we aggregate on the `_index` field the cross cluster search alias is not taken into account. Neither is it respected when we search on the field. This change adds support for cluster alias when the cluster alias is present on the `_index` field. Closes #25606	2017-07-26 09:16:52 +02:00
Michael Basnight	e816ef89a2	Shade external dependencies in the rest client jar This commit removes all external dependencies from the rest client jar and shades them in an 'org.elasticsearch.client' package within the jar using shadowJar gradle plugin. All projects that depended on the existing jar have been converted to using the 'org.elasticsearch.client' package prefixes to interact with the rest client. Closes #25208	2017-07-24 12:55:43 -05:00
Jim Ferenczi	ab3b5c695a	Pre-configured shingle filter should disable graph analysis (#25853 ) This change disables the graph analysis on default `shingle` filter. The pre-configured shingle filter produces shingles of different size. Graph analysis on such token stream is useless and dangerous as it may create too many paths. Fixes #25555	2017-07-24 18:42:15 +02:00
Simon Willnauer	0e3ad522a2	Rewrite search requests on the coordinating nodes (#25814 ) This change rewrites search requests on the coordinating node before we send requests to the individual shards. This will reduce the rewrite load and object creation for each rewrite on the executing nodes and will fetch resources only once instead of N times once per shard for queries like `terms` query with index lookups. (among percolator and geo-shape) Relates to #25791	2017-07-21 09:38:38 +02:00
Jack Conradson	9f7463e796	remove lang url parameter from stored script requests (#25779 ) Also has updates to ScriptMetaData for allowing the old namespace format to be loaded all the way back through 5.0; however, it will throw an exception if two scripts share the same id but different languages.	2017-07-20 08:51:08 -07:00
Simon Willnauer	5e629cfba0	Ensure query resources are fetched asynchronously during rewrite (#25791 ) The `QueryRewriteContext` used to provide a client object that can be used to fetch geo-shapes, terms or documents for percolation. Unfortunately all client calls used to be blocking calls which can have significant impact on the rewrite phase since it occupies an entire search thread until the resource is received. In the case that the index the resource is fetched from isn't on the local node this can have significant impact on query throughput. Note: this doesn't fix MLT since it fetches stuff in doQuery which is a different beast. Yet, it is a huge step in the right direction	2017-07-20 15:37:50 +02:00
Jay Modi	3e4bc027eb	RestClient uses system properties and system default SSLContext (#25757 ) This commit calls the `useSystemProperties` method on the HttpAsyncClientBuilder so that the jvm system properties are used. The primary reason for doing this is to ensure the builder uses the system default SSLContext rather than the default instance created by the http client library. Closes #23231	2017-07-20 07:36:56 -06:00
Simon Willnauer	4d78935df7	Introduce a new Rewriteable interface to streamline rewriting (#25788 ) Today we have duplicated code that is quite complicated to iterate over rewriteable (`QueryBuilders` mainly) This change introduces a `Rewriteable` interface that allow to share code to do the rewriting as well as encapsulation and composition of queries.	2017-07-19 15:06:49 +02:00
Adrien Grand	f1ff7f2454	Require a field when a `seed` is provided to the `random_score` function. (#25594 ) We currently use fielddata on the `_id` field which is trappy, especially as we do it implicitly. This changes the `random_score` function to use doc ids when no seed is provided and to suggest a field when a seed is provided. For now the change only emits a deprecation warning when no field is supplied but this should be replaced by a strict check on 7.0. Closes #25240	2017-07-19 14:11:15 +02:00
Martijn van Groningen	8003171a0c	Move more token filters to analysis-common module The following token filters were moved: arabic_normalization, german_normalization, hindi_normalization, indic_normalization, persian_normalization, scandinavian_normalization, serbian_normalization, sorani_normalization, cjk_width and cjk_width Relates to #23658	2017-07-17 08:29:44 +02:00
Ryan Ernst	072402463b	Scripting: Remove search template actions (#25717 ) The dedicated search template put/get/delete actions are deprecated in 5.6. This commit removes them from 6.0.	2017-07-14 23:12:05 -07:00
Christoph Büscher	887ed68cf2	Fixing compilation issues and tests after merging in master	2017-07-14 19:23:35 +02:00
Christoph Büscher	6d999f074a	Merge branch 'master' into feature/rank-eval	2017-07-14 18:36:08 +02:00
Jim Ferenczi	13da3eb53e	Refactor QueryStringQuery for 6.0 (#25646 ) This change refactors the query_string query to analyze the query text around logical operators of the query string the same way than a match_query/multi_match_query. It also adds a type parameter that can be used to change the way multi fields query are built the same way than a multi_match query does. Now that these queries share the same behavior regarding text analysis, some parameters are obsolete and have been deprecated: split_on_whitespace: This setting is now ignored with a deprecation notice if it is used explicitely. With this PR The query_string always splits on logical operator. It simplifies the understanding of the other parameters that can have different meanings depending on the value of split_on_whitespace. auto_generate_phrase_queries: This setting is now ignored with a deprecation notice if it is used explicitely. This setting only makes sense when the parser splits on whitespace. use_dismax: This setting is now ignored with a deprecation notice if it is used explicitely. The tie_breaker parameter is sufficient to handle best_fields/most_fields. Fixes #25574	2017-07-13 15:32:17 +02:00
Luca Cavanna	ec66d655b5	Rename client artifacts (#25693 ) It was brought up that our current client artifacts have generic names like 'rest' that may cause conflicts with other artifacts. This commit renames: - rest -> elasticsearch-rest-client - sniffer -> elasticsearch-rest-client-sniffer - rest-high-level -> elasticsearch-rest-high-level-client A couple of small changes are also preparing the high level client for its first release. Closes #20248	2017-07-13 09:44:25 +02:00
Ryan Ernst	70b2897bdf	Scripting: Deprecate stored search template apis (#25437 ) This commit deprecates the PUT, GET and DELETE search template apis. Instead, the stored script api should be used. closes #24596	2017-07-12 16:07:28 -07:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
Jack Conradson	d2b4f7ac5a	Disallow lang to be used with Stored Scripts (#25610 ) Requests that execute a stored script will no longer be allowed to specify the lang of the script. This information is stored in the cluster state making only an id necessary to execute against. Putting a stored script will still require a lang.	2017-07-12 07:55:57 -07:00
Tim Brooks	a3ade99fcf	Fix BytesReferenceStreamInput#skip with offset (#25634 ) There is a bug when a call to `BytesReferenceStreamInput` skip is made on a `BytesReference` that has an initial offset. The offset for the current slice is added to the current index and then subtracted from the length. This introduces the possibility of a negative number of bytes to skip. This happens inside a loop, which leads to an infinte loop. This commit correctly subtracts the current slice index from the slice.length. Additionally, the `BytesArrayTests` are modified to test instances that include an offset.	2017-07-11 09:54:29 -05:00
Adrien Grand	481d5d09b2	Upgrade to lucene-7.0.0-snapshot-00142c9. (#25641 ) Lucene 7.0 is feature-frozen now, so there should not be many changes until GA.	2017-07-11 13:58:55 +02:00
Tim Brooks	b22bbf94da	Avoid blocking on channel close on network thread (#25521 ) Currently when we close a channel in Netty4Utils.closeChannels we block until the closing is complete. This introduces the possibility that a network selector thread will block while waiting until a separate network selector thread closes a channel. For instance: T1 closes channel 1 (which is assigned to a T1 selector). Channel 1's close listener executes the closing of the node. That means that T1 now tries to close channel 2. However, channel 2 is assigned to a selector that is running on T2. T1 now must wait until T2 closes that channel at some point in the future. This commit addresses this by adding a boolean to closeChannels indicating if we should block on close. We only set this boolean to true if we are closing down the server channels at shutdown. This call is never made from a network thread. When we call the closeChannels method with that boolean set to false, we do not block on close.	2017-07-10 10:50:51 -05:00
Jason Tedor	c084542731	Bump version to 6.0.0-beta1 This commit does two things: - bumps the version from 6.0.0-alpha3 to 6.0.0-beta1 - renames the 6.0.0-alpha3 version constant to 6.0.0-beta1 Relates #25621	2017-07-09 18:12:50 -04:00
Adrien Grand	40bb1663ee	Index ids in binary form. (#25352 ) Indexing ids in binary form should help with indexing speed since we would have to compare fewer bytes upon sorting, should help with memory usage of the live version map since keys will be shorter, and might help with disk usage depending on how efficient the terms dictionary is at compressing terms. Since we can only expect base64 ids in the auto-generated case, this PR tries to use an encoding that makes the binary id equal to the base64-decoded id in the majority of cases (253 out of 256). It also specializes numeric ids, since this seems to be common when content that is stored in Elasticsearch comes from another database that uses eg. auto-increment ids. Another option could be to require base64 ids all the time. It would make things simpler but I'm not sure users would welcome this requirement. This PR should bring some benefits, but I expect it to be mostly useful when coupled with something like #24615. Closes #18154	2017-07-07 14:22:47 +02:00
Martijn van Groningen	6db708ef75	Move more token filters to analysis-common module The following token filters were moved: common grams, limit token, pattern capture and pattern raplace. Relates to #23658	2017-07-07 10:02:52 +02:00
Simon Willnauer	1f67d079b1	Validate `transport.profiles.` settings (#25508 ) Transport profiles unfortunately have never been validated. Yet, it's very easy to make a mistake when configuring profiles which will most likely stay undetected since we don't validate the settings but allow almost everything based on the wildcard in `transport.profiles.`. This change removes the settings subset based parsing of profiles but rather uses concrete affix settings for the profiles which makes it easier to fall back to higher level settings since the fallback settings are present when the profile setting is parsed. Previously, it was unclear in the code which setting is used ie. if the profiles settings (with removed prefixes) or the global node setting. There is no distinction anymore since we don't pull prefix based settings.	2017-07-07 09:40:59 +02:00
Jason Tedor	c96257ca73	Upgrade to Netty 4.1.13.Final This commit upgrades the Netty dependency from version 4.1.11.Final to 4.1.13.Final. Relates #25581	2017-07-06 15:37:00 -04:00
Martijn van Groningen	d0f9f425bd	parent/child: Removed ParentJoinFieldSubFetchPhase	2017-07-06 13:15:02 +02:00
Martijn van Groningen	407273f81d	parent/child: Support parent id being specified as number in the _source	2017-07-06 11:48:57 +02:00
Jun Ohtani	6894ef6057	[Analysis] Support normalizer in request param (#24767 ) * [Analysis] Support normalizer in request param Support normalizer param Support custom normalizer with char_filter/filter param Closes #23347	2017-07-04 19:16:56 +09:00
Colin Goodheart-Smithe	43efcffcc2	Adds check for negative search request size (#25397 ) * Adds check for negative search request size This change adds a check to `SearchSourceBuilder` to throw and exception if the size set on it is set to a negative value. Closes #22530 * fix error in reindex * update re-index tests * Addresses review comment * Fixed tests * Added random negative size test * Fixes test	2017-07-04 10:51:38 +01:00
Christoph Büscher	f576c987ce	Remove QueryParseContext (#25486 ) QueryParseContext is currently only used as a wrapper for an XContentParser, so this change removes it entirely and changes the appropriate APIs that use it so far to only accept a parser instead.	2017-07-03 17:30:40 +02:00
Simon Willnauer	5a7c8bb04e	Cleanup network / transport related settings (#25489 ) This commit makes the use of the global network settings explicit instead of implicit within NetworkService. It cleans up several places where we fall back to the global settings while we should have used tcp or http ones. In addition this change also removes unnecessary settings classes	2017-07-02 10:16:50 +02:00
Simon Willnauer	6f131a63d3	Remove unregistered `transport.netty.*` settings (#25476 ) These settings have not be working for a full major version since they are not registered. Given that they are simply duplicates we can just remove them.	2017-06-29 20:56:18 +02:00
Christoph Büscher	927111c91d	Remove QueryParseContext from parsing QueryBuilders (#25448 ) Currently QueryParseContext is only a thin wrapper around an XContentParser that adds little functionality of its own. I provides helpers for long deprecated field names which can be removed and two helper methods that can be made static and moved to other classes. This is a first step in helping to remove QueryParseContext entirely.	2017-06-29 17:10:20 +02:00
Christoph Büscher	2708bcc6ed	Merge branch 'master' into feature/rank-eval	2017-06-29 15:07:45 +02:00
olcbean	3518e313b8	Unify the result interfaces from get and search in Java client (#25361 ) As GetField and SearchHitField have the same members, they have been unified into DocumentField. Closes #16440	2017-06-29 11:35:28 +02:00
Martijn van Groningen	c85ac402b0	test: Make many percolator integration tests real integration tests	2017-06-27 17:44:30 +02:00
Simon Willnauer	d338a09812	Remove `mapping.single_type` from parent join test (#25391 ) This removes the remaining usage of `mapping.single_type` from the parent join module and moves it's bwc test to the mixed cluster tests Relates to #24961 Relates to #20257	2017-06-26 17:33:07 +02:00
Martijn van Groningen	a34f5fa812	Move more token filters to analysis-common module The following token filters were moved: stemmer, stemmer_override, kstem, dictionary_decompounder, hyphenation_decompounder, reverse, elision and truncate. Relates to #23658	2017-06-26 09:02:16 +02:00
Nik Everett	da0b991331	Remove `index.mapping.single_type=false` from reindex tests (#25365 ) * Remove the setting from the yml tests and replace with tests using `join` field. We can't use the setting in yml tests without lots of backflips but we have `ReindexParentChildTests` for the coverage. There weren't tests for `join` field with reindex before this. Adding these tests discovered #25363. * Remove the setting from `ReindexParentChildTests` and replace with `index.version.created=V_5_6_0`. This test can be entirely removed when legacy parent/child support is dropped from core. * Port the yml tests that set _parent into integ tests so they can set the index created version. These tests can be removed when we drop support for _parent in core. * Port a delete-by-query test for filtering based on type to an `ESIntegTestCase` so it can use `index.version.created=5.6.0` to setup documents of multiple types. This whole feature can be dropped when we no longer support multiple types per index. Relates to #24961	2017-06-23 17:14:59 -04:00
Simon Willnauer	4ae426a552	Remove remaining `index.mapping.single_type=false` (#25369 ) This change cleans up remaining tests to not use index.mapping.single_type=false but instead where applicable use a single type or markt the index as created with a pre 6.x version. Yet, there is still on leftover in the client tests that needs special attention. See `org.elasticsearch.client.SearchIT` Relates to #24961	2017-06-23 10:26:06 +02:00
Tal Levy	1ac7818201	fix sort and string processor tests around targetField (#25358 ) Tests were randomly assigning `targetField` to an existing field that was an array, causing path resolution issues. This PR fixes those tests Closes #25346 & #25348	2017-06-22 13:14:18 -07:00
Jack Conradson	96b62409a8	Update Painless to Allow Augmentation from Any Class (#25360 ) Custom whitelists in Painless will need to allow classes to be augmented beyond the currently hard-coded Augmentation class tied to Painless directly. This change allows any class to specify an augmentation on a Painless struct using an appropriate static method. Changes to loading the whitelist have also been created to allow for this specification of a different class for augmentation.	2017-06-22 12:16:46 -07:00
Martijn van Groningen	343e7571b9	test: single type defaults to true since alpha1 and not alpha3 Closes #25354	2017-06-22 16:31:15 +02:00
Adrien Grand	44e9c0b947	Upgrade to lucene-7.0.0-snapshot-ad2cb77. (#25349 ) Most notable changes: - better update concurrency: LUCENE-7868 - TopDocs.totalHits is now a long: LUCENE-7872 - QueryBuilder does not remove the boolean query around multi-term synonyms: LUCENE-7878 - removal of Fields: LUCENE-7500 For the `TopDocs.totalHits` change, this PR relies on the fact that the encoding of vInts and vLongs are compatible: you can write and read with any of them as long as the value can be represented by a positive int.	2017-06-22 12:35:33 +02:00
Martijn van Groningen	a977569085	percolator: Deprecate `document_type` parameter. The `document_type` parameter is no longer required to be specified, because by default from 6.0 only a single type is allowed. (`index.mapping.single_type` defaults to `true`)	2017-06-22 09:55:06 +02:00
Nik Everett	8d9a08e239	Fix reindex test when log level is debug When log level is debug we'd dereference null because the test was being cute and cutting corners. Relates to #25256	2017-06-20 16:06:58 -04:00
Jun Ohtani	62d1969595	Parse synonyms with the same analysis chain (#8049 ) * [Analysis] Parse synonyms with the same analysis chain Synonym Token Filter / Synonym Graph Filter tokenize synonyms with whatever tokenizer and token filters appear before it in the chain. Close #7199	2017-06-20 21:50:33 +09:00
Nik Everett	3261586cac	Tweak reindex cancel logic and add many debug logs (#25256 ) I'm still trying to hunt down rare failures in the cancelation tests for reindex and friends. Here is the latest: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+multijob-unix-compatibility/os=ubuntu/876/console It doesn't show much, other than that one of the tasks didn't kill itself when asked to cancel. So I'm going a bit crazy with debug logging so that the next time this comes up I can trace exactly what happened. Additionally, this tweaks the logic around how rethrottles were performed around cancel. Previously we set the `requestsPerSecond` to `0` when we cancelled the task. That was the "old way" to set them to inifity which was the intent. This switches that from `0` to `Float.MAX_VALUE` which is the "new way" to set the `requestsPerSecond` to infinity. I don't know that this is much better, but it feels better.	2017-06-19 18:46:42 -04:00
Andy Bristol	4c5bd57619	Rename simple pattern tokenizers (#25300 ) Changed names to be snake case for consistency Related to #25159, original issue #23363	2017-06-19 13:48:43 -07:00
Simon Willnauer	a8d5a58801	Replace deprecated API usage in Netty4HttpChannel	2017-06-17 14:04:23 +02:00
Christoph Büscher	e99ced06cc	[Tests] Check that parsing aggregations works in a forward compatible way (#25219 ) This change adds tests for the aggregation parsing that try to simulate that we can parse existing aggregations in a forward compatible way in the future, ignoring potential newly added fields or substructures to the xContent response.	2017-06-17 13:06:31 +02:00
Simon Willnauer	f18b0d293c	Move TransportStats accounting into TcpTransport (#25251 ) Today TcpTransport is the de-facto base-class for transport implementations. The need for all the callbacks we have in TransportServiceAdaptor are not necessary anymore since we can simply have the logic inside the base class itself. This change moves the stats metrics directly into TcpTransport removing the need for low level bytes send / received callbacks.	2017-06-16 22:34:11 +02:00
Nik Everett	ecc87f613f	Move pre-configured "keyword" tokenizer to the analysis-common module (#24863 ) Moves the keyword tokenizer to the analysis-common module. The keyword tokenizer is special because it is used by CustomNormalizerProvider so I pulled it out into its own PR. To get the move to work I've reworked the lookup from static to one using the AnalysisRegistry. This seems safe enough. Part of #23658.	2017-06-16 11:48:15 -04:00
Jack Conradson	50db8cb351	Add needs methods for specific variables to Painless script context factories. (#25267 )	2017-06-15 17:00:33 -07:00
Martijn van Groningen	428e70758a	Moved more token filters to analysis-common module. The following token filters were moved: `edge_ngram`, `ngram`, `uppercase`, `lowercase`, `length`, `flatten_graph` and `unique`. Relates to #23658	2017-06-15 18:28:31 +02:00
Jim Ferenczi	5e64cd08bc	[Test] restore BWC for parent-join now that the new mapping format is in 5.x	2017-06-15 15:15:48 +02:00
Jim Ferenczi	9ca33e2450	Add a section named "relations" in the ParentJoinFieldMapper (#25248 ) * Add a section named "relation" in the ParentJoinFieldMapper This commit puts the parent/child definition in an inner section named "relation". Mapping for the parent-join will look like this: ``` "join_field": { "type": "join" "relations": "parent": "child" } } ```	2017-06-15 14:56:20 +02:00
Tal Levy	2cd771a230	fix: Sort Processor does not have proper behavior with targetField (#25237 ) to specify a `targetField`. This results in some interesting behavior that was missed in the review. This processor sorts in-place, so there is a side-effect in both the original field and the target field. Another bug was that the targetField was not being set if the list being sorted was fewer than two elements. The new behavior works like this: If targetField and fieldName are not the same, we copy the list.	2017-06-15 05:28:54 -07:00
Boaz Leskes	648b4717a4	move assertBusy to use CheckException (#25246 ) We use assertBusy in many places where the underlying code throw exceptions. Currently we need to wrap those exceptions in a RuntimeException which is ugly.	2017-06-15 13:24:07 +02:00
Tanguy Leroux	27f1206999	Use SPI in High Level Rest Client to load XContent parsers (#25098 ) This commit adds a NamedXContentProvider interface that can be implemented by plugins or modules using Java's SPI feature in order to provide additional NamedXContent parsers to external applications like the Java High Level Rest Client.	2017-06-15 12:50:02 +02:00
Adrien Grand	0c117145f6	Upgrade to lucene-7.0.0-snapshot-92b1783. (#25222 ) This snapshot has faster range queries on range fields (LUCENE-7828), more accurate norms (LUCENE-7730) and the ability to use fake term frequencies (LUCENE-7854).	2017-06-15 09:52:07 +02:00
Ryan Ernst	caf7792db1	Scripting: Rename SearchScript.needsScores to needs_score (#25235 ) This commit renames the needsScores method so as to make it automatically generatable, based on the name of the `_score` variable which is available in search scripts. It also adds documentation to ScriptContext to explain the naming and signature of such methods.	2017-06-14 22:01:19 -07:00
Jack Conradson	a4471f51e4	Support script context stateful factory in Painless. (#25233 )	2017-06-14 16:44:41 -07:00
Christoph Büscher	4de4c795b7	Fix issues after merging in master	2017-06-14 12:16:58 +02:00
Christoph Büscher	ac3db8c30f	Merge branch 'master' into feature/rank-eval	2017-06-14 11:57:05 +02:00
Andy Bristol	48696ab544	expose simple pattern tokenizers (#25159 ) Expose the experimental simplepattern and simplepatternsplit tokenizers in the common analysis plugin. They provide tokenization based on regular expressions, using Lucene's deterministic regex implementation that is usually faster than Java's and has protections against creating too-deep stacks during matching. Both have a not-very-useful default pattern of the empty string because all tokenizer factories must be able to be instantiated at index creation time. They should always be configured by the user in practice.	2017-06-13 12:46:59 -07:00
Alexander Kazakov	a7dafdaa05	Add target_field parameter to gsub, join, lowercase, sort, split, trim, uppercase (#24133 ) Closes #23682 #23228	2017-06-13 09:40:44 -07:00
Simon Willnauer	186c16ea41	Ensure pending transport handlers are invoked for all channel failures (#25150 ) Today if a channel gets closed due to a disconnect we notify the response handler that the connection is closed and the node is disconnected. Unfortunately this is not a complete solution since it only works for published connections. Connections that are unpublished ie. for discovery can indefinitely hang since we never invoke their handers when we get a failure while a user is waiting for the response. This change adds connection tracking to TcpTransport that ensures we are notifying the corresponding connection if there is a failure on a channel.	2017-06-13 09:37:05 +02:00
Jason Tedor	dcf57f296e	Fix get mappings HEAD requests Get mappings HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for get mappings HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23192	2017-06-11 14:58:56 -04:00
Jason Tedor	7182577904	Fix handling of exceptions thrown on HEAD requests Today when an exception is thrown handling a HEAD request, the body is swallowed before the channel has a chance to see it. Yet, the channel is where we compute the content length that would be returned as a header in the response. This is a violation of the HTTP specification. This commit addresses the issue. To address this issue, we remove the special handling in bytes rest response for HEAD requests when an exception is thrown. Instead, we let the upstream channel handle the special case, as we already do today for the non-exceptional case. Relates #25172	2017-06-10 23:44:18 -04:00
Ryan Ernst	a03b6c2fa5	Scripting: Change keys for inline/stored scripts to source/id (#25127 ) This commit adds back "id" as the key within a script to specify a stored script (which with file scripts now gone is no longer ambiguous). It also adds "source" as a replacement for "code". This is in an attempt to normalize how scripts are specified across both put stored scripts and script usages, including search template requests. This also deprecates the old inline/stored keys.	2017-06-09 08:29:25 -07:00
Jim Ferenczi	8250aa4267	Remove the postings highlighter and make unified the default highlighter choice (#25028 ) This change removes the `postings` highlighter. This highlighter has been removed from Lucene master (7.x) because it behaves exactly like the `unified` highlighter when index_options is set to `offsets`: https://issues.apache.org/jira/browse/LUCENE-7815 It also makes the `unified` highlighter the default choice for highlighting a field (if `type` is not provided). The strategy used internally by this highlighter remain the same as before, it checks `term_vectors` first, then `postings` and ultimately it re-analyzes the text. Ultimately it rewrites the docs so that the options that the `unified` highlighter cannot handle are clearly marked as such. There are few features that the `unified` highlighter is not able to handle which is why the other highlighters (`plain` and `fvh`) are still available. I'll open separate issues for these features and we'll deprecate the `fvh` and `plain` highlighters when full support for these features have been added to the `unified`.	2017-06-09 14:09:57 +02:00
Tal Levy	a771912a22	Add Ingest-Processor specific Rest Endpoints & Add Grok endpoint (#25059 ) This PR enables Ingest plugins to leverage processor-scoped REST endpoints. First of which being the Grok endpoint that retrieves Grok Patterns for users to retrieve all the built-in patterns. Example usage: Kibana Grok Autocomplete!	2017-06-08 15:24:35 -07:00
Tal Levy	340909582f	remove Ingest's Internal Template Service (#25085 ) Ingest was using it's own wrapper around TemplateScripts and the ScriptService. This commit removes that abstraction	2017-06-08 15:24:03 -07:00
Guillaume Le Floch	3f6d80aa66	Allow removing multiple fields in ingest processor (#24750 ) * Allow removing multiple fields in ingest processor * Iteration 2 * Few fixes	2017-06-08 13:17:44 -07:00
Jim Ferenczi	21a57c1494	Always use DisjunctionMaxQuery to build cross fields disjunction (#25115 ) This commit modifies query_string, simple_query_string and multi_match queries to always use a DisjunctionMaxQuery when a disjunction over multiple fields is built. The tiebreaker is set to 1 in order to behave like the boolean query in terms of scoring. The removal of the coord factor in Lucene 7 made this change mandatory to correctly handle minimum_should_match. Closes #23966	2017-06-08 11:18:17 +02:00
Adrien Grand	a8ea2f0df4	Leverage scorerSupplier when applicable. (#25109 ) The `scorerSupplier` API allows to give a hint to queries in order to let them know that they will be consumed in a random-access fashion. We should use this for aggregations, function_score and matched queries.	2017-06-08 10:19:38 +02:00
Jack Conradson	d187fa78fd	Generate Painless Factory for Creating Script Instances (#25120 )	2017-06-07 16:06:11 -07:00
Jim Ferenczi	3924fd79ef	Add BWC rest test for parent-join after the backport to 5.x	2017-06-07 19:29:01 +02:00
Martijn van Groningen	db8aa8e94e	Changed inner_hits to work with the new join field type and at the same time maintaining support for the `_parent` meta field type/ Relates to #20257	2017-06-07 10:52:49 +02:00
Jason Tedor	e03c4938c5	GET aliases should 404 if aliases are missing Previously the HEAD and GET aliases endpoints were misaigned in behavior. The HEAD verb would 404 if any aliases are missing while the GET verb would not if any aliases existed. When HEAD was aligned with GET, this broke the previous usage of HEAD to serve as an existence check for aliases. It is the behavior of GET that is problematic here though, if any alias is missing the request should 404. This commit addresses this by modifying the behavior of GET to behave in this way. This fixes the behavior for HEAD to also 404 when aliases are missing. Relates #25043	2017-06-06 14:37:29 -04:00
Jim Ferenczi	7e60cf3e54	Move parent_id query to the parent-join module (#25072 ) This change moves the parent_id query to the parent-join module and handles the case when only the parent-join field can be declared on an index (index with single type on). If single type is off it uses the legacy parent join field mapper and switch to the new one otherwise (default in 6). Relates #20257	2017-06-06 19:35:14 +02:00
Tal Levy	d6d0c13bd6	fix grok's pattern parsing to validate pattern names in expression (#25063 ) Unknown patterns used to silently be ignored. This was a problem because users did not know they were providing an invalid pattern name, and maybe thought the rest of their regexes were invalid. Fixes #22831.	2017-06-06 08:07:53 -07:00
Tal Levy	e51246023a	add `exclude_keys` option to KeyValueProcessor (#24876 ) and modify data-structure of `include_keys` and `exclude_keys` to be backed by a HashSet	2017-06-05 14:12:48 -07:00
Alex Benusovich	5463294ec4	Fixed NPEs caused by requests without content. (#23497 ) REST handlers that require a body will throw an an ElasticsearchParseException "request body required". REST handlers that require a body OR source param will throw an ElasticsearchParseException "request body or source param required". Replaced asserts in BulkRequest parsing code with a more descriptive IllegalArgumentException if the line contains an empty object. Updated bulk REST test to verify an empty action line is rejected properly. Updated BulkRequestTests with randomized testing for an empty action line. Used try-with-resouces for XContentParser in AbstractBulkByQueryRestHandler.	2017-06-05 09:08:14 -06:00
Nik Everett	73307a2144	Plugins can register pre-configured char filters (#25000 ) Fixes the plumbing so plugins can register char filters and moves the `html_strip` char filter into analysis-common. Relates to #23658	2017-06-05 09:25:15 -04:00
Jack Conradson	8999104b14	Change ScriptContexts to use needs instead of uses$. (#25036 )	2017-06-02 14:50:51 -07:00
Martijn van Groningen	2a71a7bffc	Change `has_child`, `has_parent` queries and `childen` aggregation to work with the new join field type and at the same time maintaining support for the `_parent` meta field type. Relates to #20257	2017-06-02 23:27:16 +02:00
Ryan Ernst	0d8216d5af	Scripting: Convert CompiledTemplate to a ScriptContext (#25032 ) This commit creates TemplateScript and associated classes so that templates no longer need a special ScriptService.compileTemplate method. The execute() method is equivalent to the old run() method. relates #20426	2017-06-02 13:41:26 -07:00
Jack Conradson	a926ace2e1	Update Painless to Use New Script Contexts (#25015 ) * All public methods starting with get will be added as local variables to the execute method. * The execute method on a ScriptContext must be both public and abstract. This method will be implemented by the Painless compiler. * A static list of parameter names for the execute method must be provided since the names will be eliminated at runtime. * The uses$ methods will still be implemented as before. * A single constructor may be provided by the ScriptContext. This constructor will be overridden by the Painless compiler to include the exact same arguments. This allows instances of a Painless script to potentially contain state. If a constructor is not provided it is assumed the default constructor with no arguments will be used.	2017-06-02 13:36:45 -07:00
Jim Ferenczi	4077600035	Disallow the new parent join field on indices with multiple types Relates https://github.com/elastic/elasticsearch/pull/24978	2017-06-02 18:28:03 +02:00
Chris Earle	6464add551	Always Accumulate Transport Exceptions (#25017 ) This removes the `accumulateExceptions()` method (and its usage) from `TransportNodesAction` and `TransportTasksAction`, forcing both transport actions to always accumulate exceptions. Without this change, some transport actions, like `TransportNodesStatsAction` would respond in very unexpected ways by returning no response due to some failure, but instead of returning an error the response would simply be empty: no response and no error. This results in a very trappy response structure where users can check for an error, then attempt to blindly use the response when no error is returned.	2017-06-02 10:01:42 -04:00
Jim Ferenczi	b8605775df	Add the ability to set eager_global_ordinals in the new parent-join field (#25019 ) Defaults to true	2017-06-02 15:34:22 +02:00
Colin Goodheart-Smithe	779fb9a1c0	Adds nodes usage API to monitor usages of actions (#24169 ) * Adds nodes usage API to monitor usages of actions The nodes usage API has 2 main endpoints /_nodes/usage and /_nodes/{nodeIds}/usage return the usage statistics for all nodes and the specified node(s) respectively. At the moment only one type of usage statistics is available, the REST actions usage. This records the number of times each REST action class is called and when the nodes usage api is called will return a map of rest action class name to long representing the number of times each of the action classes has been called. Still to do: * [x] Create usage service to store usage statistics * [x] Record usage in REST layer * [x] Add Transport Actions * [x] Add REST Actions * [x] Tests * [x] Documentation * Rafactors UsageService so counts are done by the handlers * Fixing up docs tests * Adds a name to all rest actions * Addresses review comments	2017-06-02 08:46:38 +01:00
Jim Ferenczi	f4aee1e583	Disallow multiple parent-join fields per mapping (#25002 ) This change ensures that there is a single parent-join field defined per mapping. The verification is done through the addition of a special field mapper (MetaJoinFieldMapper) with a unique name (_parent_join) that is registered to the mapping service when the first parent-join field is defined. If a new parent-join is added, this field mapper will clash with the new one and the update will fail. This change also simplifies the parent join fetch sub phase by retrieving the parent-join field without iterating on all fields in the mapping.	2017-06-02 09:21:15 +02:00
Ryan Ernst	8d88b94372	Scripting: Add optional context parameter to put stored script requests (#25014 ) This commit adds an optional `context` url parameter to the put stored script request. When a context is specified, the script is compiled against that context before storing, as a validation the script will work when used in that context.	2017-06-01 17:53:48 -07:00
Jim Ferenczi	b5d62ae747	Introduce ParentJoinFieldMapper, a field mapper that creates parent/child relation within documents of the same index (#24978 ) * Introduce ParentJoinFieldMapper, a field mapper that creates parent/child relation within documents of the same index This change adds a new field mapper named ParentJoinFieldMapper. This mapper is a replacement for the ParentFieldMapper but instead of using the types in the mapping it uses an internal field to materialize parent/child relation within a single index. This change also adds a fetch sub phase that automatically retrieves the join name (parent or child name) and the parent id for child documents in the response hit fields. The compatibility with `has_parent`, `has_child` queries and `children` agg will be added in a follow up. Relates #20257	2017-05-31 18:07:21 +02:00
Martijn van Groningen	258be2b135	Moved `keyword_marker`, `trim`, `snowball` and `porter_stemmer` tokenfilter factories from core to common-analysis module. Relates to #23658	2017-05-31 09:34:08 +02:00
Martijn van Groningen	6945d7b046	test: Stop using the `mapping.single_type` setting in percolator tests. Closes #24958	2017-05-31 09:11:33 +02:00
Ryan Ernst	7c1211d2ed	Scripting: Add StatefulFactoryType as optional intermediate factory in script contexts (#24974 ) ScriptContexts currently understand a FactoryType that can produce instances of the script InstanceType. However, for search scripts, this does not work as we have the concept of LeafSearchScript that is created per lucene segment. This commit effectively renames the existing SearchScript class into SearchScript.LeafFactory, which is a new, optional, class that can be defined within a ScriptContext. LeafSearchScript is effectively renamed back into SearchScript. This change allows the model of stateless factory -> stateful factory -> script instance to continue, but in a generic way that any script context may take advantage of. relates #20426	2017-05-30 16:32:14 -07:00
Jack Conradson	04daac2243	Make Painless Compiler Use an Instance Per Context (#24972 ) Allows for easier management of compilation of individual interfaces on a per script context basis.	2017-05-30 14:30:48 -07:00
Tal Levy	2a6e6866bd	Fix floating-point error when DateProcessor parses UNIX (#24947 ) DateProcessor's DateFormat UNIX format parser resulted in a floating point rounding error when parsing certain stringed epoch times. Now Double.parseDouble is used, preserving the intented input.	2017-05-30 09:42:26 -07:00
Jack Conradson	5bcae914d9	Make PainlessScript an interface (#24966 ) Allows more flexibility for the specified script context interface if we want to allow script contexts to specify an abstract class instead.	2017-05-30 09:03:46 -07:00
Christoph Büscher	5a4124d4fb	Fixing template rendering after changes in master	2017-05-30 15:30:24 +02:00
Christoph Büscher	3d6fb4eb0b	Merge branch 'master' into feature/rank-eval	2017-05-30 14:24:26 +02:00
Tanguy Leroux	eea010b408	Add doc_count to ParsedMatrixStats (#24952 ) This commit adds support in ParsedMatrixStats for parsing the doc_count field. Related to #24776	2017-05-30 10:16:08 +02:00
Tanguy Leroux	28d97df67c	Add document count to Matrix Stats aggregation response (#24776 ) This commit adds a `doc_count` field to the response body of Matrix Stats aggregation. It exposes the number of documents involved in the computation of statistics, a value that can already be retrieved using the method MatrixStats.getDocCount() in the Java API.	2017-05-30 09:39:41 +02:00
Nik Everett	5da8ce8318	Remove the need for _UNRELEASED suffix in versions (#24798 ) Removes the need for the `_UNRELEASED` suffix on versions by detecting if a version should be unreleased or not based on the versions around it. This should make it simpler to automate the task of adding a new version label.	2017-05-26 18:36:32 -04:00
Ryan Ernst	74e031e842	Scripting: Rename CompiledType to FactoryType in ScriptContext (#24897 ) This commit renames the concept of the "compiled type" to a "factory type", along with all implementations of this class to be named Factory. This brings it inline with the classes purpose.	2017-05-26 00:02:54 -07:00
Ryan Ernst	8eab1fefa1	Scripting: Make contexts available to ScriptEngine construction (#24896 ) This commit adds collection of all contexts to the parameters of getScriptEngine. This will allow script engines like painless to precache extra information about the contexts.	2017-05-25 16:55:47 -07:00
Ryan Ernst	8aaea51a0a	Scripting: Move context definitions to instance type classes (#24883 ) This is a simple refactoring to move the context definitions into the type that they use. While we have multiple context names for the same class at the moment, this will eventually become one ScriptContext per instance type, so the pattern of a static member on the interface called CONTEXT can be used. This commit also moves the consolidated list of contexts provided by core ES into ScriptModule.	2017-05-25 12:18:45 -07:00
Ryan Ernst	7d03cff820	Scripting: Make ScriptEngine.compile generic on the script context (#24873 ) This commit changes the compile method of ScriptEngine to be generic in the same way it is on ScriptService. This moves the shim of handling the two existing context classes into each script engine, so that each engine can be worked on independently to convert to real handling of contexts.	2017-05-24 20:06:32 -07:00
Ryan Ernst	1daacd97b0	Scripting: Add instance and compiled classes to script contexts (#24868 ) This commit modifies the compile method of ScriptService to be context aware. The ScriptContext is now a generic class which contains both the instance type and compiled type for a script. Instance type may be stateful (for example, pre loading field information for the index a script will execute on, like in expressions), while the compiled type is stateless and used to construct instance type instances. This change is only a first step to cutover ScriptService to the new paradigm. It only converts callers to the script service, and has a small shim to wrap compilation from the script engines to support the current two fixed instance types, SearchScript and ExecutableScript.	2017-05-24 14:29:02 -07:00
Ryan Ernst	0ddd219423	Scripting: Add default implementation of close() for ScriptEngine (#24851 ) Since groovy was removed, we no longer have any ScriptEngines with resources to release. We may want to keep the option open for a script engine to close resources, but this would not be common. This commit adds a default implementation to ScriptEngine for `close()` to reduce the boiler plate that must be added for a ScriptEngine implementation.	2017-05-24 13:19:27 -07:00
Jim Ferenczi	4707377cea	Move InnerHitBuilder queries BWC version to 5.5 after the backport Relates #24676	2017-05-23 22:41:39 +02:00
Martijn van Groningen	34093735e3	Added unit tests for MatrixStatsAggregator	2017-05-23 16:19:12 +02:00
Luca Cavanna	747fa721e4	Build: add client jar for aggs-matrix-stats (#24827 ) This will be useful for the high level client to add support for the matrix stats aggregation, as we will ship with this jar by default like we do for parent-join-client which is aligned with distributing core with the modules already included. Relates to #24796	2017-05-23 13:33:54 +02:00
Jim Ferenczi	9087803cd9	Add the ability to define custom inner hit sub context builder (#24676 ) This commit moves the handling of nested and parent/child inner hits to specialized classes that can be defined outside of ES core. InnerHitBuilderContext is now used by the parent query (nested or hasChild, ...) to build the sub context from the InnerHitBuilder definition. BWC is also ensured so that nodes in previous versions can still send/receive inner hits to/from this version. Relates #20257	2017-05-23 13:06:22 +02:00
Jack Conradson	8887bcc4c6	Fix settings names for script.allowed_types and script.allowed_contexts. (#24831 ) Fixes #24830	2017-05-22 15:08:45 -07:00
Ryan Ernst	52d504bb5f	Scripting: Simplify ScriptContext (#24818 ) As we work towards contexts implying the return type of compilation, we first need ScriptContext to not be an enum. This commit removes the Standard enum and Plugin subclass of ScriptContext.	2017-05-22 13:11:15 -07:00
javanna	7a3e38eb8e	Merge branch 'master' into feature/client_aggs_parsing	2017-05-22 12:25:14 +02:00
Martijn van Groningen	08eda43899	percolator: Use QueryBuilder.rewriteQuery(...) to rewrite query builder instead of QueryBuilder.rewrite(...) Relates to #24617	2017-05-22 12:20:26 +02:00
Ryan Ernst	2de748859f	Scripting: Remove "inline script enabled" on script engines (#24815 ) ScriptEngine implementations have an overridable method to indicate they are safe to use as inline scripts. Since groovy was removed fro 6.0, there are no longer any implementations which used the default false value. Furthermore, the value was not actually read anywhere. This commit removes the method. The ScriptEngineRegistry was also no longer necessary as it only was used to build a map from language to engine.	2017-05-20 12:01:25 -07:00
javanna	db0490343e	Merge branch 'master' into feature/client_aggs_parsing	2017-05-19 18:17:06 +02:00
Nik Everett	b9ea579633	Allow plugins to register pre-configured tokenizers (#24751 ) Allows plugins to register pre-configured tokenizers. Much of the decisions are the same as those in #24223, #24572, and #24223. This only migrates the lowercase tokenizer but I figure that is a good start because it proves out the features.	2017-05-19 12:07:04 -04:00
Nicholas Knize	deb7caf4d3	Upgrade to lucene-7.0.0-snapshot-a0aef2f This commit upgrades master to a current lucene snapshot with commit id a0aef2f.	2017-05-19 10:20:55 -05:00
Jim Ferenczi	d241c4898e	Removes parent child fielddata specialization (#24737 ) This change removes the field data specialization needed for the parent field and replaces it with a simple DocValuesIndexFieldData. The underlying global ordinals are retrieved via a new function called IndexOrdinalsFieldData#getOrdinalMap. The children aggregation is also modified to use a simple WithOrdinals value source rather than the deleted WithOrdinals.Parent. Relates #20257	2017-05-19 17:11:23 +02:00
Tanguy Leroux	83aa00b3f6	Merge remote-tracking branch 'origin/master' into feature/client_aggs_parsing	2017-05-19 13:13:00 +02:00
Tanguy Leroux	dd731d9e98	Add parsing method for Matrix Stats (#24746 ) Related to #23331	2017-05-19 12:22:54 +02:00
Jack Conradson	1196dfb6bb	Remove Deprecated Script Settings (#24756 ) Removes all fine-grained script settings replaced by scripts.types_allowed and scripts.contexts_allowed.	2017-05-18 13:32:46 -07:00
Christoph Büscher	10d308578e	Fix compilation issues after merge with master	2017-05-18 17:52:58 +02:00
Christoph Büscher	cd0941810f	Merge branch 'master' into feature/rank-eval	2017-05-18 16:47:47 +02:00
Tanguy Leroux	eeef2e6c31	Merge remote-tracking branch 'origin/master' into feature/client_aggs_parsing	2017-05-18 09:43:57 +02:00
Ryan Ernst	463fe2f4d4	Scripting: Remove file scripts (#24627 ) This commit removes file scripts, which were deprecated in 5.5. closes #21798	2017-05-17 14:42:25 -07:00
Ryan Ernst	f8a48badcf	Settings: Remove shared setting property (#24728 ) Shared settings were added intially to allow the few common settings names across aws plugins. However, in 6.0 these settings have been removed. The last use was in netty, but since 6.0 also has the netty 3 modules removed, there is no longer a need for the shared property. This commit removes the shared setting property.	2017-05-17 13:14:12 -07:00
javanna	ce7326eb88	Merge branch 'master' into feature/client_aggs_parsing	2017-05-17 17:59:00 +02:00
Simon Willnauer	2ccc223ff7	Fix Version based BWC and set correct minCompatVersion (#24732 ) Approaching the release of 6.0 we need to sort out the usage of `Version#minimumCompatibilityVersion` which was still set to 5.0.0. Now this change moves it to the latest released version of 5.x (5.4 at this point) to ensure we are compatible with the latest minor of the previous major. This change also removes all the `_UNRELEASED` from the versions that where released and drops versions that were never released and are not expected to be released (bugfixes in minors that are not the latest in the previous major).	2017-05-17 17:27:09 +02:00
Nik Everett	0189a65e6b	Fail rest tests on yaml files (#24740 ) We've switched to supporting only `yml` files but anyone who didn't notice will commit a `yaml` file which won't be executed which is bad because it is easy not to notice. The test to catch this is simple enough that I think it is worth adding just to warn folks about their mistake.	2017-05-17 10:24:57 -04:00
Nik Everett	5fc6f17121	Fix new analysis-common yml tests These tests are broken because I added them with the `yml` extension and didn't realize that we weren't running tests with that extension until we merged #24659. I used that extension in anticipation of #24659 but didn't verify that the tests were actually running. Ooops! Closes #24734	2017-05-17 07:55:04 -04:00
Ryan Ernst	2a65bed243	Tests: Change rest test extension from .yaml to .yml (#24659 ) This commit renames all rest test files to use the .yml extension instead of .yaml. This way the extension used within all of elasticsearch for yaml is consistent.	2017-05-16 17:24:35 -07:00
Nik Everett	7ef390068a	Move remaining pre-configured token filters into analysis-common (#24716 ) Moves the remaining preconfigured token figured into the analysis-common module. There were a couple of tests in core that depended on the pre-configured token filters so I had to touch them: * `GetTermVectorsCheckDocFreqIT` depended on `type_as_payload` but didn't do anything important with it. I dropped the dependency. Then I moved the test to a single node test case because we're trying to cut down on the number of `ESIntegTestCase` subclasses. * `AbstractTermVectorsTestCase` and its subclasses depended on `type_as_payload`. I dropped their usage of the token filter and added an integration test for the termvectors API that uses `type_as_payload` to the `analysis-common` module. * `AnalysisModuleTests` expected a few pre-configured token filtes be registered by default. They aren't any more so I dropped this assertion. We assert that the `CommonAnalysisPlugin` registers these pre-built token filters in `CommonAnalysisFactoryTests` * `SearchQueryIT` and `SuggestSearchIT` had tests that depended on the specific behavior of the token filters so I moved the tests to integration tests in `analysis-common`.	2017-05-16 13:10:24 -04:00
Simon Willnauer	1cae850cf5	Add a cluster block that allows to delete indices that are read-only (#24678 ) Today when an index is `read-only` the index is also blocked from being deleted which sometimes is undesired since in-order to make changes to a cluster indices must be deleted to free up space. This is a likely scenario in a hosted environment when disk-space is limited to switch indices read-only but allow deletions to free up space.	2017-05-16 17:34:37 +02:00
Martijn van Groningen	f6e19dcedc	percolator: Fix range queries with date range based on current time. Range queries with now based date ranges were previously not allowed, but since #23921 these queries were allowed. This change should really fix range queries with now based date ranges.	2017-05-16 13:13:11 +02:00
Christoph Büscher	059b23e92e	Merge branch 'master' into feature/client_aggs_parsing	2017-05-16 11:54:02 +02:00
Ryan Ernst	6ce597a378	Scripts: Convert template script engines to return String instead of BytesReference (#24447 ) Template script engines (mustache, the only one) currently return a BytesReference that users must know is utf8 encoded. This commit modifies all callers and mustache to have the template engine return String. This is much simpler, and does not require decoding in order to use (for example, in ingest).	2017-05-15 22:37:31 -07:00
Igor Motov	243635222a	Move ReindexAction class to core (#24684 ) This class is also needed for plugins to use reindex functionality. Relates to #24578	2017-05-15 14:28:59 -04:00
Christoph Büscher	0b688a8733	Small improvement in InternalAggregationTestCase test setup after changes in master (#24675 )	2017-05-15 15:06:01 +02:00
Christoph Büscher	42e8d4b761	Merge branch 'master' into feature/client_aggs_parsing Conflicts: core/src/test/java/org/elasticsearch/search/aggregations/bucket/filter/InternalFilterTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/global/InternalGlobalTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/missing/InternalMissingTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/nested/InternalNestedTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/nested/InternalReverseNestedTests.java core/src/test/java/org/elasticsearch/search/aggregations/bucket/sampler/InternalSamplerTests.java modules/parent-join/src/test/java/org/elasticsearch/join/aggregations/InternalChildrenTests.java test/framework/src/main/java/org/elasticsearch/search/aggregations/InternalSingleBucketAggregationTestCase.java	2017-05-15 12:25:07 +02:00
Koen De Groote	f185b69e04	Replace manual copying an array or collection with static methods calls (#24657 )	2017-05-15 09:35:48 +02:00
Jason Tedor	5da940532d	Remove Netty logging hack (#24653 ) Netty removed a logging guarded we added to prevent a scary logging message. We added a hack to work around this. They've added the guard back, so we can remove the hack now.	2017-05-12 16:05:13 -04:00
Jason Tedor	458129a85a	Upgrade to Netty 4.1.11.Final This commit upgrades the Netty dependency from 4.1.10.Final to 4.1.11.Final. Relates #24652	2017-05-12 15:53:51 -04:00
Koen De Groote	878ae8eb3c	Size lists in advance when known When constructing an array list, if we know the size of the list in advance (because we are adding objects to it derived from another list), we should size the array list to the appropriate capacity in advance (to avoid resizing allocations). This commit does this in various places. Relates #24439	2017-05-12 10:36:13 -04:00
Jim Ferenczi	279a18a527	Add parent-join module (#24638 ) * Add parent-join module This change adds a new module named `parent-join`. The goal of this module is to provide a replacement for the `_parent` field but as a first step this change only moves the `has_child`, `has_parent` queries and the `children` aggregation to this module. These queries and aggregations are no longer in core but they are deployed by default as a module. Relates #20257	2017-05-12 15:58:06 +02:00
Simon Willnauer	be2a6ce80b	Notify onConnectionClosed rather than onNodeDisconnect to prune transport handlers (#24639 ) Today we prune transport handlers in TransportService when a node is disconnected. This can cause connections to starve in the TransportService if the connection is opened as a short living connection ie. without sharing the connection to a node via registering in the transport itself. This change now moves to pruning based on the connections cache key to ensure we notify handlers as soon as the connection is closed for all connections not just for registered connections. Relates to #24632 Relates to #24575 Relates to #24557	2017-05-12 15:40:40 +02:00
Nik Everett	a40c3a99c9	Reindex: don't duplicate _source parameter (#24629 ) If the request asks for the `_source` stored field then don't duplicate it when forcing the `_source` parameter to onto the request for reindex-from-remote from versions before 1.0. Closes #24628	2017-05-11 16:30:06 -04:00
Simon Willnauer	1155615536	Move DeleteByQuery and Reindex requests into core (#24578 ) This allows other plugins to use a client to call the functionality that is in the core modules without duplicating the logic. Plugins can now safely send the request and response classes via the client even if the requests are executed locally. All relevant classes are loaded by the core classloader such that plugins can share them. This is re-adds this commit that was revered in `952feb58e4`	2017-05-11 20:22:30 +02:00
Simon Willnauer	952feb58e4	Revert "Move DeleteByQuery and Reindex requests into core (#24578 )" This reverts commit `6ea2ae32b8`.	2017-05-11 18:26:40 +02:00
Uwe Schindler	f7c50f5f71	Painless: Optimize instance creation in LambdaBootstrap (#24618 ) Optimize instance creation in LambdaBootstrap to allow Hotspot's escape analysis, preventing us from creating many instances stressing GC	2017-05-11 09:10:27 -07:00
Simon Willnauer	6ea2ae32b8	Move DeleteByQuery and Reindex requests into core (#24578 ) This allows other plugins to use a client to call the functionality that is in the core modules without duplicating the logic. Plugins can now safely send the request and response classes via the client even if the requests are executed locally. All relevant classes are loaded by the core classloader such that plugins can share them.	2017-05-11 16:20:40 +02:00
Nik Everett	8188569fd1	Add qa module that tests reindex-from-remote against pre-5.0 versions of Elasticsearch (#24561 ) Adds tests for reindex-from-remote for the latest 2.4, 1.7, and 0.90 releases. 2.4 and 1.7 are fairly popular versions but 0.90 is a point of pride. This fixes any issues those tests revealed. Closes #23828 Closes #24520	2017-05-11 10:06:20 -04:00
Martijn van Groningen	840da4aebf	Removed deprecated template query. Relates to #19390	2017-05-11 14:56:45 +02:00
Nik Everett	e49ebd04fd	Get more information when reindex test fails It looks like it leaks contexts but it isn't clear why so this adds a little more logging. This is the failure: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.4+multijob-intake/94/console	2017-05-11 08:36:54 -04:00
Nik Everett	65f2717ab7	Make PreConfiguredTokenFilter harder to misuse (#24572 ) There are now three public static method to build instances of PreConfiguredTokenFilter and the ctor is private. I chose static methods instead of constructors because those allow us to change out the implementation returned if we so desire. Relates to #23658	2017-05-10 22:39:43 -04:00
Martijn van Groningen	0ff5933a55	Rewrite multi search template api to delegate to multi search api instead of to search template api. The max concurrent searches logic is complex and we shouldn't duplicate that in multi search template api, so we should template each individual template search request and then delegate to multi search api.	2017-05-10 11:47:53 +02:00
Martijn van Groningen	760e5fce77	Rewrite multi search template api to delegate to multi search api instead of to search template api. The max concurrent searches logic is complex and we shouldn't duplicate that in multi search template api, so we should template each individual template search request and then delegate to multi search api.	2017-05-10 11:23:24 +02:00
Isabel Drost-Fromm	bd559d96d4	This adds max_concurrent_searches to multi-search-template endpoint. Closes #20912	2017-05-10 11:23:24 +02:00
Martijn van Groningen	51c74ce547	Added unit tests for InternalMatrixStats. Also moved InternalAggregationTestCase to test-framework module in order to make use of it from other modules than core. Relates to #22278	2017-05-10 11:06:18 +02:00
Ryan Ernst	9ca7d28552	Scripting: Remove "service" from ScriptEngine interface name (#24574 ) This commit renames ScriptEngineService to ScriptEngine. It is often confusing because we have the ScriptService, and then ScriptEngineService implementations, but the latter are not services as we see in other places in elasticsearch.	2017-05-10 00:47:33 -07:00
Nik Everett	bb06d8ec4f	Allow plugins to build pre-configured token filters (#24223 ) This changes the way we register pre-configured token filters so that plugins can declare them and starts to move all of the pre-configured token filters out of core. It doesn't finish the job because doing so would make the change unreviewably large. So this PR includes a shim that keeps the "old" way of registering pre-configured token filters around. The Lowercase token filter is special because there is a "special" interaction between it and the lowercase tokenizer. I'm not sure exactly what to do about it so for now I'm leaving it alone with the intent of figuring out what to do with it in a followup. This also renames these pre-configured token filters from "pre-built" to "pre-configured" because that seemed like a more descriptive name. This is a part of #23658	2017-05-09 14:50:49 -04:00
javanna	e875f7f72e	remove duplicated import in AppendProcessor	2017-05-08 10:36:36 +02:00
Koen De Groote	13c17c75b5	Remove unneeded empty string concatentation This commit removes concatenation by empty string in places where it is simply not needed to obtain a string representation. Relates #24411	2017-05-06 00:28:53 -04:00
Nik Everett	9bc7e210a0	Test: Move flag to painless tests (#24494 ) The `-XX:-OmitStackTraceInFastThrow` flag is only required by Painless's tests so we'll only set it there. This is much simpler.	2017-05-04 13:11:09 -04:00
Jason Tedor	06364cf6f0	You had one job Netty logging guard In pre-release versions of Elasticsearch 5.0.0, users were subject to log messages of the form "your platform does not.reliably.potential system instability". This is because we disable Netty from being unsafe, and Netty throws up this scary info-level message when unsafe is unavailable, even if it was unavailable because the user requested that it be unavailabe. Users were rightly confused, and concerned. So, we contributed a guard to Netty to prevent this log message from showing up when unsafe was explicitly disabled. This guard shipped with all versions of Netty that shipped starting with Elasticsearch 5.0.0. Unfortunately, this guard was lost in an unrelated refactoring and now with the 4.1.10.Final upgrade, users will again see this message. This commit is a hack around this until we can get a fix upstream again. Relates #24469	2017-05-03 18:49:08 -04:00
Adrien Grand	7311aaa2eb	Fix PercolatorQuerySearchIT to not create multiple types.	2017-05-03 16:44:14 +02:00
Luca Cavanna	92bfd16c58	Java api: ActionRequestBuilder#execute to return a PlainActionFuture (#24415 ) This change makes the request builder code-path same as `Client#execute`. The request builder used to return a `ListenableActionFuture` when calling execute, which allows to associate listeners with the returned future. For async execution though it is recommended to use the `execute` method that accepts an `ActionListener`, like users would do when using `Client#execute`. Relates to #24412 Relates to #9201	2017-05-03 11:20:53 +02:00
Nik Everett	3b47355e56	Try not to lose stacktraces (#24426 ) This adds `-XX:-OmitStackTraceInFastThrow` to the JVM arguments which should prevent the JVM from omitting stack traces on common exception sites. Even though these sites are common, we'd still like the exceptions to debug them. This also adds the flag when running tests and adapts some tests that had workarounds for the absense of the flag. Closes #24376	2017-05-02 11:34:12 -04:00
Uwe Schindler	62fa7081b0	Painless: Add tests to check for existence and correct detection of the special Java 9 optimizations: Indified String concat and MethodHandles#ArrayLengthHelper() (#24405 )	2017-05-02 08:08:51 -07:00
Jason Tedor	40ff169c54	Set available processors for Netty Netty uses the number of processors for sizing various resources (e.g., thread pools, buffer pools, etc.). However, it uses the runtime number of available processors which might not match the configured number of processors as set in Elasticsearch to limit the number of threads (for example, in Docker containers). A new feature was added to Netty that enables configuring the number of processors Netty should see for sizing this various resources. This commit takes advantage of this feature to set this number of available processors to be equal to the configured number of processors set in Elasticsearch. Relates #24420	2017-05-01 19:27:28 -04:00
Uwe Schindler	e88d54bf0a	Painless: Fix method references to ctor with the new LambdaBootstrap and cleanup code (#24406 ) * Fix wrong delegation to constructors when compiling lambdas with method references to ctors. Also remove the get$lambda factory. * Cleanup code and remove unneeded transformations between binary and internal class names (uses ASM Type class instead) * Cleanup Exception handling * Simplification by moving the type adaption to the outside * Remove STATIC access flag from our Lambda class (not required and also officially not allowed) * Move the lambda counter to the classloader, so we have a per-script lambda ID * Change Codesource of generated lambdas to be consistent	2017-05-01 16:15:13 -07:00
Jason Tedor	eefcad94b8	Upgrade Netty to 4.1.10.Final This commit upgrades the Netty dependency from version 4.1.9.Final to version 4.1.10.Final. Relates #24414	2017-05-01 10:25:32 -04:00
Adrien Grand	1be2800120	Only allow one type on 7.0 indices (#24317 ) This adds the `index.mapping.single_type` setting, which enforces that indices have at most one type when it is true. The default value is true for 6.0+ indices and false for old indices. Relates #15613	2017-04-27 08:43:20 +02:00
Nik Everett	bc45d10e82	Remove most usages of 1-arg Script ctor (#24325 ) The one argument ctor for `Script` creates a script with the default language but most usages of are for testing and either don't care about the language or are for use with `MockScriptEngine`. This replaces most usages of the one argument ctor on `Script` with calls to `ESTestCase#mockScript` to make it clear that the tests don't need the default scripting language. I've also factored out some copy and pasted script generation code into a single place. I would have had to change that code to use `mockScript` anyway, so it was easier to perform the refactor. Relates to #16314	2017-04-26 16:04:38 -04:00
Nik Everett	7c3efb829b	Move char filters into analysis-common (#24261 ) Another step down the road to dropping the lucene-analyzers-common dependency from core. Note that this removes some tests that no longer compile from core. I played around with adding them to the analysis-common module where they would compile but we already test these in the tests generated from the example usage in the documentation. I'm not super happy with the way that `requriesAnalysisSettings` works with regards to plugins. I think it'd be fairly bug-prone for plugin authors to use. But I'm making it visible as is for now and I'll rethink later. A part of #23658	2017-04-26 13:25:34 -04:00
Martijn van Groningen	c17de49a6d	[percolator] Fix memory leak when percolator uses bitset or field data cache. The percolator doesn't close the IndexReader of the memory index any more. Prior to 2.x the percolator had its own SearchContext (PercolatorContext) that did this, but that was removed when the percolator was refactored as part of the 5.0 release. I think an alternative way to fix this is to let percolator not use the bitset and fielddata caches, that way we prevent the memory leak. Closes #24108	2017-04-26 11:08:15 +02:00
Jason Tedor	1b660c5127	Fix incorrect logger invocation It looks like auto-complete gave us a nasty surprise here with Logger#equals being invoked instead of Logger#error swallowing the absolute worst-possible level of a log message. This commit fixes the invocation.	2017-04-25 16:25:52 -04:00
Ryan Ernst	6ebf08759b	Templates: Add compileTemplate method to ScriptService for template consumers (#24280 ) This commit adds a compileTemplate method to the ScriptService. Eventually this will be used to easily cutover all consumers to a new TemplateService. relates #16314	2017-04-24 15:45:20 -07:00
Nik Everett	5fbc86e2aa	Allow painless to load stored fields (#24290 ) We document that painless can load stored fields but it can't because the classes that make that work aren't whitelisted.	2017-04-24 14:22:39 -04:00
Jack Conradson	30cc33e2e5	Fix Painless Lambdas for Java 9 (#24070 ) Replaces LambdaMetaFactory with LambdaBootstrap, a custom solution for lambdas in Painless using a design similar to LambdaMetaFactory, but allows for custom adaptation of types which recent changes to LambdaMetaFactory no longer allowed.	2017-04-24 09:58:02 -07:00
Christoph Büscher	d1703decee	Adapting to changes in master	2017-04-22 22:06:06 +02:00
Christoph Büscher	5254731039	Merge branch 'master' into feature/rank-eval	2017-04-22 21:47:32 +02:00
Ryan Ernst	473e98981b	Scripts: Remove unnecessary executable shortcut (#24264 ) ScriptService has two executable methods, one which takes a CompiledScript, which is similar to search, and one that takes a raw Script and both compiles and returns an ExecutableScript for it. The latter is not needed, and the call sites which used one or the other were mixed. This commit removes the extra executable method in favor of callers first calling compile, then executable.	2017-04-21 17:53:03 -07:00
Ryan Ernst	aadc33d260	Scripts: Remove unwrap method from executable scripts (#24263 ) The unwrap method was leftover from support javascript and python. Since those languages are removed in 6.0, this commit removes the unwrap feature from scripts.	2017-04-21 17:50:22 -07:00
Nik Everett	caf376c8af	Start building analysis-common module (#23614 ) Start moving built in analysis components into the new analysis-common module. The goal of this project is: 1. Remove core's dependency on lucene-analyzers-common.jar which should shrink the dependencies for transport client and high level rest client. 2. Prove that analysis plugins can do all the "built in" things by moving all "built in" behavior to a plugin. 3. Force tests not to depend on any oddball analyzer behavior. If tests need anything more than the standard analyzer they can use the mock analyzer provided by Lucene's test infrastructure.	2017-04-19 18:51:34 -04:00
Nik Everett	db0a5e4263	Painless: more testing for script_stack (#24168 ) `script_stack` is super useful when debugging Painless scripts because it skips all the "weird" stuff involved that obfuscates where the actual error is. It skips Painless's internals and call site bootstrapping. It works fine, but it didn't have many tests. This converts a test that we had for line numbers into a test for the `script_stack`. The line numbers test was an indirect test for `script_stack`.	2017-04-18 22:52:59 -04:00
Ryan Ernst	212f24aa27	Tests: Clean up rest test file handling (#21392 ) This change simplifies how the rest test runner finds test files and removes all leniency. Previously multiple prefixes and suffixes would be tried, and tests could exist inside or outside of the classpath, although outside of the classpath never quite worked. Now only classpath tests are supported, and only one resource prefix is supported, `/rest-api-spec/tests`. closes #20240	2017-04-18 15:07:08 -07:00
Nik Everett	0b15fde27a	Start on custom whitelists for Painless (#23563 ) We'd like to be able to support context-sensitive whitelists in Painless but we can't now because the whitelist is a static thing. This begins to de-static the whitelist, in particular removing the static keyword from most of the methods on `Definition` and plumbing the static instance into the appropriate spots as though it weren't static. Once we de-static all the methods we should be able to fairly simply build context-sensitive whitelists. The only "fun" bit of this is that I added another layer in the chain of methods that bootstraps `def` calls. Instead of running `invokedynamic` directly on `DefBootstrap` we now `invokedynamic` `$bootstrapDef` on the script itself loads the `Definition` that the script was compiled against and then calls `DefBootstrap`. I chose to put `Definition` into `Locals` so I didn't have to change the signature of all the `analyze` methods. I could have do it another way, but that seems ok for now.	2017-04-18 10:39:42 -04:00
Adrien Grand	4632661bc7	Upgrade to a Lucene 7 snapshot (#24089 ) We want to upgrade to Lucene 7 ahead of time in order to be able to check whether it causes any trouble to Elasticsearch before Lucene 7.0 gets released. From a user perspective, the main benefit of this upgrade is the enhanced support for sparse fields, whose resource consumption is now function of the number of docs that have a value rather than the total number of docs in the index. Some notes about the change: - it includes the deprecation of the `disable_coord` parameter of the `bool` and `common_terms` queries: Lucene has removed support for coord factors - it includes the deprecation of the `index.similarity.base` expert setting, since it was only useful to configure coords and query norms, which have both been removed - two tests have been marked with `@AwaitsFix` because of #23966, which we intend to address after the merge	2017-04-18 15:17:21 +02:00
Nik Everett	25119a7e78	Harden painless test against "fun" caching (#24077 ) The JVM caches `Integer` objects. This is known. A test in Painless was relying on the JVM not caching the particular integer `1000`. It turns out that when you provide `-XX:+AggressiveOpts` the JVM does cache `1000`, causing the test to fail when that is specified. This replaces `1000` with a randomly selected integer that we test to make sure isn't cached by the JVM. Hopefully this test is good enough. It relies on the caching not changing in between when we check that the value isn't cached and when we run the painless code. The cache now is a simple array but there is nothing preventing it from changing. If it does change in a way that thwarts this test then the test fail fail again. At least when that happens the next person can see the comment about how it is important that the integer isn't cached and can follow that line of inquiry. Closes #24041	2017-04-17 13:44:05 -04:00
Jason Tedor	972bdc09ee	Reject empty IDs When indexing a document via the bulk API where IDs can be explicitly specified, we currently accept an empty ID. This is problematic because such a document can not be obtained via the get API. Instead, we should rejected these requets as accepting them could be a dangerous form of leniency. Additionally, we already have a way of specifying auto-generated IDs and that is to not explicitly specify an ID so we do not need a second way. This commit rejects the individual requests where ID is specified but empty. Relates #24118	2017-04-15 10:36:03 -04:00
Jay Modi	30ab8739a6	Closing a ReleasableBytesStreamOutput closes the underlying BigArray (#23941 ) This commit makes closing a ReleasableBytesStreamOutput release the underlying BigArray so that we can use try-with-resources with these streams and avoid leaking memory by not returning the BigArray. As part of this change, the ReleasableBytesStreamOutput adds protection to only release the BigArray once. In order to make some of the changes cleaner, the ReleasableBytesStream interface has been removed. The BytesStream interface is changed to a abstract class so that we can use it as a useable return type for a new method, Streams#flushOnCloseStream. This new method wraps a given stream and overrides the close method so that the stream is simply flushed and not closed. This behavior is used in the TcpTransport when compression is used with a ReleasableBytesStreamOutput as we need to close the compressed stream to ensure all of the data is written from this stream. Closing the compressed stream will try to close the underlying stream but we only want to flush so that all of the written bytes are available. Additionally, an error message method added in the BytesRestResponse did not use a builder provided by the channel and instead created its own JSON builder. This changes that method to use the channel builder and in turn the bytes stream output that is managed by the channel. Note, this commit differs from `6bfecdf921` in that it updates ReleasableBytesStreamOutput to handle the case of the BigArray decreasing in size, which changes the reference to the BigArray. When the reference is changed, the releasable needs to be updated otherwise there could be a leak of bytes and corruption of data in unrelated streams. This reverts commit `afd45c1432`, which reverted #23572.	2017-04-14 10:50:31 -04:00
Tim Brooks	ffaac5a08a	Simplify BulkProcessor handling and retry logic (#24051 ) This commit collapses the SyncBulkRequestHandler and AsyncBulkRequestHandler into a single BulkRequestHandler. The new handler executes a bulk request and awaits for the completion if the BulkProcessor was configured with a concurrentRequests setting of 0. Otherwise the execution happens asynchronously. As part of this change the Retry class has been refactored. withSyncBackoff and withAsyncBackoff have been replaced with two versions of withBackoff. One method takes a listener that will be called on completion. The other method returns a future that will been complete on request completion.	2017-04-13 14:48:52 -05:00
Nik Everett	e99f90fb46	Add more debugging information to rethrottles I'm still trying to track down failures like: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+dockeralpine-periodic/1180/console It looks like a task is hanging but I'm not sure why. So this adds more logging for next time.	2017-04-12 08:37:31 -04:00
Jason Tedor	653619079c	Skip two Painless branch tests on Windows This commit skips the two Painless tests EqualsTests#testBranchEqualsDefAndPrimitive and EqualsTests#testBranchNotEqualsDefAndPrimitive on Windows as the tests are repeatedly failing there.	2017-04-11 06:19:42 -04:00
Colin Goodheart-Smithe	0114f0061c	Removes version 2.x constants from Version (#24011 ) * Removes version 2.x constants from Version Closes #21887 * Addresses review comments	2017-04-11 08:31:22 +01:00
Luca Cavanna	2c545c064d	Move getProperty method out of MultiBucketsAggregation.Bucket interface (#23988 ) The getProperty method is an internal method needed to run pipeline aggregations and retrieve info by path from the aggs tree. It is not needed in the MultiBucketsAggregation.Bucket interface, which is returned to users running aggregations from the transport client. The method is moved to the InternalMultiBucketAggregation class as that's where it belongs.	2017-04-10 13:35:01 +02:00
Nik Everett	de6837b7ac	Fix throttled reindex_from_remote (#23953 ) reindex_from_remote was using `TimeValue#toString` to generate the scroll timeout which is bad because that generates fractional time values that are useful for people but bad for Elasticsearch which doesn't like to parse them. This switches it to using `TimeValue#getStringRep` which spits out whole time values. Closes to #23945 Makes #23828 even more desirable	2017-04-07 15:56:52 -04:00
Martijn van Groningen	3d9671a668	[PERCOLATOR] Allowing range queries with now ranges inside percolator queries. Before now ranges where forbidden, because the percolator query itself could get cached and then the percolator queries with now ranges that should no longer match, incorrectly will continue to match. By disabling caching when the `percolator` is being used, the percolator can now correctly support range queries with now based ranges. I think this is the right tradeoff. The percolator query is likely to not be the same between search requests and disabling range queries with now ranges really disabled people using the percolator for their use cases. Also fixed an issue that existed in the percolator fieldmapper, it was unable to find forbidden queries inside `dismax` queries. Closes #23859	2017-04-07 08:44:43 +02:00
Tim Brooks	5b1fbe5e6c	Decouple BulkProcessor from client implementation (#23373 ) This commit modifies the BulkProcessor to be decoupled from the client implementation. Instead it just takes a BiConsumer<BulkRequest, ActionListener<BulkResponse>> that executes the BulkRequest.	2017-04-05 12:12:43 -05:00
Jason Tedor	afd45c1432	Revert "Closing a ReleasableBytesStreamOutput closes the underlying BigArray (#23572 )" This reverts commit `6bfecdf921`.	2017-04-04 20:33:51 -04:00
Christoph Büscher	6cfbef73a0	Follow renaming of randomAsciiOfLength() to randomAlphaOfLength()	2017-04-04 18:31:00 +02:00
Christoph Büscher	024ed1b6ca	Merge branch 'master' into feature/rank-eval	2017-04-04 18:23:41 +02:00
Jay Modi	6bfecdf921	Closing a ReleasableBytesStreamOutput closes the underlying BigArray (#23572 ) This commit makes closing a ReleasableBytesStreamOutput release the underlying BigArray so that we can use try-with-resources with these streams and avoid leaking memory by not returning the BigArray. As part of this change, the ReleasableBytesStreamOutput adds protection to only release the BigArray once. In order to make some of the changes cleaner, the ReleasableBytesStream interface has been removed. The BytesStream interface is changed to a abstract class so that we can use it as a useable return type for a new method, Streams#flushOnCloseStream. This new method wraps a given stream and overrides the close method so that the stream is simply flushed and not closed. This behavior is used in the TcpTransport when compression is used with a ReleasableBytesStreamOutput as we need to close the compressed stream to ensure all of the data is written from this stream. Closing the compressed stream will try to close the underlying stream but we only want to flush so that all of the written bytes are available. Additionally, an error message method added in the BytesRestResponse did not use a builder provided by the channel and instead created its own JSON builder. This changes that method to use the channel builder and in turn the bytes stream output that is managed by the channel.	2017-04-04 17:01:30 +01:00
Jason Tedor	3136ed1490	Rename random ASCII helper methods This commit renames the random ASCII helper methods in ESTestCase. This is because this method ultimately uses the random ASCII methods from randomized runner, but these methods actually only produce random strings generated from [a-zA-Z]. Relates #23886	2017-04-04 11:04:18 -04:00
Nik Everett	ebd74f09cf	Add extra debugging to reindex cancel tests Adds more diagnostics when reindex's cancel tests fail. It fails every once in a while and didn't have useful failure messages: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.3+multijob-unix-compatibility/os=amazon/86/consoleFull	2017-03-31 11:17:06 -04:00
Jim Ferenczi	f3a925fdbe	Fix reindex with a remote source on a version before 2.0.0 (#23805 ) Send the scroll id in the body as plain text when the remote version is before 2.0.0	2017-03-31 09:07:43 +02:00
Tim Brooks	5fa80a6521	Pass exception from sendMessage to listener (#23559 ) This commit changes the listener passed to sendMessage from a Runnable to a ActionListener. This change also removes IOException from the sendMessage signature. That signature is misleading as it allows implementers to assume an exception will be thrown in case of failure. That does not happen due to Netty's async nature.	2017-03-30 15:08:23 -05:00
Dimitris Athanasiou	34f116eae3	Require explicit query in _delete_by_query API (#23632 ) As the query of a search request defaults to match_all, calling _delete_by_query without an explicit query may result in deleting all data. In order to protect users against falling into that pitfall, this commit adds a check to require the explicit setting of a query. Closes #23629	2017-03-28 15:44:57 +01:00
Jim Ferenczi	0e95c90e9f	Upgrade to Lucene 6.5.0 (#23750 )	2017-03-27 15:57:54 +02:00
Ryan Ernst	4cb8a0100c	Build: Rewrite antlr regeneration in gradle (#23733 ) This change ports the regeneration of antlr parser/lexer into gradle (but does still take advantage of ant calls where appropriate).	2017-03-24 09:44:53 -07:00
Ryan Ernst	8c53555b28	Tests: Use local clone build of 5.x with bwc tests (#22946 ) The current rest backcompat tests, which run against a mixed cluster of 5.x and 6.0 nodes, depend on snapshot builds of 5.x. However, this has the potential for inconsistency that results in CI failures, and happens quite often, whenever some backcompat logic is added to 5.x, but the bwc test on master fails because the 5.x code has not yet been published as a snapshot. This change creates a git clone of the 5.x branch, builds the zip distribution, and ties that into gradle substitutions for the 5.x version.	2017-03-23 22:32:13 -07:00
Christoph Büscher	4a75ede208	Reformatting source to fit 100 character line length restriction	2017-03-23 20:20:22 +01:00
Christoph Büscher	96fc3aaf6f	Merge branch 'master' into feature/rank-eval	2017-03-23 19:55:47 +01:00
AdityaJNair	63757efe9c	Remove DocumentMapper#parse(String index, String type, String id, BytesReference source) (#23706 ) Removed `parse(String index, String type, String id, BytesReference source)` in DocumentMapper.java and replaced all of its use in Test files with `parse(SourceToParse source)`. `parse(String index, String type, String id, BytesReference source)` was only used in test files and never in the main code so it was removed. All of the test files that used it was then modified to use `parse(SourceToParse source)` method that existing in DocumentMapper.java	2017-03-23 11:01:09 -04:00
Nik Everett	257a7d77ed	Painless: Fix regex lexer and error messages (#23634 ) Without this change, if write a script with multiple regexes sometimes the lexer will decide to look at them like one big regex and then some trailing garbage. Like this discuss post: https://discuss.elastic.co/t/error-with-the-split-function-in-painless-script/79021 ``` def val = /\\\\/.split(ctx._source.event_data.param17); if (val[2] =~ /\\./) { def val2 = /\\./.split(val[2]); ctx._source['user_crash'] = val2[0] } else { ctx._source['user_crash'] = val[2] } ``` The error message you get from the lexer is `lexer_no_viable_alt_exception` right after the second regex. With this change each regex is just a single regex like it ought to be. As a bonus, while looking into this issue I found that the error reporting for regexes wasn't very nice. If you specify an invalid pattern then you get an error marker on the start of the pattern with the JVM's regex error message which attempts to point you to the location in the regex but is totally unreadable in the JSON response. This change fixes the location to point to the appropriate spot inside the pattern and removes the portion of the JVM's error message that doesn't render well. It is no longer needed now that we point users to the appropriate spot in the pattern.	2017-03-22 15:56:17 -04:00
Nik Everett	bc65be2a65	Reindex: wait for cleanup before responding (#23677 ) Changes reindex and friends to wait until the entire request has been "cleaned up" before responding. "Clean up" in this context is clearing the scroll and (for reindex-from-remote) shutting down the client. Failures to clean up are still only logged, not returned to the user. Closes #23653	2017-03-21 15:33:39 -04:00
Christoph Büscher	f5388e5799	Adapting rank_eval integration tests	2017-03-14 12:21:28 -07:00
Christoph Büscher	cf35545e2d	Merge branch 'master' into feature/rank-eval	2017-03-13 17:36:13 -07:00
Jason Tedor	8dfb68cf1c	Upgrade to Netty 4.1.9 This commit upgrades the Netty dependencies from version 4.1.8 to version 4.1.9. This commit picks up a few bug fixes that impacted us: - Netty was incorrectly ignoring interfaces with self-assigned MAC addresses (e.g., instances running in Docker containers or on EC2) - incorrect handling of the Expect: 100-continue header Relates #23540	2017-03-11 18:28:31 -08:00
Daniel Mitterdorfer	6f7cd71e1f	Adjust default Netty receive predictor size to 64k (#23542 ) With this commit we change the default receive predictor size for Netty from 32kB to 64kB as our testing has shown that this leads to less allocations on smaller heaps like the default out of the box configuration and this value also works reasonably well for larger heaps. Closes #23185	2017-03-11 17:32:35 -08:00
Jason Tedor	8e09eca9a6	Mute Painless lambda tests on JDK 9 This commit mutes a ton of Painless lambda tests on JDK 9. This commit did not attempt to discover exactly which tests are failing, but instead just blanket muted all tests in LambdaTests, FunctionRefTests, and AugmentationTests. Relates #23473	2017-03-02 22:36:26 -05:00
Jay Modi	01502893eb	HTTP transport stashes the ThreadContext instead of the RestController (#23456 ) Previously, the RestController would stash the context prior to copying headers. However, there could be deprecation log messages logged and in turn warning headers being added to the context prior to the stashing of the context. These headers in the context would then be removed from the request and also leaked back into the calling thread's context. This change moves the stashing of the context to the HttpTransport so that the network threads' context isn't accidentally populated with warning headers and to ensure the headers added early on in the RestController are not excluded from the response.	2017-03-02 14:44:01 -05:00
Luca Cavanna	cc65a94fd4	[TEST] improve yaml test sections parsing (#23407 ) Throw error when skip or do sections are malformed, such as they don't start with the proper token (START_OBJECT). That signals bad indentation, which would be ignored otherwise. Thanks (or due to) our pull parsing code, we were still able to properly parse the sections, yet other runners weren't able to. Closes #21980 * [TEST] fix indentation in matrix_stats yaml tests * [TEST] fix indentation in painless yaml test * [TEST] fix indentation in analysis yaml tests * [TEST] fix indentation in generated docs yaml tests * [TEST] fix indentation in multi_cluster_search yaml tests	2017-03-02 12:43:20 +01:00
Nik Everett	2dcdaa1c9d	Mustache: don't extend AbstractComponent (#23419 ) Don't extend `AbstractComponent` in `MustacheScriptEngine` because it doesn't buy anything.	2017-03-01 14:54:27 -05:00
Ryan Ernst	019263d664	Revert "Internal: Change version constant names for already released versions (#23416 )" This reverts commit `dc0e93ed62`.	2017-02-28 14:45:13 -08:00
Ryan Ernst	dc0e93ed62	Internal: Change version constant names for already released versions (#23416 ) We have many version constants in master that have already been released, but are still marked (by naming convention) as unreleased. This commit renames those version constants.	2017-02-28 13:05:44 -08:00
Tanguy Leroux	33eb6a13bf	Tests: Fix RemoteScrollableHitSourceTests With #23307, the expected exception is wrapped two times into a RuntimeException instead of being thrown directly.	2017-02-28 11:30:33 +01:00
Jim Ferenczi	5c84640126	Upgrade to lucene-6.5.0-snapshot-d00c5ca (#23385 ) Lucene upgrade	2017-02-27 18:39:04 +01:00
javanna	9a2dba3036	[TEST] add support for binary responses to REST tests infra	2017-02-27 12:27:03 +01:00
javanna	dad025a6ad	[TEST] move test for binary field to specific test file that sets Content-Type header explicitly	2017-02-27 12:27:03 +01:00
Christoph Büscher	ddae32705c	Adapting build.gradle to changes on master	2017-02-27 11:42:46 +01:00
Christoph Büscher	1f4c4d99b9	Merge branch 'master' into feature/rank-eval	2017-02-27 11:25:17 +01:00
Ryan Ernst	9df95def90	Build: Remove extra copies of netty license (#23361 ) The dependencyLicenses check has the ability to map multiple jar files to the same license file. However, netty was not taking advantage of this, and had duplicate copies of its license/notice files for each jar. This commit reduces the copies to one and uses the mapping feature.	2017-02-24 14:40:07 -08:00
Jason Tedor	f85a7aed37	Keep the pipeline handler queue small initially This commit sets the intial size of the pipeline handler queue small to prevent waste if pipelined requests are never sent. Since the queue will grow quickly if pipeline requests are indeed set, this should not be problematic. Relates #23335	2017-02-23 14:17:46 -05:00
sabi0	09b3c7f270	Do not create String instances in 'Strings' methods accepting StringBuilder (#22907 )	2017-02-23 10:57:34 -08:00
Christoph Büscher	8b1b152e91	Remove abstract InternalMetricsAggregation class (#23326 ) This class doesn't seem to do much other than to group together certain types of aggregations.	2017-02-23 18:03:40 +01:00
Jason Tedor	3e69c38dbd	Respect promises on pipelined responses When pipelined responses are sent to the pipeline handler for writing, they are not necessarily written immediately. They must be held in a priority queue until all responses preceding the given response are written. This means that when write is invoked on the handler, the promise that is attached to the write invocation will not necessarily be the promise associated with the responses that are written while the queue is drained. To address this, the promise associated with a pipelined response must be held with the response and then used when the channel context is actually written to. This was introduced when ensuring that the releasing promise is always chained through on write calls lest the releasing promise never be invoked. This leads to many failing test cases, so no new test cases are needed here. Relates #23317	2017-02-23 09:32:43 -05:00
Jason Tedor	6ca90a61a6	Relocate a comment in HttpPipeliningHandler This commit moves a comment in HttpPipeliningHandler as it makes more sense for this comment to be where the field that it is explaining is declared.	2017-02-22 20:51:18 -05:00
Jason Tedor	30f723d2b0	Add comments to HttpPipeliningHandler This commit adds some comments explaining the design of HttpPipeliningHandler.	2017-02-22 20:47:34 -05:00
Ryan Ernst	175bda64a0	Build: Rework integ test setup and shutdown to ensure stop runs when desired (#23304 ) Gradle's finalizedBy on tasks only ensures one task runs after another, but not immediately after. This is problematic for our integration tests since it allows multiple project's integ test clusters to be simultaneously. While this has not been a problem thus far (gradle 2.13 happened to keep the finalizedBy tasks close enough that no clusters were running in parallel), with gradle 3.3 the task graph generation has changed, and numerous clusters may be running simultaneously, causing memory pressure, and thus generally slower tests, or even failure if the system has a limited amount of memory (eg in a vagrant host). This commit reworks how integ tests are configured. It adds an `integTestCluster` extension to gradle which is equivalent to the current `integTest.cluster` and moves the rest test runner task to `integTestRunner`. The `integTest` task is then just a dummy task, which depends on the cluster runner task, as well as the cluster stop task. This means running `integTest` in one project will both run the rest tests, and shut down the cluster, before running `integTest` in another project.	2017-02-22 12:43:15 -08:00
Jason Tedor	708d11f54a	Ensure that releasing listener is called When sending a response to a client, we attach a releasing listener to the channel promise. If the client disappears before the response is sent, the releasing listener was never notified. The reason the listeners were never notified was due to a mistaken invocation of write and flush on the channel which has two overrides: one that takes an existing promise, and one that does not and instead creates a new promise. When the client disappears, it is this latter promise that is notified, which does not contain the releasing listener. This commit addreses this issue by invoking the override that passes our channel promise through. Relates #23310	2017-02-22 13:54:17 -05:00
javanna	594f00c582	Remove content type auto-detection from search templates Now that search templates always get converted to json, we don't need to try and auto-detect their content-type, which anyways didn't work as expected before given that only json was really working.	2017-02-22 16:20:53 +01:00
javanna	f2acf466aa	Convert script/template objects to json format Elasticsearch accepts multiple content-type formats, hence scripts can be stored/provided in json, yaml, cbor or smile. Yet the format that should be used internally is json. This is a problem mainly around search templates, as they only support json out of the four content-types, so instead of maintaining the content-type of the request we should rather convert the scripts/templates to json. Binary formats were not previously supported. If you stored a template in yaml format, you'd get back an error "No encoder found for MIME type [application/yaml]" when trying to execute it. With this commit the request content-type is independent from the template, which always gets converted to json internally. That is transparent to users and doesn't affect the content type of the response obtained when executing the template.	2017-02-22 16:20:53 +01:00
javanna	9391c6ffa9	Replace CustomMustacheFactory constant with same constant from Script (CONTENT_TYPE_OPTION)	2017-02-22 16:20:53 +01:00
Nik Everett	38d25a0369	Fix Painless's implementation of interfaces returning primitives (#23298 ) Fixes Painless to properly implement scripts that return primitives and void. Adds some simple tests that we emit sane opcodes and some other tests that we implement primitives as expected. Mostly this is just a fix following up from #22983 but there is one thing I did really worth talking about, I think. So, before this script Painless scripts could only ever return Object and they did would always return null for paths that didn't return any values. Now that they can return primitives the question is "what should Painless return from paths that don't return any values?" And I answered that with "whatever the JLS default value is". So 0/0L/0f/0d/false.	2017-02-21 17:10:55 -05:00
Martijn van Groningen	81d53470e7	percolator: add support for term extraction for MultiPhraseQuery	2017-02-21 21:10:55 +01:00
Nik Everett	9105672969	Allow painless to implement more interfaces (#22983 ) Generalizes three previously hard coded things in painless into generic concepts: 1. The "main method" is no longer hardcoded to: ``` public abstract Object execute(Map<String, Object> params, Scorer scorer, LeafDocLookup doc, Object value); ``` Instead Painless's compiler takes an interface and implements it. It looks like: ``` public interface SomeScript { // Argument names we expose to Painless scripts String[] ARGUMENTS = new String[] {"a", "b"}; // Method implemented by Painless script. Must be named execute but can have any parameters or return any value. Object execute(String a, int b); // Is the "a" argument used by the script? boolean uses$a(); } SomeScript script = scriptEngine.compile(SomeScript.class, null, "the_script_here", emptyMap()); Object result = script.execute("a", 1); ``` `PainlessScriptEngine` now compiles all scripts to the new `GenericElasticsearchScript` interface by default for compatibility with the rest of Elasticsearch until it is able to use this new ability. 2. `_score` and `ctx` are no longer hardcoded to be extracted from `#score` and `params` respectively. Instead Painless's default implementation of Elasticsearch scripts uses the `uses$_score` and `uses$ctx` methods to determine if it is used and gives them dummy values if they are not used. 3. Throwing the `ScriptException` is now handled by the Painless script itself. That way Painless doesn't have to leak the metadata that is required to build the fancy stack trace. And all painless scripts get the fancy stack trace.	2017-02-21 14:08:57 -05:00
Jack Conradson	fac2d954e3	Fix certain bad casts in Painless due to boxing/unboxing. (#23282 )	2017-02-21 10:23:27 -08:00
Daniel Mitterdorfer	0744a00001	Set network receive predictor size to 32kb (#23284 ) Previously we calculated Netty' receive predictor size for HTTP and transport traffic based on available memory and worker nodes. This resulted in a receive predictor size between 64kb and 512kb. In our benchmarks this leads to increased GC pressure. With this commit we set Netty's receive predictor size to 32kb. This value is in a sweet spot between heap memory waste (-> GC pressure) and effect on request metrics (achieved throughput and latency numbers). Closes #23185	2017-02-21 14:45:33 +01:00
Jay Modi	b234644035	Enforce Content-Type requirement on the rest layer and remove deprecated methods (#23146 ) This commit enforces the requirement of Content-Type for the REST layer and removes the deprecated methods in transport requests and their usages. While doing this, it turns out that there are many places where *Entity classes are used from the apache http client libraries and many of these usages did not specify the content type. The methods that do not specify a content type explicitly have been added to forbidden apis to prevent more of these from entering our code base. Relates #19388	2017-02-17 14:45:41 -05:00
Christoph Büscher	6f6b2933b1	Fixing compile issues after merging in master	2017-02-16 11:02:02 +01:00
Christoph Büscher	cfa52f8b9a	Merge branch 'master' into feature/rank-eval	2017-02-16 10:39:07 +01:00
Jason Tedor	0a5917d182	Fix get HEAD requests Get HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for get HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23186	2017-02-15 13:07:29 -05:00
Ryan Ernst	79a1629f74	Fix line length	2017-02-14 21:23:21 -08:00
Jason Tedor	9e80e290d6	Add failing tests for expect header violations This commit adds unit tests for two cases where Elasticsearch violates expect header handling. These tests are marked as awaits fix. Relates #23173	2017-02-14 19:24:22 -05:00
Jason Tedor	673754b1d5	Fix get source HEAD requests Get source HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for get source HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23151	2017-02-14 16:37:22 -05:00
Martijn van Groningen	cab43707dc	[percolator] Removed old 2.x bwc logic.	2017-02-14 22:17:17 +01:00
Simon Willnauer	aef0665ddb	Detach SearchPhases from AbstractSearchAsyncAction (#23118 ) Today all search phases are inner classes of AbstractSearchAsyncAction or one of it's subclasses. This makes unit testing of these classes practically impossible. This commit Extracts `DfsQueryPhase` and `FetchSearchPhase` or of the code that composes the actual query execution types and moves most of the fan-out and collect code into an `InitialSearchPhase` class that can be used to build initial search phases (phases that retry on shards). This will make modification to these classes simpler and allows to easily compose or add new search phases down the road if additional roundtrips are required.	2017-02-14 12:34:25 +01:00
Jason Tedor	5343b87502	Handle bad HTTP requests When Netty decodes a bad HTTP request, it marks the decoder result on the HTTP request as a failure, and reroutes the request to GET /bad-request. This either leads to puzzling responses when a bad request is sent to Elasticsearch (if an index named "bad-request" does not exist then it produces an index not found exception and otherwise responds with the index settings for the index named "bad-request"). This commit addresses this by inspecting the decoder result on the HTTP request and dispatching the request to a bad request handler preserving the initial cause of the bad request and providing an error message to the client. Relates #23153	2017-02-13 17:39:25 -05:00
Jay Modi	61e383813d	Make the version of the remote node accessible on a transport channel (#23019 ) This commit adds a new method to the TransportChannel that provides access to the version of the remote node that the response is being sent on and that the request came from. This is helpful for serialization of data attached as headers.	2017-02-13 15:15:57 -05:00
jaymode	d8d03f45c2	Fix communication with 5.3.0 nodes This commit fixes communication with 5.3.0 nodes to send XContentType to these nodes since #22691 was backported to the 5.3 branch.	2017-02-13 13:15:51 -05:00
Jason Tedor	0f21ed5b70	Fix template HEAD requests Template HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for template HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23130	2017-02-11 18:30:16 -05:00
Jason Tedor	a6158398dd	Fix index HEAD requests Index HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for index HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23112	2017-02-10 09:44:01 -05:00
Jason Tedor	7ac44656df	Fix alias HEAD requests Alias HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for alias HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23094	2017-02-10 09:19:35 -05:00
Adrien Grand	709cc9ba65	Upgrade to lucene-6.5.0-snapshot-f919485. (#23087 )	2017-02-10 15:08:47 +01:00
Tanguy Leroux	e2e5937455	Use `typed_keys` parameter to prefix suggester names by type in search responses (#23080 ) This pull request reuses the typed_keys parameter added in #22965, but this time it applies it to suggesters. When set to true, the suggester names in the search response will be prefixed with a prefix that reflects their type.	2017-02-10 10:53:38 +01:00
Nik Everett	0250c7ab18	Fix reindex test after toString change Weakens the assertion on wait_for_active_shards so that we don't check the toString of the bulk request because it isn't important. Relates to #22900	2017-02-09 16:48:40 -05:00
Tim Brooks	a331405aff	Isolated SocketPermissions to Netty (#23057 ) Netty 4.1.8 wraps connect and accept operations in doPrivileged blocks. This means that we not need to give permissions to the entire transport module. Additionally this commit deletes the privileged socket channel and privileged server socket chanel.	2017-02-09 10:00:25 -06:00
Tanguy Leroux	3553522328	Add parameter to prefix aggs name with type in search responses (#22965 ) This pull request adds a new parameter to the REST Search API named `typed_keys`. When set to true, the aggregation names in the search response will be prefixed with a prefix that reflects the internal type of the aggregation. Here is a simple example: ``` GET /_search?typed_keys { "aggs": { "tweets_per_user": { "terms": { "field": "user" } } }, "size": 0 } ``` And the response: ``` { "aggs": { "sterms:tweets_per_user": { ... } } } ``` This parameter is intended to make life easier for REST clients that could parse back the prefix and could detect the type of the aggregation to parse. It could also be implemented for suggesters.	2017-02-09 11:19:04 +01:00
Tim Brooks	735e5b1983	Upgrade to Netty 4.1.8 (#23055 ) This commit upgrades the Netty dependency to version 4.1.8.Final.	2017-02-08 11:44:36 -06:00
Simon Willnauer	ecb01c15b9	Fold InternalSearchHits and friends into their interfaces (#23042 ) We have a bunch of interfaces that have only a single implementation for 6 years now. These interfaces are pretty useless from a SW development perspective and only add unnecessary abstractions. They also require lots of casting in many places where we expect that there is only one concrete implementation. This change removes the interfaces, makes all of the classes final and removes the duplicate `foo` `getFoo` accessors in favor of `getFoo` from these classes.	2017-02-08 14:40:08 +01:00
Tim Brooks	fcc568fd8d	Add methods requiring connect to forbidden apis (#22964 ) This is related to #22116. This commit adds calls that require SocketPermission connect to forbidden APIs. The following calls are now forbidden: - java.net.URL#openStream() - java.net.URLConnection#connect() - java.net.URLConnection#getInputStream() - java.net.Socket#connect(java.net.SocketAddress) - java.net.Socket#connect(java.net.SocketAddress, int) - java.nio.channels.SocketChannel#open(java.net.SocketAddress) - java.nio.channels.SocketChannel#connect(java.net.SocketAddress)	2017-02-07 14:41:50 -06:00
Boaz Leskes	ba06c14a97	TransportService.connectToNode should validate remote node ID (#22828 ) #22194 gave us the ability to open low level temporary connections to remote node based on their address. With this use case out of the way, actual full blown connections should validate the node on the other side, making sure we speak to who we think we speak to. This helps in case where multiple nodes are started on the same host and a quick node restart causes them to swap addresses, which in turn can cause confusion down the road.	2017-02-07 22:11:32 +02:00
Tim Brooks	27b7d9bd8d	Add FileSystemUtil method to read 'file:/' URLs (#23020 ) As part of #22116 we are going to forbid usage of api java.net.URL#openStream(). However in a number of places across the we use this method to read files from the local filesystem. This commit introduces a helper method openFileURLStream(URL url) to read files from URLs. It does specific validation to only ensure that file:/ urls are read. Additionlly, this commit removes unneeded method FileSystemUtil.newBufferedReader(URL, Charset). This method used the openStream () method which will soon be forbidden. Instead we use the Files.newBufferedReader(Path, Charset).	2017-02-07 10:24:22 -06:00
Jay Modi	c898e8ab83	Add support for newline delimited JSON Content-Type (#22947 ) This commit adds support for the newline delimited JSON Content-Type, which is how the bulk, multi-search, and multi-search template APIs expect data to be formatted. The `elasticsearch-js` client has also been using this content type for these types of requests. Closes #22943	2017-02-07 09:20:06 -05:00
Nik Everett	0d6e622242	Make dates be ReadableDateTimes in scripts (#22948 ) Instead of longs. If you want millis since epoch you can call doc.date_field.value.millis. Relates to #22875	2017-02-06 16:44:56 -05:00
Nicholas Knize	1c9fdfd1b3	Remove GeoPointFieldMapper abstraction In order to support the evolving GeoPoint encodings in Lucene 5 and 6, ES 2.x and 5.x implements an abstraction layer to the GeoPointFieldMapper classes. As of 5.x the geo_point field mapper settled on using Lucene's more performant LatLonPoint field type and deprecated all other encodings. In 6.0 all encodings except LatLonPoint have been removed rendering this abstraction layer useless. This commit removes the abstraction layer and renames the LatLonPointFieldMapper back to GeoPointFieldMapper to mantain consistency with ES field naming.	2017-02-06 14:17:21 -06:00
Adrien Grand	c8496fc4f4	Upgrade to Lucene 6.4.1. (#22978 )	2017-02-06 09:28:43 +01:00
Nik Everett	b0c9759441	Painless: Don't allow casting from void to def (#22969 ) Painless can cast anything into the magic type `def` but it really shouldn't try to cast nothing into `def`. That causes the byte code generation library to freak out a little. Closes #22908	2017-02-03 16:38:47 -05:00
Christoph Büscher	dde2a09ba5	Updating rank-eval module after major changes on master	2017-02-03 21:17:46 +01:00
Nik Everett	9ca871af7e	Test: weaken assertion in fix sliced reindex test This test was using initial count of slices instead of the count of unfinished slices to pick the expected throttle. Unfortunely due to race conditions the actual rethrottle count is between the two. So we weaken the assertion from "the new throttle is exactly X" to "the new throttle is between X and Y (inclusive)".	2017-02-03 13:00:49 -05:00
Christoph Büscher	4cb8d9d08c	Merge branch 'master' into feature/rank-eval Conflicts: core/src/main/java/org/elasticsearch/script/Script.java docs/reference/search.asciidoc	2017-02-03 17:27:20 +01:00
Tim Brooks	f70188ac58	Remove connect SocketPermissions from core (#22797 ) This is related to #22116. Core no longer needs `SocketPermission` `connect`. This permission is relegated to these modules/plugins: - transport-netty4 module - reindex module - repository-url module - discovery-azure-classic plugin - discovery-ec2 plugin - discovery-gce plugin - repository-azure plugin - repository-gcs plugin - repository-hdfs plugin - repository-s3 plugin And for tests: - mocksocket jar - rest client - httpcore-nio jar - httpasyncclient jar	2017-02-03 09:39:56 -06:00
Christoph Büscher	c33f894846	Fixing compilation problem in Eclipse (#22956 )	2017-02-03 16:16:51 +01:00
Nik Everett	18eb0827e6	Reindex: do not log when can't clear old scroll (#22942 ) Versions of Elasticsearch prior to 2.0 would return a scroll id even with the last scroll response. They'd then automatically clear the scroll because it is empty. When terminating reindex will attempt to clear the last scroll it received, regardless of the remote version. This quiets the warning when the scroll cannot be cleared for versions before 2.0. Closes #22937	2017-02-03 10:08:27 -05:00
Jason Tedor	9a0b216c36	Upgrade checkstyle to version 7.5 This commit upgrades the checkstyle configuration from version 5.9 to version 7.5, the latest version as of today. The main enhancement obtained via this upgrade is better detection of redundant modifiers. Relates #22960	2017-02-03 09:46:44 -05:00
Nik Everett	ea4eb06b0a	Test: Make update-by-query test more resilient `UpdateByQueryWhileModifyingTests#testUpdateWhileReindexing` runs update-by-query and concurrently updates, asserting that the update-by-query never reverts any changes made by the update. It is a smoke test for concurrent updates. Now, it expects to hit a certain number of version conflicts during the updates. This is normal as it is racing the update-by-query. We have a maximum number of failures we expect (10) and I'd never seen us come close until https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+multijob-unix-compatibility/os=sles/495/console This bumps the max failures from 10 to 50 and improves logging a bit. If we continue to see this failure then we have some other issue. Closes #22938	2017-02-03 09:18:26 -05:00
Jay Modi	7520a107be	Optionally require a valid content type for all rest requests with content (#22691 ) This change adds a strict mode for xcontent parsing on the rest layer. The strict mode will be off by default for 5.x and in a separate commit will be enabled by default for 6.0. The strict mode, which can be enabled by setting `http.content_type.required: true` in 5.x, will require that all incoming rest requests have a valid and supported content type header before the request is dispatched. In the non-strict mode, the Content-Type header will be inspected and if it is not present or not valid, we will continue with auto detection of content like we have done previously. The content type header is parsed to the matching XContentType value with the only exception being for plain text requests. This value is then passed on with the content bytes so that we can reduce the number of places where we need to auto-detect the content type. As part of this, many transport requests and builders were updated to provide methods that accepted the XContentType along with the bytes and the methods that would rely on auto-detection have been deprecated. In the non-strict mode, deprecation warnings are issued whenever a request with body doesn't provide the Content-Type header. See #19388	2017-02-02 14:07:13 -05:00
Nik Everett	ce8e042b66	Reindex: fix reindex-from-remote from <2.0 (#22931 ) In 5.2 we stopped sending the source parameter if the user didn't specify it. This was a mistake as versions before 2.0 look like they don't always include the `_source`. This is because reindex requests some metadata fields. Anyway, now we say `"_source": true` if there isn't a `_source` configured in the reindex request. Closes #22893	2017-02-02 11:46:24 -05:00
Nik Everett	73bf29072f	Painless: Fix def invoked qualified method refs (#22918 ) We were incorrectly resolving qualified method references at run time when invoked on `def`. This lead to errors like `The struct with name [org] has not been defined.` when attempting ``` doc.date.dates.stream().map( org.joda.time.ReadableDateTime::centuryOfEra ).collect(Collectors.toList()) ```	2017-02-02 10:15:03 -05:00
Nik Everett	dacc150934	Expose multi-valued dates to scripts and document painless's date functions (#22875 ) Implemented by wrapping an array of reused `ModuleDateTime`s that we grow when needed. The `ModuleDateTime`s are reused when we move to the next document. Also improves the error message returned when attempting to modify the `ScriptdocValues`, removes a couple of allocations, and documents that the date functions are available in Painless. Relates to #22162	2017-02-01 21:57:07 -05:00
Jack Conradson	3d2626c4c6	Change Namespace for Stored Script to Only Use Id (#22206 ) Currently, stored scripts use a namespace of (lang, id) to be put, get, deleted, and executed. This is not necessary since the lang is stored with the stored script. A user should only have to specify an id to use a stored script. This change makes that possible while keeping backwards compatibility with the previous namespace of (lang, id). Anywhere the previous namespace is used will log deprecation warnings. The new behavior is the following: When a user specifies a stored script, that script will be stored under both the new namespace and old namespace. Take for example script 'A' with lang 'L0' and data 'D0'. If we add script 'A' to the empty set, the scripts map will be ["A" -- D0, "A#L0" -- D0]. If a script 'A' with lang 'L1' and data 'D1' is then added, the scripts map will be ["A" -- D1, "A#L1" -- D1, "A#L0" -- D0]. When a user deletes a stored script, that script will be deleted from both the new namespace (if it exists) and the old namespace. Take for example a scripts map with {"A" -- D1, "A#L1" -- D1, "A#L0" -- D0}. If a script is removed specified by an id 'A' and lang null then the scripts map will be {"A#L0" -- D0}. To remove the final script, the deprecated namespace must be used, so an id 'A' and lang 'L0' would need to be specified. When a user gets/executes a stored script, if the new namespace is used then the script will be retrieved/executed using only 'id', and if the old namespace is used then the script will be retrieved/executed using 'id' and 'lang'	2017-01-31 13:27:02 -08:00
Nik Everett	2e48fb8294	Move delete by query helpers into core (#22810 ) This moves the building blocks for delete by query into core. This should enabled two thigns: 1. Plugins other than reindex to implement "bulk by scroll" style operations. 2. Plugins to directly call delete by query. Those plugins should be careful to make sure that task cancellation still works, but this should be possible. Notes: 1. I've mostly just moved classes and moved around tests methods. 2. I haven't been super careful about cohesion between these core classes and reindex. They are quite interconnected because I wanted to make the change as mechanical as possible. Closes #22616	2017-01-27 16:09:18 -05:00
Nik Everett	8a2d424d68	Generate reference links for painless API (#22775 ) Adds "Appending B. Painless API Reference", a reference of all classes and methods available from Painless. Removes links to java packages because they contain methods that we don't expose and don't contain methods that we do expose (the ones in Augmentation). Instead this generates a list of every class and every exposed method using the same type information available to the interpreter/compiler/whatever-we-call-it. From there you can jump to the relevant docs. Right now you build all the asciidoc files by running ``` gradle generatePainlessApi ``` These files are expected to be committed because we build the docs without running `gradle`. Also changes the output of `Debug.explain` so that it is easy to search for the class in the generated reference documentation. You can also run it in an IDE safely if you pass the path to the directory in which to generate the docs as the first parameter. It'll blow away the entire directory an recreate it from scratch so be careful. And then you can build the docs by running something like: ``` ../docs/build_docs.pl --out ../built_docs/ --doc docs/reference/index.asciidoc --open ``` That is, if you have checked out https://github.com/elastic/docs in `../docs`. Wait a minute or two and your browser will pop open in with all of Elasticsearch's reference documentation. If you go to `http://localhost:8000/painless-api-reference.html` you can see this list. Or you can get there by following the links to `Modules` and `Scripting` and `Painless` and then clicking the link in the paragraphs below titled `Appendix B. Painless API Reference`. I like having these in asciidoc because we can deep link to them from the rest of the guide with constructs like `<<painless-api-reference-Object-hashCode-0>>` and `<<painless-api-reference->>` and we get link checking. Then the only brittle link maintenance bit is the link generation for javadoc. Which sucks. But I think it is important that we link to the methods directly so they are easy to find. Relates to #22720	2017-01-26 10:39:19 -05:00
Tim Brooks	719e75bb3f	Add repository-url module and move URLRepository (#22752 ) This is related to #22116. URLRepository requires SocketPermission connect. This commit introduces a new module called "repository-url" where URLRepository will reside. With the new module, permissions can be removed from core.	2017-01-25 17:09:25 -06:00
Tal Levy	e9a68b3287	fix date-processor to a new default year for every new pipeline execution. (#22601 ) Beforehand, the DateProcessor constructs its joda pattern formatter during processor construction. This led to newly ingested documents being defaulted to the year that the pipeline was constructed, not that of processing. Fixes #22547.	2017-01-25 15:09:07 -08:00
Chris Earle	f0f75b187a	Support Preemptive Authentication with RestClient (#21336 ) This adds the necessary `AuthCache` needed to support preemptive authorization. By adding every host to the cache, the automatically added `RequestAuthCache` interceptor will add credentials on the first pass rather than waiting to do it after _each_ anonymous request is rejected (thus always sending everything twice when basic auth is required).	2017-01-24 11:34:05 -05:00
Luca Cavanna	47c0e13a3b	Stop returning "es." internal exception headers as http response headers (#22703 ) move "es." internal headers to separate metadata set in ElasticsearchException and stop returning them as response headers Closes #17593 * [TEST] remove ESExceptionTests, move its methods to ElasticsearchExceptionTests or ExceptionSerializationTests	2017-01-24 16:12:45 +01:00
Nik Everett	28cfc533e2	Generate javadoc jar for painless's public API (#22704 ) The simplest way to do that is to move the public API into a new package and generate javadoc for that package.	2017-01-23 17:16:20 -05:00
Jim Ferenczi	e48bc2eed7	Add field collapsing for search request (#22337 ) * Add top hits collapsing to search request The field collapsing is done with a custom top docs collector that "collapse" search hits with same field value. The distributed aspect is resolve using the two passes that the regular search uses. The first pass "collapse" the top hits, then the coordinating node merge/collapse the top hits from each shard. ``` GET _search { "collapse": { "field": "category", } } ``` This change also adds an ExpandCollapseSearchResponseListener that intercepts the search response and expands collapsed hits using the CollapseBuilder#innerHit} options. The retrieval of each inner_hits is done by sending a query to all shards filtered by the collapse key. ``` GET _search { "collapse": { "field": "category", "inner_hits": { "size": 2 } } } ```	2017-01-23 16:33:51 +01:00
Tim Brooks	a4ac29c005	Add single static instance of SpecialPermission (#22726 ) This commit adds a SpecialPermission constant and uses that constant opposed to introducing new instances everywhere. Additionally, this commit introduces a single static method to check that the current code has permission. This avoids all the duplicated access blocks that exist currently.	2017-01-21 12:03:52 -06:00
Jim Ferenczi	8028578305	Upgrade to Lucene 6.4.0 (#22724 ) * Upgrade to Lucene 6.4.0 `ValueSource`s are now converted to `DoubleValueSource`s using the Lucene adapter made for the migration to the new API in 6.4.0.	2017-01-21 04:48:01 +01:00
Nik Everett	6265ef1c1b	Deguice rest handlers (#22575 ) There are presently 7 ctor args used in any rest handlers: * `Settings`: Every handler uses it to initialize a logger and some other strange things. * `RestController`: Every handler registers itself with it. * `ClusterSettings`: Used by `RestClusterGetSettingsAction` to render the default values for cluster settings. * `IndexScopedSettings`: Used by `RestGetSettingsAction` to get the default values for index settings. * `SettingsFilter`: Used by a few handlers to filter returned settings so we don't expose stuff like passwords. * `IndexNameExpressionResolver`: Used by `_cat/indices` to filter the list of indices. * `Supplier<DiscoveryNodes>`: Used to fill enrich the response by handlers that list tasks. We probably want to reduce these arguments over time but switching construction away from guice gives us tighter control over the list of available arguments. These parameters are passed to plugins using `ActionPlugin#initRestHandlers` which is expected to build and return that handlers immediately. This felt simpler than returning an reference to the ctors given all the different possible args. Breaks java plugins by moving rest handlers off of guice.	2017-01-20 11:48:51 -05:00
Tim Brooks	bc16162d21	Remove accept SocketPermissions from core (#22622 ) This is related to #22116. Core no longer needs SocketPermission accept. This permission is relegated to the transport-netty4 module and (for tests) to the mocksocket jar.	2017-01-20 09:27:45 -06:00
Nik Everett	22f1c9fa0f	Remove @header we no longer need	2017-01-19 11:44:13 -05:00
Nik Everett	bb83c283bb	Make lexer abstract	2017-01-19 11:41:50 -05:00
Nik Everett	dbb4a2ca6c	Move lexer hacks to EnhancedPainlessLexer This "feels" nicer. Less classes at least.	2017-01-19 11:23:16 -05:00
Nik Everett	e2da6a8ee5	Improve painless's javadocs Hopefully useful references.	2017-01-19 11:04:08 -05:00
Tim Brooks	a10aa8aade	Add TestWithDependenciesPlugin to build (#22646 ) This commit adds a MessyRestTestPlugin to the gradle build. It extends StandaloneRestTestPlugin. The main piece of functionality that it adds is to copy plugin-metadata from dependencies into the generated-resources for the current test source. This is necessary to ensure that permissions for dependencies are applied when running the tests. A current limitation is that the permissions are applied differently than in the distribution sources. When permissions are granted to all depedencies for a module or plugin, the permissions are granted to all dependencies on the classpath for tests besides a few hardcoded exclusions: - es core - es test framework - lucene test framework - randomized runner - junit library	2017-01-19 09:43:53 -06:00
Nik Everett	3ce41a0e15	Painless: Add augmentation to string for base 64 (#22665 ) We don't want to expose `String#getBytes` which is required for `Base64.getEncoder.encode` to work because we're worried about character sets. This adds `encodeBase64` and `decodeBase64` methods to `String` in Painless that are duals of one another such that: `someString == someString.encodeBase64().decodeBase64()`. Both methods work with the UTF-8 encoding of the string. Closes #22648	2017-01-19 09:31:45 -05:00
Nik Everett	ee5f8c4522	Consolidate some reindex utility classes (#22666 ) Everything that extended `AbstractAsyncBulkByScrollAction` also extended `AbstractAsyncBulkIndexByScrollAction` so this removes `AbstractAsyncBulkIndexByScrollAction`, merging it into `AbstractAsyncBulkByScrollAction`.	2017-01-18 16:58:39 -05:00
Nik Everett	1fe74a6b4b	Better error when can't auto create index (#22488 ) Changes the error message when `action.auto_create_index` or `index.mapper.dynamic` forbids automatic creation of an index from `no such index` to one of: * `no such index and [action.auto_create_index] is [false]` * `no such index and [index.mapper.dynamic] is [false]` * `no such index and [action.auto_create_index] contains [-<pattern>] which forbids automatic creation of the index` * `no such index and [action.auto_create_index] ([all patterns]) doesn't match` This should make it more clear why there is `no such index`. Closes #22435	2017-01-18 15:18:32 -05:00
Simon Willnauer	24e2847af2	Streamline foreign stored context restore and allow to perserve response headers (#22677 ) Today we do not preserve response headers if they are present on a transport protocol response. While preserving these headers is not always desired, in the most cases we should pass on these headers to have consistent results for depreciation headers etc. yet, this hasn't been much of a problem since most of the deprecations are detected early ie. on the coordinating node such that this bug wasn't uncovered until #22647 This commit allow to optionally preserve headers when a context is restored and also streamlines the context restore since it leaked frequently into the callers thread context when the callers context wasn't restored again.	2017-01-18 16:17:54 +01:00
Igor Motov	500548fcda	Remove taskManager.registerChildTask Instead of forcing each task to register all nodes where its children are running, this commit runs cancellation on all nodes. The task cancellation operation doesn't run too frequently, so this optimization doesn't seem to be worth additional complexity of the interface.	2017-01-17 18:07:31 -05:00
Ali Beyad	e2977889b8	Allow comma delimited array settings to have a space after each entry (#22591 ) Previously, certain settings that could take multiple comma delimited values would pick up incorrect values for all entries but the first if each comma separated value was followed by a whitespace character. For example, the multi-value "A,B,C" would be correctly parsed as ["A", "B", "C"] but the multi-value "A, B, C" would be incorrectly parsed as ["A", " B", " C"]. This commit allows a comma separated list to have whitespace characters after each entry. The specific settings that were affected by this are: cluster.routing.allocation.awareness.attributes index.routing.allocation.require.* index.routing.allocation.include.* index.routing.allocation.exclude.* cluster.routing.allocation.require.* cluster.routing.allocation.include.* cluster.routing.allocation.exclude.* http.cors.allow-methods http.cors.allow-headers For the allocation filtering related settings, this commit also provides validation of each specified entry if the filtering is done by _ip, _host_ip, or _publish_ip, to ensure that each entry is a valid IP address. Closes #22297	2017-01-17 08:51:04 -06:00
Tanguy Leroux	f5542ed47f	Simplify ElasticsearchException rendering as a XContent (#22611 ) This commit tries to simplify the way ElasticsearchException are rendered to xcontent. It adds some documentation and renames and merges some methods. Current behavior is preserved, the goal is to be more readable and centralize everything in the ElasticsearchException class.	2017-01-17 15:44:49 +01:00
Tim Brooks	16a76d9bc0	Remove blocking TCP clients and servers (#22639 ) This commit removes the option to use the blocking variants of the TCP transport server, TCP transport client, or http server.	2017-01-16 18:38:51 -06:00
Simon Willnauer	f30b1f82ee	Remove HttpServer and HttpServerAdapter in favor of a simple dispatch method (#22636 ) Today we have quite some abstractions that are essentially providing a simple dispatch method to the plugins defining a `HttpServerTransport`. This commit removes `HttpServer` and `HttpServerAdaptor` and introduces a simple `Dispatcher` functional interface that delegate to `RestController` by default. Relates to #18482	2017-01-16 21:06:08 +01:00
javanna	a8a13bb46f	replace custom functional interface with CheckedFunction in percolate module	2017-01-16 13:57:58 +01:00
Alexander Reelsen	f6ee6e420b	Indexing: Add shard id to indexing operation listener (#22606 ) The IndexingOperationListener interface did not provide any information about the shard id when a document was indexed. This commit adds the shard id as the first parameter to all methods in the IndexingOperationListener.	2017-01-16 09:08:16 +01:00
Tim Brooks	f4270f9914	Wrap netty accept/connect ops with doPrivileged (#22572 ) This is related to #22116. netty channels require socket `connect` and `accept` privileges. Netty does not currently wrap these operations with `doPrivileged` blocks. These changes extend the netty channels and wrap calls to the relevant super methods in doPrivileged blocks.	2017-01-13 14:27:09 -06:00
Zachary Tong	18fdc39b8c	Increase visibility of doExecute so it can be used directly (#22614 )	2017-01-13 09:42:02 -05:00
Nik Everett	baed02bbe2	Whitelist some ScriptDocValues in painless (#22600 ) Without this whitelist painless can't use ip or binary doc values. Closes #22584	2017-01-12 15:26:09 -05:00
Jason Tedor	126efea56c	Upgrade to Netty 4.1.7 This commit upgrades the Netty dependency to version 4.1.7.Final, picking up some important bug fixes. Relates #22587	2017-01-12 10:58:21 -05:00
javanna	64c3212fdb	Remove ParseFieldMatcher usages from IndexSettings	2017-01-12 14:43:35 +01:00
javanna	8072f168a3	Remove ParseFieldMatcher usages from QueryParseContext	2017-01-12 14:43:35 +01:00
Luca Cavanna	0f7d52df68	Remove some more ParseFieldMatcher usages (#22571 )	2017-01-12 10:04:10 +01:00
Nik Everett	25a5f1869a	Improve error message when reindex-from-remote gets bad json (#22536 ) Adds a message about how the remote is unlikely to be Elasticsearch. This isn't as good as including the whole message from the remote but we can't do that because we are stream parsing it and we don't want to mark the whole request. Closes #22330	2017-01-11 12:55:23 -05:00
Jack Conradson	0c694b3d19	Update loop counter to be higher (1000000) instead of (10000).	2017-01-11 09:22:24 -08:00
Nik Everett	abb7d7841f	Remove SearchRequestParsers (#22538 ) It is empty now that we've moved all the parsing into `namedObject`.	2017-01-11 10:28:14 -05:00
Luca Cavanna	0f391336f5	Clean up SearchShardTarget (#22468 ) * unify shard target setter * Remove indexText member from SearchShardTarget * Remove duplicated indexName getter from SearchShardTarget * Remove duplicated shardId getter from SearchShardTarget * Remove duplicated nodeIde getter from SearchShardTarget * Rename SearchShardTarget#nodeIdText getter to getNodeIdText * Remove unused InternalSearchHit#internalSourceRef unused method * Remove unused InternalSearchHit#internalHighlightFields unused method * Make SearchShardTarget members final	2017-01-11 10:08:31 +01:00
Nik Everett	b71b8acf59	Remove ClusterService from ctors in reindex (#22539 ) Moves fetching the local node id into `NodeClient` which is a fairly useful place to put it so you can generate task ids from `NodeClient#executeLocally`.	2017-01-10 18:26:06 -05:00
Nik Everett	d50f96e122	Remove InternalAggregation.Type (#22511 ) It is no longer needed. It used to contain a lot of strings used by serialization but those have since been removed. Now it is just another thing to pass around that we don't really need.	2017-01-10 11:57:19 -05:00
Nik Everett	78bb56671e	Fix reindex from remote clearing scroll (#22525 ) Reindex-from-remote had a race when it tried to clear the scroll. It first starts the request to clear the scroll and then submits a task to the generic threadpool to shutdown the client. These two things race and, in my experience, closing the scroll generally loses. That means that most of the time reindex-from-remote isn't clearing the scrolls that it uses. This isn't the end of the world because we flush old scroll contexts after a while but this isn't great. Noticed while experimenting with #22514.	2017-01-10 10:30:23 -05:00
Nik Everett	5ef78fd015	Fix source filtering in reindex-from-remote (#22514 ) Reindex-from-remote was accepting source filtering in the request but ignoring it and setting `_source=true` on the search URI. This fixes the filtering so it is piped through to the remote node and adds tests for that. Closes #22507	2017-01-10 09:00:12 -05:00
Martijn van Groningen	cb2333dacd	percolator: remove deprecated percolate and mpercolate apis	2017-01-10 11:18:27 +01:00
Nik Everett	3fb9254b95	Replace Suggesters with namedObject (#22491 ) Removes another parser registery type thing in favor of `XContentParser#namedObject`.	2017-01-09 16:51:08 -05:00
Nik Everett	057194f9ab	Fix test under windows Silly `\r`.	2017-01-09 16:29:59 -05:00
Nik Everett	e3f77b4795	Replace AggregatorParsers with namedObject (#22397 ) Removes `AggregatorParsers`, replacing all of its functionality with `XContentParser#namedObject`. This is the third bit of payoff from #22003, one less thing to pass around the entire application.	2017-01-09 13:59:38 -05:00
Nik Everett	fc1f7c2147	Remove content-type detection from reindex-from-remote (#22504 ) If the remote doesn't return a content type then reindex tried to guess the content-type. This didn't work most of the time and produced a rather useless error message. Given that Elasticsearch always returns the content-type we are dropping content-type detection in favor of just failing the request if the remote didn't return a content-type. Closes #22329	2017-01-09 11:50:20 -05:00
Nik Everett	f4884e0726	Replace SearchExtRegistry with namedObject (#22492 ) This is one of the last things in `SearchRequestParsers`.	2017-01-09 08:35:54 -05:00
javanna	ded694fc83	Make StatusToXContent extend ToXContentObject and rename it to StatusToXContentObject This also allows to make RestToXContentListener require ToXContentObject rather than ToXContent	2017-01-06 23:31:48 +01:00
javanna	4e49860f68	Make PercolateResponse a ToXContentObject	2017-01-06 23:31:48 +01:00
javanna	d5510701a0	Make SearchResponse a ToXContentObject	2017-01-06 23:31:48 +01:00
javanna	45d4938fcc	Migrate some more responses to ToXContentObject	2017-01-06 23:31:48 +01:00
Nik Everett	f24ca5188a	Fix some issues with painless's strings (#22393 ) 1. Escape sequences we're working. For example `\\` is now correctly interpreted as `\` instead of `\\`. Same with `\'` being `'` and `\"` being `"`. 2. `'` delimited strings weren't allowed to contain `"`s but it looked like they were intended to support it. Now they do. 3. Improves the error message when the script contains an invalid escape sequence inside a string to include a list of the valid escape sequences. Closes #22372	2017-01-06 11:35:22 -05:00
javanna	dea7d65439	remove ParseFieldMatcher usages from RestSearchTemplateAction	2017-01-05 19:33:04 +01:00
javanna	6102523033	remove ParseFieldMatcher usages from Script parsing code	2017-01-05 19:33:04 +01:00
javanna	9394792392	remove unused ParseFieldMatcher imports/arguments	2017-01-05 19:33:04 +01:00
Tim B	be22a250b6	Replace Socket, ServerSocket, and HttpServer usages in tests with mocksocket versions (#22287 ) This integrates the mocksocket jar with elasticsearch tests. Mocksocket wraps actions requiring SocketPermissions in doPrivilege blocks. This will eventually allow SocketPermissions to be assigned to the mocksocket jar opposed to the entire elasticsearch codebase.	2017-01-04 14:38:51 -06:00
Adrien Grand	f8998fece5	Upgrade to lucene-6.4.0-snapshot-084f7a0. (#22413 )	2017-01-04 19:03:52 +01:00
Jason Tedor	96ba45e310	Fix stale comment in Netty4Utils We previously named the thread using a frame from the stack trace, but this was removed to simplify the code here. However, the comment explaining this was left behind and this commit cleans that up.	2017-01-03 08:15:57 -05:00
Daniel Mitterdorfer	1ed64f0551	Eliminate unneccessary declaration of IOException With this commit we remove the declaration of IOException from assertWarnings and modify all call sites. Checked with @javanna	2017-01-03 12:36:28 +01:00
javanna	cd6b569286	Remove some usages of ParseFieldMatcher in favour of using ParseField directly Relates to #19552 Relates to #22130	2016-12-31 09:24:44 +01:00
javanna	df2acb3d9d	Remove some more usages of ParseFieldMatcher in favour of using ParseField directly Relates to #19552 Relates to #22130	2016-12-30 18:57:47 +01:00
javanna	6c54cbade4	Remove some more usages of ParseFieldMatcher in favour of using ParseField directly Relates to #19552 Relates to #22130	2016-12-30 18:57:47 +01:00
javanna	45d010e874	Remove some usages of ParseFieldMatcher in favour of using ParseField directly Relates to #19552 Relates to #22130	2016-12-30 18:57:47 +01:00
Martijn van Groningen	9ccdd3303d	percolator: Fix NPE in percolator's 'now' range check for percolator queries with range queries. Closes #22355	2016-12-27 22:56:01 +01:00
Tal Levy	e6fb3a5d95	fix index out of bounds error in KV Processor (#22288 ) - checks for index-out-of-bounds - added unit tests for failed `field_split` and `value_split` scenarios missed this test in #22272.	2016-12-27 10:57:11 -08:00
Nik Everett	f5f2149ff2	Remove much ceremony from parsing client yaml test suites (#22311 ) * Remove a checked exception, replacing it with `ParsingException`. * Remove all Parser classes for the yaml sections, replacing them with static methods. * Remove `ClientYamlTestFragmentParser`. Isn't used any more. * Remove `ClientYamlTestSuiteParseContext`, replacing it with some static utility methods. I did not rewrite the parsers using `ObjectParser` because I don't think it is worth it right now.	2016-12-22 11:00:34 -05:00
Jason Tedor	7946396fe6	Introduce translog no-op As the translog evolves towards a full operations log as part of the sequence numbers push, there is a need for the translog to be able to represent operations for which a sequence number was assigned, but the operation did not mutate the index. Examples of how this can arise are operations that fail after the sequence number is assigned, and gaps in this history that arise when an operation is assigned a sequence number but the operation never completed (e.g., a node crash). It is important that these operations appear in the history so that they can be replicated and replayed during recovery as otherwise the history will be incomplete and local checkpoints will not be able to advance. This commit introduces a no-op to the translog to set the stage for these efforts. Relates #22291	2016-12-21 23:08:16 -05:00
Nik Everett	567c65b0d5	Replace IndicesQueriesRegistry (#22289 ) * Switch query parsing to namedObject * Remove IndicesQueriesRegistry	2016-12-21 09:05:14 -05:00
Tal Levy	c53b2ee9cd	introduce KV Processor in Ingest Node (#22272 ) Now you can parse field values of the `key=value` variety and have `key` be inserted as a field name in an ingest document. Closes #22222.	2016-12-20 13:26:17 -08:00
Nik Everett	a04dcfb95b	Introduce XContentParser#namedObject (#22003 ) Introduces `XContentParser#namedObject which works a little like `StreamInput#readNamedWriteable`: on startup components register parsers under names and a superclass. At runtime we look up the parser and call it to parse the object. Right now the parsers take a context object they use to help with the parsing but I hope to be able to eliminate the need for this context as most what it is used for at this point is to move around parser registries which should be replaced by this method eventually. I make no effort to do so in this PR because it is big enough already. This is meant to the a start down a road that allows us to remove classes like `QueryParseContext`, `AggregatorParsers`, `IndicesQueriesRegistry`, and `ParseFieldRegistry`. The goal here is to reduce the amount of plumbing required to allow parsing pluggable things. With this you don't have to pass registries all over the place. Instead you must pass a super registry to fewer places and use it to wrap the reader. This is the same tradeoff that we use for NamedWriteable and it allows much, much simpler binary serialization. We think we want that same thing for xcontent serialization. The only parsing actually converted to this method is parsing `ScoreFunctions` inside of `FunctionScoreQuery`. I chose this because it is relatively self contained.	2016-12-20 11:05:24 -05:00
Nik Everett	73320566c1	Reindex test: catch exception name instead of reason It looks like the exception reason can differ in different default locales, so the build would fail in any non-English locale. This switches the catch to the name of the exception which shouldn't vary.	2016-12-20 10:00:14 -05:00
Nik Everett	8de4be9e4d	Reinex test: don't fail if iis is running on port 0	2016-12-19 16:44:08 -05:00
Grzegorz Gajos	f6b6e4e376	Added ability to remove pipelines via wildcards (#22149 ) (#22191 ) This commit is adding an ability to remove pipelines with wildcards.	2016-12-19 10:59:59 -08:00

... 10 11 12 13 14 ...

5076 Commits