OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	e23c3f915f	Save a little space on empty BitArrays (#53243 ) (#53316 ) It doesn't make a whole lot of sense for `BitArray#clear` to grow the underlying storage array just to clear the bit. We already treat indices outside of the storage array as unset. This turns such operations into a noop.	2020-03-10 09:22:19 -04:00
Alan Woodward	5c861cfe6e	Upgrade to final lucene 8.5.0 snapshot (#53293 ) Lucene 8.5.0 release candidates are imminent. This commit upgrades master to use the latest snapshot to check that there are no last-minute bugs or regressions.	2020-03-10 09:32:59 +00:00
Gordon Brown	1cb0a4399d	Fix Get Alias API handling of hidden indices with visible aliases (#53147 ) This commit changes the Get Aliases API to include hidden indices by default - this is slightly different from other APIs, but is necessary to make this API work intuitively.	2020-03-09 16:16:29 -06:00
William Brafford	2bb4b96a7f	Serialize NodesStatsRequest as set of strings (#53235 ) (#53313 ) * Add unit tests before refactoring * Convert boolean fields to set of strings In order to make nodes stats plugins pluggable, we need to make the NodesStatsRequest class capable of carrying a flexible list of metrics rather than a fixed list of boolean flags. This commit changes the internal storage of the class without changing its serialization. * Change serialization of NodesStatsRequest * Set up BWC before merging * Singularize enum name	2020-03-09 18:13:29 -04:00
Jason Tedor	1860c57147	Deprecate the listener thread pool (#53266 ) The listener thread pool is being removed from use in the server codebase. This commit deprecates configuring the listener thread pool.	2020-03-09 16:56:01 -04:00
David Turner	b20f86e450	Clarify JavaDoc for DiscoveryNodes#resolveNodes (#53277 ) Closes #52887	2020-03-09 14:44:29 +00:00
David Turner	52ff341814	Deprecate passing settings in restore requests (#53268 ) Today we accept a `settings` field in snapshot restore requests, but this field is not used. This commit deprecates it.	2020-03-09 12:01:07 +00:00
Christoph Büscher	2fd954a3b7	Fix potential NPE in FuzzyTermsEnum (#53231 ) Under certain circumstances SpanMultiTermQueryWrapper uses SpanBooleanQueryRewriteWithMaxClause as its rewrite method, which in turn tries to get a TermsEnum from the wrapped MultiTermQuery currently using a `null` AttributeSource. While queries TermsQuery or subclasses of AutomatonQuery ignore this argument, FuzzyQuery uses it to create a FuzzyTermsEnum which triggers an NPE when the AttributeSource is not provided. This PR fixes this by supplying an empty AttributeSource instead of a `null` value. Closes #52894	2020-03-09 12:59:08 +01:00
Jason Tedor	5e96d3e59a	Use given executor for global checkpoint listener (#53260 ) Today when notifying a global checkpoint listener, we use the listener thread pool. This commit turns this inside out so that the global checkpoint listener must provide an executor on which to notify the listener.	2020-03-08 13:51:05 -04:00
Jason Tedor	79b67eb3ba	Drop action future that forks on listener executor (#53261 ) This commit drops the dispatching listenable action future that forks to the listener thread pool. This was previously used in the transport client but is no longer used.	2020-03-08 12:36:09 -04:00
Jason Tedor	a0b235888f	Avoid self-suppression on grouped action listener (#53262 ) It can be that a failure is repeated to a grouped action listener. For example, if the same exception such as a connect transport exception, is the cause of repeated failures. Previously we were unconditionally self-suppressing the exception into the first exception, but self-supressing is not allowed. Thus, we would throw an exception and the grouped action listener would never complete. This commit addresses this by guarding against self-suppression.	2020-03-08 08:59:57 -04:00
Jason Tedor	c5738ae312	Notify refresh listeners on the calling thread (#53259 ) Today we notify refresh listeners by forking to the listener thread pool and then serially notifying listeners on a thread there. Refreshes are expensive though, so the expectation is that we are executing refreshes on threads that can afford an expensive operation (e.g., not a network thread) and as such, executing listeners that we expect to be cheap aon the calling thread is okay. This commit removes the forking of notifying refresh listeners to run directly on the calling thread that executed a refresh.	2020-03-07 13:12:40 -05:00
Gordon Brown	ff9b8bda63	Implement hidden aliases (#52547 ) This commit introduces hidden aliases. These are similar to hidden indices, in that they are not visible by default, unless explicitly specified by name or by indicating that hidden indices/aliases are desired. The new alias property, `is_hidden` is implemented similarly to `is_write_index`, except that it must be consistent across all indices with a given alias - that is, all indices with a given alias must specify the alias as either hidden, or all specify it as non-hidden, either explicitly or by omitting the `is_hidden` property.	2020-03-06 16:02:38 -07:00
Nik Everett	7c9641ef9d	Simplify BucketedSort (#53199 ) (#53240 ) Our lovely `BitArray` compactly stores "flags", lazilly growing its underlying storage. It is super useful when you need to store one bit of data for a zillion buckets or a documents or something. Usefully, it defaults to `false`. But there is a wrinkle! If you ask it whether or not a bit is set but it hasn't grown its underlying storage array "around" that index then it'll throw an `ArrayIndexOutOfBoundsException`. The per-document use cases tend to show up in order and don't tend to mind this too much. But the use case in aggregations, the per-bucket use case, does. Because buckets are collected out of order all the time. This changes `BitArray` so it'll return `false` if the index is too big for the underlying storage. After all, that index can't have been set or else we would have grown the underlying array. Logically, I believe this makes sense. And it makes my life easy. At the cost of three lines. but this adds an extra test to every call to `get`. I think this is likely ok because it is "very close" to an array index lookup that already runs the same test. So I think it'll end up merged with the array bounds check.	2020-03-06 15:27:51 -05:00
Jay Modi	a81460dbf5	Make watch history indices hidden (#52974 ) This commit updates the template used for watch history indices with the hidden index setting so that new indices will be created as hidden. Relates #50251 Backport of #52962	2020-03-06 09:47:03 -07:00
Christoph Büscher	9e561c2921	Fix AbstractBulkByScrollRequest slices parameter via Rest (#53068 ) Currently the AbstractBulkByScrollRequest accepts slice values of 0 via its `setSlices` method, denoting the "auto" slicing behaviour that is usable by settting the "slices=auto" parameter on rest requests. When using the High Level Rest Client, however, we send the 0 value as an integer, which is then rejected as invalid by `AbstractBulkByScrollRequest#parseSlices`. Instead of making parsing of the rest request more lenient, this PR opts for changing the RequestConverter logic in the client to translate 0 values to "auto" on the rest requests. Closes #53044	2020-03-06 15:38:04 +01:00
William Brafford	d145b5536f	Serialize NodesInfoRequest as a set of strings (#53140 ) (#53202 ) For Node Info to be pluggable, NodesInfoRequest must be able to carry arbitrary strings. This commit reworks the internals of that class to use a set rather than hard-coded boolean fields. NodesInfoRequest defaults to specifying all values. We test for this behavior as we refactor and use random testing for the various combinations of metrics. Add backwards compatibility for transport requests.	2020-03-06 09:07:49 -05:00
Marios Trivyzas	7ddbda4c20	Check for query cancellation during rewrite (#53166 ) (#53203 ) With ExitableDirectoryReader in place, check for query cancellation during QueryPhase#preProcess where the query rewriting takes place. Follows: #52822 (cherry picked from commit 0d38626d8e6e9e2620a7a446b617a2ac42852461)	2020-03-06 11:04:01 +01:00
Alan Woodward	c204137451	Deprecate BoolQueryBuilder's mustNot field (#53125 ) The bool query builder in elasticsearch accepts both must_not and mustNot fields. Given that leniency is abhorrent and must be eschewed, we should deprecate the latter as it doesn't fit with the style of parameters elsewhere in the DSL.	2020-03-06 09:11:34 +00:00
Henning Andersen	2e924e4a83	Fix ClusterDisruptionIT.testAckedIndexing (#53169 ) Use assertBusy when doing reroute after bridged disruption, since it can return non-acked if a node is marked faulty by follower check after disruption ended. Closes #53064	2020-03-06 08:56:55 +01:00
Nhat Nguyen	5476a49833	Revert "upgrade to lucene-snapshot-fa75139efea (#53150 ) (#53151 )" This reverts commit `058113aa42`.	2020-03-05 17:33:00 -05:00
Nhat Nguyen	d456e8ffca	Revert "Mute InternalEngineTests.testVersionOnPrimaryWithConcurrentRefresh" This reverts commit `66788afa67`.	2020-03-05 17:32:18 -05:00
Nhat Nguyen	e9e209ae58	Revert "Mute InternalEngineTests.testRandomOperations" This reverts commit `d1cc2e68d5`.	2020-03-05 17:32:11 -05:00
Nhat Nguyen	dc78cc6131	Revert "Mute InternalEngineTests.testForceMergeWithSoftDeletesRetentionAndRecoverySource" This reverts commit `da8aac9e66`.	2020-03-05 17:31:56 -05:00
Nhat Nguyen	f11ae5fd14	Revert "Mute GatewayMetaStatePersistedStateTests.testDataOnlyNodePersistence" This reverts commit `4452addf10`.	2020-03-05 17:31:38 -05:00
James Baiera	4452addf10	Mute GatewayMetaStatePersistedStateTests.testDataOnlyNodePersistence	2020-03-05 16:44:03 -05:00
James Baiera	da8aac9e66	Mute InternalEngineTests.testForceMergeWithSoftDeletesRetentionAndRecoverySource	2020-03-05 15:55:50 -05:00
James Baiera	d1cc2e68d5	Mute InternalEngineTests.testRandomOperations	2020-03-05 15:09:47 -05:00
James Baiera	66788afa67	Mute InternalEngineTests.testVersionOnPrimaryWithConcurrentRefresh	2020-03-05 15:09:47 -05:00
Mayya Sharipova	7e2a9f58ee	script_score query errors on negative scores (#53133 ) 7.5 and 7.6 had a regression that allowed for script_score queries to have negative scores. We have corrected this regression in #52478. This is an addition to #52478 that adds a test and release notes.	2020-03-05 14:23:39 -05:00
Marios Trivyzas	487d442760	Implement Exitable DirectoryReader (#52822 ) (#53162 ) Implement an Exitable DirectoryReader that wraps the original DirectoryReader so that when a search task is cancelled the DirectoryReaders also stop their work fast. This is usuful for expensive operations like wilcard/prefix queries where the DirectoryReaders can spend lots of time and consume resources, as previously their work wouldn't stop even though the original search task was cancelled (e.g. because of timeout or dropped client connection). (cherry picked from commit 67acaf61f33bc5f54e26541514d07e375c202e03)	2020-03-05 14:17:31 +01:00
Nik Everett	28df7ae5ed	Support multiple metrics in `top_metrics` agg (backport of #52965 ) (#53163 ) This adds support for returning multiple metrics to the `top_metrics` agg. It looks like: ``` POST /test/_search?filter_path=aggregations { "aggs": { "tm": { "top_metrics": { "metrics": [ {"field": "v"}, {"field": "m"} ], "sort": {"s": "desc"} } } } } ```	2020-03-05 08:12:01 -05:00
Alan Woodward	3cd4b97618	Remove UnknownNamedObjectException (#53105 ) This was originally thrown from NamedXContentRegistry#parseNamedObject() but that method now throws a NamedObjectNotFoundException, so this is unused.	2020-03-05 10:06:59 +00:00
Ignacio Vera	058113aa42	upgrade to lucene-snapshot-fa75139efea (#53150 ) (#53151 )	2020-03-05 10:04:05 +01:00
Nik Everett	302980e0c4	Remove some ceremony in agg parsing (#53078 ) (#53117 ) With #50871 aggrgations should now be parsed directly by an `ObjectParser` or `ConstructingObjectParser` without the need for the ceremonial `parse` method. This removes 9 of those `parse` methods and parses the aggregation directly from their `ObjectParser`.	2020-03-04 13:06:41 -05:00
Tim Brooks	f68917160e	Fix RemoteConnectionManager size() method (#52823 ) Currently the remote connection manager will delegate the size() call to the underlying cluster connection manager. This introduces the possibility that call will return 1 before the nodeConnection method has been triggered to add the connection to the remote connection list. This can cause issues, as the ensureConnected method checks the connection managers size and executes synchronously if the size is > 0. This leads to a potential cluster not connected exception while we are still waiting for the connection opened callback to be triggered. This commit fixes this issue by using the remote connection manager's size to report the connection manager's size. Fixes #52029.	2020-03-04 09:53:22 -07:00
Yannick Welsch	8ab74fea58	[7.x] Add 7.6.2 as version (#53114 )	2020-03-04 10:39:09 -06:00
Jake Landis	f08ed1f69a	[7.x] add 6.8.8 as version (#53021 )	2020-03-04 10:38:07 -06:00
Alan Woodward	dfebbbf862	BoolQueryBuilder uses ObjectParser (#52880 ) This commit removes the hand-rolled x-content parsing logic from BoolQueryBuilder and instead uses an ObjectParser to handle parsing. It also removes the long-deprecated (since version 6) disable_coord parameter.	2020-03-04 15:48:38 +00:00
Zachary Tong	3fcf598b92	Reduce deprecation log noise from DateIntervalWrapper (#52655 ) Converts the deprecations to `deprecatedAndMaybeLog` to reduce the number of times we log deprecations, since some of these could be called at a high frequency (due to unconverted queries, aggs, etc)	2020-03-03 17:08:10 -05:00
Jay Modi	c610e0893d	Introduce system index APIs for Kibana (#53035 ) This commit introduces a module for Kibana that exposes REST APIs that will be used by Kibana for access to its system indices. These APIs are wrapped versions of the existing REST endpoints. A new setting is also introduced since the Kibana system indices' names are allowed to be changed by a user in case multiple instances of Kibana use the same instance of Elasticsearch. Additionally, the ThreadContext has been extended to indicate that the use of system indices may be allowed in a request. This will be built upon in the future for the protection of system indices. Backport of #52385	2020-03-03 14:11:36 -07:00
Nik Everett	7339427af5	Remove some deprecation warnings parsing aggs (backport of #53026 ) (#53072 ) With #50871 aggrgations should now be parsed directly by an `ObjectParser` or `ConstructingObjectParser` without the need for the ceremonial `parse` method. This removes 10 of those `parse` methods and parses the aggregation directly from their `ObjectParser`.	2020-03-03 15:27:49 -05:00
Luca Cavanna	8a05b670ca	Address MinAndMax generics warnings (#52642 ) `MinAndMax` encapsulates min and max values for a shard. It uses generics to make sure that the values are of the same type and are also comparable. Though there are warnings whenever this class is currently used, which are addressed with this commit. Relates to #49092	2020-03-03 16:08:10 +01:00
Adrien Grand	cb868d2f5e	Introduce a `constant_keyword` field. (#49713 ) (#53024 ) This field is a specialization of the `keyword` field for the case when all documents have the same value. It typically performs more efficiently than keywords at query time by figuring out whether all or none of the documents match at rewrite time, like `term` queries on `_index`. The name is up for discussion. I liked including `keyword` in it, so that we still have room for a `singleton_numeric` in the future. However I'm unsure whether to call it `singleton`, `constant` or something else, any opinions? For this field there is a choice between 1. accepting values in `_source` when they are equal to the value configured in mappings, but rejecting mapping updates 2. rejecting values in `_source` but then allowing updates to the value that is configured in the mapping This commit implements option 1, so that it is possible to reindex from/to an index that has the field mapped as a keyword with no changes to the source. Backport of #49713	2020-03-03 16:01:47 +01:00
Jason Tedor	a154f9c657	Early return if no global checkpoint listeners (#53036 ) When notifying global checkpoint listeners, we have an opportunity to early return if there are not any registered listeners. This is important since it saves some allocations, and also saves forking some empty work to another thread. This commit adds an early return from notifying listeners if there are not any registered.	2020-03-02 23:28:22 -05:00
Stuart Tettemer	210aab0935	Settings: AffixSettings as validator dependencies (#52973 ) (#52982 ) Allow AffixSetting as validator dependencies. If a validator specifies AffixSettings as a dependency, then `validate(T, Map)` will have the concrete setting in a map. Backport of: #52973, 1e0ba70 Fixes: #52933	2020-02-29 09:38:46 -07:00
Nhat Nguyen	e6755afeeb	Upgrade to Lucene 8.5.0-snapshot-c4475920b08 (#52950 ) (#52977 ) To give LUCENE-9228 more CI cycles	2020-02-29 09:29:16 -05:00
Jay Modi	1cd0eee723	Remove TODO in IndexNameExpressionResolver (#52969 ) This commit removes a TODO in the IndexNameExpressionResolver that indicated the API should use a Set instead of a List. However, this TODO was not completely correct since the ordering of arguments matters due to negations when evaluating wildcards and since we also allow a list of patterns like `,-foo,`, which would have a different meaning even when using a Set with insertion ordering. Relates #52788 Backport of #52963	2020-02-28 13:56:28 -07:00
Adrien Grand	331d4bb0af	HybridDirectory should mmap postings. (#52641 ) (#52873 ) Since version 8.4, `MMapDirectory` has an optimization to read long[] arrays directly in little endian order, which postings leverage. So it'd be more efficient to open postings with `MMapDirectory`. I refactored a bit the existing logic to better explain why every listed file extension is open with `mmap`.	2020-02-28 18:45:46 +01:00
Martijn van Groningen	6aa9aaa2c6	Add validation for dynamic templates (#52890 ) Backport of #51233 to the seven dot x branch. Tries to load a `Mapper` instance for the mapping snippet of a dynamic template. This should catch things like using an analyzer that is undefined or mapping attributes that are unused. This is best effort: * If `{{name}}` placeholder is used in the mapping snippet then validation is skipped. * If `match_mapping_type` is not specified then validation is performed for all mapping types. If parsing succeeds with a single mapping type then this the dynamic mapping is considered valid. If is detected that a dynamic template mapping snippet is invalid at mapping update time then the mapping update is failed for indices created on 8.0.0-alpha1 and later. For indices created on prior version a deprecation warning is omitted instead. In 7.x clusters the mapping update will never fail in case of an invalid dynamic template mapping snippet and a deprecation warning will always be omitted. Closes #17411 Closes #24419 Co-authored-by: Adrien Grand <jpountz@gmail.com>	2020-02-28 10:35:04 +01:00
Nik Everett	407101c39b	Clean and document sorting with partialy built buckets (backport of #52769 ) (#52925 ) The `terms` aggregation can be sortd by the results of its sub-aggregations. Because it uses that sorting for filtering to the top-n it tries not to construct all of the buckets for the child aggregations. This has its own interesting problem around reduction, but they aren't super relevant to this change. This change moves that optimization from the `TermsAggregator` and into the aggregators being sorted on. This should make it more clear what is going on and it unifies this optimization with validating the sort. Finally, this should enable some minor optimizations to save a few comparisons when sorting multi-valued buckets. I'll get those in a follow up because they are now fairly obvious. They probably won't be a huge performance improvement, but it'll be nice anyway.	2020-02-27 17:50:55 -05:00
Nik Everett	1d1956ee93	Add size support to `top_metrics` (backport of #52662 ) (#52914 ) This adds support for returning the top "n" metrics instead of just the very top. Relates to #51813	2020-02-27 16:12:52 -05:00
Lee Hinman	e139d70abe	Remove TODO in MaxSizeCondition (#52854 ) Similar to what we did in #52794, this removes the TODO. Relates again to #52505	2020-02-27 09:29:12 -07:00
Dan Hermann	3c8b46a8c1	[7.x] Handle errors when evaluating if conditions in processors (#52892 )	2020-02-27 09:00:51 -06:00
hezhen Zhang	280d59c724	Append index name for the source of the cluster put-mapping task (#52690 ) Add index name(s) into the source for the cluster state update done when putting mapping. This ensures that the pending tasks API includes information on source indices.	2020-02-27 12:16:24 +01:00
David Turner	52fa465300	Cache completion stats between refreshes (#52872 ) Computing the stats for completion fields may involve a significant amount of work since it walks every field of every segment looking for completion fields. Innocuous-looking APIs like `GET _stats` or `GET _cluster/stats` do this for every shard in the cluster. This repeated work is unnecessary since these stats do not change between refreshes; in many indices they remain constant for a long time. This commit introduces a cache for these stats which is invalidated on a refresh, allowing most stats calls to bypass the work needed to compute them on most shards. Closes #51915 Backport of #51991	2020-02-27 10:01:24 +00:00
Nhat Nguyen	814c275f35	Add more assertions to testMaybeFlush (#52792 ) We aren't able to reproduce or figure out the reason that failed this test. This commit adds more assertions so we can narrow the scope. Relates #52223	2020-02-26 17:08:18 -05:00
Nhat Nguyen	0a15a6bfad	Fix testSeqNoCollision (#52588 ) Adjusts the assertion as we trim translog more eagerly since #52556. Relates #52556 Closes #52148	2020-02-26 17:08:18 -05:00
Nhat Nguyen	87e765609e	Fix testResyncAfterPrimaryPromotion (#52615 ) Adjusts the assertion as we might eagerly clean up translog during resync since #52556 Relates #52556 Closes #52598	2020-02-26 17:08:18 -05:00
Nhat Nguyen	5aa612c275	Fix testRestoreLocalHistoryFromTranslog (#52441 ) Asserts that no new operations are made into the translog since we re-opened the engine. Relates #51905 Closes #52410	2020-02-26 17:08:18 -05:00
Nhat Nguyen	a92bf5ec61	Fix IndexShardIT#testMaybeFlush (#52247 ) Since #51905, we use the local checkpoint of the safe commit to calculate the number of uncommitted operations of a translog stats. If a periodic flush triggered by afterWriteOperation completes before we sync translog, then the last commit is not safe. We also need to sync translog from Engine instead of the translog so that we can advance the safe commit. Relates #51905 Closes #52223	2020-02-26 17:08:18 -05:00
Nhat Nguyen	d7fe135d90	Fix testPrepareIndexForPeerRecovery (#52245 ) Since #51905, we skip translog recovery if the local checkpoint of the safe commit equals to the global checkpoint. This change adjusts the test not to create a new snapshot in that case. Closes #52221 Relates #51905	2020-02-26 17:08:18 -05:00
Yannick Welsch	82ab1bc1ff	Separate translog from index deletion conditions (#52556 ) Separates the translog from the index deletion conditions (allowing the translog to be cleaned up more eagerly), and avoids taking the write lock on the translog if no clean-up is actually necessary.	2020-02-26 17:08:18 -05:00
Nhat Nguyen	db6b9c21c7	Use local checkpoint to calculate min translog gen for recovery (#51905 ) Today we use the translog_generation of the safe commit as the minimum required translog generation for recovery. This approach has a limitation, where we won't be able to clean up translog unless we flush. Reopening an already recovered engine will create a new empty translog, and we leave it there until we force flush. This commit removes the translog_generation commit tag and uses the local checkpoint of the safe commit to calculate the minimum required translog generation for recovery instead. Closes #49970	2020-02-26 17:08:18 -05:00
Dan Hermann	3ffd34617f	Switch to AtomicLong for ingestCurrent metric to prevent negative values (#52581 ) (#52834 )	2020-02-26 13:26:26 -06:00
Jay Modi	07ef8ccff4	Allow dynamic updates for index.hidden setting (#52837 ) This commit changes the `index.hidden` setting from being final to a dynamic setting. While the setting being final allows for easier reasoning about an index, making this setting update-able has more benefits in that we can upgrade existing indices to be hidden and it will enable future features that would dynamically make indices hidden. Backport of #52772	2020-02-26 11:46:29 -07:00
Nik Everett	bfaa487757	Switch pipeline agg parsing to ContextParser (#52776 ) (#52832 ) We've pretty well settled on `ContextParser` for a generic interface to `ObjectParser`-like-things. This switches the interface used for building parsing pipeline aggregations to `ContextParser` which saves a couple of little wrappers around `ObjectParser`.	2020-02-26 12:57:20 -05:00
Tim Brooks	be8d704e2b	Remove seeds depedency for remote cluster settings (#52829 ) Currently 3 remote cluster settings (ping interval, skip unavailable, and compression) have a dependency on the seeds setting being comfigured. With proxy mode, it is now possible that these settings the seeds setting has not been configured. This commit removes this dependency and adds new validation for these settings.	2020-02-26 10:17:25 -07:00
Adrien Grand	1807f86751	Generalize how queries on `_index` are handled at rewrite time (#52815 ) Generalize how queries on `_index` are handled at rewrite time (#52486) Since this change refactors rewrites, I also took it as an opportunity to adrress #49254: instead of returning the same queries you would get on a keyword field when a field is unmapped, queries get rewritten to a MatchNoDocsQueryBuilder. This change exposed a couple bugs, like the fact that the percolator doesn't rewrite queries at query time, or that the significant_terms aggregation doesn't rewrite its inner filter, which I fixed. Closes #49254	2020-02-26 15:37:43 +01:00
Luca Cavanna	9e38125464	Clarify when shard iterators get sorted (#52810 ) Currently we have two ways to create a GroupShardsIterator: one that will resort the iterators based on their natural ordering, and another one that will leave them in their original order. This is currently done through two constructors, one that accepts a single argument which does the sorting, and another which accepts a second boolean argument to control whether sorting should happen or not. This second constructor is only called externally to disable the sorting. By introducing a specific method to create a sorted shard iterator we clarify and make it easier to track when we do sort and when we do not as the iterators are externally sorted.	2020-02-26 13:58:20 +01:00
Jim Ferenczi	a73ad248e8	Fix backport of #46731 (#52744 ) This change fixes the incomplete backport of #46731 in 7.x (as of 7.5). We now check if `max_children` is set on the top level nested sort and fails with an exception if it's not the case. Relates #46731 Closes #52202	2020-02-26 10:46:51 +01:00
Sachin Frayne	d3c0a2f013	Improve the error message when loading text fielddata. (#52753 ) Emphasize keyword over fielddata as the preferred way to use String fields for aggregations or sorting.	2020-02-25 15:45:44 -08:00
Lee Hinman	662f21fcea	Remove TODO in MaxAgeCondition serialization (#52794 ) * Remove TODO in MaxAgeCondition serialization This removes the TODO with a message for any future readers regarding the code in question. Resolves #52505	2020-02-25 15:47:36 -07:00
Tim Brooks	c8ef9649e2	Force execution of finish shard bulk request (#51957 ) (#52484 ) Currently the shard bulk request can be rejected by the write threadpool after a mapping update. This introduces a scenario where the mapping listener thread will attempt to finish the request and fsync. This thread can potentially be a transport thread. This commit fixes this issue by forcing the finish action to happen on the write threadpool. Fixes #51904.	2020-02-25 14:37:11 -07:00
Nhat Nguyen	848d3bc153	Revert "Fix testKeepTranslogAfterGlobalCheckpoint" This reverts commit `a88d54eb2d`.	2020-02-25 14:12:35 -05:00
Nhat Nguyen	a88d54eb2d	Fix testKeepTranslogAfterGlobalCheckpoint Read the last synced global checkpoint after flushing as we might advance it during committing. CI: https://gradle-enterprise.elastic.co/s/7o6qengg4gva2	2020-02-25 11:49:24 -05:00
Alan Woodward	638f3e4183	Use ByteBuffersDirectory rather than RAMDirectory (#52768 ) Lucene's RAMDirectory has been deprecated. This commit replaces all uses of RAMDirectory in elasticsearch with the newer ByteBuffersDirectory. Most uses are in tests, but the percolator and painless executor may get some small speedups.	2020-02-25 15:46:35 +00:00
Alan Woodward	18663b0a85	Don't index ranges including NOW in percolator (#52748 ) Currently, date ranges queries using NOW-based date math are rewritten to MatchAllDocs queries when being preprocessed for the percolator. However, since we added the verification step, this can result in incorrect matches when percolator queries are run without scores. This commit changes things to instead wrap date queries that use NOW with a new DateRangeIncludingNowQuery. This is a simple wrapper query that returns its delegate at rewrite time, but it can be detected by the percolator QueryAnalyzer and be dealt with accordingly. This also allows us to remove a method on QueryRewriteContext, and push all logic relating to NOW-based ranges into the DateFieldMapper. Fixes #52617	2020-02-25 12:18:16 +00:00
Ryan Ernst	5fba8cbc7b	Rename local Environment var in Node to avoid confusion (#52602 ) When the Node class is being constructed, an initial environment is passed in with the initial settings for the node. Once the plugin servicie is initialized, the final Environment+Settings are created, at which point the initial environment should no longer be used. This commit renames the constructor arg to avoid naming clashes with the final environment variable.	2020-02-24 11:14:46 -08:00
Lee Hinman	7d9de8412a	[7.x] fix npe in RestPluginsAction (#52620 ) (de56de9a) (#52721 ) Relates #45321 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Kaihong.Wang <kyra.wkh@alibaba-inc.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-24 11:57:01 -07:00
Mayya Sharipova	034b1c0ba3	Correct boost calculation in script_score query (#52478 ) (#52724 ) Before boost in script_score query was wrongly applied only to the subquery. This commit makes sure that the boost is applied to the whole score that comes out of script. Closes #48465	2020-02-24 13:48:21 -05:00
Adrien Grand	f993ef80f8	Move the terms index of `_id` off-heap. (#52518 ) In #42838 we moved the terms index of all fields off-heap except the `_id` field because we were worried it might make indexing slower. In general, the indexing rate is only affected if explicit IDs are used, as otherwise Elasticsearch almost never performs lookups in the terms dictionary for the purpose of indexing. So it's quite wasteful to require the terms index of `_id` to be loaded on-heap for users who have append-only workloads. Furthermore I've been conducting benchmarks when indexing with explicit ids on the http_logs dataset that suggest that the slowdown is low enough that it's probably not worth forcing the terms index to be kept on-heap. Here are some numbers for the median indexing rate in docs/s: \| Run \| Master \| Patch \| \| --- \| ------- \| ------- \| \| 1 \| 45851.2 \| 46401.4 \| \| 2 \| 45192.6 \| 44561.0 \| \| 3 \| 45635.2 \| 44137.0 \| \| 4 \| 46435.0 \| 44692.8 \| \| 5 \| 45829.0 \| 44949.0 \| And now heap usage in MB for segments: \| Run \| Master \| Patch \| \| --- \| ------- \| -------- \| \| 1 \| 41.1720 \| 0.352083 \| \| 2 \| 45.1545 \| 0.382534 \| \| 3 \| 41.7746 \| 0.381285 \| \| 4 \| 45.3673 \| 0.412737 \| \| 5 \| 45.4616 \| 0.375063 \| Indexing rate decreased by 1.8% on average, while memory usage decreased by more than 100x. The `http_logs` dataset contains small documents and has a simple indexing chain. More complex indexing chains, e.g. with more fields, ingest pipelines, etc. would see an even lower decrease of indexing rate.	2020-02-24 18:14:12 +01:00
Alan Woodward	7dc41a3b83	Use BoostQuery rather than FunctionScoreQuery for query-time indices_boost (#52272 ) This is a trivial change, but it should result in a slightly more efficient query boost.	2020-02-24 14:41:46 +00:00
Nik Everett	d26d7721ea	Continue realizing sorting by aggregations (backport of #52298 ) (#52667 ) This drops more of the `instanceof`s from `AggregationPath`. There are still a couple in `AggregationPath`. And I ended up moving two into `BucketsAggregator`, but I think this is still an improvement!	2020-02-23 17:13:55 -05:00
bellengao	02cb5b6c0e	Return 429 status code on read_only_allow_delete index block (#50166 ) We consider index level read_only_allow_delete blocks temporary since the DiskThresholdMonitor can automatically release those when an index is no longer allocated on nodes above high threshold. The rest status has therefore been changed to 429 when encountering this index block to signal retryability to clients. Related to #49393	2020-02-22 16:24:25 +01:00
Jay Modi	8abfda0b59	Rename assertThrows to prevent naming clash (#52651 ) This commit renames ElasticsearchAssertions#assertThrows to assertRequestBuilderThrows and assertFutureThrows to avoid a naming clash with JUnit 4.13+ and static imports of these methods. Additionally, these methods have been updated to make use of expectThrows internally to avoid duplicating the logic there. Relates #51787 Backport of #52582	2020-02-21 13:30:11 -07:00
Stuart Tettemer	376932a47d	Scripting: split out compile limits and caching (#52498 ) (#52652 ) Phase 1 of adding compilation limits per context. * Refactor rate limiting and caching into separate class, `ScriptCache`, which will be used per context. * Disable compilation limit for certain tests. Backport of 0866031 Refs: #50152	2020-02-21 12:10:51 -07:00
Jay Modi	f3f6ff97ee	Single instance of the IndexNameExpressionResolver (#52604 ) This commit modifies the codebase so that our production code uses a single instance of the IndexNameExpressionResolver class. This change is being made in preparation for allowing name expression resolution to be augmented by a plugin. In order to remove some instances of IndexNameExpressionResolver, the single instance is added as a parameter of Plugin#createComponents and PersistentTaskPlugin#getPersistentTasksExecutor. Backport of #52596	2020-02-21 07:50:02 -07:00
markharwood	96d603979b	Upgrade Lucene to 8.5.0-snapshot-b01d7cb (#52584 ) Upgrading 7x to same Lucene 8.5 version used in master	2020-02-21 10:25:03 +00:00
Armin Braun	0a09e15959	Add Caching for RepositoryData in BlobStoreRepository (#52341 ) (#52566 ) Cache latest `RepositoryData` on heap when it's absolutely safe to do so (i.e. when the repository is in strictly consistent mode). `RepositoryData` can safely be assumed to not grow to a size that would cause trouble because we often have at least two copies of it loaded at the same time when doing repository operations. Also, concurrent snapshot API status requests currently load it independently of each other and so on, making it safe to cache on heap and assume as "small" IMO. The benefits of this move are: * Much faster repository status API calls * listing all snapshot names becomes instant * Other operations are sped up massively too because they mostly operate in two steps: load repository data then load multiple other blobs to get the additional data * Additional cloud cost savings * Better resiliency, saving another spot where an IO issue could break the snapshot * We can simplify a number of spots in the current code that currently pass around the repository data in tricky ways to avoid loading it multiple times in follow ups.	2020-02-21 10:20:07 +01:00
Armin Braun	4bb780bc37	Refactor Inflexible Snapshot Repository BwC (#52365 ) (#52557 ) * Refactor Inflexible Snapshot Repository BwC (#52365) Transport the version to use for a snapshot instead of whether to use shard generations in the snapshots in progress entry. This allows making upcoming repository metadata changes in a flexible manner in an analogous way to how we handle serialization BwC elsewhere. Also, exposing the version at the repository API level will make it easier to do BwC relevant changes in derived repositories like source only or encrypted.	2020-02-21 09:14:34 +01:00
Ignacio Vera	107f00a4ec	Add support for multipoint geoshape queries (#52133 ) (#52553 ) Currently multi-point queries are not supported when indexing your data using BKD-backed geoshape strategy. This commit removes this limitation.	2020-02-21 07:45:53 +01:00
Yannick Welsch	d76358c875	Deprecate fixed_auto_queue_size thread pool type (#52399 ) Relates #52280	2020-02-20 11:11:06 +01:00
Yannick Welsch	3afb5ca133	Fix synchronization in ByteSizeCachingDirectory (#52512 ) One particular code place was synchronizing on the wrong object.	2020-02-19 16:10:39 +01:00
Przemysław Witek	7cd997df84	[ML] Make ml internal indices hidden (#52423 ) (#52509 )	2020-02-19 14:02:32 +01:00
Ignacio Vera	8d2261fe47	Refactor GeoShapeIndexer by extracting polygon / line decomposers (#52422 ) (#52506 ) Refactor GeoShapeIndexer. We extract Polygon and Line decomposers which are in charge of breaking a shape around the dateline if needed.	2020-02-19 12:04:29 +01:00
Henning Andersen	9d40277d4c	Deciders should not by default collect yes'es (#52438 ) AllocationDeciders would collect Yes decisions when not asking for debug info. Changed to only include Yes decisions when debug is requested (explain).	2020-02-19 11:18:03 +01:00
Henning Andersen	d4bc3b75dc	Reindex: allow comma separated source indices (#52044 ) Added ability to specify comma separated list of source indices without array. Also fixed so that empty string results in validation error rather than index does not exist. Closes #51949	2020-02-19 09:23:15 +01:00
David Turner	baf184c93f	Avoid using WindowsFS in ClusterRerouteIT (#52488 ) Issue #52000 looks like a case of cluster state updates being slower than expected, but it seems that these slowdowns are relatively rare: most invocations of `testDelayWithALargeAmountOfShards` take well under a minute in CI, but there are occasional failures that take 6+ minutes instead. When it fails like this, cluster state persistence seems generally slow: most are slower than expected, with some small updates even taking over 2 seconds to complete. The failures all have in common that they use `WindowsFS` to emulate Windows' behaviour of refusing to delete files that are still open, by tracking all files (really, inodes) and validating that deleted files are really closed first. There is a suggestion that this is a little slow in the Lucene test framework [1]. To see if we can attribute the slowdown to that common factor, this commit suppresses the use of `WindowsFS` for this test suite. [1] `4a513fa99f/lucene/test-framework/src/java/org/apache/lucene/util/TestRuleTemporaryFilesCleanup.java (L166)`	2020-02-19 07:52:49 +00:00
Tim Brooks	8038f9bba6	Do not lock when generating time based uuid (#52436 ) Currently we lock when generating time based uuids. The lock is implemented to prevent concurrent writes to the last timestamp. The uuid generation is an area of contention when indexing. This commit modifies the code to use atomic compare and set operations to update the last timestamp.	2020-02-18 09:55:51 -07:00
Tim Brooks	7fcd997b39	Do not lock on settings keyset if keys initialized (#52435 ) Every time a setting#exist call is made we lock on the keyset to ensure that it has been initialized. This a heavyweight operation that only should be done once. This commit moves to a volatile read instead to prevent unnecessary locking.	2020-02-18 09:36:07 -07:00
Tim Brooks	a742c58d45	Extract a ConnectionManager interface (#51722 ) Currently we have three different implementations representing a `ConnectionManager`. There is the basic `ConnectionManager` which holds all connections for a cluster. And a remote connection manager which support proxy behavior. And a stubbable connection manager for tests. The remote and stubbable instances use the delegate pattern, so this commit extracts an interface for them all to implement.	2020-02-18 09:19:24 -07:00
Benedict Jin	0c4f7dc193	Minor code improvements (#51921 ) Fix some whitespaces, comments and usage of `this.`. (cherry picked from commit 9f59900bf6389172811eb2279c17a2dc7cd9dfdf)	2020-02-18 16:00:05 +01:00
David Turner	3d57a78deb	Add extra logging for investigation into #52000 (#52472 ) It looks like #52000 is caused by a slowdown in cluster state application (maybe due to #50907) but I would like to understand the details to ensure that there's nothing else going on here too before simply increasing the timeout. This commit enables some relevant `DEBUG` loggers and also captures stack traces from all threads rather than just the three hottest ones.	2020-02-18 13:02:33 +00:00
Armin Braun	57d6dd7e31	Fix Non-Verbose Snapshot List Missing Empty Snapshots (#52433 ) (#52456 ) We were not including snapshots without indices in the non-verbose listing because we used the snapshot -> indices mapping to get the snapshots.	2020-02-18 11:37:53 +01:00
Armin Braun	cc628748e1	Optimize FilterStreamInput for Network Reads (#52395 ) (#52403 ) When `FilterStreamInput` wraps a Netty `ByteBuf` based stream it did not forward the bulk primitive reads to the delegate. These are optimized on the delegate but if they're not forwarded then the delegate will be called e.g. 4 times to read an `int`. This happens for essentially all network reads prior to this change because they all run from a `NamedWritableAwareStreamInput`. This also required optimising `BufferedChecksumStreamInput` individually to use bulk reads from the buffer because it implicitly assumed that the filter stream input wouldn't override any of the bulk operations.	2020-02-17 13:07:19 +01:00
Nik Everett	146def8caa	Implement top_metrics agg (#51155 ) (#52366 ) The `top_metrics` agg is kind of like `top_hits` but it only works on doc values so it should be faster. At this point it is fairly limited in that it only supports a single, numeric sort and a single, numeric metric. And it only fetches the "very topest" document worth of metric. We plan to support returning a configurable number of top metrics, requesting more than one metric and more than one sort. And, eventually, non-numeric sorts and metrics. The trick is doing those things fairly efficiently. Co-Authored by: Zachary Tong <zach@elastic.co>	2020-02-14 11:19:11 -05:00
Nik Everett	53b6583fed	Decode max and min optimization more carefully (#52336 ) (#52358 ) Fixes the the no-query optimization for `min` and `max` aggregations for `date_nanos` fields by delegating decoding dates "through" their `resolution` member. Closes #52220	2020-02-14 07:07:56 -05:00
Julie Tibshirani	0d7165a40b	Standardize naming of fetch subphases. (#52171 ) This commit makes the names of fetch subphases more consistent: * Now the names end in just 'Phase', whereas before some ended in 'FetchSubPhase'. This matches the query subphases like AggregationPhase. * Some names include 'fetch' like FetchScorePhase to avoid ambiguity about what they do.	2020-02-13 13:00:46 -08:00
Nik Everett	2dac36de4d	HLRC support for string_stats (#52163 ) (#52297 ) This adds a builder and parsed results for the `string_stats` aggregation directly to the high level rest client. Without this the HLRC can't access the `string_stats` API without the elastic licensed `analytics` module. While I'm in there this adds a few of our usual unit tests and modernizes the parsing.	2020-02-12 19:25:05 -05:00
Nik Everett	7efce22f19	Fix a DST error in date_histogram (backport #52016 ) (#52237 ) When `date_histogram` attempts to optimize itself it for a particular time zone it checks to see if the entire shard is within the same "transition". Most time zone transition once every size months or thereabouts so the optimization can usually kicks in. But it crashes when you attempt feed it a time zone who's last DST transition was before epoch. The reason for this is a little twisted: before this patch it'd find the next and previous transitions in milliseconds since epoch. Then it'd cast them to `Long`s and pass them into the `DateFieldType` to check if the shard's contents were within the range. The trouble is they are then converted to `String`s which are then parsed back to `Instant`s which are then convertd to `long`s. And the parser doesn't like most negative numbers. And everything before epoch is negative. This change removes the `long` -> `Long` -> `String` -> `Instant` -> `long` chain in favor of passing the `long` -> `Instant` -> `long` which avoids the fairly complex parsing code and handles a bunch of interesting edge cases around epoch. And other edge cases around `date_nanos`. Closes #50265	2020-02-12 17:57:04 -05:00
Nhat Nguyen	12cb6dcefe	Fix testFlushOnInactive (#52275 ) We need to reduce the translog sync interval for indices with translog async setting so that we can have the safe commit in the assertBusy interval. This is needed since #51905, where we use the local checkpoint of the safe commit to calculate the number of uncommitted operations of a translog stats. Closes #52251 Relates #51905	2020-02-12 17:19:02 -05:00
Jay Modi	5bcc6fce5c	Remove DeprecationLogger from route objects (#52285 ) This commit removes the need for DeprecatedRoute and ReplacedRoute to have an instance of a DeprecationLogger. Instead the RestController now has a DeprecationLogger that will be used for all deprecated and replaced route messages. Relates #51950 Backport of #52278	2020-02-12 15:05:41 -07:00
Marios Trivyzas	dac720d7a1	Add a cluster setting to disallow expensive queries (#51385 ) (#52279 ) Add a new cluster setting `search.allow_expensive_queries` which by default is `true`. If set to `false`, certain queries that have usually slow performance cannot be executed and an error message is returned. - Queries that need to do linear scans to identify matches: - Script queries - Queries that have a high up-front cost: - Fuzzy queries - Regexp queries - Prefix queries (without index_prefixes enabled - Wildcard queries - Range queries on text and keyword fields - Joining queries - HasParent queries - HasChild queries - ParentId queries - Nested queries - Queries on deprecated 6.x geo shapes (using PrefixTree implementation) - Queries that may have a high per-document cost: - Script score queries - Percolate queries Closes: #29050 (cherry picked from commit a8b39ed842c7770bd9275958c9f747502fd9a3ea)	2020-02-12 22:56:14 +01:00
Ryan Ernst	c07f46409c	Fix single newline in logging output stream buffer (#52253 ) The buffer in LoggingOutputStream skips flushing when only a newline appears. However, if a windows newline appeared, the buffer length was not reset. This commit resets the length so the \r does not appear in the next logging message. closes #51838	2020-02-12 10:48:55 -08:00
Nhat Nguyen	e098e837f7	Fix testShouldPeriodicallyFlushAfterMerge (#52243 ) MockRandomMergePolicy randomly determines if a segment should use a compound format. This can cause a force merge performing two merges: (1) merging to a single segment, (2) rewriting the new segment using the compound format. If the second merge completes after we have flushed, then it can flip the flag shouldPeriodicallyFlushAfterBigMerge to true. Closes #52205	2020-02-12 11:25:39 -05:00
Gordon Brown	d48ce12920	Convert ILM and SLM histories into hidden indices (#51456 ) Modifies SLM's and ILM's history indices to be hidden indices for added protection against accidental querying and deletion, and improves IndexTemplateRegistry to handle upgrading index templates. Also modifies the REST test cleanup to delete hidden indices.	2020-02-11 14:18:55 -07:00
Nik Everett	86d5211c05	Make sorting by an agg results a real abstraction (#52007 ) (#52212 ) This removes a bunch of `instanceof`s in favor of two new methods on `InernalAggregation`. The default implementations of these methods just throw exceptions explaining that you can't sort on this aggregation. They are overridden by all of the classes that used to have `instanceof` checks against them. I doubt this is really any faster in practice. The real benefit here is that it is a little more obvious that you can sort by the results of an aggregation and it should be much more obvious where to look at how aggregations sort themselves. There are still a bunch more `instanceof`s in left in `AggregationPath` but those will wait for a followup change.	2020-02-11 12:58:40 -05:00
Hendrik Muhs	098380e483	Percentiles aggregation validation checks for range (#51871 ) disallow to specify percentile out of range [0,100]. This also fixes a problem in transform by failing validation if an invalid percentile configuration is used.	2020-02-11 17:25:39 +01:00
David Roberts	473468d763	[ML] Better error when persistent task assignment disabled (#52014 ) Changes the misleading error message when attempting to open a job while the "cluster.persistent_tasks.allocation.enable" setting is set to "none" to a clearer message that names the setting. Closes #51956	2020-02-11 15:23:21 +00:00
Zachary Tong	87854573e4	Add version constant for 7.6.1	2020-02-11 09:44:43 -05:00
Igor Motov	667e1a5225	Add Boxplot Aggregation (#52174 ) Adds a `boxplot` aggregation that calculates min, max, medium and the first and the third quartiles of the given data set. Closes #33112	2020-02-11 09:38:17 -05:00
David Turner	00b9098250	Ignore timeouts with single-node discovery (#52159 ) Today we use `cluster.join.timeout` to prevent nodes from waiting indefinitely if joining a faulty master that is too slow to respond, and `cluster.publish.timeout` to allow a faulty master to detect that it is unable to publish its cluster state updates in a timely fashion. If these timeouts occur then the node restarts the discovery process in an attempt to find a healthier master. In the special case of `discovery.type: single-node` there is no point in looking for another healthier master since the single node in the cluster is all we've got. This commit suppresses these timeouts and instead lets the node wait for joins and publications to succeed no matter how long this might take.	2020-02-11 14:15:01 +00:00
David Kyle	343ced42be	Mute LoggingOutputStreamTests.testMaxBuffer (#52193 ) Relates to https://github.com/elastic/elasticsearch/issues/51838	2020-02-11 11:46:17 +00:00
Gordon Brown	350288ddf8	Check dot-index rules after template application (#52087 ) Previously, the dot-index rules (namely, that indices with dot-prefixed names should be either hidden indices or system indices) was done before* template application, and so only checked for the `index.hidden` setting in the request, ignoring if that setting was set via a template. This commit moves that check to a different method, which is applied after templates have been resolved and applied to the index settings.	2020-02-10 17:01:59 -07:00
Ryan Ernst	88cf8ac0a8	Fix windows empty line in logging capture (#52162 ) This commit fixes another edge case in handling windows newlines in our capture of stdout/stderr to log4j. The case is that the \r appears at the beginning of the buffer when flushing, which would unintentionally be emitted as an empty string. This commit skips the flush if only a \r was found. closes #51838	2020-02-10 13:29:50 -08:00
Julie Tibshirani	28a8db730f	In FieldTypeLookup, factor out flat object field logic. (#52091 ) Currently, the logic for looking up `flattened` field types lives in the top-level `FieldTypeLookup`. This PR moves it into a dedicated class `DynamicKeyFieldTypeLookup`.	2020-02-10 10:44:02 -08:00
Armin Braun	d8169e5fdc	Don't Upload Redundant Shard Files (#51729 ) (#52147 ) Segment(s) info blobs are already stored with their full content in the "hash" field in the shard snapshot metadata as long as they are smaller than 1MB. We can make use of this fact and never upload them physically to the repo. This saves a non-trivial number of uploads and downloads when restoring and might also lower the latency of searchable snapshots since they can save phyiscally loading this information as well.	2020-02-10 16:50:09 +01:00
Ignacio Vera	80e3c97210	Upgrade to lucene-8.5.0-snapshot-d62f6307658 (#52039 ) (#52130 )	2020-02-10 10:13:22 +01:00
Alan Woodward	9b7e688f5b	Don't use a static QueryShardResult for a null instance (#52063 ) Fixes #52042	2020-02-10 09:03:43 +00:00
Ioannis Kakavas	343fb36c7f	Test modifications for FIPS 140 mode (#51832 ) (#52128 ) - Enable SunJGSS provider for Kerberos tests - Handle the fact that in the decrypt method in KeyStoreWrapper might not throw immediately when the GCM cipher is from BouncyCastle FIPS and we end up with a DataInputStream that has reached it's end. - Disable tests, jarHell, testingConventions for ingest attachment plugin. We don't support this plugin (and document this) in FIPS mode. - Don't attempt to install ingest-attachment in smoke-test-plugins	2020-02-10 10:57:03 +02:00
Jay Modi	3edadfefd0	RestHandlers declare handled routes (#52123 ) This commit changes how RestHandlers are registered with the RestController so that a RestHandler no longer needs to register itself with the RestController. Instead the RestHandler interface has new methods which when called provide information about the routes (method and path combinations) that are handled by the handler including any deprecated and/or replaced combinations. This change also makes the publication of RestHandlers safe since they no longer publish a reference to themselves within their constructors. Closes #51622 Co-authored-by: Jason Tedor <jason@tedor.me> Backport of #51950	2020-02-09 22:48:32 -07:00
Nhat Nguyen	80a9a08b05	Fix leaking searcher when shards are removed or relocated (#52099 ) We might leak a searcher if the target shard is removed (i.e., its index is deleted) or relocated while we are creating a SearchContext from a SearchRewriteContext. Relates #51708 Closes #52021 I labelled this non-issue for an unreleased bug introduced in #51708.	2020-02-09 22:13:35 -05:00
Armin Braun	90eb6a020d	Remove Redundant Loading of RepositoryData during Restore (#51977 ) (#52108 ) We can just put the `IndexId` instead of just the index name into the recovery soruce and save one load of `RepositoryData` on each shard restore that way.	2020-02-09 21:44:18 +01:00
Nhat Nguyen	9f541d909d	Always create search context for scroll queries (#52078 ) We need to either exclude null responses from the scroll search response or always create a search context for every target shards, although that scroll query can be written to match_no_docs. Otherwise, we won't find search_context for subsequent scroll requests. This commit implements the latter option as it's less error-prone. Relates #51708	2020-02-08 13:01:01 -05:00
Armin Braun	b77ef1f61b	Cleanup some Dead Code in o.e.index.store (#52045 ) (#52084 ) One obviously unused method and an incorrect Javadoc that referenced an otherwise unused class.	2020-02-08 12:14:51 +01:00
Mark Vieira	e5a9e44ca4	Mute IndicesRequestCacheIT.testQueryRewriteDatesWithNow() Signed-off-by: Mark Vieira <portugee@gmail.com>	2020-02-07 13:14:32 -08:00
Julie Tibshirani	337d73a7c6	Rename MapperService#fullName to fieldType. The new name more accurately describes what the method returns.	2020-02-07 10:35:53 -08:00
Tanguy Leroux	7c6264b28c	Mute IndicesRequestCacheIT.testQueryRewrite() Relates #32827	2020-02-07 19:44:32 +02:00
Armin Braun	91e938ead8	Add Trace Logging of REST Requests (#51684 ) (#52015 ) Being able to trace log all REST requests to a node would make debugging a number of issues a lot easier.	2020-02-07 09:03:20 +01:00
Jim Ferenczi	0f333c89b9	Always rewrite search shard request outside of the search thread pool (#51708 ) (#51979 ) This change ensures that the rewrite of the shard request is executed in the network thread or in the refresh listener when waiting for an active shard. This allows queries that rewrite to match_no_docs to bypass the search thread pool entirely even if the can_match phase was skipped (pre_filter_shard_size > number of shards). Coordinating nodes don't have the ability to create empty responses so this change also ensures that at least one shard creates a full empty response while the other can return null ones. This is needed since creating true empty responses on shards require to create concrete aggregators which would be too costly to build on a network thread. We should move this functionality to aggregation builders in a follow up but that would be a much bigger change. This change is also important for #49601 since we want to add the ability to use the result of other shards to rewrite the request of subsequent ones. For instance if the first M shards have their top N computed, the top worst document in the global queue can be pass to subsequent shards that can then rewrite to match_no_docs if they can guarantee that they don't have any document better than the provided one.	2020-02-06 10:53:11 +01:00
Jim Ferenczi	fb710cc62b	Remove the query builder serialization from QueryShardException message (#51885 ) QueryBuilders that throw exceptions on shards when building the Lucene query returns the full serialization of the query builder in the exception message. For large queries that fails to execute due to the max boolean clause, this means that we keep a reference of these big messages for every shard that participate in the request. In order to limit the memory needed to hold these query shard exceptions in the coordinating node, this change removes the query builder serialization from the shard exception. The query is known by the user so there should be no need to repeat it on every shard exception. We could also omit the entire stack trace for known bad request exception but it would deserve a separate issue/pr. Closes #51843 Closes #48910	2020-02-06 08:26:15 +01:00
Nik Everett	80e29a47d8	Fix a sneaky bug in rare_terms (#51868 ) (#51959 ) When the `rare_terms` aggregation contained another aggregation it'd break them. Most of the time. This happened because the process that it uses to remove buckets that turn out not to be rare was incorrectly merging results from multiple leaves. This'd cause array index out of bounds issues. We didn't catch it in the test because the issue doesn't happen on the very first bucket. And the tests generated data in such a way that the first bucket always contained the rare terms. Randomizing the order of the generated data fixed the test so it caught the issue. Closes #51020	2020-02-05 16:32:55 -05:00
Adrien Grand	ad9d2f1922	Move analysis/mappings stats to cluster-stats. (#51875 ) Closes #51138	2020-02-05 11:02:25 +01:00
Yannick Welsch	b4480bb8a4	Mute LoggingOutputStreamTests (#51917 ) Relates #51838	2020-02-05 10:46:45 +01:00
Julie Tibshirani	38ce428831	Create a class to hold field capabilities for one index. (#51844 ) Currently, the same class `FieldCapabilities` is used both to represent the capabilities for one index, and also the merged capabilities across indices. To help clarify the logic, this PR proposes to create a separate class `IndexFieldCapabilities` for the capabilities in one index. The refactor will also help when adding `source_path` information in #49264, since the merged source path field will have a different structure from the field for a single index. Individual changes: * Add a new class IndexFieldCapabilities. * Remove extra constructor from FieldCapabilities. * Combine the add and merge methods in FieldCapabilities.Builder.	2020-02-04 11:24:57 -08:00
Maria Ralli	8d3e73b3a0	Add host address to BindTransportException message (#51269 ) When bind fails, show the host address in addition to the port. This helps debugging cases with wrong "network.host" values. Closes #48001	2020-02-04 17:13:19 +00:00
feifeiiiiiiiiii	337153b29f	Throw better exception on wrong `dynamic_templates` syntax (#51783 ) Currently, a mappings update request, where dynamic_mappings is an object instead of an array, results in a http response with a 500 code. This PR checks for this condition and throws a MapperParsingException like we do for other malformed mapping cases. Closes #51486	2020-02-04 17:01:55 +01:00
Henning Andersen	41552359a2	Increase master disruption test assert timeouts (#51810 ) After #51803, the timeouts waiting for assertions around master change were too short.	2020-02-03 15:51:33 +01:00
Henning Andersen	1800b2730f	Fix completeWith exception handling (#51734 ) ActionListener.completeWith would catch exceptions from listener.onResponse and deliver them to lister.onFailure, essentially double notifying the listener. Instead we now assert that listeners do not throw when using ActionListener.completeWith. Relates #50886	2020-02-03 14:22:55 +01:00

1 2 3 4 5 ...

4386 Commits