OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nhat Nguyen	207e4b00f9	Busily assert in testCreateSearchContextFailure (#64243 ) If a background refresh is running, then the refCount assertion will fail as Engine#refreshIsNeeded can increase the refCount by 2. Closes #64052	2020-11-10 11:51:41 -05:00
Armin Braun	d173ba6b2d	Fix NPE in toString of FailedShard (#64770 ) (#64779 ) The concatenation took precedence over the null check, leading to an NPE because `null` was passed to `ExceptionsHelper.stackTrace(failure))`.	2020-11-09 17:02:11 +01:00
David Turner	33f703ef1f	Fix up roles after rolling upgrade (#64693 ) Node roles vary by version, and new roles are suppressed for BWC. This means we can receive a join from a node that's already in the cluster but with a different set of roles: the node didn't change roles, but the cluster state came via an older master. This commit ensures that we properly process a join from such a node to ensure that the roles are correct. Closes #62840	2020-11-06 12:33:09 +00:00
Armin Braun	51e9d6f227	Revert Serializing Outbound Transport Messages on IO Threads (#64632 ) (#64654 ) Serializing outbound transport message on the IO loop was introduced in https://github.com/elastic/elasticsearch/pull/56961. Unfortunately it turns out that this is incompatible with assumptions made by CCR code here: `f22ddf822e/x-pack/plugin/ccr/src/main/java/org/elasticsearch/xpack/ccr/action/repositories/GetCcrRestoreFileChunkAction.java (L60-L61)` and that are not easy to work around on short notice. Raising reverting this move (as a temporary solution, it's still a valuable change long-term) as a blocker therefore as this seriously affects the stability of the initial phase of the CCR following by causing corrupted bytes to be send to the follower.	2020-11-05 16:29:12 +01:00
Jim Ferenczi	9e4105ec37	Validate PIT on _msearch (#63167 ) This change ensures that we validate point in times provided by individual search requests in _msearch. Relates #63132	2020-11-05 15:38:28 +01:00
Jim Ferenczi	3e2fa09666	Fix merging of terms aggregation with compound order (#64469 ) This change fixes a bug introduced in #61779 that uses a compound order to compare buckets when merging. The bug is triggered when the compound order uses a primary sort ordered by key (asc or desc). This commit ensures that we always extract the primary sort when comparing keys during merging. The PR is marked as no-issue since the bug has not been released in any official version.	2020-11-05 12:05:19 +01:00
markharwood	1fb6206fbc	SignificantText aggregation had include/exclude logic back to front (#64520 ) (#64538 ) Backport bugfix. SignificantText aggregation had include/exclude logic back to front. Added test. Closes #64519	2020-11-03 16:43:03 +00:00
Ignacio Vera	4851bc7bae	Upgrade to Lucene-8.7.0 (#64532 ) (#64537 )	2020-11-03 16:57:04 +01:00
Ignacio Vera	156c931745	LinearCounting recompute size tripping assertion (#64465 ) (#64531 ) Guard recomputeSize method from out of bounds exception	2020-11-03 15:52:48 +01:00
James Rodewig	4a64134718	[DOCS] Fix typo in IndexService.java (#64034 ) (#64447 ) Co-authored-by: mushaoqiong <mushaoqiong@126.com>	2020-11-02 08:16:29 -05:00
Armin Braun	dad3b26560	Fix Typo in Repository Exception Message (#64412 ) (#64434 ) Missing space fixed.	2020-10-30 21:10:17 +01:00
Jason Tedor	fedaa3be05	Remove mute from testDiscoveryNodeRoleWithOldVersion This commit removes a mute on DiscoveryNodeTest#testDiscoveryNodeRoleWithOldVersion after a fix was pushed in `6b119a43c1`. Relates #64385	2020-10-29 22:37:38 -04:00
Jason Tedor	6b119a43c1	Fix version in testDiscoveryNodeRoleWithOldVersion This commits fixes the version when reading from the stream in DiscoveryNodeTests#testDiscoveryNodeRoleWithOldVersion. Closes #64385	2020-10-29 22:36:14 -04:00
Yang Wang	533b929e6c	[Test] Mute DiscoveryNodeTests.testDiscoveryNodeRoleWithOldVersion The issue is tracked at https://github.com/elastic/elasticsearch/issues/64385	2020-10-30 13:28:52 +11:00
Jason Tedor	1126ba4df8	Serialize can contain data with roles (#64324 ) This commit internalizes whether or not a role represents the ability to contain data. In the future, this will let us remove the compatibility role notion.	2020-10-29 20:44:39 -04:00
Jason Tedor	827dd39a12	Filter node.roles setting in transport client (#64276 ) This commit filters out the node.roles setting from the transport client, since the transport client does not take on these roles.	2020-10-28 16:24:14 -04:00
Jason Tedor	5d42c2b06e	Deprecate the no-jdk distributions (#64275 ) This commit adds logging to indicate that the no-jdk distributions are deprecated and will be removed in a future release.	2020-10-28 10:35:23 -04:00
Nik Everett	0c47d49784	Make sure non-collecting aggs include sub-aggs (backport of #64214 ) (#64247 ) Now that we're consistently using `cat_match` to filter which shards we run on we can get this confusing case: 1. You have a search with, say, a range and a sub-agg. 2. That search has a query that `can_match` can recognize will match no docs. On any shard. 3. So we dutifully run it on a single shard so it can produce the "empty" aggs. 4. The shard we pick happens to not have the target of the range mapped. 5. This kicks in the special range aggregator that doesn't collect any documents. 6. Before this commit, that range aggregator also never produced any sub-aggs. So, without this change, it was quite possible for a search that happened to match no documents to "throw away" the sub-aggs of a range and a few other aggs. We've had this problem for a long, long time but it is more confusing now because `can_match` is really kicking in and causing us to see cases where it looks like you are targeting a lot of shards but you really are only targeting a couple. It used to be that to get the "no sub-aggs" behavior you had to explicitly target only shards that didn't map the target field of the `range` agg. And, like, in that case it isn't too bad because you targeted a sort of degenerate shard. But now that `can_match` is doing its thing you can end up with the confusing steps above. It took me several hours to track down what what happening I know how the individual pieces of all of this works. It took four hours to figure out how they fit together in this case.... Anyway! This replaces all the aggregator implementations that throw out the sub-aggregators with ones that keep them. I think this'll be less confusing in the future. Closes #64142	2020-10-28 08:38:05 -04:00
Jason Tedor	78c741ab32	Log whether or not we are using the bundled JDK (#64255 ) This commit adds logging to indicate whether or not we are using the bundled JDK. We distinguish between using a distribution that bundles the JDK versus using a distribution that does not bundle the JDK.	2020-10-28 07:10:47 -04:00
Armin Braun	2983584ef6	Fix #invariant Assertion in CacheFile (#64180 ) (#64264 ) Fix #invariant Assertion in CacheFile closes #64141	2020-10-28 10:22:47 +01:00
Armin Braun	a697d5edae	Don't Generate an Index Setting History UUID unless it's Supported (#64164 ) (#64213 ) In 7.x we can't just by default generate this setting as it might not be supported by data nodes that are assigned shards for an older version in mixed version clusters. Closes #64152	2020-10-28 09:03:09 +01:00
Jason Tedor	dfc8ae48cc	Fix using bundled JDK detection on macOS (#64236 ) This commit fixes an issue with the detection on macOS for whether or not the bundled JDK is being used. The logic between macOS and non-macOS is different because the JDK has a different directory structure on macOS versus non-macOS. However, due to notarization issues, we changed the top-level directory from jdk to jdk.app, yet never updated this detection logic to account for that. Ideally, we would have a packaging test that asserts that we have the behavior here correct, and it maintains over time. Alas, we do not currently have packaging tests on macOS.	2020-10-27 16:47:02 -04:00
Nhat Nguyen	566d1fd459	Return the same point in time in search response (#64188 ) With this change, we will always return the same point in time in a search response as its input until we implement the retry mechanism for the point in times.	2020-10-27 10:17:44 -04:00
Jim Ferenczi	e34014eb6a	Fix sorted query when date_nanos is used as the numeric_type (#64183 ) The formatting of the global bottom value does not take the resolution of the provided numeric_type into account. This change fixes this bug by providing the resolution directly in the doc value format if the numeric_type is provided as `date_nanos`. Closes #63719	2020-10-27 11:00:23 +01:00
Armin Braun	e02561476e	Fix Broken Clone Snapshot CS Update (#64116 ) (#64159 ) We must not remove the snapshot from the initializing set in the `timeout` getter. This was a plain oversight/mistake and went unnoticed. It can lead to the removal of a valid snapshot clone from the cluster state in rare circumstances (e.g. when a node concurrently joins the cluster or a routing change happens as it did in the linked test failure). Closes #64115	2020-10-26 14:32:42 +01:00
Armin Braun	96407268a0	Fix Background Merge Breaking Snapshot Restore Test (#63579 ) (#64129 ) If we run into a background merge between creating the snapshot and closing the index then with compound files we could be in a situation where we get zero file reuse on restore. Force merging before the snapshot gives us a single segment that won't change down the line so the restore always sees file reuse from the close index. Closes #63476	2020-10-26 09:34:43 +01:00
Armin Braun	bdea16301d	Fix testMasterFailoverDuringCloneStep1 (#63580 ) (#64127 ) Assuming the clone failed when the request failed is not sufficient. There are failure modes where the request fails but the clone still works out because the data node resent the requeest after the first clone had already been failed and removed from the cluster state when master was restarted. Closes #63473	2020-10-26 09:30:09 +01:00
Marios Trivyzas	9b8ea63cd2	[7.10] Bump version after 7.9.3 release (#63818 )	2020-10-22 17:49:21 +02:00
Przemyslaw Gomulka	bab426be2c	[7.10] add 6.8.14 version (#63824 ) adding 6.8.14 after version 6.8.13 release	2020-10-22 16:51:01 +02:00
Armin Braun	e0f73c96f7	Fix testStartCloneWithSuccessfulShardSnapshotPendingFinalization (#63966 ) (#64000 ) We have to wait for no more operations here not for `1`. This mostly worked because the test thread would add the listener quickly enough so that it sees the state where either the snapshot or clone but not both have already finished but randomly the test thread would be slow and time out on a state without snaphots in it.	2020-10-21 15:33:12 +02:00
markharwood	b933bd9f45	Search - make term/prefix/wildcard/regex query parsing more lenient (#63926 ) Remove errors when case_insensitive flag set to false Closes #63893	2020-10-21 13:33:19 +01:00
Henning Andersen	ddd897f747	Fix test timeout for health on master failover (#63455 ) testHealthOnMasterFailover could timeout on some of the health requests in the case where an index is added, since the recovery leads to extended test run time. Closes #62690	2020-10-21 14:31:53 +02:00
Nik Everett	8d30766a7d	Fix scripted metric BWC serialization (backport of #63821 ) (#63897 ) We had and an error when serializing fully reduced scripted metrics. Small typo and sever lack of tests..... Anyway, this fixed the one character typo and adds a bunch more tests.	2020-10-20 13:15:26 -04:00
Ignacio Vera	d0f5066310	Upgrade to lucene-8.7.0-snapshot-72d8528c3a6 (#63912 ) (#63928 ) (#63933 )	2020-10-20 15:08:06 +02:00
Tanguy Leroux	b2e07076a0	Add snapshot shard size based test in DiskThresholdDeciderTests (#63913 ) This commit adds a test in DiskThresholdDeciderTests that verifies the allocation of a snapshot recovery source based shard in the situation where the snapshot shard size was successfully provided by the SnapshotInfoService introduced in #61906 and when the service failed to provide the size. Relates #61906	2020-10-20 14:59:00 +02:00
Jim Ferenczi	3423f214dd	Composite aggregation must check live docs when the index is sorted (#63864 ) This change ensures that the live docs are checked in the composite aggregator when the index is sorted.	2020-10-20 11:40:28 +02:00
Armin Braun	1880bcdc09	Add REST Test for Snapshot Clone API (#63863 ) (#63881 ) Adds snapshot clone REST tests and HLRC support for the API.	2020-10-20 09:48:03 +02:00
Nik Everett	5583db5a73	Fix broken parent and child aggregator (backport #63811 ) (#63892 ) In #57892 I broke some sub-aggregations inside of the `parent` and `child` aggregator, specifically any sub-aggregations that do work in the `postCollect` phase. This fixes it by delaying the post collect phase of aggs under `parent` and `child` until `beforeBuildingBuckets` because, well, we haven't done any collection until after that phase.	2020-10-19 13:05:22 -04:00
Mayya Sharipova	c0c1a7a9a6	Apply boost only once for distance_feature query (#63767 ) Currently if distance_feature query contains boost, it incorrectly gets applied twice: in AbstractQueryBuilder::toQuery and we also pass this boost to Lucene's LongPoint.newDistanceFeatureQuery. As a result we get incorrect scores. This fixes this error to ensure that boost is applied only once. Closes #63691	2020-10-16 10:02:55 -04:00
Ioannis Kakavas	364511395d	[7.10] Move RestRequestFilter to core (#63507 ) Move RestRequestFilter to core so that Rest requests outside xpack can use it to filter fields and expand its usage. Backport of #63507	2020-10-16 13:57:52 +03:00
Tanguy Leroux	7ea44d20c3	Try to fix DiskThresholdDeciderIT (#63614 ) (#63721 ) This is another attempt to fix #62326 as my previous attempts failed (#63112, #63385).	2020-10-16 09:20:54 +02:00
Jay Modi	822fea9889	Fix threadpool setting test for system_write (#63706 ) This commit fixes the UpdateThreadPoolSettingsTests to be aware of the hard limit on the maximum size of the system_write executor. This executor has a hard limit that matches the write executor, which is the number of allocated processors. Closes #63131 Backport #63700	2020-10-14 14:57:43 -06:00
James Rodewig	ac2b668016	[DOCS] Fix AbstractDiffable typo (#59034 ) (#63668 ) Co-authored-by: Howard <danielhuang@tencent.com>	2020-10-14 09:56:56 -04:00
Armin Braun	424b313784	Adapt Shard Generation Assertion for 7.x (#63625 ) (#63642 ) In 7.x we can have `null` generations so we need to adjust the `assert` accordingly. See e.g. failure https://gradle-enterprise.elastic.co/s/dgypleytdotfu/tests/:server:internalClusterTest/org.elasticsearch.snapshots.ConcurrentSnapshotsIT/testConcurrentSnapshotWorksWithOldVersionRepo	2020-10-14 06:57:25 +02:00
Nhat Nguyen	9015b50e1b	Check docs limit before indexing on primary (#63273 ) Today indexing to a shard with 2147483519 documents will fail that shard. We should check the number of documents and reject the write requests instead. Closes #51136	2020-10-13 17:39:08 -04:00
Lee Hinman	7371e51583	[7.10] Add DiscoveryNodeRole compatibility role for bwc tier serialization (#63581 ) (#63613 ) Backports the following commits to 7.10: Add DiscoveryNodeRole compatibility role for bwc tier serialization (#63581)	2020-10-13 09:17:15 -06:00
Armin Braun	f70391c6cc	Fix Broken Snapshot State Machine in Corner Case (#63534 ) (#63608 ) This fixes a gap in testing and a bug that can occur in various forms: When we would start a snapshot or clone related to a shard that was done snapshotting/cloning but its overall operation was not yet finalized at the time of starting the operation, we would base the operation off of the wrong generation. This would not cause a corrupted repo, but would cause the operation to be `PARTIAL`. This commit fixes the state machine to take into account the correct generation in this case. Closes #63498	2020-10-13 16:05:34 +02:00
James Rodewig	845ccc2264	[DOCS] Fix dup word in ShardRouting hashcode method. (#63452 ) (#63583 ) Co-authored-by: Howard <danielhuang@tencent.com>	2020-10-13 09:05:19 -04:00
Tanguy Leroux	8499924e51	InternalSnapshotsInfoService should also removed failed snapshot shard size infos (#63492 ) (#63592 ) Relates #61906	2020-10-13 10:42:38 +02:00
Julie Tibshirani	9e52513c7b	Add support for missing value fetchers. (#63585 ) This PR implements value fetching for the following field types: * `text` phrase and prefix subfields * `search_as_you_type`, plus its subfields * `token_count`, which is implemented by fetching doc values Supporting these types helps ensure that retrieving all fields through `"fields": ["*"]` doesn't fail because of unsupported value fetchers.	2020-10-12 17:34:21 -07:00
Tim Brooks	56092b1a9f	Flush translog writer before adding new operation (#63505 ) Currently we flush the Translog buffer when a new operation causes the buffer to breach 1MB. This introduces a scenario where an exception is thrown AFTER the writer has accepted the operation. To avoid this, this commit flushes the Translog in an #add call before adding a new operation. This fixes #63299.	2020-10-09 10:02:55 -06:00
Julie Tibshirani	ae2fc4118d	Add factory methods for common value fetchers. (#63438 ) This PR adds factory methods for the most common implementations: * `SourceValueFetcher.identity` to pass through the source value untouched. * `SourceValueFetcher.toString` to simply convert the source value to a string.	2020-10-08 12:14:53 -07:00
Julie Tibshirani	c6b915c8e6	Make TextFieldMapper.FAST_PHRASE_SUFFIX private.	2020-10-08 11:45:53 -07:00
Tanguy Leroux	943fcaf970	Simplify reroute counting in InternalSnapshotsInfoServiceTests (#63416 ) (#63491 ) Closes #63352	2020-10-08 18:20:07 +02:00
Dan Hermann	85886e71c2	Handle error conditions when simulating ingest pipelines with verbosity enabled (#63327 ) (#63484 )	2020-10-08 09:21:05 -05:00
Przemyslaw Gomulka	d7391bc040	[7.10] Fix incorrect use of Format.equals instead of matches backport#63462 #63463 closes #63459 backports #63462	2020-10-08 15:35:13 +02:00
Christoph Büscher	517d3e4336	Mute DiskThresholdDeciderIT.testHighWatermarkNotExceeded	2020-10-08 15:14:50 +02:00
Mayya Sharipova	e022b78198	Upgrade to lucene-8.7.0-snapshot-5c4168d (#63466 ) This disables sort optim on _doc, which may still be unstable. Backport for #63444	2020-10-08 08:20:43 -04:00
Christoph Büscher	564823b00f	Muting parts of JavaJodaTimeDuellingTests	2020-10-08 11:50:47 +02:00
Alan Woodward	c4726a2cec	Don't emit separate warnings for type filters (#63391 ) #63214 made TypeFieldType a constant field, and fixed things so that it always emits deprecation warnings whenever it is referenced in a query or aggregation. However, it also emits warnings when it is used to build a type filter through the search context; this is unnecessary, as warnings are already emitted by the REST layer when types are specified as part of the URL, and it is causing failures in some BWC tests. This commit adds a specialised typeFilter method to TypeFieldType to handle this case without emitted any extra warnings. It also removes an unused duplicate TypeFieldType class that resulted from a backport merge error. Fixes #63366	2020-10-07 15:56:39 +01:00
Mayya Sharipova	e236ea43e9	Upgrade to lucene-8.7.0-snapshot-e914862 (#63401 ) Backport for: #63395	2020-10-07 09:45:14 -04:00
Alan Woodward	88b45dfa61	Convert TextFieldMapper to parametrized form (#63269 ) (#63392 ) As a result of this, we can remove a chunk of code from TypeParsers as well. Tests for search/index mode analyzers have moved into their own file. This commit also rationalises the serialization checks for parameters into a single SerializerCheck interface that takes the values includeDefaults, isConfigured and the value itself. Relates to #62988	2020-10-07 13:26:25 +01:00
Przemyslaw Gomulka	5534a60fa0	strict_date_optional_time_nanos with width 1 on nanos part (#63117 ) (#63387 ) This formatter should allow parsing fraction of a second with minimum width of 1. The same is allowed for strict_date_optional_time closes #61357	2020-10-07 14:12:04 +02:00
Armin Braun	244f1a60f9	Selectively Add ClusterState Listeners Depending on Node Roles (#63223 ) (#63396 ) We were not consistent in checking for node roles before adding listeners. In some cases we did check the necessity of a CS listener and in others we did not. This commit fixes a number of cases of redundant listeners that don't apply to all node roles.	2020-10-07 14:11:43 +02:00
Tanguy Leroux	eac99dd594	SnapshotShardSizeInfo should prefer default value when provided (#63390 ) (#63394 ) In #61906 we agreed on always providing the default value ShardRouting.UNAVAILABLE_EXPECTED_SHARD_SIZE when the SnasphotInfoService failed to retrieve the exact size for a given snapshot shard. The motivation was to allow the shard allocation to move forward in case of failures (so that the unassigned shard does not get stuck in an unassigned state for too long) while relying on the fallback values for shard sizes. Sadly a bug in the SnapshotShardSizeInfo#getShardSize(ShardRouting, long) makes the default value to be ignored when the snapshot shard size retrieval previously failed, returning ShardRouting.UNAVAILABLE_EXPECTED_SHARD_SIZE instead of the provided default value. With DiskThresholdDecider also not relying on the provided default value this triggers some assertion like in #63376 which helped us to spot the bug. Closes ##63376	2020-10-07 13:53:05 +02:00
Tanguy Leroux	581490d83c	Fix DiskThresholdDeciderIT.testHighWatermarkNotExceeded (#63112 ) (#63385 ) The first refreshDiskUsage() refreshes the ClusterInfo update which in turn calls listeners like DiskThreshMonitor. This one triggers a reroute as expected and turns an internal checkInProgress flag before submitting a cluster state update to relocate shards (the internal flag is toggled again once the cluster state update is processed). In the test I suspect that the second refreshDiskUsage() may complete before DiskThreshMonitor's internal flag is set back to its initial state, resulting in the second ClusterInfo update to be ignored and message like "[node_t0] skipping monitor as a check is already in progress" to be logged. Adding another wait for languid events to be processed before executing the second refreshDiskUsage() should help here. Closes #62326	2020-10-07 11:27:25 +02:00
Przemyslaw Gomulka	eadd69e1e4	Deprecate week_year in favour of weekyear date format backport(63307) (#63308 ) week_year is misleading as the formatter only has a weekyear. A field corresponding to 'Y'. 'weekyear' should be used instead relates #60707 backports https://github.com/elastic/elasticsearch/pull/63307	2020-10-07 09:16:27 +02:00
Tim Brooks	dd4b0d85fe	Write translog operation bytes to byte stream (#63298 ) Currently we add translog operation bytes to an array list and flush them on the next write. Unfortunately, this does not currently play well with our byte pooling which means each operation is backed, at minimum, by a 16KB array. This commit improves memory efficiency for small operations by serializing the operations to an output stream.	2020-10-06 20:55:44 -06:00
Tim Brooks	64bbbaeef1	Do not block Translog add on file write (#63374 ) Currently a TranslogWriter add operation is synchronized. This operation adds the bytes to the file output stream buffer and issues a write system call if the buffer is filled. This happens every 8KB which means that we routinely block other add calls on system writes. This commit modifies the add operation to simply place the operation in an array list. The array list if flushed when the sync call occurs or when 1MB is buffered.	2020-10-06 20:40:15 -06:00
Mayya Sharipova	f2ba62b894	Upgrade to lucene- 8.7.0-snapshot-66c49a35402 (#63372 ) This includes fixing a bug in doc iteration during sort optimization Backport for #63349	2020-10-06 22:38:58 -04:00
Dawid Weiss	dbcbdcc029	Set context class loader for plugin initialization (#63185 ) Plugins are loaded in isolated child class loaders of the root class loader. However, some libraries depend on the context class loader being set. This commit sets the context class loader for the duration of calling each plugins constructor. relates #52320 Co-authored-by: Ryan Ernst <ryan@iernst.net>	2020-10-06 18:00:21 -07:00
Julie Tibshirani	f17ca18dfa	Make array value parsing flag more robust. (#63371 ) When constructing a value fetcher, the 'parsesArrayValue' flag must match `FieldMapper#parsesArrayValue`. However there is nothing in code or tests to help enforce this. This PR reworks the value fetcher constructors so that `parsesArrayValue` is 'false' by default. Just as for `FieldMapper#parsesArrayValue`, field types must explicitly set it to true and ensure the behavior is covered by tests. Follow-up to #62974.	2020-10-06 17:49:25 -07:00
Gordon Brown	5c8b0662df	Deprecate REST access to System Indices (#63274 ) (Original #60945 ) This PR adds deprecation warnings when accessing System Indices via the REST layer. At this time, these warnings are only enabled for Snapshot builds by default, to allow projects external to Elasticsearch additional time to adjust their access patterns. Deprecation warnings will be triggered by all REST requests which access registered System Indices, except for purpose-specific APIs which access System Indices as an implementation detail a few specific APIs which will continue to allow access to system indices by default: - `GET _cluster/health` - `GET {index}/_recovery` - `GET _cluster/allocation/explain` - `GET _cluster/state` - `POST _cluster/reroute` - `GET {index}/_stats` - `GET {index}/_segments` - `GET {index}/_shard_stores` - `GET _cat/[indices,aliases,health,recovery,shards,segments]` Deprecation warnings for accessing system indices take the form: ``` this request accesses system indices: [.some_system_index], but in a future major version, direct access to system indices will be prevented by default ```	2020-10-06 13:41:40 -06:00
Tanguy Leroux	87076c32e2	Determine shard size before allocating shards recovering from snapshots (#61906 ) (#63337 ) Determines the shard size of shards before allocating shards that are recovering from snapshots. It ensures during shard allocation that the target node that is selected as recovery target will have enough free disk space for the recovery event. This applies to regular restores, CCR bootstrap from remote, as well as mounting searchable snapshots. The InternalSnapshotInfoService is responsible for fetching snapshot shard sizes from repositories. It provides a getShardSize() method to other components of the system that can be used to retrieve the latest known shard size. If the latest snapshot shard size retrieval failed, the getShardSize() returns ShardRouting.UNAVAILABLE_EXPECTED_SHARD_SIZE. While we'd like a better way to handle such failures, returning this value allows to keep the existing behavior for now. Note that this PR does not address an issues (we already have today) where a replica is being allocated without knowing how much disk space is being used by the primary. Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-10-06 18:37:05 +02:00
Julie Tibshirani	733e89d7ed	Make sure that IdFieldType#isAggregatable is accurate. (#62903 ) Before, it always returned 'true' even when the setting "indices.id_field_data.enabled" was false. Fixes #62897.	2020-10-06 09:33:44 -07:00
Dan Hermann	7a59ae8fa2	[7.x] Allow_duplicates option for append processor (#61916 ) (#63257 )	2020-10-06 09:03:47 -05:00
Armin Braun	a8dbab23a5	Increase Timeout in testDynamicRestoreThrottling (#63300 ) (#63324 ) Even if we increase the limit it might not take effect straight away if a thread is blocked on a long wait in `org.elasticsearch.index.snapshots.blobstore.RateLimitingInputStream#maybePause`. Let's increase the limit a little and see if that deals with the remaining failures for good and stop burning cycles busy asserting a future completion. Closes #63246	2020-10-06 15:27:05 +02:00
Luca Cavanna	ca68298e89	Remove MapperService argument from IndexFieldData.Builder#build (#63197 ) (#63311 ) MapperService carries a lot of weight and is only used to determine if loading of field data for the id field is enabled, which can be done in a different way.	2020-10-06 15:04:23 +02:00
Armin Braun	2aa80f9ee3	Dry up Searchable Snapshots ITs (#63190 ) (#63321 ) Just a few spots where we can dry up these tests using the snapshot test infrastructure in core that I found while studying the existing searchable snapshot tests.	2020-10-06 14:41:11 +02:00
Christoph Büscher	82096d3971	Enable SourceLookup to leverage sequential stored fields reader (#63035 ) (#63316 ) In #62509 we already plugged faster sequential access for stored fields in the fetch phase. This PR now adds using the potentially better field reader also in SourceLookup. Rally exeriments are showing that this speeds up e.g. when runtime fields that are using "_source" are added e.g. via "docvalue_fields" or are used in queries or aggs. Closes #62621	2020-10-06 14:34:39 +02:00
Alan Woodward	7405af8060	Convert TypeFieldType to a constant field type (#63214 ) In 6x and 7x, indexes can have only one type, which means that we can rework all queries against the type field to use a ConstantFieldType. This has already been done in master with the removal of the TypeFieldMapper, but we still need that class in 7x to deal with nested documents. This commit leaves TypeFieldMapper in place, but refactors TypeFieldType to extend ConstantFieldType and consolidates deprecation warnings within that class. It also incidentally removes the requirement to pass a MapperService to IndexFieldData.Builder#build, which should allow #63197 to be backported.	2020-10-06 10:27:37 +01:00
Armin Braun	d7f6812d78	Improve Snapshot Abort Efficiency (#62173 ) (#63297 ) There is no need to let snapshots that haven't yet written anything to the repo finalize with `FAILED`. When we still had the `INIT` state we would also just remove these snapshots from the state without any further action. This is not just a theoretical optimization. Currently, the situation of having a lot of queued up snapshots is fairly complicated to resolve when all the queued shards move to aborted since it is now necessary to execute tasks on the `SNAPSHOT` pool (that might be very busy) to remove the snapshot from the CS (including a number of redundant CS updates and repo writes for finalizing these snapshots before deleting them right away after).	2020-10-06 05:14:25 +02:00
Nhat Nguyen	25fbc01459	Retry CCR shard follow task when no seed node left (#63225 ) If the connection between clusters is disconnected or the leader cluster is offline, then CCR shard-follow tasks can stop with "no seed node left". CCR should retry on this error.	2020-10-05 21:56:56 -04:00
Armin Braun	5c3a4c13dd	Clone Snapshot API (#61839 ) (#63291 ) Snapshot clone API. Complete except for some TODOs around documentation (and adding HLRC support). backport of #61839, #63217, #63037	2020-10-06 01:52:25 +02:00
Armin Braun	e91936512a	Refactor SnapshotsInProgress State Transitions (#60517 ) (#63266 ) The copy constructors previously used were hard to read and the exact state changes were not obvious at all. Refactored those into a number of named constructors instead, added additional assertions and moved the snapshot abort logic into `SnapshotsInProgress`.	2020-10-06 00:03:42 +02:00
Armin Braun	860791260d	Implement Shard Snapshot Clone Logic (#62771 ) (#63260 ) First part of the snapshot clone logic that implements the snapshot clone functionality on the repository level.	2020-10-05 22:55:52 +02:00
Nhat Nguyen	1a6837883a	Upgrade to Lucene-8.7.0-snapshot-77396dbf339 (#63222 ) Includes LUCENE-9554, which exposes the pendingNumDocs from IndexWriter.	2020-10-05 14:39:30 -04:00
Nik Everett	7f07deb8d8	Skip broken test In #63242 we changed how we build `nextRoundingValue` to, well, be correct. But the old `org.elasticsearch.common.rounding.Rounding` implementation didn't get the fix. Which is fine, because it doesn't that method on that implementation doesn't receive any use outside of tests. In fact, it is entirely removed in master. Anyway, now that the two implementation produce different values we really can't go around asserting that they produce the same values now can we? Well, we were! This skips that assertion if we know `nextRoundingValue` is implemented differently. Closes #63256	2020-10-05 14:25:53 -04:00
Stuart Tettemer	791a9d5102	Scripting: enable regular expressions by default (#63029 ) (#63272 ) * Setting `script.painless.regex.enabled` has a new option, `use-factor`, the default. This defaults to using regular expressions but limiting the complexity of the regular expressions. In addition to `use-factor`, the setting can be `true`, as before, which enables regular expressions without limiting them. `false` totally disables regular expressions, which was the old default. * New setting `script.painless.regex.limit-factor`. This limits regular expression complexity by limiting the number characters a regular expression can consider based on input length. The default is `6`, so a regular expression can consider `6` * input length number of characters. With input `foobarbaz` (length `9`), for example, the regular expression can consider `54` (`6 * 9`) characters. This reduces the impact of exponential backtracking in Java's regular expression engine. * add `@inject_constant` annotation to whitelist. This annotation signals that a compiler settings will be injected at the beginning of a whitelisted method. The format is `argnum=settingname`: `1=foo_setting 2=bar_setting`. Argument numbers must start at one and must be sequential. * Augment `Pattern.split(CharSequence)` `Pattern.split(CharSequence, int)`, `Pattern.splitAsStream(CharSequence)` `Pattern.matcher(CharSequence)` to take the value of `script.painless.regex.limit-factor` as a an injected parameter, limiting as explained above when this setting is in use. Fixes: #49873 Backport of: 93f29a4	2020-10-05 13:17:47 -05:00
Armin Braun	cf75abb021	Optimize XContentParserUtils.ensureExpectedToken (#62691 ) (#63253 ) We only ever use this with `XContentParser` no need to make it inline worse by forcing the lambda and hence dynamic callsite here. => Extraced the exception formatting code path that is likely very cold to a separate method and removed the lambda usage in hot loops by simplifying the signature here.	2020-10-05 19:08:32 +02:00
Armin Braun	51d0ed1bf3	Prepare Snapshot Shard State Update Logic For Clone Logic (#62617 ) (#63255 ) Small refactoring to shorten the diff with the clone logic in #61839: * Since clones will create a different kind of shard state update that isn't the same request sent by the snapshot shards service (and cannot be the same request because we have no `ShardId`) base the shard state updates on a different class that can be extended to be general enough to accomodate shard clones as well. * Make the update executor a singleton (can't make it an inline lambda as that would break CS update batching because the executor is used as a map key but this change still makes it crystal clear that there's no internal state to the executor) * Make shard state update responses a singleton (can't use TransportResponse.Empty because we need an action response but still it makes it clear that there's no actual response with content here)	2020-10-05 18:54:01 +02:00
Armin Braun	de6eeecbd3	Dry up Snapshot Integ Tests some More (#62856 ) (#63248 ) * Just some obvious drying up of these super complex tests. * Mainly just shortening the diff of #61839 here by moving test utilities to the abstract test case. Also, making use of the now available functionality to simplify existing tests and improve logging in them.	2020-10-05 18:33:59 +02:00
David Roberts	a522e932e8	Mute RoundingDuelTests.testSerialization Due to https://github.com/elastic/elasticsearch/issues/63256	2020-10-05 17:22:40 +01:00
Armin Braun	89de9fdcf7	Cleanup Blobstore Repository Metadata Serialization (#62727 ) (#63249 ) Follow ups to #62684 making use of shorter utility for corruption checks.	2020-10-05 17:44:27 +02:00
Nik Everett	461475f9e9	Make Rounding.nextRoundingValue consistent (backport #62983 ) (#63242 ) "interval" style roundings were implementing `nextRoundingValue` in a fairly inconsistent way - it'd produce a value, but sometimes that value would be the same as the previous rounding value. This makes it consistently the next value that `rounding` would make.	2020-10-05 10:38:20 -04:00
Armin Braun	d13c1f5058	Fix Overly Strict Assertion in BlobStoreRepository (#63061 ) (#63236 ) As long as `bestEffortConsistency` is `true`, the value of `latestKnownRepoGen` can be updated as a result of reads. We can only assert that `latestKnownRepoGen` and cluster state move in lock-step if `bestEffortConsistency` was `false` before updating the metadata generation as well as after. Closes #62877	2020-10-05 14:06:57 +02:00
Yannick Welsch	b4a1199e87	Uniquely associate term with update task during election (#62212 ) There is a small race when processing the cluster state that is used to establish a newly elected leader as master of the cluster: it can pick the term in its master state update task from a different (newer) election. This trips an assertion in `Coordinator.publish(...)` where we claim that the term on the state allows to uniquely define the pre-state but this isn't so. There are no bad consequences of this race since such a publication fails later on anyway. This PR fixes things so that the assertion holds true by improving the handling of terms during cluster state processing by associating each master state update task that is used to establish a newly elected leader with the correct corresponding term from its election. It also explicitly handles the case where the pre-state that is used as base state has already superseded the current state. As a nice side-effect, join batching now only happens based on the same term. Closes #61437	2020-10-05 11:46:10 +01:00
Armin Braun	106695bec8	Fix Race in ClusterApplierService Shutdown (#62944 ) (#63228 ) The iteration over `timeoutClusterStateListeners` starts when the CS applier thread is still running. This can lead to entries being added to it that never get their listener resolved on shutdown and thus leak that listener as observed in a stuck test in #62863. Since `listener.onClose()` is idempotent we can just call it if we run into a stopped service on the CS thread to avoid the race with certainty (because the iteration in `doStop` starts after the stopped state has been set). Closes #62863	2020-10-05 12:35:42 +02:00
Alan Woodward	01950bc80f	Move FieldMapper#valueFetcher to MappedFieldType (#62974 ) (#63220 ) For runtime fields, we will want to do all search-time interaction with a field definition via a MappedFieldType, rather than a FieldMapper, to avoid interfering with the logic of document parsing. Currently, fetching values for runtime scripts and for building top hits responses need to call a method on FieldMapper. This commit moves this method to MappedFieldType, incidentally simplifying the current call sites and freeing us up to implement runtime fields as pure MappedFieldType objects.	2020-10-04 14:54:59 +01:00
nitin2goyal	c9baadd19b	Fix to actually throttle indexing when throttling is activated (#61768 ) In #22721, the decision to throttle indexing was inadvertently flipped, so that we until this commit throttle indexing during recovery but never throttle user initiated indexing requests. This commit fixes that to throttle user initiated indexing requests and never throttle recovery requests. Closes #61959	2020-10-02 15:50:31 +02:00

1 2 3 4 5 ...

5575 Commits