OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-20 03:45:02 +00:00

Author	SHA1	Message	Date
David Turner	aec44fecbc	Decouple DiskThresholdMonitor & ClusterInfoService (#44105 ) Today the `ClusterInfoService` requires the `DiskThresholdMonitor` at construction time so that it can notify it when nodes report changes in their disk usage, but this is awkward to construct: the `DiskThresholdMonitor` requires a `RerouteService` which requires an `AllocationService` which comees from the `ClusterModule` which requires the `ClusterInfoService`. Today we break the cycle with a `LazilyInitializedRerouteService` which is itself a little ugly. This commit replaces this with a more traditional subject/observer relationship between the `ClusterInfoService` and the `DiskThresholdMonitor`.	2019-07-09 18:43:32 +01:00
David Turner	e70cad4c52	Remove node conn block after connection barrier (#44114 ) Today `testOnlyBlocksOnConnectionsToNewNodes` fails (extremely rarely) if the last attempt to connect to `node0` is delayed for so long that the test runs `nodeConnectionsBlocks.clear()` before the connection attempt obtains the expected connection block. We can turn this into a reliable failure with this delay: ```diff diff --git a/server/src/main/java/org/elasticsearch/cluster/NodeConnectionsService.java b/server/src/main/java/org/elasticsearch/cluster/NodeConnectionsService.java index f48413824d3..9a1d0336bcd 100644 --- a/server/src/main/java/org/elasticsearch/cluster/NodeConnectionsService.java +++ b/server/src/main/java/org/elasticsearch/cluster/NodeConnectionsService.java @@ -300,6 +300,13 @@ public class NodeConnectionsService extends AbstractLifecycleComponent { private final Runnable connectActivity = () -> threadPool.executor(ThreadPool.Names.MANAGEMENT).execute(new AbstractRunnable() { @Override protected void doRun() { + + try { + Thread.sleep(500); + } catch (InterruptedException e) { + throw new AssertionError("unexpected", e); + } + assert Thread.holdsLock(mutex) == false : "mutex unexpectedly held"; transportService.connectToNode(discoveryNode); consecutiveFailureCount.set(0); ``` This commit reverts the extra logging introduced in #43979 and fixes this failure by waiting for the connection attempt to hit the barrier before removing it. Fixes #40170	2019-07-09 17:03:26 +01:00
Mark Vieira	a406ef1d38	Fix eclipse project file generation (#44080 )	2019-07-09 08:55:37 -07:00
marcos ramos	88ee47c9ba	Fix OIDC documentation settings (#44115 ) Current kibana setting is xpack.security.auth.oidc.realm, but the correct one is xpack.security.authc.oidc.realm	2019-07-09 18:44:35 +03:00
David Kyle	23d7e309da	Mute put job docs test Relates to #43271	2019-07-09 13:23:31 +01:00
Sachin Frayne	389c923a82	[Docs] Fix json syntax in watcher compare condition (#44032 )	2019-07-09 13:43:18 +02:00
David Turner	268971db03	Wait for blackholed connection before discovery (#44077 ) Since #42636 we no longer treat connections specially when simulating a blackholed connection. This means that at the end of the safety phase we may have just started a connection attempt which will time out, but the default timeout is 30 seconds, much longer than the 2 seconds we normally allow for post-safety-phase discovery. This commit adds time for such a connection attempt to time out. It also fixes some spurious logging of `this` that now refers to an object with an unhelpful `toString()` implementation introduced in #42636. Fixes #44073	2019-07-09 10:59:53 +01:00
Henning Andersen	748a10866d	Reindex ScrollableHitSource pump data out (#43864 ) Refactor ScrollableHitSource to pump data out and have a simplified interface (callers should no longer call startNextScroll, instead they simply mark that they are done with the previous result, triggering a new batch of data). This eases making reindex resilient, since we will sometimes need to rerun search during retries. Relates #43187 and #42612	2019-07-09 11:50:09 +02:00
Henning Andersen	859709cc94	Closed index noop recovery during upgrade (#44072 ) Test that closed indices do noop recovery during rolling upgrade.	2019-07-09 11:46:42 +02:00
David Turner	fd9eebae81	Only apply initial recovery filter to shrunk shard (#44054 ) Today the `index.routing.allocation.initial_recovery._id` setting can only be set on indices that are the result of a shrink, but the filtered allocation decider also applies this filter to shards with a recovery source of `EMPTY_STORE`. The only way to have this setting set while the recovery source is `EMPTY_STORE` is to force-allocate an empty primary, but such a forced allocation ignores this allocation decider. This commit simplifies the allocation decider so that the `initial_recovery` setting only applies to shards with a recovery source of `LOCAL_SHARDS`.	2019-07-09 08:42:18 +01:00
Armin Braun	9eac5ceb1b	Dry up inputstream to bytesreference (#43675 ) (#44094 ) * Dry up Reading InputStream to BytesReference * Dry up spots where we use the same pattern to get from an InputStream to a BytesReferences	2019-07-09 09:18:25 +02:00
Armin Braun	f1ebb82031	Update the gcs chunk_size documentation. (#38749 ) (#44098 ) Remove `1g` from the examples, as the GCS repository chunk_size can be at most 100m.	2019-07-09 09:18:03 +02:00
Armin Braun	dc8f8e40eb	Fix DedicatedClusterSnapshotRestoreIT testSnapshotWithStuckNode (#43537 ) (#44082 ) * Fix DedicatedClusterSnapshotRestoreIT testSnapshotWithStuckNode * See comment in the test: The problem is that when the snapshot delete works out partially on master failover and the retry fails on `SnapshotMissingException` no repository cleanup is run => we still failed even with repo cleanup logic in the delete path now * Fixed the test by rerunning a create snapshot and delete loop to clean up the repo before verifying file counts * Closes #39852	2019-07-09 06:32:08 +02:00
Lisa Cawley	94578a8b47	[DOCS] Defines data frame transform resources (#43996 ) Co-Authored-By: István Zoltán Szabó <istvan.szabo@elastic.co>	2019-07-08 17:53:00 -07:00
Ryan Ernst	2b1cd58648	Remove ActionResponse uses from HLRC (#44091 ) The rest client does not communicate over the transport protocol. However, in the move to make all apis supported in the HLRC, some response classes were copied with extending ActionResponse, which is meant strictly for the transport protocol. This commit removes uses of that base class from HLRC.	2019-07-08 17:27:29 -07:00
lcawl	cd4021274a	[DOCS] Enables testing for create job ML API (#44022 )	2019-07-08 11:43:18 -07:00
Lisa Cawley	117f14e0ed	[DOCS] Updates 7.x version in data frame analytics API (#44026 )	2019-07-08 11:20:57 -07:00
Lisa Cawley	efddbcc1d1	[DOCS] Fixes earliest_record_timestamp data type (#44030 )	2019-07-08 10:16:07 -07:00
David Turner	6dce458ecc	Randomise retention lease expiry time (#44067 ) In today's test suite indices mostly use the default value of `12h` for the `index.soft_deletes.retention_lease.period` setting, which in the context of the test suite essentially means "never expires". In fact, the tests should all behave correctly even if the lease period is much shorter; tests that rely on leases not expiring should configure their indices appropriately. This commit randomises the lease expiry time for those indices created during tests which do not set a specific value for this setting.	2019-07-08 18:29:27 +02:00
Lisa Cawley	4b3f1003b0	[DOCS] Reformat freeze unfreeze APis to use new API format (#43948 )	2019-07-08 09:01:06 -07:00
Christoph Büscher	8e8d7667cb	[Tests] Fix type inference issue (#44063 )	2019-07-08 17:34:35 +02:00
Armin Braun	03332b5aeb	Don't Consistency Check Broken Repository in Test (#43499 ) (#44071 ) * Missed this one in #42189 and it randomly runs into a situation where the broken mock repo is broken such that we can't get to a consistent end state via a delete * Closes #43498	2019-07-08 17:21:40 +02:00
Tanguy Leroux	251287f89d	Check again on-going snapshots/restores of indices before closing (#43873 ) Today we prevent any index that is actively snapshotted or restored to be closed. This verification is done during the execution of the first phase of index closing (ie before blocking the indices). We should also do this verification again in the last phase of index closing (ie after the shard sanity checks and right before actually changing the index state and the routing table) because a snapshot/restore could sneak in while the shards are verified-before-close.	2019-07-08 17:07:04 +02:00
Alpar Torok	0c8294e633	Make sure the clean task doesn't break test fixtures (#43641 ) Use a dedicated fixture dir.	2019-07-08 17:58:27 +03:00
Mark Tozzi	299a52c17d	Enable validating user-supplied missing values on unmapped fields (#43718 ) (#43940 ) Provides a hook for aggregations to introspect the `ValuesSourceType` for a user supplied Missing value on an unmapped field, when the type would otherwise be `ANY`. Mapped field behavior is unchanged, and still applies the `ValuesSourceType` of the field. This PR just provides the hook for doing this, no existing aggregations have their behavior changed.	2019-07-08 10:46:23 -04:00
James Rodewig	4390d4a8af	[DOCS] Clarify array is not a field datatype (#43931 )	2019-07-08 08:58:10 -04:00
Armin Braun	2918363e90	Simplify BlobStoreRepository (Flatten Nested Classes) (#42833 ) (#44060 ) * In the current codebase it is hardly obvious what code operates on a shard and is run by a datanode what code operates on the global metadata and is run on master * Fixed by adjusting the method names accordingly * The nested context classes don't add much if any value, they simply spread out the parameters that go into a shard snapshot create or delete all over the place since their constructors can be inlined in all spots * Fixed by flattening the nested classes into BlobStoreRepository * Also: * Inlined the other single use inner classes	2019-07-08 14:57:27 +02:00
Armin Braun	afe81fd625	Some Cleanup in Test Framework (#44039 ) (#44059 ) * Remove some obvious dead code * Move assert methods that were only used in a single test class to the child they belong to * Inline some redundant methods	2019-07-08 14:15:31 +02:00
David Kyle	5fc12917c3	Data frame task failure does not make a 500 response (#44058 ) Data frame task responses had logic to return a HTTP 500 status code if there was any node or task failures even if other tasks in the same request reported correctly. This is different to how other task responses are handled where a 200 is always returned leaving the client should check for failures. Returning a 500 also breaks the high level rest client so always return a 200 Closes #44011	2019-07-08 11:53:11 +01:00
David Turner	3f3bcb23c2	AwaitsFix testForceStaleReplicaToBePromotedToPrimary Relates #44049	2019-07-08 11:26:57 +01:00
David Turner	3129f5b42e	Do not copy initial recovery filter during split (#44053 ) If an index is the result of a shrink then it will have a value set for `index.routing.allocation.initial_recovery._id`. If this index is subsequently split then this value will be copied over, forcing the initial allocation of the split shards to occur on the node on which the shrink took place. Moreover if this node no longer exists then the split will fail. This commit suppresses the copying of this setting when splitting an index. Fixes #43955	2019-07-08 10:32:05 +01:00
Armin Braun	af9b98e81c	Recursively Delete Unreferenced Index Directories (#42189 ) (#44051 ) * Use ability to list child "folders" in the blob store to implement recursive delete on all stale index folders when cleaning up instead of using the diff between two `RepositoryData` instances to cover aborted deletes * Runs after ever delete operation * Relates #13159 (fixing most of this issues caused by unreferenced indices, leaving some meta files to be cleaned up only)	2019-07-08 10:55:39 +02:00
Przemyslaw Gomulka	247f2dabad	Fix decimal point parsing for date_optional_time backport(#43859 ) #44050 Joda allowed for date_optional_time and strict_date_optional_time a decimal point to be . dot or , comma For our java.time implementation we should also extend this for strict_date_optional_time-nanos the approach to fix this is the same as in iso8601 parser closes #43730	2019-07-08 09:56:01 +02:00
Armin Braun	2176d09c37	Provide an Option to Use Path-Style-Access with S3 Repo (#41966 ) (#44046 ) * Provide an Option to Use Path-Style-Access with S3 Repo * As discussed, added the option to use path style access back again and deprecated it. * Defaulted to `false` * Added warning to docs * Closes #41816	2019-07-08 08:10:01 +02:00
Ioannis Kakavas	9beb51fc44	Revert "Mute testEnableDisableBehaviour (#42929 )" This reverts commit 6ee578c6eb3f2a6164934720ee82e2456d7bb226.	2019-07-08 08:52:21 +03:00
Armin Braun	f6efc55556	Fix SnapshotResiliencyTest (#44015 ) (#44041 ) * Closes #43989	2019-07-07 19:59:16 +02:00
Armin Braun	990ac4ca83	Some Cleanup in BlobStoreRepository (#43323 ) (#44043 ) * Some Cleanup in BlobStoreRepository * Extracted from #42833: * Dry up index and shard path handling * Shorten XContent handling	2019-07-07 19:50:46 +02:00
Nhat Nguyen	9089820d8f	Enable indexing optimization using sequence numbers on replicas (#43616 ) This PR enables the indexing optimization using sequence numbers on replicas. With this optimization, indexing on replicas should be faster and use less memory as it can forgo the version lookup when possible. This change also deactivates the append-only optimization on replicas. Relates #34099	2019-07-05 22:12:08 -04:00
Dimitris Athanasiou	d3ddedf9fc	[7.x][ML] Add missing doc links to df-analytics rest spec and HLRC javadocs (#44025 ) (#44033 )	2019-07-06 02:03:29 +03:00
Mayya Sharipova	37e1ad7062	Forbid empty doc values on vector functions (#43944 ) Currently when a document misses a vector value, vector function returns 0 as a score for this document. We think this is incorrect behaviour. With this change, an error will be thrown if vector functions are used with docs that are missing vector doc values. Also VectorScriptDocValues is modified to allow size() function, which can be used to check if a document has a value for the vector field.	2019-07-05 18:09:06 -04:00
Dimitris Athanasiou	a1a62fded3	[7.x][ML] Stop df-analytics action request should filter tasks (#44016 ) (#44023 ) As a `BaseTasksRequest`, `StopDataFrameAnalyticsAction.Request` should implement a `match` method that makes sure only df-analytics tasks are applied.	2019-07-05 23:10:45 +03:00
Yannick Welsch	504a43d43a	Move ConnectionManager to async APIs (#42636 ) This commit converts the ConnectionManager's openConnection and connectToNode methods to async-style. This will allow us to not block threads anymore when opening connections. This PR also adapts the cluster coordination subsystem to make use of the new async APIs, allowing to remove some hacks in the test infrastructure that had to account for the previous synchronous nature of the connection APIs.	2019-07-05 20:40:22 +02:00
Nhat Nguyen	8bfe18477e	Clarify consequence of translog async setting (#44020 ) Relates #43915	2019-07-05 13:56:42 -04:00
lcawl	a831d4707c	[DOCS] Temporarily disables data frame API testing	2019-07-05 10:56:09 -07:00
Yannick Welsch	88783927d1	Weaken assertion in PublicationTransportHandler (#44014 ) These assertions do not hold true when a master fails during publication and quickly becomes master again, publishing a new cluster state in a higher term which races against the previous cluster state publication to self (which does not matter anyway). Relates #43994 Closes #44012	2019-07-05 18:27:42 +02:00
István Zoltán Szabó	5aeb736801	Merge branch '7.x' of github.com:elastic/elasticsearch into 7.x	2019-07-05 14:26:47 +02:00
István Zoltán Szabó	7242267f5d	[DOCS] Adds data frame analytics APIs to the ML APIs (#43875 ) This PR adds the reference documentation pages of the data frame analytics APIs (PUT, START, STOP, GET, GET stats, DELETE, Evaluate) to the ML APIs pool.	2019-07-05 14:25:54 +02:00
Akshesh Doshi	01b982fd31	Draw attention to transport layer in remote cluster docs (#43883 ) Closes #43858	2019-07-05 13:44:36 +02:00
Yannick Welsch	1220ff5b6d	Publish to self through transport (#43994 ) This commit ensures that cluster state publications to self also go through the transport layer. This allows voting-only nodes to intercept the publication to self. Fixes an issue discovered by a test failure where a voting-only node, which was the only bootstrapped node, would not step down as master after state transfer because publishing to self would succeed. Closes #43631	2019-07-05 13:00:52 +02:00
Yannick Welsch	5cdf3ff3fa	Revert "[TEST] Mute RemoteClusterServiceTests.testCollectNodes" This reverts commit d8a2970fa40d34676242d520502fcd00a2a2fbfa.	2019-07-05 11:02:42 +02:00

1 2 3 4 5 ...

46622 Commits