OpenSearch

Commit Graph

Author	SHA1	Message	Date
Tim Brooks	e63d52af63	Move page size constants to PageCacheRecycler (#36524 ) `PageCacheRecycler` is the class that creates and holds pages of arrays for various uses. `BigArrays` is just one user of these pages. This commit moves the constants that define the page sizes for the recycler to be on the recycler class.	2018-12-12 07:00:50 -07:00
Alpar Torok	c00d0fc814	Test fixtures improovements (#36037 ) * Upgrae plugin to latest and expose udp * Explicit check for windows * Rename the properties for the port numbers * Tasks for pre and pos container actions	2018-12-12 12:00:47 +02:00
Nik Everett	03daad9812	Re-deprecate xpack rollup endpoints (#36451 ) Redeprecates the `/_xpack/rollup` endpoints in favor of `/_rollup`. When we cleanup the rollup in a cluster containing 6.x nodes we need to use `/_xpack/rollup` instead of `/_rollup` because the 6.x nodes don't know about `/_rollup`. In those cases we must ignore the deprecation warnings that the 7.0 node will return for the end point. Closes #36044	2018-12-11 19:43:17 -05:00
Tim Brooks	797f985067	Add version to handshake requests (#36171 ) Currently our handshake requests do not include a version. This is unfortunate as we cannot rely on the stream version since it is not the sending node's version. Instead it is the minimum compatibility version. The handshake request is currently empty and we do nothing with it. This should allow us to add data to the request without breaking backwards compatibility. This commit adds the version to the handshake request. Additionally, it allows "future data" to be added to the request. This allows nodes to craft a version compatible response. And will properly handle additional data in future handshake requests. The proper handling of "future data" is useful as this is the only request where we do not know the other node's version. Finally, it renames the TcpTransportHandshaker to TransportHandshaker.	2018-12-11 16:09:28 -07:00
Mayya Sharipova	2f18325384	Deprecate types in update_by_query and delete_by_query (#36365 ) Relates to #35190	2018-12-11 17:09:59 -05:00
Tim Brooks	790f8102e9	Modify `BigArrays` to take name of circuit breaker (#36461 ) This commit modifies BigArrays to take a circuit breaker name and the circuit breaking service. The default instance of BigArrays that is passed around everywhere always uses the request breaker. At the network level, we want to be using the inflight request breaker. So this change will allow that. Additionally, as this change moves away from a single instance of BigArrays, the class is modified to not be a Releasable anymore. Releasing big arrays was always dispatching to the PageCacheRecycler, so this change makes the PageCacheRecycler the class that needs to be managed and torn-down. Finally, this commit closes #31435 be making the serialization of transport messages use the inflight request breaker. With this change, we no longer push the global BigArrays instnace to the network level.	2018-12-11 11:55:41 -07:00
markharwood	a9eccbcd02	Tests- added helper methods to ESRestTestCase for checking warnings (#36443 ) Added helper methods to ESRestTestCase for checking warnings in mixed and current-version-only clusters. This is supported by a new VersionSpecificWarningsHandler class with associated unit test. Closes #36251	2018-12-11 17:30:15 +00:00
Andrey Ershov	8b821706cc	Switch more tests to zen2 (#36367 ) 1. CCR tests work without any changes 2. `testDanglingIndices` require changes the source code (added TODO). 3. `testIndexDeletionWhenNodeRejoins` because it's using just two nodes, adding the node to exclusions is needed on restart. 4. `testCorruptTranslogTruncationOfReplica` starts dedicated master one, because otherwise, the cluster does not form, if nodes are stopped and one node is started back. 5. `testResolvePath` needs TEST cluster, because all nodes are stopped at the end of the test and it's not possible to perform checks needed by SUITE cluster. 6. `SnapshotDisruptionIT`. Without changes, the test fails because Zen2 retries snapshot creation as soon as network partition heals. This results into the race between creating snapshot and test cleanup logic (deleting index). Zen1 on the other hand, also schedules retry, but it takes some time after network partition heals, so cleanup logic executes latter and test passes. The check that snapshot is eventually created is added to the end of the test.	2018-12-11 17:12:17 +01:00
Julie Tibshirani	87831051dc	Deprecate types in explain requests. (#35611 ) The following updates were made: - Add a new untyped endpoint `{index}/_explain/{id}`. - Add deprecation warnings to RestAction, plus tests in RestActionTests. - For each REST yml test, make sure there is one version without types, and another legacy version that retains types (called *_with_types.yml). - Deprecate relevant methods on the Java HLRC requests/ responses. - Update documentation (for both the REST API and Java HLRC).	2018-12-10 19:45:13 -08:00
Jernej Klancic	d615add1b1	Add pipeline parent validation for auto date histogram (#35670 ) Allow `auto_date_histogram` as a valid parent agg for derivative, cumulative sum, moving average, moving function and serial differencing pipeline aggregations. Since all these aggs share the same requirement (sequentially ordered parent aggs), this commit also refactors to share the same validation code so that any newly added aggs won't be forgotten. Closes #35578	2018-12-10 16:02:49 -05:00
Jim Ferenczi	75392adf60	[TEST] Convert SearchHitsTests to AbstractStreamableXContentTestCase (#36313 ) This change adds a way to provide the content type of the rest serialization tests when creating random instances. This is used by SearchHitsTests to ensure that the internal members of the class are created with the same xContentType and that equals can be used to compare an instances created from an XContent view.	2018-12-10 20:41:20 +01:00
Nik Everett	9626e700ce	LLRC: Make warning behavior pluggable per request (#36345 ) This allows you to plug the behavior that the LLRC uses to handle warnings on a per request basis. We entertained the idea of allowing you to set the warnings behavior to strict mode on a per request basis but that wouldn't allow the high level rest client to fail when it sees an unexpected warning. We also entertained the idea of adding a list of "required warnings" to the `RequestOptions` but that won't work well with failures that occur sometimes like those we see in mixed clusters. Adding a list of "allowed warnings" to the `RequestOptions` would work for mixed clusters but it'd leave many of the assertions in our tests weaker than we'd like. This behavior plugging implementation allows us to make a "required warnings" option when we need it and an "allowed warnings" behavior when we need it. I don't think this behavior is going to be commonly used by used outside of the Elasticsearch build, but I expect they'll be a few commendably paranoid folks who could use this behavior.	2018-12-10 08:32:00 -05:00
David Turner	9f86e996fe	[Zen2] Support rolling upgrades from Zen1 (#35737 ) We support rolling upgrades from Zen1 by keeping the master as a Zen1 node until there are no more Zen1 nodes in the cluster, using the following principles: - Zen1 nodes will never vote for Zen2 nodes - Zen2 nodes will, while not bootstrapped, vote for Zen1 nodes - Zen2 nodes that were previously part of a mixed cluster will automatically (and unsafely) bootstrap themselves when the last Zen1 node leaves.	2018-12-08 07:33:35 +00:00
Tim Brooks	8a53f2b464	Implement basic `CcrRepository` restore (#36287 ) This is related to #35975. It implements a basic restore functionality for the CcrRepository. When the restore process is kicked off, it configures the new index as expected for a follower index. This means that the index has a different uuid, the version is not incremented, and the Ccr metadata is installed. When the restore shard method is called, an empty shard is initialized.	2018-12-07 15:27:04 -07:00
Tim Brooks	5556204f81	Use MockNioTransport in MockTransportService (#36346 ) The default transport used in the MockTransportService is the MockTcpTransport. This commit changes that to be the MockNioTransport.	2018-12-07 11:17:11 -07:00
Nhat Nguyen	f2df0a5be4	Remove LocalCheckpointTracker#resetCheckpoint (#34667 ) In #34474, we added a new assertion to ensure that the LocalCheckpointTracker is always consistent with Lucene index. However, we reset LocalCheckpoinTracker in testDedupByPrimaryTerm cause this assertion to be violated. This commit removes resetCheckpoint from LocalCheckpointTracker and rewrites testDedupByPrimaryTerm without resetting the local checkpoint. Relates #34474	2018-12-07 12:22:20 -05:00
David Turner	ed1c5a0241	Introduce `zen2` discovery type (#36298 ) With this change it is now possible to start a node running Zen2.	2018-12-06 16:20:08 +00:00
David Turner	38ab15c6fb	Avoid shutting down the only master (#36272 ) Today the InternalTestClusterTests sometimes set up a cluster with a single master, start some other ndoes, shut the original master down, and then reset the cluster. This doesn't really work, because the original master may be stale. This change avoids shutting down the only master in this situation.	2018-12-06 08:27:38 +01:00
Yannick Welsch	a0ae1cc987	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 23:13:12 +01:00
Yannick Welsch	03d0ea91ef	Zen2: Rename tombstones to exclusions (#36226 ) Renames the withdrawal / tombstones APIs to voting configuration exclusions.	2018-12-05 23:12:28 +01:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Yannick Welsch	b20497560c	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 14:06:38 +01:00
Yannick Welsch	0b9efff5cb	Zen2: Persist cluster states the old way on non-master-eligible nodes (#36247 ) The shard deletion logic (triggered by IndicesStore), which also leads to index metadata deletion on non-master-eligible data nodes, currently races against the new cluster state persistence logic triggered by accepting cluster states. One thread is writing the index metadata while another one is deleting the index metadata, leading to exceptions and assertions tripping (see below). The solution proposed by this PR is to move the cluster state persistence of non-master-eligible nodes back to the cluster applier service, just as it used to be for Zen1. This ensures that the index metadata deletion logic, which is triggered by the shard deletion logic, runs on the same thread on which we persist the cluster state.	2018-12-05 14:04:45 +01:00
Alpar Torok	60e45cd81d	Testing conventions task part 2 (#36107 ) Closes #35435 - make it easier to add additional testing tasks with the proper configuration and add some where they were missing. - mute or fix failing tests - add a check as part of testing conventions to find classes not included in any testing task.	2018-12-05 14:20:01 +02:00
Yannick Welsch	70c361ea5a	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-04 21:26:11 +01:00
Adrien Grand	0df08dd458	Set Lucene version upon index creation. (#36038 ) It is important that all shards of a given index have the same `indexCreatedVersionMajor` to Lucene, or eg. merging those shards is going to be considered illegal. At the moment, we use the latest Lucene version when creating a shard, which could cause shards to have different created versions eg. in case of forced allocation. This commit makes sure to reuse the appropriate Lucene version in order to avoid such issues. Closes #33826	2018-12-04 17:53:20 +01:00
Nhat Nguyen	b59deb573e	Always set soft-deletes field of IndexWriterConfig (#36196 ) Today we configure the soft-deletes field iff soft-deletes enabled. Although this choice was correct, it prevents an engine with soft-deletes disabled from opening a Lucene index with soft-deletes. Moreover, this change should not have any side-effect if a Lucene index does not have any soft-deletes. Relates #36141	2018-12-04 11:15:34 -05:00
Andrey Ershov	35e3d77e2c	[Zen2] Implement state recovery (#36013 ) This commit implements proper metadata recovery for Zen2. GatewayService is responsible for the recovery. In Zen1 GatewayService creates an instance of Gateway, that is used to reach out to other cluster nodes, get their state and calculate the most up-to-date state based on versions. After that Gateway performs upgrade and archival of ClusterSettings and closes bad indices. Then recovered state is passed to GatewayService.GatewayRecoveryListener that mixes up current state and restored state, removes state not recovered block, creates the routing table and performs re-routing. In Zen2 we should perform this kind of logic on cluster startup, except mixing state (because there is nothing to mix) and opening routing table. This commit refactors out all `ClusterUpdate` functions in a separate class `ClusterStateUpdaters`, which is used by `Gateway` and `GatewayService` in case of Zen1, and by `GatewayMetaState` and `GatewayService` in case of Zen2. This commit also switches all integration tests that are already using Zen2 from InMemoryPersistedState to GatewayMetaState.	2018-12-04 14:45:45 +01:00
Yannick Welsch	80ee7943c9	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-04 09:37:09 +01:00
Alpar Torok	d036e0ca89	Testclusters: implement starting, waiting for and stopping single cluster nodes (#35599 )	2018-12-04 10:16:51 +02:00
David Turner	034c7655b7	[Zen2] Reduce cluster scope in NodeDisconnectIT (#36168 ) This test suite can stop all the shared master-eligible nodes, which breaks the cluster since any non-shared master-eligible nodes are stopped first in the reset process between tests. Since this test suite can leave the cluster in this somewhat broken state, it seems best that it uses a new cluster for each test.	2018-12-04 07:48:56 +00:00
Armin Braun	433a506d06	SNAPSHOT: Improve Resilience SnapshotShardService (#36113 ) * Resolve the index in the snapshotting thread * Added test for routing table - snapshot state mismatch	2018-12-03 16:39:29 +01:00
Jim Ferenczi	74aca756b8	Remove the distinction between query and filter context in QueryBuilders (#35354 ) When building a query Lucene distinguishes two cases, queries that require to produce a score and queries that only need to match. We cloned this mechanism in the QueryBuilders in order to be able to produce different queries based on whether they need to produce a score or not. However the only case in es that require this distinction is the BoolQueryBuilder that sets a different minimum_should_match when a `bool` query is built in a filter context.. This behavior doesn't seem right because it makes the matching of `should` clauses different when the score is not required. Closes #35293	2018-12-03 11:49:11 +01:00
David Turner	8011438ea8	Use correct source of randomness This fixes a failure of InternalTestClusterTests#testBeforeTest which checks that the cluster is set up the same when starting from the same seed. Trappily, using ESTestCase#randomIntBetween() is no good, we have to use InternalTestCluster#random via RandomNumbers#randomIntBetween() instead.	2018-12-02 09:39:43 +00:00
David Turner	8191348d6b	[Zen2] Only bootstrap a single node (#36119 ) Today, we allow all nodes in an integration test to bootstrap. However this seems to lead to test failures due to post-election instability. The change avoids this instability by only bootstrapping a single node in the cluster.	2018-12-01 06:43:11 +00:00
Luca Cavanna	0ebc17743a	Histogram aggs: add empty buckets only in the final reduce step (#35921 ) Empty buckets don't need to be added when performing an incremental reduction step, they can be added later in the final reduction step. This will allow us to later remove the max buckets limit when performing non final reduction.	2018-11-30 20:33:09 +01:00
Tim Brooks	ea7ea51050	Make `TcpTransport#openConnection` fully async (#36095 ) This is a follow-up to #35144. That commit made the underlying connection opening process in TcpTransport asynchronous. However the method still blocked on the process being complete before returning. This commit moves the blocking to the ConnectionManager level. This is another step towards the top-level TransportService api being async.	2018-11-30 11:30:42 -07:00
Tim Brooks	26dcbcc8cc	Remove `MockTcpTransport` for ESIntegTestCase (#36089 ) This commit removes the `MockTcpTransport` as a transport option for `ESIntegTestCase`. It is the first step in replacing the usages of `MockTcpTransport` with `MockNioTransport`.	2018-11-30 09:04:51 -07:00
Adrien Grand	fa3d365ee8	Fix CompositeBytesReference#slice to not throw AIOOBE with legal offsets. (#35955 ) CompositeBytesReference#slice has two bugs: - One that makes it fail if the reference is empty and an empty slice is created, this is #35950 and is fixed by special-casing empty-slices. - One performance bug that makes it always create a composite slice when creating a slice that ends on a boundary, this is fixed by computing `limit` as the index of the sub reference that holds the last element rather than the next element after the slice. Closes #35950	2018-11-30 10:32:46 +01:00
Zachary Tong	61c2db5ebb	Revert "Deprecate X-Pack centric rollup endpoints (#35962 )" This reverts commit `b84f1f6a3a`.	2018-11-29 12:58:23 -05:00
Zachary Tong	40c5445480	Revert "[TEST] Use deprecated form of rollup endpoint in mixed cluster (#36000 )" This reverts commit `85cdf4f913`.	2018-11-29 12:56:25 -05:00
Tim Brooks	c305f9dc03	Make keepalive pings bidirectional and optimizable (#35441 ) This is related to #34405 and a follow-up to #34753. It makes a number of changes to our current keepalive pings. The ping interval configuration is moved to the ConnectionProfile. The server channel now responds to pings. This makes the keepalive pings bidirectional. On the client-side, the pings can now be optimized away. What this means is that if the channel has received a message or sent a message since the last pinging round, the ping is not sent for this round.	2018-11-29 08:55:53 -07:00
Zachary Tong	85cdf4f913	[TEST] Use deprecated form of rollup endpoint in mixed cluster (#36000 ) When wiping rollup jobs, if we are in a mixed cluster with < v7.0 nodes we need to fall back to the deprecated endpoint because we may talk to a 6.x node.	2018-11-29 07:37:33 -05:00
David Turner	7f257187af	[Zen2] Update default for USE_ZEN2 to true (#35998 ) Today the default for USE_ZEN2 is false and it is overridden in many places. By defaulting it to true we can be sure that the only places in which Zen2 does not work are those in which it is explicitly set to false.	2018-11-29 12:18:35 +00:00
Jason Tedor	b84f1f6a3a	Deprecate X-Pack centric rollup endpoints (#35962 ) This commit is part of our plan to deprecate and ultimately remove the use of _xpack in the REST APIs.	2018-11-27 20:34:17 -05:00
Tim Brooks	cc1fa799c8	Remove `TcpChannel#setSoLinger` method (#35924 ) This commit removes the dedicated `setSoLinger` method. This simplifies the `TcpChannel` interface. This method has very little effect as the SO_LINGER is not set prior to the channels being closed in the abstract transport test case. We still will set SO_LINGER on the `MockNioTransport`. However we can do this manually.	2018-11-27 09:08:14 -07:00
Andrey Ershov	0e283f9670	[Zen2] PersistedState interface implementation (#35819 ) Today GatewayMetaState is capable of atomically storing MetaData to disk. We've also moved fields that are needed to be persisted in Zen2 from ClusterState to ClusterState.MetaData.CoordinationMetaData. This commit implements PersistedState interface. version and currentTerm are persisted as a part of Manifest. GatewayMetaState now implements both ClusterStateApplier and PersistedState interfaces. We started with two descendants Zen1GatewayMetaState and Zen2GatewayMetaState, but it turned out to be not easy to glue it. GatewayMetaState now constructs previousClusterState (including MetaData) and previousManifest inside the constructor so that all PersistedState methods are usable as soon as GatewayMetaState instance is constructed. Also, loadMetaData is renamed to getMetaData, because it just returns previousClusterState.metaData(). Sadly, we don't have access to localNode (obtained from TransportService in the constructor, so getLastAcceptedState should be called, after setLocalNode method is invoked. Currently, when deciding whether to write IndexMetaData to disk, we're comparing current IndexMetaData version and received IndexMetaData version. This is not safe in Zen2 if the term has changed. So updateClusterState now accepts incremental write method parameter. When it's set to false, we always write IndexMetaData to disk. Things that are not covered by GatewayMetaStateTests are covered by GatewayMetaStatePersistedStateTests. This commit also adds an option to use GatewayMetaState instead of InMemoryPersistedState in TestZenDiscovery. However, by default InMemoryPersistedState is used and only one test in PersistedStateIT used GatewayMetaState. In order to use it for other tests, proper state recovery should be implemented.	2018-11-27 15:04:52 +01:00
Christophe Bismuth	b95a4db6e6	Throw a parsing exception when boost is set in span_or query (#28390 ) (#34112 )	2018-11-26 12:15:59 -05:00
Jim Ferenczi	e37a0ef844	Upgrade to lucene-8.0.0-snapshot-67cdd21996 (#35816 )	2018-11-22 15:42:59 +01:00
Andrey Ershov	a056bd8c1c	[Zen2] Move ClusterState fields to be persisted to ClusterState.MetaData (#35625 ) Today we have a way to atomically persist global MetaData and IndexMetaData to disk when new ClusterState is received. All other ClusterState fields are not persisted. However, there are other parts of ClusterState that should be persisted, namely: version term lastCommittedConfiguration lastAcceptedConfiguration votingTombstones version is changed frequently, other fields are not. We decided to group term, lastCommittedConfiguration, lastAcceptedConfiguration and votingTombstones into CoordinationMetaData class and make CoordinationMetaData a field inside MetaData. MetaData.toXContent and MetaData.fromXContent should take care of CoordinationMetaData. version stays as a top level field in ClusterState and will be persisted as part of Manifest in a follow-up commit. Also MetaData.isGlobalStateEquals should be extended to include coordinationMetaData in comparison. This commit favors exposing getters, such as getTerm directly in ClusterState to avoid massive code changes. An example of CoordinationMetaState.toXContent: { "term": 1, "last_committed_config": [ "TiIuBcbBtpuXyDDVHXeD", "ZIAoVbkjjLPLUuYLaTkw" ], "last_accepted_config": [ "OwkXbXZNOZPJqccdFHdz", "LouzsGYwmQzpeQMrboZe", "fCKGRZdjLTqzXAqPUtGL", "pLoxshjpJXwDhbgjfYJy", "SjINLwFIlIEFZCbjrSFo", "MDkVncJEVyZLJktopWje" ] }	2018-11-21 17:03:26 +01:00
Andrey Ershov	6ac0cb1842	Merge branch master into zen2 2 types of conflicts during the merge: 1) Line length fix 2) Classes no longer extend AbstractComponent	2018-11-21 15:36:49 +01:00
Yannick Welsch	8939a7894f	Zen2: Move disruption tests to Zen2 (#35724 ) - Moves disruption tests to Zen2 - Registers a few missing settings - Removes .put(TestZenDiscovery.USE_ZEN2.getKey(), true) from tests where Zen2 is now enabled by default through the parent test class - Moves QuorumGatewayIT back to Zen1, as it is not stable with Zen2 as it currently relies on dangling indices due to the lack of proper CS persistence, which triggers secondary failures	2018-11-21 14:43:33 +01:00
Armin Braun	7a210342ab	TESTS: Remove Dead Code in Disruption Tests (#35768 ) * Neither this class nor the constructor are used anywhere	2018-11-21 10:33:50 +01:00
Christoph Büscher	5847f8379c	Move ScoreAccessor to test-framework (#35766 ) This class is only used by RandomScoreFunctionIT and the MockScriptEngine, so it shouldn't be part of the server codebase.	2018-11-21 10:28:31 +01:00
Armin Braun	33c713ba60	TESTS: More Logging in LongGcDisruptionTests (#35702 ) * The existing logging is not helpful enough to track down which threads hang, we need the hanging thread's stacktraces too * Relates #35686	2018-11-20 15:36:01 +01:00
Alpar Torok	8659af68e0	Auto skip license headers on no source (#35640 ) * Unmute BuildExamplePluginsIT * Skip licenseHeaders when there are no sources	2018-11-20 13:02:33 +02:00
Simon Willnauer	29ef442841	Add a `_freeze` / `_unfreeze` API (#35592 ) This commit adds a rest endpoint for freezing and unfreezing an index. Among other cleanups mainly fixing an issue accessing package private APIs from a plugin that got caught by integration tests this change also adds documentation for frozen indices. Note: frozen indices are marked as `beta` and available as a basic feature. Relates to #34352	2018-11-20 08:03:24 +01:00
Yannick Welsch	47ada69c46	Zen2: Move most integration tests to Zen2 (#35678 ) Zen2 is now feature-complete enough to run most ESIntegTestCase tests. The changes in this PR are as follows: - ClusterSettingsIT is adapted to not be Zen1 specific anymore (it was using Zen1 settings). - Some of the integration tests require persistent storage of the cluster state, which is not fully implemented yet (see #33958). These tests keep running with Zen1 for now but will be switched over as soon as that is fully implemented. - Some very few integration tests are not running yet with Zen2 for other reasons, depending on some of the other open points in #32006.	2018-11-19 21:15:29 +01:00
Gordon Brown	b2057138a7	Remove AbstractComponent from AbstractLifecycleComponent (#35560 ) AbstractLifecycleComponent now no longer extends AbstractComponent. In order to accomplish this, many, many classes now instantiate their own logger.	2018-11-19 09:51:32 -07:00
Arthur Gavlyukovskiy	022726011c	Remove use of AbstractComponent in server (#35444 ) Removed extending of AbstractComponent and changed logger usage to explicit declaration. Abstract classes still have logger declaration using this.getClass() in order to show implementation class name in its logs. See #34488	2018-11-16 16:10:32 -05:00
Jernej Klancic	baf33b3162	Removes AbstractComponent from several classes (#35566 ) Removes inhertiting from AbstractComponent for some classes (mostly in the plugins module). Relates to #34488	2018-11-16 20:50:18 +01:00
Lee Hinman	ce35d049e9	[TEST] Fix ClusterApplierServiceTests.testClusterStateUpdateLogging This changes the test to not use a `CountDownlatch`, instead adding an assertion for the final logging message and waiting until the `MockAppender` has seen it before proceeding. Resolves #23739	2018-11-15 14:15:23 -07:00
David Turner	86ef041539	[Zen2] Introduce ClusterBootstrapService (#35488 ) Today, the bootstrapping of a Zen2 cluster is driven externally, requiring something else to wait for discovery to converge and then to inject the initial configuration. This is hard to use in some situations, such as REST tests. This change introduces the `ClusterBootstrapService` which brings the bootstrap retry logic within each node and allows it to be controlled via an (unsafe) node setting.	2018-11-15 20:09:22 +00:00
Tanguy Leroux	c9b4ef0dfd	Use RunOnce when appropriate (#35553 ) This pull request replaces some blocks of code that must be run once and that are currently based on AtomicBoolean by the convient RunOnce class added in #35489.	2018-11-15 09:24:40 +01:00
Andrey Ershov	045fdd0d3b	Merge master into zen2	2018-11-14 15:37:13 +03:00
Yannick Welsch	4cfdb0609e	Adapt InternalCluster#fullRestart to call onNodeStopped when all nodes are stopped (#35494 ) Refactors and simplifies the logic around stopping nodes, making sure that for a full cluster restart onNodeStopped is only called after the nodes are actually all stopped (and in particular not while starting up some nodes again). This change also ensures that a closed node client is not being used anymore (which required a small change to a test). Relates to #35049	2018-11-14 13:24:56 +01:00
Zachary Tong	c346a0f027	[Rollup] Add `wait_for_completion` option to StopRollupJob API (#34811 ) This adds a `wait_for_completion` flag which allows the user to block the Stop API until the task has actually moved to a stopped state, instead of returning immediately. If the flag is set, a `timeout` parameter can be specified to determine how long (at max) to block the API call. If unspecified, the timeout is 30s. If the timeout is exceeded before the job moves to STOPPED, a timeout exception is thrown. Note: this is just signifying that the API call itself timed out. The job will remain in STOPPING and evenutally flip over to STOPPED in the background. If the user asks the API to block, we move over the the generic threadpool so that we don't hold up a networking thread.	2018-11-13 16:37:17 -05:00
Julie Tibshirani	bc799e4a6f	Ignore warnings related to types deprecation in REST tests. (#35395 )	2018-11-13 11:56:01 -08:00
David Turner	8e40a2bbe2	[Zen2] Introduce vote withdrawal (#35446 ) If shutting down half or more of the master-eligible nodes, their votes must first be explicitly withdrawn to ensure that the cluster doesn't lose its quorum. This works via _voting tombstones_, stored in the cluster state, which tell the reconfigurator to remove nodes from the voting configuration. This change introduces voting tombstones to the cluster state, together with transport APIs for adding and removing them, and makes use of these APIs in `InternalTestCluster` to support tests which remove at least half of the master-eligible nodes at once (e.g. shrinking from two master-eligible nodes to one).	2018-11-13 19:32:32 +00:00
David Turner	fbd3cab410	[Zen2] Remove AbstractComponent usage (#35483 ) AbstractComponent was deprecated in #35140 and is looking like it will be removed at some point by #34888. Today all it does is provide a logger. This change removes the usages of AbstractComponent that live solely in the zen2 feature branch to avoid some future merge pain, and replaces it where necessary with some directly-created loggers.	2018-11-13 15:20:49 +00:00
Yannick Welsch	fe29b18c26	Fix compilation	2018-11-12 11:05:11 +01:00
Yannick Welsch	4e6c58c942	Merge remote-tracking branch 'elastic/master' into zen2	2018-11-12 10:03:59 +01:00
Tim Brooks	ba478827ad	Improve MockTcpTransport memory usage (#35402 ) The MockTcpTransport is not friendly in regards to memory usage. It must allocate multiple byte arrays for every message. This improves the memory situation by failing fast if the message is improperly formatted. Additionally, it uses reusable big arrays for at least half of the allocated byte arrays.	2018-11-09 10:12:49 -07:00
Jim Ferenczi	7054e289fa	Add trace log of the request for the query and fetch phases (#34479 ) This change adds a logger for the query and fetch phases that prints all requests before their execution at the trace level. This will help debugging cases where an issue occurs during the execution since only completed queries are logged by the slow logs.	2018-11-09 09:41:51 +01:00
Tim Brooks	93c2c604e5	Move compression config to ConnectionProfile (#35357 ) This is related to #34483. It introduces a namespaced setting for compression that allows users to configure compression on a per remote cluster basis. The transport.tcp.compress remains as a fallback setting. If transport.tcp.compress is set to true, then all requests and responses are compressed. If it is set to false, only requests to clusters based on the cluster.remote.cluster_name.transport.compress setting are compressed. However, after this change regardless of any local settings, responses will be compressed if the request that is received was compressed.	2018-11-08 10:37:59 -07:00
Yannick Welsch	c315ead0ac	Zen2: Add diff-based publishing (#35290 ) Enables diff-based publishing, which is an optimization where only the changing parts of the cluster state are published to the nodes in the cluster, falling back to full cluster state publishing if the receiver does not have the previous cluster state.	2018-11-08 17:16:09 +01:00
David Turner	6885a7cb0f	Introduce transport API for cluster bootstrapping (#34961 ) - Introduces a transport API for bootstrapping a Zen2 cluster - Introduces a transport API for requesting the set of nodes that a master-eligible node has discovered and for waiting until this comprises the expected number of nodes. - Alters ESIntegTestCase to use these APIs when forming a cluster, rather than injecting the initial configuration directly.	2018-11-08 16:09:37 +00:00
Zachary Tong	54b445d74b	[Test] Remove obsolete job/cluster cleanup code Also makes sure the awaitBusy for job stoppage is checked, so that we can fail if we timed out waiting for a job to stop. Closes #35295	2018-11-08 10:23:23 -05:00
David Turner	77789a733d	Merge branch 'master' into 2018-11-08-merge-master	2018-11-08 13:38:18 +00:00
Simon Willnauer	0cc0fd2d15	Add a frozen engine implementation (#34357 ) This change adds a `frozen` engine that allows lazily open a directory reader on a read-only shard. The engine wraps general purpose searchers in a LazyDirectoryReader that also allows to release and reset the underlying index readers after any and before secondary search phases. Relates to #34352	2018-11-07 20:23:35 +01:00
Alpar Torok	8a85b2eada	Remove build qualifier from server's Version (#35172 ) With this change, `Version` no longer carries information about the qualifier, we still need a way to show the "display version" that does have both qualifier and snapshot. This is now stored by the build and red from `META-INF`.	2018-11-07 14:01:05 +02:00
Tim Brooks	f395b1eace	Open node connections asynchronously (#35144 ) This is related to #29023. Additionally at other points we have discussed a preference for removing the need to unnecessarily block threads for opening new node connections. This commit lays the groudwork for this by opening connections asynchronously at the transport level. We still block, however, this work will make it possible to eventually remove all blocking on new connections out of the TransportService and Transport.	2018-11-06 17:58:20 -07:00
David Turner	7e356ac29b	[Zen2] Introduce auto_shrink_voting_configuration setting (#35217 ) Today we allow the user to set the minimum size of a voting configuration. On reflection we would rather this was simply '3' where possible, and we can use the retirement API to control the removal of nodes more explicitly. This change replaces the old reconfigurator setting with a new one, `cluster.auto_shrink_voting_configuration`, which determines whether Elasticsearch should automatically remove nodes from the voting configuration or not.	2018-11-06 18:10:29 +00:00
Nick Knize	a5e1f4d3a2	Upgrade to lucene-8.0.0-snapshot-31d7dfe6b1 (#35224 )	2018-11-06 11:55:23 +01:00
David Turner	2fb3d1a465	[Zen2] Fix some rarely-failing tests (#35198 ) Recent changes have left a few Zen2 tests occasionally failing. This commit fixes them.	2018-11-05 21:54:53 +00:00
Boaz Leskes	28078642b3	Engine.newChangesSnapshot may cause unneeded refreshes if called concurrently (#35169 ) When the engine is asked for historical operations, we check if some of the requested operations are not yet refreshed and if so we refresh before returning the operations. The refresh check is based on capturing the local checkpoint before each refresh and comparing that value to the one requested when `newChangesSnapshot` was called. If the requested range is above the captured local checkpoint we issue a refresh. This can currently cause unneeded extra refreshes if the method is called concurrently which may cause unwanted degradation in indexing performance. This is especially relevant for CCR where we always ask for a range below the global checkpoint. That range is guaranteed to be below the local checkpoint of the shard and one refresh is enough to serve multiple changes requests. This commit fixes this by introducing a dedicated mutex to make sure the test for whether a refresh is needed actually wait for concurrents for concurrent refreshes that were caused by another change refresh. Note that this is not a big change in semantics as refreshes are serialized by lucene anyway. I also opted not to keep the synchronization to the changes snapshot request only even if in theory we can apply it to all refreshes, not matter where they come from.	2018-11-04 13:43:33 +01:00
Nhat Nguyen	855ab3fa1e	Add equals/hashCode to SeqNoStats (#35223 ) This commit adds equals/hashCode to SeqNoStats so we can verify it wholly in tests.	2018-11-02 21:31:36 -04:00
Tim Brooks	0166388d74	Use single netty event loop group for transports (#35181 ) Currently we create a new netty event loop group for client connections and all server profiles. Each new group creates new threads for io processing. This means 2 * num of processors new threads for each group. A single group should be able to handle all io processing (for the transports). This also brings the netty module inline with what we do for nio. Additionally, this PR renames the worker threads to be the same for netty and nio.	2018-11-02 16:31:19 -06:00
Colin Goodheart-Smithe	fc6e1f7f3f	Merge branch 'master' into index-lifecycle	2018-11-02 10:56:35 +00:00
Alpar Torok	f22700812e	Introduce build qualifier parameter (#35155 ) * Introduce property to set version qualifier - VersionProperties.elasticsearch is now a string which can have qualifier and snapshot too - The Version class in the build no longer cares about snapshot and qualifier.	2018-11-02 05:27:40 +02:00
Julie Tibshirani	746d94e299	Unmute AbstractQueryTestCase#testToQuery. The RangeQueryBuilderTests#testToQuery failures were fixed in #34868 and #35145.	2018-11-01 12:06:36 -07:00
Tal Levy	c3cf7dd305	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-11-01 10:13:02 -07:00
Nik Everett	e28509fbfe	Core: Less settings to AbstractComponent (#35140 ) Stop passing `Settings` to `AbstractComponent`'s ctor. This allows us to stop passing around `Settings` in a ton of places. While this change touches many files, it touches them all in fairly small, mechanical ways, doing a few things per file: 1. Drop the `super(settings);` line on everything that extends `AbstractComponent`. 2. Drop the `settings` argument to the ctor if it is no longer used. 3. If the file doesn't use `logger` then drop `extends AbstractComponent` from it. 4. Clean up all compilation failure caused by the `settings` removal and drop any now unused `settings` isntances and method arguments. I've intentionally not removed the `settings` argument from a few files: 1. TransportAction 2. AbstractLifecycleComponent 3. BaseRestHandler These files don't need `settings` either, but this change is large enough as is. Relates to #34488	2018-10-31 21:23:20 -04:00
Igor Motov	b5e5e93c46	Fixes randomDateTimeZone method (#35145 ) The randomDateTimeZone method shouldn't return deprecated timezones this causes some tests to fail with deprecation warning.	2018-10-31 20:32:18 -04:00
Seong-hyun, Oh	9ef4788c13	Make XContentBuilder in AliasActions build `is_write_index` field (#35071 ) Make XContentBuilder in AliasesActions build `is_write_index` field	2018-10-31 14:15:46 -07:00
Armin Braun	e6f9f0666e	NETWORKING: MockTransportService Wait for Close (#35038 ) * NETWORKING: MockTransportService Wait for Close * Make `MockTransportService` wait `30s` for close listeners to run before failing the assertion * Closes #34990	2018-10-31 21:33:49 +01:00
David Turner	0072c90e2a	Pre-populate unicast hosts files (#35136 ) Today when ESIntegTestCase starts some nodes it writes out the unicast hosts files each time a node starts its transport service. This does mean that a number of nodes can start and perform their first pinging round without any unicast hosts which, if the timing is unlucky and a lot of nodes are all started at the same time, can lead to a split brain as in #35052. Prior to #33554 this was unlikely to happen since the MockUncasedHostsProvider would always have yielded the existing hosts, so the timing would have to have been implausibly unlucky. Since #33554, however, it's more likely because the race occurs between the start of the first round of pinging and the writing of the unicast hosts file. It is realistic that new nodes will be configured with the existing nodes from startup, so this change reinstates that behaviour. Closes #35052.	2018-10-31 19:21:24 +00:00
Tal Levy	d5d28420b6	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-31 10:47:07 -07:00
Luca Cavanna	674225aaa1	[TEST] Enforce skip headers when needed (#34735 ) The java yaml test runner supports sending request headers, yet not all clients support headers. This commit makes sure that we enforce adding a skip section with feature "headers" whenever headers are used in a do section as part of a test. That decreases the chance for new tests to break client builds due to the missing skip section. Closes #34650	2018-10-31 13:07:02 +01:00
Tal Levy	5141084048	rename CRUD api REST path prefix _ilm to _ilm/policy (#35056 ) This PR renames the CRUD APIS for ILM GET _ilm/<policy>, _ilm -> _ilm/policy/<policy>, _ilm/policy PUT _ilm/<policy> -> _ilm/policy/<policy> DELETE _ilm/<policy> -> _ilm/policy/<policy> closes #34929.	2018-10-30 16:19:05 -07:00
Nik Everett	086ada4c08	Core: Drop settings member from AbstractComponent (#35083 ) Drops the `Settings` member from `AbstractComponent`, moving it from the base class on to the classes that use it. For the most part this is a mechanical change that doesn't drop `Settings` accesses. The one exception to this is naming threads where it switches from an invocation that passes `Settings` and extracts the node name to one that explicitly passes the node name. This change doesn't drop the `Settings` argument from `AbstractComponent`'s ctor because this change is big enough as is. We'll do that in a follow up change.	2018-10-30 16:10:38 -04:00
Ryan Ernst	512319cef7	Test: Filter out deprecated joda tzs in tests (#34868 ) This commit filters out usage of deprecated tzs by tests. These are tested separately and should not require checking for warnings on any test using random timezones. closes #34188	2018-10-30 11:15:34 -07:00
Tal Levy	18c72e86c5	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-30 08:09:57 -07:00
Luca Cavanna	7ef65dedc3	[TEST] improve validation of yaml suites (#34957 ) Validation of test sections and suites consists of checking that the proper skip features sections are in place depending on the features used in tests. The validation logic was previously only performed on do sections included in each test section, and the skip needed to be present in the same test section. What happens often though is that the skip is added to the setup section, or the teardown section. This commit improves the validation of test suites by validating setup and teardown section first, then looking at each test section while still eventually reading the skip section from setup or teardown. We are also making SkipSection, SetupSection, TearDownSection, ClientYamlTestSection and ClientYamlTestSuite immutable. Previously it was possible to utilize constants like SetupSection.EMPTY, which were modifiable and affect every other future users by modifiying them. This has been corrected. Also, validation has been improved to cumulate errors so that all the errors from a suite will be listed at once. Relates to #34735	2018-10-30 16:06:31 +01:00
Andy Bristol	b8280ea7cc	median absolute deviation agg (#34482 ) This commit adds a new single value metric aggregation that calculates the statistic called median absolute deviation, which is a measure of variability that works on more types of data than standard deviation Our calculation of MAD is approximated using t-digests. In the collect phase, we collect each value visited into a t-digest. In the reduce phase, we merge all value t-digests, then create a t-digest of deviations using the first t-digest's median and centroids	2018-10-30 07:22:52 -07:00
Andrey Ershov	97f74c5a38	Merge branch 'master' into 'zen2' Conflicts during the merge: 1. >=140 chars line length fixed for a lot of project files and warnings for those files are no longer suppressed 2. Node name is removed from AbstractComponent, it’s no longer taken from settings, but is explicitly passed as constructor argument and there were quite a few new classes on zen2 branch that require this change 3. TransportResponseHandler interface changed (new method added) and Zen2 makes a lot of subclasses in tests 4. Deprecated way of obtaining logger was changed	2018-10-30 14:39:48 +03:00
Przemyslaw Gomulka	995bf0ee66	Bulk Api support for global parameters (#34528 ) Bulk Request in High level rest client should be consistent with what is possible in Rest API, therefore should support global parameters. Global parameters are passed in URL in Rest API. Some parameters are mandatory - index, type - and would fail validation if not provided before before the bulk is executed. Optional parameters - routing, pipeline. The usage of these should be consistent across sync/async execution, bulk processor and BulkRequestBuilder closes #26026	2018-10-30 09:08:12 +01:00
Tal Levy	c9e4d26a53	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-29 14:03:55 -07:00
Pratik Sanglikar	f1135ef0ce	Core: Replace deprecated Loggers calls with LogManager. (#34691 ) Replace deprecated Loggers calls with LogManager. Relates to #32174	2018-10-29 15:52:30 -04:00
Nik Everett	b093116a1e	Logging: Drop another deprecated Loggers method (#34520 ) Drop a method from `Loggers` that we deprecated because it just delegated to `LogManager`.	2018-10-29 10:05:24 -04:00
Alpar Torok	baa144e844	Enforce a [skip] when using [contains] (#34840 ) Be friendly to other runners	2018-10-29 14:54:22 +02:00
Tal Levy	d8322ca069	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-26 12:46:21 -07:00
Nik Everett	10295b306d	Core: Drop nodeName from AbstractComponent (#34487 ) `AbstractComponent` is trouble because its name implies that everything should extend from it. It is useful, but maybe too broadly useful. The things it offers access too, the `Settings` instance for the entire server and a logger are nice to have around, but not really needed everywhere. The `Settings` instance especially adds a fair bit of ceremony to testing without any value. This removes the `nodeName` method from `AbstractComponent` so it is more clear where we actually need the node name.	2018-10-26 15:26:14 -04:00
Tal Levy	e737ea7d4a	remove old doc placeholder and migrate ilm docs to top-level (#34615 ) we are restructuring the docs, this migrates ILM docs outside of the x-pack doc structure.	2018-10-26 12:19:52 -07:00
Igor Motov	02a342eb8c	Tests: remove possibly unnecessary rollup job logging (#34883 ) It seems that this statement is a debug leftover since it currently adds an error message `{"jobs":[]}` after each successful REST test.	2018-10-26 14:23:10 -04:00
Jay Modi	a0279bc069	Responses can use Writeable.Reader interface (#34655 ) In order to remove Streamable from the codebase, Response objects need to be read using the Writeable.Reader interface which this change enables. This change enables the use of Writeable.Reader by adding the `Action#getResponseReader` method. The default implementation simply uses the existing `newResponse` method and the readFrom method. As responses are migrated to the Writeable.Reader interface, Action classes can be updated to throw an UnsupportedOperationException when `newResponse` is called and override the `getResponseReader` method. Relates #34389	2018-10-26 09:21:54 -06:00
Tal Levy	810cd46a30	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-25 14:35:33 -07:00
Nik Everett	59df6e8689	Test: Lookup node versions on rest test start (#34657 ) This is a forward port of a change made to clean up backwards compatibility for the rollup cleanups. It makes the version of each node available very early on in test execution. The 6.x version of the change used those versions to control the cleanup backwards compatibility but that isn't needed in this branch. But having the versions around is useful. So this makes them available. Closes #34629	2018-10-25 16:43:33 -04:00
Tim Brooks	cf9aff954e	Reduce channels in AbstractSimpleTransportTestCase (#34863 ) This is related to #30876. The AbstractSimpleTransportTestCase initiates many tcp connections. There are normally over 1,000 connections in TIME_WAIT at the end of the test. This is because every test opens at least two different transports that connect to each other with 13 channel connection profiles. This commit modifies the default connection profile used by this test to 6. One connection for each type, except for REG which gets 2 connections.	2018-10-25 13:37:49 -06:00
Lee Hinman	3e7042832a	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-25 11:00:36 -06:00
lipsill	185c06bb7f	Logging: tests: clean up logging (#34606 ) Replace internal deprecated calls to `Loggers.getLogger(Class)` with direct calls to log4j `LogManager.getLogger(Class)`	2018-10-25 09:52:41 -04:00
Alpar Torok	59536966c2	Add a new "contains" feature (#34738 ) The contains syntax was added in #30874 but the skips were not properly put in place. The java runner has the feature so the tests will run as part of the build, but language clients will be able to support it at their own pace.	2018-10-25 08:50:50 +03:00
Ryan Ernst	687dc1eb11	Scripting: Remove SearchScript (#34730 ) This commit removes the last non context based script class.	2018-10-24 15:03:38 -07:00
Luca Cavanna	d51bc05dce	[TEST] Improve validation of do sections (#34734 ) We throw parsing exception when an unknown array is found, but we don't when an unknown top-level field is found. This commit makes sure that unsupported top-level fields are not ignored in a do section. Closes #34651	2018-10-24 21:27:07 +02:00
lipsill	d5ad3de42e	[test] Introduce strict deprecation mode for REST tests (#34338 ) #33708 introduced a strict deprecation mode that makes a REST request fail if there is a warning header in the response returned by Elasticsearch (usually a deprecation message signaling that a feature or a field has been deprecated). This change adds the strict deprecation mode into the REST integration tests, and makes the tests fail if a deprecated feature is used. Also any test using a deprecated feature has been modified to pass the build. The YAML integration tests already analyzed HTTP warnings so they do not use this mode, keeping their "expected vs actual" behavior.	2018-10-24 08:21:24 -04:00
Nhat Nguyen	52266d8b11	TEST: Clone replicas list when compute replication targets (#34728 ) In #34407, we supposed to clone the list of replicas of ReplicationGroup when computing replication targets, but somehow we missed it. If we don't clone the list, a WriteReplicationAction may use an old ReplicationTargets which consists replicas which are removed from the current list of replicas Relates #34407 Closes #33457	2018-10-23 21:08:34 -04:00
Zachary Tong	299d044bfc	Collapse pipeline aggs into single package (#34658 ) - Restrict visibility of Aggregators and Factories - Move PipelineAggregatorBuilders up a level so it is consistent with AggregatorBuilders - Checkstyle line length fixes for a few classes - Minor odds/ends (swapping to method references, formatting, etc)	2018-10-23 16:01:01 -04:00
Tal Levy	62ac2fa5ec	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-23 09:43:46 -07:00
Zachary Tong	4dbf498721	[Rollup] Job deletion should be invoked on the allocated task (#34574 ) We should delete a job by directly talking to the allocated task and telling it to shutdown. Today we shut down a job via the persistent task framework. This is not ideal because, while the job has been removed from the persistent task CS, the allocated task continues to live until it gets the shutdown message. This means a user can delete a job, immediately delete the rollup index, and then see new documents appear in the just-deleted index. This happens because the indexer in the allocated task is still running and indexes a few more documents before getting the shutdown command. In this PR, the transport action is changed to a TransportTasksAction, and we invoke onCancelled() directly on the matching job. The race condition still exists after this PR (albeit less likely), but this was a precursor to fixing the issue and a self-contained chunk of code. A second PR will followup to fix the race itself.	2018-10-23 12:23:22 -04:00
Tal Levy	67bfdb16ad	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-22 13:09:37 -07:00
Yannick Welsch	6d6ac74a08	Zen2: Fail fast on disconnects (#34503 ) Integrates the failure detectors with the Connection lifecycle, to fail nodes as soon as: - a leader detects one of his followers disconnecting. - a follower detects its leader disconnecting.	2018-10-22 17:20:12 +02:00
Jason Tedor	243335e2ba	Allow set section in setup section of REST tests (#34678 ) This commit enables using a set section in the setup section of REST tests.	2018-10-22 11:14:27 -04:00
Jason Tedor	7af19b8f81	Migrate wait for pending tasks helper to server (#34675 ) In some of our X-Pack REST tests we have to wait for pending tasks to complete. We are now needing this functionality in ESRestTestCase for the docs tests where we run against X-Pack features. This commit moves the helper method that we have in X-Pack to ESRestTestCase, and removes duplicate logic from waiting for rollup tasks to complete.	2018-10-22 11:14:02 -04:00
Ryan Ernst	222652dfce	Scripting: Convert script fields to use script context (#34164 ) This commit removes the use of SearchScript for script fields and adds a new FieldScript.	2018-10-20 16:33:49 -07:00
David Turner	bfd24fc030	[Zen2] Reconfigure cluster as its membership changes (#34592 ) As master-eligible nodes join or leave the cluster we should give them votes or take them away, in order to maintain the optimal level of fault-tolerance in the system. #33924 introduced the `Reconfigurator` to calculate the optimal configuration of the cluster, and in this change we add the plumbing needed to actually perform the reconfigurations needed as the cluster grows or shrinks.	2018-10-19 19:24:54 +01:00
Nhat Nguyen	bd92a28cfc	CCR: Replicate existing ops with old term on follower (#34412 ) Since #34288, we might hit deadlock if the FollowTask has more fetchers than writers. This can happen in the following scenario: Suppose the leader has two operations [seq#0, seq#1]; the FollowTask has two fetchers and one writer. 1. The FollowTask issues two concurrent fetch requests: {from_seq_no: 0, num_ops:1} and {from_seq_no: 1, num_ops:1} to read seq#0 and seq#1 respectively. 2. The second request which fetches seq#1 completes before, and then it triggers a write request containing only seq#1. 3. The primary of a follower fails after it has replicated seq#1 to replicas. 4. Since the old primary did not respond, the FollowTask issues another write request containing seq#1 (resend the previous write request). 5. The new primary has seq#1 already; thus it won't replicate seq#1 to replicas but will wait for the global checkpoint to advance at least seq#1. The problem is that the FollowTask has only one writer and that writer is waiting for seq#0 which won't be delivered until the writer completed. This PR proposes to replicate existing operations with the old primary term (instead of the current term) on the follower. In particular, when the following primary detects that it has processed an process already, it will look up the term of an existing operation with the same seq_no in the Lucene index, then rewrite that operation with the old term before replicating it to the following replicas. This approach is wait-free but requires soft-deletes on the follower. Relates #34288	2018-10-19 13:56:00 -04:00
David Turner	3de266e3cf	Merge branch 'master' into zen2	2018-10-19 14:30:07 +01:00
Colin Goodheart-Smithe	84ef91529c	Merge branch 'master' into index-lifecycle	2018-10-19 13:24:04 +01:00
Daniel Mitterdorfer	dbb6fe58fa	Remove hand-coded XContent duplicate checks With this commit we cleanup hand-coded duplicate checks in XContent parsing. They were necessary previously but since we reconfigured the underlying parser in #22073 and #22225, these checks are obsolete and were also ineffective unless an undocumented system property has been set. As we also remove this escape hatch, we can remove the additional checks as well. Closes #22253 Relates #34588	2018-10-19 10:13:13 +02:00
Tal Levy	09067c8942	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-17 15:37:11 -07:00
Nhat Nguyen	eb36f10394	TEST: Capture replication targets when replication group ready (#34407 ) Today, WriteReplicationAction uses a set of replication targets directly from the primary shard of ReplicationGroup. It should be fine except when we add/remove or promote a shard while a write action is executing. We have encountered these two issues: 1. Replicas are not found in the replication targets. This happens because we remove replicas but the WriteReplicationAction still uses the old replication targets which include the removed replicas. 2. Access ReplicationGroup from a primary shard which hasn't activated the primary-mode yet. This is because we won't activate the primary-mode for a promoting shard after bumping the primary term which is executed asynchronously. This commit captures the replication targets when the replication group is ready and continue using those targets until we re-compute the new targets after the group is changed. Closes #33457	2018-10-17 17:37:52 -04:00
Armin Braun	08d4bf6e84	TESTS: Remove Dead Code in Test Infra. (#34548 ) * None of this infrastructure is used * Some redundant throws and resulting catch code removed	2018-10-17 20:08:39 +01:00
Colin Goodheart-Smithe	90f7cec7a5	Merge branch 'master' into index-lifecycle	2018-10-17 18:22:23 +01:00
Nik Everett	139bbc3f03	Rollup: Consolidate rollup cleanup for http tests (#34342 ) This moves the rollup cleanup code for http tests from the high level rest client into the test framework and then entirely removes the rollup cleanup code for http tests that lived in x-pack. This is nice because it consolidates the cleanup into one spot, automatically invokes the cleanup without the test having to know that it is "about rollup", and should allow us to run the rollup docs tests. Part of #34530	2018-10-17 09:32:16 -04:00
Andrey Ershov	93bb24e1f8	Merge branch 'master' into zen2	2018-10-17 14:37:53 +02:00
Armin Braun	3954d041a0	SCRIPTING: Move sort Context to its Own Class (#33717 ) * SCRIPTING: Move sort Context to its own Class	2018-10-17 10:02:44 +01:00
Tal Levy	fbe8dc014c	Merge branch 'master' into index-lifecycle	2018-10-16 13:58:53 -07:00
Armin Braun	ea576a8ca2	Disc: Move AbstractDisruptionTC to filebased D. (#34461 ) * Discovery: Move AbstractDisruptionTestCase to file-based discovery. * Relates #33675 * Simplify away ClusterDiscoveryConfiguration	2018-10-16 15:28:40 +01:00
David Turner	950ca3adda	Merge branch 'master' into zen2	2018-10-16 14:41:14 +01:00
Simon Willnauer	d43a1fac33	Lock down Engine.Searcher (#34363 ) `Engine.Searcher` is non-final today which makes it error prone in the case of wrapping the underlying reader or lucene `IndexSearcher` like we do in `IndexSearcherWrapper`. Yet, there is no subclass of it yet that would be dramatic to just drop on the floor. With the start of development of frozen indices this changed since in #34357 functionality was added to a subclass which would be dropped if a `IndexSearcherWrapper` is installed on an index. This change locks down the `Engine.Searcher` to prevent such a functionality trap.	2018-10-16 14:53:07 +02:00

1 2 3 4 5 ...

1915 Commits