OpenSearch

Commit Graph

Author	SHA1	Message	Date
Tanguy Leroux	d70ebfd1d6	Merge branch 'master' into close-index-api-refactoring	2019-01-08 09:17:48 +01:00
Jason Tedor	c8c596cead	Introduce retention lease expiration (#37195 ) This commit implements a straightforward approach to retention lease expiration. Namely, we inspect which leases are expired when obtaining the current leases through the replication tracker. At that moment, we clean the map that persists the retention leases in memory.	2019-01-07 22:03:52 -08:00
Julie Tibshirani	c5aac4705d	Revert "Stop automatically nesting mappings in index creation requests. (#36924 )" This reverts commit `ac1c6940d2`.	2019-01-07 17:56:40 -08:00
Tanguy Leroux	97bf4d7176	Merge branch 'master' into close-index-api-refactoring	2019-01-07 18:38:27 +01:00
David Turner	9d0e0eb0f3	[Zen2] Remove initial master node count setting (#37150 ) The `cluster.unsafe_initial_master_node_count` setting was introduced as a temporary measure while the design of `cluster.initial_master_nodes` was being finalised. This commit removes this temporary setting, replacing it with usages of `cluster.initial_master_nodes` where appropriate.	2019-01-07 16:05:00 +00:00
Tanguy Leroux	e149b0852e	[Close Index API] Add unique UUID to ClusterBlock (#36775 ) This commit adds a unique id to cluster blocks, so that they can be uniquely identified if needed. This is important for the Close Index API where multiple concurrent closing requests can be executed at the same time. By adding a UUID to the cluster block, we can generate unique "closing block" that can later be verified on shards and then checked again from the cluster state before closing the index. When the verification on shard is done, the closing block is replaced by the regular INDEX_CLOSED_BLOCK instance. If something goes wrong, calling the Open Index API will remove the block. Related to #33888	2019-01-07 16:44:59 +01:00
Jason Tedor	c0f8c89172	Introduce shard history retention leases (#37167 ) This commit is the first in a series which will culminate with fully-functional shard history retention leases. Shard history retention leases are aimed at preventing shard history consumers from having to fallback to expensive file copy operations if shard history is not available from a certain point. These consumers include following indices in cross-cluster replication, and local shard recoveries. A future consumer will be the changes API. Further, index lifecycle management requires coordinating with some of these consumers otherwise it could remove the source before all consumers have finished reading all operations. The notion of shard history retention leases that we are introducing here will also be used to address this problem. Shard history retention leases are a property of the replication group managed under the authority of the primary. A shard history retention lease is a combination of an identifier, a retaining sequence number, a timestamp indicating when the lease was acquired or renewed, and a string indicating the source of the lease. Being leases they have a limited lifespan that will expire if not renewed. The idea of these leases is that all operations above the minimum of all retaining sequence numbers will be retained during merges (which would otherwise clear away operations that are soft deleted). These leases will be periodically persisted to Lucene and restored during recovery, and broadcast to replicas under certain circumstances. This commit is merely putting the basics in place. This first commit only introduces the concept and integrates their use with the soft delete retention policy. We add some tests to demonstrate the basic management is correct, and that the soft delete policy is correctly influenced by the existence of any retention leases. We make no effort in this commit to implement any of the following: - timestamps - expiration - persistence to and recovery from Lucene - handoff during primary relocation - sharing retention leases with replicas - exposing leases in shard-level statistics - integration with cross-cluster replication These will occur individually in follow-up commits.	2019-01-07 07:43:57 -08:00
Alpar Torok	a7c3d5842a	Split third party audit exclusions by type (#36763 )	2019-01-07 17:24:19 +02:00
Simon Willnauer	ac2e09b25a	Fix suite scope random initializaation (#37163 ) The initialization of a suite scope cluster had some sideffects on subsequent runs which causes issues when tests must be reproduced. This moves the suite scope initialization to a privte random context. Closes #36202	2019-01-07 14:20:17 +01:00
Tanguy Leroux	f5af79b9cd	Merge branch 'master' into close-index-api-refactoring	2019-01-07 12:43:03 +01:00
Armin Braun	31c33fdb9b	MINOR: Remove some Deadcode in Gradle (#37160 )	2019-01-07 09:21:25 +01:00
Jim Ferenczi	e38cf1d0dc	Add the ability to set the number of hits to track accurately (#36357 ) In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested. It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected. Relates #33028	2019-01-04 20:36:49 +01:00
Julie Tibshirani	ac1c6940d2	Stop automatically nesting mappings in index creation requests. (#36924 ) Now that we unwrap mappings in DocumentMapperParser#extractMappings, it is not necessary for the mapping definition to always be nested under the type. This leniency around the mapping format was added in `2341825358`.	2019-01-03 17:41:28 -08:00
David Findley	d4e7660248	Fix weighted_avg parser not found for RestHighLevelClient (#37027 ) Add integration test for weighted avg sub aggregation Add weighted avg parser to DefaultNamedXContents Fixes #36861	2019-01-02 15:53:21 -06:00
Josh Soref	1df66d21fe	Spelling: replace uknown with unknown (#37056 )	2019-01-02 17:33:02 +01:00
Josh Soref	d3e98278c3	Spelling: replace cachable with cacheable (#37047 )	2019-01-02 14:10:30 +01:00
Armin Braun	85be9d6a89	SNAPSHOT: Deterministic ClusterState Tests (#36644 ) * Use `DeterministicTaskQueue` infrastructure to test `SnapshotsService`	2018-12-31 11:17:21 +01:00
Luca Cavanna	51fe20e0c3	Add support for local cluster alias to SearchRequest (#36997 ) With the upcoming cross-cluster search alternate execution mode, the CCS node will be able to split a CCS request into multiple search requests, one per remote cluster involved. In order to do that, the CCS node has to be able to signal to each remote cluster that such sub-requests are part of a CCS request. Each cluster does not know about the other clusters involved, and does not know either what alias it is given in the CCS node, hence the CCS coordinating node needs to be able to provide the alias as part of the search request so that it is used as index prefix in the returned search hits. The cluster alias is a notion that's already supported in the search shards iterator and search shard target, but it is currently used in CCS as both index prefix and connection lookup key when fanning out to all the shards. With CCS alternate execution mode the provided cluster alias needs to be used only as index prefix, as shards are local to each cluster hence no cluster alias should be used for connection lookups. The local cluster alias can be set to the SearchRequest at the transport layer only, and its constructor/getter methods are package private. Relates to #32125	2018-12-28 12:43:25 +01:00
Andrey Ershov	a02cfdf6e4	Switch InternalTestClusterTests to zen2 (#36977 ) Today InternalTestClusterTests is still using zen1. This commit fixes it. Two types of changes were required: 1. Explicitly pass file discovery host provider setting. It's done in ESIntegTestCase as a part of the Zen2 feature and should be done here as well. 2. For the test, that uses autoManageMinMasterNodes = false perform cluster bootstrap.	2018-12-27 22:21:37 +01:00
Nhat Nguyen	7580d9d925	Make SourceToParse immutable (#36971 ) Today the routing of a SourceToParse is assigned in a separate step after the object is created. We can easily forget to set the routing. With this commit, the routing must be provided in the constructor of SourceToParse. Relates #36921	2018-12-24 14:06:50 -05:00
Tim Brooks	c8a8391dfa	Only compress responses if request was compressed (#36867 ) This is a follow-up to some discussions around #36399. Currently we have relatively confusing compression behavior where compression can be configured for requests based on transport.compress or a specific setting for a remote cluster. However, we can only compress responses based on transport.compress as we do not know where a request is coming from (currently). This commit modifies the behavior to NEVER compress responses based on settings. Instead, a response will only be compressed if the request was compressed. This commit also updates the documentation to more clearly described transport level compression.	2018-12-21 10:14:00 -07:00
Tanguy Leroux	bd2af2c400	Merge branch 'master' into close-index-api-refactoring	2018-12-21 12:22:24 +01:00
Andrey Ershov	ca92d74e7e	[Zen2] Change unsafe bootstrap nodes count to nodes list in tests (#36559 ) This commit modifies ESSingleNodeTestCase and ESIntegTestCase and several concrete test classes to use node names when bootstrapping the cluster. Today ClusterBootstrapService.INITIAL_MASTER_NODE_COUNT_SETTING setting is used to bootstrap clusters in tests. Instead, we want to use ClusterBootrstapService.INITIAL_MASTER_NODES_SETTING and get rid of the former setting eventually. There were two main problems when refactoring InternalTestCluster: 1. Nodes are created one-by-one in buildNode method. And node.name is created in this method as well. It's not suitable for bootstrapping, because we need to have the names of all master eligible nodes in advance, before creating the node with bootstrapping configuration set. We address this issue by separating buildNode into two methods: getNodeSettings and buildNode. We first iterate over all nodes to get nodes settings, then change the setting for the bootstrapping node and then proceed with building the node. 2. If autoManageMinMasterNodes = false, there is no way for the test to set the list of bootstrapping nodes because node names are not known in advance. This problem is solved by adding updateNodesSettings method to NodeConfigurationSource and ESIntegTestCase (which could be overridden by concrete integration test class). Once we have the list of settings for all nodes, the integration test class is allowed to update it. In our case, we update the ClusterBootrstapService.INITIAL_MASTER_NODES_SETTING setting.	2018-12-20 15:20:33 +01:00
Tanguy Leroux	fb24469fe7	Merge branch 'master' into close-index-api-refactoring	2018-12-19 16:17:26 +01:00
Yannick Welsch	487a1c4f71	Fix cluster state persistence for single-node discovery (#36825 ) Single-node discovery is not persisting cluster states, which was caused by a recent 7.0-only refactoring. This commit ensures that the cluster state is properly persisted when using single-node discovery and adds a corresponding test.	2018-12-19 13:26:04 +01:00
Alan Woodward	344917efab	Add script filter to intervals (#36776 ) This commit adds the ability to filter out intervals based on their start and end position, and internal gaps: ``` POST _search { "query": { "intervals" : { "my_text" : { "match" : { "query" : "hot porridge", "filter" : { "script" : { "source" : "interval.start > 10 && interval.end < 20 && interval.gaps == 0" } } } } } } } ```	2018-12-19 11:12:18 +00:00
Tanguy Leroux	c99fd6a53b	Merge branch 'master' into close-index-api-refactoring	2018-12-19 09:34:59 +01:00
Alpar Torok	e9ef5bdce8	Converting randomized testing to create a separate unitTest task instead of replacing the builtin test task (#36311 ) - Create a separate unitTest task instead of Gradle's built in - convert all configuration to use the new task - the built in task is now disabled	2018-12-19 08:25:20 +02:00
Tim Brooks	aaf466ff5e	Revert transport.port change for tests (#36809 ) Commit #36786 updated docs and strings to reference transport.port instead of transport.tcp.port. However, this breaks backwards compatibility tests as the tests rely on string configurations and transport.port does not exist prior to 6.6. This commit reverts the places were we reference transport.tcp.port for tests. This work will need to be reintroduced in a backwards compatible way.	2018-12-18 19:01:13 -07:00
Tim Brooks	47a9a8de49	Update transport docs and settings for changes (#36786 ) This is related to #36652. In 7.0 we plan to deprecate a number of settings that make reference to the concept of a tcp transport. We mostly just have a single transport type now (based on tcp). Settings should only reference tcp if they are referring to socket options. This commit updates the settings in the docs. And removes string usages of the old settings. Additionally it adds a missing remote compress setting to the docs.	2018-12-18 13:09:58 -07:00
Ryan Ernst	8ec8342a52	Internal: Remove originalSettings from Node (#36569 ) This commit removes the originalSettings member from Node. It was only needed to allows test clusters to recreate the node in certain situations. Instead, the test cluster now keeps track of these settings.	2018-12-18 10:05:27 -08:00
Tanguy Leroux	0a0c969517	Merge branch 'master' into close-index-api-refactoring	2018-12-18 09:27:35 +01:00
Luca Cavanna	b57e12aa44	Add raw sort values to SearchSortValues transport serialization (#36617 ) In order for CCS alternate execution mode (see #32125) to be able to do the final reduction step on the CCS coordinating node, we need to serialize additional info in the transport layer as part of each `SearchHit`. Sort values are already present but they are formatted according to the provided `DocValueFormat` provided. The CCS node needs to be able to reconstruct the lucene `FieldDoc` to include in the `TopFieldDocs` and `CollapseTopFieldDocs` which will feed the `mergeTopDocs` method used to reduce multiple search responses (one per cluster) into one. This commit adds such information to the `SearchSortValues` and exposes it through a new getter method added to `SearchHit` for retrieval. This info is only serialized at transport and never printed out at REST.	2018-12-18 09:20:51 +01:00
Nicholas Knize	96d279ed83	Revert "[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 )" This reverts commit `5bc7822562`.	2018-12-17 20:09:46 -06:00
Nick Knize	5bc7822562	[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 ) This commit exposes lucene's LatLonShape field as the default type in GeoShapeFieldMapper. To use the new indexing approach, simply set "type" : "geo_shape" in the mappings without setting any of the strategy, precision, tree_levels, or distance_error_pct parameters. Note the following when using the new indexing approach: * geo_shape query does not support querying by MULTIPOINT. * LINESTRING and MULTILINESTRING queries do not yet support WITHIN relation. * CONTAINS relation is not yet supported. The tree, precision, tree_levels, distance_error_pct, and points_only parameters are deprecated.	2018-12-17 14:38:14 -06:00
Luca Cavanna	f1e1f93943	[TEST] fix float comparison in RandomObjects#getExpectedParsedValue This commit fixes a test bug introduced with #36597. This caused some test failure as stored field values comparisons would not work when CBOR xcontent type was used. Closes #29080	2018-12-17 21:19:59 +01:00
Tanguy Leroux	79999d37d4	Merge branch 'master' into close-index-api-refactoring	2018-12-17 10:14:38 +01:00
Boaz Leskes	733a6d34c1	Add seq no powered optimistic locking support to the index and delete transport actions (#36619 ) This commit add support for using sequence numbers to power [optimistic concurrency control](http://en.wikipedia.org/wiki/Optimistic_concurrency_control) in the delete and index transport actions and requests. A follow up will come with adding sequence numbers to the update and get results. Relates #36148 Relates #10708	2018-12-15 17:59:57 +01:00
Tim Brooks	3065300434	Unify transport settings naming (#36623 ) This commit updates our transport settings for 7.0. It generally takes a few approaches. First, for normal transport settings, it usestransport. instead of transport.tcp. Second, it uses transport.tcp, http.tcp, or network.tcp for all settings that are proxies for OS level socket settings. Third, it marks the network.tcp.connect_timeout setting for removal. Network service level settings are only settings that apply to both the http and transport modules. There is no connect timeout in http. Fourth, it moves all the transport settings to a single class TransportSettings similar to the HttpTransportSettings class. This commit does not actually remove any settings. It just adds the new renamed settings and adds todos for settings that will be deprecated.	2018-12-14 14:41:04 -07:00
Tim Brooks	fbf88b2ab7	Remove the `MockTcpTransport` (#36628 ) This commit removes all remaining usages of the `MockTcpTransport`. Additionally it removes the `MockTcpTransport` and its test case.	2018-12-14 10:59:07 -07:00
Luca Cavanna	bb3ae18da5	Increase coverage in SearchSortValuesTests (#36597 ) SearchSortValuesTests extends now `AbstractSerializingTestCase` which removes some code duplication and standardizes the way we test `fromXContent`, serialization and equals/hashcode. Also, we were never creating `SearchSortValues` through their public constructor that accept an array of `DocValueFormat` together with the array of raw sort values. That is covered now, which involved some conversion from `BytesRef` to String in the test. Also, the previous test was not using doing any equality check against the original and parsed versions in `testFromXContent` due to values being parsed with different types in some cases, which is now covered by converting those values using a new method added to `RandomObjects`. The code was already there as part of `randomStoredFieldValues`, but it is now exposed to be used in other scenarios.	2018-12-14 18:57:37 +01:00
Luca Cavanna	7dc3d3b78b	Add sort and collapse info to SearchHits transport serialization (#36555 ) In order for CCS alternate execution mode (see #32125) to be able to do the final reduction step on the CCS coordinating node, we need to serialize additional info in the transport layer as part of the `SearchHits`, specifically: - lucene `SortField[]` which contains info about the fields that sorting was performed on and their type, which depends on mappings (that the CCS node does not know about) - collapse field (`String`) that field collapsing was executed on, if requested - collapse values (`Object[]`) that field collapsing was based on, if requested This info is needed to be able to reconstruct the `TopFieldDocs` or `CollapseFieldTopDocs` in the CCS coordinating node to feed the `mergeTopDocs` method and reduce multiple search responses received (one per cluster) into one. This commit adds such information to the `SearchHits` class. It's nullable info that is not serialized through the REST layer. `SearchPhaseController` sets such info at the end of the hits reduction phase.	2018-12-14 12:22:54 +01:00
Armin Braun	c5b3ac5578	SNAPSHOTS: Allow Parallel Restore Operations (#36397 ) * Enable parallel restore operations * Add uuid to restore in progress entries to uniquely identify them * Adjust restore in progress entries to be a map in cluster state * Added tests for: * Parallel restore from two different snapshots * Parallel restore from a single snapshot to different indices to test uuid identifiers are correctly used by `RestoreService` and routing allocator * Parallel restore with waiting for completion to test transport actions correctly use uuid identifiers	2018-12-14 11:39:23 +01:00
Tanguy Leroux	8e5dd20efb	[Close Index API] Refactor MetaDataIndexStateService (#36354 ) The commit changes how indices are closed in the MetaDataIndexStateService. It now uses a 3 steps process where writes are blocked on indices to be closed, then some verifications are done on shards using the TransportVerifyShardBeforeCloseAction added in #36249, and finally indices states are moved to CLOSE and their routing tables removed. The closing process also takes care of using the pre-7.0 way to close indices if the cluster contains mixed version of nodes and a node does not support the TransportVerifyShardBeforeCloseAction. It also closes unassigned indices. Related to #33888	2018-12-13 17:36:23 +01:00
Boaz Leskes	f6b5d7e013	Add sequence numbers based optimistic concurrency control support to Engine (#36467 ) This commit add support to engine operations for resolving and verifying the sequence number and primary term of the last modification to a document before performing an operation. This is infrastructure to move our (optimistic concurrency control)[http://en.wikipedia.org/wiki/Optimistic_concurrency_control] API to use sequence numbers instead of internal versioning. Relates #36148 Relates #10708	2018-12-13 08:08:40 +01:00
Tal Levy	cd1bec3a06	[refactor] add Environment in BootstrapContext (#36573 ) There are certain BootstrapCheck checks that may need access environment-specific values. Watcher's EncryptSensitiveDataBootstrapCheck passes in the node's environment via a constructor to bypass the shortcoming in BootstrapContext. This commit pulls in the node's environment into BootstrapContext. Another case is found in #36519, where it is useful to check the state of the data-path. Since PathUtils.get and Paths.get are forbidden APIs, we rely on the environment to retrieve references to things like node data paths. This means that the BootstrapContext will have the same Settings used in the Environment, which currently differs from the Node's settings.	2018-12-12 21:07:21 -08:00
Julie Tibshirani	71a39d10be	Make sure that BWC tests run successfully, even with types deprecation messages. (#36511 )	2018-12-12 12:57:32 -08:00
Tim Brooks	7f612d5dd8	Always compress based on the settings (#36522 ) Currently TransportRequestOptions allows specific requests to request compression. This commit removes this and always compresses based on the settings. Additionally, it removes TransportResponseOptions as they are unused. This closes #36399.	2018-12-12 09:39:15 -07:00
Simon Willnauer	ff5dd14753	Fix test failures related to file corruption (#36530 ) * Fix CorruptFileIT to also take last DV generation into account We currently only prune old .liv generations. With soft_deletes it's important to also prune DV generations. * Fix CorruptionUtils to skip the footer bytes after the checksum is read. Today we read a broken checksum since we also checksum the 8 footer bytes that include the checksum algorithm and the footer magic. Closes #36526	2018-12-12 16:21:02 +01:00
Tim Brooks	e63d52af63	Move page size constants to PageCacheRecycler (#36524 ) `PageCacheRecycler` is the class that creates and holds pages of arrays for various uses. `BigArrays` is just one user of these pages. This commit moves the constants that define the page sizes for the recycler to be on the recycler class.	2018-12-12 07:00:50 -07:00
Alpar Torok	c00d0fc814	Test fixtures improovements (#36037 ) * Upgrae plugin to latest and expose udp * Explicit check for windows * Rename the properties for the port numbers * Tasks for pre and pos container actions	2018-12-12 12:00:47 +02:00
Nik Everett	03daad9812	Re-deprecate xpack rollup endpoints (#36451 ) Redeprecates the `/_xpack/rollup` endpoints in favor of `/_rollup`. When we cleanup the rollup in a cluster containing 6.x nodes we need to use `/_xpack/rollup` instead of `/_rollup` because the 6.x nodes don't know about `/_rollup`. In those cases we must ignore the deprecation warnings that the 7.0 node will return for the end point. Closes #36044	2018-12-11 19:43:17 -05:00
Tim Brooks	797f985067	Add version to handshake requests (#36171 ) Currently our handshake requests do not include a version. This is unfortunate as we cannot rely on the stream version since it is not the sending node's version. Instead it is the minimum compatibility version. The handshake request is currently empty and we do nothing with it. This should allow us to add data to the request without breaking backwards compatibility. This commit adds the version to the handshake request. Additionally, it allows "future data" to be added to the request. This allows nodes to craft a version compatible response. And will properly handle additional data in future handshake requests. The proper handling of "future data" is useful as this is the only request where we do not know the other node's version. Finally, it renames the TcpTransportHandshaker to TransportHandshaker.	2018-12-11 16:09:28 -07:00
Mayya Sharipova	2f18325384	Deprecate types in update_by_query and delete_by_query (#36365 ) Relates to #35190	2018-12-11 17:09:59 -05:00
Tim Brooks	790f8102e9	Modify `BigArrays` to take name of circuit breaker (#36461 ) This commit modifies BigArrays to take a circuit breaker name and the circuit breaking service. The default instance of BigArrays that is passed around everywhere always uses the request breaker. At the network level, we want to be using the inflight request breaker. So this change will allow that. Additionally, as this change moves away from a single instance of BigArrays, the class is modified to not be a Releasable anymore. Releasing big arrays was always dispatching to the PageCacheRecycler, so this change makes the PageCacheRecycler the class that needs to be managed and torn-down. Finally, this commit closes #31435 be making the serialization of transport messages use the inflight request breaker. With this change, we no longer push the global BigArrays instnace to the network level.	2018-12-11 11:55:41 -07:00
markharwood	a9eccbcd02	Tests- added helper methods to ESRestTestCase for checking warnings (#36443 ) Added helper methods to ESRestTestCase for checking warnings in mixed and current-version-only clusters. This is supported by a new VersionSpecificWarningsHandler class with associated unit test. Closes #36251	2018-12-11 17:30:15 +00:00
Andrey Ershov	8b821706cc	Switch more tests to zen2 (#36367 ) 1. CCR tests work without any changes 2. `testDanglingIndices` require changes the source code (added TODO). 3. `testIndexDeletionWhenNodeRejoins` because it's using just two nodes, adding the node to exclusions is needed on restart. 4. `testCorruptTranslogTruncationOfReplica` starts dedicated master one, because otherwise, the cluster does not form, if nodes are stopped and one node is started back. 5. `testResolvePath` needs TEST cluster, because all nodes are stopped at the end of the test and it's not possible to perform checks needed by SUITE cluster. 6. `SnapshotDisruptionIT`. Without changes, the test fails because Zen2 retries snapshot creation as soon as network partition heals. This results into the race between creating snapshot and test cleanup logic (deleting index). Zen1 on the other hand, also schedules retry, but it takes some time after network partition heals, so cleanup logic executes latter and test passes. The check that snapshot is eventually created is added to the end of the test.	2018-12-11 17:12:17 +01:00
Julie Tibshirani	87831051dc	Deprecate types in explain requests. (#35611 ) The following updates were made: - Add a new untyped endpoint `{index}/_explain/{id}`. - Add deprecation warnings to RestAction, plus tests in RestActionTests. - For each REST yml test, make sure there is one version without types, and another legacy version that retains types (called *_with_types.yml). - Deprecate relevant methods on the Java HLRC requests/ responses. - Update documentation (for both the REST API and Java HLRC).	2018-12-10 19:45:13 -08:00
Jernej Klancic	d615add1b1	Add pipeline parent validation for auto date histogram (#35670 ) Allow `auto_date_histogram` as a valid parent agg for derivative, cumulative sum, moving average, moving function and serial differencing pipeline aggregations. Since all these aggs share the same requirement (sequentially ordered parent aggs), this commit also refactors to share the same validation code so that any newly added aggs won't be forgotten. Closes #35578	2018-12-10 16:02:49 -05:00
Jim Ferenczi	75392adf60	[TEST] Convert SearchHitsTests to AbstractStreamableXContentTestCase (#36313 ) This change adds a way to provide the content type of the rest serialization tests when creating random instances. This is used by SearchHitsTests to ensure that the internal members of the class are created with the same xContentType and that equals can be used to compare an instances created from an XContent view.	2018-12-10 20:41:20 +01:00
Nik Everett	9626e700ce	LLRC: Make warning behavior pluggable per request (#36345 ) This allows you to plug the behavior that the LLRC uses to handle warnings on a per request basis. We entertained the idea of allowing you to set the warnings behavior to strict mode on a per request basis but that wouldn't allow the high level rest client to fail when it sees an unexpected warning. We also entertained the idea of adding a list of "required warnings" to the `RequestOptions` but that won't work well with failures that occur sometimes like those we see in mixed clusters. Adding a list of "allowed warnings" to the `RequestOptions` would work for mixed clusters but it'd leave many of the assertions in our tests weaker than we'd like. This behavior plugging implementation allows us to make a "required warnings" option when we need it and an "allowed warnings" behavior when we need it. I don't think this behavior is going to be commonly used by used outside of the Elasticsearch build, but I expect they'll be a few commendably paranoid folks who could use this behavior.	2018-12-10 08:32:00 -05:00
David Turner	9f86e996fe	[Zen2] Support rolling upgrades from Zen1 (#35737 ) We support rolling upgrades from Zen1 by keeping the master as a Zen1 node until there are no more Zen1 nodes in the cluster, using the following principles: - Zen1 nodes will never vote for Zen2 nodes - Zen2 nodes will, while not bootstrapped, vote for Zen1 nodes - Zen2 nodes that were previously part of a mixed cluster will automatically (and unsafely) bootstrap themselves when the last Zen1 node leaves.	2018-12-08 07:33:35 +00:00
Tim Brooks	8a53f2b464	Implement basic `CcrRepository` restore (#36287 ) This is related to #35975. It implements a basic restore functionality for the CcrRepository. When the restore process is kicked off, it configures the new index as expected for a follower index. This means that the index has a different uuid, the version is not incremented, and the Ccr metadata is installed. When the restore shard method is called, an empty shard is initialized.	2018-12-07 15:27:04 -07:00
Tim Brooks	5556204f81	Use MockNioTransport in MockTransportService (#36346 ) The default transport used in the MockTransportService is the MockTcpTransport. This commit changes that to be the MockNioTransport.	2018-12-07 11:17:11 -07:00
Nhat Nguyen	f2df0a5be4	Remove LocalCheckpointTracker#resetCheckpoint (#34667 ) In #34474, we added a new assertion to ensure that the LocalCheckpointTracker is always consistent with Lucene index. However, we reset LocalCheckpoinTracker in testDedupByPrimaryTerm cause this assertion to be violated. This commit removes resetCheckpoint from LocalCheckpointTracker and rewrites testDedupByPrimaryTerm without resetting the local checkpoint. Relates #34474	2018-12-07 12:22:20 -05:00
David Turner	ed1c5a0241	Introduce `zen2` discovery type (#36298 ) With this change it is now possible to start a node running Zen2.	2018-12-06 16:20:08 +00:00
David Turner	38ab15c6fb	Avoid shutting down the only master (#36272 ) Today the InternalTestClusterTests sometimes set up a cluster with a single master, start some other ndoes, shut the original master down, and then reset the cluster. This doesn't really work, because the original master may be stale. This change avoids shutting down the only master in this situation.	2018-12-06 08:27:38 +01:00
Yannick Welsch	a0ae1cc987	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 23:13:12 +01:00
Yannick Welsch	03d0ea91ef	Zen2: Rename tombstones to exclusions (#36226 ) Renames the withdrawal / tombstones APIs to voting configuration exclusions.	2018-12-05 23:12:28 +01:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Yannick Welsch	b20497560c	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 14:06:38 +01:00
Yannick Welsch	0b9efff5cb	Zen2: Persist cluster states the old way on non-master-eligible nodes (#36247 ) The shard deletion logic (triggered by IndicesStore), which also leads to index metadata deletion on non-master-eligible data nodes, currently races against the new cluster state persistence logic triggered by accepting cluster states. One thread is writing the index metadata while another one is deleting the index metadata, leading to exceptions and assertions tripping (see below). The solution proposed by this PR is to move the cluster state persistence of non-master-eligible nodes back to the cluster applier service, just as it used to be for Zen1. This ensures that the index metadata deletion logic, which is triggered by the shard deletion logic, runs on the same thread on which we persist the cluster state.	2018-12-05 14:04:45 +01:00
Alpar Torok	60e45cd81d	Testing conventions task part 2 (#36107 ) Closes #35435 - make it easier to add additional testing tasks with the proper configuration and add some where they were missing. - mute or fix failing tests - add a check as part of testing conventions to find classes not included in any testing task.	2018-12-05 14:20:01 +02:00
Yannick Welsch	70c361ea5a	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-04 21:26:11 +01:00
Adrien Grand	0df08dd458	Set Lucene version upon index creation. (#36038 ) It is important that all shards of a given index have the same `indexCreatedVersionMajor` to Lucene, or eg. merging those shards is going to be considered illegal. At the moment, we use the latest Lucene version when creating a shard, which could cause shards to have different created versions eg. in case of forced allocation. This commit makes sure to reuse the appropriate Lucene version in order to avoid such issues. Closes #33826	2018-12-04 17:53:20 +01:00
Nhat Nguyen	b59deb573e	Always set soft-deletes field of IndexWriterConfig (#36196 ) Today we configure the soft-deletes field iff soft-deletes enabled. Although this choice was correct, it prevents an engine with soft-deletes disabled from opening a Lucene index with soft-deletes. Moreover, this change should not have any side-effect if a Lucene index does not have any soft-deletes. Relates #36141	2018-12-04 11:15:34 -05:00
Andrey Ershov	35e3d77e2c	[Zen2] Implement state recovery (#36013 ) This commit implements proper metadata recovery for Zen2. GatewayService is responsible for the recovery. In Zen1 GatewayService creates an instance of Gateway, that is used to reach out to other cluster nodes, get their state and calculate the most up-to-date state based on versions. After that Gateway performs upgrade and archival of ClusterSettings and closes bad indices. Then recovered state is passed to GatewayService.GatewayRecoveryListener that mixes up current state and restored state, removes state not recovered block, creates the routing table and performs re-routing. In Zen2 we should perform this kind of logic on cluster startup, except mixing state (because there is nothing to mix) and opening routing table. This commit refactors out all `ClusterUpdate` functions in a separate class `ClusterStateUpdaters`, which is used by `Gateway` and `GatewayService` in case of Zen1, and by `GatewayMetaState` and `GatewayService` in case of Zen2. This commit also switches all integration tests that are already using Zen2 from InMemoryPersistedState to GatewayMetaState.	2018-12-04 14:45:45 +01:00
Yannick Welsch	80ee7943c9	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-04 09:37:09 +01:00
Alpar Torok	d036e0ca89	Testclusters: implement starting, waiting for and stopping single cluster nodes (#35599 )	2018-12-04 10:16:51 +02:00
David Turner	034c7655b7	[Zen2] Reduce cluster scope in NodeDisconnectIT (#36168 ) This test suite can stop all the shared master-eligible nodes, which breaks the cluster since any non-shared master-eligible nodes are stopped first in the reset process between tests. Since this test suite can leave the cluster in this somewhat broken state, it seems best that it uses a new cluster for each test.	2018-12-04 07:48:56 +00:00
Armin Braun	433a506d06	SNAPSHOT: Improve Resilience SnapshotShardService (#36113 ) * Resolve the index in the snapshotting thread * Added test for routing table - snapshot state mismatch	2018-12-03 16:39:29 +01:00
Jim Ferenczi	74aca756b8	Remove the distinction between query and filter context in QueryBuilders (#35354 ) When building a query Lucene distinguishes two cases, queries that require to produce a score and queries that only need to match. We cloned this mechanism in the QueryBuilders in order to be able to produce different queries based on whether they need to produce a score or not. However the only case in es that require this distinction is the BoolQueryBuilder that sets a different minimum_should_match when a `bool` query is built in a filter context.. This behavior doesn't seem right because it makes the matching of `should` clauses different when the score is not required. Closes #35293	2018-12-03 11:49:11 +01:00
David Turner	8011438ea8	Use correct source of randomness This fixes a failure of InternalTestClusterTests#testBeforeTest which checks that the cluster is set up the same when starting from the same seed. Trappily, using ESTestCase#randomIntBetween() is no good, we have to use InternalTestCluster#random via RandomNumbers#randomIntBetween() instead.	2018-12-02 09:39:43 +00:00
David Turner	8191348d6b	[Zen2] Only bootstrap a single node (#36119 ) Today, we allow all nodes in an integration test to bootstrap. However this seems to lead to test failures due to post-election instability. The change avoids this instability by only bootstrapping a single node in the cluster.	2018-12-01 06:43:11 +00:00
Luca Cavanna	0ebc17743a	Histogram aggs: add empty buckets only in the final reduce step (#35921 ) Empty buckets don't need to be added when performing an incremental reduction step, they can be added later in the final reduction step. This will allow us to later remove the max buckets limit when performing non final reduction.	2018-11-30 20:33:09 +01:00
Tim Brooks	ea7ea51050	Make `TcpTransport#openConnection` fully async (#36095 ) This is a follow-up to #35144. That commit made the underlying connection opening process in TcpTransport asynchronous. However the method still blocked on the process being complete before returning. This commit moves the blocking to the ConnectionManager level. This is another step towards the top-level TransportService api being async.	2018-11-30 11:30:42 -07:00
Tim Brooks	26dcbcc8cc	Remove `MockTcpTransport` for ESIntegTestCase (#36089 ) This commit removes the `MockTcpTransport` as a transport option for `ESIntegTestCase`. It is the first step in replacing the usages of `MockTcpTransport` with `MockNioTransport`.	2018-11-30 09:04:51 -07:00
Adrien Grand	fa3d365ee8	Fix CompositeBytesReference#slice to not throw AIOOBE with legal offsets. (#35955 ) CompositeBytesReference#slice has two bugs: - One that makes it fail if the reference is empty and an empty slice is created, this is #35950 and is fixed by special-casing empty-slices. - One performance bug that makes it always create a composite slice when creating a slice that ends on a boundary, this is fixed by computing `limit` as the index of the sub reference that holds the last element rather than the next element after the slice. Closes #35950	2018-11-30 10:32:46 +01:00
Zachary Tong	61c2db5ebb	Revert "Deprecate X-Pack centric rollup endpoints (#35962 )" This reverts commit `b84f1f6a3a`.	2018-11-29 12:58:23 -05:00
Zachary Tong	40c5445480	Revert "[TEST] Use deprecated form of rollup endpoint in mixed cluster (#36000 )" This reverts commit `85cdf4f913`.	2018-11-29 12:56:25 -05:00
Tim Brooks	c305f9dc03	Make keepalive pings bidirectional and optimizable (#35441 ) This is related to #34405 and a follow-up to #34753. It makes a number of changes to our current keepalive pings. The ping interval configuration is moved to the ConnectionProfile. The server channel now responds to pings. This makes the keepalive pings bidirectional. On the client-side, the pings can now be optimized away. What this means is that if the channel has received a message or sent a message since the last pinging round, the ping is not sent for this round.	2018-11-29 08:55:53 -07:00
Zachary Tong	85cdf4f913	[TEST] Use deprecated form of rollup endpoint in mixed cluster (#36000 ) When wiping rollup jobs, if we are in a mixed cluster with < v7.0 nodes we need to fall back to the deprecated endpoint because we may talk to a 6.x node.	2018-11-29 07:37:33 -05:00
David Turner	7f257187af	[Zen2] Update default for USE_ZEN2 to true (#35998 ) Today the default for USE_ZEN2 is false and it is overridden in many places. By defaulting it to true we can be sure that the only places in which Zen2 does not work are those in which it is explicitly set to false.	2018-11-29 12:18:35 +00:00
Jason Tedor	b84f1f6a3a	Deprecate X-Pack centric rollup endpoints (#35962 ) This commit is part of our plan to deprecate and ultimately remove the use of _xpack in the REST APIs.	2018-11-27 20:34:17 -05:00
Tim Brooks	cc1fa799c8	Remove `TcpChannel#setSoLinger` method (#35924 ) This commit removes the dedicated `setSoLinger` method. This simplifies the `TcpChannel` interface. This method has very little effect as the SO_LINGER is not set prior to the channels being closed in the abstract transport test case. We still will set SO_LINGER on the `MockNioTransport`. However we can do this manually.	2018-11-27 09:08:14 -07:00
Andrey Ershov	0e283f9670	[Zen2] PersistedState interface implementation (#35819 ) Today GatewayMetaState is capable of atomically storing MetaData to disk. We've also moved fields that are needed to be persisted in Zen2 from ClusterState to ClusterState.MetaData.CoordinationMetaData. This commit implements PersistedState interface. version and currentTerm are persisted as a part of Manifest. GatewayMetaState now implements both ClusterStateApplier and PersistedState interfaces. We started with two descendants Zen1GatewayMetaState and Zen2GatewayMetaState, but it turned out to be not easy to glue it. GatewayMetaState now constructs previousClusterState (including MetaData) and previousManifest inside the constructor so that all PersistedState methods are usable as soon as GatewayMetaState instance is constructed. Also, loadMetaData is renamed to getMetaData, because it just returns previousClusterState.metaData(). Sadly, we don't have access to localNode (obtained from TransportService in the constructor, so getLastAcceptedState should be called, after setLocalNode method is invoked. Currently, when deciding whether to write IndexMetaData to disk, we're comparing current IndexMetaData version and received IndexMetaData version. This is not safe in Zen2 if the term has changed. So updateClusterState now accepts incremental write method parameter. When it's set to false, we always write IndexMetaData to disk. Things that are not covered by GatewayMetaStateTests are covered by GatewayMetaStatePersistedStateTests. This commit also adds an option to use GatewayMetaState instead of InMemoryPersistedState in TestZenDiscovery. However, by default InMemoryPersistedState is used and only one test in PersistedStateIT used GatewayMetaState. In order to use it for other tests, proper state recovery should be implemented.	2018-11-27 15:04:52 +01:00
Christophe Bismuth	b95a4db6e6	Throw a parsing exception when boost is set in span_or query (#28390 ) (#34112 )	2018-11-26 12:15:59 -05:00
Jim Ferenczi	e37a0ef844	Upgrade to lucene-8.0.0-snapshot-67cdd21996 (#35816 )	2018-11-22 15:42:59 +01:00
Andrey Ershov	a056bd8c1c	[Zen2] Move ClusterState fields to be persisted to ClusterState.MetaData (#35625 ) Today we have a way to atomically persist global MetaData and IndexMetaData to disk when new ClusterState is received. All other ClusterState fields are not persisted. However, there are other parts of ClusterState that should be persisted, namely: version term lastCommittedConfiguration lastAcceptedConfiguration votingTombstones version is changed frequently, other fields are not. We decided to group term, lastCommittedConfiguration, lastAcceptedConfiguration and votingTombstones into CoordinationMetaData class and make CoordinationMetaData a field inside MetaData. MetaData.toXContent and MetaData.fromXContent should take care of CoordinationMetaData. version stays as a top level field in ClusterState and will be persisted as part of Manifest in a follow-up commit. Also MetaData.isGlobalStateEquals should be extended to include coordinationMetaData in comparison. This commit favors exposing getters, such as getTerm directly in ClusterState to avoid massive code changes. An example of CoordinationMetaState.toXContent: { "term": 1, "last_committed_config": [ "TiIuBcbBtpuXyDDVHXeD", "ZIAoVbkjjLPLUuYLaTkw" ], "last_accepted_config": [ "OwkXbXZNOZPJqccdFHdz", "LouzsGYwmQzpeQMrboZe", "fCKGRZdjLTqzXAqPUtGL", "pLoxshjpJXwDhbgjfYJy", "SjINLwFIlIEFZCbjrSFo", "MDkVncJEVyZLJktopWje" ] }	2018-11-21 17:03:26 +01:00
Andrey Ershov	6ac0cb1842	Merge branch master into zen2 2 types of conflicts during the merge: 1) Line length fix 2) Classes no longer extend AbstractComponent	2018-11-21 15:36:49 +01:00
Yannick Welsch	8939a7894f	Zen2: Move disruption tests to Zen2 (#35724 ) - Moves disruption tests to Zen2 - Registers a few missing settings - Removes .put(TestZenDiscovery.USE_ZEN2.getKey(), true) from tests where Zen2 is now enabled by default through the parent test class - Moves QuorumGatewayIT back to Zen1, as it is not stable with Zen2 as it currently relies on dangling indices due to the lack of proper CS persistence, which triggers secondary failures	2018-11-21 14:43:33 +01:00
Armin Braun	7a210342ab	TESTS: Remove Dead Code in Disruption Tests (#35768 ) * Neither this class nor the constructor are used anywhere	2018-11-21 10:33:50 +01:00
Christoph Büscher	5847f8379c	Move ScoreAccessor to test-framework (#35766 ) This class is only used by RandomScoreFunctionIT and the MockScriptEngine, so it shouldn't be part of the server codebase.	2018-11-21 10:28:31 +01:00
Armin Braun	33c713ba60	TESTS: More Logging in LongGcDisruptionTests (#35702 ) * The existing logging is not helpful enough to track down which threads hang, we need the hanging thread's stacktraces too * Relates #35686	2018-11-20 15:36:01 +01:00
Alpar Torok	8659af68e0	Auto skip license headers on no source (#35640 ) * Unmute BuildExamplePluginsIT * Skip licenseHeaders when there are no sources	2018-11-20 13:02:33 +02:00
Simon Willnauer	29ef442841	Add a `_freeze` / `_unfreeze` API (#35592 ) This commit adds a rest endpoint for freezing and unfreezing an index. Among other cleanups mainly fixing an issue accessing package private APIs from a plugin that got caught by integration tests this change also adds documentation for frozen indices. Note: frozen indices are marked as `beta` and available as a basic feature. Relates to #34352	2018-11-20 08:03:24 +01:00
Yannick Welsch	47ada69c46	Zen2: Move most integration tests to Zen2 (#35678 ) Zen2 is now feature-complete enough to run most ESIntegTestCase tests. The changes in this PR are as follows: - ClusterSettingsIT is adapted to not be Zen1 specific anymore (it was using Zen1 settings). - Some of the integration tests require persistent storage of the cluster state, which is not fully implemented yet (see #33958). These tests keep running with Zen1 for now but will be switched over as soon as that is fully implemented. - Some very few integration tests are not running yet with Zen2 for other reasons, depending on some of the other open points in #32006.	2018-11-19 21:15:29 +01:00
Gordon Brown	b2057138a7	Remove AbstractComponent from AbstractLifecycleComponent (#35560 ) AbstractLifecycleComponent now no longer extends AbstractComponent. In order to accomplish this, many, many classes now instantiate their own logger.	2018-11-19 09:51:32 -07:00
Arthur Gavlyukovskiy	022726011c	Remove use of AbstractComponent in server (#35444 ) Removed extending of AbstractComponent and changed logger usage to explicit declaration. Abstract classes still have logger declaration using this.getClass() in order to show implementation class name in its logs. See #34488	2018-11-16 16:10:32 -05:00
Jernej Klancic	baf33b3162	Removes AbstractComponent from several classes (#35566 ) Removes inhertiting from AbstractComponent for some classes (mostly in the plugins module). Relates to #34488	2018-11-16 20:50:18 +01:00
Lee Hinman	ce35d049e9	[TEST] Fix ClusterApplierServiceTests.testClusterStateUpdateLogging This changes the test to not use a `CountDownlatch`, instead adding an assertion for the final logging message and waiting until the `MockAppender` has seen it before proceeding. Resolves #23739	2018-11-15 14:15:23 -07:00
David Turner	86ef041539	[Zen2] Introduce ClusterBootstrapService (#35488 ) Today, the bootstrapping of a Zen2 cluster is driven externally, requiring something else to wait for discovery to converge and then to inject the initial configuration. This is hard to use in some situations, such as REST tests. This change introduces the `ClusterBootstrapService` which brings the bootstrap retry logic within each node and allows it to be controlled via an (unsafe) node setting.	2018-11-15 20:09:22 +00:00
Tanguy Leroux	c9b4ef0dfd	Use RunOnce when appropriate (#35553 ) This pull request replaces some blocks of code that must be run once and that are currently based on AtomicBoolean by the convient RunOnce class added in #35489.	2018-11-15 09:24:40 +01:00
Andrey Ershov	045fdd0d3b	Merge master into zen2	2018-11-14 15:37:13 +03:00
Yannick Welsch	4cfdb0609e	Adapt InternalCluster#fullRestart to call onNodeStopped when all nodes are stopped (#35494 ) Refactors and simplifies the logic around stopping nodes, making sure that for a full cluster restart onNodeStopped is only called after the nodes are actually all stopped (and in particular not while starting up some nodes again). This change also ensures that a closed node client is not being used anymore (which required a small change to a test). Relates to #35049	2018-11-14 13:24:56 +01:00
Zachary Tong	c346a0f027	[Rollup] Add `wait_for_completion` option to StopRollupJob API (#34811 ) This adds a `wait_for_completion` flag which allows the user to block the Stop API until the task has actually moved to a stopped state, instead of returning immediately. If the flag is set, a `timeout` parameter can be specified to determine how long (at max) to block the API call. If unspecified, the timeout is 30s. If the timeout is exceeded before the job moves to STOPPED, a timeout exception is thrown. Note: this is just signifying that the API call itself timed out. The job will remain in STOPPING and evenutally flip over to STOPPED in the background. If the user asks the API to block, we move over the the generic threadpool so that we don't hold up a networking thread.	2018-11-13 16:37:17 -05:00
Julie Tibshirani	bc799e4a6f	Ignore warnings related to types deprecation in REST tests. (#35395 )	2018-11-13 11:56:01 -08:00
David Turner	8e40a2bbe2	[Zen2] Introduce vote withdrawal (#35446 ) If shutting down half or more of the master-eligible nodes, their votes must first be explicitly withdrawn to ensure that the cluster doesn't lose its quorum. This works via _voting tombstones_, stored in the cluster state, which tell the reconfigurator to remove nodes from the voting configuration. This change introduces voting tombstones to the cluster state, together with transport APIs for adding and removing them, and makes use of these APIs in `InternalTestCluster` to support tests which remove at least half of the master-eligible nodes at once (e.g. shrinking from two master-eligible nodes to one).	2018-11-13 19:32:32 +00:00
David Turner	fbd3cab410	[Zen2] Remove AbstractComponent usage (#35483 ) AbstractComponent was deprecated in #35140 and is looking like it will be removed at some point by #34888. Today all it does is provide a logger. This change removes the usages of AbstractComponent that live solely in the zen2 feature branch to avoid some future merge pain, and replaces it where necessary with some directly-created loggers.	2018-11-13 15:20:49 +00:00
Yannick Welsch	fe29b18c26	Fix compilation	2018-11-12 11:05:11 +01:00
Yannick Welsch	4e6c58c942	Merge remote-tracking branch 'elastic/master' into zen2	2018-11-12 10:03:59 +01:00
Tim Brooks	ba478827ad	Improve MockTcpTransport memory usage (#35402 ) The MockTcpTransport is not friendly in regards to memory usage. It must allocate multiple byte arrays for every message. This improves the memory situation by failing fast if the message is improperly formatted. Additionally, it uses reusable big arrays for at least half of the allocated byte arrays.	2018-11-09 10:12:49 -07:00
Jim Ferenczi	7054e289fa	Add trace log of the request for the query and fetch phases (#34479 ) This change adds a logger for the query and fetch phases that prints all requests before their execution at the trace level. This will help debugging cases where an issue occurs during the execution since only completed queries are logged by the slow logs.	2018-11-09 09:41:51 +01:00
Tim Brooks	93c2c604e5	Move compression config to ConnectionProfile (#35357 ) This is related to #34483. It introduces a namespaced setting for compression that allows users to configure compression on a per remote cluster basis. The transport.tcp.compress remains as a fallback setting. If transport.tcp.compress is set to true, then all requests and responses are compressed. If it is set to false, only requests to clusters based on the cluster.remote.cluster_name.transport.compress setting are compressed. However, after this change regardless of any local settings, responses will be compressed if the request that is received was compressed.	2018-11-08 10:37:59 -07:00
Yannick Welsch	c315ead0ac	Zen2: Add diff-based publishing (#35290 ) Enables diff-based publishing, which is an optimization where only the changing parts of the cluster state are published to the nodes in the cluster, falling back to full cluster state publishing if the receiver does not have the previous cluster state.	2018-11-08 17:16:09 +01:00
David Turner	6885a7cb0f	Introduce transport API for cluster bootstrapping (#34961 ) - Introduces a transport API for bootstrapping a Zen2 cluster - Introduces a transport API for requesting the set of nodes that a master-eligible node has discovered and for waiting until this comprises the expected number of nodes. - Alters ESIntegTestCase to use these APIs when forming a cluster, rather than injecting the initial configuration directly.	2018-11-08 16:09:37 +00:00
Zachary Tong	54b445d74b	[Test] Remove obsolete job/cluster cleanup code Also makes sure the awaitBusy for job stoppage is checked, so that we can fail if we timed out waiting for a job to stop. Closes #35295	2018-11-08 10:23:23 -05:00
David Turner	77789a733d	Merge branch 'master' into 2018-11-08-merge-master	2018-11-08 13:38:18 +00:00
Simon Willnauer	0cc0fd2d15	Add a frozen engine implementation (#34357 ) This change adds a `frozen` engine that allows lazily open a directory reader on a read-only shard. The engine wraps general purpose searchers in a LazyDirectoryReader that also allows to release and reset the underlying index readers after any and before secondary search phases. Relates to #34352	2018-11-07 20:23:35 +01:00
Alpar Torok	8a85b2eada	Remove build qualifier from server's Version (#35172 ) With this change, `Version` no longer carries information about the qualifier, we still need a way to show the "display version" that does have both qualifier and snapshot. This is now stored by the build and red from `META-INF`.	2018-11-07 14:01:05 +02:00
Tim Brooks	f395b1eace	Open node connections asynchronously (#35144 ) This is related to #29023. Additionally at other points we have discussed a preference for removing the need to unnecessarily block threads for opening new node connections. This commit lays the groudwork for this by opening connections asynchronously at the transport level. We still block, however, this work will make it possible to eventually remove all blocking on new connections out of the TransportService and Transport.	2018-11-06 17:58:20 -07:00
David Turner	7e356ac29b	[Zen2] Introduce auto_shrink_voting_configuration setting (#35217 ) Today we allow the user to set the minimum size of a voting configuration. On reflection we would rather this was simply '3' where possible, and we can use the retirement API to control the removal of nodes more explicitly. This change replaces the old reconfigurator setting with a new one, `cluster.auto_shrink_voting_configuration`, which determines whether Elasticsearch should automatically remove nodes from the voting configuration or not.	2018-11-06 18:10:29 +00:00
Nick Knize	a5e1f4d3a2	Upgrade to lucene-8.0.0-snapshot-31d7dfe6b1 (#35224 )	2018-11-06 11:55:23 +01:00
David Turner	2fb3d1a465	[Zen2] Fix some rarely-failing tests (#35198 ) Recent changes have left a few Zen2 tests occasionally failing. This commit fixes them.	2018-11-05 21:54:53 +00:00
Boaz Leskes	28078642b3	Engine.newChangesSnapshot may cause unneeded refreshes if called concurrently (#35169 ) When the engine is asked for historical operations, we check if some of the requested operations are not yet refreshed and if so we refresh before returning the operations. The refresh check is based on capturing the local checkpoint before each refresh and comparing that value to the one requested when `newChangesSnapshot` was called. If the requested range is above the captured local checkpoint we issue a refresh. This can currently cause unneeded extra refreshes if the method is called concurrently which may cause unwanted degradation in indexing performance. This is especially relevant for CCR where we always ask for a range below the global checkpoint. That range is guaranteed to be below the local checkpoint of the shard and one refresh is enough to serve multiple changes requests. This commit fixes this by introducing a dedicated mutex to make sure the test for whether a refresh is needed actually wait for concurrents for concurrent refreshes that were caused by another change refresh. Note that this is not a big change in semantics as refreshes are serialized by lucene anyway. I also opted not to keep the synchronization to the changes snapshot request only even if in theory we can apply it to all refreshes, not matter where they come from.	2018-11-04 13:43:33 +01:00
Nhat Nguyen	855ab3fa1e	Add equals/hashCode to SeqNoStats (#35223 ) This commit adds equals/hashCode to SeqNoStats so we can verify it wholly in tests.	2018-11-02 21:31:36 -04:00
Tim Brooks	0166388d74	Use single netty event loop group for transports (#35181 ) Currently we create a new netty event loop group for client connections and all server profiles. Each new group creates new threads for io processing. This means 2 * num of processors new threads for each group. A single group should be able to handle all io processing (for the transports). This also brings the netty module inline with what we do for nio. Additionally, this PR renames the worker threads to be the same for netty and nio.	2018-11-02 16:31:19 -06:00
Colin Goodheart-Smithe	fc6e1f7f3f	Merge branch 'master' into index-lifecycle	2018-11-02 10:56:35 +00:00
Alpar Torok	f22700812e	Introduce build qualifier parameter (#35155 ) * Introduce property to set version qualifier - VersionProperties.elasticsearch is now a string which can have qualifier and snapshot too - The Version class in the build no longer cares about snapshot and qualifier.	2018-11-02 05:27:40 +02:00
Julie Tibshirani	746d94e299	Unmute AbstractQueryTestCase#testToQuery. The RangeQueryBuilderTests#testToQuery failures were fixed in #34868 and #35145.	2018-11-01 12:06:36 -07:00
Tal Levy	c3cf7dd305	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-11-01 10:13:02 -07:00
Nik Everett	e28509fbfe	Core: Less settings to AbstractComponent (#35140 ) Stop passing `Settings` to `AbstractComponent`'s ctor. This allows us to stop passing around `Settings` in a ton of places. While this change touches many files, it touches them all in fairly small, mechanical ways, doing a few things per file: 1. Drop the `super(settings);` line on everything that extends `AbstractComponent`. 2. Drop the `settings` argument to the ctor if it is no longer used. 3. If the file doesn't use `logger` then drop `extends AbstractComponent` from it. 4. Clean up all compilation failure caused by the `settings` removal and drop any now unused `settings` isntances and method arguments. I've intentionally not removed the `settings` argument from a few files: 1. TransportAction 2. AbstractLifecycleComponent 3. BaseRestHandler These files don't need `settings` either, but this change is large enough as is. Relates to #34488	2018-10-31 21:23:20 -04:00
Igor Motov	b5e5e93c46	Fixes randomDateTimeZone method (#35145 ) The randomDateTimeZone method shouldn't return deprecated timezones this causes some tests to fail with deprecation warning.	2018-10-31 20:32:18 -04:00
Seong-hyun, Oh	9ef4788c13	Make XContentBuilder in AliasActions build `is_write_index` field (#35071 ) Make XContentBuilder in AliasesActions build `is_write_index` field	2018-10-31 14:15:46 -07:00
Armin Braun	e6f9f0666e	NETWORKING: MockTransportService Wait for Close (#35038 ) * NETWORKING: MockTransportService Wait for Close * Make `MockTransportService` wait `30s` for close listeners to run before failing the assertion * Closes #34990	2018-10-31 21:33:49 +01:00
David Turner	0072c90e2a	Pre-populate unicast hosts files (#35136 ) Today when ESIntegTestCase starts some nodes it writes out the unicast hosts files each time a node starts its transport service. This does mean that a number of nodes can start and perform their first pinging round without any unicast hosts which, if the timing is unlucky and a lot of nodes are all started at the same time, can lead to a split brain as in #35052. Prior to #33554 this was unlikely to happen since the MockUncasedHostsProvider would always have yielded the existing hosts, so the timing would have to have been implausibly unlucky. Since #33554, however, it's more likely because the race occurs between the start of the first round of pinging and the writing of the unicast hosts file. It is realistic that new nodes will be configured with the existing nodes from startup, so this change reinstates that behaviour. Closes #35052.	2018-10-31 19:21:24 +00:00
Tal Levy	d5d28420b6	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-31 10:47:07 -07:00
Luca Cavanna	674225aaa1	[TEST] Enforce skip headers when needed (#34735 ) The java yaml test runner supports sending request headers, yet not all clients support headers. This commit makes sure that we enforce adding a skip section with feature "headers" whenever headers are used in a do section as part of a test. That decreases the chance for new tests to break client builds due to the missing skip section. Closes #34650	2018-10-31 13:07:02 +01:00
Tal Levy	5141084048	rename CRUD api REST path prefix _ilm to _ilm/policy (#35056 ) This PR renames the CRUD APIS for ILM GET _ilm/<policy>, _ilm -> _ilm/policy/<policy>, _ilm/policy PUT _ilm/<policy> -> _ilm/policy/<policy> DELETE _ilm/<policy> -> _ilm/policy/<policy> closes #34929.	2018-10-30 16:19:05 -07:00
Nik Everett	086ada4c08	Core: Drop settings member from AbstractComponent (#35083 ) Drops the `Settings` member from `AbstractComponent`, moving it from the base class on to the classes that use it. For the most part this is a mechanical change that doesn't drop `Settings` accesses. The one exception to this is naming threads where it switches from an invocation that passes `Settings` and extracts the node name to one that explicitly passes the node name. This change doesn't drop the `Settings` argument from `AbstractComponent`'s ctor because this change is big enough as is. We'll do that in a follow up change.	2018-10-30 16:10:38 -04:00

1 2 3 4 5 ...

1964 Commits