OpenSearch

Commit Graph

Author	SHA1	Message	Date
Yannick Welsch	d5139e0590	Only bootstrap and elect node in current voting configuration (#37712 ) Adapts bootstrapping and leader election to only trigger on nodes that are actually part of the voting configuration.	2019-01-23 13:10:11 +01:00
Simon Willnauer	4ec3a6d922	Ensure either success or failure path for SearchOperationListener is called (#37467 ) Today we have several implementations of executing SearchOperationListener in SearchService. While all of them seem to be safe at least on, the one that executes scroll searches can cause illegal execution of SearchOperationListener that can then in-turn trigger assertions in ShardSearchStats. This change adds a SearchOperationListenerExecutor that uses try-with blocks to ensure listeners are called in a safe way. Relates to #37185	2019-01-23 12:38:44 +01:00
Tanguy Leroux	6130d15172	Adapt SyncedFlushService (#37691 )	2019-01-23 11:08:54 +01:00
Alexander Reelsen	701d89caa2	Mute FilterAggregatorTests#testRandom Relates #37743	2019-01-23 11:00:37 +01:00
Alexander Reelsen	daa2ec8a60	Switch mapping/aggregations over to java time (#36363 ) This commit moves the aggregation and mapping code from joda time to java time. This includes field mappers, root object mappers, aggregations with date histograms, query builders and a lot of changes within tests. The cut-over to java time is a requirement so that we can support nanoseconds properly in a future field mapper. Relates #27330	2019-01-23 10:40:05 +01:00
Boaz Leskes	52ba407931	Expose sequence number and primary terms in search responses (#37639 ) Users may require the sequence number and primary terms to perform optimistic concurrency control operations. Currently, you can get the sequence number via the `docvalues_fields` API but the primary term is not accessible because it is maintained by the `SeqNoFieldMapper` and the infrastructure can't find it. This commit adds a dedicated sub fetch phase to return both numbers that is connected to a new `seq_no_primary_term` parameter.	2019-01-23 09:01:58 +01:00
Andrey Ershov	7c6566e14c	Migrate SpecificMasterNodesIT to Zen2 (#37532 ) 1. testSimpleOnlyMasterNodeElection - requires cluster bootstrap when the first master node is started. 2. testElectOnlyBetweenMasterNodes - requires cluster bootstrap when the first master node is started and requires adding voting exclusion before shutting down the first master node. 3. testAliasFilterValidation - requires cluster bootstrap when the first master node is started.	2019-01-23 07:22:41 +01:00
Andrey Ershov	e2e00cd245	Fix MetaStateFormat tests It's not safe to continue writing state using MetaDataStateFormat after dirty WriteStateException occurred if it's not recovered by successful subsequent state write. We've encountered test failure of testFailRandomlyAndReadAnyState. The test breaks in the following way. There are 3 state paths. And what happens next Successful write at the beginning of the test yields 0 0 0 state files in the directories. 1st write in the loop is unsuccessful, but not dirty - 0 0 0. 2nd write in the loop is not successful and dirty (failure during fsync), however before removing new files we have 1 1 1. But now during deletion, the first deletion fails and we get - 1 0 0. 3rd write in the loop is unsuccessful, but not dirty - so we want to keep old generation, which happens to be the 1st generation, so now we have 1 x x in state folders. Now we assert that we either load 0 or 1 state from the state folders and select only 2rd and 3th folder to emulate disk failures - this results in NPE because there is nothing in these folders. Fortunately, this won’t be a problem in real life, because if there is a dirty exception, we shut down the node and make sure we perform a successful write on the node startup.	2019-01-23 07:21:26 +01:00
Zachary Tong	2ba9e361ab	Add helper classes to determine if aggs have a value (#36020 ) This adds a set of helper classes to determine if an agg "has a value". This is needed because InternalAggs represent "empty" in different manners according to convention. Some use `NaN`, `+/- Inf`, `0.0`, etc. A user can pass the Internal agg type to one of these helper methods and it will report if the agg contains a value or not, which allows the user to differentiate "empty" from a real `NaN`. These helpers are best-effort in some cases. For example, several pipeline aggs share a single return class but use different conventions to mark "empty", so the helper uses the loosest definition that applies to all the aggs that use the class. Sums in particular are unreliable. The InternalSum simply returns 0.0 if the agg is empty (which is correct, no values == sum of zero). But this also means the helper cannot differentiate from "empty" and `+1 + -1`.	2019-01-22 12:38:55 -05:00
Jason Tedor	715719ee3b	Remove warn-date from warning headers (#37622 ) This commit removes the warn-date from warning headers. Previously we were stamping every warning header with when the request occurred. However, this has a severe performance penalty when deprecation logging is called frequently, as obtaining the current time and formatting it properly is expensive. A previous change moved to using the startup time as the time to stamp on every warning header, but this was only to prove that the timestamping was expensive. Since the warn-date is optional, we elect to remove it from the warning header. Prior to this commit, we worked in Kibana to make the warn-date treated as optional there so that we can follow-up in Elasticsearch and remove the warn-date. This commit does that.	2019-01-22 12:29:24 -05:00
Yannick Welsch	23ba900840	Publish to masters first (#37673 ) Prefer publishing to master-eligible nodes first, so that cluster state updates are committed more quickly, and master-eligible nodes also turned more quickly into followers after a leader election.	2019-01-22 13:53:10 +01:00
David Kyle	3fad1eeaed	Un-assign persistent tasks as nodes exit the cluster (#37656 ) PersistentTasksClusterService decides if a task should be reassigned by checking there is a node in the cluster with the same Id. If a node is restarted PersistentTasksClusterService may not observe the change and decide the task still has a valid assignment because the node's ephemeral Id is not used in that decision. This change un-assigns tasks as the nodes in the cluster change.	2019-01-22 12:44:45 +00:00
Henning Andersen	228611843c	Fail start of non-data node if node has data (#37347 ) * Fail start of non-data node if node has data Check that nodes started with node.data=false cannot start if they have shard data to avoid (old) indexes being resurrected into the cluster in red status. Issue #27073	2019-01-22 13:27:12 +01:00
Yannick Welsch	2a7b7ccf1c	Use cancel instead of timeout for aborting publications (#37670 ) When publications were cancelled because a node turned to follower or candidate, it would still show as time out, which can be confusing in the logs. This change adapts the improper call of onTimeout by generalizing it to a cancel method.	2019-01-22 12:51:03 +01:00
Christoph Büscher	0a93a0358b	Remove deprecated FieldNamesFieldMapper.Builder#index (#37305 ) The method calls "enabled" in addition to what the super.index() does, but this seems to be done explicitely now in the TypeParsers `parse` method. The removed method has been deprecated since at least 6.0. Also making some of the Builders methods and ctos private since they are only used internally in this class.	2019-01-22 12:12:21 +01:00
David Turner	5db7ed22a0	Bootstrap a Zen2 cluster once quorum is discovered (#37463 ) Today when bootstrapping a Zen2 cluster we wait for every node in the `initial_master_nodes` setting to be discovered, so that we can map the node names or addresses in the `initial_master_nodes` list to their IDs for inclusion in the initial voting configuration. This means that if any of the expected master-eligible nodes fails to start then bootstrapping will not occur and the cluster will not form. This is not ideal, and we would prefer the cluster to bootstrap even if some of the master-eligible nodes do not start. Safe bootstrapping requires that all pairs of quorums of all initial configurations overlap, and this is particularly troublesome to ensure given that nodes may be concurrently and independently attempting to bootstrap the cluster. The solution is to bootstrap using an initial configuration whose size matches the size of the expected set of master-eligible nodes, but with the unknown IDs replaced by "placeholder" IDs that can never belong to any node. Any quorum of received votes in any of these placeholder-laden initial configurations is also a quorum of the "true" initial set of master-eligible nodes, giving the guarantee that it intersects all other quorums as required. Note that this change means that the initial configuration is not necessarily robust to any node failures. Normally the cluster will form and then auto-reconfigure to a more robust configuration in which the placeholder IDs are replaced by the IDs of genuine nodes as they join the cluster; however if a node fails between bootstrapping and this auto-reconfiguration then the cluster may become unavailable. This we feel to be less likely than a node failing to start at all. This commit also enormously simplifies the cluster bootstrapping process. Today, the cluster bootstrapping process involves two (local) transport actions in order to support a flexible bootstrapping API and to make it easily accessible to plugins. However this flexibility is not required for the current design so it is adding a good deal of unnecessary complexity. Here we remove this complexity in favour of a much simpler ClusterBootstrapService implementation that does all the work itself.	2019-01-22 11:03:51 +00:00
Adrien Grand	e9fcb25a28	Upgrade to lucene-8.0.0-snapshot-83f9835. (#37668 ) This snapshot uses a new file format for doc-values which is expected to make advance/advanceExact perform faster on sparse fields: https://issues.apache.org/jira/browse/LUCENE-8585	2019-01-22 11:44:29 +01:00
Alpar Torok	74d1cfbf7e	Mute failing test Tracking ##37687	2019-01-22 10:50:27 +02:00
Alexander Reelsen	4fb68ea195	Fix java time formatters that round up (#37604 ) In order to be able to parse epoch seconds and epoch milli seconds own java time fields had been introduced. These fields are however not compatible with the way that java time allows one to configure default fields (when a part of a timestamp cannot be read then a default value is added), which is used for the formatters that are rounding up to the next value. This commit allows java date formatters to configure its round up parsing by setting default values via a consumer. By default all formats are setting JavaDateFormatter.ROUND_UP_BASE_FIELDS for rounding up. The epoch however parsers both need to set different fields. The merged date formatters do not set any fields, they just append all the round up formatters. Also the formatter now properly copies the locale and the timezone, fractional parsing has been set to nano seconds with proper width.	2019-01-22 09:42:17 +01:00
Alpar Torok	17d704347e	Mute failing test Tracking #37685	2019-01-22 10:31:23 +02:00
Tanguy Leroux	0290547ad7	Ensure that max seq # is equal to the global checkpoint when creating ReadOnlyEngines (#37426 ) Since version 6.7.0 the Close Index API guarantees that all translog operations have been correctly flushed before the index is closed. If the index is reopened as a Frozen index (which uses a ReadOnlyEngine) we can verify that the maximum sequence number from the last Lucene commit is indeed equal to the last known global checkpoint and refuses to open the read only engine if it's not the case. In this PR the check is only done for indices created on or after 6.7.0 as they are guaranteed to be closed using the new Close Index API. Related #33888	2019-01-22 09:22:33 +01:00
Alpar Torok	a713183cab	Mute failing discovery disruption tests Tracking #37539	2019-01-22 10:16:04 +02:00
Nhat Nguyen	7394892b4c	Make prepare engine step of recovery source non-blocking (#37573 ) Relates #37174	2019-01-21 21:35:10 -05:00
Tim Brooks	21838d73b5	Extract message serialization from `TcpTransport` (#37034 ) This commit introduces a NetworkMessage class. This class has two subclasses - InboundMessage and OutboundMessage. These messages can be serialized and deserialized independent of the transport. This allows more granular testing. Additionally, the serialization mechanism is now a simple Supplier. This builds the framework to eventually move the serialization of transport messages to the network thread. This is the one serialization component that is not currently performed on the network thread (transport deserialization and http serialization and deserialization are all on the network thread).	2019-01-21 14:14:18 -07:00
Tim Brooks	f516d68fb2	Share `NioGroup` between http and transport impls (#37396 ) Currently we create dedicated network threads for both the http and transport implementations. Since these these threads should never perform blocking operations, these threads could be shared. This commit modifies the nio-transport to have 0 http workers be default. If the default configs are used, this will cause the http transport to be run on the transport worker threads. The http worker setting will still exist in case the user would like to configure dedicated workers. Additionally, this commmit deletes dedicated acceptor threads. We have never had these for the netty transport and they can be added back if a need is determined in the future.	2019-01-21 13:50:56 -07:00
Armin Braun	3a3f5b39c3	Fix Race in Concurrent Snapshot Delete and Create (#37612 ) * The repo id was determined wrong when the delete picked up on an in progress snapshot * NOTE: This solution is still a best-effort fix and there's a slight chance of running into concurrency issues here when multiple create and delete requests for the same snapshot name are happening concurrently, but these require a sequence of multiple cluster state updates between the changed method reading the genId and submitting its cluster state update task * Added test reproduced the issue reliably in about 50% of runs * Closes #37581	2019-01-21 13:10:33 +01:00
Luca Cavanna	09a6ba50ef	Add support for merging multiple search responses into one (#37566 ) This will be used in cross-cluster search when reduction will be performed locally on each cluster. The CCS coordinating node will send one search request per remote cluster involved and will get one search response back from each one of them. Such responses contain all the info to be able to perform an additional reduction and return results back to the user. Relates to #32125	2019-01-21 11:51:47 +01:00
Jason Tedor	adae233f77	Add some deprecation optimizations (#37597 ) This commit optimizes some of the performance issues from using deprecation logging: - we optimize encoding the deprecation value - we optimize formatting the deprecation string - we optimize away getting the current time (by using cached startup time)	2019-01-18 16:42:25 -05:00
Tal Levy	106f900dfb	refactor inner geogrid classes to own class files (#37596 ) To make further refactoring of GeoGrid aggregations easier (related: #30320), splitting out these inner class dependencies into their own files makes it easier to map the relationship between classes	2019-01-18 13:40:00 -08:00
Julie Tibshirani	8da7a27f3b	Deprecate types in the put mapping API. (#37280 ) From #29453 and #37285, the `include_type_name` parameter was already present and defaulted to false. This PR makes the following updates: - Add deprecation warnings to `RestPutMappingAction`, plus tests in `RestPutMappingActionTests`. - Add a typeless 'put mappings' method to the Java HLRC, and deprecate the old typed version. To do this cleanly, I opted to create a new `PutMappingRequest` object that differs from the existing server one.	2019-01-18 12:28:31 -08:00
Jack Conradson	de55b4dfd1	Add types deprecation to script contexts (#37554 ) This adds deprecation to _type in the script contexts for ingest and update. This adds a DeprecationMap that wraps the ctx Map containing _type for these specific contexts.	2019-01-18 09:13:49 -08:00
Yannick Welsch	377d96e376	Remove initial_master_nodes on node restart (#37580 ) Some tests (e.g. testRestoreIndexWithShardsMissingInLocalGateway) were split-braining since being switched to Zen2 because the bootstrap setting was left around when nodes got restarted with data folders wiped. The test in question here was starting one node (which autobootstrapped to that single node), then another node. The first node was then shut down (after excluding it from the voting configuration), its data folder wiped, and restarted. After restart, the node had an empty data folder yet initial_master_nodes set to itself (i.e. same name). This made the node sometimes form a cluster of its own, and not rejoin the existing cluster with the other node.	2019-01-18 16:36:42 +01:00
Jason Tedor	ed297b7369	Only update response headers if we have a new one (#37590 ) Currently when adding a response header, we do some de-duplication, and maybe drop the header on the floor if we have reached capacity. Yet, we still update the thread local tracking the response headers. This is really expensive because under the hood there is a shared reference that we synchronize on. In the case of a request processed across many shards in a tight loop, this contention can be detrimental to performance. We can avoid updating the thread local in these cases though, when the response header is duplicate of one that we have already seen, or when it's dropped on the floor. This commit addresses these performance issues by avoiding the unnecessary set.	2019-01-18 08:20:05 -05:00
Tanguy Leroux	29d3a708da	Fix BulkWithUpdatesIT and CloseIndexIT As of today the Close Index API does its best to close indices, but closing an index with ongoing recoveries might or might not be acknowledged depending of the values of the max seq number and global checkpoint at the time the TransportVerifyShardBeforeClose action is executed. These tests failed because they always expect that the index is correctly closed on the first try, which is not always the case. Instead we need to retry the closing until it succeed. Closes #37571	2019-01-18 10:54:35 +01:00
David Turner	65e76b3f6f	Migrate RecoveryFromGatewayIT to Zen2 (#37520 ) * Fixes `testTwoNodeFirstNodeCleared` by manipulating voting config exclusions. * Removes `testRecoveryDifferentNodeOrderStartup` since state recovery is now handled entirely on the elected master, so the order in which the data nodes start is irrelevant.	2019-01-18 09:15:51 +00:00
David Turner	699d881739	Migrate IndicesExistsIT to Zen2 (#37526 ) This test was actually passing, for the wrong reason: it asserts a `MasterNotDiscoveredException` is thrown, expecting this to be due to a failure to perform state recovery, but in fact it's thrown because the node is not correctly bootstrapped.	2019-01-18 09:15:30 +00:00
Christoph Büscher	2f0e0b2426	Allow indices.get_mapping response parsing without types (#37492 ) This change adds deprecation warning to the indices.get_mapping API in case the "inlcude_type_name" parameter is set to "true" and changes the parsing code in GetMappingsResponse to parse the type-less response instead of the one containing types. As a consequence the HLRC client doesn't need to force "include_type_name=true" any more and the GetMappingsResponseTests can be adapted to the new format as well. Also removing some "include_type_name" parameters in yaml test and docs where not necessary.	2019-01-18 09:33:36 +01:00
Armin Braun	62ddc8c776	Reenable UnicastZenPingTests#testSimplePings * This was muted needlessly, the problem in #26701 only applies to `6.x` * Relates #26701	2019-01-18 08:36:22 +01:00
Tim Brooks	b6f06a48c0	Implement follower rate limiting for file restore (#37449 ) This is related to #35975. This commit implements rate limiting on the follower side using a new class `CombinedRateLimiter`.	2019-01-17 14:58:46 -07:00
Armin Braun	381d035cd6	Remove Redundant RestoreRequest Class (#37535 ) * Same as #37464 but for the restore side	2019-01-17 22:23:23 +01:00
Tal Levy	a0c504e4a3	Create specific exception for when snapshots are in progress (#37550 ) delete and close index actions threw IllegalArgumentExceptions when attempting to run against an index that has a snapshot in progress. This change introduces a dedicated SnapshotInProgressException for these scenarios. This is done to explicitly signal to clients that this is the reason the action failed, and it is a retryable error. relates to #37541.	2019-01-17 13:21:12 -08:00
James Baiera	5782a5bbbc	Mute UnicastZenPingTests#testSimplePings relates #26701	2019-01-17 15:13:09 -05:00
Yannick Welsch	68de2edb14	Fix assertion at end of forceRefreshes (#37559 ) This commit ensures that we only change refreshListeners to a list if we're actually adding something to the list.	2019-01-17 19:18:47 +01:00
Yannick Welsch	6d64a2a901	Propagate Errors in executors to uncaught exception handler (#36137 ) This is a continuation of #28667 and has as goal to convert all executors to propagate errors to the uncaught exception handler. Notable missing ones were the direct executor and the scheduler. This commit also makes it the property of the executor, not the runnable, to ensure this property. A big part of this commit also consists of vastly improving the test coverage in this area.	2019-01-17 17:46:35 +01:00
Nhat Nguyen	20ed3dd1a8	Make recovery source send operations non-blocking (#37503 ) Relates #37458	2019-01-17 09:59:05 -05:00
Jim Ferenczi	4351a5e537	Allow field types to optimize phrase prefix queries (#37436 ) This change adds a way to customize how phrase prefix queries should be created on field types. The match phrase prefix query is exposed in field types in order to allow optimizations based on the options set on the field. For instance the text field uses the configured prefix field (if available) to build a span near that mixes the original field and the prefix field on the last position. This change also contains a small refactoring of the match/multi_match query that simplifies the interactions between the builders. Closes #31921	2019-01-17 15:10:28 +01:00
Yannick Welsch	d9fa4e4ada	Fix testRelocateWhileContinuouslyIndexingAndWaitingForRefresh (#37560 ) This test failed because the refresh at the end of the test is not guaranteed to run before the indexing is completed, and therefore there's no guarantee that the refresh will free all operations. This triggers an assertion failure in the test clean-up, which asserts that there are no more pending operations.	2019-01-17 13:59:09 +01:00
Yannick Welsch	6fe2d6da03	Mute TransportClientNodesServiceTests#testListenerFailures Relates to #37567	2019-01-17 13:54:48 +01:00
Martijn van Groningen	da799306a8	Decreased time out in test Relates to #37378	2019-01-17 11:51:17 +01:00
Torgeir Thoresen	676e1b1a13	Fix erroneous docstrings for abstract bulk by scroll request (#37517 )	2019-01-17 10:22:49 +01:00
Przemyslaw Gomulka	b6e5ccaf8a	Remove the AbstracLifecycleComponent constructor with Settings (#37523 ) Adding the migration guide and removing the deprecated in 6.x constructor relates #35560 relates #34488	2019-01-17 09:10:09 +01:00
Jason Tedor	18a3e48a4a	Change file descriptor limit to 65535 (#37537 ) Some systems default to a nofile ulimit of 65535. To reduce the pain of deploying Elasticsearch to such systems, this commit lowers the required limit from 65536 to 65535.	2019-01-16 17:19:12 -05:00
Nhat Nguyen	655103de58	Increase timeout for testAddNewReplicas We flush quite often in testAddNewReplicas to create the safe index commit with gaps in sequence numbers. This test is failing recently because CI is too slow to complete 5 small flushes in 10 seconds. This commit increases timeout for this test and also ensures to always terminate the background indexing. The latter is to eliminate unrelated failures if this test fails again. Closes #37183	2019-01-16 13:17:10 -05:00
Andrey Ershov	4e72f3c5c6	DedicatedClusterSnapshotRestoreIT to Zen2 (#37489 ) All tests except testRestorePersistentSettings (renamed to testExceptionWhenRestoringPersistentSettings) worked fine. testExceptionWhenRestoringPersistentSettings re-written to use a custom setting, because "minimum master node" setting is no longer available in Zen2. It turns out there is no good replacement for "minimum master node" setting for this test, that's why the custom setting is introduced. Unfortunately, there is #37485 bug and currently RestoreService does not perform setting validation. That's why the test is annotated with @AwaitsFix, the idea is to merge this commit and then fix the issue and enable the test. (The test passes with a simple fix, that adds a single line to RestoreService).	2019-01-16 11:14:16 -05:00
Jack Conradson	3d8c04659c	Deprecate _type from LeafDocLookup (#37491 ) * Deprecate _type from LeafDocLookup * Response to PR comments. * Response to PR comments.	2019-01-16 07:05:09 -08:00
Tim Brooks	0b5af276a8	Allow system privilege to execute proxied actions (#37508 ) Currently all proxied actions are denied for the `SystemPrivilege`. Unfortunately, there are use cases (CCR) where we would like to proxy actions to a remote node that are normally performed by the system context. This commit allows the system context to perform proxy actions if they are actions that the system context is normally allowed to execute.	2019-01-16 07:52:38 -07:00
Nhat Nguyen	0160ba2539	AwaitsFix testAddNewReplicas Tracked at #37183	2019-01-16 09:48:35 -05:00
Adrien Grand	9d8afe68a5	IndexMetaData#mappingOrDefault doesn't need to take a type argument. (#37480 ) Currently it takes a type, but this isn't really needed now that indices can have at most one type. The only downside is that we might return a different error when trying to index into a type that doesnt't exist yet.	2019-01-16 14:01:09 +01:00
Armin Braun	21a88d5505	Simplify + Cleanup Dead Code in Settings (#37341 ) * Remove dead code * Simplify some overly complex code, this class is long enough already	2019-01-16 13:57:16 +01:00
Jason Tedor	687978b7d1	Reject all requests that have an unconsumed body (#37504 ) This commit removes some leniency from REST handling where we move to reject all requests that have a body where the body is not used during the course of handling the request. For example, DELETE /index { "query" : { "term" : { "field" : "value" } } } is now rejected.	2019-01-16 07:29:25 -05:00
Dimitrios Liappis	347cbaf0ed	Fix line length for aliases and remove suppression (#37455 ) Relates #34884	2019-01-16 13:06:29 +02:00
Armin Braun	5a5e44d1de	Simplify Snapshot Create Request Handling (#37464 ) * The internal create request is absolutely redundant, the only difference to the transport request is that we resolved the snapshot name when moving from the transport to the internal version * Removed it and passed the transport request into the snapshot service instead * nicer way of resolve snapshot name in callback	2019-01-16 11:08:48 +01:00
Przemyslaw Gomulka	5e94f384c4	Remove the use of AbstracLifecycleComponent constructor #37488 (#37488 ) The AbstracLifecycleComponent used to extend AbstractComponent, so it had to pass settings to the constractor of its supper class. It no longer extends the AbstractComponent so there is no need for this constructor There is also no need for AbstracLifecycleComponent subclasses to have Settings in their constructors if they were only passing it over to super constructor. This is part 1. which will be backported to 6.x with a migration guide/deprecation log. part 2 will have this constructor removed in 7 relates #35560 relates #34488	2019-01-16 09:05:30 +01:00
Julie Tibshirani	0a3bff2ca9	Only log one types warning per bulk search request. (#37446 )	2019-01-15 12:38:32 -08:00
Andrey Ershov	42fd68ed38	Use GatewayMetaState in CoordinatorTests rarely (#36897 ) This commit adds one more underlying implementation of MockPersistedState. Previously only InMemoryPersistentState was used, not GatewayMetaState is used rarely. When adding GatewayMetaState support the main question was: do we want to emulate exceptions as we do today in MockPersistedState before delegating to GatewayMetaState or do we want these exceptions to propagate from the lower level, i.e. file system exceptions? On the one hand, lower level exception propagation is already tested in GatewayMetaStateTests, so this won't improve the coverage. On the other hand, the benefit of low-level exceptions is to see how all these components work in conjunction. Finally, we abandoned the idea of low-level exceptions because we don't have a way to deal with IOError today in CoordinatorTests, but hacking GatewayMetaState not to throw IOError seems unnatural. So MockPersistedState rarely throws an exception before delegating to GatewayMetaState, which is not supposed to throw the exception. This commit required two changes: Move GatewayMetaStateUT to upper-level from GatewayMetaStatePersistedStateTests, because otherwise, it's not easy to construct GatewayMetaState instance in CoordinatorTests. Move addition of STATE_NOT_RECOVERED_BLOCK from GatewayMetaState constructor to GatewayMetaState.applyClusterUpdaters, because CoordinatorTests class assumes that there is no such block and most of them fail.	2019-01-15 13:33:25 -05:00
Jim Ferenczi	f8d80dff7c	Fix duplicate removal when merging completion suggestions (#36996 ) The completion suggester ignores the original weight of the suggestion when duplicates are removed. This change fixes this bug and keeps the best weighted suggestion among the duplicates. It also removes the custom implementation of the top docs suggest collector now that https://issues.apache.org/jira/browse/LUCENE-8529 is committed in Lucene. Closes #35836	2019-01-15 19:27:31 +01:00
Nhat Nguyen	6647122f1c	Prepare to make send translog of recovery non-blocking (#37458 ) This commit prepares the required infra to make send a translog snapshot of the recovery source non-blocking. I'll make a follow-up to make the send snapshot method non-blocking. Relates #37291	2019-01-15 13:17:25 -05:00
Andrey Ershov	02d4d8b409	MinimumMasterNodesIT changed for Zen2 (#37428 ) There were 5 tests in MinimumMasterNodesIT. 2 of them removed, 3 of them changed and renamed. 1) testSimpleMinimumMasterNodes -> testTwoNodesNoMasterBlock. The flow of this test is left intact but in order to make it work on Zen2, additional work for the cluster bootstrapping and voting exclusions is needed. 2) testDynamicUpdateMinimumMasterNodes -> removed, there is nothing that corresponds to the dynamic change of the minimum master nodes setting. 3) testCanNotBringClusterDown -> removed, it also plays with changing minimum master nodes dynamically. 4) testMultipleNodesShutdownNonMasterNodes -> testThreeNodesNoMasterBlock. Previously this test was checking that there would be no master block, if min_master_nodes=3 and 4 nodes are started, then 2 nodes are brought down. Zen2 dynamically accommodates to the number of nodes in the cluster, so it's possible that there still will be a master in 2 nodes cluster. For Zen2, we start up 3 nodes. And shut down 2 of them (w/o voting exclusions), which results in no master block. 5) testCanNotPublishWithoutMinMastNodes -> testCanNotCommitStateThreeNodes. Test flow is not changed. But previously there was no check that nodes in the bigger part of network partition will elect the master, before healing the network partition. For Zen2 it does not work, because persistent setting addition is accepted on the old master and if it's elected new master again, this setting will appear in the cluster state. Also, I have a feeling that we need to remove this class, but could not come up with a good name.	2019-01-15 13:09:48 -05:00
Dimitrios Liappis	63793499bd	Fix line length for `node` and remove suppresion (#37454 ) Relates #34884	2019-01-15 19:57:24 +02:00
David Turner	a2a40c50a0	Report terms and version if cluster does not form (#37473 ) Adds the node's current term and the term and version of the the last-accepted cluster state to the message reported by the `ClusterFormationFailureHelper`, since these values may be of importance when tracking down a cluster formation failure.	2019-01-15 17:32:08 +00:00
Nhat Nguyen	68e2d36fa3	Adjust bwc version for max_concurrent_file_chunks Relates #36981	2019-01-15 11:18:55 -05:00
Luca Cavanna	0b396a0c5e	Restore assertion on discount overlaps in SimilarityTests This assertion was commented out as the getDiscountOverlaps getter was missing from LegacyBm25Similarity. That has been fixed in lucene.	2019-01-15 16:43:09 +01:00
Julie Tibshirani	1a1dbf705f	Make sure to use the resolved type in DocumentMapperService#extractMappings. (#37451 ) * Pull out a shared method MapperService#resolveDocumentType. * Make sure to resolve the type when extracting the mappings. Addresses #36811.	2019-01-15 07:32:47 -08:00
Fabricio Archanjo Fonseca	3cc8f39532	New mapping signature and mapping string source fixed. (#37401 ) * New mapping signature and mapping string source fixed. * Keep compatibility with CreateIndexRequest class.	2019-01-15 08:06:32 -07:00
David Roberts	7cdf7f882b	[ML] Fix ML datafeed CCS with wildcarded cluster name (#37470 ) The test that remote clusters used by ML datafeeds have a license that allows ML was not accounting for the possibility that the remote cluster name could be wildcarded. This change fixes that omission. Fixes #36228	2019-01-15 14:19:05 +00:00
Dimitrios Liappis	19fc59f089	Fix line length for monitor and remove suppressions (#37456 ) Relates #34884	2019-01-15 14:18:15 +02:00
Simon Willnauer	147c5e65d3	Remove dead code from ShardSearchStats (#37421 ) The clear methodsa are unused and unsafe at this point. This commit removes the dead code.	2019-01-15 09:39:53 +01:00
Nhat Nguyen	bf49f54456	Simplify testSendSnapshotSendsOps (#37445 ) The test testSendSnapshotSendsOps is currently using a mock instance of RecoveryTargetHandler which will be hard to modify when we make the RecoveryTargetHandler non-blocking. This commit prepares for the incoming changes by replacing the mock instance with a stub.	2019-01-15 03:07:56 -05:00
Tim Vernum	b97245cfcd	Restore lost @Inject annotation (#37452 ) The Inject Annotation was removed from IndicesClusterStateService as part of reformatting in `e11a32e`, but this causes CreationException on cluster startup.	2019-01-15 18:20:22 +11:00
Jason Tedor	43bfdd32ee	Add run under primary permit method (#37440 ) This commit adds a simple method for executing a runnable against a shard under a primary permit. Today there is only a single caller for this method, but this there are two upcoming use-cases for which having this method will help keep the code simpler.	2019-01-14 21:54:42 -05:00
Jason Tedor	e11a32eda8	Reformat some classes in the index universe This commit reformats some classes in the index universe with the purpose of breaking some long method definitions and invocations into a line per parameter. This has the advantage that for an upcoming change to these definitions and invocations, the diff for that change will be a single line per definition or invocation. That makes these sorts of changes easier to read.	2019-01-14 21:45:24 -05:00
Jason Tedor	3bc0711b90	Add simple method to write collection of writeables (#37448 ) This commit adds a simple convenience method for writing a collection of writeables, and replaces existing call sites with the new method.	2019-01-14 21:28:28 -05:00
Jason Tedor	eb86b9f284	Fix retention lease commit test This commit fixes an issue with testing committed retention leases when they are not any retention leases (a deliberate edge case). Closes #37420	2019-01-14 21:16:49 -05:00
Jason Tedor	74640d0ba7	Introduce retention lease serialization (#37447 ) This commit is a simple introduction of the serialization of retention leases, which will be needed when they are sent across the wire while synchronizing retention leases to replicas.	2019-01-14 21:06:44 -05:00
Nhat Nguyen	397f315f56	Make finalize step of recovery source non-blocking (#37388 ) Relates #37291	2019-01-14 18:20:54 -05:00
Julie Tibshirani	36a3b84fc9	Update the default for include_type_name to false. (#37285 ) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs.	2019-01-14 13:08:01 -08:00
Nhat Nguyen	15aa3764a4	Reduce recovery time with compress or secure transport (#36981 ) Today file-chunks are sent sequentially one by one in peer-recovery. This is a correct choice since the implementation is straightforward and recovery is network bound in most of the time. However, if the connection is encrypted, we might not be able to saturate the network pipe because encrypting/decrypting are cpu bound rather than network-bound. With this commit, a source node can send multiple (default to 2) file-chunks without waiting for the acknowledgments from the target. Below are the benchmark results for PMC and NYC_taxis. - PMC (20.2 GB) \| Transport \| Baseline \| chunks=1 \| chunks=2 \| chunks=3 \| chunks=4 \| \| ----------\| ---------\| -------- \| -------- \| -------- \| -------- \| \| Plain \| 184s \| 137s \| 106s \| 105s \| 106s \| \| TLS \| 346s \| 294s \| 176s \| 153s \| 117s \| \| Compress \| 1556s \| 1407s \| 1193s \| 1183s \| 1211s \| - NYC_Taxis (38.6GB) \| Transport \| Baseline \| chunks=1 \| chunks=2 \| chunks=3 \| chunks=4 \| \| ----------\| ---------\| ---------\| ---------\| ---------\| -------- \| \| Plain \| 321s \| 249s \| 191s \| * \| * \| \| TLS \| 618s \| 539s \| 323s \| 290s \| 213s \| \| Compress \| 2622s \| 2421s \| 2018s \| 2029s \| n/a \| Relates #33844	2019-01-14 15:14:46 -05:00
Tim Brooks	5c68338a1c	Implement ccr file restore (#37130 ) This is related to #35975. It implements a file based restore in the CcrRepository. The restore transfers files from the leader cluster to the follower cluster. It does not implement any advanced resiliency features at the moment. Any request failure will end the restore.	2019-01-14 13:07:55 -07:00
Christoph Büscher	c801b89072	Fix Eclipse specific compilation issue (#37419 ) Without pulling out the supplier function to the enclosing class, Eclipse 4.8 complains with the following error "No enclosing instance of type CoordinatorTests.Cluster is available due to some intermediate constructor invocation"	2019-01-14 20:39:04 +01:00
markharwood	92c6c98e8d	Performance fix. Reduce deprecation calls for the same bulk request (#37415 ) DeprecationLogger has warning de-duplication logic but it is expensive to run as it involves parsing existing warning headers. This PR changes the upstream bulk indexing code to do its own "event thinning" rather than relying on DeprecationLogger's trimming. Closes #37411	2019-01-14 17:51:49 +00:00
David Kyle	1abe5df09c	Mute IndexShardRetentionLeaseTests.testCommit #37420	2019-01-14 14:17:11 +00:00
Daniel Mitterdorfer	abe35fb99b	Remove unused index store in directory service With this commit we remove the unused field `indexStore` from all implementations of `FsDirectoryService`. Relates #37097	2019-01-14 13:44:32 +01:00
Tanguy Leroux	07dc8c7eee	Improve CloseWhileRelocatingShardsIT (#37348 )	2019-01-14 13:14:36 +01:00
Tanguy Leroux	6ca076bf74	Fix ClusterBlock serialization and Close Index API logic after backport to 6.x (#37360 ) This commit changes the versions in the serialization logic of ClusterBlock after the backport to 6.x of the Close Index API refactoring (#37359).	2019-01-14 13:13:15 +01:00
Christoph Büscher	89b45f1fc6	Remove deprecated pipeline request contructors (#37366 ) The constructors in PutPipelineRequest and SimulatePipelineRequest that guess the xContent type from the provided source are deprecated since 6.0 and each have a counterpart that takes the xContent type as an explicit argument. Removing these ctors together with the builders and methods in ClusterAdminClient that don't have the xContent type as argument.	2019-01-14 11:14:38 +01:00
Nhat Nguyen	d44a6f9fbc	Simplify SyncedFlushService flow with StepListener (#37383 ) Today the SyncedFlushService flow is written with multiple nested callbacks which are hard to read. This commit replaces them with sequential step listeners.	2019-01-14 03:54:34 -05:00
Luca Cavanna	d54f88f62c	Remove unused empty constructors from suggestions classes (#37295 ) We recently migrated suggestions to `Writeable`. That allows us to also clean up empty constructors and methods that called them as they are no longer needed. They are replaced by constructors that accept a `StreamInput` instance.	2019-01-14 08:32:45 +01:00
Jason Tedor	03be4dbaca	Introduce retention lease persistence (#37375 ) This commit introduces the persistence of retention leases by persisting them in index commits and recovering them when recovering a shard from store.	2019-01-12 14:43:19 -08:00
Nhat Nguyen	44a1071018	Make recovery source partially non-blocking (#37291 ) Today a peer-recovery may run into a deadlock if the value of node_concurrent_recoveries is too high. This happens because the peer-recovery is executed in a blocking fashion. This commit attempts to make the recovery source partially non-blocking. I will make three follow-ups to make it fully non-blocking: (1) send translog operations, (2) primary relocation, (3) send commit files. Relates #36195	2019-01-12 12:49:48 -05:00
Armin Braun	63fe3c6ed6	Fix PrimaryAllocationIT Race Condition (#37355 ) * Fix PrimaryAllocationIT Race Condition * Forcing a stale primary allocation on a green index was tripping the assertion that was removed * Added a test that this case still errors out correctly * Made the ability to wipe stopped datanode's data public on the internal test cluster and used it to ensure correct behaviour on the fixed test * Previously it simply passed because the test finished before the index went green and would NPE when the index was green at the time of the shard store status request, that would then come up empty * Closes #37345	2019-01-11 23:26:04 +01:00
Nhat Nguyen	70cee18e56	Introduce StepListener (#37327 ) This commit introduces StepListener which provides a simple way to write a flow consisting of multiple asynchronous steps without having nested callbacks. Relates #37291	2019-01-11 13:06:17 -05:00
Christoph Büscher	bb6d8784e7	Switch indices.get rest after backport of `include_type_name` (#37351 ) With the `include_type_name` available now for indices.get on 6.x after the backport, the corresponsing yaml test can include anything from 6.7 on. Also changing the RestGetIndicesActionTests base test class.	2019-01-11 17:24:12 +01:00
Armin Braun	1eba1d1df9	Fix SnapshotDisruptionIT Race Condition (#37358 ) * Due to a race between retrying the snapshot creation and the failed snapshot create trying to delete the snapshot there is no guarantee that the snapshot is eventually created by retries * Adjusted the assertion accordingly * Closes #36779	2019-01-11 16:09:26 +01:00
Yannick Welsch	f4abf9628a	Mock connections more accurately in DisruptableMockTransport (#37296 ) This commit moves DisruptableMockTransport to use a more accurate representation of connection management, which allows to use the full connection manager and does not require mocking out any behavior. With this, we can implement restarting nodes in CoordinatorTests.	2019-01-11 16:06:48 +01:00
Boaz Leskes	d21df2a17a	Use Sequence number powered OCC for processing updates (#37308 ) Updates perform realtime get, perform the requested update and then index the document again using optimistic concurrency control. This PR changes the logic to use sequence numbers instead of versioning. Note that the current versioning logic isn't suffering from the same problem as external OCC requests because the get and indexing is always done on the same primary. Relates #36148 Relates #10708	2019-01-11 06:23:55 -08:00
Yannick Welsch	4d3928d444	Increase timeouts in UnicastZenPingTests Relates to #37268	2019-01-11 11:33:55 +01:00
Yannick Welsch	3a929c7aea	Increase assertBusy timeouts for RefreshListenersTests	2019-01-11 10:15:39 +01:00
Alpar Torok	82ca2d62de	Mute CloseWhileRelocatingShardsIT.testCloseWhileRelocatingShards Tracked by #37274	2019-01-11 10:54:53 +02:00
Alpar Torok	3e73911cbe	Mute PrimaryAllocationIT.testForceStaleReplicaToBePromotedToPrimaryOnWrongNode Tracking issue: #37345	2019-01-11 10:49:25 +02:00
Ignacio Vera	0a50821bb2	Geo: Do not normalize the longitude with value -180 for Lucene shapes (#37299 ) Lucene based shapes should not normalize the longitude value -180 to 180.	2019-01-11 09:37:18 +01:00
Nhat Nguyen	360c430ad7	Add runAfter and notifyOnce wrapper to ActionListener (#37331 ) Relates #37291	2019-01-11 03:33:06 -05:00
Alexander Reelsen	9f3da013d8	Date/Time parsing: Use java time API instead of exception handling (#37222 ) * Add benchmark * Use java time API instead of exception handling when several formatters are used, the existing way of parsing those is to throw an exception catch it, and try the next one. This is is considerably slower than the approach taken in joda time, so that indexing is reduced when a date format like `x\|\|y` is used and y is the date format being used. This commit now uses the java API to parse the date by appending the date time formatters to each other and does not rely on exception handling. * fix benchmark * fix tests by changing formatter, also expose printer * restore optional printing logic to fix tests * fix tests * incorporate review comments	2019-01-11 09:25:05 +01:00
Jason Tedor	822626dadf	Make consistent empty retention lease supplier This commit makes the use of empty retention lease suppliers to always be an empty list as opposed to in some cases an empty set. This commit is solely for consistency reasons, there is no functional change here.	2019-01-10 18:34:55 -08:00
Jason Tedor	edc95c8a8e	Add validation for retention lease construction (#37312 ) This commit adds some simple validation that the values input to the retention lease constructor our valid values. We will later rely on these values being within the validated range.	2019-01-10 18:13:05 -08:00
Jack Conradson	b5b93a2746	Rename ParameterMap to DeprecationMap (#37317 ) Mechanical change to rename ParameterMap to DeprecationMap as this seems more appropriate for an extended Map to issue deprecation warnings.	2019-01-10 13:58:00 -08:00
markharwood	434430506b	Type removal - added deprecation warnings to _bulk apis (#36549 ) Added warnings checks to existing tests Added “defaultTypeIfNull” to DocWriteRequest interface so that Bulk requests can override a null choice of document type with any global custom choice. Related to #35190	2019-01-10 21:35:19 +00:00
Julie Tibshirani	a433c4012c	Support include_type_name in the field mapping and index template APIs. (#37210 ) * Add include_type_name to the get field mappings API. * Make sure the API specification lists include_type_name as a boolean. * Add include_type_name to the get index templates API. * Add include_type_name to the put index templates API.	2019-01-10 09:24:08 -08:00
Ryan Ernst	fcf7df3eda	Core: Handle security manager permission for deprecation log rolling (#37281 ) When the deprecation log is written to within scripting support code like ScriptDocValues, it runs under the reduces privileges of scripts. Sometimes this can trigger log rolling, which then causes uncaught security errors, as was handled in #28485. While doing individual deprecation handling within each deprecation scripting location is possible, there are a growing number of deprecations in scripts. This commit wraps the logging call within the deprecation logger use a doPrivileged block, just was we would within individual logging call sites for scripting utilities.	2019-01-10 07:44:40 -08:00
Armin Braun	46237faa97	Fail Stale Primary Alloc. Req. without Data (#37226 ) * Get indices shard store status before enqueuing the reallocation state update task to prevent tasks that would fail because a node does not hold a stale copy of the shard on a best effort basis * Closes #37098	2019-01-10 16:28:38 +01:00
Przemyslaw Gomulka	c812e6aea6	Fix line length in org.elasticsearch.routing (#37253 ) Remove the line length suppression for this package and fix offending lines relates: #34884	2019-01-10 15:23:34 +01:00
Armin Braun	26cb7466ef	SNAPSHOT+TESTS: Stabilize SnapshotDisruptionIT (#37289 ) * Ensure retry by busy assert on SnapshotMissingException * Closes #36739	2019-01-10 14:13:20 +01:00
Luca Cavanna	61b54196c4	[TEST] Fixed compile issue in SnapshotsServiceTests Relates to #37203	2019-01-10 13:38:39 +01:00
Alexander Reelsen	71287b0759	Remove unused EpochMillisDateFormatter (#37293 ) This class has been superceded by a custom java time epoch millis date parser.	2019-01-10 13:20:24 +01:00
Yannick Welsch	d499233068	Zen2: Add join validation (#37203 ) Adds join validation to Zen2, which prevents a node from joining a cluster when the node does not have the right ES version or does not satisfy any other of the join validation constraints.	2019-01-10 12:57:50 +01:00
Christoph Büscher	cd608848e7	Remove deprecated QUERY_AND_FETCH SearchType (#37257 ) This SearchType was deprecated since at least 6.0 and according to the documentation is only kept around for pre-5.3 requests. Removing and leaving a comment as placeholder so we don't reuse the byte value associated with it without further consideration.	2019-01-10 12:50:07 +01:00
Alexander Reelsen	eb12de550a	Java Time: Fix timezone parsing (#37262 ) * Java Time: Fix timezone parsing An independent test uncovered an issue when parsing a timezone containing a colon like `01:00` - some formats did not properly support this. This commit adds test for all formats in the dueling tests and fixes a few issues with existing date formatters. * fix tests, so they run under java8	2019-01-10 09:26:01 +01:00
Alexander Reelsen	b2e8437424	Tests: Add ElasticsearchAssertions.awaitLatch method (#36777 ) * Tests: Add ElasticsearchAssertions.awaitLatch method Some tests are using assertTrue(latch.await(...)) in their code. This leads to an assertion error without any error message. This adds a method which has a nicer error message and can be used in tests. * fix forbidden apis * fix spaces	2019-01-10 09:25:36 +01:00
Michael Basnight	d625b79df2	Add getZone to JodaCompatibleZonedDateTime (#37084 ) The ZonedDateTime#getZone() was not accessible via the Joda shim. This commit adds getZone() and exposes it through painless.	2019-01-09 22:09:34 -06:00
Jim Ferenczi	586093ec5e	Handle TopFieldDocs copy in TopDocsCollectorContext This commit fixes the clone of TopFieldDocs. Relates #37179 Relates #37266	2019-01-10 00:26:55 +01:00
Simon Willnauer	234059d2c0	Enable Bulk-Merge if all source remains (#37269 ) Today we still wrap recovery source readers on merge even if we keep all documents recovery source. This basically disables bulk merging for stored fields. This change skips wrapping if all docs sources are kept anyway.	2019-01-09 23:46:31 +01:00
Jim Ferenczi	95479f1766	Ensure that a non static top docs is created during the search phase This change fixes an unreleased bug that trips an assertion because a static instance shared among threads is modified during the search. This commit copies the static instance in order to ensure that each thread can modify the value without modifying the other instances. Closes #37179 Closes #37266	2019-01-09 22:57:34 +01:00
Jake Landis	195873002b	ingest: compile mustache template only if field includes '{{'' (#37207 ) * ingest: compile mustache template only if field includes '{{'' Prior to this change, any field in an ingest node processor that supports script templates would be compiled as mustache template regardless if they contain a template or not. Compiling normal text as mustache templates is harmless. However, each compilation counts against the script compilation circuit breaker. A large number of processors without any templates or scripts could un-intuitively trip the too many script compilations circuit breaker. This change simple checks for '{{' in the text before it attempts to compile. fixes #37120	2019-01-09 14:47:47 -06:00
Jack Conradson	95eef77ad4	[Style] Fix line length violations for threadpool, indexing, and script packages (#37205 )	2019-01-09 10:55:52 -08:00
Evangelos Chatzikalymnios	85a603ee61	Use List instead of priority queue for stable sorting in bucket sort aggregator (#36748 ) Update BucketSortPipelineAggregator to use a List and Collections.sort() for sorting instead of a priority queue. This preserves the order for equal values. Closes #36322.	2019-01-09 18:01:39 +02:00
Armin Braun	eacc63b032	TESTS: Real Coordinator in SnapshotServiceTests (#37162 ) * TESTS: Real Coordinator in SnapshotServiceTests * Introduce real coordinator in SnapshotServiceTests to be able to test network disruptions realistically * Make adjustments to cluster applier service so that we can pass a mocked single threaded executor for tests	2019-01-09 16:53:49 +01:00
Alpar Torok	ae086ebcc4	Muting SnapshotDisruptionIT Tracked in #36779	2019-01-09 16:55:11 +02:00
Christoph Büscher	c149bb8cc2	Support 'include_type_name' in RestGetIndicesAction (#37149 ) This change adds support for the 'include_type_name' parameter for the indices.get API. This parameter, which defaults to `false` starting in 7.0, changes the response to not include the indices type names any longer. If the parameter is set in the request, we additionally emit a deprecation warning since using the parameter should be only temporarily necessary while adapting to the new response format and we will remove it with the next major version.	2019-01-09 14:17:17 +01:00
Jim Ferenczi	84d520b0e5	Throw an ISE rather than an hard assertion in SearchPhaseController#getTotalHits This change turns an assertion into an IllegalStateException in SearchPhaseController#getTotalHits. The goal is to help identify the cause of the failures in https://github.com/elastic/elasticsearch/issues/37179 which seems to fail only in CI. The assertion will be restored when the issue is solved (NORELEASE).	2019-01-09 13:36:48 +01:00
Tanguy Leroux	dc371ef593	[Tests] Fix ReopenWhileClosingIT with correct min num shards The test intercepts TransportVerifyShardBeforeCloseAction shard requests, so it needs a minimum of 2 primary shards on 2 different nodes to correctly intercepts requests.	2019-01-09 13:34:38 +01:00
Alpar Torok	7ff846ffb2	mute failing test: org.elasticsearch.index.shard.RefreshListenersTests.testConcurrentRefresh	2019-01-09 13:43:21 +02:00
Tanguy Leroux	096a83183e	Merge branch 'master' into close-index-api-refactoring	2019-01-09 10:52:46 +01:00
Armin Braun	224475e6aa	MINOR: Some Cleanups in Snapshots (#37175 ) * Removed pre v6 Bwc logic + comments * Removed some dead code	2019-01-09 09:45:37 +01:00
Yannick Welsch	9040a96daf	Zen2: Add test that checks that CoordinatorTests are reproducible (#37225 ) Ensures reproducibility of the tests in CoordinatorTests.	2019-01-09 09:32:24 +01:00
Tanguy Leroux	7f6fe14b66	Merge branch 'master' into close-index-api-refactoring	2019-01-09 09:26:05 +01:00
Tanguy Leroux	0a40564068	[Tests] Change cluster scope in CorruptedFileIT and FlushIT (#37229 ) These tests failed on CI multiple times in the past weeks because they use a test cluster with a SUITE scope that recreates nodes between tests. With such a scope, nodes can be recreated in between test executions and can inherit a node id from a previous test execution, while they are assigned a random data path. With the successive node recreations it is possible that a newly recreated node shares the same node id (but different data path) as a non recreated node. This commit changes the cluster scope of the CorruptedFileIT and FlushIT tests which often fail. The failure is reproducable with : ./gradlew :server:integTest -Dtests.seed=EF3A50C225CF377 -Dtests.class=org.elasticsearch.index.store.CorruptedFileIT -Dtests.security.manager=true -Dtests.locale=th-TH-u-nu-thai-x-lvariant-TH -Dtests.timezone=America/Rio_Branco -Dcompiler.java=11 -Druntime.java=8	2019-01-09 09:24:56 +01:00
Jun Ohtani	38b698d455	[Analysis] Deprecate Standard Html Strip Analyzer in master (#26719 ) * [Analysis] Deprecate Standard Html Strip Analyzer Deprecate only Standard Html Strip Analyzer If user create index with the analyzer since 7.0, es throws an exception. If an index was created before 7.0, es issue deprecation log We will remove it in 8.0 Related #4704	2019-01-09 12:42:00 +09:00
Mayya Sharipova	ec32e66088	Deprecate reference to _type in lookup queries (#37016 ) Relates to #35190	2019-01-08 18:46:41 -08:00
Nhat Nguyen	87ac3103f4	Do not mutate RecoveryResponse (#37204 ) Today we create a global instance of RecoveryResponse then mutate it when executing each recovery step. This is okay for the current sequential recovery flow but not suitable for an asynchronous recovery which we are targeting. With this commit, we return the result of each step separately, then construct a RecoveryResponse at the end. Relates #37174	2019-01-08 16:12:18 -05:00
Ryan Ernst	86b71930f6	Remove more checkstyle suppressions (#36735 ) This commit removes several more line length violations from checkstyle suppression. relates #34884	2019-01-08 11:04:54 -08:00
Jim Ferenczi	054c3bb04f	Fix topDocs.totalHits assignment on scroll queries (#37180 ) This change fixes an unreleased bug that assigns the wrong totalHits to scroll queries. Closes #37179	2019-01-08 13:31:53 +01:00
Alpar Torok	6344e9a3ce	Testing conventions: add support for checking base classes (#36650 )	2019-01-08 13:39:03 +02:00
Tanguy Leroux	d70ebfd1d6	Merge branch 'master' into close-index-api-refactoring	2019-01-08 09:17:48 +01:00
Tim Brooks	b0665963e8	Make remote ping and compress settings dynamic (#37200 ) Traditionally remote clusters can be configured dynamically. However, the compress and ping settings are not currently set to be configured dynamically. This commit changes that.	2019-01-07 23:35:22 -07:00
Jason Tedor	c8c596cead	Introduce retention lease expiration (#37195 ) This commit implements a straightforward approach to retention lease expiration. Namely, we inspect which leases are expired when obtaining the current leases through the replication tracker. At that moment, we clean the map that persists the retention leases in memory.	2019-01-07 22:03:52 -08:00
Julie Tibshirani	c5aac4705d	Revert "Stop automatically nesting mappings in index creation requests. (#36924 )" This reverts commit `ac1c6940d2`.	2019-01-07 17:56:40 -08:00
Ryan Ernst	55d3ca3aa8	Core: Rework epoch time parsing for java time (#36914 ) This commit converts the epoch time parsing implementation which uses the java time api to create DateTimeFormatters instead of DateFormatter implementations. This will allow multi formats for java time to be implemented in a single DateTimeFormatter in a future change.	2019-01-07 16:15:30 -08:00
Christoph Büscher	56e472bfbc	Removing unused methods in Numbers (#37186 ) Remove several unused helper methods. Most of them are one-liners and should be easier to be used from the corresponding primitive wrapper classes. The bytes array conversion methods are unused as well, it should be easy to re-create them if needed.	2019-01-08 00:14:50 +01:00
Jason Tedor	3b48b99861	Fix setting by time unit (#37192 ) This commit fixes an issue with a settings builder method that allows setting a duration by time unit. In particular, this method can suffer from a loss of precision. For example, if the input duration is 1500 microseconds then internally we are converting this to "1ms", demonstrating the loss of precision. Instead, we should internally convert this to a TimeValue that correctly represents the input duration, and then convert this to a string using a method that does not lose the unit. That is what this commit does.	2019-01-07 14:59:24 -08:00
Luca Cavanna	e34658edba	Update version in SearchRequest and related test Version needs to be updated after backporting #36997 & #37142 where we added support for providing and serializing localClusterAlias as well ass absoluteStartMillis. Relates to #36997 & #37142	2019-01-07 18:47:17 +01:00
Tanguy Leroux	97bf4d7176	Merge branch 'master' into close-index-api-refactoring	2019-01-07 18:38:27 +01:00
Christophe Bismuth	9602d794c6	Separate out validation of groups of settings (#34184 ) Today, a setting can declare that its validity depends on the values of other related settings. However, the validity of a setting is not always checked against the correct values of its dependent settings because those settings' correct values may not be available when the validator runs. This commit separates the validation of a settings updates into two phases, with separate methods on the `Setting.Validator` interface. In the first phase the setting's validity is checked in isolation, and in the second phase it is checked again against the values of its related settings. Most settings only use the first phase, and only the few settings with dependencies make use of the second phase.	2019-01-07 16:12:58 +00:00
David Turner	9d0e0eb0f3	[Zen2] Remove initial master node count setting (#37150 ) The `cluster.unsafe_initial_master_node_count` setting was introduced as a temporary measure while the design of `cluster.initial_master_nodes` was being finalised. This commit removes this temporary setting, replacing it with usages of `cluster.initial_master_nodes` where appropriate.	2019-01-07 16:05:00 +00:00
Tanguy Leroux	e149b0852e	[Close Index API] Add unique UUID to ClusterBlock (#36775 ) This commit adds a unique id to cluster blocks, so that they can be uniquely identified if needed. This is important for the Close Index API where multiple concurrent closing requests can be executed at the same time. By adding a UUID to the cluster block, we can generate unique "closing block" that can later be verified on shards and then checked again from the cluster state before closing the index. When the verification on shard is done, the closing block is replaced by the regular INDEX_CLOSED_BLOCK instance. If something goes wrong, calling the Open Index API will remove the block. Related to #33888	2019-01-07 16:44:59 +01:00
Jason Tedor	c0f8c89172	Introduce shard history retention leases (#37167 ) This commit is the first in a series which will culminate with fully-functional shard history retention leases. Shard history retention leases are aimed at preventing shard history consumers from having to fallback to expensive file copy operations if shard history is not available from a certain point. These consumers include following indices in cross-cluster replication, and local shard recoveries. A future consumer will be the changes API. Further, index lifecycle management requires coordinating with some of these consumers otherwise it could remove the source before all consumers have finished reading all operations. The notion of shard history retention leases that we are introducing here will also be used to address this problem. Shard history retention leases are a property of the replication group managed under the authority of the primary. A shard history retention lease is a combination of an identifier, a retaining sequence number, a timestamp indicating when the lease was acquired or renewed, and a string indicating the source of the lease. Being leases they have a limited lifespan that will expire if not renewed. The idea of these leases is that all operations above the minimum of all retaining sequence numbers will be retained during merges (which would otherwise clear away operations that are soft deleted). These leases will be periodically persisted to Lucene and restored during recovery, and broadcast to replicas under certain circumstances. This commit is merely putting the basics in place. This first commit only introduces the concept and integrates their use with the soft delete retention policy. We add some tests to demonstrate the basic management is correct, and that the soft delete policy is correctly influenced by the existence of any retention leases. We make no effort in this commit to implement any of the following: - timestamps - expiration - persistence to and recovery from Lucene - handoff during primary relocation - sharing retention leases with replicas - exposing leases in shard-level statistics - integration with cross-cluster replication These will occur individually in follow-up commits.	2019-01-07 07:43:57 -08:00
Alpar Torok	a7c3d5842a	Split third party audit exclusions by type (#36763 )	2019-01-07 17:24:19 +02:00
Jason Tedor	a233db7367	Fix handling of fractional byte size value settings (#37172 ) This commit addresses an issue when setting a byte size value setting using a value that has a fractional component when converted to its string representation. For example, trying to set a byte size value setting to a value of 1536 bytes is problematic because internally this is converted to the string "1.5k". When we go to get this setting, we try to parse "1.5k" back to a byte size value, which does not support fractional values. The problem is that internally we are relying on a method which loses the unit when doing the string conversion. Instead, we are going to use a method that does not lose the unit and therefore we can roundtrip from the byte size value to the string and back to the byte size value.	2019-01-07 07:13:50 -08:00
Armin Braun	82b1f10eb1	SNAPSHOT+TESTS: Rem. Mock Atomic Writes Randomness (#37011 ) * Randomly doing non-atomic writes causes rare 0 byte reads from `index-N` files in tests * Removing this randomness fixes these random failures and is valid because it does not reproduce a real-world failure-mode: * Cloud-based Blob stores (S3, GCS, and Azure) do not have inconsistent partial reads of a blob, either you read a complete blob or nothing on them * For file system based blob stores the atomic move we do (to atomically write a file) by setting `java.nio.file.StandardCopyOption#ATOMIC_MOVE` would throw if the file system does not provide for atomic moves * Closes #37005	2019-01-07 15:24:23 +01:00
Tanguy Leroux	f5af79b9cd	Merge branch 'master' into close-index-api-refactoring	2019-01-07 12:43:03 +01:00
Armin Braun	617e294133	SNAPSHOT: Make Atomic Blob Writes Mandatory (#37168 ) * With #37066 introducing atomic writes to HDFS repository we can enforce atomic write capabilities on this interface * The overrides on the other three cloud implementations are ok because: * https://docs.aws.amazon.com/AmazonS3/latest/API/RESTObjectPUT.html states that "Amazon S3 never adds partial objects; if you receive a success response, Amazon S3 added the entire object to the bucket." * https://cloud.google.com/storage/docs/consistency states that GCS has strong read-after-write consistency * https://docs.microsoft.com/en-us/rest/api/storageservices/put-block#remarks Azure has the concept of committing blobs, so there's no partial content here either * Relates #37011	2019-01-07 12:11:19 +01:00
David Kyle	7cc749dced	Mute SearchStatsIT.testOpenContexts	2019-01-07 09:47:58 +00:00
Christoph Büscher	60742b12e9	Remove deprecated CardinalityAggregationBuilder#rehash method (#37154 ) It has been deprecated since at least 6.0, is a no-op and unused in the rest of our code.	2019-01-07 10:42:07 +01:00
Christoph Büscher	d18c3d651d	Introduce an `include_type_name` constant (#37155 ) I started referring to this parameter name from various places in #37149 so I think it's a good idea to simplify things by referring to a common constant.	2019-01-07 10:41:40 +01:00
Christoph Büscher	12a105e5ef	Remove deprecated PutIndexTemplateRequestBuilder#setTemplate (#37151 ) The method has been removed since 6.0, there is a direct replacement and it is only used in tests still.	2019-01-07 10:41:04 +01:00
Luca Cavanna	2f4dafa69f	Add support for providing absolute start time to SearchRequest (#37142 ) We have recently added support for providing a local cluster alias to a SearchRequest through a package protected constructor. When executing cross-cluster search requests with local reduction on each cluster, the CCS coordinating node will have to provide such cluster alias to each remote cluster, as well as the absolute start time of the search action in milliseconds from the time epoch, to be used when evaluating date math expressions both while executing queries / scripts as well as when resolving index names. This commit adds support for providing the start time together with the cluster alias. It is a final member in the search request, which will only be set when using cross-cluster search with local reduction (also known as alternate execution mode). When not provided, the coordinating node will determine the current time and pass it through (by calling `System.currentTimeMillis`). Relates to #32125	2019-01-07 10:28:31 +01:00
Tanguy Leroux	19593884ee	[Close Index API] Propagate tasks ids between Freeze, Close and Verify Shard actions (#36630 ) This pull request changes the Freeze Index and Close Index actions so that these actions always requires a Task. The task's id is then propagated from the Freeze action to the Close action, and then to the Verify shard action. This way it is possible to track which Freeze task initiates the closing of an index, and which consecutive verifiy shard are executed for the index closing.	2019-01-07 09:43:50 +01:00
Tanguy Leroux	6347461146	Rename ClusterBlocks.hasGlobalBlock methods (#36941 ) As suggested in #36775, this pull request renames the following methods: ClusterBlocks.hasGlobalBlock(int) ClusterBlocks.hasGlobalBlock(RestStatus) ClusterBlocks.hasGlobalBlock(ClusterBlockLevel) to something that better reflects the property of the ClusterBlock that is searched for: ClusterBlocks.hasGlobalBlockWithId(int) ClusterBlocks.hasGlobalBlockWithStatus(RestStatus) ClusterBlocks.hasGlobalBlockWithLevel(ClusterBlockLevel)	2019-01-07 09:42:27 +01:00
Jason Tedor	bf5bc88f50	Fix handling of fractional time value settings (#37171 ) This commit addresses an issue when setting a time value setting using a value that has a fractional component when converted to its string representation. For example, trying to set a time value setting to a value of 1500ms is problematic because internally this is converted to the string "1.5s". When we go to get this setting, we try to parse "1.5s" back to a time value, which does not support fractional values. The problem is that internally we are relying on a method which loses the unit when doing the string conversion. Instead, we are going to use a method that does not lose the unit and therefore we can roundtrip from the time value to the string and back to the time value.	2019-01-06 22:34:52 -08:00
Armin Braun	b34e7d4f19	SNAPSHOT+TESTS: Relax Assertion in DisruptionIT (#37144 ) * The retries on the failing master can lead to concurrently trying to create and delete a snapshot, catch this for now to fix this test * closes #36779	2019-01-05 17:52:24 +01:00
Simon Willnauer	0cc877026f	Subclass NIOFSDirectory instead of using FileSwitchDirectory (#37140 ) We don't want two FSDirectories manage pending deletes separately and optimize file listing. This confuses IndexWriter and causes exceptions when files are deleted twice but are pending for deletion. This change move to using a NIOFS subclass that only delegates to MMAP for opening files all metadata and pending deletes are managed on top. Closes #37111 Relates to #36668	2019-01-05 10:15:33 +01:00
Julie Tibshirani	0bac64fbd3	Deprecate the _type field in aggregations. (#37131 )	2019-01-04 13:05:52 -08:00
Michael Basnight	e40193ae66	HLRC: Fix Reindex from remote query logic (#36908 ) The query object was incorrectly added to the remote object in the xcontent. This fix moves the query back into the source, if it was passed in as part of the RemoteInfo. It also adds a IPv6 test for reindex from remote such that we can properly validate this.	2019-01-04 13:37:59 -06:00
Jim Ferenczi	e38cf1d0dc	Add the ability to set the number of hits to track accurately (#36357 ) In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested. It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected. Relates #33028	2019-01-04 20:36:49 +01:00
Simon Willnauer	b4f113d3ea	Don't block on peer recovery on the target side (#37076 ) Today we block using the generic thread-pool on the target side until the source side has fully executed the recovery. We still block on the source side executing the recovery in a blocking fashion but there is no reason to block on the target side. This will release generic threads early if there are many concurrent recoveries happen. Relates to #36195	2019-01-04 13:51:06 +01:00
Simon Willnauer	41d7e3a2fe	Expose `search.throttled` on `_cat/indices` (#37073 ) Today it's very difficult to see which indices are frozen or rather throttled via the commonly used monitoring APIs. This change adds a cell to the `_cat/indices` API to render if an index is `search.throttled` Relates to #34352	2019-01-04 13:49:40 +01:00
Luca Cavanna	21d52f0dab	Ensure that local cluster alias is never treated as remote (#37121 ) With #36997 we added support for providing a local cluster alias with a `SearchRequest`. We intended to make sure that when provided as part of a search request, the cluster alias would never be used for connection lookups. Yet due to a bug we would still end up looking up the connection from the remote ones. This commit adds a test to make sure that whenever we set the cluster alias to the `SearchRequest` (which can only be done at transport), such alias is used as index prefix in the returned hits. No errors are thrown despite no remote clusters are configured indicating that such alias is never used for connection look-ups. Also, we add explicit support for the empty cluster alias when printing out index names through `RemoteClusterAware#buildRemoteIndexName`. In fact we don't want to print out `:index` when the cluster alias is set to empty string, but rather `index`. Yet, the semantic of empty string is different compared to `null` as it will still disable final reduction. This will be used in CCS when searching against remote clusters as well as the local one, the local one will have empty prefix yet it will need to disable final reduction so that its results will be properly merged with the ones coming from the remote clusters.	2019-01-04 12:19:31 +01:00
David Turner	3f7d6a989a	[Zen2] Elect freshest master in upgrade (#37122 ) Today when electing a master in Zen2 we use the cluster state version to determine whether a node has a fresh-enough cluster state to become master. However the cluster state version is not a reliable measure of freshness in the Zen1 world; furthermore in 6.x the cluster state version is not persisted. This means that when upgrading from 6.x via a full cluster restart a cluster state update may be lost if a stale master wins the initial election. This change fixes this by using the metadata version as a measure of freshness when in term 0, since this is persisted in 6.x and does more reliably indicate the freshness of nodes. It also makes changes parallel to elastic/elasticsearch-formal-models#40 to support situations in which nodes accept cluster state versions in term 0: this does not happen in a pure Zen2 cluster, but can happen in mixed clusters and during upgrades.	2019-01-04 09:09:16 +00:00
Julie Tibshirani	ac1c6940d2	Stop automatically nesting mappings in index creation requests. (#36924 ) Now that we unwrap mappings in DocumentMapperParser#extractMappings, it is not necessary for the mapping definition to always be nested under the type. This leniency around the mapping format was added in `2341825358`.	2019-01-03 17:41:28 -08:00
Armin Braun	7686ee7631	TESTS: Shutdown ThreadPool after TestNodes (#37123 ) * If the threadpool gets shut down before the testnodes we run into an error => fixed by moving to single `After` method * Relates #36976	2019-01-03 22:35:44 +01:00
Julie Tibshirani	54f53d2a51	Make sure to accept empty unnested mappings in create index requests. (#37089 )	2019-01-03 11:53:08 -08:00
Nick Knize	e613bcae43	Remove XLatLonShape classes (#37094 ) This commit removes local XLatLonShape classes and replaces with current LatLonShape classes in latest lucene snapshot	2019-01-03 12:48:36 -06:00
Nicholas Knize	de962b2f39	Revert "Adjust Lucene version for 6.7" This reverts commit `b7f6ee72a6`.	2019-01-03 11:52:31 -06:00
Armin Braun	675ea4c59c	TESTS: Remove Static Threadpool in TaskManagerTest (#36976 ) * The static threadpool leaks a lot of memory in these tests because it prevents things like the connect listeners from `org.elasticsearch.transport.TcpTransport#initiateConnection` to be GCed between tests (since they keep being referenced by the threadpool) which in turn reference channels and their underlying buffers * I could not find any slowdown in executing these tests from this change, if anything they are slightly faster now on my machine * Relates #36906 (which may be caused by slowness from leaking memory and also becomes testable in a loop by this change)	2019-01-03 15:19:21 +01:00
Christoph Büscher	046f86f274	Deprecate use of type in reindex request body (#36823 ) Types can be used both in the source and dest section of the body which will be translated to search and index requests respectively. Adding a deprecation warning for those cases and removing examples using more than one type in reindex since support for this is going to be removed.	2019-01-03 10:29:14 +01:00
Christoph Büscher	e21054d176	Remove two unused methods in Iterables (#37075 ) These helper methods are unused in the rest of the codebase.	2019-01-03 10:28:47 +01:00
Nhat Nguyen	b7f6ee72a6	Adjust Lucene version for 6.7 Relates #37088	2019-01-03 04:20:47 -05:00
Jim Ferenczi	78ba1889cf	Replace the TreeMap in the composite aggregation (#36675 ) The `composite` aggregation uses a TreeMap to keep track of the best buckets. This ensures a log(n) time cost to insert new buckets but also to retrieve buckets that are already present in the map. In order to speed up the retrieval of buckets this change replaces the TreeMap with a priority queue and a HashMap. The insertion cost is still log(n) but the retrieval of buckets through the HashMap is now done in constant time. This optimization can bring significant improvement since each document needs to check if its associated buckets are already present in the current best buckets.	2019-01-03 09:51:35 +01:00
Daniel Mitterdorfer	75f3443c62	Rename setting to enable mmap With this commit we rename `node.store.allow_mmapfs` to `node.store.allow_mmap`. Previously this setting has controlled whether `mmapfs` could be used as a store type. With the introduction of `hybridfs` which also relies on memory-mapping, `node.store.allow_mmapfs` also applies to `hybridfs` and thus we rename it in order to convey that it is actually used to allow memory-mapping but not a specific store type. Relates #36668 Relates #37070	2019-01-03 07:10:34 +01:00
Nick Knize	b2aa655f46	Upgrade master to lucene-8.0.0-snapshot-a1c6e642aa (#37091 ) Updates the master branch to the latest snapshot of Lucene 8.0.	2019-01-02 20:18:19 -06:00
David Findley	d4e7660248	Fix weighted_avg parser not found for RestHighLevelClient (#37027 ) Add integration test for weighted avg sub aggregation Add weighted avg parser to DefaultNamedXContents Fixes #36861	2019-01-02 15:53:21 -06:00
Ke Li	62ece69b92	Remove system property es.enforce_max_shards_per_node (#36968 ) The system property es.enforce_max_shards_per_node was not needed any more, because we always enforce cluster-wide shard limit now.	2019-01-02 14:01:12 -07:00
Alan Woodward	7a0047744d	`query_string` should use indexed prefixes (#36895 ) The QueryStringQueryBuilder does not currently delegate to the field mapper's prefixQuery method, so does not use indexed prefixes. This commit corrects this. It also fixes a bug where a query a* would not match the word a if indexed prefixes were used with a minchar setting of 2.	2019-01-02 20:12:24 +00:00
Luca Cavanna	42ea644903	Remove single shard optimization when suggesting shard_size (#37041 ) When executing terms aggregations we set the shard_size, meaning the number of buckets to collect on each shard, to a value that's higher than the number of requested buckets, to guarantee some basic level of precision. We have an optimization in place so that we leave shard_size set to size whenever we are searching against a single shard, in which case maximum precision is guaranteed by definition. Such optimization requires us access to the total number of shards that the search is executing against. In the context of cross-cluster search, once we will introduce multiple reduction steps (one per cluster) each cluster will only know the number of local shards, which is problematic as we should only optimize if we are searching against a single shard in a single cluster. It could be that we are searching against one shard per cluster in which case the current code would optimize number of terms causing a loss of precision. While discussing how to address the CCS scenario, we decided that we do not want to introduce further complexity caused by this single shard optimization, as it benefits only a minority of cases, especially when the benefits are not so great. This commit removes the single shard optimization, meaning that we will always have heuristic enabled on how many number of buckets to collect on the shards, even when searching against a single shard. This will cause more buckets to be collected when searching against a single shard compared to before. If that becomes a problem for some users, they can work around that by setting the shard_size equal to the size. Relates to #32125	2019-01-02 17:45:49 +01:00
Josh Soref	f6e4b9a014	Spelling: correct wrong spellings of likelihood (#37052 )	2019-01-02 17:37:03 +01:00
Josh Soref	5d5d5c26bc	Spelling: replace interruptable with interruptible (#37049 )	2019-01-02 17:35:53 +01:00
Josh Soref	1df66d21fe	Spelling: replace uknown with unknown (#37056 )	2019-01-02 17:33:02 +01:00
Przemyslaw Gomulka	15c4d5b184	Fix line length in org.elasticsearch.get (#37071 ) Remove the line suppression for this package and fix offedning lines relates #34844	2019-01-02 17:22:15 +01:00
Luca Cavanna	450d3014f6	[TEST] Fix testLimitConcurrentShardRequests failure With #36221 we introduced shards counting to address a rare failure. This caused a worse problem in this test when replicas were allocated and shards failures were randomly returned. The latch has to take into account additional attempts caused by the shard failures, which means that in order for run to be called, performPhaseOnShard will be called (numShards + numFailures) times. To address this, we need to decide upfront which shard is going to fail, making sure that at least one shards is successful otherwise the whole request fails. Closes #37074	2019-01-02 16:46:41 +01:00
Luca Cavanna	b10a33054e	Mute failing testLimitConcurrentShardRequests relates to #37074	2019-01-02 16:35:24 +01:00
Josh Soref	565a95c382	Spelling: replace respositories with repositories (#37053 )	2019-01-02 14:13:54 +01:00
Josh Soref	d3e98278c3	Spelling: replace cachable with cacheable (#37047 )	2019-01-02 14:10:30 +01:00
Daniel Mitterdorfer	f0052b1a7a	Add hybridfs store type With this commit we introduce a new store type `hybridfs` that is a hybrid between `mmapfs` and `niofs`. This store type chooses different strategies to read Lucene files based on the read access pattern (random or linear) in order to optimize performance. This store type has been available in earlier versions of Elasticsearch as `default_fs`. We have chosen a different name now in order to convey the intent of the store type instead of tying it to the fact whether it is the default choice. Relates #36668	2019-01-02 10:10:32 +01:00
Luca Cavanna	9e70696628	[TEST] Address rejected execution in SearchAsyncActionTests (#37028 ) SearchAsyncActionTests may fail with RejectedExecutionException as InitialSearchPhase may try to execute a runnable after the test has successfully completed, and the corresponding executor was already shut down. The latch was located in getNextPhase that is almost correct, but does not cover for the last finishAndRunNext round that gets executed after onShardResult is invoked. This commit moves the latch to count the number of shards, and allowing the test to count down later, after finishAndRunNext has been potentially forked. This way nothing else will be executed once the executor is shut down at the end of the tests. Closes #36221 Closes #33699	2019-01-02 09:47:13 +01:00
Alexis Wilke	35c09adbe1	Replaced the word 'shards' with 'replicas' in an error message. (#36234 ) (#36275 ) Closes #36234	2019-01-01 16:37:15 +01:00
Luca Cavanna	fd7cde88db	Mute failing RolloverIT#testRolloverWithDateMath Relates to #37037	2018-12-31 11:51:50 +01:00
Armin Braun	85be9d6a89	SNAPSHOT: Deterministic ClusterState Tests (#36644 ) * Use `DeterministicTaskQueue` infrastructure to test `SnapshotsService`	2018-12-31 11:17:21 +01:00
Luca Cavanna	adb957b5aa	Mute failing DateMathExpressionResolverTests tests Relates to #37037	2018-12-31 10:37:10 +01:00
Luca Cavanna	d3f1fe46d3	Increase await timeouts in RemoteClusterServiceTests Closes #33852	2018-12-28 17:03:40 +01:00
Armin Braun	4ac8fc6906	Force Refresh Listeners when Acquiring all Operation Permits (#36835 ) * Fixes the issue reproduced in the added tests: * When having open index requests on a shard that are waiting for a refresh, relocating that shard becomes blocked until that refresh happens (which could be never as in the test scenario).	2018-12-28 16:42:51 +01:00
Luca Cavanna	cb6bac3f88	Skip final reduction if SearchRequest holds a cluster alias (#37000 ) With #36997 we added the ability to provide a cluster alias with a SearchRequest. The next step is to disable the final reduction whenever a cluster alias is provided with the SearchRequest. A cluster alias will be provided when executing a cross-cluster search request with alternate execution mode, where each cluster does its own reduction locally. In order for the CCS node to be able to later perform an additional reduction of the results, we need to make sure that all the needed info stays available. This means that terms aggregations can be reduced but not pruned, and pipeline aggs should not be executed. The final reduction will happen later in the CCS coordinating node. Relates to #36997 & #32125	2018-12-28 14:58:20 +01:00
Armin Braun	34d22f378d	TESTS: Mute testSnapshotCanceledOnRemovedShard * relates #37005	2018-12-28 14:20:45 +01:00
Luca Cavanna	51fe20e0c3	Add support for local cluster alias to SearchRequest (#36997 ) With the upcoming cross-cluster search alternate execution mode, the CCS node will be able to split a CCS request into multiple search requests, one per remote cluster involved. In order to do that, the CCS node has to be able to signal to each remote cluster that such sub-requests are part of a CCS request. Each cluster does not know about the other clusters involved, and does not know either what alias it is given in the CCS node, hence the CCS coordinating node needs to be able to provide the alias as part of the search request so that it is used as index prefix in the returned search hits. The cluster alias is a notion that's already supported in the search shards iterator and search shard target, but it is currently used in CCS as both index prefix and connection lookup key when fanning out to all the shards. With CCS alternate execution mode the provided cluster alias needs to be used only as index prefix, as shards are local to each cluster hence no cluster alias should be used for connection lookups. The local cluster alias can be set to the SearchRequest at the transport layer only, and its constructor/getter methods are package private. Relates to #32125	2018-12-28 12:43:25 +01:00
Yannick Welsch	935c2e98b0	Zen2: Turn to follower on follower check when no state accepted yet from new leader (#37003 ) Improves on #36449 which did not cover the situation where a node had bumped its term during the election, and not when receiving the first follower check. This was uncovered while refactoring NodeJoinTests so that they don't need to access to an internal field of Coordinator anymore (which can now be made private).	2018-12-28 08:37:04 +01:00
Andrey Ershov	76da29ba84	Switch testClusterJoinDespiteOfPublishingIssues to Zen2 (#36979 )	2018-12-27 22:19:20 +01:00
Andrey Ershov	6259ccb1cf	Move ensureAtomicMoveSupported to NodeEnvironment (#36975 ) Atomic move support is needed not only for GatewayMetaState to work correctly	2018-12-27 22:18:47 +01:00
Armin Braun	6aae7c8516	SNAPSHOT+TESTS: Speed up Snapshot IT (#36990 ) * This speeds up the test from an average 25s down to 7s runtime * There is no need for artificially slowing down the snapshot to reproduce the issue of an out of sync routing table in practice. Over hundreds of test runs the test's snapshot shard service still runs in the index not found exception every time reproducing this issue. * Relates #36294	2018-12-27 12:18:17 +01:00
Jim Ferenczi	b0d2957e10	[TEST] Fix possible identical mutation in SearchHitTests#mutate	2018-12-27 12:00:22 +01:00
Itamar Benjamin	48b0908fc6	Make InternalComposite key comparable Keys are compared in BucketSortPipelineAggregation so making key type (ArrayMap) implement Comparable. Maps are compared using the entry set's iterator so ordered maps order is maintain. For each entry first comparing key then value. Assuming all keys are strings. When comparing entries' values if type is not identical and\or type not implementing Comparable, throwing exception. Not implementing equals() and hashCode() functions as parent's ones are sufficient. Tests included.	2018-12-27 08:51:24 +01:00
Nhat Nguyen	7580d9d925	Make SourceToParse immutable (#36971 ) Today the routing of a SourceToParse is assigned in a separate step after the object is created. We can easily forget to set the routing. With this commit, the routing must be provided in the constructor of SourceToParse. Relates #36921	2018-12-24 14:06:50 -05:00
Andrey Ershov	531ae8b3ab	Remove ensureNoPre019State check (#36974 ) This check is no longer needed because it's already ES v7	2018-12-24 16:26:08 +01:00
Nhat Nguyen	40c7ae6181	Rewrite SourceToParse with resolved docType (#36921 ) We introduce a typeless API in #35790 where we translate the default docType "_doc" to the user-defined docType. However, we do not rewrite the SourceToParse with the resolved docType. This leads to a situation where we have two translog operations for the same document with different types: - prvOp [Index{id='9LCpwGcBkJN7eZxaB54L', type='_doc', seqNo=1, primaryTerm=1, version=1, autoGeneratedIdTimestamp=1545125562123}] - newOp [Index{id='9LCpwGcBkJN7eZxaB54L', type='not_doc', seqNo=1, primaryTerm=1, version=1, autoGeneratedIdTimestamp=-1}] Closes #36769	2018-12-23 15:14:21 -05:00
Nhat Nguyen	d238b2934c	TEST: Fix Engine#testRebuildLocalCheckpointTracker In this test, we verify that the LocalCheckpointTracker is initialized with the operations of the safe commit. And the test fails because Engine#Index does not implement the equals method (should not implement as it consists of a mutable ParsedDocument). Closes #36470	2018-12-22 18:00:42 -05:00
Tim Brooks	c8a8391dfa	Only compress responses if request was compressed (#36867 ) This is a follow-up to some discussions around #36399. Currently we have relatively confusing compression behavior where compression can be configured for requests based on transport.compress or a specific setting for a remote cluster. However, we can only compress responses based on transport.compress as we do not know where a request is coming from (currently). This commit modifies the behavior to NEVER compress responses based on settings. Instead, a response will only be compressed if the request was compressed. This commit also updates the documentation to more clearly described transport level compression.	2018-12-21 10:14:00 -07:00
Tanguy Leroux	bd2af2c400	Merge branch 'master' into close-index-api-refactoring	2018-12-21 12:22:24 +01:00
David Turner	cfea2fd68c	RecoveryMonitor#lastSeenAccessTime should be volatile (#36781 ) This local field is accessed on multiple threads and is nonvolatile so theoretically could yield stale values. Not sure it does in practice.	2018-12-21 11:10:48 +00:00
Michael Basnight	7cbf03c001	Scripting: Remove deprecated params.ctx (#36848 ) When the script contexts were created in 6, the use of params.ctx was deprecated. This commit cleans up that code and ensures that params.ctx is null in both watcher script contexts. Relates: #34059	2018-12-20 21:30:24 -06:00
Julie Tibshirani	fba710469a	Refactor the REST actions to clarify what endpoints are deprecated. (#36869 )	2018-12-20 18:06:41 -08:00
Tim Brooks	d9b2ed6135	Send clear session as routable remote request (#36805 ) This commit adds a RemoteClusterAwareRequest interface that allows a request to specify which remote node it should be routed to. The remote cluster aware client will attempt to route the request directly to this node. Otherwise it will send it as a proxy action to eventually end up on the requested node. It implements the ccr clean_session action with this client.	2018-12-20 17:43:12 -07:00
David Turner	d6d5134890	Fix name of SettingsBasedHostsProviderIT (#36778 ) It is not a _host_ provider because it provides more than one host.	2018-12-20 17:11:12 +00:00
David Turner	52d34e45e7	[Zen2] Minor logging improvements (#36818 ) * Adds term number and greppable phrase 'coordinator becoming' to Coordinator mode changes * Adds term and version to messages from the ClusterApplier about master changes * Reduces some LeaderChecker messages to TRACE level	2018-12-20 15:31:52 +00:00
Andrey Ershov	ca92d74e7e	[Zen2] Change unsafe bootstrap nodes count to nodes list in tests (#36559 ) This commit modifies ESSingleNodeTestCase and ESIntegTestCase and several concrete test classes to use node names when bootstrapping the cluster. Today ClusterBootstrapService.INITIAL_MASTER_NODE_COUNT_SETTING setting is used to bootstrap clusters in tests. Instead, we want to use ClusterBootrstapService.INITIAL_MASTER_NODES_SETTING and get rid of the former setting eventually. There were two main problems when refactoring InternalTestCluster: 1. Nodes are created one-by-one in buildNode method. And node.name is created in this method as well. It's not suitable for bootstrapping, because we need to have the names of all master eligible nodes in advance, before creating the node with bootstrapping configuration set. We address this issue by separating buildNode into two methods: getNodeSettings and buildNode. We first iterate over all nodes to get nodes settings, then change the setting for the bootstrapping node and then proceed with building the node. 2. If autoManageMinMasterNodes = false, there is no way for the test to set the list of bootstrapping nodes because node names are not known in advance. This problem is solved by adding updateNodesSettings method to NodeConfigurationSource and ESIntegTestCase (which could be overridden by concrete integration test class). Once we have the list of settings for all nodes, the integration test class is allowed to update it. In our case, we update the ClusterBootrstapService.INITIAL_MASTER_NODES_SETTING setting.	2018-12-20 15:20:33 +01:00
Dimitris Athanasiou	08bcd83757	[ML] Reduce persistent tasks periodic reassignment interval in ... (#36845 ) ... MlDistributedFailureIT.testLoseDedicatedMasterNode. An intermittent failure has been observed in `MlDistributedFailureIT. testLoseDedicatedMasterNode`. The test launches a cluster comprised by a dedicated master node and a data and ML node. It creates a job and datafeed and starts them. It then shuts down and restarts the master node. Finally, the test asserts that the two tasks have been reassigned within 10s. The intermittent failure is due to the assertions that the tasks have been reassigned failing. Investigating the failure revealed that the `assertBusy` that performs that assertion times out. Furthermore, it appears that the job task is not reassigned because the memory tracking info is stale. Memory tracking info is refreshed asynchronously when a job is attempted to be reassigned. Tasks are attempted to be reassigned either due to a relevant cluster state change or periodically. The periodic interval is controlled by a cluster setting called `cluster.persistent_tasks.allocation.recheck_interval` and defaults to 30s. What seems to be happening in this test is that if all cluster state changes after the master node is restarted come through before the async memory info refresh completes, then the job might take up to 30s until it is attempted to reassigned. Thus the `assertBusy` times out. This commit changes the test to reduce the periodic check that reassigns persistent tasks to `200ms`. If the above theory is correct, this should eradicate those failures. Closes #36760	2018-12-20 14:53:36 +02:00
Ryan Ernst	cfc0a47232	Core: Deprecate negative epoch timestamps (#36793 ) Negative timestamps are currently supported in joda time. These are dates before epoch. However, it doesn't really make sense to have a negative timestamp, since this is a modern format. Any dates before epoch can be represented with normal date formats, like ISO8601. Additionally, implementing negative epoch timestamp parsing in java time has an edge case which would more than double the code required. This commit deprecates use of negative epoch timestamps.	2018-12-20 00:17:06 -08:00
Tim Brooks	a7f344cc7f	Use ByteBuffer#slice in BytesReference wrapper (#36862 ) Currently the ByteBufferReference does not duplicate the buffer. This means that any changes to the buffer's limit or position will impact the reference. This can lead to unexpected behavior. This commit uses the ByteBuffer#slice method to ensure that the reference retains its own ByteBuffer.	2018-12-19 16:58:43 -07:00
Julie Tibshirani	ecb822c666	Deprecate the document create endpoint. (#36863 )	2018-12-19 15:20:20 -08:00
Mayya Sharipova	9c1e47d434	Add 6.5.5 version constants	2018-12-19 17:02:14 -05:00
Julie Tibshirani	32ef80f3d4	Avoid duplicate types deprecation messages in search-related APIs. (#36802 )	2018-12-19 12:59:25 -08:00
David Causse	3412627efe	QueryRescorer should keep the window size when rewriting (#36836 ) This attribute being controlled by the parent class it's easy to miss it during rewrites.	2018-12-19 21:42:18 +01:00
Ryan Ernst	c85c16bd94	Core: Revert back to joda's multi date formatters (#36814 ) This commit partially reverts #36447 by using the ability of Joda time's DateTimeFormatterBuilder to append multiple parsers instead of using the MergedDateFormatter. The MergedDateFormatter will be removed in a future change, as it is not as performant due to creating potentially many exceptions during heavy date parsing. This change is a stop-gap until that followup is ready. closes #36602	2018-12-19 11:20:35 -08:00
Christoph Büscher	9058698d9d	Removing some deprecated methods (#36829 ) Changes: * Removed deprecated method in InnerHitBuilder * Removed fields() from SearchRequestBuilder * Removed deprecated GeoDistanceSortBuilder#geohashes	2018-12-19 18:45:43 +01:00
Christoph Büscher	de052ae533	Bump laster 6.x version in VersionTests (#36851 )	2018-12-19 18:19:16 +01:00
Nhat Nguyen	b63f9b967c	Fix translog bwc serialization (#36676 ) Serializing of a Translog#Index from v7.0.0 to 6.x is broken since #29224 where we removed the _parent field. Relates #29224	2018-12-19 11:19:25 -05:00
Tanguy Leroux	fb24469fe7	Merge branch 'master' into close-index-api-refactoring	2018-12-19 16:17:26 +01:00
Ignacio Vera	a34a3532ce	[Javadoc]: Remove lucene tags (#36834 ) Remove lucene tags as they break gradle javadoc task Relates #36794	2018-12-19 15:29:09 +01:00
Jason Tedor	f2a5373495	Add the 6.7.0 constant to the master branch Now that the 6.x branch has been bumped to the 6.7.0 version, this commit adds knowledge of the 6.7.0 version to the master branch.	2018-12-19 08:24:10 -05:00
Yannick Welsch	8f141b8a41	Fix ClusterInfoServiceIT timeouts (#36758 ) The test testClusterInfoServiceInformationClearOnError relies on timing behavior. It sets InternalClusterInfoService.INTERNAL_CLUSTER_INFO_TIMEOUT_SETTING to 1s and relies on the fact that the stats request completes within that timeframe (which our ever-so-slow CI seems to violate at times). Unfortunately the logging has been misimplemented in InternalClusterInfoService, so the corresponding log messages showing that the requests have timed out are missing for this. The issue can be locally reproduced by reducing the timeout to something lower. Closes #36554	2018-12-19 13:59:58 +01:00
David Turner	3f8f907606	Extend time allowed to detect disconnection (#36827 ) If the master sends both its follower checks just before disconnection then neither will receive a response, meaning it must wait for the checks to time out and send another in order to detect the disconnection and stand down. Closes #36788	2018-12-19 12:51:14 +00:00
Yannick Welsch	487a1c4f71	Fix cluster state persistence for single-node discovery (#36825 ) Single-node discovery is not persisting cluster states, which was caused by a recent 7.0-only refactoring. This commit ensures that the cluster state is properly persisted when using single-node discovery and adds a corresponding test.	2018-12-19 13:26:04 +01:00
Boaz Leskes	216b154107	Replace 0L with an UNASSIGNED_PRIMARY_TERM constant (#36819 ) * Replace 0L with an UNASSIGNED_PRIMARY_TERM constant 0 is an illegal value for a primary term that is often used to indicate the primary term isn't assigned or is irrelevant. This PR replaces the usage of 0 with a constant, to improve readability and so it can be tracked and if needed, replaced. * feedback	2018-12-19 13:15:05 +01:00
Armin Braun	978713a67c	SNAPSHOTS+TESTS: Correctly Wait for Clean State (#36801 ) * Test must wait until there is no in-progress deletion as well here since the master failover leads to a snapshot deletion * Closes #36779	2018-12-19 13:14:36 +01:00
Alan Woodward	344917efab	Add script filter to intervals (#36776 ) This commit adds the ability to filter out intervals based on their start and end position, and internal gaps: ``` POST _search { "query": { "intervals" : { "my_text" : { "match" : { "query" : "hot porridge", "filter" : { "script" : { "source" : "interval.start > 10 && interval.end < 20 && interval.gaps == 0" } } } } } } } ```	2018-12-19 11:12:18 +00:00
Przemyslaw Gomulka	1345dff507	Fix line length in org.elasticsearch.snapshots (#36646 ) Remove the line length suppression for this package and fix offending lines in both main and test relates #34884	2018-12-19 11:29:21 +01:00
Luca Cavanna	d2ce576c8c	Use SearchRequest copy constructor in ExpandSearchPhase (#36772 ) Relates to #36641	2018-12-19 10:47:17 +01:00
Alan Woodward	dd540ef618	Use index-prefix fields for terms of length min_chars - 1 (#36703 ) The default index_prefix settings will index prefixes of between 2 and 5 characters in length. Currently, if a prefix search falls outside of this range at either end we fall back to a standard prefix expansion, which is still very expensive for single character prefixes. However, we have an option here to use a wildcard expansion rather than a prefix expansion, so that a query of a* gets remapped to a? against the _index_prefix field - likely to be a very small set of terms, and certain to be much smaller than a* against the whole index. This commit adds this extra level of mapping for any prefix term whose length is one less than the min_chars parameter of the index_prefixes field.	2018-12-19 08:55:05 +00:00
Tanguy Leroux	c99fd6a53b	Merge branch 'master' into close-index-api-refactoring	2018-12-19 09:34:59 +01:00
Nhat Nguyen	1e9d0bb01e	AwaitsFix testRebuildLocalCheckpointTracker Tracked at #36470	2018-12-19 03:12:34 -05:00
Simon Willnauer	8e5db90eec	Never corrupt fully deleted segments in tests (#36741 ) Today we might corrupt a fully deleted segment which is then pruned once a snapshot is taken. This causes random test failures in CorruptedFileIT. This change hardens the selection of files to corrupt and removes some fragile code preventing fully deleted segments to be taken into account. Closes #36526	2018-12-19 07:39:10 +01:00
Alpar Torok	e9ef5bdce8	Converting randomized testing to create a separate unitTest task instead of replacing the builtin test task (#36311 ) - Create a separate unitTest task instead of Gradle's built in - convert all configuration to use the new task - the built in task is now disabled	2018-12-19 08:25:20 +02:00
Alpar Torok	8015560bd4	Remove line length supressions from plugins (#36708 )	2018-12-19 08:06:48 +02:00
Tim Brooks	aaf466ff5e	Revert transport.port change for tests (#36809 ) Commit #36786 updated docs and strings to reference transport.port instead of transport.tcp.port. However, this breaks backwards compatibility tests as the tests rely on string configurations and transport.port does not exist prior to 6.6. This commit reverts the places were we reference transport.tcp.port for tests. This work will need to be reintroduced in a backwards compatible way.	2018-12-18 19:01:13 -07:00
Nick Knize	20b58f0b0f	[GEO] Fork Lucene's LatLonShape Classes to local lucene package (#36794 ) Lucene 7.6 uses a smaller encoding for LatLonShape. This commit forks the LatLonShape classes to Elasticsearch's local lucene package. These classes will be removed on the release of Lucene 7.6.	2018-12-18 16:48:30 -06:00
Tim Brooks	47a9a8de49	Update transport docs and settings for changes (#36786 ) This is related to #36652. In 7.0 we plan to deprecate a number of settings that make reference to the concept of a tcp transport. We mostly just have a single transport type now (based on tcp). Settings should only reference tcp if they are referring to socket options. This commit updates the settings in the docs. And removes string usages of the old settings. Additionally it adds a missing remote compress setting to the docs.	2018-12-18 13:09:58 -07:00
Boaz Leskes	10f6408814	enable bwc tests and switch transport serialization version to 6.6.0 for CAS features Relates to #36148	2018-12-18 20:03:07 +01:00
Yannick Welsch	e35de2ea2c	Synchronize WriteReplicaResult callbacks (#36770 ) TransportWriteAction.WriteReplicaResult is not properly synchronized, which can lead to a data race between the thread that calls respond and the AsyncAfterWriteAction that calls either onSuccess or onFailure. This data race results in the response listener not being called, which ultimately results in a stuck replication task on the replica.	2018-12-18 19:23:35 +01:00
Tim Brooks	1fa105658e	Add CcrRestoreSourceService to track sessions (#36578 ) This commit is related to #36127. It adds a CcrRestoreSourceService to track Engine.IndexCommitRef need for in-process file restores. When a follower starts restoring a shard through the CcrRepository it opens a session with the leader through the PutCcrRestoreSessionAction. The leader responds to the request by telling the follower what files it needs to fetch for a restore. This is not yet implemented. Once, the restore is complete, the follower closes the session with the DeleteCcrRestoreSessionAction action.	2018-12-18 11:23:13 -07:00
Ryan Ernst	8ec8342a52	Internal: Remove originalSettings from Node (#36569 ) This commit removes the originalSettings member from Node. It was only needed to allows test clusters to recreate the node in certain situations. Instead, the test cluster now keeps track of these settings.	2018-12-18 10:05:27 -08:00
Ryan Ernst	0b22ca3a0f	Core: Deprecate use of scientific notation in epoch time parsing (#36691 ) The joda epoch parsing code currently supports passing epoch time as a number in scientific notation. However, no systems appear to exist which output timestamps in scientific notation. In java time, it is particularly complex to implement scientific notation timestamp parsing within a DateTimeFormatter. This commit adds a deprecation warning when the epoch time parsers in joda parse scientific notation, so that it can be removed when switching to java time. joda are passed a time in scientific notation.	2018-12-18 09:51:03 -08:00
Alexander Reelsen	ea9b08dee1	Tests: Mute SnapshotDisruptionIT.testDisruptionOnSnapshotInitialization Relates #36779	2018-12-18 18:25:06 +01:00
Tanguy Leroux	103c4d4762	[Close Index API] Mark unavailable shard copy as stale during verification (#36755 ) This pull request modifies the TransportVerifyShardBeforeCloseAction so that it marks unavailable shards as stale.	2018-12-18 17:43:14 +01:00
Luca Cavanna	bd12e00b53	Update versions in SearchSortValues transport serialization Now that #36617 is backported to 6.x, the version in the transport serialization conditionals for the search sort values can be updated to 6.6.0	2018-12-18 17:03:00 +01:00
Luca Cavanna	1aad08c0c0	Update version in SearchHits transport serialization Now that #36555 is backported to 6.x, the version in the transport serialization conditionals for sort and collapse info can be updated to 6.6.0	2018-12-18 17:03:00 +01:00
Nick Knize	ec0dc2c0e9	[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#36751 ) * [Geo] Expose BKDBackedGeoShapes as new VECTOR strategy This commit exposes lucene's LatLonShape field as a new strategy in GeoShapeFieldMapper. To use the new indexing approach, strategy should be set to "vector" in the geo_shape field mapper. If the tree parameter is set the mapper will throw an IAE. Note the following: When using vector strategy: * geo_shape query does not support querying by POINT, MULTIPOINT, or GEOMETRYCOLLECTION. * LINESTRING and MULTILINESTRING queries do not support WITHIN relation. * CONTAINS relation is not supported. * The tree, precision, tree_levels, distance_error_pct, and points_only parameters will not throw an exception but they have no effect and will be marked as deprecated.. All other features are supported. * revert change to PercolatorFieldMapper * fix ExistsQuery for geo_shape vector strategy * add deprecation logging for tree, precision, tree_levels, distance_error_pct, and points_only * initial update to geoshape docs, including mapping migration updates * initial support for GeoCollection queries * fix docs and javadoc errors * clean up geocollection queries * set deprecated mapping tests to NOTCONSOLE * fix geo-shape mapper asciidoc mapping and test warnings * add support for point queries using LatLonShapeBoundingBoxQuery * update GeoShapeQueryBuilderTests to include POINT queries for VECTOR strategy. Other comment cleanups * add lucene geometry build testing to ShapeBuilder tests * remove deprecated prefix tree mapping from geo-shape.asciidoc * refactor GeoShapeFieldMapper into LegacyGeoShapeFieldMapper and GeoShapeFieldMapper Both classes derive from BaseGeoShapeFieldMapper that provides shared parameters: coerce, ignoreMalformed, ignore_z_value, orientation. * update docs to remove vector strategy * fix GeometryCollectionBuilder#buildLucene to return the object created by the shape builder * fix LineLength failure in GeoJsonShapeParserTests * ShapeMapper refactor changes from PR feedback * fix typo in geo-shape.asciidoc * ignore circle test in docs * update indexing-approach ref to geoshape-indexing-approach * add warnings check for LegacyGeoShapeFieldMapper to AbstractBuilderTestCase * fix deprecatedParameters setup * update indexing approach * fixing unexpected warnings failures * move orientation back to field type * remove if in LegacyGeoShapeFieldMapper#doXContent. Fix GeoShapeFieldMapper to work with double array as a point * fix indexing-approach link in circle section of geoshape docs * add strategy to deprecation warnings check * fix test failures * fix typo in QueryStringQueryBuilderTests * fix total hits to totalHits().value * fix version number * add version check to BaseGeoShapeFieldMapper * fix line length! * revert version check in BaseGeoShapeFieldMapper * Fix serialization of mappings of legacy shapes.	2018-12-18 09:54:56 -06:00
Tanguy Leroux	cd3a1af77b	Merge branch 'master' into close-index-api-refactoring	2018-12-18 15:52:39 +01:00
Mayya Sharipova	f884b2b1cd	Deprecate types in index API (#36575 ) * Deprecate types in index API - deprecate type-based constructors of IndexRequest - update tests to use typeless IndexRequest constructors - no yaml tests as they have been already added in #35790 Relates to #35190	2018-12-18 08:53:49 -05:00
Boaz Leskes	5f76f39386	Rename seq# powered optimistic concurrency control parameters to ifSeqNo/ifPrimaryTerm (#36757 ) This PR renames the parameters previously introduce to the following: ### URL Parameters ``` PUT twitter/_doc/1?if_seq_no=501&if_primary_term=1 { "user" : "kimchy", "post_date" : "2009-11-15T14:12:12", "message" : "trying out Elasticsearch" } DELETE twitter/_doc/1?if_seq_no=501&if_primary_term=1 ``` ### Bulk API ``` POST _bulk { "index" : { "_index" : "test", "_type" : "_doc", "_id" : "1", "if_seq_no": 501, "if_primary_term": 1 } } { "field1" : "value1" } { "delete" : { "_index" : "test", "_type" : "_doc", "_id" : "2", "if_seq_no": 501, "if_primary_term": 1 } } ``` ### Java API ``` IndexRequest.ifSeqNo(long seqNo) IndexRequest.ifPrimaryTerm(long primaryTerm) DeleteRequest.ifSeqNo(long seqNo) DeleteRequest.ifPrimaryTerm(long primaryTerm) ``` Relates #36148 Relates #10708	2018-12-18 14:35:18 +01:00
Boaz Leskes	9087c98a5a	Expose Sequence Number based Optimistic Concurrency Control in the rest layer (#36721 ) Relates #36148 Relates #10708	2018-12-18 10:56:02 +01:00
Luca Cavanna	8a412c6a26	Ensure MapperService#getAllMetaFields elements order is deterministic (#36739 ) MapperService#getAllMetaFields returns an array, which is created out of an `ObjectHashSet`. Such set does not guarantee deterministic hash ordering. The array returned by its toArray may be sorted differently at each run. This caused some repeatability issues in our tests (see #29080) as we pick random fields from the array of possible metadata fields, but that won't be repeatable if the input array is sorted differently at every run. Once setting the tests seed, hppc picks that up and the sorting is deterministic, but failures don't repeat with the seed that gets printed out originally (as a seed was not originally set). See also https://issues.carrot2.org/projects/HPPC/issues/HPPC-173. With this commit, we simply create a static sorted array that is used for `getAllMetaFields`. The change is in production code but really affects only testing as the only production usage of this method was to iterate through all values when parsing fields in the high-level REST client code. Anyways, this seems like a good change as returning an array would imply that it's deterministically sorted.	2018-12-18 10:43:14 +01:00
Tanguy Leroux	0a0c969517	Merge branch 'master' into close-index-api-refactoring	2018-12-18 09:27:35 +01:00
Tanguy Leroux	7372529a95	[Tests] Reduce randomization in CloseWhileRelocatingShardsIT (#36694 )	2018-12-18 09:25:47 +01:00
Luca Cavanna	b57e12aa44	Add raw sort values to SearchSortValues transport serialization (#36617 ) In order for CCS alternate execution mode (see #32125) to be able to do the final reduction step on the CCS coordinating node, we need to serialize additional info in the transport layer as part of each `SearchHit`. Sort values are already present but they are formatted according to the provided `DocValueFormat` provided. The CCS node needs to be able to reconstruct the lucene `FieldDoc` to include in the `TopFieldDocs` and `CollapseTopFieldDocs` which will feed the `mergeTopDocs` method used to reduce multiple search responses (one per cluster) into one. This commit adds such information to the `SearchSortValues` and exposes it through a new getter method added to `SearchHit` for retrieval. This info is only serialized at transport and never printed out at REST.	2018-12-18 09:20:51 +01:00
Nicholas Knize	96d279ed83	Revert "[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 )" This reverts commit `5bc7822562`.	2018-12-17 20:09:46 -06:00
Christoph Büscher	2f5300e3a6	Deprecate types in get_source and exist_source (#36426 ) This change adds a new untyped endpoint `{index}/_source/{id}` for both the GET and the HEAD methods to get the source of a document or check for its existance. It also adds deprecation warnings to RestGetSourceAction that emit a warning when the old deprecated "type" parameter is still used. Also updating documentation and tests where appropriate. Relates to #35190	2018-12-18 00:57:42 +01:00
Jason Tedor	f0f2b26159	Fix duplicate phrase in shrink/split error message (#36734 ) This commit removes a duplicate "must be a" from the shrink/split error messages.	2018-12-17 17:42:58 -05:00
Jake Landis	384757deff	ingest: support default pipelines + bulk upserts (#36618 ) This commit adds support to enable bulk upserts to use an index's default pipeline. Bulk upsert, doc_as_upsert, and script_as_upsert are all supported. However, bulk script_as_upsert has slightly surprising behavior since the pipeline is executed _before_ the script is evaluated. This means that the pipeline only has access the data found in the upsert field of the script_as_upsert. The non-bulk script_as_upsert (existing behavior) runs the pipeline _after_ the script is executed. This commit does _not_ attempt to consolidate the bulk and non-bulk behavior for script_as_upsert. This commit also adds additional testing for the non-bulk behavior, which remains unchanged with this commit. fixes #36219	2018-12-17 16:25:11 -06:00
Armin Braun	75bfbe92ee	TESTS:Debug Log. IndexStatsIT#testFilterCacheStats	2018-12-17 22:35:30 +01:00
Nick Knize	5bc7822562	[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 ) This commit exposes lucene's LatLonShape field as the default type in GeoShapeFieldMapper. To use the new indexing approach, simply set "type" : "geo_shape" in the mappings without setting any of the strategy, precision, tree_levels, or distance_error_pct parameters. Note the following when using the new indexing approach: * geo_shape query does not support querying by MULTIPOINT. * LINESTRING and MULTILINESTRING queries do not yet support WITHIN relation. * CONTAINS relation is not yet supported. The tree, precision, tree_levels, distance_error_pct, and points_only parameters are deprecated.	2018-12-17 14:38:14 -06:00
Jake Landis	7bf822bbbb	ingest: fix on_failure with Drop processor (#36686 ) This commit allows a document to be dropped when a Drop processor is used in the on_failure fork of the processor chain. Fixes #36151	2018-12-17 14:10:13 -06:00
Armin Braun	4d0bb9dd0a	SNAPSHOTS: Adjust BwC Versions in Restore Logic (#36718 ) * Re-enables bwc tests with adjusted version conditions now that #36397 enables concurrent snapshots in 6.6+	2018-12-17 21:02:59 +01:00
Jason Tedor	cd632de116	Do not resolve addresses in remote connection info (#36671 ) The remote connection info API leads to resolving addresses of seed nodes when invoked. This is problematic because if a hostname fails to resolve, we would not display any remote connection info. Yet, a hostname not resolving can happen across remote clusters, especially in the modern world of cloud services with dynamically chaning IPs. Instead, the remote connection info API should be providing the configured seed nodes. This commit changes the remote connection info to display the configured seed nodes, avoiding a hostname resolution. Note that care was taken to preserve backwards compatibility with previous versions that expect the remote connection info to serialize a transport address instead of a string representing the hostname.	2018-12-17 13:24:31 -05:00
Evgenia Badyanova	27ad733dce	Fixing line length for EnvironmentTests and RecoveryTests (#36657 ) Relates #34884	2018-12-17 11:59:45 -05:00
Boaz Leskes	e356b8cb95	Add doc's sequence number + primary term to GetResult and use it for updates (#36680 ) This commit adds the last sequence number and primary term of the last operation that have modified a document to `GetResult` and uses it to power the Update API. Relates #36148 Relates #10708	2018-12-17 15:22:13 +01:00
Tanguy Leroux	ef6ae699cf	[Close Index API] Adapt MetaDataIndexStateServiceTests after merge	2018-12-17 10:28:26 +01:00
Tanguy Leroux	79999d37d4	Merge branch 'master' into close-index-api-refactoring	2018-12-17 10:14:38 +01:00
David Turner	f7567b865e	Improve assertions in testStaleMasterNotHijackingMajority (#36658 ) Investigating #36556 was made a little trickier because the feedback from the failing assertion wasn't very informative, and the messages attached to other nearby assertions were misleading. This commit improves the feedback from these assertions and tidies up a few other issues in the test suite.	2018-12-17 08:34:34 +00:00
Armin Braun	e82a647740	TEST: Fix Assertion for Shard Allocation Counts (#36685 ) * If removing half the nodes completely removes a shard from the cluster we can't count it in the assertion * Also: * Remove unused logger parameter * Fix typo in var name * Closes #35365	2018-12-17 09:20:21 +01:00
Boaz Leskes	733a6d34c1	Add seq no powered optimistic locking support to the index and delete transport actions (#36619 ) This commit add support for using sequence numbers to power [optimistic concurrency control](http://en.wikipedia.org/wiki/Optimistic_concurrency_control) in the delete and index transport actions and requests. A follow up will come with adding sequence numbers to the update and get results. Relates #36148 Relates #10708	2018-12-15 17:59:57 +01:00
Tim Brooks	4e4022b7ef	Register the TCP_CONNECT_TIMEOUT setting (#36663 ) This commit registers the TCP_CONNECT_TIMEOUT setting in the ClusterSettings.	2018-12-14 18:06:48 -07:00
Tim Brooks	3065300434	Unify transport settings naming (#36623 ) This commit updates our transport settings for 7.0. It generally takes a few approaches. First, for normal transport settings, it usestransport. instead of transport.tcp. Second, it uses transport.tcp, http.tcp, or network.tcp for all settings that are proxies for OS level socket settings. Third, it marks the network.tcp.connect_timeout setting for removal. Network service level settings are only settings that apply to both the http and transport modules. There is no connect timeout in http. Fourth, it moves all the transport settings to a single class TransportSettings similar to the HttpTransportSettings class. This commit does not actually remove any settings. It just adds the new renamed settings and adds todos for settings that will be deprecated.	2018-12-14 14:41:04 -07:00
Michael Basnight	dae422fb2b	Update joda compat methods to use compat class (#36654 ) The existing joda compat methods isEquals isAfter and isBefore all took in a ZonedDateTime, but since all of the scripting is now using the new JodaCompatZonedDateTime, these are changed to take that in instead.	2018-12-14 15:38:51 -06:00
João Barbosa	276726aea2	Added keyed response to pipeline percentile aggregations 22302 (#36392 ) Closes #22302	2018-12-14 16:22:54 -05:00
Alan Woodward	c7ac9ef826	Upgrade to lucene snapshot 774e9aefbc (#36637 ) Includes LUCENE-8607: improvement to MatchAllDocsQuery	2018-12-14 20:30:07 +00:00
Julie Tibshirani	ccd1beb9b3	Deprecate types in update requests. (#36181 ) The following updates were made: * Add deprecation warnings to `RestUpdateAction`, plus a test in `RestUpdateActionTests`. * Deprecate relevant methods on the Java HLRC requests/ responses. * Add HLRC integration tests for the typed APIs. * Update documentation (for both the REST API and Java HLRC). * Fix failing integration tests. Because of an earlier PR, the REST yml tests were already updated (one version without types, and another legacy version that retains types).	2018-12-14 10:47:27 -08:00
Tim Brooks	fbf88b2ab7	Remove the `MockTcpTransport` (#36628 ) This commit removes all remaining usages of the `MockTcpTransport`. Additionally it removes the `MockTcpTransport` and its test case.	2018-12-14 10:59:07 -07:00
Luca Cavanna	bb3ae18da5	Increase coverage in SearchSortValuesTests (#36597 ) SearchSortValuesTests extends now `AbstractSerializingTestCase` which removes some code duplication and standardizes the way we test `fromXContent`, serialization and equals/hashcode. Also, we were never creating `SearchSortValues` through their public constructor that accept an array of `DocValueFormat` together with the array of raw sort values. That is covered now, which involved some conversion from `BytesRef` to String in the test. Also, the previous test was not using doing any equality check against the original and parsed versions in `testFromXContent` due to values being parsed with different types in some cases, which is now covered by converting those values using a new method added to `RandomObjects`. The code was already there as part of `randomStoredFieldValues`, but it is now exposed to be used in other scenarios.	2018-12-14 18:57:37 +01:00
Luca Cavanna	8f04536a35	Add copy constructor to SearchRequest (#36641 ) For cross cluster search alternate execution mode (see #32125), we will need to take a search request that spans across multiple clusters (based on index prefixes e.g. cluster1:index, cluster2:index etc.) and split it into multiple search requests to be sent to each cluster. A copy constructor added to `SearchRequest` would make that easy and well maintainable in the future. Something along the same lines already happens in `BulkByScrollParallelizationHelper`, but the corresponding code went outdated as some new fields were added to `SearchRequest` which were not added to the bulk by scroll code. A copy constructor helps making the task of copying a search request maintainable over time.	2018-12-14 18:30:29 +01:00
Alan Woodward	09bf93dc2a	Add intervals query (#36135 ) * Add IntervalQueryBuilder with support for match and combine intervals * Add relative intervals * feedback * YAML test - broekn * yaml test; begin to add block source * Add block; make disjunction its own source * WIP * Extract IntervalBuilder and add tests for it * Fix eq/hashcode in Disjunction * New yaml test * checkstyle * license headers * test fix * YAML format * YAML formatting again * yaml tests; javadoc * Add OR test -> requires fix from LUCENE-8586 * Add docs * Re-do API * Clint's API * Delete bash script * doc fixes * imports * docs * test fix * feedback * comma * docs fixes * Tidy up doc references to old rule	2018-12-14 15:14:00 +00:00
David Turner	278cc4c3de	[Zen2] Improve assertion when creating fake node ID (#36544 ) Today we assert that the fake node ID is greater than the real node's ID. In fact we want to assert that it's greater than _all_ proper UUIDs. This adds assertions to that effect.	2018-12-14 14:16:26 +00:00
Jeff Hajewski	f1f3b28f5c	Delete deprecated getValues from ScriptDocValues (#36183 ) * Adds deprecation logging to ScriptDocValues#getValues. First commit addressing issue #22919. `ScriptDocValues#getValues` was added for backwards compatibility but no longer needed. Scripts using the syntax `doc['foo'].values` when `doc['foo']` is a list should be using `doc['foo']` instead. * Fixes two build errors in #34279 * Removes unused import in ScriptDocValuesDatesTest * Removes used of `.values` in example in diversified-sampler-aggregation.asciidoc * Removes use of .values from painless test. Part of #34279 * Updates tests to use `doc[foo]` syntax rather than `doc[foo].values`. * Removes use of `getValues()` and replaces use of `doc[foo].values` with `doc[foo]`. * Indentation fix. * Remove unnecessary list construction at previous `getValues()` callsite in ScriptDocValues.GeoPoints. * Update migration doc and add link to `getValue` in ScriptDocValues javadoc. * Fix compile * Fix javadoc issue * Removes ScriptDocValues#getValues usage from painless whitelist.	2018-12-14 07:56:47 -05:00
David Turner	245cdd8c34	Register ResyncTask.Status as a NamedWriteable (#36610 ) Today, ResyncTask.Status is not registered, but appears as a task status sometimes, leading to `Failed to deserialize response from handler` exceptions: java.lang.IllegalArgumentException: Unknown NamedWriteable [org.elasticsearch.tasks.Task$Status][resync] This commit adds the missing registration.	2018-12-14 12:16:09 +00:00
David Turner	44ba9ab04d	Fix racy use of ConcurrentHashMap (#36603 ) ConcurrentHashMap does not always behave correctly if removing elements and concurrently checking for its emptyiness. Work around this by protecting all usages with a mutex (there was only one usage unprotected by the mutex anyway) and then we don't even need a ConcurrentHashMap at all.	2018-12-14 12:15:42 +00:00
Luca Cavanna	7dc3d3b78b	Add sort and collapse info to SearchHits transport serialization (#36555 ) In order for CCS alternate execution mode (see #32125) to be able to do the final reduction step on the CCS coordinating node, we need to serialize additional info in the transport layer as part of the `SearchHits`, specifically: - lucene `SortField[]` which contains info about the fields that sorting was performed on and their type, which depends on mappings (that the CCS node does not know about) - collapse field (`String`) that field collapsing was executed on, if requested - collapse values (`Object[]`) that field collapsing was based on, if requested This info is needed to be able to reconstruct the `TopFieldDocs` or `CollapseFieldTopDocs` in the CCS coordinating node to feed the `mergeTopDocs` method and reduce multiple search responses received (one per cluster) into one. This commit adds such information to the `SearchHits` class. It's nullable info that is not serialized through the REST layer. `SearchPhaseController` sets such info at the end of the hits reduction phase.	2018-12-14 12:22:54 +01:00
Armin Braun	c5b3ac5578	SNAPSHOTS: Allow Parallel Restore Operations (#36397 ) * Enable parallel restore operations * Add uuid to restore in progress entries to uniquely identify them * Adjust restore in progress entries to be a map in cluster state * Added tests for: * Parallel restore from two different snapshots * Parallel restore from a single snapshot to different indices to test uuid identifiers are correctly used by `RestoreService` and routing allocator * Parallel restore with waiting for completion to test transport actions correctly use uuid identifiers	2018-12-14 11:39:23 +01:00
Nhat Nguyen	1ae53e6b20	TEST: Disable merges in testRebuildLocalCheckpointTracker (#36625 ) In this test, we keep track of a list of index commits then verify that we reload exactly every operation from the safe commit. If a background merge is triggered, then we might have a new index commit which is not recorded in the tracking list. This change disables merges in the test. Closes #36470	2018-12-14 05:25:36 -05:00
Christoph Büscher	573b6325e7	[Tests] Fix edge case in SimpleQueryStringBuilderTests (#36611 ) A previous fix of a similar problem in #35201 wasn't general enough, we also need to catch cases where the randomly generated query string starts with some version of "now" and hits a date field. Closes #36595	2018-12-14 11:10:45 +01:00
Daniel Mitterdorfer	b7ba2fa74b	Fix line length for VersionTests Relates #34884 Relates #36585	2018-12-14 10:07:27 +01:00
Daniel Mitterdorfer	42f094952b	Fix line length for IndexLifecycleActionIT Relates #34884 Relates #36586	2018-12-14 10:00:37 +01:00
Ryan Ernst	254d1e8f22	Core: Add backcompat for joda time formats (#36531 ) This commit adds deprecation warnings when using format specifiers with joda data formats that will change with java time. It also adds the "8" prefix which may be used to force the new java time format parsing.	2018-12-13 12:26:51 -08:00
Christoph Büscher	b33ff16d62	Remove deprecated `useDisMax` from MultiMatchQuery (#36488 ) The getters and setters for useDisMax() have been deprecated since at least 6.0, also there hasn't been any reference to the query parameter in the documentation. Removing it from the builder and tests and replacing it with `tieBreaker(1.0f)` where necessary.	2018-12-13 20:13:16 +01:00
Tanguy Leroux	1f6f236d44	Merge branch 'master' into close-index-api-refactoring	2018-12-13 17:40:06 +01:00
Tanguy Leroux	8e5dd20efb	[Close Index API] Refactor MetaDataIndexStateService (#36354 ) The commit changes how indices are closed in the MetaDataIndexStateService. It now uses a 3 steps process where writes are blocked on indices to be closed, then some verifications are done on shards using the TransportVerifyShardBeforeCloseAction added in #36249, and finally indices states are moved to CLOSE and their routing tables removed. The closing process also takes care of using the pre-7.0 way to close indices if the cluster contains mixed version of nodes and a node does not support the TransportVerifyShardBeforeCloseAction. It also closes unassigned indices. Related to #33888	2018-12-13 17:36:23 +01:00
Tal Levy	b820d7c617	fix MultiValuesSourceFieldConfig toXContent (#36525 ) This commit turns MultiValuesSourceFieldConfig into a proper ToXContentObject for easy testing and verification of its to/from XContent methods. Closes #36474.	2018-12-13 08:17:36 -08:00
Tim Brooks	f998e04c34	Remove usages of `MockTcpTransport` from zen tests (#36579 ) We are attempting to replace the usage of the `MockTcpTransport` with the `MockNioTransport`. This commit replaces usages of `MockTcpTransport` in two zen test cases.	2018-12-13 09:10:18 -07:00
Jason Tedor	2afa7faefd	Override the JVM DNS cache policy (#36570 ) When a security manager is present, the JVM will cache positive hostname lookups indefinitely. This can be problematic, especially in the modern world with cloud services where DNS addresses can change, or environments using Docker containers where IP addresses could be considered ephemeral. This behavior impacts cluster discovery, cross-cluster replication and cross-cluster search, reindex from remote, snapshot repositories, webhooks in Watcher, external authentication mechanisms, and the Elastic Stack Monitoring Service. The experience of watching a DNS lookup change yet not be reflected within Elasticsearch is a poor experience for users. The reason the JVM has this is guard against DNS cache posioning attacks. Yet, there is already a defense in the modern world against such attacks: TLS. With proper certificate validation, even if a resolver falls prey to a DNS cache poisoning attack, using TLS would neuter the attack. Therefore we have a policy with dubious security value that significantly impacts usability. As such we make the usability/security tradeoff towards usability, since the security risks are very low. This commit introduces new system properties that Elasticsearch observes to override the JVM DNS cache policy.	2018-12-13 10:23:45 -05:00
Tanguy Leroux	40953d70ca	Merge branch 'master' into close-index-api-refactoring	2018-12-13 15:45:56 +01:00
Igor Motov	7446f75346	Geo: Adds a name of the field to geopoint parsing errors (#36529 ) Adds the field name and type to geo_point parsing errors. Closes #15965	2018-12-13 17:32:45 +04:00
Armin Braun	088d3f39c3	MINOR: Cleanup Dead Code in o.e.c.metadata Package (#36587 )	2018-12-13 12:54:22 +01:00
David Roberts	13cb0fb98b	Periodically try to reassign unassigned persistent tasks (#36069 ) Previously persistent task assignment was checked in the following situations: - Persistent tasks are changed - A node joins or leaves the cluster - The routing table is changed - Custom metadata in the cluster state is changed - A new master node is elected However, there could be situations when a persistent task that could not be assigned to a node could become assignable due to some other change, such as memory usage on the nodes. This change adds a timed recheck of persistent task assignment to account for such situations. The timer is suspended while checks triggered by cluster state changes are in-flight to avoid adding burden to an already busy cluster. Closes #35792	2018-12-13 09:15:27 +00:00
Armin Braun	34d7cc13c1	SETTINGS: Correctly Identify Noop Updates (#36560 ) * We should compare the target value with the to be applied value before interpreting the update as a change * This speeds up the test failing in #36496 considerably by preventing state updates on noop setting updates	2018-12-13 10:02:54 +01:00
Boaz Leskes	f6b5d7e013	Add sequence numbers based optimistic concurrency control support to Engine (#36467 ) This commit add support to engine operations for resolving and verifying the sequence number and primary term of the last modification to a document before performing an operation. This is infrastructure to move our (optimistic concurrency control)[http://en.wikipedia.org/wiki/Optimistic_concurrency_control] API to use sequence numbers instead of internal versioning. Relates #36148 Relates #10708	2018-12-13 08:08:40 +01:00
Tal Levy	cd1bec3a06	[refactor] add Environment in BootstrapContext (#36573 ) There are certain BootstrapCheck checks that may need access environment-specific values. Watcher's EncryptSensitiveDataBootstrapCheck passes in the node's environment via a constructor to bypass the shortcoming in BootstrapContext. This commit pulls in the node's environment into BootstrapContext. Another case is found in #36519, where it is useful to check the state of the data-path. Since PathUtils.get and Paths.get are forbidden APIs, we rely on the environment to retrieve references to things like node data paths. This means that the BootstrapContext will have the same Settings used in the Environment, which currently differs from the Node's settings.	2018-12-12 21:07:21 -08:00
Mayya Sharipova	d40037c91e	Deprecate uses of _type as a field name in queries (#36503 )	2018-12-12 21:21:53 -05:00
Tal Levy	9c1cdea839	[cleanup] remove deprecated references to dataWithClusterFiles (#36574 ) data files under the cluster name subdirectory has been deprecated and was meant to be removed in 6.0. This commit removes some leftover referrences to these paths.	2018-12-12 16:28:11 -08:00
Julie Tibshirani	71a39d10be	Make sure that BWC tests run successfully, even with types deprecation messages. (#36511 )	2018-12-12 12:57:32 -08:00
Alan Woodward	9ac7359643	Update lucene to snapshot-7e4555a2fd (#36563 ) Includes the following: * Reversion of doc-values changes in LUCENE-8374; we are interested in seeing if this has an effect on benchmarks for node-stats and index-stats * More improvements to docvalues updates	2018-12-12 20:18:32 +00:00
Julie Tibshirani	33152f648f	Fix some inconsistencies in the types deprecation code. (#36517 ) * Make sure to test conversion for both typed and typeless HLRC requests. * Update a few more statements to deprecatedAndMaybeLog. * Make sure Rest*SearchTemplateActionTests extend RestActionTestCase.	2018-12-12 10:38:02 -08:00
Tim Brooks	7f612d5dd8	Always compress based on the settings (#36522 ) Currently TransportRequestOptions allows specific requests to request compression. This commit removes this and always compresses based on the settings. Additionally, it removes TransportResponseOptions as they are unused. This closes #36399.	2018-12-12 09:39:15 -07:00
Armin Braun	02d0f163a4	MINOR: Cleanup Dead Code in Snapshots Package (#36528 )	2018-12-12 16:48:34 +01:00
Tanguy Leroux	dbcafbefc9	Merge branch 'master' into close-index-api-refactoring	2018-12-12 16:45:31 +01:00
Simon Willnauer	ff5dd14753	Fix test failures related to file corruption (#36530 ) * Fix CorruptFileIT to also take last DV generation into account We currently only prune old .liv generations. With soft_deletes it's important to also prune DV generations. * Fix CorruptionUtils to skip the footer bytes after the checksum is read. Today we read a broken checksum since we also checksum the 8 footer bytes that include the checksum algorithm and the footer magic. Closes #36526	2018-12-12 16:21:02 +01:00
Tim Brooks	3a56bb0924	Do not zero out pages for bytes output stream (#36523 ) Currently, the `BytesStreamOutput` always zeros out the underlying byte pages when they are acquired. This should not be necessary as the stream overwrites the underlying bytes as serialization occurs.	2018-12-12 07:55:17 -07:00
Tim Brooks	e63d52af63	Move page size constants to PageCacheRecycler (#36524 ) `PageCacheRecycler` is the class that creates and holds pages of arrays for various uses. `BigArrays` is just one user of these pages. This commit moves the constants that define the page sizes for the recycler to be on the recycler class.	2018-12-12 07:00:50 -07:00
Jason Tedor	bdb1e0e04e	Make node field in JoinRequest private (#36405 ) This commit moves the node field in the JoinRequest object to be a private field, adding a dedicated accessor. This is a minor breaking change in that it is no longer possible for all callers to overwrite this field, but that is a feature.	2018-12-12 08:50:21 -05:00
David Turner	aa43e0b2cc	[Zen2] Migrate no-master-block integration tests (#36502 ) This change follows up on #36478 by migrating the affected integration tests to use Zen2.	2018-12-12 12:52:34 +00:00
Ryan Ernst	c4f4378006	Core: Rework multi date formatter merging (#36447 ) This commit moves the MergedDateFormatter to a package private class and reworks joda DateFormatter instances to use that instead of a single DateTimeFormatter with multiple parsers. This will allow the java and joda multi formats to share the same format parsing method in a followup.	2018-12-11 23:47:44 -08:00
Nhat Nguyen	1bb6f844fe	AwaitsFix testCorruptFileThenSnapshotAndRestore Tracked at #36526	2018-12-12 02:03:23 -05:00
Nhat Nguyen	3fb5a12b30	Upgrade to Lucene-8.0.0-snapshot-61e448666d (#36518 ) Includes: - LUCENE-8602: Share TermsEnum if possible while applying DV updates	2018-12-12 06:47:40 +01:00
Nhat Nguyen	51800de2a8	Enable soft-deletes by default on 7.0.0 or later (#36141 ) This change enables soft-deletes by default on ES 7.0.0 or later. Relates #33222 Co-authored-by: Jason Tedor <jason@tedor.me>	2018-12-11 18:58:49 -05:00
Tim Brooks	797f985067	Add version to handshake requests (#36171 ) Currently our handshake requests do not include a version. This is unfortunate as we cannot rely on the stream version since it is not the sending node's version. Instead it is the minimum compatibility version. The handshake request is currently empty and we do nothing with it. This should allow us to add data to the request without breaking backwards compatibility. This commit adds the version to the handshake request. Additionally, it allows "future data" to be added to the request. This allows nodes to craft a version compatible response. And will properly handle additional data in future handshake requests. The proper handling of "future data" is useful as this is the only request where we do not know the other node's version. Finally, it renames the TcpTransportHandshaker to TransportHandshaker.	2018-12-11 16:09:28 -07:00
Mayya Sharipova	2f18325384	Deprecate types in update_by_query and delete_by_query (#36365 ) Relates to #35190	2018-12-11 17:09:59 -05:00
Luca Cavanna	dafea3cc23	Minor search controller changes (#36479 ) This commit contains a few minor changes to our search code: - adjust the visibility of a couple of methods in our search code to package private from public or protected. - make some of the `SearchPhaseController` methods static where possible - rename one of the `SearchPhaseController#reducedQueryPhase` methods (used only for scroll requests) to `reducedScrollQueryPhase` without the `isScrollRequest` argument which was always set to `true` - replace leniency in `SearchPhaseController#setShardIndex` with an assert to make sure that we never set the shard index twice - remove two null checks where the checked field can never be null - resolve an unchecked warning - replace `List#toArray` invocation that creates an array providing the true size with array creation of length 0 - correct a couple of typos in comments	2018-12-11 20:24:29 +01:00
Luca Cavanna	fb18b35347	Add default methods to DocValueFormat (#36480 ) The different `DocValueFormat` implementors throw `UnsupportedOperationException` for methods that they don't support. That is perfectly fine, and quite common as not all implementors support all of the possible formats. This makes it hard though to trace back which implementors support which formats as they all implement the same methods. This commit introduces default methods in the `DocValueFormat` interface so that all methods throw `UnsupportedOperationException` by default. This way implementors can override only the methods that they specifically support.	2018-12-11 20:19:01 +01:00
Tim Brooks	790f8102e9	Modify `BigArrays` to take name of circuit breaker (#36461 ) This commit modifies BigArrays to take a circuit breaker name and the circuit breaking service. The default instance of BigArrays that is passed around everywhere always uses the request breaker. At the network level, we want to be using the inflight request breaker. So this change will allow that. Additionally, as this change moves away from a single instance of BigArrays, the class is modified to not be a Releasable anymore. Releasing big arrays was always dispatching to the PageCacheRecycler, so this change makes the PageCacheRecycler the class that needs to be managed and torn-down. Finally, this commit closes #31435 be making the serialization of transport messages use the inflight request breaker. With this change, we no longer push the global BigArrays instnace to the network level.	2018-12-11 11:55:41 -07:00
Martijn van Groningen	405a38131b	added 6.5.4 version	2018-12-11 19:38:59 +01:00
David Turner	21d91f1268	[Zen2] Respect the no_master_block setting (#36478 ) Today the Zen2 coordinator only applies a write block when there is no known elected master, ignoring the `discovery.zen.no_master_block` setting. This commit resolves this, applying the correct block according to the configuration instead.	2018-12-11 16:40:19 +00:00
Nhat Nguyen	084e06e481	Require soft-deletes when access changes snapshot (#36446 ) Today we do not enforce soft-deletes when accessing the Lucene changes snapshot. This might lead to internal errors because we assume soft-deletes are enabled in that code path.	2018-12-11 11:18:10 -05:00
Andrey Ershov	8b821706cc	Switch more tests to zen2 (#36367 ) 1. CCR tests work without any changes 2. `testDanglingIndices` require changes the source code (added TODO). 3. `testIndexDeletionWhenNodeRejoins` because it's using just two nodes, adding the node to exclusions is needed on restart. 4. `testCorruptTranslogTruncationOfReplica` starts dedicated master one, because otherwise, the cluster does not form, if nodes are stopped and one node is started back. 5. `testResolvePath` needs TEST cluster, because all nodes are stopped at the end of the test and it's not possible to perform checks needed by SUITE cluster. 6. `SnapshotDisruptionIT`. Without changes, the test fails because Zen2 retries snapshot creation as soon as network partition heals. This results into the race between creating snapshot and test cleanup logic (deleting index). Zen1 on the other hand, also schedules retry, but it takes some time after network partition heals, so cleanup logic executes latter and test passes. The check that snapshot is eventually created is added to the end of the test.	2018-12-11 17:12:17 +01:00
David Turner	c3a6d1998a	Log [initial_master_nodes] on formation failure (#36466 ) Today we log a slightly cryptic "cluster bootstrapping is disabled on this node" message if bootstrapping hasn't been configured. Since there is today only one way to bootstrap the cluster it seems preferable to spell out exactly which setting is missing.	2018-12-11 12:53:05 +00:00
Christian Schneider	00eadd93eb	converting ForbiddenPatternsTask to .java (#36194 ) * converting ForbiddenPatternsTask to java impl & unit tests	2018-12-11 14:15:44 +02:00
Yu	d01b30acba	lower fielddata circuit breaker's default limit (#27162 ) * Lower fielddata circuit breaker default limit Lower fielddata circuit breaker default limit from 60% to 40% as we have moved to doc_values for most of the cases. * merge master in * update tests * update docs	2018-12-11 11:30:58 +01:00
Yannick Welsch	71cf6faaad	Only turn to follower when term bumping on follower check (#36449 ) Deals with a situation where a follower becomes disconnected from the leader, but only for such a short time where it becomes candidate and puts up a NO_MASTER_BLOCK, but then receives a follower check from the leader. If the leader does not notice the node disconnecting, it is important for the node not to be turned back into a follower but try and join the leader again. We still should prefer the node into a follower on a follower check when this follower check triggers a term bump as this can help during a leader election to quickly have a leader turn all other nodes into followers, even before the leader has had the chance to transfer a possibly very large cluster state. Closes #36428	2018-12-11 11:22:23 +01:00
Yannick Welsch	d8e3d97a7d	Add discovery types to cluster stats (#36442 ) Adds information about the used discovery types to the cluster stats, similar as we have for the network types.	2018-12-11 10:35:58 +01:00
David Turner	2bf0583fec	Exclude Zen2 fault detection traffic from tracer (#36444 ) Today we exclude `internal:discovery/zen/fd*` from the transport tracer by default. This commit excludes the new fault detection mechanism similarly.	2018-12-11 09:02:01 +00:00
Julie Tibshirani	87831051dc	Deprecate types in explain requests. (#35611 ) The following updates were made: - Add a new untyped endpoint `{index}/_explain/{id}`. - Add deprecation warnings to RestAction, plus tests in RestActionTests. - For each REST yml test, make sure there is one version without types, and another legacy version that retains types (called *_with_types.yml). - Deprecate relevant methods on the Java HLRC requests/ responses. - Update documentation (for both the REST API and Java HLRC).	2018-12-10 19:45:13 -08:00
Ryan Ernst	a0da390df2	Scripting: Switch watcher to use joda bwc time objects (#35966 ) This commit converts the watcher execution context to use the joda compat java time objects. It also again removes the joda methods from the painless whitelist.	2018-12-10 17:29:25 -08:00
Julie Tibshirani	99f89cd3b4	Deprecate types in get, exists, and multi get. (#35930 ) For each API, the following updates were made: - Add deprecation warnings to `RestAction`, plus tests in `RestActionTests`. - For each REST yml test, make sure there is one version without types, and another legacy version that retains types (called *_with_types.yml). - Deprecate relevant methods on the Java HLRC requests/ responses. - Update documentation (for both the REST API and Java HLRC).	2018-12-10 17:22:48 -08:00
Nhat Nguyen	2a7edca59f	Upgrade to Lucene-8.0.0-snapshot-ef61b547b1 (#36450 ) Includes: - LUCENE-8598: Improve field updates packed values - LUCENE-8599: Use sparse bitset to store docs in SingleValueDocValuesFieldUpdates	2018-12-10 16:33:49 -05:00
Jernej Klancic	d615add1b1	Add pipeline parent validation for auto date histogram (#35670 ) Allow `auto_date_histogram` as a valid parent agg for derivative, cumulative sum, moving average, moving function and serial differencing pipeline aggregations. Since all these aggs share the same requirement (sequentially ordered parent aggs), this commit also refactors to share the same validation code so that any newly added aggs won't be forgotten. Closes #35578	2018-12-10 16:02:49 -05:00
Jim Ferenczi	75392adf60	[TEST] Convert SearchHitsTests to AbstractStreamableXContentTestCase (#36313 ) This change adds a way to provide the content type of the rest serialization tests when creating random instances. This is used by SearchHitsTests to ensure that the internal members of the class are created with the same xContentType and that equals can be used to compare an instances created from an XContent view.	2018-12-10 20:41:20 +01:00
David Turner	ca3f5c1e2e	Cancel GetDiscoveredNodesAction when bootstrapped (#36423 ) Today the `GetDiscoveredNodesAction` waits, possibly indefinitely, to discover enough nodes to bootstrap the cluster. However it is possible that the cluster forms before a node has discovered the expected collection of nodes, in which case the action will wait indefinitely despite the fact that it is no longer required. This commit changes the behaviour so that the action fails once a node receives a cluster state with a nonempty configuration, indicating that the cluster has been successfully bootstrapped and therefore the `GetDiscoveredNodesAction` need wait no longer. Relates #36380 and #36381; reverts `558f4ec278`.	2018-12-10 17:23:03 +00:00
Ryan Ernst	1a36046421	Core: Convert FormatDateTimeFormatter to DateFormatter (#36396 ) This commit creates JodaDateFormatter to replace FormatDateTimeFormatter. It converts all uses of the old class to DateFormatter to allow a future change to use JavaDateFormatter when appropriate.	2018-12-10 08:06:41 -08:00
Yannick Welsch	f79e602d9f	Mute ClusterDisruptionIT.testSendingShardFailure Relates to #36428	2018-12-10 13:53:43 +01:00
Igor Suhorukov	0d9e3adc95	Loop replace with Collection.removeIf() (#36351 )	2018-12-10 11:26:24 +01:00
Yannick Welsch	6e6e63d01d	Zen2: Move all mixed-version REST tests to Zen2 (#36398 ) Moves all remaining (rolling-upgrade and mixed-version) REST tests to use Zen2. To avoid adding extra configuration, it relies on Zen2 being set as the default discovery type. This required a few smaller changes in other tests. I've removed AzureMinimumMasterNodesTests which tests Zen1 functionality and dates from a time where host providers were not configurable and each cloud plugin had its own discovery.type, subclassing the ZenDiscovery class. I've also adapted a few tests which were unnecessarily adding addTestZenDiscovery = false for the same legacy reasons. Finally, this also moves the unconfigured-node-name REST test to Zen2, testing the auto-bootstrapping functionality in development mode when no discovery configuration is provided.	2018-12-10 11:00:57 +01:00
Nhat Nguyen	902d6f579a	Exposed engine must have all ops below gcp during rollback (#36159 ) Today we expose a new engine immediately during Lucene rollback. The new engine is started with a safe commit which might not include all acknowledged operation. With this change, we won't expose the new engine until it has recovered from the local translog. Note that this solution is not complete since it's able to reserve only acknowledged operations before the global checkpoint. This is because we replay translog up to the global checkpoint during rollback. A per-doc Lucene rollback would solve this issue entirely. Relates #32867	2018-12-08 21:27:44 -05:00
Nhat Nguyen	d41cf6ac9f	Use delCount of SegmentInfos to calculate numDocs (#36323 ) Today, we iterate the bitset of hardLiveDocs to calculate the number of live docs. This calculation might be expensive if we enable soft-deletes (by default) for old indices whose soft-deletes was disabled previously and had hard-deletes. Once soft-deletes is enabled, we no longer hard-update or hard-delete documents directly. We have hard-deletes in two scenarios: (1) from old segments where soft-deletes was disabled, (2) when IndexWriter hits non-aborted exceptions. These two cases, IW flushes SegmentInfos before exposing the hard-deletes; thus we can use the hard-delete count of SegmentInfos.	2018-12-08 21:24:36 -05:00
Tim Brooks	cc3872d934	Reduce composite references at network level (#36400 ) Creating `CompositeBytesReference` has more overhead than a single `ByteBufferReference`. Many of our messages will be contained to a single `ByteBuffer`. This commit avoids creating composite instances when there is 0 or 1 underlying `ByteBuffers`.	2018-12-08 14:24:38 -07:00
David Turner	9f86e996fe	[Zen2] Support rolling upgrades from Zen1 (#35737 ) We support rolling upgrades from Zen1 by keeping the master as a Zen1 node until there are no more Zen1 nodes in the cluster, using the following principles: - Zen1 nodes will never vote for Zen2 nodes - Zen2 nodes will, while not bootstrapped, vote for Zen1 nodes - Zen2 nodes that were previously part of a mixed cluster will automatically (and unsafely) bootstrap themselves when the last Zen1 node leaves.	2018-12-08 07:33:35 +00:00
Ryan Ernst	a27f2efca5	Core: Converge FormatDateTimeFormatter and DateFormatter apis (#36390 ) This commit makes FormatDateTimeFormatter and DateFormatter apis close to each other, so that the former can be removed in favor of the latter. This PR does not change the uses of FormatDateTimeFormatter yet, so that that future change can be purely mechanical.	2018-12-07 17:23:41 -08:00
Julie Tibshirani	b15d1aebcf	For msearch templates, make sure to use the right name for deprecation logging. (#36344 )	2018-12-07 14:50:47 -08:00
Tim Brooks	8a53f2b464	Implement basic `CcrRepository` restore (#36287 ) This is related to #35975. It implements a basic restore functionality for the CcrRepository. When the restore process is kicked off, it configures the new index as expected for a follower index. This means that the index has a different uuid, the version is not incremented, and the Ccr metadata is installed. When the restore shard method is called, an empty shard is initialized.	2018-12-07 15:27:04 -07:00
Ryan Ernst	a998f4dec6	Core: Remove parseDefaulting from DateFormatter (#36386 ) This commit removes the parseDefaulting method from DateFormatter, bringing it more inline with the joda equivalent FormatDateTimeFormatter. This method was only needed for the java time implementation of DateMathParser. Instead, a DateFormatter now returns an implementation of DateMathParser for the given format, allowing the java time implementation to construct the appropriate date math parser internally.	2018-12-07 12:40:14 -08:00
Julie Tibshirani	51e1d40dca	Small improvements related to types deprecation. (#36328 ) * Make sure to use deprecatedAndMaybeLog for types deprecation messages. * Introduce a common base class for Rest*Action tests.	2018-12-07 11:21:24 -08:00
Tim Brooks	5556204f81	Use MockNioTransport in MockTransportService (#36346 ) The default transport used in the MockTransportService is the MockTcpTransport. This commit changes that to be the MockNioTransport.	2018-12-07 11:17:11 -07:00
David Turner	9d417984bd	[Zen2] Add warning if cluster fails to form fast enough (#35993 ) * Add warning if cluster fails to form fast enough Today if a leader is not discovered or elected then nodes are essentially silent at INFO and above, and log copiously at DEBUG and below. A short delay when electing a leader is not unusual, for instance if other nodes have not yet started, but a persistent failure to elect a leader is a problem worthy of log messages in the default configuration. With this change, while there is no leader each node outputs a WARN-level log message every 10 seconds (by default) indicating as such, describing the current discovery state and the current quorum(s). * Add note about whether the discovered nodes form a quorum or not * Introduce separate ClusterFormationFailureHelper ... and back out the unnecessary changes elsewhere * It can be volatile	2018-12-07 17:23:20 +00:00
Nhat Nguyen	f2df0a5be4	Remove LocalCheckpointTracker#resetCheckpoint (#34667 ) In #34474, we added a new assertion to ensure that the LocalCheckpointTracker is always consistent with Lucene index. However, we reset LocalCheckpoinTracker in testDedupByPrimaryTerm cause this assertion to be violated. This commit removes resetCheckpoint from LocalCheckpointTracker and rewrites testDedupByPrimaryTerm without resetting the local checkpoint. Relates #34474	2018-12-07 12:22:20 -05:00
Jim Ferenczi	a53e8653f2	Add support for inlined user dictionary in Nori (#36123 ) Add support for inlined user dictionary in Nori This change adds a new option called `user_dictionary_rules` to the Nori a tokenizer`. It can be used to set additional tokenization rules to the Korean tokenizer directly in the settings (instead of using a file). Closes #35842	2018-12-07 15:26:08 +01:00
David Turner	c32e4fb83f	[Zen2] Best-effort cluster formation if unconfigured (#36215 ) In real deployments it is important that clusters are properly configured to avoid accidentally forming multiple independent clusters at cluster bootstrapping time. However we also expect to be able to unpack Elasticsearch and start up one or more nodes without any up-front configuration, and have them do their best to find each other and form a cluster after a few seconds. This change adds a delayed automatic bootstrapping process to nodes that start up with no relevant settings set to support the desired out-of-the-box experience without compromising safety in properly-configured deployments.	2018-12-07 12:47:09 +00:00
Christoph Büscher	879397df70	Remove deprecated methods in SimpleQueryStringBuilder (#36341 ) The `useAllFields`methods have already been deprecated in 6.0 and are save to remove in 7.0.	2018-12-07 11:15:16 +01:00
Christoph Büscher	562479b5a5	Remove deprecated methods in MoreLikeThisQueryBuilder (#36340 ) The ids method is deprecated since 6.0 (at least) and only used in a few tests internally, so we can safely remove it.	2018-12-07 11:14:41 +01:00
Adrien Grand	266b9bc796	Inner hits fail to propagate doc-value format. (#36310 ) If you pass a doc-value format to inner hits, it gets ignored.	2018-12-07 09:57:12 +01:00
Alexander Reelsen	6a987415f8	Scripting: Properly support no-offset date formatting (#36316 ) The conversion of timezones in JodaCompatibleZonedDateTime from joda to java time requires the use of the DateUtils class to cater for corner cases. Closes #36306	2018-12-07 09:40:58 +01:00
Tanguy Leroux	593d8637c4	Merge branch 'master' into close-index-api-refactoring	2018-12-07 09:31:35 +01:00
Tanguy Leroux	3ca885e9ac	[Close Index API] Add TransportShardCloseAction for pre-closing verifications (#36249 ) This pull request adds the TransportShardCloseAction which is a transport replication action that acquires all index shard permits for its execution. This action will be used in the future by the MetaDataIndexStateService in a new index closing process, where we need to execute some sanity checks before closing an index. The action executes the following verifications on the primary and replicas: * there is no other on going operation active on the shard * the data node holding the shard knows that the index is blocked for writes * the shard's max sequence number is equal to the global checkpoint When the verifications are done and successful, the shard is flushed. Relates #33888	2018-12-07 09:30:27 +01:00
Tanguy Leroux	9691d0508c	Remove unused class DelegatingActionListener (#36257 )	2018-12-07 09:20:24 +01:00
Armin Braun	e6d190613f	[ZEN2] Use Zen2 in REST Tests (#36300 )	2018-12-07 09:15:11 +01:00
Nhat Nguyen	10feb75eb7	Upgrade to Lucene-8.0.0-snapshot-aaa64d70159 (#36335 ) Includes: LUCENE-8594: DV update are broken for updates on new field LUCENE-8590: Optimize DocValues update datastructures LUCENE-8593: Specialize single value numeric DV updates Relates #36286	2018-12-06 20:33:25 -05:00
Nhat Nguyen	a7edf1ed1f	TEST: Ensure a manual flush always occur (#36188 ) The test testLookupSeqNoByIdInLucene fails because it expects if any change should be visible after a flush. However, that flush might be ignored if the waitIfOngoing parameter is false (the default value), and there is an ongoing flush triggered after merge is running. Closes #35823	2018-12-06 18:37:54 -05:00
Ryan Ernst	37b3fc383f	Build: Use explicit deps on test tasks for check (#36325 ) This commit moves back to use explicit dependsOn for test tasks on check. Not all tasks extending RandomizedTestingTask should be run by check directly.	2018-12-06 14:13:49 -08:00
Julie Tibshirani	3f3cde41d3	Deprecate types in termvector and mtermvector requests. (#36182 ) * Add deprecation warnings to `RestTermVectorsAction`, plus tests in `RestTermVectorsActionTests`. * Deprecate relevant methods on the Java HLRC requests/ responses. * Update documentation (for both the REST API and Java HLRC). * For each REST yml test, create one version without types, and another legacy version that retains types (called *_with_types.yml).	2018-12-06 10:23:15 -08:00
Armin Braun	96ee6e0d2a	TEST: Reenable RemoveCorruptedShardDataCommandIT.testCorruptIndex * Merging the zen2 branch has made it impossible to reproduce the test failure here, likely as a result of the state now being written atomically * Closes #36189	2018-12-06 17:55:40 +01:00
Christoph Büscher	65e15f07fb	Add comments about need for explicit cast Some IDEs (specifically Eclipse 4.8.0) needs some explicit casting for type inference. This was added in a recent change but we should also have comments so we remember why this needs to be there (at least for now).	2018-12-06 17:40:59 +01:00
David Turner	ed1c5a0241	Introduce `zen2` discovery type (#36298 ) With this change it is now possible to start a node running Zen2.	2018-12-06 16:20:08 +00:00
Andrey Ershov	f0340d6d32	[Zen2] Storage layer WriteStateException propagation (#36052 ) Currently, we only log that WriteStateException has occurred, in GatewayMetaState. This PR goal is to re-throw WriteStateException to upper layers. If dirty flag is set to false, we wrap WriteStateException in UncheckedIOException, we prefer unchecked exception not to add explicit throws in the multitude of layers. If dirty flag is set to true - the world is broken. And we need to halt the JVM. Instead of explicit halting in GatewayMetaState, we prefer to throw IOError, which will be subsequently handled by ElasticsearchUncaughtExceptionHandler and JVM will be halted. This PR also adds tests for WriteStateException.	2018-12-06 17:02:02 +01:00
Adrien Grand	b08cffad48	Fix line length offenders in the o.e.search package. (#36223 ) Relates #34884	2018-12-06 16:31:12 +01:00
Christoph Büscher	f5fa1f4746	Help Eclipse with type inference for functions (#36301 ) The Eclipse IDE java compiler seems to need some special hints about what types some functions used in the tests return. Correcting this for some test that were newly merged to master.	2018-12-06 16:04:25 +01:00
Armin Braun	98e66c6afe	SNAPSHOT: Increase Timeout to Stabilize Test (#36294 ) * 30s is pretty close here, most runs take 25s of the timeout making the test unstable * closes #36283	2018-12-06 13:43:28 +01:00
Yannick Welsch	ee05ef1312	Merge branch 'zen2'	2018-12-06 08:31:46 +01:00
Jim Ferenczi	3ac64c1ec2	Fix total hits serialization of the search response (#36290 ) This change removes the custom serialization of the total hits and reuses the shard's serialization of Lucene#read/writeTopDocs in the client code. This also removes the incorrect assertion that trips randomly in bwc tests. Closes #36284	2018-12-05 22:02:22 -08:00
Christoph Büscher	75635251ab	Make sure test don't use Math.random for reproducability (#36241 ) Currently we use Math.random() in a few places in the tests which makes these tests not reproducable with the random seed mechanism that comes with ESTestCase. The change removes those instances.	2018-12-06 00:07:15 +01:00
Yannick Welsch	38d2c093ad	Fix compilation	2018-12-05 23:44:33 +01:00
Jake Landis	190ac8e9bf	ingest: support default pipeline through an alias (#36231 ) This commit allows writes that go through an alias to use the default pipeline defined on the backing index. Fixes #35817	2018-12-05 16:25:50 -06:00
Yannick Welsch	a0ae1cc987	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 23:13:12 +01:00
Yannick Welsch	03d0ea91ef	Zen2: Rename tombstones to exclusions (#36226 ) Renames the withdrawal / tombstones APIs to voting configuration exclusions.	2018-12-05 23:12:28 +01:00
Andrey Ershov	5d6602120f	[Zen2] Hide not recovered state (#36224 ) This commit hides ClusterStates that have a STATE_NOT_RECOVERED_BLOCK from ClusterStateAppliers. This is needed, because some appliers, such as IngestService, rely on the fact, that cluster states with STATE_NOT_RECOVERED_BLOCK won't contain anything useful. Once the state is recovered it's fully available for the appliers. This commit also switches many of the remaining tests that require state persistence/recovery from Zen1 to Zen2.	2018-12-05 23:11:20 +01:00
jaymode	2d03eeb632	Test: mute testDataNodeRestartWithBusyMasterDuringSnapshot This commit mutes DedicatedClusterSnapshotRestoreIT# testDataNodeRestartWithBusyMasterDuringSnapshot which fails reproducibly. See #36283	2018-12-05 14:06:38 -07:00
Jay Modi	dba3f4ec9f	add version 6.5.3 (#36268 )	2018-12-05 12:15:09 -07:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Mark Tozzi	ce7b1886b8	Unit test for very large percentile aggs (#36122 ) Closes #19528 * Port a test Colin wrote for the TDigest library to validate TDigests storing over 2*MAXINT values. This appears to have been fixed in version 3.2 of TDigest, which Elasticsearch has been using for some time, so no changes were necessary to resolve this issue.	2018-12-05 10:56:48 -05:00
Andrey Ershov	e8289167ee	Fix FAIL_ON_METHOD in MetaDataStateFormatTests fail method should be computed before eval, not inside it	2018-12-05 14:59:41 +01:00
Yannick Welsch	b20497560c	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 14:06:38 +01:00
Yannick Welsch	0b9efff5cb	Zen2: Persist cluster states the old way on non-master-eligible nodes (#36247 ) The shard deletion logic (triggered by IndicesStore), which also leads to index metadata deletion on non-master-eligible data nodes, currently races against the new cluster state persistence logic triggered by accepting cluster states. One thread is writing the index metadata while another one is deleting the index metadata, leading to exceptions and assertions tripping (see below). The solution proposed by this PR is to move the cluster state persistence of non-master-eligible nodes back to the cluster applier service, just as it used to be for Zen1. This ensures that the index metadata deletion logic, which is triggered by the shard deletion logic, runs on the same thread on which we persist the cluster state.	2018-12-05 14:04:45 +01:00
Alan Woodward	73ceaad03a	Update to lucene-8.0.0-snapshot-c78429a554 (#36212 ) Includes: * A fix for a bug in Intervals.or() (https://issues.apache.org/jira/browse/LUCENE-8586) * The ability to disable offset mangling in WordDelimiterGraphFilter (https://issues.apache.org/jira/browse/LUCENE-8509) * BM25Similarity no longer multiplies scores by k1 + 1	2018-12-05 12:43:56 +00:00
Alpar Torok	60e45cd81d	Testing conventions task part 2 (#36107 ) Closes #35435 - make it easier to add additional testing tasks with the proper configuration and add some where they were missing. - mute or fix failing tests - add a check as part of testing conventions to find classes not included in any testing task.	2018-12-05 14:20:01 +02:00
Martijn van Groningen	11935cd480	Replace Streamable w/ Writeable in BaseTasksResponse and subclasses (#36176 ) This commit replaces usages of Streamable with Writeable for the BaseTasksResponse / TransportTasksAction classes and subclasses of these classes. Note that where possible response fields were made final. Relates to #34389	2018-12-05 13:14:10 +01:00
Yannick Welsch	42457b5960	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 11:39:38 +01:00
Adrien Grand	4c2e0d76f3	Fix test failures when using the `termvectors` endpoint with a type name.	2018-12-05 11:28:12 +01:00
Tim Brooks	8bde608979	Register CcrRepository based on settings update (#36086 ) This commit adds an empty CcrRepository snapshot/restore repository. When a new cluster is registered in the remote cluster settings, a new CcrRepository is registered for that cluster. This is implemented using a new concept of "internal repositories". RepositoryPlugin now allows implementations to return factories for "internal repositories". The "internal repositories" are different from normal repositories in that they cannot be registered through the external repository api. Additionally, "internal repositories" are local to a node and are not stored in the cluster state. The repository will be unregistered if the remote cluster is removed.	2018-12-04 14:36:50 -07:00
Yannick Welsch	70c361ea5a	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-04 21:26:11 +01:00
Armin Braun	3c54b413ad	SNAPSHOT: Repo Creation out of ClusterStateTask (#36157 ) * Move `createRepository` call out of cluster state tasks * Now only `RepositoriesService#applyClusterState` manipulates `this.repositories` * Closes #9488	2018-12-04 20:53:51 +01:00
Adrien Grand	d24b40f688	Make typeless APIs usable with indices whose type name is different from `_doc` (#35790 ) This commit makes `document`, `update`, `explain`, `termvectors` and `mapping` typeless APIs work on indices that have a type whose name is not `_doc`. Unfortunately, this needs to be a bit of a hack since I didn't want calls with random type names to see documents with the type name that the user had chosen upon type creation. The `explain` and `termvectors` do not support being called without a type for now so the test is just using `_doc` as a type for now, we will need to fix tests later but this shouldn't require further changes server-side since passing `_doc` as a type name is what typeless APIs do internally anyway. Relates #35190	2018-12-04 19:22:17 +01:00
Yannick Welsch	9026f98aca	Remove trace logging from CoordinatorTests	2018-12-04 18:53:32 +01:00
Adrien Grand	0df08dd458	Set Lucene version upon index creation. (#36038 ) It is important that all shards of a given index have the same `indexCreatedVersionMajor` to Lucene, or eg. merging those shards is going to be considered illegal. At the moment, we use the latest Lucene version when creating a shard, which could cause shards to have different created versions eg. in case of forced allocation. This commit makes sure to reuse the appropriate Lucene version in order to avoid such issues. Closes #33826	2018-12-04 17:53:20 +01:00
Nhat Nguyen	b59deb573e	Always set soft-deletes field of IndexWriterConfig (#36196 ) Today we configure the soft-deletes field iff soft-deletes enabled. Although this choice was correct, it prevents an engine with soft-deletes disabled from opening a Lucene index with soft-deletes. Moreover, this change should not have any side-effect if a Lucene index does not have any soft-deletes. Relates #36141	2018-12-04 11:15:34 -05:00
Andrey Ershov	35e3d77e2c	[Zen2] Implement state recovery (#36013 ) This commit implements proper metadata recovery for Zen2. GatewayService is responsible for the recovery. In Zen1 GatewayService creates an instance of Gateway, that is used to reach out to other cluster nodes, get their state and calculate the most up-to-date state based on versions. After that Gateway performs upgrade and archival of ClusterSettings and closes bad indices. Then recovered state is passed to GatewayService.GatewayRecoveryListener that mixes up current state and restored state, removes state not recovered block, creates the routing table and performs re-routing. In Zen2 we should perform this kind of logic on cluster startup, except mixing state (because there is nothing to mix) and opening routing table. This commit refactors out all `ClusterUpdate` functions in a separate class `ClusterStateUpdaters`, which is used by `Gateway` and `GatewayService` in case of Zen1, and by `GatewayMetaState` and `GatewayService` in case of Zen2. This commit also switches all integration tests that are already using Zen2 from InMemoryPersistedState to GatewayMetaState.	2018-12-04 14:45:45 +01:00
Tanguy Leroux	5d684ca473	Combine the execution of an exclusive replica operation with primary term update (#36116 ) This commit changes how an operation which requires all index shard operations permits is executed when a primary term update is required: the operation and the update are combined so that the operation is executed after the primary term update under the same blocking operation. Closes #35850 Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2018-12-04 14:39:31 +01:00
Yannick Welsch	80ee7943c9	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-04 09:37:09 +01:00
David Turner	034c7655b7	[Zen2] Reduce cluster scope in NodeDisconnectIT (#36168 ) This test suite can stop all the shared master-eligible nodes, which breaks the cluster since any non-shared master-eligible nodes are stopped first in the reset process between tests. Since this test suite can leave the cluster in this somewhat broken state, it seems best that it uses a new cluster for each test.	2018-12-04 07:48:56 +00:00
Andy Bristol	01b8f99c17	[test] mute RemoveCorruptedShardDataCommandIT	2018-12-03 16:07:16 -08:00
João Barbosa	d27aa72b17	Added soft limit to open scroll contexts #25244 (#36009 ) This change adds a soft limit to open scroll contexts that can be controlled with the dynamic cluster setting `search.max_open_scroll_context` (defaults to 500).	2018-12-03 19:57:10 +01:00
Julie Tibshirani	59ee8b5c69	Remove the deprecated _termvector endpoint. (#36131 )	2018-12-03 10:22:42 -08:00
David Turner	c01aecb4b1	[Zen2] Do not probe non-master nodes back (#36160 ) Today if a node `A` sends a peers request to another node `B` then `B` will react by sending a peers request back to `A`. However if `A` is not master-eligible then this reaction is pointless and fails with an exception saying `non-master-eligible node found`, adding noise to the logs. This change suppresses this response to non-master-eligible nodes.	2018-12-03 17:19:18 +00:00
Armin Braun	433a506d06	SNAPSHOT: Improve Resilience SnapshotShardService (#36113 ) * Resolve the index in the snapshotting thread * Added test for routing table - snapshot state mismatch	2018-12-03 16:39:29 +01:00
Nhat Nguyen	9c1c46a02f	TEST: Adjust min_retained_seq_no expectation min_retained_seq_no is non-negative, however, if the number of retained operations is greater than 0, then the expectation may be negative.	2018-12-03 08:59:13 -05:00
Luca Cavanna	b5cae0af58	Enforce max_buckets limit only in the final reduction phase (#36152 ) Given that we check the max buckets limit on each shard when collecting the buckets, and that non final reduction cannot add buckets (see #35921), there is no point in counting and checking the number of buckets as part of non final reduction phases. Such check is still needed though in the final reduction phases to make sure that the number of returned buckets is not above the allowed threshold. Relates somehow to #32125 as we will make use of non final reduction phases in CCS alternate execution mode and that increases the chance that this check trips for nothing when reducing aggs in each remote cluster.	2018-12-03 13:55:18 +01:00
Boaz Leskes	36ddca7d0c	Disable merges in testReuseInFileBasedPeerRecovery The test assumes lucene files don't change. Closes #35772	2018-12-03 13:45:19 +01:00
Jim Ferenczi	74aca756b8	Remove the distinction between query and filter context in QueryBuilders (#35354 ) When building a query Lucene distinguishes two cases, queries that require to produce a score and queries that only need to match. We cloned this mechanism in the QueryBuilders in order to be able to produce different queries based on whether they need to produce a score or not. However the only case in es that require this distinction is the BoolQueryBuilder that sets a different minimum_should_match when a `bool` query is built in a filter context.. This behavior doesn't seem right because it makes the matching of `should` clauses different when the score is not required. Closes #35293	2018-12-03 11:49:11 +01:00
Armin Braun	328d022ddd	MINOR: Some Cleanups around Store (#36139 ) * Moved method `canOpenIndex` is only used in tests -> moved to test CP * Simplify `org.elasticsearch.index.store.Store#renameTempFilesSafe` * Delete some dead methods	2018-12-03 11:21:42 +01:00
Armin Braun	f763037b03	MINOR: BlobstoreRepository Cleanups (#36140 ) * Removed redundant private getter * Removed unused `version` field	2018-12-03 11:11:10 +01:00
Armin Braun	9c49aacbcf	MINOR: Remove Dead Code in QueryCache (#36147 )	2018-12-03 10:02:35 +01:00
Martijn van Groningen	43773a32a4	Replace Streamable w/ Writeable in BaseTasksRequest and subclasses (#35854 ) * Replace Streamable w/ Writeable in BaseTasksRequest and subclasses This commit replaces usages of Streamable with Writeable for the BaseTasksRequest / TransportTasksAction classes and subclasses of these classes. Relates to #34389	2018-12-03 08:04:29 +01:00
Armin Braun	9c0a429709	TESTS: Fix IndexStatsIT#testFilterCacheStats (#36143 ) * Test randomly failed because of background merges * Fixed by force merging down to a single segment * Closes #32506	2018-12-03 06:16:12 +01:00
David Turner	8bb1952975	Fix NodeJoinTests again (#36133 ) In #36033 we removed a catch block because we thought we were preventing exceptions by avoiding concurrent elections, missing the obvious fact that some joins are supposed to be failing. As a quick fix the catch was reinstated in `3a5dab6d8e` but this change adds finesse by only catching exceptions from the joins that we expect to fail. It also inlines an always-false parameter to `initialState()`.	2018-12-01 09:54:01 +00:00
David Turner	9cc416bc46	Weaken assertion in PeerFinder It can be inactive with no leader if it's handling an incoming PeersRequest before being activated for the first time.	2018-12-01 07:20:19 +00:00
David Turner	3a5dab6d8e	Reinstate catch removed in error in #36033	2018-12-01 07:10:19 +00:00
Julie Tibshirani	0e1ddfd825	Deprecate types in document delete requests. (#36087 ) * Make sure to use _doc as a type name in the CRUD HLRC tests. * Deprecate types in document delete requests.	2018-11-30 15:11:29 -08:00
Julie Tibshirani	98b290637d	Deprecate the _termvector endpoint. (#36098 )	2018-11-30 13:11:58 -08:00
Nik Everett	df56f0734e	Tasks: Retry if task can't be written (#35054 ) Adds about a minute worth of backoffs and retries to saving task results so it is much more likely that a busy cluster won't lose task results. This isn't an ideal solution to losing task results, but it is an incremental improvement. If all of the retries fail when still log the task result, but that is far from ideal. Closes #33764	2018-11-30 16:06:58 -05:00
Luca Cavanna	0ebc17743a	Histogram aggs: add empty buckets only in the final reduce step (#35921 ) Empty buckets don't need to be added when performing an incremental reduction step, they can be added later in the final reduction step. This will allow us to later remove the max buckets limit when performing non final reduction.	2018-11-30 20:33:09 +01:00
Tim Brooks	ea7ea51050	Make `TcpTransport#openConnection` fully async (#36095 ) This is a follow-up to #35144. That commit made the underlying connection opening process in TcpTransport asynchronous. However the method still blocked on the process being complete before returning. This commit moves the blocking to the ConnectionManager level. This is another step towards the top-level TransportService api being async.	2018-11-30 11:30:42 -07:00
Armin Braun	986bf52d1f	[Zen2] Allow Setting a List of Bootstrap Nodes to Wait for (#35847 )	2018-11-30 18:53:08 +01:00
Tim Brooks	da100c5479	Remove `Lifecycle` from `ConnectionManager` (#36092 ) Prior to #35441 `ConnectionManager` had a `Lifecycle` object to support the ping runnable. After that commit, the connection amanger only needs the existing `AtomicBoolean` to indicate if it is running.	2018-11-30 09:04:32 -07:00
Luca Cavanna	43ea498f2f	[TEST] Reduce number of buckets created in InternalDateHistogramTests New that we test with min_doc_count set to 0 as well, we may end up generating a lot more buckets. This commit adjusts the min bound and max bound, as well as the offset for each randomly generated agg instance so that we don't end up hitting the 10.000 max buckets limit. Relates to #36064	2018-11-30 14:16:32 +01:00
Luca Cavanna	e3eb05b14b	[TEST] Increase InternalDateHistogramTests coverage (#36064 ) In this test we were randomizing different values but minDocCount was hardcoded to 1. It's important to test other values, especially `0` as it's the default. The test needed some adapting in the way buckets are randomly generated: all aggs need to share the same interval, minDocCount and emptyBucketInfo. Also assertions need to take into account that more (or less) buckets are expected depending on minDocCount.	2018-11-30 11:21:46 +01:00
Adrien Grand	fa3d365ee8	Fix CompositeBytesReference#slice to not throw AIOOBE with legal offsets. (#35955 ) CompositeBytesReference#slice has two bugs: - One that makes it fail if the reference is empty and an empty slice is created, this is #35950 and is fixed by special-casing empty-slices. - One performance bug that makes it always create a composite slice when creating a slice that ends on a boundary, this is fixed by computing `limit` as the index of the sub reference that holds the last element rather than the next element after the slice. Closes #35950	2018-11-30 10:32:46 +01:00
Jake Landis	f8636e58f9	Support content type `application/x-ndjson` in DeprecationRestHandler (#36025 ) org.elasticsearch.rest.RestController#hasContentType checks to see if the RestHandler supports the `application/x-ndjson` Content-Type. DeprecationRestHandler is a wrapper around the real RestHandler, and prior to this change would always return `false` due to the interface's default supportsContentStream(). This prevents API's that use multi-line JSON from properly being deprecated resulting in an HTTP 406 error. This change ensures that the DeprecationRestHandler honors the supportsContentStream() of the wrapped RestHandler. Relates to #35958	2018-11-29 11:45:45 -06:00
Jim Ferenczi	8a7f3f75f3	Add support for rest_total_hits_as_int (#36051 ) The support for rest_total_hits_as_int has already been merged to 6x in #35848 so this change adds this new option to master. The plan was to add this new option as part of #35848 but we've decided to wait a few days before merging this breaking change so this commit just handles the new option as a noop exactly like 6x for now. This will allow users to migrate to this parameter before #35848 is merged. Relates #33028	2018-11-29 18:36:16 +01:00
Tim Brooks	c305f9dc03	Make keepalive pings bidirectional and optimizable (#35441 ) This is related to #34405 and a follow-up to #34753. It makes a number of changes to our current keepalive pings. The ping interval configuration is moved to the ConnectionProfile. The server channel now responds to pings. This makes the keepalive pings bidirectional. On the client-side, the pings can now be optimized away. What this means is that if the channel has received a message or sent a message since the last pinging round, the ping is not sent for this round.	2018-11-29 08:55:53 -07:00
Jim Ferenczi	ecd29089a8	Cache the score of the parent document in the nested agg (#36019 ) The nested agg can defer the collection of children if it is nested under another aggregation. In such case accessing the score in the children aggregation throws an error because the scorer has already advanced to the next parent. This change fixes this error by caching the score of the parent in the nested aggregation. Children aggregations that work on nested documents will be able to access the _score. Also note that the _score in this case is always the parent's score, there is no way to retrieve the score of a nested docs in aggregations. Closes #35985 Closes #34555	2018-11-29 14:35:25 +01:00
Armin Braun	48dc6c3442	[Zen2] Implement Tombstone REST APIs (#36007 ) * [Zen2] Implement Tombstone REST APIs * Adds REST API for withdrawing votes and clearing vote withdrawls * Tests added to Netty4 module since we need a real Network impl. for Http endpoints	2018-11-29 14:34:10 +01:00
David Turner	7f257187af	[Zen2] Update default for USE_ZEN2 to true (#35998 ) Today the default for USE_ZEN2 is false and it is overridden in many places. By defaulting it to true we can be sure that the only places in which Zen2 does not work are those in which it is explicitly set to false.	2018-11-29 12:18:35 +00:00
David Turner	277ccba3bd	[Zen2] fix NodeJoinTests#testConcurrentJoining() (#36033 ) Today we sometimes create a setup in which the node is a quorum on its own, which allows it to win a pre-voting round and schedule an election essentially at will, causing it to discard all the joins it just received and fail the test. This change excludes this case, preventing stray elections from ruining things.	2018-11-29 12:11:08 +00:00
David Turner	87408b04d4	[Zen2] Only elect master-eligible nodes (#35996 ) Today any node can win an election. However, the whole point of master-eligibility is that master-ineligible nodes should not be elected as the leader; furthermore master-ineligible nodes do not have any outgoing STATE channels so cannot publish cluster states, so their leadership is ineffective and disruptive. This change ensures that the elected leader is master-eligible by preventing master-ineligible nodes from scheduling an election.	2018-11-29 12:10:43 +00:00
Alan Woodward	a646f85a99	Ensure TokenFilters only produce single tokens when parsing synonyms (#34331 ) A number of tokenfilters can produce multiple tokens at the same position. This is a problem when using token chains to parse synonym files, as the SynonymMap requires that there are no stacked tokens in its input. This commit ensures that when used to parse synonyms, these tokenfilters either produce a single version of their input token, or that they throw an error when mappings are generated. In indexes created in elasticsearch 6.x deprecation warnings are emitted in place of the error. * asciifolding and cjk_bigram produce only the folded or bigrammed token * decompounders, synonyms and keyword_repeat are skipped * n-grams, word-delimiter-filter, multiplexer, fingerprint and phonetic throw errors Fixes #34298	2018-11-29 10:35:38 +00:00
Tanguy Leroux	0967620641	ActiveShardCount should not fail when closing the index (#35936 ) The ActiveShardCount is used by cluster state observers to wait for a given number of shards to be active before returning to the caller. The current implementation does not work when an index is closed while an observer is waiting on shards to be active. In this case, a NPE is thrown and the observer is never notified that the shards won't become active. This commit fixes the ActiveShardCount.enoughShardsActive() so that it does not fail when an index is closed, similarly to what is done when an index is deleted.	2018-11-29 09:08:30 +01:00
Alpar Torok	e0a678f0c4	Remove version.qualified from MainResponse (#35412 ) The fully qualified version will be returned as `version.number`	2018-11-29 08:41:39 +02:00
Ryan Ernst	afd42df15f	Core: Deguice RepositoriesService (#36016 ) This commit moves the RepositoriesService to be created outside of guice.	2018-11-28 21:20:44 -08:00
Jim Ferenczi	9ca3a06475	Remove custom QueryBuilder#analyzeGraphPhrase (#35983 ) Now that https://issues.apache.org/jira/browse/LUCENE-8479 is fixed we can remove the custom implementation of QueryBuilder#analyzeGraphPhrase in the match QueryBuilder.	2018-11-28 20:15:27 +01:00
Luca Cavanna	4b85769d24	Increase InternalHistogramTests coverage (#36004 ) In `InternalHistogramTests` we were randomizing different values but `minDocCount` was hardcoded to `1`. It's important to test other values, especially `0` as it's the default. To make this possible, the test needed some adapting in the way buckets are randomly generated: all aggs need to share the same `interval`, `minDocCount` and `emptyBucketInfo`. Also assertions need to take into account that more (or less) buckets are expected depending on `minDocCount`. This was originated by #35921 and its need to test adding empty buckets as part of the reduce phase. Also relates to #26856 as one more key comparison needed to use `Double.compare` to properly handle `NaN` values, which was triggered by the increased test coverage.	2018-11-28 20:06:40 +01:00
Nik Everett	0588dad80b	Tasks: Only require task permissions (#35667 ) Right now using the `GET /_tasks/<taskid>` API and causing a task to opt in to saving its result after being completed requires permissions on the `.tasks` index. When we built this we thought that that was fine, but we've since moved towards not leaking details like "persisting task results after the task is completed is done by saving them into an index named `.tasks`." A more modern way of doing this would be to save the tasks into the index "under the hood" and to have APIs to manage the saved tasks. This is the first step down that road: it drops the requirement to have permissions to interact with the `.tasks` index when fetching task statuses and when persisting statuses beyond the lifetime of the task. In particular, this moves the concept of the "origin" of an action into a more prominent place in the Elasticsearch server. The origin of an action is ignored by the server, but the security plugin uses the origin to make requests on behalf of a user in such a way that the user need not have permissions to perform these actions. It can be made to be fairly precise. More specifically, we can create an internal user just for the tasks API that just has permission to interact with the `.tasks` index. This change doesn't do that, instead, it uses the ubiquitus "xpack" user which has most permissions because it is simpler. Adding the tasks user is something I'd like to get to in a follow up change. Instead, the majority of this change is about moving the "origin" concept from the security portion of x-pack into the server. This should allow any code to use the origin. To keep the change managable I've also opted to deprecate rather than remove the "origin" helpers in the security code. Removing them is almost entirely mechanical and I'd like to that in a follow up as well. Relates to #35573	2018-11-28 09:28:27 -05:00
Christoph Büscher	51a7dc54ec	Fix custom AUTO issue with Fuzziness#toXContent (#35807 ) Currently when a Fuzziness instance with custom AUTO distance values gets written to XContent, the customized lower and upper distance values are ommited and can consequently not be parsed back. This changes this to write the String including the optional custom values when writing to XContent and fixes the tests that should have caught this in the first place, e.g. by adding the custom low and high distance values to the equality check.	2018-11-28 15:07:11 +01:00
Andrey Ershov	0b45fb98b9	[Zen2] Generate coordinationMetaData with different configs (#35991 ) This PR fixes test failure, which is caused by equal randomly generated lastAcceptedConfiguration and lastCommittedConfguration.	2018-11-28 14:49:07 +01:00
Yannick Welsch	5f0c036183	Disable testDeleteCreateInOneBulk on Zen2 This test needs adaptation to run with Zen2	2018-11-28 13:34:48 +01:00
Christoph Büscher	2f547bac65	Remove deprecated methods from QueryStringQueryBuilder (#35912 ) This change removes the deprecated useDisMax() and useAllFields() methods from the QueryStringQueryBuilder and related tests. The disMax parameter has already been a no-op since 6.0 and also the useAllFields has been deprecated since 6.0 and there is a direct replacement via defaultField.	2018-11-28 11:09:03 +01:00
Jeff Hajewski	49087f16f5	Adds deprecation logging to ScriptDocValues#getValues. (#34279 ) `ScriptDocValues#getValues` was added for backwards compatibility but no longer needed. Scripts using the syntax `doc['foo'].values` when `doc['foo']` is a list should be using `doc['foo']` instead. Closes #22919	2018-11-27 14:30:13 -05:00
Julie Tibshirani	25c416b12d	Deprecate types in search and multi search templates. (#35669 ) This PR adds deprecation warnings to the relevant `RestAction` classes, plus tests in `RestActionTests`. No updates to REST tests, the Java HLRC, or documentation were necessary, since they didn't make use of types.	2018-11-27 10:19:19 -08:00
Simon Willnauer	ad1f0dccd4	Validate metdata on `_msearch` (#35938 ) MultiSearchRequests issues through `_msearch` now validate all keys in the metadata section. Previously unknown keys were ignored while now an exception is thrown. Closes #35869	2018-11-27 17:08:24 +01:00
Tim Brooks	cc1fa799c8	Remove `TcpChannel#setSoLinger` method (#35924 ) This commit removes the dedicated `setSoLinger` method. This simplifies the `TcpChannel` interface. This method has very little effect as the SO_LINGER is not set prior to the channels being closed in the abstract transport test case. We still will set SO_LINGER on the `MockNioTransport`. However we can do this manually.	2018-11-27 09:08:14 -07:00
Christophe Bismuth	adc0b560c0	Raise a 404 exception when document source is not found (#33384 ) (#34083 ) This pull request makes the `RestGetSourceAction` return a `ResourceNotFoundException` with a proper JSON response when source or document itself is missing (see issue #33384). Here is below a sample JSON output: ``` { "error": { "root_cause": [ { "type": "resource_not_found_exception", "reason": "Source not found [index1]/[_doc]/[1]" } ], "type": "resource_not_found_exception", "reason": "Source not found [index1]/[_doc]/[1]" }, "status": 404 } ```	2018-11-27 10:35:45 -05:00
Andrey Ershov	0e283f9670	[Zen2] PersistedState interface implementation (#35819 ) Today GatewayMetaState is capable of atomically storing MetaData to disk. We've also moved fields that are needed to be persisted in Zen2 from ClusterState to ClusterState.MetaData.CoordinationMetaData. This commit implements PersistedState interface. version and currentTerm are persisted as a part of Manifest. GatewayMetaState now implements both ClusterStateApplier and PersistedState interfaces. We started with two descendants Zen1GatewayMetaState and Zen2GatewayMetaState, but it turned out to be not easy to glue it. GatewayMetaState now constructs previousClusterState (including MetaData) and previousManifest inside the constructor so that all PersistedState methods are usable as soon as GatewayMetaState instance is constructed. Also, loadMetaData is renamed to getMetaData, because it just returns previousClusterState.metaData(). Sadly, we don't have access to localNode (obtained from TransportService in the constructor, so getLastAcceptedState should be called, after setLocalNode method is invoked. Currently, when deciding whether to write IndexMetaData to disk, we're comparing current IndexMetaData version and received IndexMetaData version. This is not safe in Zen2 if the term has changed. So updateClusterState now accepts incremental write method parameter. When it's set to false, we always write IndexMetaData to disk. Things that are not covered by GatewayMetaStateTests are covered by GatewayMetaStatePersistedStateTests. This commit also adds an option to use GatewayMetaState instead of InMemoryPersistedState in TestZenDiscovery. However, by default InMemoryPersistedState is used and only one test in PersistedStateIT used GatewayMetaState. In order to use it for other tests, proper state recovery should be implemented.	2018-11-27 15:04:52 +01:00
Martijn van Groningen	447e5d212a	Changed versions in serialization code after backporting #35535	2018-11-27 08:00:06 +01:00
Gordon Brown	119835decd	Always enforce cluster-wide shard limit (#34892 ) This removes the option to run a cluster without enforcing the cluster-wide shard limit, making strict enforcement the default and only behavior. The limit can still be adjusted as desired using the cluster settings API.	2018-11-26 17:05:12 -07:00
Igor Motov	663563f64b	Geo: better handling of malformed geo_points (#35554 ) Improves handling of malformed geo_points when `ignore_malformed` is set to true Closes #35419	2018-11-26 09:44:42 -10:00
Jim Ferenczi	900caa20ef	Handles exists query in composite aggs (#35758 ) This commit adds the support for exists query in the sorted execution mode of the `composite` aggregation. We'll execute the aggregation from the sorted points and use early termination if the main query is an `exists` query over the first source of the `composite` aggregation.	2018-11-26 19:08:14 +01:00
Christophe Bismuth	b95a4db6e6	Throw a parsing exception when boost is set in span_or query (#28390 ) (#34112 )	2018-11-26 12:15:59 -05:00
Simon Willnauer	ca9b2b9931	Repsect indices options on _msearch (#35887 ) Today we don't respect the indices options when they are passed as request parameters to the `_msearch` endpoint. This is unintuitive and doesn't cause any errors. This changes uses the top-level indices options as the defaults for each sub search-request. Closes #35851	2018-11-26 14:26:39 +01:00
Christophe Bismuth	04ebc63e34	RoutingMissingException in more like this (#33974 ) More like this query allows to provide identifiers of documents to be retrieved as like/unlike items. It can happen that at retrieval time an error is thrown, for instance caused by missing routing value when `_routing` is set required in the mapping. Instead of ignoring such error and returning no documents for the query, the error should be re-thrown and returned to users. As part of this change also mget and mtermvectors are unified in the way they throw such exception like it happens in other places, so that a `RoutingMissingException` is raised. Closes #29678	2018-11-26 13:57:57 +01:00
Tanguy Leroux	9bdbba23f8	[Tests] Fix IndexShardTests.testAcquirePrimaryAllOperationsPermits() This test fails on CI because of an inappropriate assertion, which is I think a leftover and has no real value.	2018-11-26 13:44:12 +01:00
Luca Cavanna	e44390ac20	InitialSearchPhase minor cleanups (#35864 ) This commit simplifies the throttling logic in InitialSearchPhase and removes some asserts from it. Also, a few formatting changes are applied to its code and surrounding classes.	2018-11-26 13:42:41 +01:00
David Turner	a68a46450b	[Zen2] Add lag detector (#35685 ) A publication can succeed and complete before all nodes have applied the published state and acknowledged it, thanks to the publication timeout; however we need every node eventually either to apply the published state (or a later state) or be removed from the cluster. This change introduces the LagDetector which achieves this liveness property by removing any lagging nodes from the cluster.	2018-11-26 10:52:49 +00:00
iverase	401b814d1a	[CI] Muting method testOperationPermitOnReplicaShards in IndexShardTests Relates to #35850	2018-11-26 09:33:23 +01:00
Martijn van Groningen	7624734f14	Added wait_for_metadata_version parameter to cluster state api. (#35535 ) The `wait_for_metadata_version` parameter will instruct the cluster state api to only return a cluster state until the metadata's version is equal or greater than the version specified in `wait_for_metadata_version`. If the specified `wait_for_timeout` has expired then a timed out response is returned. (a response with no cluster state and wait for timed out flag set to true) In the case metadata's version is equal or higher than `wait_for_metadata_version` then the api will immediately return. This feature is useful to avoid external components from constantly polling the cluster state to whether somethings have changed in the cluster state's metadata.	2018-11-26 08:50:08 +01:00
Simon Willnauer	4711c5cdf3	Always return false from `refreshNeeded` on ReadOnlyEngine (#35837 ) Acquiring a searcher is unnecessary to determine if a refresh is necessary since read-only engines never refresh. Closes #35785	2018-11-24 09:25:42 +01:00
Simon Willnauer	e46e44ce38	Wrap can_match reader with ElasticsearchDirectoryReader (#35857 ) Code that operates on-top of the engine requires all readers returned to be unwrapped into ElasticsearchDirectoryReader. The special reader the FrozenEngine uses wasn't wrapped.	2018-11-24 09:23:53 +01:00
Andrey Ershov	f47636b254	[Zen2] Introduce VotingTombstone class (#35832 ) Today voting tombstones are stored in CoordinationMetaData as Set<DiscoveryNode>. DiscoveryNode is not a lightweight object and have a lot of fields. It also has toXContent method, but no fromXContent method and the output of toXContent is not enough to re-create DiscoveryNode object. And votingTombstone set should be persisted as a part of MetaData. On the other hand, the only thing required from the tombstone is the nodeId. This PR adds VotingTombstone class for voting tombstones, which consists of two fields for now - nodeId and nodeName. It could be extended/shrank in the future if needed. This PR also resolves TODO's related to the voting tombstones xcontent story. Example of CoordinationMetaData.toXContent with voting tombstones: { "term": 1, "last_committed_config": [ "fkwLdOBvXSlgRTBfgNAL", "tmQiPGHvUxXzPkkCDSJo", "HhOmtQBZAThpHIGWhxpz", "qZHWGpoDNPYRNIiqKsDl" ], "last_accepted_config": [ "lhqacKmriwhHGFZcvqbx", "MYysmBuROkvJRlDcusyd" ], "voting_tombstones": [ { "node_id": "McjbZbRkEz", "node_name": "pdKIWeNJUO" }, { "node_id": "cpXkVibGwo", "node_name": "UnCvFgdVsc" }, { "node_id": "EylRNOztbc", "node_name": "ohOhkbMWZX" } ] }	2018-11-23 18:34:06 +01:00
Yannick Welsch	51d2e986c5	Remove BWC conditions after backport of #35731 This PR was backported to 6.x, so the extra BWC conditions are not needed anymore	2018-11-23 17:11:06 +01:00
Adrien Grand	5b370316a6	Remove some legacy code from when indices could have multiple types. (#35815 ) This code is only necessary up to indices created with version 5.x while 7.0 only supports indices created with 6.x or 7.0.	2018-11-23 15:15:26 +01:00
Yannick Welsch	2970abfce9	Add read-only repository verification (#35731 ) Adds a verification mode for read-only repositories. It also makes the extra bucket check on repository creation obsolete, which fixes #35703.	2018-11-23 14:45:05 +01:00
Christoph Büscher	88d862e69f	[CI] Muting two methods in IndexShardTests Relates to #35850	2018-11-23 14:29:26 +01:00
David Turner	d01436de3c	Copy checkpoint atomically when rolling generation (#35407 ) Today when rolling a transog generation we copy the checkpoint from `translog.ckp` to `translog-nnnn.ckp` using a simple `Files.copy()` followed by appropriate `fsync()` calls. The copy operation is not atomic, so if we crash at the wrong moment we can leave an incomplete checkpoint file on disk. In practice the checkpoint is so small that it's either empty or fully written. However, we do not correctly handle the case where it's empty when the node restarts. In contrast, in `recoverFromFiles()` we _do_ copy the checkpoint atomically. This commit extracts the atomic copy operation from `recoverFromFiles()` and re-uses it in `rollGeneration()`.	2018-11-23 08:43:34 +00:00
Jim Ferenczi	be69a774df	Fix analyzed prefix query in query_string (#35756 ) This change fixes analyzed prefix queries in `query_string` to be ignored if all terms are removed during the analysis. Closes #31702	2018-11-23 09:42:23 +01:00
Tanguy Leroux	2e37f17a7d	Expose all permits acquisition in IndexShard and TransportReplicationAction (#35540 ) This pull request exposes two new methods in the IndexShard and TransportReplicationAction classes in order to allow transport replication actions to acquire all index shard operation permits for their execution. It first adds the acquireAllPrimaryOperationPermits() and the acquireAllReplicaOperationsPermits() methods to the IndexShard class which allow to acquire all operations permits on a shard while exposing a Releasable. It also refactors the TransportReplicationAction class to expose two protected methods (acquirePrimaryOperationPermit() and acquireReplicaOperationPermit()) that can be overridden when a transport replication action requires the acquisition of all permits on primary and/or replica shard during execution. Finally, it adds a TransportReplicationAllPermitsAcquisitionTests which illustrates how a transport replication action can grab all permits before adding a cluster block in the cluster state, making subsequent operations that requires a single permit to fail). Related to elastic #33888	2018-11-23 09:26:38 +01:00
Dimitris Athanasiou	43d6ec8bcd	Remove unnecessary throws IOException in CompressedXContent.string() (#35821 )	2018-11-22 15:08:46 +00:00
Jim Ferenczi	e37a0ef844	Upgrade to lucene-8.0.0-snapshot-67cdd21996 (#35816 )	2018-11-22 15:42:59 +01:00
Albert Zaharovits	4fc911a129	Mute test InternalEngineTests Relates #35823	2018-11-22 15:34:53 +02:00
Tanguy Leroux	f9f7261d60	Revert "Revert "[RCI] Check blocks while having index shard permit in TransportReplicationAction (#35332 )"" This reverts commit `d3d7c01`	2018-11-22 12:13:19 +01:00
Mayya Sharipova	b6014d971c	Forbid negative scores in functon_score query (#35709 ) * Forbid negative scores in functon_score query - Throw an exception when scores are negative in field_value_factor function - Throw an exception when scores are negative in script_score function Relates to #33309	2018-11-22 06:08:48 -05:00
Tanguy Leroux	11052b75c7	TransportResyncReplicationAction should not honour blocks (#35795 ) After #35332 has been merged, we noticed some test failures like #35597 in which one or more replica shards failed to be promoted as primaries because the primary replica re-synchronization never succeed. After some digging it appeared that the execution of the resync action was blocked because of the presence of a global cluster block in the cluster state (in this case, the "no master" block), making the resync action to fail when executed on the primary. Until #35332 such failures never happened because the TransportResyncReplicationAction is skipping the reroute phase, the only place where blocks were checked. Now with #35332 blocks are checked during reroute and also during the execution of the transport replication action on the primary. After some internal discussion, we decided that the TransportResyncReplicationAction should never be blocked. This action is part of the replica to primary promotion and makes sure that replicas are in sync and should not be blocked when the cluster state has no master or when the index is read only. This commit changes the TransportResyncReplicationAction to make obvious that it does not honor blocks. It also adds a simple test that fails if the resync action is blocked during the primary action execution. Closes #35597	2018-11-22 10:50:12 +01:00
David Turner	cfdf666672	[Zen2] Fix test failures in diff-based publishing (#35684 ) `testIncompatibleDiffResendsFullState` sometimes makes a 2-node cluster and then partitions one of the nodes from the leader, which makes the leader stand down. Then when the partition is removed the cluster re-forms but does so by sending full cluster states, not diffs, causing the test to fail. Additionally `testDiffBasedPublishing` sometimes fails if a publication is delivered out-of-order, wiping out a fresher last-received cluster state with a less-fresh one. This is fixed here by passing the received cluster state to the coordinator before recording it as the last-received one, relying on the coordinator's freshness checks.	2018-11-22 09:08:52 +00:00
Igor Motov	39789d0a10	GEO: More robust handling of ignore_malformed in geoshape parsing (#35603 ) Adds an XContent sub parser class that can to wrap another XContent parser at the beginning of an object and allow skiping all children in case of the parsing failure. It also uses this subparser to ignore the rest of the GeoJson shape if the parsing fails and we need to ignore the geoshape due to the ignore_malformed flag. Supersedes #34498 Closes #34047	2018-11-21 11:04:01 -10:00
Armin Braun	1a5553d495	MINOR: Cleanup Runnables in SnapshotsService (#35796 ) * Simplify complex `Runnable` by moving to `AbstractRunnable`	2018-11-21 19:58:51 +01:00
Nick Knize	3bee25cb70	[GEO] Add support to ShapeBuilders for building Lucene geometry (#35707 ) * [GEO] Add support to ShapeBuilders for building Lucene geometry This commit adds support for building lucene geometry from the ShapeBuilders. This is needed for integrating LatLonShape as the primary indexing approach for geo_shape field types. All unit and integration tests are updated to add randomization for testing both jts/s4j shapes and lucene shapes.	2018-11-21 11:15:01 -06:00
Yannick Welsch	c816347253	Disable testClusterJoinDespiteOfPublishingIssues for Zen2 This test is failing sometimes with Zen2 due to the lack of lag detection. Zen1 does not have this problem as it only considers a join as valid if the corresponding cluster state update is successfully published and committed on the joining node.	2018-11-21 17:21:44 +01:00
Andrey Ershov	a056bd8c1c	[Zen2] Move ClusterState fields to be persisted to ClusterState.MetaData (#35625 ) Today we have a way to atomically persist global MetaData and IndexMetaData to disk when new ClusterState is received. All other ClusterState fields are not persisted. However, there are other parts of ClusterState that should be persisted, namely: version term lastCommittedConfiguration lastAcceptedConfiguration votingTombstones version is changed frequently, other fields are not. We decided to group term, lastCommittedConfiguration, lastAcceptedConfiguration and votingTombstones into CoordinationMetaData class and make CoordinationMetaData a field inside MetaData. MetaData.toXContent and MetaData.fromXContent should take care of CoordinationMetaData. version stays as a top level field in ClusterState and will be persisted as part of Manifest in a follow-up commit. Also MetaData.isGlobalStateEquals should be extended to include coordinationMetaData in comparison. This commit favors exposing getters, such as getTerm directly in ClusterState to avoid massive code changes. An example of CoordinationMetaState.toXContent: { "term": 1, "last_committed_config": [ "TiIuBcbBtpuXyDDVHXeD", "ZIAoVbkjjLPLUuYLaTkw" ], "last_accepted_config": [ "OwkXbXZNOZPJqccdFHdz", "LouzsGYwmQzpeQMrboZe", "fCKGRZdjLTqzXAqPUtGL", "pLoxshjpJXwDhbgjfYJy", "SjINLwFIlIEFZCbjrSFo", "MDkVncJEVyZLJktopWje" ] }	2018-11-21 17:03:26 +01:00
Andrey Ershov	6ac0cb1842	Merge branch master into zen2 2 types of conflicts during the merge: 1) Line length fix 2) Classes no longer extend AbstractComponent	2018-11-21 15:36:49 +01:00
Yannick Welsch	8939a7894f	Zen2: Move disruption tests to Zen2 (#35724 ) - Moves disruption tests to Zen2 - Registers a few missing settings - Removes .put(TestZenDiscovery.USE_ZEN2.getKey(), true) from tests where Zen2 is now enabled by default through the parent test class - Moves QuorumGatewayIT back to Zen1, as it is not stable with Zen2 as it currently relies on dangling indices due to the lack of proper CS persistence, which triggers secondary failures	2018-11-21 14:43:33 +01:00
Armin Braun	bdf632b6f9	SNAPSHOTING+MINOR: Simplify SnapshortShardService (#35769 )	2018-11-21 13:50:17 +01:00
Christoph Büscher	6638708b56	Remove deprecated QueryStringQueryBuilder#splitOnWhiteSpace (#35763 ) This parameter has been deprecated and was ignored since 6.0, so its Java API methods can be removed.	2018-11-21 10:29:08 +01:00
Christoph Büscher	5847f8379c	Move ScoreAccessor to test-framework (#35766 ) This class is only used by RandomScoreFunctionIT and the MockScriptEngine, so it shouldn't be part of the server codebase.	2018-11-21 10:28:31 +01:00
Christoph Büscher	ff03443ab9	Fix problem with MatchNoDocsQuery in disjunction queries (#35726 ) Queries across multiple fields generate MatchNoDocsQuerys for fields that are unmapped. In certain situation this can lead to erroneous behaviour, for example when an umapped field is used in a query_string query across several fields. If some of the tokens in the query string get eliminated by an analyzer on the mapped fields, the same token will currently generate MatchNoDocsQuerys combined into a disjunction, which in turn leads to no matches in the overall query. Instead we should simply not add MatchNoDocsQuerys to those disjunctions. Closes #34708	2018-11-21 03:49:49 +01:00
Mayya Sharipova	643bb20137	Add a new query type - ScriptScoreQuery (#34533 ) * Add a new query type - ScriptScoreQuery script_score query uses script to calculate document scores. Added as a substitute for function_score with an intentation to deprecate function_scoreq query. ```http GET /_search { "query": { "script_score" : { "query": { "match": { "message": "elasticsearch" } }, "script" : { "source": "Math.log(2 + doc['likes'].value)" }, "min_score" : 2 } } } ``` Add several functions to painless to be used inside script_score: double rational(double, double) double sigmoid(double, double, double) double randomNotReproducible() double randomReproducible(String, int) double decayGeoLinear(String, String, String, double, GeoPoint) double decayGeoExp(String, String, String, double, GeoPoint) double decayGeoGauss(String, String, String, double, GeoPoint) double decayNumericLinear(String, String, String, double, double) double decayNumericExp(String, String, String, double, double) double decayNumericGauss(String, String, String, double, double) double decayDateLinear(String, String, String, double, JodaCompatibleZonedDateTime) double decayDateExp(String, String, String, double, JodaCompatibleZonedDateTime) double decayDateGauss(String, String, String, double, JodaCompatibleZonedDateTime) Date functions only works on dates in the default format and default time zone	2018-11-20 16:10:06 -05:00
Ryan Ernst	c07ad67718	Add 6.5.2 version	2018-11-20 12:33:26 -08:00
Gordon Brown	4bda469861	Fix line lengths in misc other files (#35650 )	2018-11-20 12:29:48 -07:00
Vladimir Dolzhenko	953c8586df	Fix PrimaryAllocationIT#testForceStaleReplicaToBePromotedToPrimary (#35728 ) Closes 35497	2018-11-20 20:05:23 +01:00
Christoph Büscher	e91f404d16	Remove remains of 'auto_generate_phrase_queries' (#35735 ) This parameter in the `query_string` query was deprecated in 6.0 and ignored since then. Its API methods and remaining uses can be removed in the upcoming major version. Relates to #35734	2018-11-20 16:07:17 +01:00
Jim Ferenczi	a5f5ceb869	Remove remaining line length violations for o.e.index (#35652 ) This commit removes line length violations in the classes under org.elasticsearch.index.	2018-11-20 08:09:14 +01:00
Simon Willnauer	29ef442841	Add a `_freeze` / `_unfreeze` API (#35592 ) This commit adds a rest endpoint for freezing and unfreezing an index. Among other cleanups mainly fixing an issue accessing package private APIs from a plugin that got caught by integration tests this change also adds documentation for frozen indices. Note: frozen indices are marked as `beta` and available as a basic feature. Relates to #34352	2018-11-20 08:03:24 +01:00
Benjamin Trent	7a779a9988	[TEST] fixing ClusterApplierService#testClusterStateUpdateLogging() (#35714 )	2018-11-19 14:23:49 -06:00
Yannick Welsch	47ada69c46	Zen2: Move most integration tests to Zen2 (#35678 ) Zen2 is now feature-complete enough to run most ESIntegTestCase tests. The changes in this PR are as follows: - ClusterSettingsIT is adapted to not be Zen1 specific anymore (it was using Zen1 settings). - Some of the integration tests require persistent storage of the cluster state, which is not fully implemented yet (see #33958). These tests keep running with Zen1 for now but will be switched over as soon as that is fully implemented. - Some very few integration tests are not running yet with Zen2 for other reasons, depending on some of the other open points in #32006.	2018-11-19 21:15:29 +01:00
Andrey Ershov	f9ecd0c49e	[Zen2] Write manifest file (#35049 ) Elasticsearch node is responsible for storing cluster metadata. There are 2 types of metadata: global metadata and index metadata. `GatewayMetaState` implements `ClusterStateApplier` and receives all `ClusterStateChanged` events and is responsible for storing modified metadata to disk. When new `ClusterStateChanged` event is received, `GatewayMetaState` checks if global metadata has changed and if it's the case writes new global metadata to disk. After that `GatewayMetaState` checks if index metadata has changed or there are new indices assigned to this node and if it's the case writes new index metadata to disk. Atomicity of global metadata and index metadata writes is ensured by `MetaDataStateFormat` class. Unfortunately, there is no atomicity when more than one metadata changes (global and index, or metadata for two indices). And atomicity is important for Zen2 correctness. This commit adds atomicity by adding a notion of manifest file, represented by `MetaState` class. `MetaState` contains pointers to current metadata. More precisely, it stores global state generation as long and map from `Index` to index metadata generation as long. Atomicity of writes for manifest file is ensured by `MetaStateFormat` class. The algorithm of writing changes to the disk would be the following: 1. Write global metadata state file to disk and remember it's generation. 2. For each new/changed index write state file to disk and remember it's generation. For each not-changed index use generation from previous manifest file. If index is removed or this node is no longer responsible for this index - forget about the index. 3. Create `MetaState` object using previously remembered generations and write it to disk. 4. Remove old state files for global metadata, indices metadata and manifest. Additonally new implementation relies on enhanced `MetaDataStateFormat` failure semantics, `applyClusterState` throws IOException, whose descendant `WriteStateException` could be (and should be in Zen2) explicitly handled.	2018-11-19 19:49:44 +01:00
Gordon Brown	b2057138a7	Remove AbstractComponent from AbstractLifecycleComponent (#35560 ) AbstractLifecycleComponent now no longer extends AbstractComponent. In order to accomplish this, many, many classes now instantiate their own logger.	2018-11-19 09:51:32 -07:00
Nik Everett	036c15d8a3	Build: Fix official plugins list (#35661 ) The list of official plugins accidentally included `qa` projects like, well, `qa` and `amazon-ec2`. This changes the mechanism that we use to build the list and adds a test to catch this. Closes #35623	2018-11-19 10:17:05 -05:00

... 9 10 11 12 13 ...

2818 Commits