OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	7b49beb9b0	Fix threshold frequency computation in Suggesters (#34312 ) The `term` and `phrase` suggesters have different options to filter candidates based on their frequencies. The `popular` mode for instance filters candidate terms that occur in less docs than the original term. However when we compute this threshold we use the total term frequency of a term instead of the document frequency. This is not inline with the actual filtering which is always based on the document frequency. This change fixes this discrepancy and clarifies the meaning of the different frequencies in use in the suggesters. It also ensures that the threshold doesn't overflow the maximum allowed value (Integer.MAX_VALUE). Closes #34282	2018-10-19 13:33:19 +02:00
markharwood	fe623acf66	Docs - removed experimental/beta markers from adjacency matrix aggregation (#34599 )	2018-10-19 09:33:59 +01:00
Daniel Mitterdorfer	dbb6fe58fa	Remove hand-coded XContent duplicate checks With this commit we cleanup hand-coded duplicate checks in XContent parsing. They were necessary previously but since we reconfigured the underlying parser in #22073 and #22225, these checks are obsolete and were also ineffective unless an undocumented system property has been set. As we also remove this escape hatch, we can remove the additional checks as well. Closes #22253 Relates #34588	2018-10-19 10:13:13 +02:00
Alexander Reelsen	e498b7d437	Core: Parse floats in epoch millis parser (#34504 ) In order to stay BWC compatible with joda time, the epoch millis date formatter needs to parse dates with a dot like `123.45`. This adds this functionality for the epoch millis parser in the same way as for the epoch seconds parser. It also adds support for scientific notations like `1.0e3` and fixes parsing of negative values for epoch seconds and epoch millis.	2018-10-19 10:02:45 +02:00
Christoph Büscher	4f7895800e	Remove unused methods in ValueType (#34624 ) The removed methods seem unused in the rest of the project.	2018-10-19 09:50:45 +02:00
David Turner	e13ce66a3c	[Zen2] Calculate optimal cluster configuration (#33924 ) We wish to commit a cluster state update after having received a response from more than half of the master-eligible nodes in the cluster. This is optimal: requiring either more or fewer votes than half harms resilience. For instance if we have three master nodes then, we want to be able to commit a cluster state after receiving responses from any two nodes; requiring responses from all three is clearly not resilient to the failure of any node, and if we could commit an update after a response from just one node then that node would be required for every commit, which is also not resilient. However, this means we must adjust the configuration (the set of voting nodes in the cluster) whenever a master-eligible node joins or leaves. The calculation of the best configuration for the cluster is the job of the Reconfigurator, introduced here.	2018-10-18 13:19:27 +01:00
Christoph Büscher	7bcf496315	[Tests] Correct map lookup in ReplicationTrackerTests (#34565 )	2018-10-18 11:23:53 +02:00
Tal Levy	09067c8942	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-17 15:37:11 -07:00
Ryan Ernst	8734540345	Ensure map keys cannot be self referencing (#34569 ) This commit improves self reference checking to map keys, as well as adds it to ingest script processing.	2018-10-17 15:16:13 -07:00
Jason Tedor	9be87adb95	Increment settings version when upgrading index (#34566 ) When we upgrade an index, we set the settings version upgraded setting. This should be considered a settings change, and therefore we need to increment the settings version. This commit addresses that.	2018-10-17 18:00:17 -04:00
Nik Everett	b6aa42777a	Search: Wrap lucene classes at 140 columns (#34491 ) Applies our line length guidance for all classes in the server in `lucene` directories except `XMoreLikeThis`. The only long line in `XMoreLikeThis` says "remove this when we upgrade to Lucene 5. Given that we're on Lucene 8, this is a little terrifying and deserves another look.	2018-10-17 15:54:35 -04:00
Armin Braun	08d4bf6e84	TESTS: Remove Dead Code in Test Infra. (#34548 ) * None of this infrastructure is used * Some redundant throws and resulting catch code removed	2018-10-17 20:08:39 +01:00
Colin Goodheart-Smithe	90f7cec7a5	Merge branch 'master' into index-lifecycle	2018-10-17 18:22:23 +01:00
Simon Willnauer	b0e98cbce2	Pass the host name on as `server_name` if proxy mode is on (#34559 ) In remote cluster setup if we see a configured proxy we should set the seed nodes host name as the `server_name` to trigger SNI based routing even for seed nodes. Since remote cluster connections are plain TCP connections we have to set the host manually since the other side can't take it from the request URL like in the HTTP case. This also adds some more informative logging to remote cluster connection.	2018-10-17 19:11:50 +02:00
Andrey Ershov	51f38ddc0c	Switch MetaDataStateFormat to Lucene directory abstraction (#33989 ) Switch MetaDataStateFormat to Lucene directory abstraction This commit switches MetaDataStateFormat class to Lucene directory abstraction to make it easier to test MetaDataStateFormat for different IO failures. This commits also adds different IO failures tests to MetaDataStateFormatTests.	2018-10-17 18:17:17 +02:00
Andrey Ershov	93bb24e1f8	Merge branch 'master' into zen2	2018-10-17 14:37:53 +02:00
Armin Braun	3954d041a0	SCRIPTING: Move sort Context to its Own Class (#33717 ) * SCRIPTING: Move sort Context to its own Class	2018-10-17 10:02:44 +01:00
Tal Levy	fbe8dc014c	Merge branch 'master' into index-lifecycle	2018-10-16 13:58:53 -07:00
Simon Willnauer	a93aefb4a4	Assume that rollover datemath tests run on the same day. (#34527 ) in #28741 RolloverIT fails because we are cutting over to the next day while the test executes. We assume that this doesn't happen based on the assertions in the test. This adds a assumeTrue to ensure we are at least 5 min away form a date-flip. Closes #28741	2018-10-16 20:22:32 +02:00
David Turner	303575f742	Fix up merge of master	2018-10-16 15:29:47 +01:00
Armin Braun	ea576a8ca2	Disc: Move AbstractDisruptionTC to filebased D. (#34461 ) * Discovery: Move AbstractDisruptionTestCase to file-based discovery. * Relates #33675 * Simplify away ClusterDiscoveryConfiguration	2018-10-16 15:28:40 +01:00
David Turner	950ca3adda	Merge branch 'master' into zen2	2018-10-16 14:41:14 +01:00
Simon Willnauer	d43a1fac33	Lock down Engine.Searcher (#34363 ) `Engine.Searcher` is non-final today which makes it error prone in the case of wrapping the underlying reader or lucene `IndexSearcher` like we do in `IndexSearcherWrapper`. Yet, there is no subclass of it yet that would be dramatic to just drop on the floor. With the start of development of frozen indices this changed since in #34357 functionality was added to a subclass which would be dropped if a `IndexSearcherWrapper` is installed on an index. This change locks down the `Engine.Searcher` to prevent such a functionality trap.	2018-10-16 14:53:07 +02:00
Martijn van Groningen	a1ec91395c	Changed CCR internal integration tests to use a leader and follower cluster instead of a single cluster (#34344 ) The `AutoFollowTests` needs to restart the clusters between each tests, because it is using auto follow stats in assertions. Auto follow stats are only reset by stopping the elected master node. Extracted the `testGetOperationsBasedOnGlobalSequenceId()` test to its own test, because it just tests the shard changes api. * Renamed AutoFollowTests to AutoFollowIT, because it is an integration test. Renamed ShardChangesIT to IndexFollowingIT, because shard changes it the name of an internal api and isn't a good name for an integration test. * move creation of NodeConfigurationSource to a seperate method * Fixes issues after merge, moved assertSeqNos() and assertSameDocIdsOnShards() methods from ESIntegTestCase to InternalTestCluster, so that ccr tests can use these methods too.	2018-10-16 14:45:46 +02:00
Jason Tedor	05911fb499	Adjust settings version BWC version after backport This commit adjusts the settings version BWC version after backporting the change to the 6.x branch which currently is versioned as 6.5.0.	2018-10-16 06:38:38 -04:00
Jim Ferenczi	544de13d8e	Disallow negative query boost (#34486 ) This change disallows negative query boosts. Negative scores are not allowed in Lucene 8 so it is easier to just disallow negative boosts entirely. We should also deprecate negative boosts in 6x in order to ensure that users are aware when they'll upgrade to ES 7. Relates #33309	2018-10-16 11:31:53 +01:00
Jason Tedor	4b2052c683	Introduce index settings version (#34429 ) This commit introduces settings version to index metadata. This value is monotonically increasing and is updated on settings updates. This will be useful in cross-cluster replication so that we can request settings updates from the leader only when there is a settings update.	2018-10-16 06:22:20 -04:00
Daniel Mitterdorfer	92b2e1a209	Remove lenient boolean handling With this commit we remove some leftovers from #26389 which cleaned up lenient boolean handling. Relates #26389 Relates #22298 Relates #34467	2018-10-16 06:30:00 +02:00
Jason Tedor	55dee53046	Do not update number of replicas on no indices (#34481 ) Today when submitting an update settings request to update the number of replicas with a wildcard that does not match any indices and allow no indices is set to true, the request ends up being interpreted as updating the number of replicas for all indices. That is, consider the following sequence: PUT /test-index { "settings": { "index.number_of_replicas": 0 } } PUT /non-existent-*/_settings?expand_wildcards=open&allow_no_indices=true { "settings": { "index.number_of_replicas": 1 } } GET /test-index/_settings The latter will show that the number of replicas on test-index is now one. This is surprising, and should be considered a bug. The underlying problem here is treating no indices in the underlying methods used to update the routing table and the metadata as meaning all indices. This commit takes away this assumption. Tests that relied on this behavior have been changed to no longer rely on this. A test for this situation is added in UpdateNumberOfReplicasIT.	2018-10-15 19:49:58 -04:00
Nik Everett	23ece922c9	Core: Remove two methods from AbstractComponent (#34336 ) This removes another two methods from `AbstractComponent`. One isn't used at all and another is only used in a single class in watcher. I've moved the method that watcher uses into the single class that uses it.	2018-10-15 16:05:14 -04:00
Nik Everett	a6d1cc6ca9	Revert "Search: Fix spelling mistake in Javadoc (#34480 )" This reverts commit `4e1d7baed0`.	2018-10-15 15:42:11 -04:00
fonxian	4e1d7baed0	Search: Fix spelling mistake in Javadoc (#34480 ) "iff" -> "if".	2018-10-15 15:38:37 -04:00
Ryan Ernst	26f1d7fc94	Tests: Handle epoch date formatters edge cases (#34437 ) This commit handles cases testing withLocale and withZone when the zone and locale in question is the same as the special base case. This can happen sometimes since the locale and zoneids are randomized.	2018-10-15 12:18:18 -07:00
Jim Ferenczi	67577fca56	Fix handling of empty keyword in terms aggregation (#34457 ) Empty values on keyword fields are filtered by the `map` execution mode of the `terms` aggregation. This commit restores them as valid buckets. Closes #34434	2018-10-15 19:33:52 +01:00
Armin Braun	ebca27371c	SCRIPTING: Move Aggregation Script Context to its own class (#33820 ) * SCRIPTING: Move Aggregation Script Context to its own class	2018-10-15 17:28:05 +01:00
Colin Goodheart-Smithe	0b42eda0e3	Merge branch 'master' into index-lifecycle	2018-10-15 16:03:37 +01:00
David Turner	9bb620eece	Mute PartitionedRoutingIT#testShrinking on Windows	2018-10-15 13:18:00 +01:00
Ryan Ernst	72d818c304	Tests: Fix DateFormatter equals tests with locale (#34435 ) This commit removes randomization of locale for DateFormatter equals tests, instead using explicit locales. The test framework already randomizes locales, so the random choice of the second locale can sometimes be equal to the already chosen locale. Randomization also does not provide any extra protection, as the equality of DateFormatter does not implement equality of the locales itself. closes #34337	2018-10-14 23:54:49 +01:00
Yannick Welsch	5fbead00a3	Zen2: Add infrastructure for integration tests (#34365 ) Adds the infrastructure to run integration tests against Zen2.	2018-10-14 20:55:04 +01:00
David Turner	8b9fa55c93	Add storage-layer disruptions to CoordinatorTests (#34347 ) Today we assume the storage layer operates perfectly in CoordinatorTests, which means we are not testing that the system's invariants are preserved if the storage layer fails for some reason. This change injects (rare) storage-layer failures during the safety phase to cover these cases.	2018-10-13 14:24:15 +01:00
David Turner	d98199df14	Extend duration of fixLag() (#34364 ) Today, fixLag() waits for a new cluster state to be committed. However, it does not account for the fact that a term bump may occur, requiring a new election to take place after the cluster state is committed. This change fixes this.	2018-10-11 23:24:08 +01:00
David Turner	a32e303b0c	Account for election duration (#34362 ) Today we may schedule two elections very close together, which can cause the first election to fail even if there are no other nodes. This change adds a delay in between subsequent elections on the same node, effectively allowing time for each election to complete before scheduling the next one.	2018-10-11 15:31:08 +01:00
Jay Modi	6d99d7dafc	ListenableFuture should preserve ThreadContext (#34394 ) ListenableFuture may run a listener on the same thread that called the addListener method or it may execute on another thread after the future has completed. Whenever the ListenableFuture stores the listener for execution later, it should preserve the thread context which is what this change does.	2018-10-11 15:24:38 +01:00
Nhat Nguyen	33791ac27c	CCR: Following primary should process operations once (#34288 ) Today we rewrite the operations from the leader with the term of the following primary because the follower should own its history. The problem is that a newly promoted primary may re-assign its term to operations which were replicated to replicas before by the previous primary. If this happens, some operations with the same seq_no may be assigned different terms. This is not good for the future optimistic locking using a combination of seqno and term. This change ensures that the primary of a follower only processes an operation if that operation was not processed before. The skipped operations are guaranteed to be delivered to replicas via either primary-replica resync or peer-recovery. However, the primary must not acknowledge until the global checkpoint is at least the highest seqno of all skipped ops (i.e., they all have been processed on every replica). Relates #31751 Relates #31113	2018-10-10 15:39:57 -04:00
Simon Willnauer	34b935ae57	Improve `getRestHandlerWrapper` JavaDocs (#34376 ) Questions on how to work with `ActionPlugin#getRestHandlerWrapper()` come up in discuss forums all the time. This change adds an example to the javadoc how this method should/could be used.	2018-10-10 17:28:07 +01:00
David Turner	52a3a19551	Add low-level bootstrap implementation (#34345 ) Today we inject the initial configuration of the cluster (i.e. the set of voting nodes) at startup. In reality we must support injecting the initial configuration after startup too. This commit adds low-level support for doing so as safely as possible.	2018-10-08 15:56:48 +01:00
Yannick Welsch	49cbcaff4f	Allow excluding folder names when scanning for dangling indices (#34349 ) ES is scanning for dangling indices on every cluster state update. For this, it lists the subfolders of the indices directory to determine which extra index directories exist on the node where there's no corresponding index in the cluster state. These are potential targets for dangling index import. On certain machine types, and with large number of indices, this subfolder listing can be horribly slow. This means that every cluster state update will be slowed down by potentially hundreds of milliseconds. One of the reasons for this poor performance is that Files.isDirectory() is a relatively expensive call on some OS and JDK versions. There is no need though to do all these isDirectory calls for folders which we know we are going to discard anyhow in the next step of the dangling indices logic. This commit allows adding an exclusion predicate to the availableIndexFolders methods which can dramatically speed up this method when scanning for dangling indices.	2018-10-08 15:35:50 +02:00
David Turner	ac99d1d66d	Fix bugs in fixLag() (#34346 ) The hack to work around lag detection had some issues: - it always called runFor(), even if no lag was detected - it looked at the last-accepted state not the last-applied state, so missed some lag situations. This fixes these issues.	2018-10-08 11:33:25 +01:00
Nik Everett	06993e0c35	Logging: Make ESLoggerFactory package private (#34199 ) Since all calls to `ESLoggerFactory` outside of the logging package were deprecated, it seemed like it'd simplify things to migrate all of the deprecated calls and declare `ESLoggerFactory` to be package private. This does that.	2018-10-06 09:54:08 -04:00
David Turner	03da4f6c51	Gather votes from all nodes (#34335 ) Today we accept that some nodes may vote for the wrong master in an election. This is mostly fine because they do end up joining the correct master in the end, but the lack of a vote from every follower may prevent a future desirable reconfiguration from taking place. The solution is to hold another election in a yet-higher term in order to collect a complete set of votes. Elections are somewhat disruptive so we should think carefully about when this election should take place. One option is to wait as late as possible (on the grounds that it might not ever be necessary). This unfortunately makes it harder to predict how an apparently-smoothly-running cluster will react to nodes leaving and joining. Instead we prefer to perform the election as soon as possible in the leader's term, adding "votes from all followers" to the invariants that we expect to hold in a stable cluster. The start of a leader's term is already a somewhat disrupted time for the cluster, so performing another election at this point does not materially change the cluster's behaviour. This change implements the logic needed to trigger a new election in order to satisfy this extra stabilisation condition.	2018-10-06 07:22:04 +01:00
Daniel Mitterdorfer	7d826916b9	Adjust size of BigArrays in circuit breaker test With this commit we restore the previous behavior in `BigArraysTests#testMaxSizeExceededOnResize` but lower the sizes that are tested to the range between 256 bytes to 16 kB so the test does not produce a whole lot of garbage. The previous attempt to reduce the amount of garbage produced by that test was to properly size the array initially but it failed to account for object alignment which lead to test failures in some cases. While it would be possible to account for object alignment, we would need to open up BigArrays or directly use the underlying Lucene API which would require us to allocate an array upfront only to find its size (incl. object alignment). Instead we have fixed this issue by conservatively sizing the array initially (so the initial allocation will never trip the circuit breaker) and reduce garbage by reducing the circuit breaker's upper bound as described previously. Closes #33750 Relates #34325	2018-10-05 15:39:08 +02:00
Jim Ferenczi	5c7b52e930	Adapt bwc version after backport Relates #33587	2018-10-05 13:07:39 +02:00
eray	daf88335d7	Add max_children limit to nested sort (#33587 ) Add an option to `nested` sort to limit the number of children to visit when picking the sort value of the root document. Closes #33592	2018-10-05 12:02:47 +02:00
David Turner	29d7d1d503	Minor housekeeping of tests (#34315 ) From experience with #34257, here are a few things that help with analysing logs from test runs. Also we prevent trying to stabilise a cluster with raised delay variability, because lowering the delay variability requires time to allow all the extra-varied-scheduled tasks to work their way out of the system.	2018-10-05 07:57:03 +01:00
Dimitris Athanasiou	4dacfa95d2	[ML] Allow asynchronous job deletion (#34058 ) This changes the delete job API by adding the choice to delete a job asynchronously. The commit adds a `wait_for_completion` parameter to the delete job request. When set to `false`, the action returns immediately and the response contains the task id. This also changes the handling of subsequent delete requests for a job that is already being deleted. It now uses the task framework to check if the job is being deleted instead of the cluster state. This is a beneficial for it is going to also be working once the job configs are moved out of the cluster state and into an index. Also, force delete requests that are waiting for the job to be deleted will not proceed with the deletion if the first task fails. This will prevent overloading the cluster. Instead, the failure is communicated better via notifications so that the user may retry. Finally, this makes the `deleting` property of the job visible (also it was renamed from `deleted`). This allows a client to render a deleting job differently. Closes #32836	2018-10-05 02:41:28 +03:00
Nik Everett	09aaed4fe4	Tasks: Document that status is not semvered (#34270 ) The `status` part of the tasks API reflects the internal status of a running task. In general, we do not make backwards breaking changes to the `status` but because it is internal we reserve the right to do so. I suspect we will very rarely excercise that right but it is important that we have it so we're not boxed into any particular implementation for a request. In some sense this is policy making by documentation change. In another it is clarification of the way we've always thought of this field. I also reflect the documentation change into the Javadoc in a few places. There I acknowledge Kibana's "special relationship" with Elasticsearch. Kibana parses `_reindex`'s `status` field and, because we're friends with those folks, we should talk to them before we make backwards breaking changes to it. We want to be friends with everyone but there is only so much time in the day and we don't want to make backwards breaking fields to `status` at all anyway. So we hope that breaking changes documentation should be enough for other folks. Relates to #34245.	2018-10-04 14:42:37 -04:00
Yannick Welsch	b32abcbd00	Zen2: Add Cluster State Applier (#34257 ) Adds the cluster state applier to Coordinator, and adds tests for cluster state acking.	2018-10-04 20:33:28 +02:00
Vladimir Dolzhenko	dcfe64e0e4	[CI] Fix bogus ScheduleWithFixedDelayTests.testRunnableRunsAtMostOnceAfterCancellation Closes #34004	2018-10-04 16:31:56 +02:00
Armin Braun	3ccfc3de58	SCRIPTING: Terms set query expression (#33856 ) * SCRIPTING: Add Expr. Compile for TermSetQuery Ctx. * Follow up to #33602 adding the ability to compile TermsSetQuery scripts with the expressions engine in the same way we support SearchScript in Expressions * Duplicated the code here for now to make the change less complex, the only difference to SearchScript is that `_score` and `_value` are not handled for TermsSetQuery * remove redundant check	2018-10-04 16:03:57 +02:00
Nik Everett	ab8a5563f2	Logging: Drop remaining Settings log ctor (#34149 ) Drops the last logging constructor that takes `Settings` because it is no longer needed. Watcher goes through a lot of effort to pass `Settings` to `Logger` constructors and dropping `Settings` from all of those calls allowed us to remove quite a bit of log-based ceremony from watcher.	2018-10-04 09:18:04 -04:00
David Turner	c6b0f08472	Add safety phase to CoordinatorTests (#34241 ) Today's CoordinatorTests have a limited amount of randomisation in how things are scheduled. However, to be fully confident in Zen2's liveness we require the system to stabilise after any permitted sequence of events. We can achieve this by running the system in a much more random fashion for a while, with much larger variation in when things are scheduled (simulating GC pressure and network disruption) and then continuing to assert that the system stabilises as we expect. When running randomly, we do not expect to make significant progress and merely verify that no safety property is violated. This change introduces the runRandomly() test method which implements this idea. It also fixes a handful of liveness bugs that this first version of runRandomly() exposed.	2018-10-04 07:40:26 +01:00
Jim Ferenczi	e8b986cc37	Fix sporadic failure in NestedObjectMapperTests Relates #34225	2018-10-04 07:40:46 +02:00
Nhat Nguyen	6dd716b0c4	Replace version with reader cache key in IndicesRequestCache (#34189 ) Today we use the version of a DirectoryReader as a component of the key of IndicesRequestCache. This usage is perfectly fine since the version is advanced every time a new change is made into IndexWriter. In other words, two DirectoryReaders with the same version should have the same content. However, this invariant is only guaranteed in the context of a single IndexWriter because the version is reset to the committed version value when IndexWriter is re-opened. Since #33473, each IndexShard may have more than one IndexWriter, and using the version of a DirectoryReader as a part of the cache key can cause IndicesRequestCache to return stale cached values. For example, in #27650, we rollback the engine (i.e., re-open IndexWriter), index new documents, refresh, then make a count request, but the search layer mistakenly returns the count of the DirectoryReader of the previous IndexWriter because the current DirectoryReader has the same version of the old DirectoryReader even their documents are different. This is possible because these two readers come from different IndexWriters. This commit replaces the the version with the reader cache key of IndexReader as a component of the cache key of IndicesRequestCache. Closes #27650 Relates #33473	2018-10-03 21:03:24 -04:00
David Turner	cbe1cf98c6	Merge branch 'master' into zen2	2018-10-03 22:12:56 +01:00
Kazuhiro Sera	d45fe43a68	Fix a variety of typos and misspelled words (#32792 )	2018-10-03 18:11:38 +01:00
Jim Ferenczi	ee21067a41	Add early termination support for min/max aggregations (#33375 ) This commit adds the support to early terminate the collection of a leaf in the min/max aggregator. If the query matches all documents the min and max value for a numeric field can be retrieved efficiently in the points reader. This change applies this optimization when possible.	2018-10-03 18:33:39 +02:00
Lee Hinman	90c55f5e36	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-03 09:11:28 -06:00
albendz	f09190c14d	Require combine and reduce scripts in scripted metrics aggregation (#33452 ) * Make text message not required in constructor for slack * Remove unnecessary comments in test file * Throw exception when reduce or combine is not provided; update tests * Update integration tests for scripted metrics to always include reduce and combine * Remove some old changes from previous branches * Rearrange script presence checks to be earlier in build * Change null check order in script builder for aggregated metrics; correct test scripts in IT * Add breaking change details to PR	2018-10-03 15:22:01 +01:00
Jim Ferenczi	41528c0813	Adapt bwc version after backport (bis) Relates #34225	2018-10-03 14:24:01 +02:00
Jim Ferenczi	1aa8e72be7	Adapt bwc version after backport Relates #34225	2018-10-03 12:24:07 +02:00
Jim Ferenczi	5a3e031831	Preserve the order of nested documents in the Lucene index (#34225 ) Today we reverse the initial order of the nested documents when we index them in order to ensure that parents documents appear after their children. This means that a query will always match nested documents in the reverse order of their offsets in the source document. Reversing all documents is not needed so this change ensures that parents documents appear after their children without modifying the initial order in each nested level. This allows to match children in the order of their appearance in the source document which is a requirement to efficiently implement #33587. Old indices created before this change will continue to reverse the order of nested documents to ensure backwark compatibility.	2018-10-03 11:55:30 +02:00
Colin Goodheart-Smithe	2d64e3db9a	Adds trace logging to IndicesRequestCache (#34180 ) * Adds trace logging to IndicesRequestCache This change adds trace level logging to `IndicesrrequestCache` witht eh primary aim of helping to identify the cause of teh failures in https://github.com/elastic/elasticsearch/issues/32827. The cache will log at trace level when a cache hit or miss occurs including the reader version and the cache key. Note that this change adds a `cacheKeyRenderer` whcih supplies a human readable String of the cache key since the actual cache key itself is a `BytesReference` containing the wire protocol serialised form of the request. Logging is also added for the case where a search timeout occurs and fr that reason the cache entry is invalidated. * Adds comment to remaind us to remove cacheKeyRenderer	2018-10-03 08:58:33 +01:00
David Turner	a9eae1d068	Merge branch 'master' into zen2	2018-10-03 08:36:34 +01:00
Gordon Brown	fb907706ec	Merge branch 'master' into index-lifecycle	2018-10-02 13:43:46 -06:00
Dimitrios Liappis	f12e0a8398	Add ES version 6.4.3 (#34239 ) Version bump	2018-10-02 21:15:58 +03:00
David Turner	a7ce4b31ed	Fix logging of cluster state update descriptions (#34182 ) In #28941 we changed the computation of cluster state task descriptions but this introduced a bug in which we only log the empty descriptions (rather than the non-empty ones). This change fixes that.	2018-10-02 19:08:19 +01:00
Christoph Büscher	5183ea3d68	Use OptionalInt instead of Optional<Integer> (#34220 ) Optionals containing boxed primitive types are prohibitively costly because they have two level of boxing. For Optional<Integer> the analogous OptionalInt can be used to avoid the boxing of the contained int value.	2018-10-02 15:58:07 +02:00
Jim Ferenczi	ead6ffce54	Fix cross fields mode of the query_string query (#34216 ) This change fixes a bug in the cross fields mode of the `query_string` query. The multi fields query builder must be reseted before parsing in order to clear the list of expanded fields coming from the previous text block. Closes #34215	2018-10-02 14:53:26 +02:00
Przemyslaw Gomulka	3f8cc89c9f	Completion types with multi-fields support (#34081 ) Mappings with completion type and multi-fields, were not able to index array or object format on completion fields. Only string format was supported. This is fixed by providing multiField parser with externalValueContext with already parsed object closes #15115	2018-10-02 14:32:56 +02:00
Alexander Reelsen	b1b0f3276b	Core: Add methods to get locale/timezone in DateFormatter (#34113 ) This adds some method into the `DateFormatter` interface, namely * `withLocale()` to change the locale of a date formatter * `getLocale()` * `getZone()` * `hashCode()` * `equals()` These methods will be needed for aggregations and mapping changes, where zones and locales can be specified in the mapping or in search/aggs parts of a search request.	2018-10-02 14:13:30 +02:00
David Turner	a127805b4a	[Zen2] Simulate scheduling delays (#34181 ) Today we schedule tasks (both immediate and future ones) exactly when requested. In fact it is more realistic to allow for a small amount of delay in the scheduling of tasks, and this helps to exercise more interleavings of actions and therefore to improve test coverage. This change adds to the DeterministicTaskQueue the ability to add a random delay to the scheduling of tasks. This change also provides more explicit timeouts for stabilisation in the CoordinatorTests. Using the randomised scheduling feature in the CoordinatorTests also found a situation in which we could become a leader, then a candidate, and then a leader again very quickly, causing a clash of the _BECOME_MASTER_ and _FINISH_ELECTION_ tasks. We change their behaviour to not consider these duplicates to be problematic.	2018-10-02 11:22:05 +01:00
Jim Ferenczi	aba4a59d0d	Handle terms query when detecting if a query can match nested docs (#34072 ) When nested objects are present in the mappings, we add a filter in queries to exclude them if there is no evidence that the query cannot match in this space. In 6x we visit the query in order to find a mandatory clause that can match root documents only. If we find one we can omit the nested documents filter. Currently only `term` and `range` queries are checked, this change adds the support for `terms` query to effectively remove the nested filter if a mandatory `terms` clause targets a non-nested field. Closes #34067	2018-10-02 09:30:23 +02:00
David Turner	2aff005a69	Clean up TransportMasterNodeAction (#34076 ) Mainly this fixes a warning by replacing the unchecked `new ActionListener` with the checked `new ActionListener<Response>`, and it also fixes the line length violations in this class.	2018-10-02 03:17:55 +01:00
Lee Hinman	2d9cb21490	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-01 14:10:09 -06:00
Christophe Bismuth	2923fb5b31	Disallow "enabled" attribute change for types in mapping update (#33933 ) This commit adds a check for "enabled" attribute change for types when a RestPutMappingAction is received. A MappingException is thrown when such a change is detected. Change are prevented in both ways: "false -> true" and "true -> false". Closes #33566	2018-10-01 20:49:08 +02:00
Vladimir Dolzhenko	2e2ae19b97	drop elasticsearch-translog for 7.0 (#33373 ) #32281 adds elasticsearch-shard to provide bwc version of elasticsearch-translog for 6.x; have to remove elasticsearch-translog for 7.0 Relates to #31389	2018-10-01 16:21:14 +02:00
Christoph Büscher	17e6932bf3	[Tests] Rename DocumentMapperMergeTests (#34121 ) Renaming to simply DocumentMapperTests to indicate this is where other unit tests should go. Also removing outdates Todo in DocumentMapperParserTests.	2018-10-01 10:29:19 +02:00
Jason Tedor	e2bd2028d8	Allow specifying shard changes batch sizes in bytes (#34168 ) This commit changes the shard changes requests from using a raw byte value to being able to be specified using bytes units (e.g., 4mb).	2018-09-30 14:22:22 -04:00
Martijn van Groningen	b1a27b2e6b	[CCR] Add unfollow API (#34132 ) The unfollow API changes a follower index into a regular index, so that it will accept write requests from clients. For the unfollow api to work the index follow needs to be stopped and the index needs to be closed. Closes #33931	2018-09-30 19:19:34 +02:00
Nhat Nguyen	ad61398879	CCR: Optimize indexing ops using seq_no on followers (#34099 ) This change introduces the indexing optimization using sequence numbers in the FollowingEngine. This optimization uses the max_seq_no_updates which is tracked on the primary of the leader and replicated to replicas and followers. Relates #33656	2018-09-28 20:42:26 -04:00
Ryan Ernst	47cbae9b26	Scripting: Remove ExecutableScript (#34154 ) This commit removes the legacy ExecutableScript, which was no longer used except in tests. All uses have previously been converted to script contexts.	2018-09-28 17:13:08 -07:00
Lee Hinman	6ea396a476	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-28 15:40:12 -06:00
Yannick Welsch	412face402	Move NodeRemovalClusterStateTaskExecutor out of ZenDiscovery (#34147 ) Allows this class to be cleanly shared between Zen1 and Zen2. Follow-up to #33917	2018-09-28 23:12:59 +02:00
Armin Braun	76dd3948f3	TESTS: Relax Assertion About Deleting Shard Dir (#34120 ) * TESTS: Relax Assertion About Deleting Shard Dir * Allow empty state directory to prevent test from failing * Closes #32686	2018-09-28 19:09:49 +02:00
Ryan Ernst	95977f4db9	Scripting: Add watcher script contexts (#34059 ) This commit removes the use of ExecutableScript from watcher in favor of custom script contexts for both watcher condition scripts and transform scripts.	2018-09-28 07:58:17 -07:00
Hendrik Muhs	e2f310b56c	Fix AggregationFactories.Builder equality and hash regarding order (#34005 ) Fixes the equals and hash function to ignore the order of aggregations to ensure equality after serialization and deserialization. This ensures storing configs with aggregation works properly. This also addresses a potential issue in caching when the same query contains aggregations but in different order. 1st it will not hit in the cache, 2nd cache objects which shall be equal might end up twice in the cache.	2018-09-28 13:30:50 +02:00
David Turner	980cfc69d6	Integrate FollowerChecker with Coordinator (#34075 ) This change ensures that the leader node periodically checks that its followers are healthy, and that they are removed from the cluster if not.	2018-09-28 12:29:34 +01:00
Armin Braun	c4b831645c	MINOR: Remove some deadcode in NodeEnv and Related (#34133 )	2018-09-28 12:40:20 +02:00
Alexander Reelsen	bc7d69f74a	Core: Don't rely on java time for epoch seconds formatting (#34086 ) In order to be compatible with joda time, this adds an epoch seconds formatter, that is able to parse floating point values. However joda time discards the floating point values, but still parses the data, where as this one is able to parse the whole value including milliseconds.	2018-09-28 10:53:33 +02:00
Alan Woodward	f243d75f59	Remove special-casing of Synonym filters in AnalysisRegistry (#34034 ) The synonym filters no longer need access to the AnalysisRegistry in their constructors, so we can remove the special-case code and move them to the common analysis module. This commit means that synonyms are no longer available for `server` integration tests, so several of these are either rewritten or migrated to the common analysis module as rest-spec-api tests	2018-09-28 09:02:47 +01:00
Julie Tibshirani	9cd4f70a67	Support 'string'-style queries on metadata fields when reasonable. (#34089 ) * Make sure 'ignored' and 'routing' field types inherit from StringFieldType. * Add tests for prefix and regexp queries. * Support prefix and regexp queries on _index fields.	2018-09-27 20:59:03 -07:00
Ryan Ernst	a2c941806b	Tests: Add support for custom contexts to mock scripts (#34100 ) This commit adds the ability to plug in compilation of custom contexts in mock script engine. This is needed for testing plugins which add custom contexts like watcher.	2018-09-27 12:23:59 -07:00
Jake Landis	73ee721b29	ingest: correctly measure chained pipeline stats (#33912 ) Prior to this change when a pipeline processor called another pipeline, only the stats for the first processor were recorded. The stats for the subsequent pipelines were ignored. This change properly accounts for pipelines irregardless if they are the first or subsequently called pipelines. This change moves the state of the stats from the IngestService to the pipeline itself. Cluster updates are safe since the pipelines map is atomically swapped, and if a cluster update happens while iterating over stats (now read directly from the pipeline) a slightly stale view of stats may be shown.	2018-09-27 13:54:26 -05:00
Lee Hinman	a26cc1a242	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-27 11:00:37 -06:00
Jason Tedor	899a7c7d99	Fix remote cluster seeds fallback (#34090 ) Recently we introduced the settings cluster.remote to take the place of search.remote for configuring remote cluster connections. We made this change due to the fact that we have generalized the remote cluster infrastructure to also be used within cross-cluster replication and not only cross-cluster search. For backwards compatibility, when we made this change, we allowed that cluster.remote would fallback to search.remote. Alas, the initial change for this contained a bug for handling the proxy and seeds settings. The bug for the seeds settings arose because we were manually iterating over the concrete settings only for cluster.remote seeds but not for search.remote seeds. This commit addresses this by iterating over both cluster.remote seeds and search.remote seeds. Additionally, when checking for existence of proxy settings, we have to not only check cluster.remote proxy settings, but also fallback to search.remote proxy settings. This commit addresses both issues, and adds tests for these situations.	2018-09-27 09:47:51 -04:00
Jim Ferenczi	269ae0bc15	Handle MatchNoDocsQuery in span query wrappers (#34106 ) * Handle MatchNoDocsQuery in span query wrappers This change adds a new SpanMatchNoDocsQuery query that replaces MatchNoDocsQuery in the span query wrappers. The `wildcard` query now returns MatchNoDocsQuery if the target field is not in the mapping (#34093) so we need the equivalent span query in order to be able to pass it to other span wrappers. Closes #34105	2018-09-27 14:19:08 +02:00
Christoph Büscher	cb4cdf17f0	Update MovAvgIT AwaitsFix bug url	2018-09-27 11:11:21 +02:00
Simon Willnauer	bda7bc145b	Fold EngineSearcher into Engine.Searcher (#34082 ) EngineSearcher can be easily folded into Engine.Searcher which removes a level of inheritance that is necessary for most of it's subclasses. This change folds it into Engine.Searcher and removes the dependency on ReferenceManager.	2018-09-27 09:06:04 +02:00
Armin Braun	acd80a1e07	TESTS: Enable DEBUG Logging in Flaky Test (#34091 ) * This should surface what errors are thrown on CI and in org.elasticsearch.transport.RemoteClusterConnection.ConnectHandler#collectRemoteNodes (the sequence of caught error in the last catch block and moving on to the next seed node seems to be the only path by which the errors logged in #33756 could come about) * Relates #33756	2018-09-27 06:02:24 +02:00
Nhat Nguyen	ea9b33527e	TEST: Add engine is closed as expected failure msg This commit adds "engine is closed" as an expected failure message. This change is due to #33967 in which we might access a closed engine on promotion. Relates #33967	2018-09-26 22:38:55 -04:00
Nhat Nguyen	12d94e44b8	Adjust bwc version for max_seq_no_of_updates Relates #33967 Relates #33842	2018-09-26 22:12:19 -04:00
Simon Willnauer	ae8e54493d	Build DocStats from SegmentInfos in ReadOnlyEngine (#34079 ) This change is related to #33903 that ports the DocStats simplification to the master branch. This change builds the docStats in the ReadOnlyEngine from the last committed segment infos rather than the reader. Co-authored-by: Tanguy Leroux <tlrx.dev@gmail.com>	2018-09-27 00:16:17 +02:00
Julie Tibshirani	1d08f63eff	When creating wildcard queries, use MatchNoDocsQuery when the field type doesn't exist. (#34093 )	2018-09-26 15:08:35 -07:00
Simon Willnauer	2b730d1b9d	Mute MovAvgIT#testHoltWintersNotEnoughData Relates to #34098	2018-09-26 23:50:31 +02:00
Mayya Sharipova	80c5d30f30	XContentBuilder to handle BigInteger and BigDecimal (#32888 ) Although we allow to index BigInteger and BigDecimal into a keyword field, source filtering on these fields would fail as XContentBuilder was not able to deserialize BigInteger and BigDecimal to json. This modifies XContentBuilder to allow to handle BigInteger and BigDecimal. Closes #32395	2018-09-26 14:24:31 -04:00
Julie Tibshirani	de8bfb908f	Delegate wildcard query creation to MappedFieldType. (#34062 ) * Delegate wildcard query creation to MappedFieldType. * Disallow wildcard queries on collation fields. * Disallow wildcard queries on non-string fields.	2018-09-26 09:36:41 -07:00
Nik Everett	ddce9704d4	Logging: Drop two deprecated methods (#34055 ) This drops two deprecated methods from `ESLoggerFactory`, switching all calls to those methods to calls to methods of the same name on `LogManager`.	2018-09-26 11:20:52 -04:00
Ryan Ernst	7800b4fa91	Core: Abstract DateMathParser in an interface (#33905 ) This commits creates a DateMathParser interface, which is already implemented for both joda and java time. While currently the java time DateMathParser is not used, this change will allow a followup which will create a DateMathParser from a DateFormatter, so the caller does not need to know the internals of the DateFormatter they have.	2018-09-26 07:56:25 -07:00
Zachary Tong	25d74bd0cb	Prefer mapped aggs to lead reductions (#33528 ) Previously, unmapped aggs try to delegate reduction to a sibling agg that is mapped. That delegated agg will run the reductions, and also reduce any pipeline aggs. But because delegation comes before running pipelines, the unmapped agg _also_ tries to run pipeline aggs. This causes the pipeline to run twice, and potentially double it's output in buckets which can create invalid JSON (e.g. same key multiple times) and break when converting to maps. This fixes by sorting the list of aggregations ahead of time so that mapped aggs appear first, meaning they preferentially lead the reduction. If all aggs are unmapped, the first unmapped agg simply creates a new unmapped object and returns that for the reduction. This means that unmapped aggs no longer defer and there is no chance for a secondary execution of pipelines (or other side effects caused by deferring execution). Closes #33514	2018-09-26 10:09:31 -04:00
Nik Everett	1871e7f7e9	Search: Simply SingleFieldsVisitor (#34052 ) `SingleFieldsVisitor` is meant to load a single stored field but it manages to be quite complex to reason about because it inherits from our "basic" `FieldsVisitor` which is designed to load many fields. This breaks that inheritance and adds logic to `SingleFieldsVisitor` so it can be properly stand alone. While this amounts to more lines of code they ought to be significantly easier to reason about.	2018-09-26 09:48:15 -04:00
David Roberts	1413ace74f	Mute testSplitFromOneToN and testCreateShrinkIndexToN on Windows Relates #34080	2018-09-26 14:02:14 +01:00
Christoph Büscher	ba3ceeaccf	Clean up "unused variable" warnings (#31876 ) This change cleans up "unused variable" warnings. There are several cases were we most likely want to suppress the warnings (especially in the client documentation test where the snippets contain many unused variables). In a lot of cases the unused variables can just be deleted though.	2018-09-26 14:09:32 +02:00
David Turner	d995fc85c6	Integrate LeaderChecker with Coordinator (#34049 ) This change ensures that follower nodes periodically check that their leader is healthy, and that they elect a new leader if not.	2018-09-26 12:18:13 +01:00
Jim Ferenczi	a255880497	Add nested and object fields to field capabilities response (#33803 ) This commit adds nested and object fields to the field capabilities response. Closes #33237	2018-09-26 08:59:41 +02:00
Ryan Ernst	be8475955e	Scripting: Use ParameterMap for deprecated ctx var in update scripts (#34065 ) This commit removes the sysprop controlling whether ctx is in params for update scripts and replaces it with use of the new ParameterMap, which outputs a deprecation warning whenever params.ctx is used.	2018-09-25 22:08:02 -07:00
Nhat Nguyen	8a56369f5b	Move max_unsafe_auto_id_timestamp constant to Engine (#34025 ) We should not access InternalEngine in other classes.	2018-09-25 19:20:00 -04:00
Jim Ferenczi	0f878eff19	Add a limit for graph phrase query expansion (#34031 ) Today query parsers throw TooManyClauses exception when a query creates too many clauses. However graph phrase queries do not respect this limit. This change adds a protection against crazy expansions that can happen when building a graph phrase query. This is a temporary copy of the fix available in https://issues.apache.org/jira/browse/LUCENE-8479 but not merged yet. This logic will be removed when we integrate the Lucene patch in a future release.	2018-09-25 21:38:47 +02:00
Igor Motov	1e6780d703	Mute AckClusterUpdateSettingsIT Tracked by #33673	2018-09-25 14:16:47 -04:00
Armin Braun	0ba1855740	INGEST: Tests for Drop Processor (#33430 ) * INGEST: Tests for Drop Processor * UT for behavior of dropped callback and drop processor * Moved drop processor to `server` project to enable this test * Simple IT * Relates #32278	2018-09-25 19:29:22 +02:00
Christoph Büscher	ecc087a5bb	Remove Join utility class (#34037 ) The functionality can be replaces with String.join in new Java versions.	2018-09-25 15:25:54 +02:00
David Turner	f886eebd99	Fix CoordinatorTests some more (#34039 ) Today the `CoordinatorTests` are not completely reliable. These changes make them more so, by removing a couple of assertions that we do not expect to pass (yet).	2018-09-25 14:04:22 +01:00
David Turner	7c63f5455b	Use a threadsafe map in SearchAsyncActionTests (#33700 ) Today `SearchAsyncActionTests#testFanOutAndCollect` uses a simple `HashMap` for the `nodeToContextMap` variable, which is then accessed from multiple threads without, apparently, explicit synchronisation. This provides an explanation for the test failure identified in #29242 in which `.toString()` returns `"[]"` just before `.isEmpty` returns `false`, without any concurrent modifications. This change converts `nodeToContextMap` to a `newConcurrentMap()` so that this cannot occur. It also fixes a race condition in the detection of double-calling the subsequent search phase. Closes #29242.	2018-09-25 13:58:05 +01:00
Nhat Nguyen	5166dd0a4c	Replicate max seq_no of updates to replicas (#33967 ) We start tracking max seq_no_of_updates on the primary in #33842. This commit replicates that value from a primary to its replicas in replication requests or the translog phase of peer-recovery. With this change, we guarantee that the value of max seq_no_of_updates on a replica when any index/delete operation is performed at least the max_seq_no_of_updates on the primary when that operation was executed. Relates #33656	2018-09-25 08:07:57 -04:00
Luca Cavanna	970407c663	[DOCS] add comment to clarify cluster name resolution (#34014 ) We currently fallback to local indices whenever a remote cluster is not found, as there may still be indices / aliases with the same name. Such behaviour is lenient but needs to be kept for backwards compatibility. Clarified that in the code so we don't forget. Relates to #26247	2018-09-25 14:03:07 +02:00
Adrien Grand	612201aee0	Fix created version for similarity validation. (#33890 ) It mistakenly uses the Elasticsearch major version instead of the Lucene major version. I noticed it when backporting, it is not noticeable on master because the only two Lucene versions that are supported, 7 and 8, encode norms the same way, unlike Lucene 6.	2018-09-25 13:48:25 +02:00
Yannick Welsch	679fb698d0	Zen2: Trigger join when active master detected (#34008 ) Triggers a join when an active master is detected. In order to avoid spamming joins, deduplicates join request based on <target, join> pair. This ensures that a new join is sent whenever the term is incremented or when a new master is found. Also changes the logging of join failures from DEBUG to INFO. These join failures should be happening rarely, and can either indicate a failed election (which should be rare) or a configuration issue.	2018-09-25 09:44:35 +02:00
David Turner	1d47c9582b	Fix CoordinatorTests (#34002 ) Today the CoordinatorTests are not very reliable if two elections are scheduled concurrently. Although we expect occasional failures due to this, in fact the failures are much more common than expected due to a handful of issues. This PR fixes these issues.	2018-09-25 08:43:47 +01:00
Hendrik Muhs	bf6cf6b6d9	refactor CompositeValuesSourceParserHelper for reusage by making it public (#33945 ) refactor CompositeValuesSourceParserHelper for reusage by making it public and moving toXContent into it	2018-09-25 09:15:52 +02:00
David Turner	3af8fc74c7	Make TransportService more test-friendly (#33869 ) Today, TransportService uses System.currentTimeMillis() to get the current time to report on things like timeouts, and enqueues lambdas for future execution. However, in tests it is useful to be able to fake out the current time and to see what all these enqueued lambdas are really for. This change alters the situation so that we can obtain the time from the more easily-faked ThreadPool#relativeTimeInMillis(), and implements some friendlier toString() methods on the various Runnables so we can see what they are later.	2018-09-25 07:50:18 +01:00
David Turner	02b483c372	Logging improvements in CoordinatorTests (#33991 ) Today, we know that CoordinatorTests sometimes fail to stabilise due to an election collision. This change improves the logging that occurs when an election collision occurs so it will be easier to see if this is happening when analysing a test failure. We also wrap the call to masterService.submitStateUpdateTask() in a context that logs the node on which it runs. We also introduce the InitialJoinAccumulator instead of using a placeholder CandidateJoinAccumulator at startup, which reduces the cases to consider in CandidateJoinAccumulator.close() and tightens up the assertions we can make here.	2018-09-24 20:07:32 +01:00
Lee Hinman	243e863f6e	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-24 10:33:51 -06:00
Armin Braun	25bc8c4b5a	Fix typo `NodeEnvironment#assertPathsDoNotExist` (#33996 ) * We want to check the individual paths here one by one to get a better to interpret assertion message	2018-09-24 17:57:27 +02:00
Julie Tibshirani	8e8bd56cc7	In MatchQuery, remove a check for fragile search analyzers. (#33927 ) As far as I can tell this guard against fragile analyzers is no longer relevant, since we stopped setting special analyzers on numeric fields (3bf6f4). Instead of removing the guard completely, I opted to keep a check for untokenized + unnormalized fields to avoid going through the analysis process unnecessarily. My motivation for simplifying this check is that I'd like to add support for `split_queries_on_whitespace` to the new 'queryable object' fields. As it stands, I would have to add a dedicated instanceof check for the new mapper, which is not optimal.	2018-09-24 08:56:13 -07:00
Yannick Welsch	2e774e146d	Zen2: Update PeerFinder term on term bump (#33992 ) Ensures that the PeerFinder always uses the correct term.	2018-09-24 17:47:15 +02:00
Tim Brooks	78e483e8d8	Introduce abstract security transport testcase (#33878 ) This commit introduces an AbstractSimpleSecurityTransportTestCase for security transports. This classes provides transport tests that are specific for security transports. Additionally, it fixes the tests referenced in #33285.	2018-09-24 09:44:44 -06:00
Ignacio Vera	df333ca305	TESTS: Make score Float#NaN when there is no max score (#33997 ) * TESTS: Make score Float#NaN when there is no max score Fixes test failure due to maxScore set to Float#MinValue instead on Float#NaN. In addition the initial value for maxScore is set to Float#NEGATIVE_INFINITY so it is an illegal value. Closes #33993	2018-09-24 17:36:48 +02:00
Luca Cavanna	e389d9e296	Clarify RemoteClusterService#groupIndices behaviour (#33899 ) When executing a cross-cluster search, we need to search against all local indices (and no remote indices) in case no indices are specified. Also, if only remote indices are specified, no local indices will be queried. We previously added empty local indices whenever they were not present in the map of the grouped indices, then we would act differently later based on the extracted remote indices. Instead, we now add the empty array for local indices only in case we need to search all local indices; the entry for local indices is not added when local indices should not be searched. This way the grouped indices reflect reality and provide a better indication of what indices will be searched.	2018-09-24 11:45:33 +02:00
Christophe Bismuth	47ed6c79ee	[TEST] Add validate query tests for empty and malformed queries (#33862 ) Relates to #33095	2018-09-24 11:21:47 +02:00
Simon Willnauer	7d703c2f92	Fix AutoQueueAdjustingExecutorBuilder settings validation (#33922 ) Settings validation in AutoQueueAdjustingExecutorBuilder always checked against a default value which means that we never can change a max queue size that is lower than the default. This change adds tests and fixes this validation.	2018-09-24 07:45:50 +02:00
Nhat Nguyen	432e61c971	Adjust bwc for resync request (#33964 ) Relates #33964	2018-09-22 19:29:38 -04:00
Nhat Nguyen	f2f08dd6c5	Adjust bwc for recovery request (#33693 ) Relates #33693	2018-09-22 19:28:20 -04:00
Nhat Nguyen	e7ae2f9d36	Propagate auto_id_timestamp in primary-replica resync (#33964 ) A follow-up of #33693 to propagate max_seen_auto_id_timestamp in a primary-replica resync. Relates #33693	2018-09-22 11:40:10 -04:00
Nhat Nguyen	7944a0cb25	Track max seq_no of updates or deletes on primary (#33842 ) This PR is the first step to use seq_no to optimize indexing operations. The idea is to track the max seq_no of either update or delete ops on a primary, and transfer this information to replicas, and replicas use it to optimize indexing plan for index operations (with assigned seq_no). The max_seq_no_of_updates on primary is initialized once when a primary finishes its local recovery or peer recovery in relocation or being promoted. After that, the max_seq_no_of_updates is only advanced internally inside an engine when processing update or delete operations. Relates #33656	2018-09-22 08:02:57 -04:00
David Turner	1761b6c85c	Introduce FollowersChecker (#33917 ) It is important that the leader periodically checks that its followers are still healthy and can remain part of its cluster. If these checks fail repeatedly then the leader should remove the faulty node from the cluster. The FollowerChecker, introduced in this commit, performs these periodic checks and deals with retries.	2018-09-22 11:34:16 +01:00
Yannick Welsch	a612dd1272	Zen2: Add node id to log output of CoordinatorTests (#33929 ) With recent changes to the logging framework, the node name can no longer be injected into the logging output using the node.name setting, which means that for the CoordinatorTests (which are simulating a cluster in a fully deterministic fashion using a single thread), as all the different nodes are running under the same test thread, we are not able to distinguish which log lines are coming from which node. This commit readds logging for node ids in the CoordinatorTests, making two very small changes to DeterministicTaskQueue and TestThreadInfoPatternConverter.	2018-09-21 18:40:12 +02:00
Vladimir Dolzhenko	9c0316869b	Store: keep IndexFormatTooOldException and IndexFormatTooNewException in corruption marker (#33920 ) Closes #33916	2018-09-21 14:00:02 +02:00
Nik Everett	cac93949fe	API: Drop deprecated methods from Retry (#33925 ) We deprecated the `Retry.withBackoff` flavors with `Settings` in 6.5 because they were no longer needed. This drops them form 7.0.	2018-09-21 07:55:50 -04:00
Christoph Büscher	b654d986d7	Add OneStatementPerLineCheck to Checkstyle rules (#33682 ) This change adds the OneStatementPerLineCheck to our checkstyle precommit checks. This rule restricts the number of statements per line to one. The resoning behind this is that it is very difficult to read multiple statements on one line. People seem to mostly use it in short lambdas and switch statements in our code base, but just going through the changes already uncovered some actual problems in randomization in test code, so I think its worth it.	2018-09-21 11:52:31 +02:00
Nhat Nguyen	5f7f793f43	Propagate max_auto_id_timestamp in peer recovery (#33693 ) Today we don't store the auto-generated timestamp of append-only operations in Lucene; and assign -1 to every index operations constructed from LuceneChangesSnapshot. This looks innocent but it generates duplicate documents on a replica if a retry append-only arrives first via peer-recovery; then an original append-only arrives via replication. Since the retry append-only (delivered via recovery) does not have timestamp, the replica will happily optimizes the original request while it should not. This change transmits the max auto-generated timestamp from the primary to replicas before translog phase in peer recovery. This timestamp will prevent replicas from optimizing append-only requests if retry counterparts have been processed. Relates #33656 Relates #33222	2018-09-20 19:53:30 -04:00
Vladimir Dolzhenko	dbe6405354	mute RemoveCorruptedShardDataCommandTests.testCorruptedIndex	2018-09-20 21:30:40 +02:00
David Turner	187f787f52	[Zen2] Introduce LeaderChecker (#33024 ) It is important that follower nodes periodically check that their leader is still healthy and that they remain part of its cluster. If these checks fail repeatedly then followers should attempt to find and join a new leader, possibly electing one in the process. The LeaderChecker, introduced in this commit, performs these periodic checks and deals with retries.	2018-09-20 20:05:55 +01:00
Nhat Nguyen	76a1a863e3	TEST: stop assertSeqNos if shards movement (#33875 ) Currently, assertSeqNos assumes that the cluster is stable at the end of the test (i.e., no more shard movement). However, this assumption does not always hold. In these cases, we can stop the assertion instead of failing a test. Closes #33704	2018-09-20 13:44:26 -04:00
Christoph Büscher	28b1d41007	Fix unused import checktyle issue	2018-09-20 19:42:15 +02:00
Nhat Nguyen	002f763c48	Restore local history from translog on promotion (#33616 ) If a shard was serving as a replica when another shard was promoted to primary, then its Lucene index was reset to the global checkpoint. However, if the new primary fails before the primary/replica resync completes and we are now being promoted, we have to restore the reverted operations by replaying the translog to avoid losing acknowledged writes. Relates #33473 Relates #32867	2018-09-20 13:21:11 -04:00
Nhat Nguyen	b13a434f59	Remove wrong assert in LocalCheckpointTrackerTests It's possible for the set "seqNos" to contain only the "unFinishedSeq" in the testConcurrentReplica test. If this is the case, the call `randomValueOtherThan` won't make any progress because the predicate will never be false. This commit removes this expectation because it's incorrect and it's no longer needed as we have a dedicated test to verify the contains method. Relates #33871	2018-09-20 13:12:19 -04:00
Alan Woodward	b33c18d316	Move SoraniNormalizationFilterFactory to the common analysis plugin (#33892 ) Follow up to #25715	2018-09-20 17:31:41 +01:00
Yannick Welsch	db327818dd	[TEST] Enable DEBUG logging on testCreateShrinkIndexToN	2018-09-20 18:16:20 +02:00
Nik Everett	f963c29876	Logging: Drop Settings from some logger lookups (#33859 ) Drops `Settings` from some of the methods to lookup loggers and deprecates another logger lookup that takes `Settings` because `Settings` is no longer required to build a logger.	2018-09-20 10:42:48 -04:00
David Turner	0b4a6ae97c	Merge commit '3522b9084b611c89ec4f06c1863542883840ed0e' into zen2	2018-09-20 15:17:47 +01:00
Jake Landis	e37e5dfc04	ingest: support simulate with verbose for pipeline processor (#33839 ) * ingest: support simulate with verbose for pipeline processor This change better supports the use of simulate?verbose with the pipeline processor. Prior to this change any pipeline processors executed with simulate?verbose would not show all intermediate processors for the inner pipelines. This changes also moves the PipelineProcess and TrackingResultProcessor classes to enable instance checks and to avoid overly public classes. As well this updates the error message for when cycles are detected in pipelines calling other pipelines.	2018-09-20 08:33:07 -05:00
Simon Willnauer	3522b9084b	Introduce a `search_throttled` threadpool (#33732 ) Today all searches happen on the search threadpool which is the correct behavior in almost any case. Yet, there are exceptions where for instance searches searches should be passed through a single-thread thread-pool to reduce impact on a node. This change adds a index-private setting that allows to mark an index as throttled for searches and forks off all non-stats searcher access to this thread-pool for indices that are marked as `index.search.throttled`	2018-09-20 13:43:11 +02:00
David Turner	c041e94349	Test that transient settings beat persistent ones (#33818 ) Transient settings override persistent settings, but in fact all of the tests that run as part of `:server:test` and `:server:integTest` will pass if the precedence is changed to be the other way round. This change adds a test that verifies the precedence is as documented.	2018-09-20 11:17:19 +01:00
Tim Vernum	8d50c10208	Mute ShrinkIndexIT.testCreateShrinkIndexToN on Windows Relates: #33857	2018-09-20 18:21:15 +10:00
Daniel Mitterdorfer	b1cc58e425	Allow to clear the fielddata cache per field With this commit we clear the fielddata cache per field as it is supposed to be. Previously we retrieved the proper field from the cache but then cleared the entire cache anyway. Closes #33798 Relates #33807	2018-09-20 08:59:53 +02:00
Tim Vernum	1f1ebb4656	Add additional null check in _cat/shards The target of the func lambda may be null (e.g. in a mixed cluster where older nodes lack some of the values) Relates: #33858 / 331caba Closes #33877	2018-09-20 06:44:13 +02:00
Nhat Nguyen	05bf9dc2e8	Add contains method to LocalCheckpointTracker (#33871 ) This change adds "contains" method to LocalCheckpointTracker. One of the use cases is to check if a given operation has been processed in an engine or not by looking up its seq_no in LocalCheckpointTracker. Relates #33656	2018-09-19 20:29:36 -04:00
Gordon Brown	90de436e55	Use custom index metadata for ILM state (#33783 ) Using index settings for ILM state is fragile and exposes too much information that doesn't need to be exposed. Using custom index metadata is more resilient and allows more controlled access to internal information. As part of these changes, moves away from using defaults for ILM-related values, in favor of using null values to clearly indicate that the value is not present.	2018-09-19 14:50:48 -06:00
Nik Everett	26c4f1fb6c	Core: Default node.name to the hostname (#33677 ) Changes the default of the `node.name` setting to the hostname of the machine on which Elasticsearch is running. Previously it was the first 8 characters of the node id. This had the advantage of producing a unique name even when the node name isn't configured but the disadvantage of being unrecognizable and not being available until fairly late in the startup process. Of particular interest is that it isn't available until after logging is configured. This forces us to use a volatile read whenever we add the node name to the log. Using the hostname is available immediately on startup and is generally recognizable but has the disadvantage of not being unique when run on machines that don't set their hostname or when multiple elasticsearch processes are run on the same host. I believe that, taken together, it is better to default to the hostname. 1. Running multiple copies of Elasticsearch on the same node is a fairly advanced feature. We do it all the as part of the elasticsearch build for testing but we make sure to set the node name then. 2. That the node.name defaults to some flavor of "localhost" on an unconfigured box feels like it isn't going to come up too much in production. I expect most production deployments to at least set the hostname. As a bonus, production deployments need no longer set the node name in most cases. At least in my experience most folks set it to the hostname anyway.	2018-09-19 15:21:29 -04:00
Simon Willnauer	a92dda2e7e	Move CompletionStats into the Engine (#33847 ) By moving CompletionStats into the engine we can easily cache the stats for read-only engines if necessary. It also moves the responsibiltiy out of IndexShard which has quiet some complexity already. Relates to #33835	2018-09-19 20:35:57 +02:00
Simon Willnauer	0fa5758bc6	Fix potential NPE in `_cat/shards/` with partial CommonStats (#33858 ) Today if we fetch common stats from a shard we might get a partial response if the shard is closed while we fetch the stats. This causes hard to track and reproduce NPEs. This change streamlines null checking to ensure we only render stats we actually received.	2018-09-19 20:34:54 +02:00
Nik Everett	3ede13a454	Test framework fall cleaning (#33423 ) Wraps all lines in our test framework at 140 characters because that is our standard line length and removes all of the checkstyle suppressions for the test framework. Drops most of `ModuleTestCase` because it isn't used and we're moving away from using guice in the way that it wants to test anyway. Also switches a few classes that extend it but don't use it to extend `ESTestCase` instead.	2018-09-19 14:34:02 -04:00
Lee Hinman	81e9150c7a	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-19 09:43:26 -06:00
Simon Willnauer	6ec12bef0d	Add missing IndexShard#readAllowed() This was lost in #33835	2018-09-19 17:07:13 +02:00
Alan Woodward	5107949402	Allow TokenFilterFactories to rewrite themselves against their preceding chain (#33702 ) We currently special-case SynonymFilterFactory and SynonymGraphFilterFactory, which need to know their predecessors in the analysis chain in order to correctly analyze their synonym lists. This special-casing doesn't work with Referring filter factories, such as the Multiplexer or Conditional filters. We also have a number of filters (eg the Multiplexer) that will break synonyms when they appear before them in a chain, because they produce multiple tokens at the same position. This commit adds two methods to the TokenFilterFactory interface. * `getChainAwareTokenFilterFactory()` allows a filter factory to rewrite itself against its preceding filter chain, or to resolve references to other filters. It replaces `ReferringFilterFactory` and `CustomAnalyzerProvider.checkAndApplySynonymFilter`, and by default returns `this`. * `getSynonymFilter()` defines whether or not a filter should be applied when building a synonym list `Analyzer`. By default it returns `true`. Fixes #33609	2018-09-19 15:52:14 +01:00
Christoph Büscher	546e7361ed	[Tests] Nudge wait time in RemoteClusterServiceTests (#33853 ) This test occasionally fails in `testCollectSearchShards` waiting on what seems to be a search request to a remote cluster for one second. Given that the test fails here very rarely I suspect maybe one second is very rarely not enough so we could fix it by increasing the max wait time slightly. Closes #33852	2018-09-19 15:58:35 +02:00
Yannick Welsch	6551b4f651	Zen2: Integrate publication pipeline into Coordinator (#33771 ) Replaces the mock integration of Publication in CoordinatorTests by the real thing.	2018-09-19 13:36:11 +02:00
Yannick Welsch	10009434bf	Merge remote-tracking branch 'elastic/master' into zen2	2018-09-19 11:18:01 +02:00
Simon Willnauer	0c77f45dc6	Move DocsStats into Engine (#33835 ) By moving DocStats into the engine we can easily cache the stats for read-only engines if necessary. It also moves the responsibility out of IndexShard which has quiet some complexity already.	2018-09-19 11:03:11 +02:00
Vladimir Dolzhenko	a3e8b831ee	add elasticsearch-shard tool (#32281 ) Relates #31389	2018-09-19 10:28:22 +02:00
Simon Willnauer	251489d59a	Cut over to unwrap segment reader (#33843 ) The fix in #33757 introduces some workaround since FilterCodecReader didn't support unwrapping. This cuts over to a more elegant fix to access the readers segment infos.	2018-09-19 10:18:03 +02:00
Jim Ferenczi	61e1df0274	Use the global doc id to generate a random score (#33599 ) This commit changes the random_score function to use the global docID of the document rather than the segment docID to generate random scores. As a result documents that have the same segment docID within the shard will generate different scores.	2018-09-19 09:28:38 +02:00
Adrien Grand	c4261bab44	Add minimal sanity checks to custom/scripted similarities. (#33564 ) Add minimal sanity checks to custom/scripted similarities. Lucene 8 introduced more constraints on similarities, in particular: - scores must not be negative, - scores must not decrease when term freq increases, - scores must not increase when norm (interpreted as an unsigned long) increases. We can't check every single case, but could at least run some sanity checks. Relates #33309	2018-09-19 09:19:13 +02:00
Ignacio Vera	7f473b683d	Profiler: Don’t profile NEXTDOC for ConstantScoreQuery. (#33196 ) * Profiler: Don’t profile NEXTDOC for ConstantScoreQuery. A ConstantScore query will return the iterator of its inner query. However, when profiling, the constant score query is wrapped separately from its inner query, which distorts the times emitted by the profiler. Return the iterator directly in such a case. Closes #23430	2018-09-18 23:32:16 -07:00
Lee Hinman	c87cff22b4	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-18 13:57:41 -06:00
Zachary Tong	f4cbbcf98b	Add ES version 6.4.2 (#33831 ) Version and properties files	2018-09-18 15:25:20 -04:00
Armin Braun	c6462057a1	MINOR: Remove Some Dead Code in Scripting (#33800 ) * The is default check method is not used in ScriptType * The removed vars on ExpressionSearchScript are unused	2018-09-18 20:43:31 +02:00
Simon Willnauer	9026c3ee92	Ensure realtime `_get` and `_termvectors` don't run on the network thread (#33814 ) The change in #27500 introduces this regression that causes `_get` and `_term_vector` actions to run on the network thread if the realtime flag is set. This fixes the issue by delegating to the super method forking on the corresponding threadpool.	2018-09-18 19:53:42 +02:00
Simon Willnauer	98ccd94962	Factor out a ChannelActionListener (#33819 ) We use similar / same concepts in SerachTransportService and HandledTransportAction but both duplicate the efforts with slightly different implementation details. This streamlines sending responses / exceptions back to a channel in an ActionListener with appropriate logging.	2018-09-18 19:53:26 +02:00
Jim Ferenczi	241c74efb2	upgrade to a new snapshot of Lucene 8 (7d0a7782fa) (#33812 )	2018-09-18 18:16:40 +02:00
David Turner	421f58e172	Remove discovery-file plugin (#33257 ) In #33241 we moved the file-based discovery functionality to core Elasticsearch, but preserved the `discovery-file` plugin, and support for the existing location of the `unicast_hosts.txt` file, for BWC reasons. This commit completes the removal of this plugin.	2018-09-18 12:01:16 +01:00
Yannick Welsch	758b2f9111	Zen2: Add DisruptableMockTransport (#33713 ) Adds a mock transport implementation that allows to simulate network disruptions.	2018-09-18 11:48:24 +02:00
markharwood	2fa09f062e	New plugin - Annotated_text field type (#30364 ) New plugin for annotated_text field type. Largely a copy of `text` field type but adds ability to include markdown-like syntax in the text. The “AnnotatedText” class parses text+markup and converts into plain text and AnnotationTokens. The annotation token values are injected unchanged alongside the regular text tokens to provide a form of additional indexed overlay useful in positional searches and highlighting. Annotated_text fields do not support fielddata as we want to phase this out. Also includes a new "annotated" highlighter type that retains annotations and merges in search hits as additional annotation markup. Closes #29467	2018-09-18 10:25:27 +01:00
Armin Braun	87cedef3cf	NETWORKING:Def CName in Http Publish Addr to True (#33631 ) * Follow up to #32806 setting the setting to true for 7.x	2018-09-18 10:29:02 +02:00
Armin Braun	615f494c77	MINOR: Drop Redundant Ctx. Check in ScriptService (#33782 ) * MINOR: Drop Redundant Ctx. Check in ScriptService * This check is completely redundant, the expression script engine will throw anyway (and with a similar message) for those contexts that it cannot compile. Moreover, the update context is not the only context that is not suported by the expression engine at this point so handling the update context separately here makes no sense.	2018-09-18 07:25:22 +02:00
Or Bin	a5bad4d92c	Docs: Fixed a grammatical mistake: 'a HTTP ...' -> 'an HTTP ...' (#33744 ) Fixed a grammatical mistake: 'a HTTP ...' -> 'an HTTP ...' Closes #33728	2018-09-17 15:35:54 -04:00
Lee Hinman	7ff11b4ae1	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-17 10:41:10 -06:00
Vladimir Dolzhenko	4d0bea705c	Do not report negative free bytes for DiskThresholdDecider#canAllocate (#33641 ) Do not report negative free bytes for DiskThresholdDecider#canAllocate (#33641) Closes #33596	2018-09-17 17:56:47 +02:00
Armin Braun	a654f21599	TESTS: Fix Concurent Remote Connection Updates (#33707 ) * Same fix idea as in #10666a4 to prevent background threads trying to reconnect after the tests are done from throwing `ExecutionCancelledException` and breaking the test * Closes #30714	2018-09-17 16:38:44 +02:00
David Turner	c79fbea923	[Zen2] Implement basic cluster formation (#33668 ) This PR integrates the following pieces of machinery in the Coordinator: - discovery - pre-voting - randomised election scheduling - joining (of a new master) - publication of cluster state updates Together, these things are everything needed to form a cluster. We therefore also add the start of a test suite that allows us to assert higher-level properties of the interactions between all these pieces of machinery, with as little fake behaviour as possible. We assert one such property: "a cluster successfully forms".	2018-09-17 15:00:30 +02:00
Bukhtawar	14d57c1115	Skip rebalancing when cluster_concurrent_rebalance threshold reached (#33329 ) Allows to skip shard balancing when the cluster_concurrent_rebalance threshold is already reached, which cuts down the time spent in the rebalance method of BalancedShardsAllocator.	2018-09-17 13:13:44 +02:00
Adrien Grand	b06a082725	Improve reproducibility of BigArraysTests. Close #33750	2018-09-17 11:59:15 +02:00
Christoph Büscher	1f2a90cb39	Mute DateTimeUnitTests.testConversion	2018-09-17 11:16:50 +02:00
Yannick Welsch	01b3be917a	Merge remote-tracking branch 'elastic/master' into zen2	2018-09-17 09:59:37 +02:00
Martijn van Groningen	34379887b4	Make custom index metadata completely immutable (#33735 ) Currently `IndexMetadata#getCustomData(...)` wraps the custom metadata in an unmodifiable map, but in case there is no entry for the specified key then a NPE is thrown by Collections.unmodifiableMap(...). This is not ideal in case callers like to throw an exception with a specific message. (like in the case for ccr to indicate that the follow index was not created by the create_and_follow api and therefor incompatible as follow index) I think making `DiffableStringMap` itself immutable is better then just wrapping custom metadata with `Collections.unmodifiableMap(...)` in all methods that access it. Also removed the `equals()`, `hashcode()` and to `toString()` methods of `DiffableStringMap`, because `AbstractMap` already implements these methods.	2018-09-17 07:51:34 +02:00
Ryan Ernst	3046656ab1	Scripting: Rework joda time backcompat (#33486 ) This commit switches the joda time backcompat in scripting to use augmentation over ZonedDateTime. The augmentation methods provide compatibility with the missing methods between joda's DateTime and java's ZonedDateTime. Due to getDayOfWeek returning an enum in the java API, ZonedDateTime is wrapped so that the method can return int like the joda time does. The java time api version is renamed to getDayOfWeekEnum, which will be kept through 7.x for compatibility while users switch back to getDayOfWeek once joda compatibility is removed.	2018-09-16 19:18:00 -07:00
Ryan Ernst	e5d82c3dea	Test: Fix dv date bwc tests when no docs have a value (#32798 ) This commit adds a guard around the rare case that no documents in the 10 iterations actually have any values, thus making the warning check incorrect. closes #32779	2018-09-16 11:11:51 -07:00
Lee Hinman	e6cbaa5a78	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-14 16:27:37 -06:00
Jason Tedor	a0f0d7860e	Cleanup assertions in global checkpoint listeners (#33722 ) This commit is a cleanup of the assertions in global checkpoint listeners, simplifying them and adding some messages to them in case the assertions trip.	2018-09-14 14:45:58 -04:00
Christoph Büscher	bcbbbdf660	[Tests] Fix randomization in StringTermsIT (#33678 ) It looks like the COLLECT_SEGMENT_ORDS flag should be randomized.	2018-09-14 15:52:47 +02:00
Jason Tedor	39191331d1	Only notify ready global checkpoint listeners (#33690 ) When we add a global checkpoint listener, it is also carries along with it a value that it thinks is the current global checkpoint. This value can be above the actual global checkpoint on a shard if the listener knows the global checkpoint from another shard copy (e.g., the primary), and the current shard copy is lagging behind. Today we notify the listener whenever the global checkpoint advances, regardless if it goes above the current global checkpoint known to the listener. This commit reworks this implementation. Rather than thinking of the value associated with the listener as the current global checkpoint known to the listener, we think of it as the value that the listener is waiting for the global checkpoint to advance to (inclusive). Now instead of notifying all waiting listeners when the global checkpoint advances, we only notify those that are waiting for a value not larger than the actual global checkpoint that we advanced to.	2018-09-14 09:32:03 -04:00
Adrien Grand	4f68104865	Don't count hits via the collector if the hit count can be computed from index stats. (#33701 ) This is something that we were already doing when sorting by field, which is now also done when sorting by score. As-is this change will speed up top-k `term` queries. This could work for `match_all` queries as well when we implement the `setMinCompetitiveScore` API on their Scorer.	2018-09-14 14:59:16 +02:00
David Turner	31e8781eaa	Merge branch 'master' into zen2	2018-09-14 14:28:28 +02:00
Alexander Reelsen	faa3c16241	Core: Add DateFormatter interface for java time parsing (#33467 ) The existing approach used date formatters when a format based string like `date_time\|\|epoch_millis` was used, instead of the custom code. In order to properly solve this, a new interface called `DateFormatter` has been added, which now can be implemented for custom formatters. Currently there are two implementations, one using java time and one doing the epoch_millis formatter, which simply parses a number and then converts it to a date in UTC timezone. The DateFormatter interface now also has a method to retrieve the name of the formatter pattern, which is needed for mapping changes anyway. The existing `CompoundDateTimeFormatter` class has been removed, the name was not really nice anyway. One more minor change is the fact, that the new java time using FormatDateFormatter does not try to parse the date with its printer implementation first (which might be a strict one and fail), but a printer can now be specified in addition. This saves one potential failure/exception when parsing less strict dates. If only a printer is specified, the printer will also be used as a parser.	2018-09-14 13:55:16 +02:00
Igor Motov	b8fb83d7a4	Mute ClusterDisruptionIT#testSendingShardFailure Tracked by #33704	2018-09-14 14:24:06 +04:00
Armin Braun	0b4960ff6b	SCRIPTING: Move terms_set Context to its Own Class (#33602 ) * SCRIPTING: Move terms_set Context to its Own Class * Extracted TermsSetQueryScript * Kept mechanics close to what they were with SearchScript	2018-09-14 06:21:18 +02:00
Armin Braun	040695b64e	CORE: Disable Setting Type Validation (#33660 ) (#33669 ) * Reverts setting type validation introduced in #33503	2018-09-13 20:45:48 +02:00
Jason Tedor	e4eb631b8e	Revert "Use serializable exception in GCP listeners (#33657 )" This reverts commit `6dfe54c838`.	2018-09-13 13:55:19 -04:00
Nhat Nguyen	b3071133d4	TEST: decrease logging level in the flush test Relates #31629	2018-09-13 11:18:03 -04:00
Jason Tedor	d806a0e59d	Fix race in global checkpoint listeners test This race can occur if the latch from the listener notifies the test thread and the test thread races ahead before the scheduler thread has a chance to emit the log message. This commit fixes this test by not counting down the latch until after the log message we are going to assert on has been emitted.	2018-09-13 07:00:40 -04:00
Jason Tedor	6dfe54c838	Use serializable exception in GCP listeners (#33657 ) We used TimeoutException here but that's not serializable. This commit switches to a serializable exception so that we can test for the exception type on the remote side.	2018-09-13 06:35:36 -04:00
Colin Goodheart-Smithe	8e59de3eb2	Merge branch 'master' into index-lifecycle	2018-09-13 09:46:14 +01:00
Jim Ferenczi	6ca36bba15	Fix field mapping updates with similarity (#33634 ) This change fixes a bug introduced in 6.3 that prevents fields with an explicit similarity to be updated. It also adds a test that checks this case for similarities but also for analyzers since they could suffer from the same problem. Closes #33611	2018-09-13 09:21:27 +02:00
David Turner	5a3fd8e4e7	Use file-based discovery not MockUncasedHostsProvider (#33554 ) Today we use a special unicast hosts provider, the `MockUncasedHostsProvider`, in many integration tests, to deal with the dynamic nature of the allocation of ports to nodes. However #33241 allows us to use file-based discovery to achieve the same goal, so the special test-only `MockUncasedHostsProvider` is no longer required. This change removes `MockUncasedHostProvider` and replaces it with file-based discovery in tests based on `EsIntegTestCase`.	2018-09-13 07:37:15 +02:00
Nhat Nguyen	b097eff342	Resync fails to notify on unavaiable exceptions (#33615 ) We fail to notify the resync listener if the resync replication hits a shard unavailable exception. Moreover, we no longer need to swallow these unavailable exceptions. Relates #28571 Closes #33613	2018-09-12 21:27:59 -04:00
Jason Tedor	9b8fe85edb	Remove volatile from global checkpoint listeners (#33636 ) This field does not need to be volatile because all accesses are done under a lock. This commit removes the unnecessary volatile modifier from this field.	2018-09-12 14:38:24 -04:00
Jason Tedor	c023f67c5d	Add migration note for remote cluster settings (#33632 ) The remote cluster settings search.remote.* have been renamed to cluster.remote.* and are automatically upgraded in the cluster state on gateway recovery, and on put. This commit adds a note to the migration docs for these changes.	2018-09-12 13:37:11 -04:00
Simon Willnauer	c783488e97	Add `_source`-only snapshot repository (#32844 ) This change adds a `_source` only snapshot repository that allows to wrap any existing repository as a _backend_ to snapshot only the `_source` part including live docs markers. Snapshots taken with the `source` repository won't include any indices, doc-values or points. The snapshot will be reduced in size and functionality such that it requires full re-indexing after it's successfully restored. The restore process will copy the `_source` data locally starts a special shard and engine to allow `match_all` scrolls and searches. Any other query, or get call will fail with and unsupported operation exception. The restored index is also marked as read-only. This feature aims mainly for disaster recovery use-cases where snapshot size is a concern or where time to restore is less of an issue. NOTE: The snapshot produced by this repository is still a valid lucene index. This change doesn't allow for any longer retention policies which is out of scope for this change.	2018-09-12 17:47:10 +02:00
Jason Tedor	36ba3cda7e	Enable global checkpoint listeners to timeout (#33620 ) In cross-cluster replication, we will use global checkpoint listeners to long poll for updates to a shard. However, we do not want these polls to wait indefinitely as it could be difficult to discern if the listener is still waiting for updates versus something has gone horribly wrong and cross-cluster replication is stuck. Instead, we want these listeners to timeout after some period (for example, one minute) so that they are notified and we can update status on the following side that cross-cluster replication is still active. After this, we will immediately enter back into a poll mode. To do this, we need the ability to associate a timeout with a global checkpoint listener. This commit adds this capability.	2018-09-12 10:53:22 -04:00
Nhat Nguyen	d9bbb89b26	TEST: Adjust rollback condition when shard is empty If a shard is empty, it won't rollback its engine on promotion. This commit adjusts the expectation in the rollback test. Relates #33473	2018-09-12 08:26:02 -04:00
lipsill	c92ec1c5d7	Forbid negative `weight` in Function Score Query (#33390 ) This change forbids negative `weight` in Function Score query. Negative scores are forbidden in Lucene 8.	2018-09-12 09:16:40 +02:00
Jim Ferenczi	4561c5ee83	Clarify context suggestions filtering and boosting (#33601 ) This change clarifies the documentation of the context completion suggester regarding filtering and boosting with contexts. Unlike the suggester v1, filtering on multiple contexts works as a disjunction, a suggestion matches if it contains at least one of the provided context values and boosting selects the maximum score among the matching contexts. This commit also adapts an old test that was written for the v1 suggester and commented out for version 2 because the behavior changed.	2018-09-12 08:47:32 +02:00
Jason Tedor	c74c46edc3	Upgrade remote cluster settings (#33537 ) This commit adds settings upgraders for the search.remote.* settings that can be in the cluster state to automatically upgrade these settings to cluster.remote.*. Because of the infrastructure that we have here, these settings can be upgraded when recovering the cluster state, but also when a user tries to make a dynamic update for these settings.	2018-09-12 01:14:43 -04:00
Armin Braun	94cdf0ceba	NETWORKING: http.publish_host Should Contain CNAME (#32806 ) * NETWORKING: http.publish_host Should Contain CNAME * Closes #22029	2018-09-12 06:15:36 +02:00
Jason Tedor	9752540866	Add test coverage for global checkpoint listeners This commit adds test coverage for two cases not previously covered by the existing testing. Namely, we add coverage ensuring that the executor is used to notify listeners being added that are immediately notified because the shard is closed or because the global checkpoint is already beyond what the listener knows.	2018-09-11 23:19:27 -04:00
Nhat Nguyen	743327efc2	Reset replica engine to global checkpoint on promotion (#33473 ) When a replica starts following a newly promoted primary, it may have some operations which don't exist on the new primary. Thus we need to throw those operations to align a replica with the new primary. This can be done by first resetting an engine from the safe commit, then replaying the local translog up to the global checkpoint. Relates #32867	2018-09-11 22:09:37 -04:00
Nhat Nguyen	1e577d3ce8	Mute testIndexDeletionWhenNodeRejoins Tracked at #33613	2018-09-11 16:23:12 -04:00
Colin Goodheart-Smithe	624b84f897	Improves doc values format deprecation message (#33576 ) * Improves doc values format deprecation message This changes the deprecation message when doc values fields do not supply a format form logging a deprecation warning for each offending field individually to logging a single message which lists all offending fields Closes #33572 * Updates YAML test with new deprecation message Also adds a test to ensure multiple deprecation warnings are collated into one message * Condenses collection of fields without format check Moves the collection of fields that don't have a format to a separate loop and moves the logging of the deprecation warning to be next to it at the expesnse of looping through the field list twice * fixes typo * Fixes test	2018-09-11 14:32:43 +01:00
Alan Woodward	36bdad4895	Use IndexWriter.getFlushingBytes() rather than tracking it ourselves (#33582 ) Currently we keep track of how many bytes are currently being written to disk in an AtomicLong within InternalEngine, updating it on refresh. The IndexWriter has its own accounting for this, and exposes it via a getFlushingBytes method in the latest lucene 8 snapshot. This commit removes the InternalEngine tracking in favour of just using the IndexWriter method.	2018-09-11 13:38:44 +01:00
Jason Tedor	ad4b5e4270	Fix upgrading of list settings (#33589 ) Upgrading list settings is broken because of the conversion that we do to strings, and then when we try to put back the upgraded value we do not know that it is a representation of a list. This commit addresses this by adding special handling for list settings.	2018-09-11 08:35:42 -04:00
Simon Willnauer	517cfc3cc0	Add read-only Engine (#33563 ) This change adds an engine implementation that opens a reader on an existing index but doesn't permit any refreshes or modifications to the index. Relates to #32867 Relates to #32844	2018-09-11 14:05:14 +02:00
David Turner	1d18e2854c	Fix merge	2018-09-11 09:55:52 +02:00
David Turner	a2cd8f731e	Merge branch 'master' into zen2	2018-09-11 09:38:10 +02:00
Armin Braun	6075e159e5	Validate list values for settings (#33503 ) When we see a settings value, it could be a list. Yet this should only happen if the underlying setting type is a list setting type. This commit adds validation that when we get a setting value that is a list, that the setting that we are getting is a list setting. And similarly, if we get a value for a list setting, the underlying value should be a list.	2018-09-10 19:24:17 -04:00
Nhat Nguyen	624b6bb487	Copy and validatie soft-deletes setting on resize (#33517 ) This change copies and validates the soft-deletes setting during resize. If the source enables soft-deletes, the target must also enable it. Closes #33321	2018-09-10 17:38:58 -04:00
Colin Goodheart-Smithe	cdc4f57a77	Merge branch 'master' into index-lifecycle	2018-09-10 21:30:44 +01:00
Alan Woodward	39c3234c2f	Upgrade to latest Lucene snapshot (#33505 ) * LeafCollector.setScorer() now takes a Scorable * Scorers may not have null Weights * IndexWriter.getFlushingBytes() reports how much memory is being used by IW threads writing to disk	2018-09-10 20:51:55 +01:00
Armin Braun	9a2c77d1c3	MINOR: Remove Dead Code in SearchScript (#33569 ) * `lookup` is not used anywhere * `getLeafContext` is not used anywhere	2018-09-10 18:56:21 +02:00
Tanguy Leroux	079d130d8c	[Test] Remove duplicate method in TestShardRouting (#32815 )	2018-09-10 18:29:00 +02:00
David Turner	284c45a6ff	Strengthen FilterRoutingTests (#33149 ) Today the FilterRoutingTests take the belt-and-braces approach of excluding some node attribute values and including some others. This means that we don't really test that both inclusion and exclusion work correctly: as long as one of them works as expected then the test will pass. This change improves these tests by only using one approach at once, demonstrating that both do indeed work, and adds tests for various other scenarios too.	2018-09-10 11:23:05 +02:00
Nhat Nguyen	e6ca55bca6	Adjust bwc for stale primary recovery source (#33432 ) Relates #33432	2018-09-09 21:34:32 -04:00
Jason Tedor	6bb817004b	Add infrastructure to upgrade settings (#33536 ) In some cases we want to deprecate a setting, and then automatically upgrade uses of that setting to a replacement setting. This commit adds infrastructure for this so that we can upgrade settings when recovering the cluster state, as well as when such settings are dynamically applied on cluster update settings requests. This commit only focuses on cluster settings, index settings can build on this infrastructure in a follow-up.	2018-09-09 20:49:19 -04:00
Armin Braun	d4b212c4c9	CORE: Make Pattern Exclusion Work with Aliases (#33518 ) * CORE: Make Pattern Exclusion Work with Aliases * Adds the pattern exclusion logic to finding aliases * Closes #33395	2018-09-09 17:31:02 +02:00
S.Y. Wang	9073dbefd6	HLRC: Add put stored script support to high-level rest client (#31323 ) Relates to #27205	2018-09-09 13:47:47 +02:00
Nhat Nguyen	94e4cb64c2	Bootstrap a new history_uuid when force allocating a stale primary (#33432 ) This commit ensures that we bootstrap a new history_uuid when force allocating a stale primary. A stale primary should never be the source of an operation-based recovery to another shard which exists before the forced-allocation. Closes #26712	2018-09-08 19:29:31 -04:00
Armin Braun	f27c3dcf88	INGEST: Remove Outdated TODOs (#33458 ) * CompoundProcessor is in the ingest package now -> resolved * Java generics don't offer type checking so nothing can be done here -> remvoed TODO and test * #16019 was closed and not acted on -> todo can go away	2018-09-08 10:18:45 +02:00
Jason Tedor	9a404f3def	Include fallback settings when checking dependencies (#33522 ) Today when checking settings dependencies, we do not check if fallback settings are present. This means, for example, that if cluster.remote..seeds falls back to search.remote..seeds, and cluster.remote..skip_unavailable and search.remote..skip_unavailable depend on cluster.remote..seeds, and we have set search.remote..seeds and search.remote..skip_unavailable, then validation will fail because it is expected that cluster.ermote..seeds is set here. This commit addresses this by also checking fallback settings when validating dependencies. To do this, we adjust the settings exist method to also check for fallback settings, a case that it was not handling previously.	2018-09-07 20:09:53 -04:00
Nik Everett	190ea9a6de	Logging: Configure the node name when we have it (#32983 ) Change the logging infrastructure to handle when the node name isn't available in `elasticsearch.yml`. In that case the node name is not available until long after logging is configured. The biggest change is that the node name logging no longer fixed at pattern build time. Instead it is read from a `SetOnce` on every print. If it is unset it is printed as `unknown` so we have something that fits in the pattern. On normal startup we don't log anything until the node name is available so we never see the `unknown`s.	2018-09-07 14:31:23 -04:00
Nhat Nguyen	ab7e696108	TEST: Ensure merge triggered in _source retention test (#33487 ) We invoke force merge twice in the test to verify that recovery sources are pruned when the global checkpoint advanced. However, if the global checkpoint equals to the local checkpoint in the first force-merge, the second force-merge will be a noop because all deleted docs are expunged in the first merge already. We need to flush a new segment to make merge happen so we can verify that all recovery sources are pruned.	2018-09-07 12:58:00 -04:00
Simon Willnauer	c12d232215	Pass Directory instead of DirectoryService to Store (#33466 ) Instead of passing DirectoryService which causes yet another dependency on Store we can just pass in a Directory since we will just call `DirectoryService#newDirectory()` on it anyway.	2018-09-07 14:00:24 +02:00
Colin Goodheart-Smithe	017ffe5d12	Merge branch 'master' into index-lifecycle	2018-09-07 10:59:10 +01:00
Jim Ferenczi	79cd6385fe	Collapse package structure for metrics aggs (#33463 ) This change collapses all metrics aggregations classes into a single package `org.elasticsearch.aggregations.metrics`. It also restricts the visibility of some classes (aggregators and factories) that should not be used outside of the package. Relates #22868	2018-09-07 10:58:06 +02:00
Jim Ferenczi	34859414a0	Fix bwc serialization of total hits when track_total_hits is false	2018-09-07 10:30:53 +02:00
Nik Everett	0d45752e50	Fix IndexMetaData loads after rollover (#33394 ) When we rollover and index we write the conditions of the rollover that the old index met into the old index. Loading this index metadata requires a working `NamedXContentRegistry` that has been populated with parsers from the rollover infrastructure. We had a few loads that didn't use a working `NamedXContentRegistry` and so would fail if they ever encountered an index that had been rolled over. Here are the locations of the loads and how I fixed them: * IndexFolderUpgrader - removed entirely. It existed to support opening indices made in Elasticsearch 2.x. Since we only need this change as far back as 6.4.1 which will supports reading from indices created as far back as 5.0.0 we should be good here. * TransportNodesListGatewayStartedShards - wired the `NamedXContentRegistry` into place. * TransportNodesListShardStoreMetaData - wired the `NamedXContentRegistry` into place. * OldIndexUtils - removed entirely. It existed to support the zip based index backwards compatibility tests which we've since replaced with code that actually runs old versions of Elasticsearch. In addition to fixing the actual problem I added full cluster restart integration tests for rollover which would have caught this problem and I added an extra assertion to IndexMetaData's deserialization code which will trip if we try to deserialize and index's metadata without a fully formed `NamedXContentRegistry`. It won't catch if use the wrong `NamedXContentRegistry` but it is better than nothing. Closes #33316	2018-09-06 17:55:24 -04:00
Simon Willnauer	c6c456e8cb	Move up acquireSearcher logic to Engine (#33453 ) By moving the logic to acquire the searcher up to the engine it's simpler to build new engines that are for instance read-only.	2018-09-06 18:48:05 +02:00
Nhat Nguyen	8afe09a749	Pass TranslogRecoveryRunner to engine from outside (#33449 ) This commit allows us to use different TranslogRecoveryRunner when recovering an engine from its local translog. This change is a prerequisite for the commit-based rollback PR. Relates #32867	2018-09-06 11:59:16 -04:00
Jim Ferenczi	7ad71f906a	Upgrade to a Lucene 8 snapshot (#33310 ) The main benefit of the upgrade for users is the search optimization for top scored documents when the total hit count is not needed. However this optimization is not activated in this change, there is another issue opened to discuss how it should be integrated smoothly. Some comments about the change: * Tests that can produce negative scores have been adapted but we need to forbid them completely: #33309 Closes #32899	2018-09-06 14:42:06 +02:00
Alan Woodward	e134f9b5f3	Fix generics in ScriptPlugin#getContexts() (#33426 ) Changes the return value from List<ScriptContext> to List<ScriptContext<?>> to remove raw-types warnings.	2018-09-06 09:04:22 +01:00
Alexander Reelsen	82fab40099	Core: Fix IndicesSegmentResponse.toXcontent() serialization (#33414 ) When index sorting is enabled, toXContent tried to serialize an SortField object, resulting in an exception, when using the _segments endpoint. Relates #29120	2018-09-06 09:56:20 +02:00
Daniel Mitterdorfer	5236f2b1af	Improve reproducability of RestControllerTests With this commit we use the classic parent circuit breaker which does not account for real memory usage. In those tests we want to have reproducible results and hence it makes sense to disable the real memory circuit breaker there.	2018-09-06 09:44:05 +02:00
Colin Goodheart-Smithe	b1257d873b	Merge branch 'master' into index-lifecycle	2018-09-06 08:17:40 +01:00
Martijn van Groningen	a721d09c81	[CCR] Added auto follow patterns feature (#33118 ) Auto Following Patterns is a cross cluster replication feature that keeps track whether in the leader cluster indices are being created with names that match with a specific pattern and if so automatically let the follower cluster follow these newly created indices. This change adds an `AutoFollowCoordinator` component that is only active on the elected master node. Periodically this component checks the the cluster state of remote clusters if there new leader indices that match with configured auto follow patterns that have been defined in `AutoFollowMetadata` custom metadata. This change also adds two new APIs to manage auto follow patterns. A put auto follow pattern api: ``` PUT /_ccr/_autofollow/{{remote_cluster}} { "leader_index_pattern": ["logs-*", ...], "follow_index_pattern": "{{leader_index}}-copy", "max_concurrent_read_batches": 2 ... // other optional parameters } ``` and delete auto follow pattern api: ``` DELETE /_ccr/_autofollow/{{remote_cluster_alias}} ``` The auto follow patterns are directly tied to the remote cluster aliases configured in the follow cluster. Relates to #33007 Co-authored-by: Jason Tedor jason@tedor.me	2018-09-06 08:01:58 +02:00
Jason Tedor	d71ced1b00	Generalize search.remote settings to cluster.remote (#33413 ) With features like CCR building on the CCS infrastructure, the settings prefix search.remote makes less sense as the namespace for these remote cluster settings than does a more general namespace like cluster.remote. This commit replaces these settings with cluster.remote with a fallback to the deprecated settings search.remote.	2018-09-05 20:43:44 -04:00
Nhat Nguyen	39e3bd93c7	TEST: Create following engines in the main thread (#33391 ) There are two races in the testUpdateAndReadChangesConcurrently if the following engines are created in the worker threads. We fixed the translog issue in #33352, but there is still another race with createStore. This commit ensures that we create all engines in the main thread. Relates #33352 Closes #33344	2018-09-05 19:05:41 -04:00
Nhat Nguyen	41839cf9a8	Acquire seacher on closing engine should throw ACE (#33331 ) Closes #33330	2018-09-05 19:03:34 -04:00
Tim Brooks	b697f485bb	Introduce `TransportLogger` for common logging (#32725 ) Historically we have had a ESLoggingHandler in the netty module that logs low-level connection operations. This class just extends the netty logging handler with some (broken) message deserialization. This commit fixes this message serialization and moves the class to server. This new logger logs inbound and outbound messages. Eventually, we should move other event logging to this class (connect, close, flush). That way we will have consistent logging regards of which transport is loaded. Resolves #27306 on master. Older branches will need a different fix.	2018-09-05 16:12:37 -06:00
Tim Brooks	88c178dca6	Add sni name to SSLEngine in netty transport (#33144 ) This commit is related to #32517. It allows an "server_name" attribute on a DiscoveryNode to be propagated to the server using the TLS SNI extentsion. This functionality is only implemented for the netty security transport.	2018-09-05 16:12:10 -06:00
Armin Braun	ef1066d7f8	INGEST: Allow Repeated Invocation of Pipeline (#33419 ) * Allows repeated, non-recursive invocation of the same pipeline	2018-09-05 22:04:53 +02:00
Tal Levy	b5f7fb6882	Merge branch 'master' into index-lifecycle	2018-09-05 12:56:58 -07:00
Jim Ferenczi	50e07dd413	Add an index setting to control TieredMergePolicy#deletesPctAllowed (#32907 ) This change adds an expert index setting called `index.merge.policy.deletes_pct_allowed`. It controls the maximum percentage of deleted documents that is tolerated in the index. Lower values make the index more space efficient at the expense of increased CPU and I/O activity. Values must be between `20` and `50`. Default value is `33`.	2018-09-05 19:57:36 +02:00
Nik Everett	5c624bc55b	Logging: Further clean up logging ctors (#33378 ) Drops and unused logging constructor, simplifies a rarely used one, and removes `Settings` from a third. There is now only a single logging ctor that takes `Settings` and we'll remove that one in a follow up change.	2018-09-05 13:04:26 -04:00
Adrien Grand	46ac8d1a51	Make test less GC-intensive.	2018-09-05 18:59:43 +02:00
Christoph Büscher	eafc2a5470	Don't count metadata fields towards index.mapping.total_fields.limit (#33386 ) The maximum number of fields per index is limited to 1000 by default by the `index.mapping.total_fields.limit` setting to prevent accidental mapping explosions due to too many fields. Currently all metadata fields also count towards this limit, which can lead to some confusion when using lower limits. It is not obvious for users that they cannot actually add as many fields as are specified by the limit in this case. This change takes the number of metadata fields out of the field count that we check against the field limit. It also adds tests that check that we can add fields up to the specified limit, but throw an exception for any additional field added. Closes #24096	2018-09-05 18:27:21 +02:00
Jason Tedor	23934e39d2	Fix deprecated setting specializations (#33412 ) Deprecating a some setting specializations (e.g., list settings) does not cause deprecation warning headers and deprecation log messages to appear. This is due to a missed check for deprecation. This commit fixes this for all setting specializations, and ensures that this can not be missed again.	2018-09-05 11:01:58 -04:00
Adrien Grand	913d5fd820	Disable IndexRecoveryIT.testRerouteRecovery. Relates #32686.	2018-09-05 14:53:22 +02:00
Armin Braun	46774098d9	INGEST: Implement Drop Processor (#32278 ) * INGEST: Implement Drop Processor * Adjust Processor API * Implement Drop Processor * Closes #23726	2018-09-05 14:25:29 +02:00
Paul Sanwald	c303006e6b	Add interval response parameter to AutoDateInterval histogram (#33254 ) Adds the interval used to the aggregation response.	2018-09-05 07:35:59 -04:00
Armin Braun	4156cc3fae	MINOR+CORE: Remove Dead Methods ClusterService (#33346 ) * None of these methods are used anywhere	2018-09-05 12:08:28 +02:00
Colin Goodheart-Smithe	f00a28a909	Merge branch 'master' into index-lifecycle	2018-09-05 09:48:48 +01:00
Gordon Brown	cfd3fa72ed	Add user-defined cluster metadata (#33325 ) Adds a place for users to store cluster-wide data they wish to associate with the cluster via the Cluster Settings API. This is strictly for user-defined data, Elasticsearch makes no other other use of these settings.	2018-09-04 16:14:18 -06:00
Jim Ferenczi	dbc7102c86	Fix inner hits retrieval when stored fields are disabled (_none_) (#33018 ) Now that types are unique per mapping we can retrieve the document mapper without referencing the type. This fixes an NPE when stored fields are disabled. For 6x we'll need a different fix since mappings can still have multiple types. Relates #32941	2018-09-04 16:25:52 +02:00
Sohaib Iftikhar	761e8c461f	HLRC: Add delete by query API (#32782 ) Adds the delete-by-query API to the High Level REST Client.	2018-09-04 08:56:26 -04:00
Colin Goodheart-Smithe	92ab442aee	Merge branch 'master' into index-lifecycle	2018-09-04 10:34:49 +01:00
Julie Tibshirani	78df00ff24	Simplify the return type of FieldMapper#parse. (#32654 )	2018-09-04 01:15:19 +00:00
Jason Tedor	09bf4e5f00	Introduce private settings (#33327 ) This commit introduces the formal notion of a private setting. This enables us to register some settings that we had previously not registered as fully-fledged settings to avoid them being exposed via APIs such as the create index API. For example, we had hacks in the codebase to allow index.version.created to be passed around inside of settings objects, but was not registered as a setting so that if a user tried to use the setting on any API then they would get an exception. This prevented users from setting index.version.created on index creation, or updating it via the index settings API. By introducing private settings, we can continue to reject these attempts, yet now we can represent these settings as actual settings. In this change, we register index.version.created as an actual setting. We do not cutover all settings that we had been treating as private in this pull request, it is already quite large due to moving some tests around to account for the fact that some tests need to be able to set the index.version.created. This can be done in a follow-up change.	2018-09-03 19:17:57 -04:00
Armin Braun	1f046617bf	TESTS: Fix Race Condition in Temp Path Creation (#33352 ) * TESTS: Fix Race Condition in Temp Path Creation * Calling `createTempDir` concurrently here in the `Follower`s causes collisions at times which lead to `createEngine` throwing because of unexpected files in the newly created temp dir * Fixed by creating all temp dirs in the main test thread * closes #33344	2018-09-03 19:55:59 +02:00
Nhat Nguyen	24d60c7f4b	Fix from_range in search_after in changes snapshot (#33335 ) We can have multiple documents in Lucene with the same seq_no for parent-child documents (or without rollback). In this case, the usage "lastSeenSeqNo + 1" is an off-by-one error as it may miss some documents. This error merely affects the `skippedOperations` contract. See: https://github.com/elastic/elasticsearch/pull/33222#discussion_r213842257 Closes #33318	2018-09-03 11:58:49 -04:00
Armin Braun	42424aff21	TESTS+DISTR.: Fix testIndexCheckOnStartup Flake (#33349 ) * Ignore all `RuntimeException` since random file corruption triggers other RTE in addition to the randomly caught one * closes #33345	2018-09-03 17:06:12 +02:00
tony-dillon	a9d2b1dde8	Null completion field should not throw IAE (#33268 ) Ignore null value on the completion field Closes #33200	2018-09-03 16:49:53 +02:00
Colin Goodheart-Smithe	0bf36253a9	Adds code to help with IndicesRequestCacheIT failures (#33313 ) * Adds code to help with IndicesRequestCacheIT failures Relates to #32827 * Adds comment * Fixes test failure	2018-09-03 14:54:17 +01:00
Alexander Reelsen	246a7df8c2	Core: Fix epoch millis java time formatter (#33302 ) The existing implemention could not deal with negative numbers as well as +- 999 milliseconds around the epoch. This commit uses Instant.ofEpochMilli() and parses the input to a number instead of using a date formatter.	2018-09-03 13:13:19 +02:00
Colin Goodheart-Smithe	e2c1beb1be	Merge branch 'master' into index-lifecycle	2018-09-03 10:01:16 +01:00
Jim Ferenczi	9310d2eaf3	[CI] Mute IndexShardTests#testIndexCheckOnStartup fails #33345	2018-09-03 10:27:42 +02:00
Jim Ferenczi	2fa75b4438	[CI] Mute LuceneChangesSnapshotTests#testUpdateAndReadChangesConcurrently	2018-09-03 10:14:00 +02:00
Jim Ferenczi	713c07e14d	Add early termination support to BucketCollector (#33279 ) This commit adds the support to early terminate the collection of a leaf in the aggregation framework. This change introduces a MultiBucketCollector which handles CollectionTerminatedException exactly like the Lucene MultiCollector. Any aggregator can now throw a CollectionTerminatedException without stopping the collection of a sibling aggregator. This is useful for aggregators that can infer their result without visiting all documents (e.g.: a min/max aggregation on a match_all query).	2018-09-03 09:34:35 +02:00
Nik Everett	f8b7a4dbc8	Logging: Drop Settings from some logging ctors (#33332 ) Drops `Settings` from some logging ctors now that they are no longer needed. This should allow us to stop passing `Settings` around to quite as many places.	2018-09-02 16:51:26 -04:00
Jason Tedor	ea4eef8641	Merge branch 'master' into ccr * master: HLREST: add update by query API (#32760)	2018-09-02 16:07:50 -04:00
Sohaib Iftikhar	389bf67275	HLREST: add update by query API (#32760 ) Adds update by query to the high level rest client.	2018-09-02 15:15:00 -04:00
Nhat Nguyen	3197a6bbdd	Merge branch 'master' into ccr * master: HLRC: ML Flush job (#33187) HLRC: Adding ML Job stats (#33183) LLREST: Drop deprecated methods (#33223) Mute testSyncerOnClosingShard [DOCS] Moves machine learning APIs to docs folder (#31118)	2018-09-02 09:30:51 -04:00
Nhat Nguyen	ce635f5f15	Mute testSyncerOnClosingShard Tracked at #33330	2018-09-01 09:53:31 -04:00
Nhat Nguyen	b93507608a	Merge branch 'master' into ccr * master: Mute test watcher usage stats output [Rollup] Fix FullClusterRestart test Adjust soft-deletes version after backport into 6.5 completely drop `index.shard.check_on_startup: fix` for 7.0 (#33194) Fix AwaitsFix issue number Mute SmokeTestWatcherWithSecurityIT testsi drop `index.shard.check_on_startup: fix` (#32279) tracked at [DOCS] Moves ml folder from x-pack/docs to docs (#33248) [DOCS] Move rollup APIs to docs (#31450) [DOCS] Rename X-Pack Commands section (#33005) TEST: Disable soft-deletes in ParentChildTestCase Fixes SecurityIntegTestCase so it always adds at least one alias (#33296) Fix pom for build-tools (#33300) Lazy evaluate java9home (#33301) SQL: test coverage for JdbcResultSet (#32813) Work around to be able to generate eclipse projects (#33295) Highlight that index_phrases only works if no slop is used (#33303) Different handling for security specific errors in the CLI. Fix for https://github.com/elastic/elasticsearch/issues/33230 (#33255) [ML] Refactor delimited file structure detection (#33233) SQL: Support multi-index format as table identifier (#33278) MINOR: Remove Dead Code from PathTrie (#33280) Enable forbiddenapis server java9 (#33245)	2018-08-31 19:03:04 -04:00
Nhat Nguyen	08b9247ce2	Adjust soft-deletes version after backport into 6.5 Relates #33222	2018-08-31 16:50:08 -04:00
Vladimir Dolzhenko	00b272af32	completely drop `index.shard.check_on_startup: fix` for 7.0 (#33194 ) Relates to #32279	2018-08-31 22:08:28 +02:00
Vladimir Dolzhenko	3d82a30fad	drop `index.shard.check_on_startup: fix` (#32279 ) drop `index.shard.check_on_startup: fix` Relates #31389	2018-08-31 21:29:06 +02:00
Colin Goodheart-Smithe	3eef74d5d5	Merge branch 'master' into index-lifecycle	2018-08-31 14:45:22 +01:00
Armin Braun	c6cfa08a61	MINOR: Remove Dead Code from PathTrie (#33280 ) * The array size checks are redundant since the array sizes are checked earlier in those methods too * The removed methods are just not used anywhere	2018-08-31 08:40:27 +02:00
Alpar Torok	44ed5f6306	Enable forbiddenapis server java9 (#33245 )	2018-08-31 09:31:55 +03:00
Nhat Nguyen	ad4dd086d2	Integrates soft-deletes into Elasticsearch (#33222 ) This PR integrates Lucene soft-deletes(LUCENE-8200) into Elasticsearch. Highlight works in this PR include: - Replace hard-deletes by soft-deletes in InternalEngine - Use _recovery_source if _source is disabled or modified (#31106) - Soft-deletes retention policy based on the global checkpoint (#30335) - Read operation history from Lucene instead of translog (#30120) - Use Lucene history in peer-recovery (#30522) Relates #30086 Closes #29530 --- These works have been done by the whole team; however, these individuals (lexical order) have significant contribution in coding and reviewing: Co-authored-by: Adrien Grand <jpountz@gmail.com> Co-authored-by: Boaz Leskes <b.leskes@gmail.com> Co-authored-by: Jason Tedor <jason@tedor.me> Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com> Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co> Co-authored-by: Simon Willnauer <simonw@apache.org>	2018-08-30 23:46:07 -04:00
Nhat Nguyen	547de71d59	Revert "Integrates soft-deletes into Elasticsearch (#33222 )" Revert to correct co-author tags. This reverts commit `6dd0aa54f6`.	2018-08-30 23:44:57 -04:00
Nhat Nguyen	d3f32273eb	Merge branch 'master' into ccr	2018-08-30 23:22:58 -04:00
Nhat Nguyen	6dd0aa54f6	Integrates soft-deletes into Elasticsearch (#33222 ) This PR integrates Lucene soft-deletes(LUCENE-8200) into Elasticsearch. Highlight works in this PR include: - Replace hard-deletes by soft-deletes in InternalEngine - Use _recovery_source if _source is disabled or modified (#31106) - Soft-deletes retention policy based on the global checkpoint (#30335) - Read operation history from Lucene instead of translog (#30120) - Use Lucene history in peer-recovery (#30522) Relates #30086 Closes #29530 --- These works have been done by the whole team; however, these individuals (lexical order) have significant contribution in coding and reviewing: Co-authored-by: Adrien Grand jpountz@gmail.com Co-authored-by: Boaz Leskes b.leskes@gmail.com Co-authored-by: Jason Tedor jason@tedor.me Co-authored-by: Martijn van Groningen martijn.v.groningen@gmail.com Co-authored-by: Nhat Nguyen nhat.nguyen@elastic.co Co-authored-by: Simon Willnauer simonw@apache.org	2018-08-30 22:11:23 -04:00
Tal Levy	13a0d822d0	Merge branch 'master' into index-lifecycle	2018-08-30 15:04:17 -07:00
Lee Hinman	8a2d154bad	Update serialization versions for custom IndexMetaData backport	2018-08-30 15:56:53 -06:00
Igor Motov	001b78f704	Replace IndexMetaData.Custom with Map-based custom metadata (#32749 ) This PR removes the deprecated `Custom` class in `IndexMetaData`, in favor of a `Map<String, DiffableStringMap>` that is used to store custom index metadata. As part of this, there is now no way to set this metadata in a template or create index request (since it's only set by plugins, or dedicated REST endpoints). The `Map<String, DiffableStringMap>` is intended to be a namespaced `Map<String, String>` (`DiffableStringMap` implements `Map<String, String>`, so the signature is more like `Map<String, Map<String, String>>`). This is so we can do things like: ``` java Map<String, String> ccrMeta = indexMetaData.getCustom("ccr"); ``` And then have complete control over the metadata. This also means any plugin/feature that uses this has to manage its own BWC, as the map is just serialized as a map. It also means that if metadata is put in the map that isn't used (for instance, if a plugin were removed), it causes no failures the way an unregistered `Setting` would. The reason I use a custom `DiffableStringMap` here rather than a plain `Map<String, String>` is so the map can be diffed with previous cluster state updates for serialization. Supersedes #32683	2018-08-30 13:57:00 -06:00
Simon Willnauer	af2eaf2a6c	Remove usage of `index.shrink.source.` in 7.x (#33271 ) We cut over to `index.resize.source.` but still have these constants being public in `IndexMetaData`. Those Settings and constants are not needed in 7.x while we still need to keep the keys known to private settings since they might be part of the index settings of old indices. We can remove that in 8.0. Yet, we should remove the settings to make sure they are not used again.	2018-08-30 21:08:35 +02:00
Jim Ferenczi	d0630093cd	Fix serialization of empty field capabilities response (#33263 ) Fix serialization of empty field capabilities response When no response are required (no indices match the requested patterns) the empty response throws an NPE in the transport serialization (writeTo).	2018-08-30 18:07:58 +02:00
Jim Ferenczi	1404dd2a42	Fix nested _source retrieval with includes/excludes (#33180 ) If an exclude or an include clause removes an entry to a nested field in the original source at query time, the creation of nested hits fails with an NPE. This change fixes this exception and replaces the nested document source with an empty map. Closes #33163 Closes #33170	2018-08-30 15:15:50 +02:00
Nhat Nguyen	13261996ce	Add NoOps to Lucene for failed delete ops (#33217 ) Today we add a NoOp to Lucene and translog if we fail to process an indexing operation. However, we are only adding NoOps to translog for delete operations. In order to have a complete history in Lucene, we should add NoOps of failed delete operations to both Lucene and translog. Relates #29530	2018-08-30 07:55:13 -04:00
David Turner	47859e56ac	Move file-based discovery to core (#33241 ) Today we support a static list of seed hosts in core Elasticsearch, and allow a dynamic list of seed hosts to be provided via a file using the `discovery-file` plugin. In fact the ability to provide a dynamic list of seed hosts is increasingly useful, so this change moves this functionality to core Elasticsearch to avoid the need for a plugin. Furthermore, in order to start up nodes in integration tests we currently assign a known port to each node before startup, which unfortunately sometimes fails if another process grabs the selected port in the meantime. By moving the `discovery-file` functionality into the core product we can use it to avoid this race. This change also moves the expected path to the file from `$ES_PATH_CONF/discovery-file/unicast_hosts.txt` to `$ES_PATH_CONF/unicast_hosts.txt`. An example of this file is not included in distributions. For BWC purposes the plugin still exists, but does nothing more than create the example file in the old location, and issue a warning when it is used. We also continue to support the old location for the file, but warn about its deprecation. Relates #29244 Closes #33030	2018-08-30 06:43:04 +01:00
Armin Braun	cc4d7059bf	Ingest: Add conditional per processor (#32398 ) * Ingest: Add conditional per processor * closes #21248	2018-08-30 03:46:39 +02:00
Jason Tedor	0f22dbb1cc	Apply settings filter to get cluster settings API (#33247 ) Some settings have filters applied to them and we use this in logs and the get nodes info API. For consistency, we should apply this in the get cluster settings API too.	2018-08-29 15:56:13 -04:00
Nhat Nguyen	5632e31c74	Merge branch 'master' into ccr * master: Painless: Add Bindings (#33042) Update version after client credentials backport Fix forbidden apis on FIPS (#33202) Remote 6.x transport BWC Layer for `_shrink` (#33236) Test fix - Graph HLRC tests needed another field adding to randomisation exception list HLRC: Add ML Get Records API (#33085) [ML] Fix character set finder bug with unencodable charsets (#33234) TESTS: Fix overly long lines (#33240) Test fix - Graph HLRC test was missing field name to be excluded from randomisation logic Remove unsupported group_shard_failures parameter (#33208) Update BucketUtils#suggestShardSideQueueSize signature (#33210) Parse PEM Key files leniantly (#33173) INGEST: Add Pipeline Processor (#32473) Core: Add java time xcontent serializers (#33120) Consider multi release jars when running third party audit (#33206) Update MSI documentation (#31950) HLRC: create base timed request class (#33216) [DOCS] Fixes command page titles HLRC: Move ML protocol classes into client ml package (#33203) Scroll queries asking for rescore are considered invalid (#32918) Painless: Fix Semicolon Regression (#33212) ingest: minor - update test to include dissect (#33211) Switch remaining LLREST usage to new style Requests (#33171) HLREST: add reindex API (#32679)	2018-08-29 12:30:24 -04:00
Simon Willnauer	6a0d4b4a77	Remote 6.x transport BWC Layer for `_shrink` (#33236 ) The shrink action was renamed to `_resize` with the addition or split. This bwc layer is unnecessary on 7.x since 6.latest will always use the resize action.	2018-08-29 16:43:13 +02:00
Gordon Brown	454ce99b01	Merge branch 'master' into index-lifecycle	2018-08-29 08:28:23 -06:00
Luca Cavanna	49109187e2	Remove unsupported group_shard_failures parameter (#33208 ) We have had support for the `group_shard_failures` parameter in our code for a while, since we introduced failures grouping. When we introduced validation of parameters at REST, we seem to have forgotten to expose such parameter. Given that the parameter is effectively not supported for many months now, that no user has complained about that and that grouping is the expected behaviour, this commit removes support for the parameter.	2018-08-29 14:05:41 +02:00
Luca Cavanna	034fdbca28	Update BucketUtils#suggestShardSideQueueSize signature (#33210 ) `BucketUtils#suggestShardSideQueueSize` used to calculate the shard_size based on the number of shards. It returns now a different value only based on whether we are querying a single shard or multiple shards. This commit replaces the numberOfShards argument with a boolean that tells whether we are querying a single shard or not.	2018-08-29 13:51:54 +02:00
Armin Braun	f690b492e7	INGEST: Add Pipeline Processor (#32473 ) * INGEST: Add Pipeline Processor * Adds Processor capable of invoking other pipelines * Closes #31842	2018-08-29 11:03:10 +02:00
Alexander Reelsen	48b388ce82	Core: Add java time xcontent serializers (#33120 ) This ensures that the java time class exposed by painless have proper serialization/string representations. Closes #31853	2018-08-29 10:00:16 +02:00
Alpar Torok	f29f0af7bc	Consider multi release jars when running third party audit (#33206 ) Exclude classes meant for newer versions than what we are auditing against, those classes won't be found. There's no reason to exclude JDK classes from newer versions, with this PR, we will not extract them in the first place.	2018-08-29 09:53:04 +03:00
Mark Tozzi	84b61d0738	Scroll queries asking for rescore are considered invalid (#32918 ) This PR changes our behavior from silently ignoring rescore in a scroll query to instead report to the user that such a query is invalid. Closes #31775	2018-08-28 15:48:23 -04:00
Nhat Nguyen	c42dc77896	Merge branch 'master' into ccr * master: [Rollup] Better error message when trying to set non-rollup index (#32965) HLRC: Use Optional in validation logic (#33104) Remove unused User class from protocol (#33137) ingest: Introduce the dissect processor (#32884) [Docs] Add link to es-kotlin-wrapper-client (#32618) [Docs] Remove repeating words (#33087) Minor spelling and grammar fix (#32931) Remove support for deprecated params._agg/_aggs for scripted metric aggregations (#32979) Watcher: Simplify finding next date in cron schedule (#33015) Run Third party audit with forbidden APIs CLI (part3/3) (#33052) Fix plugin build test on Windows (#33078) HLRC+MINOR: Remove Unused Private Method (#33165) Remove old unused test script files (#32970) Build analysis-icu client JAR (#33184) Ensure to generate identical NoOp for the same failure (#33141) ShardSearchFailure#readFrom to set index and shardId (#33161)	2018-08-28 13:56:38 -04:00
Sohaib Iftikhar	7f5e29ddb2	HLREST: add reindex API (#32679 ) Adds the reindex API to the high level REST client.	2018-08-28 13:02:23 -04:00
Nhat Nguyen	e39689a198	Send only ops after checkpoint in file-based recovery with soft-deletes (#33190 ) Today a file-based recovery will replay all existing translog operations from the primary on a replica so that that replica can have a full history in translog as the primary. However, with soft-deletes enabled, we should not do it because: 1. All operations before the local checkpoint of the safe commit exist in the commit already. 2. The number of operations before the local checkpoint may be considerable and requires a significant amount of time to replay on a replica. Relates #30522 Relates #29530	2018-08-28 12:32:09 -04:00
Nhat Nguyen	e2b931e80b	Use Lucene history in primary-replica resync (#33178 ) This commit makes primary-replica resyncer use Lucene as the source of history operation instead of translog if soft-deletes is enabled. With this change, we no longer expose translog snapshot directly in IndexShard. Relates #29530	2018-08-28 10:44:15 -04:00
Nhat Nguyen	d8a1b7cb17	Make soft-deletes settings final (#33172 ) For now, we do not support changing the soft-deletes setting even with closed indices. Therefore we should make it a final setting. Relates #29530	2018-08-28 08:48:42 -04:00
Jonathan Little	9d92a87ae6	Remove support for deprecated params._agg/_aggs for scripted metric aggregations (#32979 )	2018-08-28 09:27:43 +01:00
Alpar Torok	2cc611604f	Run Third party audit with forbidden APIs CLI (part3/3) (#33052 ) The new implementation is functional equivalent with the old, ant based one. It parses task standard error to get the missing classes and violations in the same way. I considered re-using ForbiddenApisCliTask but Gradle makes it hard to build inheritance with tasks that have task actions , since the order of the task actions can't be controlled. This inheritance isn't dully desired either as the third party audit task is much more opinionated and we don't want to expose some of the configuration. We could probably extract a common base class without any task actions, but probably more trouble than it's worth. Closes #31715	2018-08-28 10:03:30 +03:00
Gordon Brown	50368656ee	Merge branch 'master' into index-lifecycle	2018-08-27 15:35:19 -06:00
Nhat Nguyen	014b3236dc	Ensure to generate identical NoOp for the same failure (#33141 ) We generate slightly different NoOps in InternalEngine and TransportShardBulkAction for the same failure. 1. InternalEngine uses Exception#getFailure to generate a message without the class name: newOp [NoOp{seqNo=1, primaryTerm=1, reason='Contexts are mandatory in context enabled completion field [suggest_context]'}]. 2. TransportShardBulkAction uses Exception#toString to generate a message with the class name: NoOp{seqNo=1, primaryTerm=1, reason='java.lang.IllegalArgumentException: Contexts are mandatory in context enabled completion field [suggest_context]'}. If a write operation fails while a replica is recovering, that replica will possibly receive two different NoOps: one from recovery and one from replication. These two different NoOps will trip TranslogWriter#assertNoSeqNumberConflict assertion. This commit ensures that we generate the same Noop for the same failure. Closes #32986	2018-08-27 15:59:42 -04:00
Luca Cavanna	ed0571e16c	ShardSearchFailure#readFrom to set index and shardId (#33161 ) As part of recent changes made to `ShardOperationFailedException` we introduced `index` and `shardId` members to the base class, but the subclasses are entirely responsible for the serialization of such fields. In the case of `ShardSearchFailure`, we have an additional `SearchShardTarget` instance member which also holds the index and the shardId, hence they get serialized as part of `SearchShardTarget` itself. When de-serializing a `ShardSearchFailure` though, we need to remember to also set the parent class `index` and `shardId` fields otherwise they get lost Relates to #32640	2018-08-27 20:31:27 +02:00
Jason Tedor	0e5d42ca38	Merge branch 'master' into ccr * master: Adjust BWC version on mapping version Token API supports the client_credentials grant (#33106) Build: forked compiler max memory matches jvmArgs (#33138) Introduce mapping version to index metadata (#33147) SQL: Enable aggregations to create a separate bucket for missing values (#32832) Fix grammar in contributing docs SECURITY: Fix Compile Error in ReservedRealmTests (#33166) APM server monitoring (#32515) Support only string `format` in date, root object & date range (#28117) [Rollup] Move toBuilders() methods out of rollup config objects (#32585) Fix forbiddenapis on java 11 (#33116) Apply publishing to genreate pom (#33094) Have circuit breaker succeed on unknown mem usage Do not lose default mapper on metadata updates (#33153) Fix a mappings update test (#33146) Reload Secure Settings REST specs & docs (#32990) Refactor CachingUsernamePassword realm (#32646)	2018-08-27 13:49:59 -04:00
Jason Tedor	318df2a107	Adjust BWC version on mapping version The introduction of mapping version on index metadata has been backported to 6.x. This commit adjusts the BWC version around mapping version to account for this backport.	2018-08-27 13:17:15 -04:00
Jason Tedor	2aef7e0900	Introduce mapping version to index metadata (#33147 ) This commit introduces mapping version to index metadata. This value is monotonically increasing and is updated on mapping updates. This will be useful in cross-cluster replication so that we can request mapping updates from the leader only when there is a mapping update as opposed to the strategy we employ today which is to request a mapping update any time there is an index metadata update. As index metadata updates can occur for many reasons other than mapping updates, this leads to some unnecessary requests and work in cross-cluster replication.	2018-08-27 12:21:11 -04:00
Tal Levy	5783545222	Merge branch 'master' into index-lifecycle	2018-08-27 08:19:05 -07:00
Mikita Karaliou	f1f6d4ed33	Support only string `format` in date, root object & date range (#28117 ) Limit date `format` attribute to String values only. Closes #23650	2018-08-27 12:24:51 +02:00
Daniel Mitterdorfer	06c0055c0f	Have circuit breaker succeed on unknown mem usage With this commit we implement a workaround for https://bugs.openjdk.java.net/browse/JDK-8207200 which is a race condition in the JVM that results in `IllegalArgumentException` to be thrown in rare cases when we determine memory usage via `MemoryMXBean`. As we do not want to fail requests in those cases we always return zero memory usage. Relates #31767 Relates #33125	2018-08-27 07:09:27 +02:00
Jason Tedor	143cd9bbaa	Do not lose default mapper on metadata updates (#33153 ) When applying index metadata updates we run through the mappings updating them if needed. Today if there is not an update to the default mapper, we can lose the default mapping. This means that, for example, if we apply a settings update to an index we will lose the default mapper. This happens because we were not guarding updating the default mapping with a check that the default mapping was updated in the metadata update. When there is no update in the metadata update, we need to continue to preserve the previous default mapping. This commit achieves this by moving the updating of the default mapping under the same guard that we use for updating the default mapping source. We add a test that fails before putting the update under a guard and now passes after moving the update under the guard.	2018-08-26 15:57:52 -04:00
Jason Tedor	f8b07a0d84	Fix a mappings update test (#33146 ) This commit fixes a mappings update test. The test is broken in the sense that it passes, but for the wrong reason. The test here is testing that if we make a mapping update but do not commit that mapping update then the mapper service still maintains the previous document mapper. This was not the case long, long ago when a mapping update would update the in-memory state before the cluster state update was committed. This test was passing, but it was passing because the mapping update was never even updated. It was never even updated because it was encountering a null pointer exception. Of course the in-memory state is not going to be updated in that case, we are simply going to end up with a failed cluster state update. Fixing that leads to another issue which is that the mapping source does not even parse so again we would, of course, end up with the in-memory state not being modified. We fix these issues, assert that the result cluster state task completed successfully, and finally that the in-memory state was not updated since we never committed the resulting cluster state.	2018-08-26 09:36:17 -04:00
Nhat Nguyen	75304f405b	Merge branch 'master' into ccr * master: Add proxy support to RemoteClusterConnection (#33062) TEST: Skip assertSeqNos for closed shards (#33130) TEST: resync operation on replica should acquire shard permit (#33103) Switch remaining x-pack tests to new style Requests (#33108) Switch remaining tests to new style Requests (#33109) Switch remaining ml tests to new style Requests (#33107) Build: Line up IDE detection logic Security index expands to a single replica (#33131) HLRC: request/response homogeneity and JavaDoc improvements (#33133) Checkstyle! [Test] Fix sporadic failure in MembershipActionTests Revert "Do NOT allow termvectors on nested fields (#32728)" [Rollup] Move toAggCap() methods out of rollup config objects (#32583) Fix race condition in scheduler engine test	2018-08-25 21:41:53 -04:00
Simon Willnauer	3376922e8b	Add proxy support to RemoteClusterConnection (#33062 ) This adds support for connecting to a remote cluster through a tcp proxy. A remote cluster can configured with an additional `search.remote.$clustername.proxy` setting. This proxy will be used to connect to remote nodes for every node connection established. We still try to sniff the remote clsuter and connect to nodes directly through the proxy which has to support some kind of routing to these nodes. Yet, this routing mechanism requires the handshake request to include some kind of information where to route to which is not yet implemented. The effort to use the hostname and an optional node attribute for routing is tracked in #32517 Closes #31840	2018-08-25 20:41:32 +02:00
Nhat Nguyen	9dad82ece8	TEST: Skip assertSeqNos for closed shards (#33130 ) If a shard was closed, we return null for SeqNoStats. Therefore the assertion assertSeqNos will hit NPE when it verifies a closed shard. This commit skips closed shards in assertSeqNos and enables this assertion in AbstractDisruptionTestCase.	2018-08-24 21:02:13 -04:00
Tal Levy	74312be0ea	Merge branch 'master' into index-lifecycle	2018-08-24 12:41:12 -07:00
Nik Everett	a023e64801	Checkstyle! Catching your unused imports since 2001.	2018-08-24 14:13:13 -04:00
Jim Ferenczi	70030c18f1	[Test] Fix sporadic failure in MembershipActionTests Rewrite test that require Version.V_5 constants.	2018-08-24 18:40:04 +02:00
Mayya Sharipova	6f1ee76443	Revert "Do NOT allow termvectors on nested fields (#32728 )" This reverts commit `fdff8f3db0`.	2018-08-24 10:12:16 -04:00
Jason Tedor	91a052b617	Merge branch 'master' into ccr * master: Add hook to skip asserting x-content equivalence (#33114) Muted testListenersThrowingExceptionsDoNotCauseOtherListenersToBeSkipped [Rollup] Move getMetadata() methods out of rollup config objects (#32579) Muted testEmptyAuthorizedIndicesSearchForAllDisallowNoIndices Update Google Cloud Storage Library for Java (#32940) Remove unsupported Version.V_5_* (#32937)	2018-08-24 06:55:10 -04:00
Jim Ferenczi	f4e9729d64	Remove unsupported Version.V_5_* (#32937 ) This change removes the es 5x version constants and their usages.	2018-08-24 09:51:21 +02:00
Martijn van Groningen	82592dda5a	Merge remote-tracking branch 'es/master' into ccr * es/master: (62 commits) [DOCS] Add docs for Application Privileges (#32635) Add versions 5.6.12 and 6.4.1 Do NOT allow termvectors on nested fields (#32728) [Rollup] Return empty response when aggs are missing (#32796) [TEST] Add some ACL yaml tests for Rollup (#33035) Move non duplicated actions back into xpack core (#32952) Test fix - GraphExploreResponseTests should not randomise array elements Closes #33086 Use `addIfAbsent` instead of checking if an element is contained TESTS: Fix Random Fail in MockTcpTransportTests (#33061) HLRC: Fix Compile Error From Missing Throws (#33083) [DOCS] Remove reload password from docs cf. #32889 HLRC: Add ML Get Buckets API (#33056) Watcher: Improve error messages for CronEvalTool (#32800) Search: Support of wildcard on docvalue_fields (#32980) Change query field expansion (#33020) INGEST: Cleanup Redundant Put Method (#33034) SQL: skip uppercasing/lowercasing function tests for AZ locales as well (#32910) Fix the default pom file name (#33063) Switch ml basic tests to new style Requests (#32483) Switch some watcher tests to new style Requests (#33044) ...	2018-08-24 12:22:11 +07:00
Gordon Brown	935b28087b	Merge branch 'master' into index-lifecycle	2018-08-23 14:56:25 -06:00
Michael Basnight	8f16696fe1	Add versions 5.6.12 and 6.4.1	2018-08-23 15:49:14 -05:00
Mayya Sharipova	fdff8f3db0	Do NOT allow termvectors on nested fields (#32728 ) Requesting _termvectors on a nested field or any sub-fields of a nested field returns empty results. Closes #21625	2018-08-23 16:46:47 -04:00
Gordon Brown	1f13c77b49	Merge branch 'master' into index-lifecycle	2018-08-23 11:52:59 -06:00
Yannick Welsch	a0d32f5947	Zen2: Add leader-side join handling logic (#33013 ) Adds the logic for handling joins by a prospective leader. Introduces the Coordinator class with the basic lifecycle modes (candidate, leader, follower) as well as a JoinHelper class that contains most of the plumbing for handling joins.	2018-08-23 19:18:52 +02:00
Simon Willnauer	f3cfd4504f	Use `addIfAbsent` instead of checking if an element is contained Relates to #32988	2018-08-23 13:43:23 +02:00
Ignacio Vera	d7219c05a2	Search: Support of wildcard on docvalue_fields (#32980 ) * Search: Support of wildcard on docvalue_fields For consistency with stored_fields, docvalue_fields should support the use of wildcards. Documentation of doc values fields is updated accordingly. See also: #26390 Closes #26299	2018-08-23 10:04:00 +02:00
Jim Ferenczi	ffe895e16e	Change query field expansion (#33020 ) This commit changes the query field expansion for query parsers to not rely on an hardcoded list of field types. Instead we rely on the type of exception that is thrown by MappedFieldType#termQuery to include/exclude an expanded field. Supersedes #31655 Closes #31798	2018-08-23 09:52:48 +02:00
Armin Braun	46247ff1f9	INGEST: Cleanup Redundant Put Method (#33034 )	2018-08-23 07:43:36 +02:00
Luca Cavanna	393eec1482	Set maxScore for empty TopDocs to Nan rather than 0 (#32938 ) We used to set `maxScore` to `0` within `TopDocs` in situations where there is really no score as the size was set to `0` and scores were not even tracked. In such scenarios, `Float.Nan` is more appropriate, which gets converted to `max_score: null` on the REST layer. That's also more consistent with lucene which set `maxScore` to `Float.Nan` when merging empty `TopDocs` (see `TopDocs#merge`).	2018-08-22 17:23:54 +02:00
Jason Tedor	67bfb765ee	Refactor Netty4Utils#maybeDie (#33021 ) In our Netty layer we have had to take extra precautions against Netty catching throwables which prevents them from reaching the uncaught exception handler. This code has taken on additional uses in NIO layer and now in the scheduler engine because there are other components in stack traces that could catch throwables and suppress them from reaching the uncaught exception handler. This commit is a simple cleanup of the iterative evolution of this code to refactor all uses into a single method in ExceptionsHelper.	2018-08-22 10:18:07 -04:00
Simon Willnauer	ead198bf2e	Add settings updater for 2 affix settings (#33050 ) Today we can only have non-affix settings updated and consumed _together_. Yet, there are use-cases where two affix settings depend on each other which makes using the hard without consuming updates together. Unfortunately, there is not straight forward way to have N settings updated together in a type-safe way having 2 still serves a large portion of use-cases.	2018-08-22 14:13:27 +02:00
Nhat Nguyen	262d3c0783	Allow engine to recover from translog upto a seqno (#33032 ) This change allows an engine to recover from its local translog up to the given seqno. The extended API can be used in these use cases: When a replica starts following a new primary, it resets its index to the safe commit, then replays its local translog up to the current global checkpoint (see #32867). When a replica starts a peer-recovery, it can initialize the start_sequence_number to the persisted global checkpoint instead of the local checkpoint of the safe commit. A replica will then replay its local translog up to that global checkpoint before accepting remote translog from the primary. This change will increase the chance of operation-based recovery. I will make this in a follow-up. Relates #32867	2018-08-22 07:57:44 -04:00
Simon Willnauer	ffb1a5d5b7	Expose `max_concurrent_shard_requests` in `_msearch` (#33016 ) Today `_msearch` doesn't allow modifying the `max_concurrent_shard_requests` per sub search request. This change adds support for setting this parameter on all sub-search requests in an `_msearch`. Relates to #31877	2018-08-22 08:45:08 +02:00
Julie Tibshirani	67b5a83a9a	Ensure that _exists queries on keyword fields use norms when they're available. (#33006 )	2018-08-21 16:33:42 -07:00
Jim Ferenczi	767c69593c	Fix quoted _exists_ query (#33019 ) This change in the `query_string` query fixes the detection of the special `_exists_` field when it is used with a quoted term. Closes #28922	2018-08-21 22:15:09 +02:00
Jim Ferenczi	8b43e21521	Fix multi fields empty query (#33017 ) This change fixes empty query removal when all fields remove the search term in `simple_query_string`, `multi_match` and `query_string`. Closes #33009	2018-08-21 22:12:53 +02:00
Igor Motov	3973bb4028	Fix north pole overflow error in GeoHashUtils.bbox() (#32891 ) Fixes an overflow error in GeoHashUtils.bbox() calculation of a bounding box for geohashes with maximum precision located next to the north pole.	2018-08-21 14:59:37 -04:00
Jason Tedor	bdfcc326d7	Enable avoiding mmap bootstrap check (#32421 ) The maximum map count boostrap check can be a hindrance to users that do not own the underlying platform on which they are executing Elasticsearch. This is because addressing it requires tuning the kernel and a platform provider might now allow this, especially on shared infrastructure. However, this bootstrap check is not needed if mmapfs is not in use. Today we do not have a way for the user to communicate that they are not going to use mmapfs. This commit therefore adds a setting that enables the user to disallow mmapfs. When mmapfs is disallowed, the maximum map count bootstrap check is not enforced. Additionally, we fallback to a different default index store and prevent the explicit use of mmapfs for an index.	2018-08-21 11:02:25 -04:00
Colin Goodheart-Smithe	10c60fae93	Merge branch 'master' into index-lifecycle	2018-08-21 11:54:06 +01:00
Simon Willnauer	92076497e5	Use a dedicated ConnectionManger for RemoteClusterConnection (#32988 ) This change introduces a dedicated ConnectionManager for every RemoteClusterConnection such that there is not state shared with the TransportService internal ConnectionManager. All connections to a remote cluster are isolated from the TransportService but still uses the TransportService and it's internal properties like the Transport, tracing and internal listener actions on disconnects etc. This allows a remote cluster connection to have a different lifecycle than a local cluster connection, also local discovery code doesn't get notified if there is a disconnect on from a remote cluster and each connection can use it's own dedicated connection profile which allows to have a reduced set of connections per cluster without conflicting with the local cluster. Closes #31835	2018-08-21 12:43:25 +02:00
Armin Braun	200078734c	INGEST: Simplify IngestService (#33008 ) * INGEST: Simplify IngestService * Follow up to #32617 * Flatten redundant inner classes of `IngestService`	2018-08-21 10:13:32 +02:00
David Turner	e4ef12798e	Add PeerFinder#onFoundPeersUpdated (#32939 ) Today the PeerFinder silently updates the set of found peers as new peers are discovered and old ones are disconnected, and elections are scheduled independently of these changes. In fact, it would be better if the election scheduler were only activated on discovery of a quorum of peers. This commit introduces the `onFoundPeersUpdated` method that allows this flow.	2018-08-21 08:04:30 +01:00
Armin Braun	8fc213f237	INGEST: Move all Pipeline State into IngestService (#32617 ) * INGEST: Move all Pipeline State into IngestService * Moves all pipeline state into the ingest service * Retains the existing pipeline store and pipeline execution service as inner classes to make the review easier, they should be flattened out in the next step * All tests for these classes were copied (and adapted) to the ingest service tests * This is a refactoring step to enable a clean implementation of a pipeline processor (See #32473)	2018-08-21 05:05:32 +02:00
Jason Tedor	ad0a965db9	Protect scheduler engine against throwing listeners (#32998 ) There are two problems with the scheduler engine today. Both relate to listeners that throw. The first problem is that any triggered listener that throws a plain old exception will cause no additional listeners to be triggered for the event, and will also cause the scheduler to never be invoked again. This leads to lost events and is bad. The second problem is that any triggered listener that throws an error of the fatal kind will not lead to that error because caught by the uncaught exception handler. This is because the triggered listener is executed as a future task under a scheduled thread pool executor. A throwable there goes caught by the JDK framework and set as the outcome on the future task. Since we never inspect these tasks for their outcomes, nor is there a good place to do this, we have to handle these errors ourselves. To do this, we catch them and dispatch them to the uncaught exception handler via a forked thread. This is similar to our handling in Netty.	2018-08-20 22:07:16 -04:00
Nhat Nguyen	77d7547be2	Fix compilation after merge from master	2018-08-20 16:33:33 -04:00
Jason Tedor	853eb1c51c	Merge branch 'master' into ccr * master: Generalize remote license checker (#32971) Trim translog when safe commit advanced (#32967) Fix an inaccuracy in the dynamic templates documentation. (#32890) Logging: Use settings when building daemon threads (#32751) All Translog inner closes should happen after tragedy exception is set (#32674) HLREST: AwaitsFix ML Test Pass DiscoveryNode to initiateChannel (#32958) Add mzn and dz to unsupported locales (#32957) Use settings from the context in BootstrapChecks (#32908) Update docs for node specifications (#30468) HLRC: Forbid all Elasticsearch logging infra (#32784) Only configure publishing if it's applied externally (#32351) Fixes libs:dissect when in eclipse Protect ScriptedMetricIT test cases against failures on 0-doc shards (#32959) (#32968) [Kerberos] Add documentation for Kerberos realm (#32662) Watcher: Properly find next valid date in cron expressions (#32734) Fix some small issues in the getting started docs (#30346) Set forbidden APIs target compatibility to compiler java version (#32935) Move connection listener to ConnectionManager (#32956)	2018-08-20 15:49:31 -04:00
Nhat Nguyen	40f1bb5e5e	Trim translog when safe commit advanced (#32967 ) Since #28140 when the global checkpoint is advanced, we try to move the safe commit forward, and clean up old index commits if possible. However, we forget to trim unreferenced translog. This change makes sure that we prune both old translog and index commits when the safe commit advanced. Relates #28140 Closes #32089	2018-08-20 15:13:19 -04:00
Nik Everett	462e91d362	Logging: Use settings when building daemon threads (#32751 ) Subclasses of `EsIntegTestCase` run multiple Elasticsearch nodes in the same JVM and when we log we look at the name of the thread to figure out the node name. This makes sure that all calls to `daemonThreadFactory` include the node name. Closes #32574 I'd like to follow this up with more drastic changes that make it impossible to do this incorrectly but that change is much larger than this and I'd like to get these log lines fixed up sooner rather than later.	2018-08-20 13:53:15 -04:00
Andrey Ershov	0749b18181	All Translog inner closes should happen after tragedy exception is set (#32674 ) All Translog inner closes should happen after tragedy exception is set (#32674) We faced with the nasty race condition. See #32526 InternalEngine.failOnTragic method has thrown AssertionError. If you carefully look at if branches in this method, you will spot that its only possible, if either Lucene IndexWriterhas closed from inside or Translog, has closed from inside, but tragedy exception is not set. For now, let us concentrate on the Translog class. We found out that there are two methods in Translog - namely rollGeneration and trimOperations that are closing Translog in case of Exception without tragedy exception being set. This commit fixes these 2 methods. To fix it, we pull tragedyException from TranslogWriter up-to Translog class, because in these 2 methods IndexWriter could be innocent, but still Translog needs to be closed. Also, tragedyException is wrapped with TragicExceptionHolder to reuse CAS/addSuppresed functionality in Translog and TranslogWriter. Also to protect us in the future and make sure close method is never called from inside Translog special assertion examining stack trace is added. Since we're still targeting Java 8 for runtime - no StackWalker API is used in the implementation. In the stack-trace checking method, we're considering inner caller not only Translog methods but Translog child classes methods as well. It does mean that Translog is meant for extending it, but it's needed to be able to test this method. Closes #32526	2018-08-20 19:22:10 +02:00
David Turner	cd6326b391	Introduce PreVoteCollector (#32847 ) An election requires a node to select a term that is higher than all previously-seen terms. If nodes are too enthusiastic about starting elections then they can effectively excludes itself from the cluster until the leader can bump to a still-higher term, and if this process repeats then a single faulty node can prevent the cluster from making useful progress. The solution is to start the election with a pre-voting round to ensure that there is at least a quorum of nodes who believe there to be no leader. This also fixes up some merge issues.	2018-08-20 17:48:05 +01:00
Tim Brooks	faa42de66d	Pass DiscoveryNode to initiateChannel (#32958 ) This is related to #32517. This commit passes the DiscoveryNode to the initiateChannel method for different Transport implementation. This will allow additional attributes (besides just the socket address) to be used when opening channels.	2018-08-20 08:54:55 -06:00
Colin Goodheart-Smithe	3736097e19	Merge branch 'master' into index-lifecycle	2018-08-20 10:06:02 +01:00
David Turner	f6891cd222	Fixup after merge	2018-08-20 08:58:03 +01:00
Jonathan Little	676091aafb	Protect ScriptedMetricIT test cases against failures on 0-doc shards (#32959 ) (#32968 ) Randomized test conditions that cause some shards to have no docs on them failed due to test asserts that relied on a lazy initialization side effect from the map script. After this fix: - Test cases with the relevant init script are protected - Test cases with the relevant combine or reduce scripts were already protected, because the combine and reduce scripts safely handle this case.	2018-08-20 08:55:43 +01:00
David Turner	f317562c82	Merge branch 'master' into zen2	2018-08-20 08:33:55 +01:00
Alpar Torok	4b34b3f4aa	Set forbidden APIs target compatibility to compiler java version (#32935 ) Set forbidden apis target compatibility to compiler version Fix outstanding deprecation	2018-08-20 09:27:02 +03:00
Tim Brooks	de92d2ef1f	Move connection listener to ConnectionManager (#32956 ) This is a followup to #31886. After that commit the TransportConnectionListener had to be propogated to both the Transport and the ConnectionManager. This commit moves that listener to completely live in the ConnectionManager. The request and response related methods are moved to a TransportMessageListener. That listener continues to live in the Transport class.	2018-08-18 10:09:24 -06:00
Jason Tedor	ac75968c0b	Merge remote-tracking branch 'elastic/master' into ccr * elastic/master: (46 commits) NETWORKING: Make RemoteClusterConn. Lazy Resolve DNS (#32764) [DOCS] Splits the users API documentation into multiple pages (#32825) [DOCS] Splits the token APIs into separate pages (#32865) [DOCS] Creates redirects for role management APIs page Bypassing failing test PainlessDomainSplitIT#testHRDSplit (#32966) TEST: Mute testRetentionPolicyChangeDuringRecovery [DOCS] Fixes more broken links to role management APIs [Docs] Tweaks and fixes to rollup docs [DOCS] Fixes links to role management APIs [ML][TEST] Fix BasicRenormalizationIT after adding multibucket feature [DOCS] Splits the roles API documentation into multiple pages (#32794) [TEST] Run pre 6.4 nodes in non-FIPS JVMs (#32901) Make Geo Context Mapping Parsing More Strict (#32821) [ML] fix updating opened jobs scheduled events (#31651) (#32881) Scripted metric aggregations: add deprecation warning and system property to control legacy params (#31597) Tests: Fix timezone conversion in DateTimeUnitTests Enable FIPS140LicenseBootstrapCheck (#32903) Fix InternalAutoDateHistogram reproducible failure (#32723) Remove assertion in testDocStats on deletedDocs counter (#32914) HLRC: Move ML request converters into their own class (#32906) ...	2018-08-18 09:48:55 -04:00
Armin Braun	f82bb64feb	NETWORKING: Make RemoteClusterConn. Lazy Resolve DNS (#32764 ) * Lazy resolve DNS (i.e. `String` to `DiscoveryNode`) to not run into indefinitely caching lookup issues (provided the JVM dns cache is configured correctly as explained in https://www.elastic.co/guide/en/elasticsearch/reference/6.3/networkaddress-cache-ttl.html) * Changed `InetAddress` type to `String` for that higher up the stack * Passed down `Supplier<DiscoveryNode>` instead of outright `DiscoveryNode` from `RemoteClusterAware#buildRemoteClustersSeeds` on to lazy resolve DNS when the `DiscoveryNode` is actually used (could've also passed down the value of `clusterName = REMOTE_CLUSTERS_SEEDS.getNamespace(concreteSetting)` together with the `List<String>` of hosts, but this route seemed to introduce less duplication and resulted in a significantly smaller changeset). * Closes #28858	2018-08-18 08:46:44 +02:00
Tal Levy	a26e108590	Merge branch 'master' into index-lifecycle	2018-08-17 13:57:28 -07:00
Nhat Nguyen	86ffce4bbc	TEST: Mute testRetentionPolicyChangeDuringRecovery Tracked at #32089	2018-08-17 14:12:45 -04:00
Igor Motov	da6b61e8ef	Make Geo Context Mapping Parsing More Strict (#32821 ) Currently, if geo context is represented by something other than geo_point or an object with lat and lon fields, the parsing of it as a geo context can result in ignoring the context altogether, returning confusing errors such as number_format_exception or trying to parse the number specifying as long-encoded hash code. It would also fail if the geo_point was stored. This commit makes the mapping parsing more strict and will fail during mapping update or index creation if the geo context doesn't point to a geo_point field. Supersedes #32412 Closes #32202	2018-08-17 08:13:16 -07:00
Jonathan Little	a08127c072	Scripted metric aggregations: add deprecation warning and system property to control legacy params (#31597 ) * Scripted metric aggregations: add deprecation warning and system property to control legacy params Scripted metric aggregation params._agg/_aggs are replaced by state/states context variables. By default the old params are still present, and a deprecation warning is emitted when Scripted Metric Aggregations are used. A new system property can be used to disable the legacy params. This functionality will be removed in a future revision. * Fix minor style issue and docs test failure * Disable deprecated params._agg/_aggs in tests and revise tests to use state/states instead * Add integration test covering deprecated scripted metrics aggs params._agg/_aggs access * Disable deprecated params._agg/_aggs in docs integration tests and revise stored scripts to use state/states instead * Revert unnecessary migrations doc change A relevant note should be added in the changes destined for 7.0; this PR is going to be backported to 6.x. * Replace deprecated _agg param bwc integration test with a couple of unit tests * Fix compatibility test after merge * Rename backwards compatibility system property per code review feedback * Tweak deprecation warning text per review feedback	2018-08-17 13:11:18 +01:00
Alexander Reelsen	0d92f377fd	Tests: Fix timezone conversion in DateTimeUnitTests This fix prevernts trying to parse unknown timezone ids by converting the joda time zone via java.util.TimeZone to a java time based ZoneId. Closes #32927	2018-08-17 14:09:01 +02:00
Paul Sanwald	ca54aacbb5	Fix InternalAutoDateHistogram reproducible failure (#32723 ) Update test logic to correctly bucket intervals.	2018-08-17 07:03:25 -04:00
Andrey Ershov	2fa028cfa1	Remove assertion in testDocStats on deletedDocs counter (#32914 ) testDocStats test is flaky and sometimes it's failing on jenkins and failure is not reproducible locally. The reason for this failure is in timing. If the number of deleted documents is greater than 33% of inserted documents, Lucene will schedule segments to merge if TieredMergePolicy is used (it's not the case for LogMergePolicy, but ES is only using TieredMergePolicy). If this merge is performed before stats are retrieved - we will get 0 for "deleted" counter. So basically this counter could be either 0 or numOfDeletedDocs at this point, but this is the too loose assertion and we decided to remove it at all. Closes #32766	2018-08-17 12:36:45 +02:00
JB Nizet	dd5a5aab88	Fix allowed value for HighlighterBuilder encoder in javadocs (#32780 ) Relates to #32745	2018-08-17 10:59:26 +02:00
Julie Tibshirani	cbf160a4e6	For filters aggs, make sure that rewrites preserve other_bucket. (#32921 )	2018-08-16 17:36:58 -07:00
Yannick Welsch	a3bb85eeaf	Zen2: Extract JoinTaskExecutor (#32911 ) Moves JoinTaskExecutor out of ZenDiscovery so that it can be reused for Zen2. Also ensures that tasks to JoinTaskExecutor have a proper identity, so that multiple tasks for the same node can coexist.	2018-08-16 22:19:17 +02:00
Tal Levy	c9de707f58	Merge branch 'master' into index-lifecycle	2018-08-16 08:41:57 -07:00
Jim Ferenczi	3dd1677cdc	[Test] Fix DuelScrollIT#testDuelIndexOrderQueryThenFetch This commit disables the automatic `refresh_interval` in order to ensure that index readers cannot differ between the normal and scroll search. This issue is related to the 7.5 Lucene upgrade which contains a change that makes single segment merge more likely to occur (max deletes percentage). Closes #32682	2018-08-16 15:33:17 +02:00
Jason Tedor	f8c7414ee8	Remove passphrase support from reload settings API (#32889 ) We do not support passphrases on the secure settings storage (the keystore). Yet, we added support for this in the API layer. This commit removes this support so that we are not limited in our future options, or have to make a breaking change.	2018-08-16 07:24:05 -04:00
Adrien Grand	e35be01901	AwaitFix AckIT. Relates #32767	2018-08-16 12:31:58 +02:00
Colin Goodheart-Smithe	d80457ee2a	Mutes test in DuelScrollIT Due to https://github.com/elastic/elasticsearch/issues/32682	2018-08-16 11:08:00 +01:00
Jay Modi	1a45b27d8b	Move CharArrays to core lib (#32851 ) This change cleans up some methods in the CharArrays class from x-pack, which includes the unification of char[] to utf8 and utf8 to char[] conversions that intentionally do not use strings. There was previously an implementation in x-pack and in the reloading of secure settings. The method from the reloading of secure settings was adopted as it handled more scenarios related to the backing byte and char buffers that were used to perform the conversions. The cleaned up class is moved into libs/core to allow it to be used by requests that will be migrated to the high level rest client. Relates #32332	2018-08-15 15:26:00 -06:00
Jason Tedor	4475f88c95	Merge branch 'master' into ccr * master: Fix global checkpoint listeners test HLRC: adding machine learning open job (#32860) [ML] Add log structure finder functionality (#32788) INGEST: Add Configuration Except. Data to Metdata (#32322)	2018-08-15 16:07:28 -04:00
Tal Levy	ec93756600	Merge branch 'master' into index-lifecycle	2018-08-15 12:56:01 -07:00
Jason Tedor	364ccc36d6	Fix global checkpoint listeners test This commit fixes a global checkpoint listeners test wherein we were expecting an executor to have been used even if there were no listeners. This is silliness, so this commit adjusts the assertion to verify that the executor never fires if there are no listeners, and fires exactly once if there is one or more listeners.	2018-08-15 15:53:15 -04:00
David Turner	6d9e7c5cec	Introduce ElectionScheduler (#32846 ) The ElectionScheduler runs while there is no known elected master and is responsible for scheduling elections randomly, backing off on failure, to balance the desire to elect a master quickly with the desire to avoid more than one node starting an election at once.	2018-08-15 20:48:16 +01:00
Armin Braun	986c55b830	INGEST: Add Configuration Except. Data to Metdata (#32322 ) * closes #27728	2018-08-15 19:02:19 +02:00
Jason Tedor	aa147cca44	Merge remote-tracking branch 'elastic/master' into ccr * elastic/master: Revert "cluster formation DSL - Gradle integration - part 2 (#32028)" (#32876) cluster formation DSL - Gradle integration - part 2 (#32028) Introduce global checkpoint listeners (#32696) Move connection profile into connection manager (#32858) [ML] Temporarily disabling rolling-upgrade tests Use generic AcknowledgedResponse instead of extended classes (#32859) [ML] Removing old per-partition normalization code (#32816) Use JDK 10 for 6.4 BWC builds (#32866) Removed flaky test. Looks like randomisation makes these assertions unreliable. [test] mute IndexShardTests.testDocStats Introduce the dissect library (#32297) Security: remove password hash bootstrap check (#32440) Move validation to server for put user requests (#32471) [ML] Add high level REST client docs for ML put job endpoint (#32843) Test: Fix forbidden uses in test framework (#32824) Painless: Change fqn_only to no_import (#32817) [test] mute testSearchWithSignificantTermsAgg Watcher: Remove unused hipchat render method (#32211) Watcher: Remove extraneous auth classes (#32300) Watcher: migrate PagerDuty v1 events API to v2 API (#32285)	2018-08-15 12:30:35 -04:00
Jason Tedor	068d03f56b	Introduce global checkpoint listeners (#32696 ) This commit introduces the ability for global checkpoint listeners to be registered at the shard level. These listeners are notified when the global checkpoint is updated, and also when the shard closes. To encapsulate these listeners, we introduce a shard-level component that handles synchronization of notification and modifications to the collection of listeners.	2018-08-15 12:04:24 -04:00
Tim Brooks	2464b68613	Move connection profile into connection manager (#32858 ) This is related to #31835. It moves the default connection profile into the ConnectionManager class. The will allow us to have different connection managers with different profiles.	2018-08-15 09:08:33 -06:00
Lee Hinman	48281ac5bc	Use generic AcknowledgedResponse instead of extended classes (#32859 ) This removes custom Response classes that extend `AcknowledgedResponse` and do nothing, these classes are not needed and we can directly use the non-abstract super-class instead. While this appears to be a large PR, no code has actually changed, only class names have been changed and entire classes removed.	2018-08-15 08:06:14 -06:00
Tal Levy	92ecd1d271	Merge branch 'master' into index-lifecycle	2018-08-15 06:11:25 -07:00
Andy Bristol	a1cff86012	[test] mute IndexShardTests.testDocStats For #32766	2018-08-14 18:21:59 -07:00
Nhat Nguyen	6556186d9a	Merge branch 'master' into ccr	2018-08-14 12:11:35 -04:00
Armin Braun	27e64e7251	MINOR: Remove `IndexTemplateFilter` (#32841 ) * This isn't used anywhere anymore ever since `00c123b59f8ba11eb260e6b70acf7be80bccc949` and `dc166c5dc6bcf4abb7f25c6f4143f07d8176333d`	2018-08-14 16:01:33 +02:00
Colin Goodheart-Smithe	a84b3239c3	Merge branch 'master' into index-lifecycle client/rest-high-level/src/main/java/org/elasticsearch/client/RequestCon verters.java /Users/colings86/dev/work/git/elasticsearch/.git/worktrees/elasticsearch -ilm/MERGE_HEAD client/rest-high-level/src/main/java/org/elasticsearch/client/LicenseCli ent.java client/rest-high-level/src/main/java/org/elasticsearch/client/RequestCon verters.java client/rest-high-level/src/test/java/org/elasticsearch/client/SearchIT.j ava client/rest-high-level/src/test/java/org/elasticsearch/client/documentat ion/LicensingDocumentationIT.java docs/java-rest/high-level/licensing/delete-license.asciidoc server/src/main/java/org/elasticsearch/action/bulk/BulkPrimaryExecutionC ontext.java server/src/main/java/org/elasticsearch/action/bulk/TransportBulkAction.j ava server/src/main/java/org/elasticsearch/common/Rounding.java server/src/main/java/org/elasticsearch/common/rounding/Rounding.java server/src/main/java/org/elasticsearch/search/aggregations/bucket/signif icant/ParsedSignificantTerms.java server/src/test/java/org/elasticsearch/action/IndicesRequestIT.java server/src/test/java/org/elasticsearch/action/bulk/TransportBulkActionIn gestTests.java server/src/test/java/org/elasticsearch/action/bulk/TransportShardBulkAct ionTests.java server/src/test/java/org/elasticsearch/common/RoundingTests.java server/src/test/java/org/elasticsearch/common/rounding/DateTimeUnitTests .java server/src/test/java/org/elasticsearch/common/rounding/RoundingDuelTests .java server/src/test/java/org/elasticsearch/search/aggregations/bucket/signif icant/SignificantLongTermsTests.java server/src/test/java/org/elasticsearch/search/aggregations/bucket/signif icant/SignificantStringTermsTests.java x-pack/plugin/core/src/main/java/org/elasticsearch/license/DeleteLicense Action.java x-pack/plugin/core/src/main/java/org/elasticsearch/license/DeleteLicense Request.java x-pack/plugin/core/src/main/java/org/elasticsearch/license/DeleteLicense RequestBuilder.java x-pack/plugin/core/src/main/java/org/elasticsearch/license/DeleteLicense Response.java x-pack/plugin/core/src/main/java/org/elasticsearch/license/LicenseServic e.java x-pack/plugin/core/src/main/java/org/elasticsearch/license/LicensingClie nt.java x-pack/plugin/core/src/main/java/org/elasticsearch/license/RestDeleteLic enseAction.java x-pack/plugin/core/src/main/java/org/elasticsearch/license/TransportDele teLicenseAction.java x-pack/plugin/core/src/test/java/org/elasticsearch/license/LicensesManag erServiceTests.java x-pack/plugin/core/src/test/java/org/elasticsearch/license/LicensesTrans portTests.java x-pack/protocol/src/main/java/org/elasticsearch/protocol/xpack/license/D eleteLicenseRequest.java x-pack/protocol/src/main/java/org/elasticsearch/protocol/xpack/license/D eleteLicenseResponse.java x-pack/protocol/src/test/java/org/elasticsearch/protocol/xpack/license/D eleteLicenseResponseTests.java	2018-08-14 13:07:26 +01:00
Alexander Reelsen	87481a0e34	Core: Add java time version of rounding classes (#32641 ) This commit adds a java time version of the existing rounding classes, which features the same test suite and a small test class to check if serialization works as expected.	2018-08-14 13:52:55 +02:00
markharwood	e5ab09f708	Aggregations/HL Rest client fix: missing scores (#32774 ) Significance score doubles were being parsed as long. Existing tests did not catch this because SignificantLongTermsTests and SignificantStringTermsTests did not set the score. Fixed these and also added integration test. Thanks for the report/fix, Blakko Closes #32770	2018-08-14 11:14:47 +01:00
Armin Braun	124c1f1358	INGEST: Create Index Before Pipeline Execute (#32786 ) * INGEST: Create Index Before Pipeline Execute * Ensures that indices are created before the default pipeline setting is read to correcly handle the case of an index template containing a default pipeline (without the fix the first document does not get the pipeline applied as explained in #32758) * closes #32758	2018-08-14 11:27:08 +02:00
Yannick Welsch	a8bfa466b2	Fix NOOP bulk updates (#32819 ) #31821 introduced an unreleased bug where NOOP updates were incorrectly mutating the bulk shard request, inserting null item to be replicated, which would result in NullPointerExceptions when serializing the request to be shipped to the replicas. Closes #32808	2018-08-14 08:20:35 +02:00
Tal Levy	e78f537e58	Merge branch 'master' into index-lifecycle	2018-08-13 16:31:40 -07:00
Tim Brooks	10fddb62ee	Remove client connections from TcpTransport (#31886 ) This is related to #31835. This commit adds a connection manager that manages client connections to other nodes. This means that the TcpTransport no longer maintains a map of nodes that it is connected to.	2018-08-13 16:44:09 -06:00
Nhat Nguyen	8a003e1281	Increase logging testRetentionPolicyChangeDuringRecovery Relates #32089	2018-08-13 16:29:34 -04:00
Armin Braun	d412230cda	SCRIPTING: Support BucketAggScript return null (#32811 ) * As explained in #32790, `BucketAggregationScript` must support `null` as a return value * Closes #32790	2018-08-13 20:08:26 +02:00
Tal Levy	a771478940	Merge branch 'master' into index-lifecycle	2018-08-13 09:14:00 -07:00
Yannick Welsch	e122505a91	Zen2: Deterministic MasterService (#32493 ) Increases testability of MasterService and the discovery layer. Changes: - Async publish method - Moved a few interfaces/classes top-level to simplify imports - Deterministic MasterService implementation for tests	2018-08-13 18:03:08 +02:00
Nhat Nguyen	cb2273b02a	Mute IndicesRequestIT#testBulk Tracked at #32808	2018-08-13 10:10:33 -04:00
Ryan Ernst	cb1d467124	Cat apis: Fix index creation time to use strict date format (#32510 ) With the move to java time, the default formatter used by toString on ZonedDateTime uses optional components for least significant portions of the date. This commit changes the cat indices api to use a strict date time format, which will always output milliseconds, even if they are zero. closes #32466	2018-08-10 13:15:00 -07:00
Tal Levy	93637e2135	Merge branch 'master' into index-lifecycle	2018-08-10 10:23:14 -07:00
Christoph Büscher	22f7b03430	Fix test reproducability in AbstractBuilderTestCase setup (#32403 ) Currently AbstractBuilderTestCase generates certain random values in its `beforeTest()` method annotated with @Before only the first time that a test method in the suite is run while initializing the serviceHolder that we use for the rest of the test. This changes the values of subsequent random values and has the effect that when running single methods from a test suite with "-Dtests.method=*", the random values it sees are different from when the same test method is run as part of the whole test suite. This makes it hard to use the reproduction lines logged on failure. This change runs the inialization of the serviceHolder and the randomization connected to it using the test runners master seed, so reproduction by running just one method is possible again. Closes #32400	2018-08-10 15:13:44 +02:00
Alexander Reelsen	f236bb3ff6	Tests: Muted ScriptDocValuesDatesTests.testJodaTimeBwc Relates #32779	2018-08-10 14:38:23 +02:00
Boaz Leskes	f58ed21720	Refactor TransportShardBulkAction to better support retries (#31821 ) Processing bulk request goes item by item. Sometimes during processing, we need to stop execution and wait for a new mapping update to be processed by the node. This is currently achieved by throwing a `RetryOnPrimaryException`, which is caught higher up. When the exception is caught, we wait for the next cluster state to arrive and process the request again. Sadly this is a problem because all operations that were already done until the mapping change was required are applied again and get new sequence numbers. This in turn means that the previously issued sequence numbers are never replicated to the replicas. That causes the local checkpoint of those shards to be stuck and with it all the seq# based infrastructure. This commit refactors how we deal with retries with the goal of removing `RetryOnPrimaryException` and `RetryOnReplicaException` (not done yet). It achieves so by introducing a class `BulkPrimaryExecutionContext` that is used the capture the execution state and allows continuing from where the execution stopped. The class also formalizes the steps each item has to go through: 1) A translation phase for updates 2) Execution phase (always index/delete) 3) Waiting for a mapping update to come in, if needed 4) Requires a retry (for updates and cases where the mapping are still not available after the put mapping call returns) 5) A finalization phase which allows updates to the index/delete result to an update result.	2018-08-10 10:15:01 +02:00
Alexander Reelsen	798fb546cb	Core: Create java time based DateMathParser (#32131 ) This adds a java time based date math parser class in order, which will replace the joda date based one in the future. For now the class also returns the date in milliseconds since the epoch.	2018-08-10 09:38:18 +02:00
Tal Levy	c7a2c357a3	Merge branch 'master' into index-lifecycle	2018-08-09 18:00:40 -07:00
lipsill	be54ba39c4	Add expected mapping type to `MapperException` (#31564 ) Currently if a document cannot be indexed because it violates the defined mapping for the index, a MapperException is thrown. In some cases it is useful to expose the expected field type in the exception itself, so that the user can react based on the error message. This change adds the expected data type to the MapperException. Closes #31502	2018-08-09 23:10:51 +02:00
Nik Everett	294ab7ee96	Core: Remove some logging constructors (#32513 ) Remove a few of the logger constructors that aren't widely used or aren't used at all and deprecate a few more logger constructors in favor of log4j2's `LogManager`.	2018-08-09 16:11:48 -04:00
Nicholas Knize	e162127ff3	Upgrade to Lucene-7.5.0-snapshot-13b9e28f9d The main feature is the inclusion of bkd backed geo_shape with INTERSECT, DISJOINT, WITHIN bounding box and polygon query support.	2018-08-09 11:15:02 -05:00
Armin Braun	79375d35bb	Scripting: Replace Update Context (#32096 ) * SCRIPTING: Move Update Scripts to their own context * Added system property for backwards compatibility of change to `ctx.params`	2018-08-09 14:32:36 +02:00
Colin Goodheart-Smithe	0fe21136db	Merge branch 'master' into index-lifecycle	2018-08-09 12:47:26 +01:00
Alexander Reelsen	823d40e19b	Core: Fix Java Time DateFormatter printers (#32592 ) A bug in the test suite prevented to properly check that all date formatters printed the date the same way like joda time does. This fixes the test and thus also a fair share of formats, that now use the strict parser for printing.	2018-08-09 10:01:40 +02:00
Lee Hinman	7af28c48c3	Switch WritePipelineResponse to AcknowledgedResponse (#32722 ) We previously discussed moving the classes extending `AcknowledgedResponse` to simply use `AcknowledgedResponse`, making the class non-abstract. This moves the first class to do this, removing `WritePipelineResponse` in the process. If we like the way this looks, I will switch the remaining classes over to using `AcknowledgedResponse`.	2018-08-08 16:21:58 -06:00
David Turner	433b7b8427	Remove FutureExecutor interface (#32713 ) Instead of using a separate interface for scheduling tasks in the future, we can simply use ThreadPool#schedule with a suitable implementation. This removes the unnecessary interface and its usages, and migrates the tests as appropriate.	2018-08-08 21:44:44 +01:00
Suresh N S	7fdf898518	Whitelisting / from Circuit Breaker Exception (#32325 ) (#32666 ) When Circuit Breaker has tripped, certain diagnostic requests like "_cluster/health" succeed where as request to / fails with 503 Service Unavailable. This behavior is observed because of this commit `f32b700` where certain API paths are whitelisted from Circuit Breaking exception, but / is not whitelisted. Added / to circuit breaker whitelist so that it can be used for diagnostic purposes	2018-08-08 08:24:53 -06:00
Tal Levy	2d925c9a9a	Merge branch 'master' into index-lifecycle	2018-08-08 07:21:01 -07:00
Colin Goodheart-Smithe	781e6ad551	Fixes suggestion generics (#32706 ) * Fixes suggestion generics This solves a compile problem in Eclipse where Eclipse could not resolve the generics for the options field in `PhraseSuggestion.Entry`. But I think this is also a good change in general because `PhraseSuggestion.Entry` is now declaring the specific `Option` implementation it requires rather than `Suggest.Entry.Option` which is more general and could lead to weird bugs. `CompletionSuggestion.Entry` and `TermSuggestion.Entry` already declare the more specific class they use so I think this was an oversight in `PhaseSuggestion.Entry` * iter	2018-08-08 12:46:38 +01:00
Luca Cavanna	3e437438d5	Prevent cause from being null in ShardOperationFailedException (#32640 ) `ShardOperationFailedException` and corresponding implementors seem to suggest that the cause may be null, case that is also handled in a few places. Yet, it does not seem to be possible in practice for the cause to be null, hence we can clean that up and enforce the cause to be a non null value. This is best done by making `ShardOperationFailedException` an abstract class rather than an interface, which holds the basic member instance that all the subclasses have in common and can also enforce that cause, status and reason are non null.	2018-08-08 09:59:22 +02:00
Luca Cavanna	5c2ef5e869	Preserve index_uuid when creating QueryShardException (#32677 ) As part of #32608 we made sure that the fully qualified index name is taken from the query shard context whenever creating a new `QueryShardException`. That change introduced a regression as instead of setting the entire `Index` object to the exception, which holds index name and index uuid, we ended up setting only the index name (including cluster alias). With this commit we make sure that the index uuid does not get lost and we try to lower the chances that a similar bug makes it in another time. That's done by making `QueryShardContext` return the fully qualified `Index` (which also holds the uuid) rather than only the fully qualified index name.	2018-08-08 09:57:11 +02:00
Julie Tibshirani	d7183f8f3d	Make sure that field collapsing supports field aliases. (#32648 )	2018-08-07 16:20:09 -07:00
Andy Bristol	8bfb0f3f8d	serialize suggestion responses as named writeables (#30284 ) Suggestion responses were previously serialized as streamables which made writing suggesters in plugins with custom suggestion response types impossible. This commit makes them serialized as named writeables and provides a facility for registering a reader for suggestion responses when registering a suggester. This also makes Suggestion responses abstract, requiring a suggester implementation to provide its own types. Suggesters which do not need anything additional to what is defined in Suggest.Suggestion should provide a minimal subclass. The existing plugin suggester integration tests are removed and replaced with an equivalent implementation as an example plugin.	2018-08-07 13:31:00 -07:00
Jason Tedor	dcc816427e	Expose whether or not the global checkpoint updated (#32659 ) It will be useful for future efforts to know if the global checkpoint was updated. To this end, we need to expose whether or not the global checkpoint was updated when the state of the replication tracker updates. For this, we add to the tracker a callback that is invoked whenever the global checkpoint is updated. For primaries this will be invoked when the computed global checkpoint is updated based on state changes to the tracker. For replicas this will be invoked when the local knowledge of the global checkpoint is advanced from the primary.	2018-08-07 15:10:09 -04:00
Tim Brooks	3d5e9114e3	Reduce connections used by MockNioTransport (#32620 ) The MockNioTransport (similar to the MockTcpTransport) is used for integ tests. The MockTcpTransport has always only opened a single for all of its work. The MockNioTransport has awlays opened the default number of connections (13). This means that every test where two transports connect requires 26 connections. This is more than is necessary. This commit modifies the MockNioTransport to only require 3 connections.	2018-08-07 12:52:28 -06:00
Yannick Welsch	22c367315a	[Zen2] Randomized testing of CoordinationState (#32242 ) Simulates a random run of a cluster with multiple CoordinationState instances (each representing one node), passing messages back and forth, and asserting that the overall system satisfies a given set of safety properties. Follow-up to #32171	2018-08-07 16:37:55 +02:00
Yannick Welsch	45066b5e89	Verify primary mode usage with assertions (#32667 ) Primary terms were introduced as part of the sequence-number effort (#10708) and added in ES 5.0. Subsequent work introduced the replication tracker which lets the primary own its replication group (#25692) to coordinate recovery and replication. The replication tracker explicitly exposes whether it is operating in primary mode or replica mode, independent of the ShardRouting object that's associated with a shard. During a primary relocation, for example, the primary mode is transferred between the primary relocation source and the primary relocation target. After transferring this so-called primary context, the old primary becomes a replication target and the new primary the replication source, reflected in the replication tracker on both nodes. With the most recent PR in this area (#32442), we finally have a clean transition between a shard that's operating as a primary and issuing sequence numbers and a shard that's serving as a replication target. The transition from one state to the other is enforced through the operation-permit system, where we block permit acquisition during such changes and perform the transition under this operation block, ensuring that there are no operations in progress while the transition is being performed. This finally allows us to turn the best-effort checks that were put in place to prevent shards from being used in the wrong way (i.e. primary as replica, or replica as primary) into hard assertions, making it easier to catch any bugs in this area.	2018-08-07 15:02:37 +02:00
Paul Sanwald	3ce984d746	mute test while I work on #32215	2018-08-07 08:56:00 -04:00
Yannick Welsch	785b6e824c	Zen2: Cluster state publication pipeline (#32584 ) Implements the state machine on the master to publish a cluster state. Relates to #32006	2018-08-07 14:51:46 +02:00
David Turner	f44ba04aee	[Zen2] Add UnicastConfiguredHostsResolver (#32642 ) The `PeerFinder`, introduced in #32246, obtains the collection of seed addresses configured by the user from a `ConfiguredHostsResolver`. In reality this collection comes from the `UnicastHostsProvider` via a slightly complicated threading model that performs the resolution of hostnames to addresses using a dedicated `ExecutorService`. This commit introduces an adapter to allow the `PeerFinder` to obtain its seed addresses in this manner.	2018-08-07 13:34:53 +01:00
David Turner	289e34aeed	[Zen2] Add HandshakingTransportAddressConnector (#32643 ) The `PeerFinder`, introduced in #32246, needs to be able to identify, and connect to, a remote master node using only its `TransportAddress`. This can be done by opening a single-channel connection to the address, performing a handshake, and only then forming a full-blown connection to the node. This change implements this logic.	2018-08-07 13:34:07 +01:00
Colin Goodheart-Smithe	b9c04adb29	Merge branch 'master' into index-lifecycle	2018-08-07 12:35:32 +01:00
Andrey Ershov	6449d9bc14	Include translog path in error message when translog is corrupted (#32251 ) Currently, when TranslogCorruptedException is thrown most of the times it does not contain information about the translog location on the file system. There is the translog recovery tool that accepts the translog path as an argument and users are constantly puzzled where to get the path. This pull request adds "source" information to every TranslogCorruptedException thrown. The source could be local file, remote translog source (used for recovery), assertion (translog entry is constructed to perform some assertion) or translog constructed inside the test. Closes #24929	2018-08-07 13:03:43 +02:00
Parth Verma	6fe6247dc8	Ignore script fields when size is 0 (#31917 ) This change adds a check so that when parsing the search source, script fields are ignored when the requested search result size is 0. This helps with e.g. clients like Kibana that sends a list of script fields that they may need for convenience, but they don't require any hits. Before this change, user sometimes ran into confusing behaviour, e.g. the script compilation limit to breaking although no hits were requested. Closes #31824	2018-08-07 10:56:44 +02:00
Armin Braun	f57cb10d2c	Tests: Fix Typo Causing Flaky Settings Test (#32665 ) * We were comparing the wrong timeout value in the `randomValueOtherThan` call here, leading to no mutation happening for a certain seed * closes #32639	2018-08-07 10:30:45 +02:00
Jason Tedor	3fb0923182	Fix content type detection with leading whitespace (#32632 ) Today content type detection on an input stream works by peeking up to twenty bytes into the stream. If the stream is headed by more whitespace than twenty bytes, we might fail to detect the content type. We should be ignoring this whitespace before attempting to detect the content type. This commit does that by ignoring all leading whitespace in an input stream before attempting to guess the content type.	2018-08-06 18:07:46 -04:00
Yannick Welsch	014b2772db	[TEST] Fix testReplicaTermIncrementWithConcurrentPrimaryPromotion The assertion in the test was not broad enough. If the timing is very unlucky, the shard is already promoted to primary before the indexOnReplica even gets to execute. Closes #32645	2018-08-06 18:38:01 +02:00
Nhat Nguyen	c394eb9ae9	CCR: Expose the operation primary term Relates #32442	2018-08-06 10:55:37 -04:00
Lee Hinman	0a9c3ae8bc	Remove UpdateSettingsTestHelper class (#32557 ) * Remove UpdateSettingsTestHelper class By making the `settings()` method public on `UpdateSettingsRequest` (I think it should have been in the first place) we can get rid of this class entirely. Mock response objects are now constructed by parsing JSON without making the constructor public. Relates to #29823	2018-08-06 08:53:44 -06:00
Lee Hinman	7ea7dd8018	Remove RolloverIndexTestHelper (#32559 ) * Remove RolloverIndexTestHelper This removes the `RolloverIndexTestHelper` class in favor of making a couple of getters publically accessible as well as custom building a response object using JSON parsing. Relates to #29823	2018-08-06 08:53:28 -06:00
Lee Hinman	aed466d5b6	Remove ILM constructor hacks (#32597 ) This commit removes the hacks associated with mocking Response objects. Rather than parse a wrapped byte array, the constructors for `IndicesAliasesResponse` and `ResizeResponse` are made public Relates to #29823	2018-08-06 08:53:12 -06:00
Nhat Nguyen	5881322b3f	Merge branch 'master' into ccr * master: Cross-cluster search: preserve cluster alias in shard failures (#32608) Handle AlreadyClosedException when bumping primary term [TEST] Allow to run in FIPS JVM (#32607) [Test] Add ckb to the list of unsupported languages (#32611) SCRIPTING: Move Aggregation Scripts to their own context (#32068) Painless: Use LocalMethod Map For Lookup at Runtime (#32599) [TEST] Enhance failure message when bulk updates have failures [ML] Add ML result classes to protocol library (#32587) Suppress LicensingDocumentationIT.testPutLicense in release builds (#32613) [Rollup] Update wire version check after backport Suppress Wildfly test in FIPS JVMs (#32543) [Rollup] Improve ID scheme for rollup documents (#32558) ingest: doc: move Dot Expander Processor doc to correct position (#31743) [ML] Add some ML config classes to protocol library (#32502) [TEST]Split transport verification mode none tests (#32488) Core: Move helper date formatters over to java time (#32504) [Rollup] Remove builders from DateHistogramGroupConfig (#32555) [TEST} unmutes SearchAsyncActionTests and adds debugging info [ML] Add Detector config classes to protocol library (#32495) [Rollup] Remove builders from MetricConfig (#32536) Tests: Add rolling upgrade tests for watcher (#32428) Fix race between replica reset and primary promotion (#32442)	2018-08-06 10:27:18 -04:00
David Turner	2176184db1	[Zen2] Introduce gossip-like discovery of master nodes (#32246 ) This commit introduces the `PeerFinder` which can be used to collect the identities of the master-eligible nodes in a masterless cluster, based on the `UnicastHostsProvider`, the nodes in the `ClusterState`, and nodes that other nodes have discovered.	2018-08-06 15:26:31 +01:00
Armin Braun	0a67cb4133	LOGGING: Upgrade to Log4J 2.11.1 (#32616 ) * LOGGING: Upgrade to Log4J 2.11.1 * Upgrade to `2.11.1` to fix memory leaks in slow logger when logging large requests * This was caused by a bug in Log4J https://issues.apache.org/jira/browse/LOG4J2-2269 and is fixed in `2.11.1` via https://git-wip-us.apache.org/repos/asf?p=logging-log4j2.git;h=9496c0c * Fixes #32537 * Fixes #27300	2018-08-06 14:56:21 +02:00
Luca Cavanna	826399f9fc	Cross-cluster search: preserve cluster alias in shard failures (#32608 ) When some remote clusters return shard failures as part of a cross-cluster search request, the cluster alias currently gets lost. As a result, if the shard failures are all caused by the same error, and against indices belonging to different clusters, but with the same index name, only one failure gets returned as part of the search response, meaning that failures are grouped by index name, ignoring the cluster alias. With this commit we make sure that `ShardSearchFailure` returns the cluster alias as part of the index name. Also, we set the fully qualfied index name when creating a `QueryShardException`. That way shard failures are grouped by cluster:index. Such fixes should cover at least most of the cases where either 1) the shard target is set but we don't have the index in the cause (we were previously reading it only from the cause that did not have the cluster alias) 2) the shard target is missing but if the cause is a `QueryShardException` the cluster alias does not get lost. We also prevent NPE in case the failure cause is not set and test such scenario.	2018-08-06 11:48:50 +02:00
Yannick Welsch	3cf08326ab	Handle AlreadyClosedException when bumping primary term If the shard is already closed while bumping the primary term, this can result in an AlreadyClosedException to be thrown. As we use asyncBlockOperations, the exception will be thrown on a thread from the generic thread pool and end up in the uncaught exception handler, failing our tests. Relates to #32442	2018-08-06 08:34:38 +02:00
Armin Braun	6fa7016bbf	SCRIPTING: Move Aggregation Scripts to their own context (#32068 ) * SCRIPTING: Move Aggregation Scripts to their own context	2018-08-04 10:37:07 +02:00
Lee Hinman	1e4751ec47	[TEST] Enhance failure message when bulk updates have failures	2018-08-03 15:27:10 -06:00
Alexander Reelsen	018e77cac6	Core: Move helper date formatters over to java time (#32504 ) Some classes use internal date formatters, which now can be moved over to java time using the DateFormatters class. The same applies for a few test cases.	2018-08-03 13:21:14 +02:00
Colin Goodheart-Smithe	d05f39de8b	[TEST} unmutes SearchAsyncActionTests and adds debugging info This unmutes the testFanOutAndCollect()` method and add a check to make sure we aren't accidentally running something twice causing a search phase to still be running after we have counted down the latch Relates to #29242	2018-08-03 11:52:46 +01:00
Yannick Welsch	0d60e8a029	Fix race between replica reset and primary promotion (#32442 ) We've recently seen a number of test failures that tripped an assertion in IndexShard (see issues linked below), leading to the discovery of a race between resetting a replica when it learns about a higher term and when the same replica is promoted to primary. This commit fixes the race by distinguishing between a cluster state primary term (called pendingPrimaryTerm) and a shard-level operation term. The former is set during the cluster state update or when a replica learns about a new primary. The latter is only incremented under the operation block, which can happen in a delayed fashion. It also solves the issue where a replica that's still adjusting to the new term receives a cluster state update that promotes it to primary, which can happen in the situation of multiple nodes being shut down in short succession. In that case, the cluster state update thread would call `asyncBlockOperations` in `updateShardState`, which in turn would throw an exception as blocking permits is not allowed while an ongoing block is in place, subsequently failing the shard. This commit therefore extends the IndexShardOperationPermits to allow it to queue multiple blocks (which will all take precedence over operations acquiring permits). Finally, it also moves the primary activation of the replication tracker under the operation block, so that the actual transition to primary only happens under the operation block. Relates to #32431, #32304 and #32118	2018-08-03 09:33:08 +02:00
Nhat Nguyen	6eeb628d6d	Merge branch 'master' into ccr * master: HLRC: Move commercial clients from XPackClient (#32596) Add cluster UUID to Cluster Stats API response (#32206) Security: move User to protocol project (#32367) [TEST] Test for shard failures, add debug to testProfileMatchesRegular Minor fix for javadoc (applicable for java 11). (#32573) Painless: Move Some Lookup Logic to PainlessLookup (#32565) TEST: Avoid merges in testSeqNoAndCheckpoints [Rollup] Remove builders from HistoGroupConfig (#32533) Mutes failing SQL string function tests due to #32589 fixed elements in array of produced terms (#32519) INGEST: Enable default pipelines (#32286) Remove cluster state initial customs (#32501) Mutes LicensingDocumentationIT due to #32580 [ML] Remove multiple_bucket_spans (#32496) [ML] Rename JobProvider to JobResultsProvider (#32551) Correct minor typo in explain.asciidoc for HLRC Build: Add elastic maven to repos used by BuildPlugin (#32549) Clarify the error message when a pipeline agg is used in the 'order' parameter. (#32522) Revert "[test] turn on host io cache for opensuse (#32053)" Enable packaging tests on suse boxes [ML] Improve error when no available field exists for rule scope (#32550) [ML] Improve error for functions with limited rule condition support (#32548) Painless: Clean Up PainlessField (#32525) Add @AwaitsFix for #32554 Remove broken @link in Javadoc Scripting: Conditionally use java time api in scripting (#31441) [ML] Fix thread leak when waiting for job flush (#32196) (#32541) Add AwaitsFix to failing test - see #32546 Core: Minor size reduction for AbstractComponent (#32509) SQL: Added support for string manipulating functions with more than one parameter (#32356) [DOCS] Reloadable Secure Settings (#31713) Watcher: Reenable HttpSecretsIntegrationTests#testWebhookAction test (#32456) [Rollup] Remove builders from TermsGroupConfig (#32507) Use hostname instead of IP with SPNEGO test (#32514) Switch x-pack rolling restart to new style Requests (#32339) NETWORKING: Fix Netty Leaks by upgrading to 4.1.28 (#32511) [DOCS] Small fixes in rule configuration page (#32516) Painless: Clean up PainlessMethod (#32476) Build: Remove shadowing from benchmarks (#32475) Docs: Add all JDKs to CONTRIBUTING.md Add licensing enforcement for FIPS mode (#32437) SQL: Add test for handling of partial results (#32474) Mute testFilterCacheStats [ML][DOCS] Fix typo applied_to => applies_to Scripting: Fix painless compiler loader to know about context classes (#32385)	2018-08-02 23:14:37 -04:00
Shaunak Kashyap	0a83968650	Add cluster UUID to Cluster Stats API response (#32206 ) * Make cluster stats response contain cluster UUID * Updating constructor usage in Monitoring tests * Adding cluster_uuid field to Cluster Stats API reference doc * Adding rest api spec test for expecting cluster_uuid in cluster stats response * Adding missing newline * Indenting do section properly * Missed a spot! * Fixing the test cluster ID	2018-08-02 17:14:19 -07:00
Zachary Tong	080b9f58ea	[TEST] Test for shard failures, add debug to testProfileMatchesRegular Unmuting the test and adding some more debug output. Was not able to reproduce the prior failure, but it seems possible that the failure (mismatched counts) could be caused by partial search results during the test. The assertions check for shard failures first, because if one of the two searches is partial the rest of the test will fail. Next, instead of just checking respective hit counts, we emit the difference in hits to help identify what went wrong. Closes #32492	2018-08-02 17:18:29 -04:00
Nhat Nguyen	9f96073e64	TEST: Avoid merges in testRecoveryWithOutOfOrderDelete Since LUCENE-8263, testRecoveryWithOutOfOrderDelete may trigger merges because of the deletes. In the test, we try to retain index#0 but reclaim delete#1. However, if a merge is triggered, we will remove both index#0 and delete#1. This commit disables merges in this test. Another option is to index more documents in the segment_2 to reduce the deletion ratio.	2018-08-02 15:57:29 -04:00
Nhat Nguyen	2c35db8043	TEST: Avoid merges in testSeqNoAndCheckpoints Since LUCENE-8263, testSeqNoAndCheckpoints might trigger merges because of the updates and deletes in the test. Our merge scheduler will trigger a flush if there is no pending merge. Those extra flushes will change the last committed segmentInfos in the engine and fail the test. This commit uses LogMergePolicy for the engine in the test to avoid merges. Closes #32430	2018-08-02 13:46:23 -04:00
Armin Braun	be31cc642b	INGEST: Enable default pipelines (#32286 ) * INGEST: Enable default pipelines * Add `default_pipeline` index setting * `_none` is interpreted as no pipeline * closes #21101	2018-08-02 17:11:12 +02:00
Yannick Welsch	db6e8c736d	Remove cluster state initial customs (#32501 ) This infrastructure was introduced in #26144 and made obsolete in #30743	2018-08-02 15:49:59 +02:00
Julie Tibshirani	5efc2ec9f7	Clarify the error message when a pipeline agg is used in the 'order' parameter. (#32522 )	2018-08-01 12:02:07 -07:00
Ryan Ernst	478f6d6cf1	Scripting: Conditionally use java time api in scripting (#31441 ) This commit adds a boolean system property, `es.scripting.use_java_time`, which controls the concrete return type used by doc values within scripts. The return type of accessing doc values for a date field is changed to Object, essentially duck typing the type to allow co-existence during the transition from joda time to java time.	2018-08-01 08:58:49 -07:00
Nik Everett	e7ead17893	Core: Minor size reduction for AbstractComponent (#32509 ) This removes a constructor from `AbstractComponent` and `AbstractLifecycleComponent` that we weren't using and it switches the logger creation away from one of the `Settings` flavored methods which are no longer needed.	2018-08-01 09:17:48 -04:00
Yannick Welsch	d80b639c18	Merge remote-tracking branch 'elastic/master' into zen2	2018-08-01 11:07:44 +02:00
Nhat Nguyen	67d53e5093	Mute testFilterCacheStats Tracked at #32506	2018-07-31 12:45:30 -04:00
Nhat Nguyen	036cb3f864	Merge branch 'master' into ccr * master: Logging: Make node name consistent in logger (#31588) Mute SSLTrustRestrictionsTests on JDK 11 Increase max chunk size to 256Mb for repo-azure (#32101) Docs: Fix README upgrade mention (#32313) Changed ReindexRequest to use Writeable.Reader (#32401) Mute KerberosAuthenticationIT Fix AutoIntervalDateHistogram.testReduce random failures (#32301) fix no=>not typo (#32463) Mute QueryProfilerIT#testProfileMatchesRegular() HLRC: Add delete watch action (#32337) High-level client: fix clusterAlias parsing in SearchHit (#32465) Fix calculation of orientation of polygons (#27967) [Kerberos] Add missing javadocs (#32469) [Kerberos] Remove Kerberos bootstrap checks (#32451) Make get all app privs requires "*" permission (#32460) Switch security to new style Requests (#32290) Switch security spi example to new style Requests (#32341) Painless: Add PainlessConstructor (#32447) update rollover to leverage write-alias semantics (#32216) Update Fuzzy Query docs to clarify default behavior re max_expansions (#30819) INGEST: Clean up Java8 Stream Usage (#32059) Ensure KeyStoreWrapper decryption exceptions are handled (#32464)	2018-07-31 10:56:10 -04:00
Nik Everett	22459576d7	Logging: Make node name consistent in logger (#31588 ) First, some background: we have 15 different methods to get a logger in Elasticsearch but they can be broken down into three broad categories based on what information is provided when building the logger. Just a class like: ``` private static final Logger logger = ESLoggerFactory.getLogger(ActionModule.class); ``` or: ``` protected final Logger logger = Loggers.getLogger(getClass()); ``` The class and settings: ``` this.logger = Loggers.getLogger(getClass(), settings); ``` Or more information like: ``` Loggers.getLogger("index.store.deletes", settings, shardId) ``` The goal of the "class and settings" variant is to attach the node name to the logger. Because we don't always have the settings available, we often use the "just a class" variant and get loggers without node names attached. There isn't any real consistency here. Some loggers get the node name because it is convenient and some do not. This change makes the node name available to all loggers all the time. Almost. There are some caveats are testing that I'll get to. But in production code the node name is node available to all loggers. This means we can stop using the "class and settings" variants to fetch loggers which was the real goal here, but a pleasant side effect is that the ndoe name is now consitent on every log line and optional by editing the logging pattern. This is all powered by setting the node name statically on a logging formatter very early in initialization. Now to tests: tests can't set the node name statically because subclasses of `ESIntegTestCase` run many nodes in the same jvm, even in the same class loader. Also, lots of tests don't run with a real node so they don't have a node name at all. To support multiple nodes in the same JVM tests suss out the node name from the thread name which works surprisingly well and easy to test in a nice way. For those threads that are not part of an `ESIntegTestCase` node we stick whatever useful information we can get form the thread name in the place of the node name. This allows us to keep the logger format consistent.	2018-07-31 10:54:24 -04:00
Sohaib Iftikhar	4fa92cbf49	Changed ReindexRequest to use Writeable.Reader (#32401 ) -- This is a pre-stage for adding the reindex API to the REST high-level-client -- Follows the pattern set in #26315	2018-07-31 10:11:17 -04:00
Paul Sanwald	6f93911955	Fix AutoIntervalDateHistogram.testReduce random failures (#32301 ) 1. Refactor the test to use the same roundings as the implementation. 2. Refactor the test verification logic to use `innerIntervals` when rounding.	2018-07-31 08:52:16 -04:00
Daniel Mitterdorfer	9703d06321	Mute QueryProfilerIT#testProfileMatchesRegular() Relates #32492	2018-07-31 13:29:21 +02:00
Luca Cavanna	a3b272966d	High-level client: fix clusterAlias parsing in SearchHit (#32465 ) When using cross-cluster search through the high-level REST client, the cluster alias from each search hit was not parsed correctly. It would be part of the index field initially, but overridden just a few lines later once setting the shard target (in case we have enough info to build it from the response). In any case, getClusterAlias returns `null` which is a bug. With this change we rather parse back clusterAliases from the index name, set its corresponding field and properly handle the two possible cases depending on whether we can or cannot build the shard target object.	2018-07-31 09:41:51 +02:00
David Turner	8b57e2e5ba	Fix calculation of orientation of polygons (#27967 ) The method for working out whether a polygon is clockwise or anticlockwise is mostly correct but doesn't work in some rare cases such as the included test case. This commit fixes that.	2018-07-31 08:25:21 +01:00
Tal Levy	1e0fcebfe1	update rollover to leverage write-alias semantics (#32216 ) Rollover should not swap aliases when `is_write_index` is set to `true`. Instead, both the new and old indices should have the rollover alias, with the newly created index as the new write index Updates Rollover to leverage the ability to preserve aliases and swap which is the write index. Historically, Rollover would swap which index had the designated alias for writing documents against. This required users to keep a separate read-alias that enabled reading against both rolled over and newly created indices, whiles the write-alias was being re-assigned at every rollover. With the ability for aliases to designate a write index, Rollover can be a bit more flexible with its use of aliases. Updates include: - Rollover validates that the target alias has a write index (the index that is being rolled over). This means that the restriction that aliases only point to one index is no longer necessary. - Rollover explicitly (and atomically) swaps which index is the write-index by explicitly assigning the existing index to have `is_write_index: false` and have the newly created index have its rollover alias as `is_write_index: true`. This is only done when `is_write_index: true` on the write index. Default behavior of removing the alias from the rolled over index stays when `is_write_index` is not explicitly set Relevant things that are staying the same: - Rollover is rejected if there exist any templates that match the newly-created index and configure the rollover-alias - I think this existed to prevent the situation where an alias pointed to two indices for a short while. Although this can technically be relaxed, the specific cases that are safe are really particular and difficult to reason, so leaving the broad restriction sounds good	2018-07-30 14:32:55 -07:00
Armin Braun	cf7489899a	INGEST: Clean up Java8 Stream Usage (#32059 ) * GrokProcessor: Rationalize the loop over the map to save allocations and indirection * IngestDocument: Rationalize way we append to `List`	2018-07-30 21:25:30 +02:00
Ioannis Kakavas	c2e3bebab9	Ensure KeyStoreWrapper decryption exceptions are handled (#32464 ) * Ensure decryption related exceptions are handled This commit ensures that all possible Exceptions in KeyStoreWrapper#decrypt() are handled. More specifically, in the case that a wrong password is used for secure settings, calling readX on the DataInputStream that wraps the CipherInputStream can throw an IOException. It also adds a test for loading a KeyStoreWrapper with a wrong password. Resolves #32411	2018-07-30 22:15:59 +03:00
Nhat Nguyen	1fdc3f08be	Do not expose hard-deleted docs in Lucene history (#32333 ) Today when reading operation history in Lucene, we read all documents. However, if indexing a document is aborted, IndexWriter will hard-delete it; we, therefore, need to exclude that document from Lucene history. This commit makes sure that we exclude aborted documents by using the hard liveDocs of a SegmentReader if there are deletes. Closes #32269	2018-07-30 14:30:47 -04:00
Nhat Nguyen	2245812ef7	Merge branch 'master' into ccr * master: Tests: Fix convert error tests to use fixed value (#32415) IndicesClusterStateService should replace an init. replica with an init. primary with the same aId (#32374) REST high-level client: parse back _ignored meta field (#32362) [CI] Mute DocumentSubsetReaderTests testSearch	2018-07-30 14:02:58 -04:00
Boaz Leskes	0cae19c8d7	IndicesClusterStateService should replace an init. replica with an init. primary with the same aId (#32374 ) In rare cases it is possible that a nodes gets an instruction to replace a replica shard that's in `POST_RECOVERY` with a new initializing primary with the same allocation id. This can happen by batching cluster states that include the starting of the replica, with closing of the indices, opening it up again and allocating the primary shard to the node in question. The node should then clean it's initializing replica and replace it with a new initializing primary. I'm not sure whether the test I added really adds enough value as existing tests found this. The main reason I added is to allow for simpler reproduction and to double check I fixed it. I'm open to discuss if we should keep. Closes #32308	2018-07-30 16:24:41 +03:00
Luca Cavanna	9a4d0069f6	REST high-level client: parse back _ignored meta field (#32362 ) `GetResult` and `SearchHit` have been adjusted to parse back the `_ignored` meta field whenever it gets printed out. Expanded the existing tests to make sure this is covered. Fixed also a small problem around highlighted fields in `SearchHitTests`.	2018-07-30 13:43:40 +02:00
Nhat Nguyen	d2a88f5c62	Merge branch 'master' into ccr * master: TEST: testDocStats should always use forceMerge (#32450) TEST: Avoid deletion in FlushIT AwaitsFix IndexShardTests#testDocStats Painless: Add method type to method. (#32441)	2018-07-28 07:50:39 -04:00
Nhat Nguyen	5b1ad8099b	TEST: testDocStats should always use forceMerge (#32450 ) Due to the recent change in LUCENE-8263, we need to adjust the deletion ration to between 10% to 33% to preserve the current behavior of the test. However, we may need another refinement if soft-deletes is enabled as the actual deletes are different because of delete tombstones. This commit prefers to always execute forceMerge instead of adjusting the deletion ratio so that this test can focus on testing docStats. Closes #32449	2018-07-28 07:41:30 -04:00
Nhat Nguyen	a538b76f6f	TEST: avoid merge in testSegmentMemoryTrackedInBreaker This commit indexes an extra document to avoid triggering merges. Relates LUCENE-8263	2018-07-27 23:44:35 -04:00
Nhat Nguyen	6e98615cc1	TEST: Avoid deletion in FlushIT Due to the recent change in LUCENE-8263, a merge can be triggered if the deletion ration is higher than 33%. An in-progress merge can prevent a synced-flush from issuing. This commit avoids deletes by using different docIds. Closes #32436	2018-07-27 23:14:24 -04:00
Nhat Nguyen	139631c77d	AwaitsFix IndexShardTests#testDocStats Relates #32449	2018-07-27 20:48:23 -04:00
Nhat Nguyen	2f756b00f6	Merge branch 'master' into ccr * master: Remove reference to non-existent store type (#32418) [TEST] Mute failing FlushIT test Fix ordering of bootstrap checks in docs (#32417) [TEST] Mute failing InternalEngineTests#testSeqNoAndCheckpoints [TEST] Mute failing testConvertLongHexError bump lucene version after backport Upgrade to Lucene-7.5.0-snapshot-608f0277b0 (#32390) [Kerberos] Avoid vagrant update on precommit (#32416) TESTS: Move netty leak detection to paranoid level (#32354) [DOCS] Fixes formatting of scope object in job resource Copy missing segment attributes in getSegmentInfo (#32396) AbstractQueryTestCase should run without type less often (#28936) INGEST: Fix Deprecation Warning in Script Proc. (#32407) Switch x-pack/plugin to new style Requests (#32327) Docs: Correcting a typo in tophits (#32359) Build: Stop double generating buildSrc pom (#32408) TEST: Avoid triggering merges in FlushIT Fix missing JavaDoc for @throws in several places in KerberosTicketValidator. Switch x-pack full restart to new style Requests (#32294) Release requests in cors handler (#32364) Painless: Clean Up PainlessClass Variables (#32380) Docs: Fix callouts in put license HL REST docs (#32363) [ML] Consistent pattern for strict/lenient parser names (#32399) Update update-settings.asciidoc (#31378) Remove some dead code (#31993) Introduce index store plugins (#32375) Rank-Eval: Reduce scope of an unchecked supression Make sure _forcemerge respects `max_num_segments`. (#32291) TESTS: Fix Buf Leaks in HttpReadWriteHandlerTests (#32377) Only enforce password hashing check if FIPS enabled (#32383)	2018-07-27 16:24:03 -04:00
javanna	dcb5d24639	[TEST] Mute failing FlushIT test See #32436	2018-07-27 17:10:29 +02:00
javanna	7aa5365497	[TEST] Mute failing InternalEngineTests#testSeqNoAndCheckpoints	2018-07-27 14:41:32 +02:00
Nhat Nguyen	8474f8a01c	Validate source of an index in LuceneChangesSnapshot (#32288 ) Today it's possible to encounter an Index operation in Lucene whose _source is disabled, and _recovery_source was pruned by the MergePolicy. If it's the case, we create a Translog#Index without source and let the caller validate it later. However, this approach is challenging for the caller. Deletes and No-Ops don't allow invoking "source()" method. The caller has to make sure to call "source()" only on index operations. The current implementation in CCR does not follow this and fail to replica deletes or no-ops. Moreover, it's easier to reason if a Translog#Index always has the source.	2018-07-27 08:16:52 -04:00
Jim Ferenczi	5decb23687	bump lucene version after backport	2018-07-27 10:50:22 +02:00
Jim Ferenczi	53ff06e621	Upgrade to Lucene-7.5.0-snapshot-608f0277b0 (#32390 ) The main highlight is the removal of the reclaim_deletes_weight in the TieredMergePolicy. The es setting index.merge.policy.reclaim_deletes_weight is deprecated in this commit and the value is ignored. The new merge policy setting setDeletesPctAllowed should be added in a follow up.	2018-07-27 08:28:51 +02:00
Nhat Nguyen	90c58872ff	Only enable soft-deletes in 6.5 or later	2018-07-26 21:43:25 -04:00
Jim Ferenczi	860f92fcdd	Copy missing segment attributes in getSegmentInfo (#32396 ) The index sort and the attributes map of a segment are not copied on committed segments that are not loaded by the internal or external searcher.	2018-07-26 20:29:27 +02:00
Jim Ferenczi	8e5f281b27	AbstractQueryTestCase should run without type less often (#28936 ) This commit changes the randomization to always create an index with a type. It also adds a way to create a query shard context that maps to an index with no type registered in order to explicitely test cases where there is no type.	2018-07-26 20:29:05 +02:00
Armin Braun	57876bfeb9	INGEST: Fix Deprecation Warning in Script Proc. (#32407 ) * Using short script form normalized to a map that used 'inline' instead of 'source' so a short form processor definition like: ``` { "script": "ctx.foo= 'bar'" } ``` would always warn about the following deprecation: ``` #! Deprecation: Deprecated field [inline] used, expected [source] ```	2018-07-26 19:55:28 +02:00
Nhat Nguyen	0ed3458534	TEST: Avoid triggering merges in FlushIT In testSyncedFlushSkipOutOfSyncReplicas, we reindex the extra documents to all shards including the out-of-sync replica. However, reindexing to that replica can trigger merges (due to the new deletes) which cause the synced-flush failed. This test starts failing after we aggressively trigger merges segments with a large number of deletes in LUCENE-8263.	2018-07-26 12:38:36 -04:00

... 9 10 11 12 13 ...

2090 Commits