OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-10 15:05:33 +00:00

Author	SHA1	Message	Date
Jim Ferenczi	544de13d8e	Disallow negative query boost (#34486 ) This change disallows negative query boosts. Negative scores are not allowed in Lucene 8 so it is easier to just disallow negative boosts entirely. We should also deprecate negative boosts in 6x in order to ensure that users are aware when they'll upgrade to ES 7. Relates #33309	2018-10-16 11:31:53 +01:00
Jason Tedor	4b2052c683	Introduce index settings version (#34429 ) This commit introduces settings version to index metadata. This value is monotonically increasing and is updated on settings updates. This will be useful in cross-cluster replication so that we can request settings updates from the leader only when there is a settings update.	2018-10-16 06:22:20 -04:00
Daniel Mitterdorfer	92b2e1a209	Remove lenient boolean handling With this commit we remove some leftovers from #26389 which cleaned up lenient boolean handling. Relates #26389 Relates #22298 Relates #34467	2018-10-16 06:30:00 +02:00
Jason Tedor	55dee53046	Do not update number of replicas on no indices (#34481 ) Today when submitting an update settings request to update the number of replicas with a wildcard that does not match any indices and allow no indices is set to true, the request ends up being interpreted as updating the number of replicas for all indices. That is, consider the following sequence: PUT /test-index { "settings": { "index.number_of_replicas": 0 } } PUT /non-existent-*/_settings?expand_wildcards=open&allow_no_indices=true { "settings": { "index.number_of_replicas": 1 } } GET /test-index/_settings The latter will show that the number of replicas on test-index is now one. This is surprising, and should be considered a bug. The underlying problem here is treating no indices in the underlying methods used to update the routing table and the metadata as meaning all indices. This commit takes away this assumption. Tests that relied on this behavior have been changed to no longer rely on this. A test for this situation is added in UpdateNumberOfReplicasIT.	2018-10-15 19:49:58 -04:00
Nik Everett	23ece922c9	Core: Remove two methods from AbstractComponent (#34336 ) This removes another two methods from `AbstractComponent`. One isn't used at all and another is only used in a single class in watcher. I've moved the method that watcher uses into the single class that uses it.	2018-10-15 16:05:14 -04:00
Nik Everett	a6d1cc6ca9	Revert "Search: Fix spelling mistake in Javadoc (#34480 )" This reverts commit 4e1d7baed0a1e2fa1fa17fe4479045d811f1e02e.	2018-10-15 15:42:11 -04:00
fonxian	4e1d7baed0	Search: Fix spelling mistake in Javadoc (#34480 ) "iff" -> "if".	2018-10-15 15:38:37 -04:00
Ryan Ernst	26f1d7fc94	Tests: Handle epoch date formatters edge cases (#34437 ) This commit handles cases testing withLocale and withZone when the zone and locale in question is the same as the special base case. This can happen sometimes since the locale and zoneids are randomized.	2018-10-15 12:18:18 -07:00
Jim Ferenczi	67577fca56	Fix handling of empty keyword in terms aggregation (#34457 ) Empty values on keyword fields are filtered by the `map` execution mode of the `terms` aggregation. This commit restores them as valid buckets. Closes #34434	2018-10-15 19:33:52 +01:00
Armin Braun	ebca27371c	SCRIPTING: Move Aggregation Script Context to its own class (#33820 ) * SCRIPTING: Move Aggregation Script Context to its own class	2018-10-15 17:28:05 +01:00
Colin Goodheart-Smithe	0b42eda0e3	Merge branch 'master' into index-lifecycle	2018-10-15 16:03:37 +01:00
David Turner	9bb620eece	Mute PartitionedRoutingIT#testShrinking on Windows	2018-10-15 13:18:00 +01:00
Ryan Ernst	72d818c304	Tests: Fix DateFormatter equals tests with locale (#34435 ) This commit removes randomization of locale for DateFormatter equals tests, instead using explicit locales. The test framework already randomizes locales, so the random choice of the second locale can sometimes be equal to the already chosen locale. Randomization also does not provide any extra protection, as the equality of DateFormatter does not implement equality of the locales itself. closes #34337	2018-10-14 23:54:49 +01:00
Yannick Welsch	5fbead00a3	Zen2: Add infrastructure for integration tests (#34365 ) Adds the infrastructure to run integration tests against Zen2.	2018-10-14 20:55:04 +01:00
David Turner	8b9fa55c93	Add storage-layer disruptions to CoordinatorTests (#34347 ) Today we assume the storage layer operates perfectly in CoordinatorTests, which means we are not testing that the system's invariants are preserved if the storage layer fails for some reason. This change injects (rare) storage-layer failures during the safety phase to cover these cases.	2018-10-13 14:24:15 +01:00
David Turner	d98199df14	Extend duration of fixLag() (#34364 ) Today, fixLag() waits for a new cluster state to be committed. However, it does not account for the fact that a term bump may occur, requiring a new election to take place after the cluster state is committed. This change fixes this.	2018-10-11 23:24:08 +01:00
David Turner	a32e303b0c	Account for election duration (#34362 ) Today we may schedule two elections very close together, which can cause the first election to fail even if there are no other nodes. This change adds a delay in between subsequent elections on the same node, effectively allowing time for each election to complete before scheduling the next one.	2018-10-11 15:31:08 +01:00
Jay Modi	6d99d7dafc	ListenableFuture should preserve ThreadContext (#34394 ) ListenableFuture may run a listener on the same thread that called the addListener method or it may execute on another thread after the future has completed. Whenever the ListenableFuture stores the listener for execution later, it should preserve the thread context which is what this change does.	2018-10-11 15:24:38 +01:00
Nhat Nguyen	33791ac27c	CCR: Following primary should process operations once (#34288 ) Today we rewrite the operations from the leader with the term of the following primary because the follower should own its history. The problem is that a newly promoted primary may re-assign its term to operations which were replicated to replicas before by the previous primary. If this happens, some operations with the same seq_no may be assigned different terms. This is not good for the future optimistic locking using a combination of seqno and term. This change ensures that the primary of a follower only processes an operation if that operation was not processed before. The skipped operations are guaranteed to be delivered to replicas via either primary-replica resync or peer-recovery. However, the primary must not acknowledge until the global checkpoint is at least the highest seqno of all skipped ops (i.e., they all have been processed on every replica). Relates #31751 Relates #31113	2018-10-10 15:39:57 -04:00
Simon Willnauer	34b935ae57	Improve `getRestHandlerWrapper` JavaDocs (#34376 ) Questions on how to work with `ActionPlugin#getRestHandlerWrapper()` come up in discuss forums all the time. This change adds an example to the javadoc how this method should/could be used.	2018-10-10 17:28:07 +01:00
David Turner	52a3a19551	Add low-level bootstrap implementation (#34345 ) Today we inject the initial configuration of the cluster (i.e. the set of voting nodes) at startup. In reality we must support injecting the initial configuration after startup too. This commit adds low-level support for doing so as safely as possible.	2018-10-08 15:56:48 +01:00
Yannick Welsch	49cbcaff4f	Allow excluding folder names when scanning for dangling indices (#34349 ) ES is scanning for dangling indices on every cluster state update. For this, it lists the subfolders of the indices directory to determine which extra index directories exist on the node where there's no corresponding index in the cluster state. These are potential targets for dangling index import. On certain machine types, and with large number of indices, this subfolder listing can be horribly slow. This means that every cluster state update will be slowed down by potentially hundreds of milliseconds. One of the reasons for this poor performance is that Files.isDirectory() is a relatively expensive call on some OS and JDK versions. There is no need though to do all these isDirectory calls for folders which we know we are going to discard anyhow in the next step of the dangling indices logic. This commit allows adding an exclusion predicate to the availableIndexFolders methods which can dramatically speed up this method when scanning for dangling indices.	2018-10-08 15:35:50 +02:00
David Turner	ac99d1d66d	Fix bugs in fixLag() (#34346 ) The hack to work around lag detection had some issues: - it always called runFor(), even if no lag was detected - it looked at the last-accepted state not the last-applied state, so missed some lag situations. This fixes these issues.	2018-10-08 11:33:25 +01:00
Nik Everett	06993e0c35	Logging: Make ESLoggerFactory package private (#34199 ) Since all calls to `ESLoggerFactory` outside of the logging package were deprecated, it seemed like it'd simplify things to migrate all of the deprecated calls and declare `ESLoggerFactory` to be package private. This does that.	2018-10-06 09:54:08 -04:00
David Turner	03da4f6c51	Gather votes from all nodes (#34335 ) Today we accept that some nodes may vote for the wrong master in an election. This is mostly fine because they do end up joining the correct master in the end, but the lack of a vote from every follower may prevent a future desirable reconfiguration from taking place. The solution is to hold another election in a yet-higher term in order to collect a complete set of votes. Elections are somewhat disruptive so we should think carefully about when this election should take place. One option is to wait as late as possible (on the grounds that it might not ever be necessary). This unfortunately makes it harder to predict how an apparently-smoothly-running cluster will react to nodes leaving and joining. Instead we prefer to perform the election as soon as possible in the leader's term, adding "votes from all followers" to the invariants that we expect to hold in a stable cluster. The start of a leader's term is already a somewhat disrupted time for the cluster, so performing another election at this point does not materially change the cluster's behaviour. This change implements the logic needed to trigger a new election in order to satisfy this extra stabilisation condition.	2018-10-06 07:22:04 +01:00
Daniel Mitterdorfer	7d826916b9	Adjust size of BigArrays in circuit breaker test With this commit we restore the previous behavior in `BigArraysTests#testMaxSizeExceededOnResize` but lower the sizes that are tested to the range between 256 bytes to 16 kB so the test does not produce a whole lot of garbage. The previous attempt to reduce the amount of garbage produced by that test was to properly size the array initially but it failed to account for object alignment which lead to test failures in some cases. While it would be possible to account for object alignment, we would need to open up BigArrays or directly use the underlying Lucene API which would require us to allocate an array upfront only to find its size (incl. object alignment). Instead we have fixed this issue by conservatively sizing the array initially (so the initial allocation will never trip the circuit breaker) and reduce garbage by reducing the circuit breaker's upper bound as described previously. Closes #33750 Relates #34325	2018-10-05 15:39:08 +02:00
Jim Ferenczi	5c7b52e930	Adapt bwc version after backport Relates #33587	2018-10-05 13:07:39 +02:00
eray	daf88335d7	Add max_children limit to nested sort (#33587 ) Add an option to `nested` sort to limit the number of children to visit when picking the sort value of the root document. Closes #33592	2018-10-05 12:02:47 +02:00
David Turner	29d7d1d503	Minor housekeeping of tests (#34315 ) From experience with #34257, here are a few things that help with analysing logs from test runs. Also we prevent trying to stabilise a cluster with raised delay variability, because lowering the delay variability requires time to allow all the extra-varied-scheduled tasks to work their way out of the system.	2018-10-05 07:57:03 +01:00
Dimitris Athanasiou	4dacfa95d2	[ML] Allow asynchronous job deletion (#34058 ) This changes the delete job API by adding the choice to delete a job asynchronously. The commit adds a `wait_for_completion` parameter to the delete job request. When set to `false`, the action returns immediately and the response contains the task id. This also changes the handling of subsequent delete requests for a job that is already being deleted. It now uses the task framework to check if the job is being deleted instead of the cluster state. This is a beneficial for it is going to also be working once the job configs are moved out of the cluster state and into an index. Also, force delete requests that are waiting for the job to be deleted will not proceed with the deletion if the first task fails. This will prevent overloading the cluster. Instead, the failure is communicated better via notifications so that the user may retry. Finally, this makes the `deleting` property of the job visible (also it was renamed from `deleted`). This allows a client to render a deleting job differently. Closes #32836	2018-10-05 02:41:28 +03:00
Nik Everett	09aaed4fe4	Tasks: Document that status is not semvered (#34270 ) The `status` part of the tasks API reflects the internal status of a running task. In general, we do not make backwards breaking changes to the `status` but because it is internal we reserve the right to do so. I suspect we will very rarely excercise that right but it is important that we have it so we're not boxed into any particular implementation for a request. In some sense this is policy making by documentation change. In another it is clarification of the way we've always thought of this field. I also reflect the documentation change into the Javadoc in a few places. There I acknowledge Kibana's "special relationship" with Elasticsearch. Kibana parses `_reindex`'s `status` field and, because we're friends with those folks, we should talk to them before we make backwards breaking changes to it. We want to be friends with everyone but there is only so much time in the day and we don't want to make backwards breaking fields to `status` at all anyway. So we hope that breaking changes documentation should be enough for other folks. Relates to #34245.	2018-10-04 14:42:37 -04:00
Yannick Welsch	b32abcbd00	Zen2: Add Cluster State Applier (#34257 ) Adds the cluster state applier to Coordinator, and adds tests for cluster state acking.	2018-10-04 20:33:28 +02:00
Vladimir Dolzhenko	dcfe64e0e4	[CI] Fix bogus ScheduleWithFixedDelayTests.testRunnableRunsAtMostOnceAfterCancellation Closes #34004	2018-10-04 16:31:56 +02:00
Armin Braun	3ccfc3de58	SCRIPTING: Terms set query expression (#33856 ) * SCRIPTING: Add Expr. Compile for TermSetQuery Ctx. * Follow up to #33602 adding the ability to compile TermsSetQuery scripts with the expressions engine in the same way we support SearchScript in Expressions * Duplicated the code here for now to make the change less complex, the only difference to SearchScript is that `_score` and `_value` are not handled for TermsSetQuery * remove redundant check	2018-10-04 16:03:57 +02:00
Nik Everett	ab8a5563f2	Logging: Drop remaining Settings log ctor (#34149 ) Drops the last logging constructor that takes `Settings` because it is no longer needed. Watcher goes through a lot of effort to pass `Settings` to `Logger` constructors and dropping `Settings` from all of those calls allowed us to remove quite a bit of log-based ceremony from watcher.	2018-10-04 09:18:04 -04:00
David Turner	c6b0f08472	Add safety phase to CoordinatorTests (#34241 ) Today's CoordinatorTests have a limited amount of randomisation in how things are scheduled. However, to be fully confident in Zen2's liveness we require the system to stabilise after any permitted sequence of events. We can achieve this by running the system in a much more random fashion for a while, with much larger variation in when things are scheduled (simulating GC pressure and network disruption) and then continuing to assert that the system stabilises as we expect. When running randomly, we do not expect to make significant progress and merely verify that no safety property is violated. This change introduces the runRandomly() test method which implements this idea. It also fixes a handful of liveness bugs that this first version of runRandomly() exposed.	2018-10-04 07:40:26 +01:00
Jim Ferenczi	e8b986cc37	Fix sporadic failure in NestedObjectMapperTests Relates #34225	2018-10-04 07:40:46 +02:00
Nhat Nguyen	6dd716b0c4	Replace version with reader cache key in IndicesRequestCache (#34189 ) Today we use the version of a DirectoryReader as a component of the key of IndicesRequestCache. This usage is perfectly fine since the version is advanced every time a new change is made into IndexWriter. In other words, two DirectoryReaders with the same version should have the same content. However, this invariant is only guaranteed in the context of a single IndexWriter because the version is reset to the committed version value when IndexWriter is re-opened. Since #33473, each IndexShard may have more than one IndexWriter, and using the version of a DirectoryReader as a part of the cache key can cause IndicesRequestCache to return stale cached values. For example, in #27650, we rollback the engine (i.e., re-open IndexWriter), index new documents, refresh, then make a count request, but the search layer mistakenly returns the count of the DirectoryReader of the previous IndexWriter because the current DirectoryReader has the same version of the old DirectoryReader even their documents are different. This is possible because these two readers come from different IndexWriters. This commit replaces the the version with the reader cache key of IndexReader as a component of the cache key of IndicesRequestCache. Closes #27650 Relates #33473	2018-10-03 21:03:24 -04:00
David Turner	cbe1cf98c6	Merge branch 'master' into zen2	2018-10-03 22:12:56 +01:00
Kazuhiro Sera	d45fe43a68	Fix a variety of typos and misspelled words (#32792 )	2018-10-03 18:11:38 +01:00
Jim Ferenczi	ee21067a41	Add early termination support for min/max aggregations (#33375 ) This commit adds the support to early terminate the collection of a leaf in the min/max aggregator. If the query matches all documents the min and max value for a numeric field can be retrieved efficiently in the points reader. This change applies this optimization when possible.	2018-10-03 18:33:39 +02:00
Lee Hinman	90c55f5e36	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-03 09:11:28 -06:00
albendz	f09190c14d	Require combine and reduce scripts in scripted metrics aggregation (#33452 ) * Make text message not required in constructor for slack * Remove unnecessary comments in test file * Throw exception when reduce or combine is not provided; update tests * Update integration tests for scripted metrics to always include reduce and combine * Remove some old changes from previous branches * Rearrange script presence checks to be earlier in build * Change null check order in script builder for aggregated metrics; correct test scripts in IT * Add breaking change details to PR	2018-10-03 15:22:01 +01:00
Jim Ferenczi	41528c0813	Adapt bwc version after backport (bis) Relates #34225	2018-10-03 14:24:01 +02:00
Jim Ferenczi	1aa8e72be7	Adapt bwc version after backport Relates #34225	2018-10-03 12:24:07 +02:00
Jim Ferenczi	5a3e031831	Preserve the order of nested documents in the Lucene index (#34225 ) Today we reverse the initial order of the nested documents when we index them in order to ensure that parents documents appear after their children. This means that a query will always match nested documents in the reverse order of their offsets in the source document. Reversing all documents is not needed so this change ensures that parents documents appear after their children without modifying the initial order in each nested level. This allows to match children in the order of their appearance in the source document which is a requirement to efficiently implement #33587. Old indices created before this change will continue to reverse the order of nested documents to ensure backwark compatibility.	2018-10-03 11:55:30 +02:00
Colin Goodheart-Smithe	2d64e3db9a	Adds trace logging to IndicesRequestCache (#34180 ) * Adds trace logging to IndicesRequestCache This change adds trace level logging to `IndicesrrequestCache` witht eh primary aim of helping to identify the cause of teh failures in https://github.com/elastic/elasticsearch/issues/32827. The cache will log at trace level when a cache hit or miss occurs including the reader version and the cache key. Note that this change adds a `cacheKeyRenderer` whcih supplies a human readable String of the cache key since the actual cache key itself is a `BytesReference` containing the wire protocol serialised form of the request. Logging is also added for the case where a search timeout occurs and fr that reason the cache entry is invalidated. * Adds comment to remaind us to remove cacheKeyRenderer	2018-10-03 08:58:33 +01:00
David Turner	a9eae1d068	Merge branch 'master' into zen2	2018-10-03 08:36:34 +01:00
Gordon Brown	fb907706ec	Merge branch 'master' into index-lifecycle	2018-10-02 13:43:46 -06:00
Dimitrios Liappis	f12e0a8398	Add ES version 6.4.3 (#34239 ) Version bump	2018-10-02 21:15:58 +03:00

... 2 3 4 5 6 ...

1696 Commits