OpenSearch

Commit Graph

Author	SHA1	Message	Date
Simon Willnauer	11dc9fe249	Mark searcher as accessed in acquireSearcher (#41335 ) This fixes an issue where every N seconds a slow search request is triggered since the searcher access time is not set unless the shard is idle. This change moves to a more pro-active approach setting the searcher as accessed all the time.	2019-04-18 19:14:50 +02:00
Adrien Grand	a699cb76a5	Fix javadoc tag. (#41330 ) s/returns/return/	2019-04-18 14:41:09 +02:00
Armin Braun	389a13b68e	Mute BulkProcessorRetryIT#testBulkRejectionLoadWithBackoff (#41325 ) (#41331 ) * For #41324	2019-04-18 11:55:28 +02:00
Alpar Torok	a4a4259cac	Mute failing test Tracking #41326	2019-04-18 09:26:20 +03:00
Armin Braun	c77e10b16b	Handle Bulk Requests on Write Threadpool (#40866 ) (#41315 ) * Bulk requests can be thousands of items large and take more than O(10ms) time to handle => we should not handle them on the transport threadpool to not block select loops * relates #39128 * relates #39658	2019-04-18 07:10:23 +02:00
David Turner	946baf87d3	Assert TransportReplicationActions acquire permits (#41271 ) Today we do not distinguish "no operations in flight" from "operations are blocked", since both return `0` from `IndexShard#getActiveOperationsCount()`. We therefore cannot assert that every `TransportReplicationAction` performs its actions under permit(s). This commit fixes this by returning `IndexShard#OPERATIONS_BLOCKED` if operations are blocked, allowing these two cases to be distinguished.	2019-04-17 23:05:03 +02:00
Zachary Tong	7e62ff2823	[Rollup] Validate timezones based on rules not string comparision (#36237 ) The date_histogram internally converts obsolete timezones (such as "Canada/Mountain") into their modern equivalent ("America/Edmonton"). But rollup just stored the TZ as provided by the user. When checking the TZ for query validation we used a string comparison, which would fail due to the date_histo's upgrading behavior. Instead, we should convert both to a TimeZone object and check if their rules are compatible.	2019-04-17 13:46:44 -04:00
Christoph Büscher	4d964194db	Fix error applying `ignore_malformed` to boolean values (#41261 ) The `ignore_malformed` option currently works on numeric fields only when the bad value isn't a string value but not if it is a boolean. In this case we get a parsing error from the xContent parser which we need to catch in addition to the field mapper. Closes #11498	2019-04-17 18:44:57 +02:00
David Turner	2670ed2f8f	Assert the stability of custom search preferences (#41150 ) Today the `?preference=custom_string_value` search preference will only change its choice of a shard copy if something changes the `IndexShardRoutingTable` for that specific shard. Users can use this behaviour to route searches to a consistent set of shard copies, which means they can reliably hit copies with hot caches, and use the other copies only for redundancy in case of failure. However we do not assert this property anywhere, so we might break it in future. This commit adds a test that shows that searches are routed consistently even if other indices are created/rebalanced/deleted. Relates https://discuss.elastic.co/t/176598, #41115, #26791	2019-04-17 17:47:44 +02:00
Nhat Nguyen	2ee87c99d9	Fix bwc version of sanity check of read only engine Relates #41041	2019-04-17 10:25:47 -04:00
Nhat Nguyen	aa0c957a4a	Do not trim unsafe commits when open readonly engine (#41041 ) Today we always trim unsafe commits (whose max_seq_no >= global checkpoint) before starting a read-write or read-only engine. This is mandatory for read-write engines because they must start with the safe commit. This is also fine for read-only engines since most of the cases we should have exactly one commit after closing an index (trimming is a noop). However, this is dangerous for following indices which might have more than one commits when they are being closed. With this change, we move the trimming logic to the ctor of InternalEngine so we won't trim anything if we are going to open a read-only engine.	2019-04-17 10:16:12 -04:00
Adrien Grand	f7e590ce0d	ProfileScorer should propagate `setMinCompetitiveScore`. (#40958 ) (#41302 ) Currently enabling profiling disables top-hits optimizations, which is unfortunate: it would be nice to be able to notice the difference in method counts and timings depending on whether total hit counts are requested.	2019-04-17 16:11:14 +02:00
Adrien Grand	9fd5237fd4	Clean up Node#close. (#39317 ) (#41301 ) `Node#close` is pretty hard to rely on today: - it might swallow exceptions - it waits for 10 seconds for threads to terminate but doesn't signal anything if threads are still not terminated after 10 seconds This commit makes `IOException`s propagated and splits `Node#close` into `Node#close` and `Node#awaitClose` so that the decision what to do if a node takes too long to close can be done on top of `Node#close`. It also adds synchronization to lifecycle transitions to make them atomic. I don't think it is a source of problems today, but it makes things easier to reason about.	2019-04-17 16:10:53 +02:00
Jason Tedor	6566979c18	Always check for archiving broken index settings (#41209 ) Today we check if an index has broken settings when checking if an index needs to be upgraded. However, it can be the case that an index setting became broken even if an index is already upgraded to the current version if the user removed a plugin (or downgraded from the default distribution to the non-default distribution) while on the same version of Elasticsearch. In this case, some registered settings would go missing and the index would now be broken. Yet, we miss this check and instead of archiving the settings, the index becomes unassigned due to the missing settings. This commit addresses this by checking for broken settings whether or not the index is upgraded.	2019-04-17 07:00:23 -04:00
Christoph Büscher	badb7a22e0	Some cleanups in NoisyChannelSpellChecker (#40949 ) One of the two #getCorrections methods is only used in tests, so we can move it and any of the required helper methods to that test. Also reducing the visibility of several methods to package private since the class isn't used elsewhere outside the package.	2019-04-17 10:22:12 +02:00
David Turner	bfa06d963e	Do not create missing directories in readonly repo (#41249 ) Today we erroneously look for a node setting called `readonly` when deciding whether or not to create a missing directory in a filesystem repository. This change fixes this by using the repository setting instead. Closes #41009 Relates #26909	2019-04-17 09:43:14 +02:00
Yogesh Gaikwad	6a552c05fe	Use alias name from rollover request to query indices stats (#40774 ) (#41284 ) In `TransportRolloverAction` before doing rollover we resolve source index name (write index) from the alias in the rollover request. Before evaluating the conditions and executing rollover action, we retrieve stats, but to do so we used the source index name resolved from the alias instead of alias from the index. This fails when the user is assigned a role with index privilege on the alias instead of the concrete index. This commit fixes this by using the alias from the request. After this change, verified that when we retrieve all the stats (including write + read indexes) we are considering only source index. Closes #40771	2019-04-17 14:15:05 +10:00
Jim Ferenczi	043c1f5d42	Unified highlighter should respect no_match_size with number_of_fragments set to 0 (#41069 ) The unified highlighter returns the first sentence of the text when number_of_fragments is set to 0 (full highlighting). This is a legacy of the removed postings highlighter that was based on sentence break only. This commit changes this behavior in order to respect the provided no_match_size value when number_of_fragments is set to 0. This means that the behavior will be consistent for any value of the number_of_fragments option. Closes #41066	2019-04-16 19:25:25 +02:00
Armin Braun	c4e84e2b34	Add Bulk Delete Api to BlobStore (#40322 ) (#41253 ) * Adds Bulk delete API to blob container * Implement bulk delete API for S3 * Adjust S3Fixture to accept both path styles for bulk deletes since the S3 SDK uses both during our ITs * Closes #40250	2019-04-16 17:19:05 +02:00
Jim Ferenczi	c22a2cea12	BlendedTermQuery should ignore fields that don't exists in the index (#41125 ) Today the blended term query detects if a term exists in a field by looking at the term statistics in the index. However the value to indicate that a term has no occurence in a field have changed in Lucene. A non-existing term now returns a doc and total term frequency of 0. Because of this disrepancy the blended term query picks 0 as the minimum frequency for a term even if other fields have documents for this terms. This confuses the term queries that the blending creates since some of them contain a custom state that indicates a frequency of 0 even though the term has some occurence in the field. For these terms an exception is thrown because the term query always checks that the term state's frequency is greater than 0 if there are documents associate to it. This change fixes this bug by ignoring terms with a doc freq of 0 when the blended term query picks the minimum term frequency among the requested fields. Closes #41118	2019-04-16 16:25:42 +02:00
David Turner	8577bbd73b	Inline TransportReplAct#createReplicatedOperation (#41197 ) `TransportReplicationAction.AsyncPrimaryAction#createReplicatedOperation` exists so it can be overridden in tests. This commit re-works these tests to use a real `ReplicationOperation` and inlines the now-unnecessary method. Relates #40706.	2019-04-16 13:36:29 +01:00
David Turner	10e58210a0	Validate cluster UUID when joining Zen1 cluster (#41063 ) Today we fail to join a Zen2 cluster if the cluster UUID does not match our own, but we do not perform the same validation when joining a Zen1 cluster. This means that a Zen2 node will pass join validation and be added to a Zen1 cluster but will reject all cluster states from the master. Relates #37775	2019-04-16 12:49:47 +01:00
Nhat Nguyen	8ee84f2268	Correct flush parameters in engine test Since #40213, we forbid a combination of flush parameters: force=true and wait_if_ongoing=false. Closes #41236	2019-04-16 05:04:31 -04:00
Christoph Büscher	f8161ffa88	Fix some `range` query edge cases (#41160 ) Currently we throw an error when a range querys minimum value exceeds the maximum value due to the fact that they are neighbouring values and both upper and lower value are excluded from the interval. Since this is a condition that the user usually doesn't specify conciously (at least in the case of float and double values its difficult to see which values are adjacent) we should ignore those "wrong" intervals and create a MatchNoDocsQuery in those cases. We should still throw errors with an actionable message if the user specifies the query interval in a way that min value > max value. This PR adds those checks and tests for those cases. Closes #40937	2019-04-16 10:56:13 +02:00
Tim Brooks	ad3b7abaa3	Deprecate old transport settings (#41229 ) This is related to #36652. We intend to remove a number of old transport settings in 8.0. This commit deprecates those settings for 7.x.	2019-04-15 21:43:09 -06:00
Tim Brooks	56c00eecbc	Remove string usages of old transport settings (#41207 ) This is related to #36652. We intend to deprecate a number of transport settings in 7.x and remove them in 8.0. This commit removes the string usages of these settings.	2019-04-15 16:54:24 -06:00
Zachary Tong	f19b052e03	Better error messages when pipelines reference incompatible aggs (#40068 ) Pipelines require single-valued agg or a numeric to be returned. If they don't get that, they throw an exception. Unfortunately, this exception text is very confusing to users because it usually arises from pathing "through" multiple terms aggs. The final target is a numeric, but it's the intermediary aggs that cause the problem. This commit adds the current agg name to the exception message so the user knows which "level" is the issue.	2019-04-15 10:35:53 -04:00
Jim Ferenczi	d30fec4914	Full text queries should not always ignore unmapped fields (#41062 ) Full text queries ignore unmapped fields since https://github.com/elastic/elasticsearch/issues/41022 even if all fields in the query are unmapped. This change makes sure that we ignore unmapped fields only if they are mixed with mapped fields and returns a MatchNoDocsQuery otherwise. Closes #41022	2019-04-15 12:16:50 +02:00
Christoph Büscher	2980a6c70f	Clarify some ToXContent implementations behaviour (#41000 ) This change adds either ToXContentObject or ToXContentFragment to classes directly implementing ToXContent currently. This helps in reasoning about whether those implementations output full xcontent object or just fragments. Relates to #16347	2019-04-15 09:42:08 +02:00
Yogesh Gaikwad	e7375368d6	Remove nested loop in IndicesStatsResponse (#40988 ) (#41138 ) This commit removes nested loop in `getIndices`.	2019-04-13 04:36:29 +10:00
Ignacio Vera	8af930c468	Improve error message when polygons contains twice the same point in no-consecutive position (#41051 ) (#41133 ) When a polygon contains a self-intersection due to have twice the same point in no-consecutive position, the polygon builder tries to split the polygon. During the split one of the polygons become invalid as it is not closed and an error is thrown which is not related to the real issue. We detect this situation now and throw a more meaningful error.	2019-04-12 09:16:33 +02:00
Nhat Nguyen	e9999dfa1d	Init global checkpoint after copy commit in peer recovery (#40823 ) Today a new replica of a closed index does not have a safe commit invariant when its engine is opened because we won't initialize the global checkpoint on a recovering replica until the finalize step. With this change, we can achieve that property by creating a new translog with the global checkpoint from the primary at the end of phase 1.	2019-04-11 22:18:31 -04:00
Antonio Matarrese	79c7a57737	Use the breadth first collection mode for significant terms aggs. (#29042 ) This helps avoid memory issues when computing deep sub-aggregations. Because it should be rare to use sub-aggregations with significant terms, we opted to always choose breadth first as opposed to exposing a `collect_mode` option. Closes #28652.	2019-04-11 15:56:02 -07:00
Nhat Nguyen	0f496842fd	Fix msu assertion in restore shard history test Since #40249, we always reinitialize max_seq_no_of_updates to max_seq_no when a promoting primary restores history regardless of whether it did rollback previously or not. Closes #40929	2019-04-11 18:44:13 -04:00
Ryan Ernst	5cdd87deb7	Remove settings members from Node (#40811 ) This commit removes the settings member variable from Node. This member made it confusing which settings should actually be looked at. Now all settings are accessed through the final environment.	2019-04-11 13:59:54 -07:00
David Turner	b522de975d	Move primary term from replicas proxy to repl op (#41119 ) A small refactoring that removes the primaryTerm field from ReplicasProxy and instead passes it directly in to the methods that need it. Relates #40706.	2019-04-11 21:19:27 +01:00
Armin Braun	233df6b73b	Make Transport Shard Bulk Action Async (#39793 ) (#41112 ) This is a dependency of #39504 Motivation: By refactoring `TransportShardBulkAction#shardOperationOnPrimary` to async, we enable using `DeterministicTaskQueue` based tests to run indexing operations. This was previously impossible since we were blocking on the `write` thread until the `update` thread finished the mapping update. With this change, the mapping update will trigger a new task in the `write` queue instead. This change significantly enhances the amount of coverage we get from `SnapshotResiliencyTests` (and other potential future tests) when it comes to tracking down concurrency issues with distributed state machines. The logical change is effectively all in `TransportShardBulkAction`, the rest of the changes is then simply mechanically moving the caller code and tests to being async and passing the `ActionListener` down. Since the move to async would've added more parameters to the `private static` steps in this logic, I decided to inline and dry up (between delete and update) the logic as much as I could instead of passing the listener + wait-consumer down through all of them.	2019-04-11 16:01:52 +02:00
Jason Tedor	24446ceae0	Add packaging to cluster stats response (#41048 ) This commit adds a packaging_types field to the cluster stats response that outlines the build flavors and types present in a cluster.	2019-04-10 13:47:19 -04:00
Zachary Tong	e611334b2b	Add 7.0.1 version constant	2019-04-10 11:32:53 -04:00
Dimitrios Liappis	799541e068	Mute DateTimeUnitTests.testConversion (#40738 ) Due to #39617 Backport of #40086	2019-04-10 16:37:16 +03:00
Jim Ferenczi	4263a28039	Fix rewrite of inner queries in DisMaxQueryBuilder (#40956 ) This commit implements missing rewrite for the DisMaxQueryBuilder. Closes #40953	2019-04-10 11:38:16 +02:00
Jason Tedor	3aae98f922	Add debug logging for leases sync on recovery test This commit adds some debug logging for a retention leases sync on recovery test.	2019-04-09 22:59:22 -04:00
Julie Tibshirani	d38214060e	Mute ClusterDisruptionIT#testCannotJoinIfMasterLostDataFolder. Tracked in #41047.	2019-04-09 17:36:21 -07:00
Julie Tibshirani	a417905098	Mute RareClusterStateIT#testDelayedMappingPropagationOnPrimary as we await a fix. Tracked in #41030.	2019-04-09 13:41:53 -07:00
Julie Tibshirani	a0fc2461d7	Mute DedicatedClusterSnapshotRestoreIT#testSnapshotWithStuckNode as we await a fix.	2019-04-09 12:03:33 -07:00
Mark Vieira	1287c7d91f	[Backport] Replace usages RandomizedTestingTask with built-in Gradle Test (#40978 ) (#40993 ) * Replace usages RandomizedTestingTask with built-in Gradle Test (#40978) This commit replaces the existing RandomizedTestingTask and supporting code with Gradle's built-in JUnit support via the Test task type. Additionally, the previous workaround to disable all tasks named "test" and create new unit testing tasks named "unitTest" has been removed such that the "test" task now runs unit tests as per the normal Gradle Java plugin conventions. (cherry picked from commit 323f312bbc829a63056a79ebe45adced5099f6e6) * Fix forking JVM runner * Don't bump shadow plugin version	2019-04-09 11:52:50 -07:00
Jason Tedor	321f93c4f9	Wait for all listeners in checkpoint listeners test It could be that we try to shutdown the executor pool before all the listeners have been invoked. It can happen that one was not invoked if it timed out and was in the process of being notified that it timed out on the executor. If we do this shutdown then, a listener will be met with rejected execution exception. To address this, we first wait until all listeners have been notified (or timed out) before proceeding with shutting down the executor. Relates #40970	2019-04-09 14:27:09 -04:00
Henning Andersen	c5a77e5d8c	Node repurpose tool docs (#40525 ) Added documentation for node repurpose tool and included documentation on how to repurpose nodes safely. Adjusted order of tools in `elasticsearch-node` tool since the repurpose tool is most likely to be used. Co-Authored-By: David Turner <david.turner@elastic.co>	2019-04-09 15:07:37 +02:00
David Turner	08ecdfe20e	Short-circuit rebalancing when disabled (#40966 ) Today if `cluster.routing.rebalance.enable: none` then rebalancing is disabled, but we still execute `balanceByWeights()` and perform some rather expensive calculations before discovering that we cannot rebalance any shards. In a large cluster this can make cluster state updates occur rather slowly. With this change we check earlier whether rebalancing is globally disabled and, if so, avoid the rebalancing process entirely. Relates #40942 which was reverted because of egregiously faulty tests.	2019-04-09 07:59:52 +01:00
Nhat Nguyen	69421612e5	Mute testRecoverMissingAnalyzer Tracked at #40867	2019-04-08 22:47:20 -04:00

1 2 3 4 5 ...

2866 Commits