OpenSearch

Commit Graph

Author	SHA1	Message	Date
David Turner	935f70c05e	Handle serialization exceptions during publication (#41781 ) Today if an exception is thrown when serializing a cluster state during publication then the master enters a poisoned state where it cannot publish any more cluster states, but nor does it stand down as master, yielding repeated exceptions of the following form: ``` failed to commit cluster state version [12345] org.elasticsearch.cluster.coordination.FailedToCommitClusterStateException: publishing failed at org.elasticsearch.cluster.coordination.Coordinator.publish(Coordinator.java:1045) ~[elasticsearch-7.0.0.jar:7.0.0] at org.elasticsearch.cluster.service.MasterService.publish(MasterService.java:252) [elasticsearch-7.0.0.jar:7.0.0] at org.elasticsearch.cluster.service.MasterService.runTasks(MasterService.java:238) [elasticsearch-7.0.0.jar:7.0.0] at org.elasticsearch.cluster.service.MasterService$Batcher.run(MasterService.java:142) [elasticsearch-7.0.0.jar:7.0.0] at org.elasticsearch.cluster.service.TaskBatcher.runIfNotProcessed(TaskBatcher.java:150) [elasticsearch-7.0.0.jar:7.0.0] at org.elasticsearch.cluster.service.TaskBatcher$BatchedTask.run(TaskBatcher.java:188) [elasticsearch-7.0.0.jar:7.0.0] at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:681) [elasticsearch-7.0.0.jar:7.0.0] at org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.runAndClean(PrioritizedEsThreadPoolExecutor.java:252) [elasticsearch-7.0.0.jar:7.0.0] at org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.run(PrioritizedEsThreadPoolExecutor.java:215) [elasticsearch-7.0.0.jar:7.0.0] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_144] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_144] at java.lang.Thread.run(Thread.java:748) [?:1.8.0_144] Caused by: org.elasticsearch.cluster.coordination.CoordinationStateRejectedException: cannot start publishing next value before accepting previous one at org.elasticsearch.cluster.coordination.CoordinationState.handleClientValue(CoordinationState.java:280) ~[elasticsearch-7.0.0.jar:7.0.0] at org.elasticsearch.cluster.coordination.Coordinator.publish(Coordinator.java:1030) ~[elasticsearch-7.0.0.jar:7.0.0] ... 11 more ``` This is because it already created the publication request using `CoordinationState#handleClientValue()` but then it fails before accepting it. This commit addresses this by performing the serialization before calling `handleClientValue()`. Relates #41090, which was the source of such a serialization exception.	2019-05-07 17:53:12 +01:00
Alan Woodward	4cca1e8fff	Correct spelling of MockLogAppender.PatternSeenEventExpectation (#41893 ) The class was called PatternSeenEventExcpectation. This commit is a straight class rename to correct the spelling.	2019-05-07 17:28:51 +01:00
Ryan Ernst	e9e4bae683	Fix fractional seconds for strict_date_optional_time (#41871 ) The fractional seconds portion of strict_date_optional_time was accidentally copied from the printer, which always prints at least 3 fractional digits. This commit fixes the formatter to allow 1 or 2 fractional seconds. closes #41633	2019-05-07 09:09:30 -07:00
Henning Andersen	f068a22f5f	SeqNo CAS linearizability (#38561 ) Add a test that stresses concurrent writes using ifSeqno/ifPrimaryTerm to do CAS style updates. Use linearizability checker to verify linearizability. Linearizability of successful CAS'es is guaranteed. Changed linearizability checker to allow collecting history concurrently. Changed unresponsive network simulation to wake up immediately when network disruption is cleared to ensure tests proceed in a timely manner (and this also seems more likely to provoke issues).	2019-05-07 14:04:38 +02:00
Jim Ferenczi	70bf432fa8	Fix full text queries test that start with now (#41854 ) Full text queries that start with now are not cacheable if they target a date field. However we assume in the query builder tests that all queries are cacheable and this assumption fails when the random generated query string starts with "now". This fails twice in several years since the probability that a random string starts with "now" is low but this commit ensures that isCacheable is correctly checked for full text queries that fall into this edge case. Closes #41847	2019-05-06 19:08:30 +02:00
Przemyslaw Gomulka	79b7ce8697	Fix javadoc in WrapperQueryBuilder backport(41641) #41849 missing brackets in javadoc backports #41641	2019-05-06 17:55:11 +02:00
Henning Andersen	227d5e15fb	ReadOnlyEngine assertion fix (#41842 ) Fixed the assertion that maxSeqNo == globalCheckpoint to actually check against the global checkpoint.	2019-05-06 16:11:38 +02:00
Hicham Mallah	4a88da70c5	Add index name to cluster block exception (#41489 ) Updates the error message to reveal the index name that is causing it. Closes #40870	2019-05-04 19:11:59 -04:00
Nhat Nguyen	c7924014fa	Verify consistency of version and source in disruption tests (#41614 ) (#41661 ) With this change, we will verify the consistency of version and source (besides id, seq_no, and term) of live documents between shard copies at the end of disruption tests.	2019-05-03 18:47:14 -04:00
Nhat Nguyen	e61469aae6	Noop peer recoveries on closed index (#41400 ) If users close an index to change some non-dynamic index settings, then the current implementation forces replicas of that closed index to copy over segment files from the primary. With this change, we make peer recoveries of closed index skip both phases. Relates #33888 Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2019-05-03 12:07:37 -04:00
Issam EL-ATIF	23706d4cdf	Update error message for allowed characters in aggregation names (#41573 ) Exception message thrown when specifying illegal characters did no accurately described the allowed characters. This updates the error message to reflect reality (any character except [, ] and >)	2019-05-03 11:55:09 -04:00
Jason Tedor	03c959f188	Upgrade keystore on package install (#41755 ) When Elasticsearch is run from a package installation, the running process does not have permissions to write to the keystore. This is because of the root:root ownership of /etc/elasticsearch. This is why we create the keystore if it does not exist during package installation. If the keystore needs to be upgraded, that is currently done by the running Elasticsearch process. Yet, as just mentioned, the Elasticsearch process would not have permissions to do that during runtime. Instead, this needs to be done during package upgrade. This commit adds an upgrade command to the keystore CLI for this purpose, and that is invoked during package upgrade if the keystore already exists. This ensures that we are always on the latest keystore format before the Elasticsearch process is invoked, and therefore no upgrade would be needed then. While this bug has always existed, we have not heard of reports of it in practice. Yet, this bug becomes a lot more likely with a recent change to the format of the keystore to remove the distinction between file and string entries.	2019-05-03 10:34:30 -04:00
David Turner	873d0020a5	Reject null customs at build time (#41782 ) Today you can add a null `Custom` to the cluster state or its metadata, but attempting to publish such a cluster state will fail. Unfortunately, the publication-time failure gives very little information about the source of the problem. This change causes the failure to manifest earlier and adds information about which `Custom` was null in order to simplify the investigation. Relates #41090.	2019-05-03 14:52:32 +02:00
Jack Conradson	025619bbf1	Improve error message for ln/log with negative results in function score This changes the error message for a negative result in a function score when using the ln modifier to suggest using ln1p or ln2p when a negative result occurs in a function score and for the log modifier to suggest using log1p or log2p. This relates to #41509	2019-05-02 16:31:25 -07:00
Jason Tedor	d0f071236a	Simplify filtering addresses on interfaces (#41758 ) This commit is a refactoring of how we filter addresses on interfaces. In particular, we refactor all of these methods into a common private method. We also change the order of logic to first check if an address matches our filter and then check if the interface is up. This is to possibly avoid problems we are seeing where devices are flapping up and down while we are checking for loopback addresses. We do not expect the loopback device to flap up and down so by reversing the logic here we avoid that problem on CI machines. Finally, we expand the error message when this does occur so that we know which device is flapping.	2019-05-02 16:36:27 -04:00
Colin Goodheart-Smithe	ab9154005b	Adds version 6.7.3	2019-05-02 17:36:23 +01:00
Tim Brooks	b4bcbf9f64	Support http read timeouts for transport-nio (#41466 ) This is related to #27260. Currently there is a setting http.read_timeout that allows users to define a read timeout for the http transport. This commit implements support for this functionality with the transport-nio plugin. The behavior here is that a repeating task will be scheduled for the interval defined. If there have been no requests received since the last run and there are no inflight requests, the channel will be closed.	2019-05-02 09:48:52 -06:00
David Turner	b189596631	Add details to BulkShardRequest#getDescription() (#41711 ) Today a bulk shard request appears as follows in the detailed task list: requests[42], index[my_index] This change adds the shard index and refresh policy too: requests[42], index[my_index][2], refresh[IMMEDIATE]	2019-05-02 08:29:25 +02:00
Andy Bristol	b9e44288d3	mute NodeTests#testCloseOnInterruptibleTask For #41448	2019-05-01 13:24:22 -07:00
Jason Tedor	39b0b5809d	Fix minimum compatible version after 6.8 This commit fixes the minimum compatible version after the introduction of 6.8.	2019-05-01 16:21:13 -04:00
Jay Modi	7f7eb7b679	Add version 7.0.2 to 7.x branch (#41715 )	2019-05-01 15:23:53 -04:00
Jason Tedor	f08ac103ee	Add 6.8 version constant This commit adds the 6.8 version constant to the 7.x branch.	2019-05-01 13:38:58 -04:00
Jason Tedor	7f3ab4524f	Bump 7.x branch to version 7.2.0 This commit adds the 7.2.0 version constant to the 7.x branch, and bumps BWC logic accordingly.	2019-05-01 13:38:57 -04:00
Henning Andersen	c6abe74dd6	Close and acquire commit during reset engine fix (#41584 ) (#41709 ) If closing a shard while resetting engine, IndexEventListener.afterIndexShardClosed would be called while there is still an active IndexWriter on the shard. For integration tests, this leads to an exception during check index called from MockFSIndexStore .Listener. Fixed. Relates to #38561	2019-05-01 15:22:24 +02:00
Jason Tedor	26c72c96bd	Fix imports in KeyStoreWrapperTests This commit addresses a checkstyle violation in KeyStoreWrapperTests, removing a leftover import.	2019-05-01 07:21:23 -04:00
Jason Tedor	0b46a62f6b	Drop distinction in entries for keystore (#41701 ) Today we allow adding entries from a file or from a string, yet we internally maintain this distinction such that if you try to add a value from a file for a setting that expects a string or add a value from a string for a setting that expects a file, you will have a bad time. This causes a pain for operators such that for each setting they need to know this difference. Yet, we do not need to maintain this distinction internally as they are bytes after all. This commit removes that distinction and includes logic to upgrade legacy keystores.	2019-05-01 07:02:04 -04:00
Nhat Nguyen	887f3f2c83	Simplify initialization of max_seq_no of updates (#41161 ) Today we choose to initialize max_seq_no_of_updates on primaries only so we can deal with a situation where a primary is on an old node (before 6.5) which does not have MUS while replicas on new nodes (6.5+). However, this strategy is quite complex and can lead to bugs (for example #40249) since we have to assign a correct value (not too low) to MSU in all possible situations (before recovering from translog, restoring history on promotion, and handing off relocation). Fortunately, we don't have to deal with this BWC in 7.0+ since all nodes in the cluster should have MSU. This change simplifies the initialization of MSU by always assigning it a correct value in the constructor of Engine regardless of whether it's a replica or primary. Relates #33842	2019-04-30 15:14:52 -04:00
Igor Motov	10ab838106	Geo: Add GeoJson parser to libs/geo classes (#41575 ) (#41657 ) Adds GeoJson parser for Geometry classes defined in libs/geo. Relates #40908 and #29872	2019-04-29 19:43:31 -04:00
Alan Woodward	a01f451ef7	Limit complexity of IntervalQueryBuilderTests#testRandomSource() (#41538 ) IntervalsSources can throw IllegalArgumentExceptions if they would produce too many disjunctions. To mitigate against this when building random sources, we limit the depth of the randomly generated source to four nested sources Fixes #41402	2019-04-29 13:31:19 +01:00
Dan Hermann	b23709b178	Applies the same naming restrictions to repositories as to snapshots except that leading underscores and uppercase characters are permitted. (#41585 ) Fixes #40817.	2019-04-29 07:31:01 -05:00
Armin Braun	6e51b6f96d	Add Repository Consistency Assertion to SnapshotResiliencyTests (#41631 ) * Add Repository Consistency Assertion to SnapshotResiliencyTests (#40857) * Add Repository Consistency Assertion to SnapshotResiliencyTests * Add some quick validation on not leaving behind any dangling metadata or dangling indices to the snapshot resiliency tests * Added todo about expanding this assertion further * Fix SnapshotResiliencyTest Repo Consistency Check (#41332) * Fix SnapshotResiliencyTest Repo Consistency Check * Due to the random creation of an empty `extra0` file by the Lucene mockFS we see broken tests because we use the existence of an index folder in assertions and the index deletion doesn't go through if there are extra files in an index folder * Fixed by removing the `extra0` file and resulting empty directory trees before asserting repo consistency * Closes #41326 * Reenable SnapshotResiliency Test (#41437) This was fixed in https://github.com/elastic/elasticsearch/pull/41332 but I forgot to reenable the test. * fix compile on java8	2019-04-29 12:01:58 +02:00
Nhat Nguyen	615a0211f0	Recovery should not indefinitely retry on mapping error (#41099 ) A stuck peer recovery in #40913 reveals that we indefinitely retry on new cluster states if indexing translog operations hits a mapper exception. We should not wait and retry if the mapping on the target is as recent as the mapping that the primary used to index the replaying operations. Relates #40913	2019-04-27 10:55:08 -04:00
Michael Morello	75283294f5	Fix multi-node parsing in voting config exclusions REST API (#41588 ) Fixes an issue where multiple nodes where not properly parsed in the voting config exclusions REST API. Closes #41587	2019-04-27 12:20:03 +02:00
Nick Knize	113b24be4b	Refactor GeoHashUtils (#40869 ) This commit refactors GeoHashUtils class into a new Geohash utility class located in the ES geo library. The intent is to not only better control what geo methods are whitelisted for painless scripting but to clean up the geo utility API in general.	2019-04-26 10:06:36 -05:00
Armin Braun	aad33121d8	Async Snapshot Repository Deletes (#40144 ) (#41571 ) Motivated by slow snapshot deletes reported in e.g. #39656 and the fact that these likely are a contributing factor to repositories accumulating stale files over time when deletes fail to finish in time and are interrupted before they can complete. * Makes snapshot deletion async and parallelizes some steps of the delete process that can be safely run concurrently via the snapshot thread poll * I did not take the biggest potential speedup step here and parallelize the shard file deletion because that's probably better handled by moving to bulk deletes where possible (and can still be parallelized via the snapshot pool where it isn't). Also, I wanted to keep the size of the PR manageable. * See https://github.com/elastic/elasticsearch/pull/39656#issuecomment-470492106 * Also, as a side effect this gives the `SnapshotResiliencyTests` a little more coverage for master failover scenarios (since parallel access to a blob store repository during deletes is now possible since a delete isn't a single task anymore). * By adding a `ThreadPool` reference to the repository this also lays the groundwork to parallelizing shard snapshot uploads to improve the situation reported in #39657	2019-04-26 15:36:09 +02:00
Armin Braun	7824f60a34	Simplify Snapshot Resiliency Test (#40930 ) (#41565 ) * Thanks to #39793 dynamic mapping updates don't contain blocking operations anymore so we don't have to manually put the mapping in this test and can keep it a little simpler	2019-04-26 10:59:09 +02:00
Christoph Büscher	078936b8f5	Remove search analyzers from DocumentFieldMappers (#41484 ) These references seem to be unused except for tests and should be removed to keep the places we store analyzers limited.	2019-04-26 09:48:48 +02:00
Armin Braun	6a24fd3f26	Add Restore Operation to SnapshotResiliencyTests (#40634 ) (#41546 ) * Add Restore Operation to SnapshotResiliencyTests * Expand the successful snapshot test case to also include restoring the snapshop * Add indexing of documents as well to be able to meaningfully verify the restore * This is part of the larger effort to test eventually consistent blob stores in #39504	2019-04-26 09:04:34 +02:00
Christoph Büscher	52495843cc	[Docs] Fix common word repetitions (#39703 )	2019-04-25 20:47:47 +02:00
Armin Braun	23b3741618	Remove Exists Check from S3 Repository Deletes (#40931 ) (#41534 ) * The check doesn't add much if anything practically, since the S3 repository is eventually consistent and we only log the non-existence of a blob anyway * We don't do the check on writes for this very reason and documented it as such * Removing the check saves one API call per single delete speeding up the deletion process and lowering costs	2019-04-25 18:25:03 +02:00
Jim Ferenczi	6184efaff6	Handle unmapped fields in _field_caps API (#34071 ) (#41426 ) Today the `_field_caps` API returns the list of indices where a field is present only if this field has different types within the requested indices. However if the request is an index pattern (or an alias, or both...) there is no way to infer the indices if the response contains only fields that have the same type in all indices. This commit changes the response to always return the list of indices in the response. It also adds a way to retrieve unmapped field in a specific section per field called `unmapped`. This section is created for each field that is present in some indices but not all if the parameter `include_unmapped` is set to true in the request (defaults to false).	2019-04-25 18:13:48 +02:00
Armin Braun	40aef2b8aa	Introduce Delegating ActionListener Wrappers (#40129 ) (#41527 ) * Introduce Delegating ActionListener Wrappers * Dry up use cases of ActionListener that simply pass through the response or exception to another listener	2019-04-25 16:05:04 +02:00
Ignacio Vera	d119abdf96	Improve accuracy for Geo Centroid Aggregation (#41514 ) keeps the partial results as doubles and uses Kahan summation to help reduce floating point errors.	2019-04-25 15:25:48 +02:00
Armin Braun	cd830b53e2	Name Snapshot Data Blobs by UUID (#40652 ) (#41523 ) * Name Snapshot Data Blobs by UUID * There is no functional reason why we need incremental naming for these files but * As explained in #38941 it is a possible source of corrupting the repository * It wastes API calls for the list operation * Is just needless complication * Since we store the exact names of the data blobs in all the metadata anyway, we can make this change without any BwC considerations * Even on the worst case scenario of a downgrade the functionality would continue working since the incremental names wouldn't conflict with the uuids and the number parsing for finding the next incremental name suppresses the exception when encountring a non-numeric value after the double underscore prefix	2019-04-25 13:18:03 +02:00
Luca Cavanna	8a0e5f7b87	Deprecate support for first line empty in msearch API (#41442 ) In order to support empty action metadata in the first msearch item, we need to remove support for prepending msearch request body with an empty line, which prevents us from parsing the empty line as action metadata for the first search item. Relates to #41011	2019-04-25 12:45:18 +02:00
Przemyslaw Gomulka	906f88029b	Remove the test which is testing java and joda api backport(#41493 ) #41518 The test is testing the java time API and fails in case it hits daylight saving time changes. Java time has the right implementation and we don't need to test this. more details on how the test was affected by the DST change on this comment closes #39617 backport(#41493)	2019-04-25 12:21:01 +02:00
Armin Braun	7c819fd2aa	Fix BulkRejectionIT (#41446 ) (#41500 ) * Due to #40866 one of the two parallel bulk requests can randomly be rejected outright when the write queue is full already, we can catch this situation and ignore it since we can still have the rejection for the dynamic mapping udate for the other reuqest and it's somewhat rare to run into this anyway * Closes #41363	2019-04-24 20:46:21 +02:00
Zachary Tong	ec5dd0594f	Disallow null/empty or duplicate composite sources (#41359 ) Adds some validation to prevent duplicate source names from being used in the composite agg. Also refactored to use a ConstructingObjectParser and removed the private ctor and setter for sources, making it mandatory.	2019-04-24 13:23:31 -04:00
Armin Braun	1db9166ea0	Fix Broken Index Shard Snapshot File Preventing Snapshot Creation (#41310 ) (#41473 ) * The problem here is that if we run into a corrupted index-N file, instead of generating a new index-(N+1) file, we instead set the newest index generation to -1 and thus tried to create `index-0` * If `index-0` is corrupt, this prevents us from ever creating a new snapshot using the broken shard, because we are unable to create `index-0` since it already exists * Fixed by still using the index generation for naming the next index file, even if it was a broken index file * Added test that makes sure restoring as well as snapshotting on top of the broken shard index file work as expected * closes #41304	2019-04-24 18:39:17 +02:00
Armin Braun	381b8e2ece	Fix BulkProcessor Retry ITs (#41338 ) (#41472 ) * The test fails for the retry backoff enabled case because the retry handler in the bulk processor hasn't been adjusted to account for #40866 which now might lead to an outright rejection of the request instead of its items individually * Fixed by adding retry functionality to the top level request as well * Also fixed the duplicate test for the HLRC that wasn't handling the non-backoff case yet the same way the non-client IT did * closes #41324	2019-04-24 13:46:32 +02:00
Jason Tedor	65af47eb31	Introduce aliases version (#41397 ) This commit introduces aliases versions to index metadata. This will be useful in CCR when we replicate aliases.	2019-04-23 12:19:11 -04:00
David Roberts	7e2aec022d	[TEST] Mute BulkRejectionIT.testBulkRejectionAfterDynamicMappingUpdate Due to https://github.com/elastic/elasticsearch/issues/41363	2019-04-23 15:58:38 +01:00
David Roberts	d8a2970fa4	[TEST] Mute RemoteClusterServiceTests.testCollectNodes Due to https://github.com/elastic/elasticsearch/issues/41067	2019-04-23 15:13:01 +01:00
David Turner	0bb15d3dac	Allow ops to be blocked after primary promotion (#41360 ) Today we assert that there are no operations in flight in this test. However we will sometimes be in a situation where the operations are blocked, and we distinguish these cases since #41271 causing the assertion to fail. This commit addresses this by allowing operations to be blocked sometimes after a primary promotion. Fixes #41333.	2019-04-19 07:48:43 +01:00
Jim Ferenczi	8f73e1e883	Fix unmapped field handling in the composite aggregation (#41280 ) The `composite` aggregation maps unknown fields as numerics, this means that any `after` value that is set on a query with an unmapped field on some indices will fail if the provided value is not numeric. This commit changes the default value source to use keyword instead in order to be able to parse any type of after values.	2019-04-18 23:08:13 +02:00
Jim Ferenczi	754037b71e	Unified highlighter should ignore terms that targets the _id field (#41275 ) The `_id` field uses a binary encoding to index terms that is not compatible with the utf8 automaton that the unified highlighter creates to reanalyze the input. For these reason this commit ignores terms that target the `_id` field when `require_field_match` is set to false. Closes #37525	2019-04-18 22:31:23 +02:00
Jim Ferenczi	068f8ba223	more_like_this query to throw an error if the like fields is not provided (#40632 ) With the removal of the `_all` field the `mlt` query cannot infer a field name to use to analyze the provided (un)like text if the `fields` parameter is not explicitly set in the query and the `index.query.default_field` is not changed in the index settings (by default it is set to ``). For this reason the like text is ignored and queries are only built from the provided document ids. This change fixes this bug by throwing an error if the fields option is not set and the `index.query.default_field` is equals to ``. The error is thrown only if like or unlike texts are provided in the query.	2019-04-18 22:30:22 +02:00
Simon Willnauer	11dc9fe249	Mark searcher as accessed in acquireSearcher (#41335 ) This fixes an issue where every N seconds a slow search request is triggered since the searcher access time is not set unless the shard is idle. This change moves to a more pro-active approach setting the searcher as accessed all the time.	2019-04-18 19:14:50 +02:00
Adrien Grand	a699cb76a5	Fix javadoc tag. (#41330 ) s/returns/return/	2019-04-18 14:41:09 +02:00
Armin Braun	389a13b68e	Mute BulkProcessorRetryIT#testBulkRejectionLoadWithBackoff (#41325 ) (#41331 ) * For #41324	2019-04-18 11:55:28 +02:00
Alpar Torok	a4a4259cac	Mute failing test Tracking #41326	2019-04-18 09:26:20 +03:00
Armin Braun	c77e10b16b	Handle Bulk Requests on Write Threadpool (#40866 ) (#41315 ) * Bulk requests can be thousands of items large and take more than O(10ms) time to handle => we should not handle them on the transport threadpool to not block select loops * relates #39128 * relates #39658	2019-04-18 07:10:23 +02:00
David Turner	946baf87d3	Assert TransportReplicationActions acquire permits (#41271 ) Today we do not distinguish "no operations in flight" from "operations are blocked", since both return `0` from `IndexShard#getActiveOperationsCount()`. We therefore cannot assert that every `TransportReplicationAction` performs its actions under permit(s). This commit fixes this by returning `IndexShard#OPERATIONS_BLOCKED` if operations are blocked, allowing these two cases to be distinguished.	2019-04-17 23:05:03 +02:00
Zachary Tong	7e62ff2823	[Rollup] Validate timezones based on rules not string comparision (#36237 ) The date_histogram internally converts obsolete timezones (such as "Canada/Mountain") into their modern equivalent ("America/Edmonton"). But rollup just stored the TZ as provided by the user. When checking the TZ for query validation we used a string comparison, which would fail due to the date_histo's upgrading behavior. Instead, we should convert both to a TimeZone object and check if their rules are compatible.	2019-04-17 13:46:44 -04:00
Christoph Büscher	4d964194db	Fix error applying `ignore_malformed` to boolean values (#41261 ) The `ignore_malformed` option currently works on numeric fields only when the bad value isn't a string value but not if it is a boolean. In this case we get a parsing error from the xContent parser which we need to catch in addition to the field mapper. Closes #11498	2019-04-17 18:44:57 +02:00
David Turner	2670ed2f8f	Assert the stability of custom search preferences (#41150 ) Today the `?preference=custom_string_value` search preference will only change its choice of a shard copy if something changes the `IndexShardRoutingTable` for that specific shard. Users can use this behaviour to route searches to a consistent set of shard copies, which means they can reliably hit copies with hot caches, and use the other copies only for redundancy in case of failure. However we do not assert this property anywhere, so we might break it in future. This commit adds a test that shows that searches are routed consistently even if other indices are created/rebalanced/deleted. Relates https://discuss.elastic.co/t/176598, #41115, #26791	2019-04-17 17:47:44 +02:00
Nhat Nguyen	2ee87c99d9	Fix bwc version of sanity check of read only engine Relates #41041	2019-04-17 10:25:47 -04:00
Nhat Nguyen	aa0c957a4a	Do not trim unsafe commits when open readonly engine (#41041 ) Today we always trim unsafe commits (whose max_seq_no >= global checkpoint) before starting a read-write or read-only engine. This is mandatory for read-write engines because they must start with the safe commit. This is also fine for read-only engines since most of the cases we should have exactly one commit after closing an index (trimming is a noop). However, this is dangerous for following indices which might have more than one commits when they are being closed. With this change, we move the trimming logic to the ctor of InternalEngine so we won't trim anything if we are going to open a read-only engine.	2019-04-17 10:16:12 -04:00
Adrien Grand	f7e590ce0d	ProfileScorer should propagate `setMinCompetitiveScore`. (#40958 ) (#41302 ) Currently enabling profiling disables top-hits optimizations, which is unfortunate: it would be nice to be able to notice the difference in method counts and timings depending on whether total hit counts are requested.	2019-04-17 16:11:14 +02:00
Adrien Grand	9fd5237fd4	Clean up Node#close. (#39317 ) (#41301 ) `Node#close` is pretty hard to rely on today: - it might swallow exceptions - it waits for 10 seconds for threads to terminate but doesn't signal anything if threads are still not terminated after 10 seconds This commit makes `IOException`s propagated and splits `Node#close` into `Node#close` and `Node#awaitClose` so that the decision what to do if a node takes too long to close can be done on top of `Node#close`. It also adds synchronization to lifecycle transitions to make them atomic. I don't think it is a source of problems today, but it makes things easier to reason about.	2019-04-17 16:10:53 +02:00
Jason Tedor	6566979c18	Always check for archiving broken index settings (#41209 ) Today we check if an index has broken settings when checking if an index needs to be upgraded. However, it can be the case that an index setting became broken even if an index is already upgraded to the current version if the user removed a plugin (or downgraded from the default distribution to the non-default distribution) while on the same version of Elasticsearch. In this case, some registered settings would go missing and the index would now be broken. Yet, we miss this check and instead of archiving the settings, the index becomes unassigned due to the missing settings. This commit addresses this by checking for broken settings whether or not the index is upgraded.	2019-04-17 07:00:23 -04:00
Christoph Büscher	badb7a22e0	Some cleanups in NoisyChannelSpellChecker (#40949 ) One of the two #getCorrections methods is only used in tests, so we can move it and any of the required helper methods to that test. Also reducing the visibility of several methods to package private since the class isn't used elsewhere outside the package.	2019-04-17 10:22:12 +02:00
David Turner	bfa06d963e	Do not create missing directories in readonly repo (#41249 ) Today we erroneously look for a node setting called `readonly` when deciding whether or not to create a missing directory in a filesystem repository. This change fixes this by using the repository setting instead. Closes #41009 Relates #26909	2019-04-17 09:43:14 +02:00
Yogesh Gaikwad	6a552c05fe	Use alias name from rollover request to query indices stats (#40774 ) (#41284 ) In `TransportRolloverAction` before doing rollover we resolve source index name (write index) from the alias in the rollover request. Before evaluating the conditions and executing rollover action, we retrieve stats, but to do so we used the source index name resolved from the alias instead of alias from the index. This fails when the user is assigned a role with index privilege on the alias instead of the concrete index. This commit fixes this by using the alias from the request. After this change, verified that when we retrieve all the stats (including write + read indexes) we are considering only source index. Closes #40771	2019-04-17 14:15:05 +10:00
Jim Ferenczi	043c1f5d42	Unified highlighter should respect no_match_size with number_of_fragments set to 0 (#41069 ) The unified highlighter returns the first sentence of the text when number_of_fragments is set to 0 (full highlighting). This is a legacy of the removed postings highlighter that was based on sentence break only. This commit changes this behavior in order to respect the provided no_match_size value when number_of_fragments is set to 0. This means that the behavior will be consistent for any value of the number_of_fragments option. Closes #41066	2019-04-16 19:25:25 +02:00
Armin Braun	c4e84e2b34	Add Bulk Delete Api to BlobStore (#40322 ) (#41253 ) * Adds Bulk delete API to blob container * Implement bulk delete API for S3 * Adjust S3Fixture to accept both path styles for bulk deletes since the S3 SDK uses both during our ITs * Closes #40250	2019-04-16 17:19:05 +02:00
Jim Ferenczi	c22a2cea12	BlendedTermQuery should ignore fields that don't exists in the index (#41125 ) Today the blended term query detects if a term exists in a field by looking at the term statistics in the index. However the value to indicate that a term has no occurence in a field have changed in Lucene. A non-existing term now returns a doc and total term frequency of 0. Because of this disrepancy the blended term query picks 0 as the minimum frequency for a term even if other fields have documents for this terms. This confuses the term queries that the blending creates since some of them contain a custom state that indicates a frequency of 0 even though the term has some occurence in the field. For these terms an exception is thrown because the term query always checks that the term state's frequency is greater than 0 if there are documents associate to it. This change fixes this bug by ignoring terms with a doc freq of 0 when the blended term query picks the minimum term frequency among the requested fields. Closes #41118	2019-04-16 16:25:42 +02:00
David Turner	8577bbd73b	Inline TransportReplAct#createReplicatedOperation (#41197 ) `TransportReplicationAction.AsyncPrimaryAction#createReplicatedOperation` exists so it can be overridden in tests. This commit re-works these tests to use a real `ReplicationOperation` and inlines the now-unnecessary method. Relates #40706.	2019-04-16 13:36:29 +01:00
David Turner	10e58210a0	Validate cluster UUID when joining Zen1 cluster (#41063 ) Today we fail to join a Zen2 cluster if the cluster UUID does not match our own, but we do not perform the same validation when joining a Zen1 cluster. This means that a Zen2 node will pass join validation and be added to a Zen1 cluster but will reject all cluster states from the master. Relates #37775	2019-04-16 12:49:47 +01:00
Nhat Nguyen	8ee84f2268	Correct flush parameters in engine test Since #40213, we forbid a combination of flush parameters: force=true and wait_if_ongoing=false. Closes #41236	2019-04-16 05:04:31 -04:00
Christoph Büscher	f8161ffa88	Fix some `range` query edge cases (#41160 ) Currently we throw an error when a range querys minimum value exceeds the maximum value due to the fact that they are neighbouring values and both upper and lower value are excluded from the interval. Since this is a condition that the user usually doesn't specify conciously (at least in the case of float and double values its difficult to see which values are adjacent) we should ignore those "wrong" intervals and create a MatchNoDocsQuery in those cases. We should still throw errors with an actionable message if the user specifies the query interval in a way that min value > max value. This PR adds those checks and tests for those cases. Closes #40937	2019-04-16 10:56:13 +02:00
Tim Brooks	ad3b7abaa3	Deprecate old transport settings (#41229 ) This is related to #36652. We intend to remove a number of old transport settings in 8.0. This commit deprecates those settings for 7.x.	2019-04-15 21:43:09 -06:00
Tim Brooks	56c00eecbc	Remove string usages of old transport settings (#41207 ) This is related to #36652. We intend to deprecate a number of transport settings in 7.x and remove them in 8.0. This commit removes the string usages of these settings.	2019-04-15 16:54:24 -06:00
Zachary Tong	f19b052e03	Better error messages when pipelines reference incompatible aggs (#40068 ) Pipelines require single-valued agg or a numeric to be returned. If they don't get that, they throw an exception. Unfortunately, this exception text is very confusing to users because it usually arises from pathing "through" multiple terms aggs. The final target is a numeric, but it's the intermediary aggs that cause the problem. This commit adds the current agg name to the exception message so the user knows which "level" is the issue.	2019-04-15 10:35:53 -04:00
Jim Ferenczi	d30fec4914	Full text queries should not always ignore unmapped fields (#41062 ) Full text queries ignore unmapped fields since https://github.com/elastic/elasticsearch/issues/41022 even if all fields in the query are unmapped. This change makes sure that we ignore unmapped fields only if they are mixed with mapped fields and returns a MatchNoDocsQuery otherwise. Closes #41022	2019-04-15 12:16:50 +02:00
Christoph Büscher	2980a6c70f	Clarify some ToXContent implementations behaviour (#41000 ) This change adds either ToXContentObject or ToXContentFragment to classes directly implementing ToXContent currently. This helps in reasoning about whether those implementations output full xcontent object or just fragments. Relates to #16347	2019-04-15 09:42:08 +02:00
Yogesh Gaikwad	e7375368d6	Remove nested loop in IndicesStatsResponse (#40988 ) (#41138 ) This commit removes nested loop in `getIndices`.	2019-04-13 04:36:29 +10:00
Ignacio Vera	8af930c468	Improve error message when polygons contains twice the same point in no-consecutive position (#41051 ) (#41133 ) When a polygon contains a self-intersection due to have twice the same point in no-consecutive position, the polygon builder tries to split the polygon. During the split one of the polygons become invalid as it is not closed and an error is thrown which is not related to the real issue. We detect this situation now and throw a more meaningful error.	2019-04-12 09:16:33 +02:00
Nhat Nguyen	e9999dfa1d	Init global checkpoint after copy commit in peer recovery (#40823 ) Today a new replica of a closed index does not have a safe commit invariant when its engine is opened because we won't initialize the global checkpoint on a recovering replica until the finalize step. With this change, we can achieve that property by creating a new translog with the global checkpoint from the primary at the end of phase 1.	2019-04-11 22:18:31 -04:00
Antonio Matarrese	79c7a57737	Use the breadth first collection mode for significant terms aggs. (#29042 ) This helps avoid memory issues when computing deep sub-aggregations. Because it should be rare to use sub-aggregations with significant terms, we opted to always choose breadth first as opposed to exposing a `collect_mode` option. Closes #28652.	2019-04-11 15:56:02 -07:00
Nhat Nguyen	0f496842fd	Fix msu assertion in restore shard history test Since #40249, we always reinitialize max_seq_no_of_updates to max_seq_no when a promoting primary restores history regardless of whether it did rollback previously or not. Closes #40929	2019-04-11 18:44:13 -04:00
Ryan Ernst	5cdd87deb7	Remove settings members from Node (#40811 ) This commit removes the settings member variable from Node. This member made it confusing which settings should actually be looked at. Now all settings are accessed through the final environment.	2019-04-11 13:59:54 -07:00
David Turner	b522de975d	Move primary term from replicas proxy to repl op (#41119 ) A small refactoring that removes the primaryTerm field from ReplicasProxy and instead passes it directly in to the methods that need it. Relates #40706.	2019-04-11 21:19:27 +01:00
Armin Braun	233df6b73b	Make Transport Shard Bulk Action Async (#39793 ) (#41112 ) This is a dependency of #39504 Motivation: By refactoring `TransportShardBulkAction#shardOperationOnPrimary` to async, we enable using `DeterministicTaskQueue` based tests to run indexing operations. This was previously impossible since we were blocking on the `write` thread until the `update` thread finished the mapping update. With this change, the mapping update will trigger a new task in the `write` queue instead. This change significantly enhances the amount of coverage we get from `SnapshotResiliencyTests` (and other potential future tests) when it comes to tracking down concurrency issues with distributed state machines. The logical change is effectively all in `TransportShardBulkAction`, the rest of the changes is then simply mechanically moving the caller code and tests to being async and passing the `ActionListener` down. Since the move to async would've added more parameters to the `private static` steps in this logic, I decided to inline and dry up (between delete and update) the logic as much as I could instead of passing the listener + wait-consumer down through all of them.	2019-04-11 16:01:52 +02:00
Jason Tedor	24446ceae0	Add packaging to cluster stats response (#41048 ) This commit adds a packaging_types field to the cluster stats response that outlines the build flavors and types present in a cluster.	2019-04-10 13:47:19 -04:00
Zachary Tong	e611334b2b	Add 7.0.1 version constant	2019-04-10 11:32:53 -04:00
Dimitrios Liappis	799541e068	Mute DateTimeUnitTests.testConversion (#40738 ) Due to #39617 Backport of #40086	2019-04-10 16:37:16 +03:00
Jim Ferenczi	4263a28039	Fix rewrite of inner queries in DisMaxQueryBuilder (#40956 ) This commit implements missing rewrite for the DisMaxQueryBuilder. Closes #40953	2019-04-10 11:38:16 +02:00
Jason Tedor	3aae98f922	Add debug logging for leases sync on recovery test This commit adds some debug logging for a retention leases sync on recovery test.	2019-04-09 22:59:22 -04:00
Julie Tibshirani	d38214060e	Mute ClusterDisruptionIT#testCannotJoinIfMasterLostDataFolder. Tracked in #41047.	2019-04-09 17:36:21 -07:00

1 2 3 4 5 ...

2973 Commits