OpenSearch

Commit Graph

Author	SHA1	Message	Date
Igor Motov	260c11d89e	Add an additional cancellation check to the fetch phase (#62577 ) (#62587 ) In #62357 we introduced an additional optimization that allows us to skip the most of the fetch phase early if no results are found. This change caused some cancellation test failures that were relying on definitive cancellation during the fetch phase. This commit adds an additional quick cancellation check at the very beginning of the fetch phase to make cancellation process more deterministic. Fixes #62530	2020-09-18 10:00:36 -04:00
James Rodewig	9d6f94ffa3	[DOCS] EQL: Disallow chained comparisons (#62570 ) (#62625 )	2020-09-18 08:47:27 -04:00
Ignacio Vera	18a52f7477	Use BitArray instead of FixedBitSet for collecting ordinals in Cardinality Aggregator (#62600 ) (#62619 ) Changes the way we collecting ordinals in the Cardinality aggregation from Lucene FixedBitSet to BitArray. The benefit is that BitArray is tracked by our Circuit breakers so it is safer.	2020-09-18 14:16:31 +02:00
Tanguy Leroux	9f5e95505b	Also abort ongoing file restores when snapshot restore is aborted (#62441 ) (#62607 ) Today when a snapshot restore is aborted (for example when the index is explicitly deleted) while the restoration of the files from the repository has already started the file restores are not interrupted. It means that Elasticsearch will continue to read the files from the repository and will continue to write them to disk until all files are restored; the store will then be closed and files will be deleted from disk at some point but this can take a while. This will also take some slots in the SNAPSHOT thread pool too. The Recovery API won't show any files actively being recovered, the only notable indicator would be the active threads in the SNAPSHOT thread pool. This commit adds a check before reading a file to restore and before writing bytes on disk so that a closing store can be detected more quickly and the file recovery process aborted. This way the file restores just stops and for most of the repository implementations it means that no more bytes are read (see #62370 for S3), finishing threads in the SNAPSHOT thread pool more quickly too.	2020-09-18 14:04:58 +02:00
Nhat Nguyen	8bea6b3711	Increase keep alive of point in time in async search tests (#62593 ) Async search tests can take more than one minute due to the excessive trace logs. And the point in time in the tests can be expired the midway. Closes #62451	2020-09-18 08:01:57 -04:00
Martijn van Groningen	4c949b0869	Adjust allowed warnings in data stream yaml test. (#62610 )	2020-09-18 12:44:57 +02:00
Armin Braun	73d19271a9	Fix Races in testQueuedSnapshotOperationsAndBrokenRepoOnMasterFailOverMultipleRepos (#62431 ) (#62614 ) This test (in-part) verifies that snapshot creation is not retried on master fail-over once a snaphot has been started already. Unless we wait for the snapshot creation to show up in the cluster state before failing the master node though, we could run into a race where the snapshot wasn't yet in the cluster state and a retry goes through successfully.	2020-09-18 12:20:23 +02:00
Przemyslaw Gomulka	d87268a264	Round up parsers should be based on a list of parsers backport(#62290 ) (#62604 ) a dateformatter can be created with a list of parsers which are iterated during parsing and the first one that passes will return a parsed date. DateMathParser should do the same, when created based on a list of non-rounding parsers it should also iterate over all of them - it is at the moment only taking first element closing #62207	2020-09-18 12:03:20 +02:00
Martijn van Groningen	6e071580ff	Unmute rollover docs test (#62603 ) (#62605 ) The underlying issue was fixed a while ago in Lucene: https://issues.apache.org/jira/browse/LUCENE-9517 and went away when lucene snapshot version was upgraded. Also the name of the index to rollover had to be slightly changed, so that it doesn't collide with data stream template's namespace. (a regular index can't be created in the namespace that is managed by a template that creates data streams) Closes #62043	2020-09-18 11:11:13 +02:00
Marios Trivyzas	b072de4ce0	EQL: Disallow chained comparisons (#62567 ) (#62601 ) Expressions like `1 = 2 = 3 = 4` or `1 < 2 = 3 >= 4` were treated with leftmost priority: ((1 = 2) = 3) = 4 which can lead to confusing results. Since such expressions don't make so much change for EQL filters we disallow them in the parser to prevent unexpected results from their bad usage. Major DBs like PostgreSQL and Oracle also disallow them in their SQL syntax. (counter example would be MySQL which interprets them as we did before with leftmost priority). Fixes: #61654 (cherry picked from commit 8f94981bb093f104228d267b532e0a3d5b7f6a38)	2020-09-18 10:48:14 +02:00
Costin Leau	81f2f84177	EQL: Allow requests with size 0 (#62537 ) The purpose for this change is to allow validation of queries without having to actually execute them. The optimizer already picks up this case. Fix #62494 (cherry picked from commit 675889559b2f96a0c1faa6fc84fd537148ba2cce)	2020-09-18 11:24:39 +03:00
Martijn van Groningen	5190b0961d	adjust skip reason	2020-09-18 10:10:00 +02:00
Adrien Grand	4de8579455	Upgrade to lucene-8.7.0-snapshot-830bd186a8d. (#62596 )	2020-09-18 09:51:34 +02:00
David Turner	0a3f2c453f	Hide c.a.s.s.i.UseArnRegionResolver noise (#62522 ) A recent AWS SDK upgrade has introduced a new source of spurious `WARN` logs when the security manager prevents access to the user's home directory and therefore to `$HOME/.aws/config`. This is the behaviour we want, and it's harmless and handled by the SDK as if the config doesn't exist, so this log message is unnecessary noise. This commit suppresses this noisy logging by default. Relates #20313, #56346, #53962 Closes #62493	2020-09-18 08:30:39 +01:00
David Turner	06d5d360f9	Tidy up fillInStackTrace implementations (#62555 ) Removes the unnecessary `synchronized` introduced in #62433 and adjusts the others to return `this` not `null` as required by the parent method's Javadocs.	2020-09-18 08:29:48 +01:00
Martijn van Groningen	c83d8ce78e	Adjust skip version data stream test (#62597 ) after #62527 was backported.	2020-09-18 09:28:16 +02:00
David Turner	7324ee1044	Remove unused upgrade actions (#62552 ) These actions were almost completely removed in #40075 but a couple of classes were left in place. This commit completes their removal.	2020-09-18 08:16:13 +01:00
Ignacio Vera	6a3d731be1	Only call reduce on a single InternalAggregation when needed (#62525 ) (#62594 ) Adds a new abstract method in InternalAggregation that flags the framework if it needs to reduce on a single InternalAggregation.	2020-09-18 08:43:58 +02:00
Jake Landis	5b7246157f	[7.x] Fix projects that failed to build within Intellij (#62258 ) (#62408 ) This commit address some build failures from the perspective of Intellij. These changes include: * changing an order of a dependency definition that seems to can cause Intellij build to fail. * introduction of an abstract class out of the test source set (seems to be an issue sharing classes cross projects with non-standard source sets. * a couple of missing dependency definitions (not sure how the command line worked prior to this)	2020-09-17 17:45:12 -05:00
William Brafford	b764f8977e	Copy Key Certs for javaRestTest (#62584 )	2020-09-17 17:45:42 -04:00
Tim Vernum	ab427534f7	[DOCS] Add warning about derived API keys to docs (#62351 )	2020-09-17 13:46:25 -07:00
Dimitris Athanasiou	7118ff7976	[7.x][ML] Remove model snapshot legacy doc ids (#62434 ) (#62569 ) Removes methods that were no longer used regarding version 5.4 doc ids of ModelState. Also adds clean up of 5.4 model state and quantile docs in the daily maintenance. Backport of #62434	2020-09-17 23:43:28 +03:00
Nhat Nguyen	0127b71901	Adjust keep alive assertion in ShardSearchRequest (#62582 ) Relates #62184	2020-09-17 16:09:54 -04:00
Ryan Ernst	ede62d722f	Skip release build tests for external test modules (#62579 ) The tests don't make sense for release builds. closes #62435	2020-09-17 13:08:17 -07:00
Lee Hinman	9bb7ce0b22	[7.x] Allocate new indices on "hot" or "content" tier depending on data stream inclusion (#62338 ) (#62557 ) Backports the following commits to 7.x: Allocate new indices on "hot" or "content" tier depending on data stream inclusion (#62338)	2020-09-17 13:29:23 -06:00
Ryan Ernst	1b0e6caae6	Upgrade the bundled JDK to JDK 15 (#62580 ) This commit upgrades the bundled JDK to JDK 15.	2020-09-17 11:55:35 -07:00
William Brafford	5a0dca2491	Deprecate xpack.eql.enabled setting and make it a no-op (#61375 ) (#62491 ) * Deprecate xpack.eql.enabled and make it a no-op * Remove uses of xpack.eql.enabled	2020-09-17 14:17:27 -04:00
Martijn van Groningen	5f643433c6	Prohibit the usage of create index api in namespaces managed by data stream templates (#62574 ) Backport of #62527 to 7.x branch. This commit adds validation that prohibits the creation of regular indices in the namespace of templates with data streams enabled. It shouldn't be possible to create ordinary indices when the name of the index matches with a composable index template that enables data streams. Auto creation has logic that creates data streams instead of regular indices. However validation logic for the create index api was missing.	2020-09-17 20:10:42 +02:00
Jim Ferenczi	df93b31b15	Faster sequential access for stored fields (#62509 ) (#62573 ) Faster sequential access for stored fields Spinoff of #61806 Today retrieving stored fields at search time is optimized for random access. So we make no effort to keep state in order to not decompress the same data multiple times because two documents might be in the same compressed block. This strategy is acceptable when retrieving a top N sorted by score since there is no guarantee that documents will be on the same block. However, we have some use cases where the document to retrieve might be completely sequential: Scrolls or normal search sorted by document id. Queries on Runtime fields that extract from _source. This commit exposes a sequential stored fields reader in the custom leaf reader that we use at search time. That allows to leverage the merge instances of stored fields readers that are optimized for sequential access. This change focuses on the fetch phase for now and leverages the merge instances for stored fields only if all documents to retrieve are adjacent. Applying the same logic in the source lookup of runtime fields should be trivial but will be done in a follow up. The speedup on queries sorted by doc id is significant. I played with the scroll task of the http_logs rally track on my laptop and had the following result: \| Metric \| Task \| Baseline \| Contender \| Diff \| Unit \| \|--------------------------------------------------------------:\|-------:\|------------:\|------------:\|---------:\|--------:\| \| Total Young Gen GC \| \| 0.199 \| 0.231 \| 0.032 \| s \| \| Total Old Gen GC \| \| 0 \| 0 \| 0 \| s \| \| Store size \| \| 17.9704 \| 17.9704 \| 0 \| GB \| \| Translog size \| \| 2.04891e-06 \| 2.04891e-06 \| 0 \| GB \| \| Heap used for segments \| \| 0.820332 \| 0.820332 \| 0 \| MB \| \| Heap used for doc values \| \| 0.113979 \| 0.113979 \| 0 \| MB \| \| Heap used for terms \| \| 0.37973 \| 0.37973 \| 0 \| MB \| \| Heap used for norms \| \| 0.03302 \| 0.03302 \| 0 \| MB \| \| Heap used for points \| \| 0 \| 0 \| 0 \| MB \| \| Heap used for stored fields \| \| 0.293602 \| 0.293602 \| 0 \| MB \| \| Segment count \| \| 541 \| 541 \| 0 \| \| \| Min Throughput \| scroll \| 12.7872 \| 12.8747 \| 0.08758 \| pages/s \| \| Median Throughput \| scroll \| 12.9679 \| 13.0556 \| 0.08776 \| pages/s \| \| Max Throughput \| scroll \| 13.4001 \| 13.5705 \| 0.17046 \| pages/s \| \| 50th percentile latency \| scroll \| 524.966 \| 251.396 \| -273.57 \| ms \| \| 90th percentile latency \| scroll \| 577.593 \| 271.066 \| -306.527 \| ms \| \| 100th percentile latency \| scroll \| 664.73 \| 272.734 \| -391.997 \| ms \| \| 50th percentile service time \| scroll \| 522.387 \| 248.776 \| -273.612 \| ms \| \| 90th percentile service time \| scroll \| 573.118 \| 267.79 \| -305.328 \| ms \| \| 100th percentile service time \| scroll \| 660.642 \| 268.963 \| -391.678 \| ms \| \| error rate \| scroll \| 0 \| 0 \| 0 \| % \| Closes #62024	2020-09-17 19:58:18 +02:00
James Rodewig	c4d80ab7be	[DOCS] Clarify `http.max_content_length` def (#62562 ) (#62571 )	2020-09-17 13:13:37 -04:00
Andrei Dan	3753682877	Fix AllocationRoutedStep equals and hashcode (#62548 ) (#62559 ) (cherry picked from commit 79039e16305c7fb71ee012e693219a0d2b77e97b) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-09-17 17:40:21 +01:00
Alan Woodward	5421a743a7	Move SearchLookup into FetchContext (#62549 ) FetchSubPhase#getProcessor currently takes a SearchLookup parameter. This however is only needed by a couple of subphases, and will almost certainly change in future as we want to simplify how fetch phases retrieve values for individual hits. To future-proof against further signature changes, this commit moves the SearchLookup reference into FetchContext instead.	2020-09-17 17:39:02 +01:00
Alan Woodward	e3e3aef3d8	Load version metadata even when stored fields are disabled (#62533 ) Currently we throw an error if stored fields are disabled, but hit version metadata is requested on a search. This doesn't make much sense, as the version information is stored in docvalues and so has no connection with stored fields. This commit removes the link between the two, allowing version metadata to be loaded even when stored fields are disabled in a request. Fixes #62456	2020-09-17 17:39:02 +01:00
Adam Locke	0c92ed4a97	Removing erroneous screen capture. (#62561 ) (#62565 )	2020-09-17 12:37:32 -04:00
Dimitris Athanasiou	f5c28e2054	[7.x][ML] Do not start data frame analytics when too many docs are analyzed (#62547 ) (#62558 ) The data frame structure in c++ has a limit on 2^32 documents. This commit adds a check that the number of documents involved in the analysis are less than that and fails to start otherwise. That saves the cost of reindexing when it is unnecessary. Backport of #62547	2020-09-17 19:06:38 +03:00
Mark Vieira	7d36393b09	Disable composePull task on idp-fixture project due to error (#62510 )	2020-09-17 08:55:47 -07:00
Alan Woodward	91e2330529	Warn on badly-formed null values for date and IP field mappers (#62487 ) In #57666 we changed when null_value was parsed for ip and date fields. Previously, the null value was stored as a string, and parsed into a date or InetAddress whenever a document containing a null value was encountered. Now, the values are parsed when the mappings are built, which means that bad values are detected up front; if you try and add a mapping with a badly-parsed ip or date for a null_value, the mapping will be rejected. This causes problems for upgrades in the case when you have a badly-formed null_value in a pre-7.9 cluster. This commit fixes the upgrade case by changing the logic to only logging a warning on the badly formed value, replicating the earlier behaviour. Fixes #62363	2020-09-17 16:38:08 +01:00
Nik Everett	4d272a2a00	Runtime fields: fix a test name (#62498 ) This fixes the name of a test method so we actually run it. I broke it a few commits ago without realizing it.	2020-09-17 11:17:44 -04:00
Tanguy Leroux	e6777810ba	Fix S3BlobContainerRetriesTests (#62464 ) (#62551 ) The AssertingInputStream in S3BlobContainerRetriesTests verifies that InputStream are either fully consumed or aborted, but the eof flag is only set when the underlying stream returns it. When buffered read are executed and when the exact number of remaining bytes are read, the eof flag is not set to true. Instead the test should rely on the total number of bytes read to know if the stream has been fully consumed. Close #62390	2020-09-17 17:12:34 +02:00
Ignacio Vera	901000891a	Fix test error in InternalCardinalityTests#testEqualsAndHashcode (#62542 ) (#62554 ) Make sure the the new HLL++ is different to the original one	2020-09-17 17:09:13 +02:00
Lee Hinman	3081b3827b	[7.x] Add host.ip and observer.ip fields to the synthetics-- mappings (#62412 ) (#62553 ) We need to ensure these are mapped as 'ip' instead of a keyword, even if they do end up not being used. Relates to #62193	2020-09-17 09:01:53 -06:00
Lee Hinman	a636d106bf	[7.x] Remove data_frozen node role (tier) and frozen ILM phase (#62403 ) (#62465 ) Backports the following commits to 7.x: Remove data_frozen node role (tier) and frozen ILM phase (#62403)	2020-09-17 08:58:07 -06:00
Andrei Dan	fe1194d58f	[7.x] ILM migrate data between tiers (#61377 ) (#62536 ) This adds ILM support for automatically migrating the managed indices between data tiers. This proposal makes use of a MigrateAction that is injected (similar to how the Unfollow action is injected) in phases that don't define index allocation rules using the AllocateAction or don't explicitly define the MigrateAction itself (regardless if it's enabled or disabled). (cherry picked from commit c1746afffd61048d0c12d3a77e6d8191a804ed49) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-09-17 15:08:31 +01:00
Adam Locke	db9dd9f7e1	[DOCS] Updating CCR setup to be more tutorial focused (#62256 ) (#62499 ) * Applying some initial changes. * Updating intro and screenshots. * Removing unnecessary links, streamlining content, and adding GIF. * Adding what's next section. * Removing what's next. * Minor edits. * Apply suggestions from code review Co-authored-by: debadair <debadair@elastic.co> * Incorporating review feedback. * Moving CCR user privileges to another page, plus more edits. * Apply suggestions from code review Co-authored-by: debadair <debadair@elastic.co> * Incorporating more review feedback. * Adding TESTSETUP to fix build errors. * Update docs/reference/ccr/getting-started.asciidoc Co-authored-by: debadair <debadair@elastic.co> * Swapping GIF for mp4 hosted on web team CMS. * Removing GIF in favor of mp4. Co-authored-by: debadair <debadair@elastic.co> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: debadair <debadair@elastic.co> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-09-17 09:50:21 -04:00
Luca Cavanna	3cf559bf9c	Minor cleanup of runtime fields classes (#62531 ) This commit addresses some compiler warnings in the runtime fields classes	2020-09-17 15:47:05 +02:00
James Rodewig	567510da40	[DOCS] Fix link to Filebeat docs (#62519 ) (#62544 ) Co-authored-by: DeDe Morton <dede.morton@elastic.co>	2020-09-17 09:37:33 -04:00
James Rodewig	cd953272cd	[DOCS] EQL: Remove support for single quote strings (#62479 ) (#62543 )	2020-09-17 09:34:40 -04:00
David Kyle	417ce9396d	[ML] Add datafeed run time fields integration test (#62535 ) (#62538 )	2020-09-17 13:41:07 +01:00
Fernando Briano	d3bdff6bbf	Adds quotes to timestamp values in runtime_fields/40_date YAML test (#62526 )	2020-09-17 12:09:10 +01:00
Christoph Büscher	aba86d7d29	Fix condition in ILM step that cannot be met (#62377 ) (#62528 ) ReplaceDataStreamBackingIndexStep#performAction seems to perform an equality check on an original Index and the write indexes names, but because this compares an Index instance to a String, the condition can never be met. This PR changes this comparison.	2020-09-17 12:38:05 +02:00

... 6 7 8 9 10 ...

54054 Commits All Branches Search

54054 Commits

All Branches