OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jay Modi	1a13a0b10f	Watcher understands hidden expand wildcard value (#65372 ) Watcher has a search template that stores indices options to be used as part of a search during watch execution, but this was not updated to be aware of hidden indices and the `hidden` expand_wildcards option. This change makes use of the `IndicesOptions#toXContent` method in Watcher, which already handles the new value. Additionally, the XContent parsing is moved to the IndicesOptions class so that we will be less likely to miss updating this in the future. Closes #65148 Backport of #65332	2020-11-23 09:17:49 -07:00
Andrei Stefan	866a6afcdf	Extend the interval date comparison (#65348 ) (#65358 ) (cherry picked from commit acfb463892fdaf3f0deb679122b5e402c7b56418)	2020-11-23 15:24:36 +02:00
Armin Braun	7fbdcb5e00	Fix SearchableSnapshotsIntegTests.testCreateAndRestoreSearchableSnapshot (#65343 ) (#65351 ) The recovery stats assertions in this test ran without any waiting for the recoveries to actually finish. The fact that they ran after the concurrent searches checks generally meant that they would pass (because of searches warming caches + general relative slowness of searches) but there is no hard guarantees this will work reliably as the pre-fetch threads which will update the recovery state might still be slow to do so randomly, causing the assertions to trip. closes #65302	2020-11-23 12:30:18 +01:00
Armin Braun	b0cea04f95	Fix Broken Error Handling in CacheFile#acquire (#65342 ) (#65347 ) If we fail to create the `FileChannelReference` (e.g. because the directory it should be created in was deleted in a test) we have to remove the listener from the `listeners` set to not trip internal consistency assertions. Relates #65302 (does not fix it though, but reduces noise from failures by removing secondary tripped assertions after the test fails)	2020-11-23 08:57:24 +01:00
Armin Braun	67b6317488	Mute JdbcCsvSpecIT#testCurrentDateFilter (#65341 ) Muting for https://github.com/elastic/elasticsearch/issues/65336	2020-11-22 22:37:21 +01:00
Nik Everett	56605e4d9a	Fixup reduceRandom tests (#65263 ) In aa1ea96b8698aa12bed1c4e8d704882a2a639791 I made all `testReduceRandom` tests for aggs mimick production more precisely. More precisely, they pick the correct "lead" result when performing partial reduction. This is great, but, sadly, some tests assumed that we always reduced against the "first" aggregator. This fixes those tests. Closes #65163	2020-11-20 13:10:34 -05:00
Jay Modi	893e1a5282	Fix date math hidden index resolution (#65278 ) This commit updates the IndexAbstractionResolver so that hidden indices are properly resolved when date math is in use and when we are checking if the index is visible. Closes #65157 Backport of #65236	2020-11-19 12:40:14 -07:00
Nhat Nguyen	3989243a52	Stop renew retention leases when follow task fails (#65168 ) If a shard follow-task hits a non-retryable error and stops, then we should also stop the retention-leases renewal process associated with that follow-task.	2020-11-18 15:53:55 -05:00
Jim Ferenczi	9f3e3e2162	Fix "resource not found" exception on existing EQL async search (#65167 ) This change fixes the initialization of the async results service for the EQL get async action. The boolean that differentiates EQL from normal _async_search request is set incorrectly, which results in errors (404) when extending the keep alive of a running EQL search. Fixes #65108	2020-11-18 09:10:31 +01:00
Costin Leau	f089547b20	EQL: Fix aggressive/incorrect until policy in sequences (#65156 ) The current until implementation in sequences is too optimistic, leading to an aggressive match that discards correct data leading to invalid results. This commit addresses this issue and also unifies the until usage inside TumblingWindow. Further more it packs together the UntilGroup with SequenceGroup to minimize memory usage and improve clean-up. (cherry picked from commit de2724e92c732c66436939dbbedef93c9981b435) (cherry picked from commit a60757756aae5f5abb31176fee972a7cdeac3649)	2020-11-18 09:34:33 +02:00
Dimitris Athanasiou	197de8fe66	[7.10][ML] Increase timeout waiting for DFA jobs to finish in integ tests (#65126 ) (#65131 ) It appears that occasionally 30 seconds are not enough for CI workers to complete DFA jobs. In order to eliminate such failures we increase the time we wait for DFA jobs to complete in integration tests to 60 seconds. Fixes #64926 Backport of #65126	2020-11-17 16:46:17 +02:00
Costin Leau	74fde15833	EQL: Allow null tiebreakers inside ordinals/sequences (#65033 ) Align Ordinal comparator to consider nulls last (higher) in tiebreakers. Add unit tests to Ordinal comparisons and criterion extraction. Fix #64706 (cherry picked from commit 93dc883abd6b8855ff1618a574412b7f773b8ff5) (cherry picked from commit 936e5f1a2cc29c1d5662cb8aa90c629af563a987)	2020-11-16 16:52:55 +02:00
Przemysław Witek	de668ab84b	[7.10] [ML] Extract dependent variable's mapping correctly in case of a multi-field (#63813 ) (#64287 )	2020-11-16 10:34:58 +01:00
Costin Leau	9551cb3420	EQL: small improvements to the testing base class Extract request settings into dedicated methods for easier adjustments (cherry picked from commit 4f93591cc561c7f8ff7c2f070dd1180f209810b7) (cherry picked from commit ff7e8427345c304f5a37612c870b48555484b692)	2020-11-14 16:40:48 +02:00
Costin Leau	f7cc570c4f	EQL: Re-enable correctness tests (#65041 ) Enable previously disabled tests - only two type of queries remain disabled: one that does pattern matching and another one for case-insensitivity. Fix #63742 (cherry picked from commit 20210cc43b34438c40b8b5aebf0aa2b8161c4104) (cherry picked from commit 95d08f2c8d0aac52cc1ed470fa489c239ee25159)	2020-11-14 16:09:11 +02:00
Costin Leau	76e73fec79	EQL: Add option for returning results from the tail of the stream (#64869 ) (#65040 ) Introduce option for specifying whether the results are returned from the tail (end) of the stream or the head (beginning). Improve sequencing algorithm by significantly eliminating the number of in-flight sequences for spare datasets. Refactor the sequence class by eliminating some of the redundant code. Change matching behavior for tail sequences. Return results based on their first entry ordinal instead of insertion order (which was ordered on the last match ordinal). Randomize results position inside test suite. Close #58646 (cherry picked from commit e85d9d1bbee13ad408e789fd62efb30bc8d223f2) (cherry picked from commit 452c674a10cdc16dced3cde7babf5d5a9d64a6d9)	2020-11-14 13:44:17 +02:00
Alan Woodward	0e2a9b4ac7	Fix sparse vector test	2020-11-12 20:10:07 +00:00
Benjamin Trent	b888f36388	[ML] fix custom feature processor extraction bugs around boolean fields and custom one_hot feature output order (#64937 ) (#65009 ) This commit fixes two problems: - When extracting a doc value, we allow boolean scalars to be used as input - The output order of processed feature names is deterministic. Previous custom one hot fields used to be non-deterministic and thus could cause weird bugs.	2020-11-12 11:15:57 -05:00
Tanguy Leroux	e40d7e02ea	Makes testCcrRepositoryFetchesSnapshotShardSizeFromIndexShardStoreStats more robust (#64976 ) (#64989 ) Today this test fails because the sizes of the snapshot shards are only kept in a very short period of time in the InternalSnapshotsInfoService and are not guaranteed to exist once the shards are correctly assigned. closes #64167	2020-11-12 15:38:38 +01:00
Dimitris Athanasiou	b5efaf6e3b	[7.10][ML] Protect against stack overflow while loading DFA data (#64947 ) (#64956 ) If we encounter an exception during extracting data in a data frame analytics job, we retry once. However, we were not catching exceptions thrown from processing the search response. This may result in an infinite loop that causes a stack overflow. This commit fixes this problem. Backport of #64947	2020-11-12 11:08:40 +02:00
Andras Palinkas	a7301065d7	SQL: Fix the return type in the sign function (#64845 ) (#64968 ) Fixes the inconsistency between the type of the object returned by the `SIGN()/SIGNUM()` SQL functions and the specified `DataType`. In the Class Sign, DataType is DataTypes.INTEGER. The source code is as follows： ``` public DataType dataType() { return DataTypes.INTEGER; } ``` But In the Class MathProcessor, the source code of SIGN((Object l), Parameter and return value types are the same. Therefore, when using double or float parameters to test, there is a little problem, the test method is like the following curl : ``` curl -XPOST 127.0.0.1:9200/_sql -d "{\"query\":\"select SIGN(1.0) \"}" \ -H 'Content-Type: application/json' ``` The result is: ``` {"columns":[{"name":"SIGN(1.0)","type":"integer"}],"rows":[[1.0]]} ``` The result value is `1.0`, but the type is `integer`. Signed-off-by: mantuliu <240951888@qq.com> Co-authored-by: Marios Trivyzas <matriv@gmail.com> (cherry picked from commits aa78301e71f, ced3c1281c7, 40e5b9b)	2020-11-11 17:07:17 -05:00
Daniel Mitterdorfer	b8c9780c23	Mute multiple tests in ClassificationIT (#64930 ) Relates #64926	2020-11-11 15:30:19 +01:00
Daniel Mitterdorfer	90af072277	Mute AsyncSearchActionIT.testRetryVersionConflict (#64919 ) Relates #63948	2020-11-11 14:01:33 +01:00
Nhat Nguyen	aa0e3f85e6	Increase timeout in testCleanUpShardFollowTasksForDeletedIndices (#64562 ) If the deleted index has N shards, then ShardFollowTaskCleaner can send N*(N-1)/2 requests to remove N shard-follow tasks. I think that's fine as the implementation is straightforward. The test failed when the deleted index has 8 shards. This commit increases the timeout in the test. Closes #64311	2020-11-10 11:51:41 -05:00
Benjamin Trent	f0ff673f82	[ML] Fix bug with data frame analytics classification test data sampling when using custom feature processors (#64727 ) (#64864 ) When using custom processors, the field names extracted from the documents are not the same as the feature names used for training. Consequently, it is possible for the stratified sampler to have an incorrect view of the feature rows. This can lead to the wrong column being read for the class label, and thus throw errors on training row extraction. This commit changes the training row feature names used by the stratified sampler so that it matches the names (and their order) that are sent to the analytics process.	2020-11-10 08:47:07 -05:00
Benjamin Trent	dafafd7ec6	[ML] fix edge case for data frame analytics where a field mapped as a keyword actually has boolean and string values in the _source (#64826 ) (#64862 ) It is possible that a value mapped as a `keyword` has any scalar value type. This includes any numerical value, String, or boolean. This commit allows `boolean` types to be considered as a part of the categorical feature collection when this is the case.	2020-11-10 08:46:52 -05:00
Ryan Ernst	6be4acd509	Use no-kpi subdomains for tests downloading artifacts (#64502 ) This commit converts build code that downloads distributions or other artifacts to use the new no-kpi subdomain, and removes the formerly used no-kpi header.	2020-11-07 11:30:52 -08:00
Lee Hinman	6dbfafcff2	[7.10] Fix SetSingleNodeAllocateStep for data tier deployments (#64679 ) (#64730 ) Backports the following commits to 7.10: Fix SetSingleNodeAllocateStep for data tier deployments (#64679)	2020-11-06 10:12:16 -07:00
Jim Ferenczi	9e4105ec37	Validate PIT on _msearch (#63167 ) This change ensures that we validate point in times provided by individual search requests in _msearch. Relates #63132	2020-11-05 15:38:28 +01:00
Dan Hermann	38ee2da564	Add configurable op_type for index watcher action (#64590 ) (#64647 )	2020-11-05 08:21:19 -06:00
Andrei Dan	a3d9408fda	Fix DataTiersUsageTransportActionTests testCalculateMAD (#64596 ) (#64628 ) Random the compression factor starting with 1 (to elimitinate nearly 0 values) which will only use one centroid (and yield 0 for MAD as the aproximate median is the same as the single centroid mean value) (cherry picked from commit 940e0f1fde0f40f99af117dd03ab0891c9eedae6) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-11-05 10:58:40 +00:00
Andras Palinkas	3d8e17f3bd	SQL: Fix incorrect parameter resolution (#63710 ) (#64615 ) Summary of the issue and the root cause: ``` (1) SELECT 100, 100 -> success (2) SELECT ?, ? (with params: 100, 100) -> success (3) SELECT 100, 100 FROM test -> Unknown output attribute exception for the second 100 (4) SELECT ?, ? FROM test (params: 100, 100) -> Unknown output attribute exception for the second ? (5) SELECT field1 as "x", field1 as "x" FROM test -> Unknown output attribute exception for the second "x" ``` There are two separate issues at play here: 1. Construction of `AttributeMap`s keeps only one of the `Attribute`s with the same name even if the `id`s are different (see the `AttributeMapTests` in this PR). This should be fixed no matter what, we should not overwrite attributes with one another during the construction of the `AttributeMap`. 2. The `id` on the `Alias`es is not the same in case the `Alias`es have the same `name` and same `child` It was considered to simpy fix the second issue by just reassigning the same `id`s to the `Alias`es with the same name and child, but it would not solve the `unknown output attribute exception` (see notes below). This PR covers the fix for the first issue. Relates to #56013	2020-11-04 20:38:00 -05:00
Jay Modi	4c3300bf57	Fix job scheduling for same scheduled time (#64598 ) The SchedulerEngine used by SLM uses a custom runnable that will schedule itself for its next execution if there is one to run. For the majority of jobs, this scheduling could be many hours or days away. Due to the scheduling so far in advance, there is a chance that time drifts on the machine or even that time varies core to core so there is no guarantee that the job actually runs on or after the scheduled time. This can cause some jobs to reschedule themselves for the same scheduled time even if they ran only a millisecond prior to the scheduled time, which causes unexpected actions to be taken such as what appears as duplicated snapshots. This change resolves this by checking the triggered time against the scheduled time and using the appropriate value to ensure that we do not have unexpected job runs. Relates #63754 Backport of #64501	2020-11-04 10:15:46 -07:00
Andrei Dan	2dbc444fe5	Tests: fix testMoveToStepRereadsPolicy flakiness (#64466 ) (#64593 ) testMoveToStepRereadsPolicy relied on an updated ILM policy that had a rollover condition that enabled the index to be rolled after one second. This changes the test to use a `max_doc`:1 condition so it's under the test's control to trigger the condition. (cherry picked from commit 73ab35a411bcdf5a92eb3d2b3bae5b1132a2bb56) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-11-04 15:04:01 +00:00
Andrei Dan	835ebcfff2	Convert flaky yml tests to EsRestTestCases (#63634 ) (#64581 ) The yml "Test Invalid Move To Step With Invalid Next Step" worked based on assuming the current step is a particular one. As we can't control the timing of ILM and we can't busy assert in yml test, this converts the test to a java test and makes use of `assertBusy` This converts the explain lifecycle yml tests that depende on ILM having run at least once to a java integration test that makes use of `assertBusy`. (cherry picked from commit 6afd0422ed5ff0e3a2e5661f0e6d192bdad9af4f) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-11-04 12:39:26 +00:00
Mayya Sharipova	8c25130a80	Disable using unsigned_long in scripts (#64552 ) (#64557 ) Backport for #64523 Relates to #64361	2020-11-03 16:38:50 -05:00
Dimitris Athanasiou	e3fee3e6df	[ML] Ignore _doc_count field in data frame analytics (#64541 ) This field is added from version 7.11 onwards. We are adding it to the list of ignored fields for data frame analytics in 7.10 to avoid failing to start an outlier detection job in a mixed cluster environment. Relates #64503	2020-11-03 19:53:17 +02:00
Ignacio Vera	4851bc7bae	Upgrade to Lucene-8.7.0 (#64532 ) (#64537 )	2020-11-03 16:57:04 +01:00
David Roberts	e0d0ac86dd	[ML] Increase forecast test timeout (#64471 ) ForecastIT.testOverflowToDisk has been observed to fail a few times in FIPS JVMs because it takes longer than the permitted 30 seconds. This PR bumps the timeout up to 60 seconds. Fixes #63793	2020-11-02 14:58:18 +00:00
Tanguy Leroux	2d0bddf428	Fix CcrRepositoryIT.testCcrRepositoryFetchesSnapshotShardSizeEtc (#64228 ) (#64479 ) This test failed sometimes for various reasons: an empty bulk request that can't be validated, a background force-merge that completes after the store stats were collected and finally an assertBusy() that waits 10 seconds while we usually wait 60s on the follower cluster in CCR tests. Closes #64167	2020-11-02 15:47:07 +01:00
Marios Trivyzas	0a9481fcaf	EQL: [Tests] enable server side debugging (#64308 ) (#64449 ) Register a new task `runEqlCorrectnessNode` which enables developers to start an ES node in debug mode, properly restore the correctness data and then run queries against it. Assert the index is restored correctly and use new snapshot. (cherry picked from commit fc8c6dd56d602b4a62ee1ff484f00caab92dc6e2)	2020-10-31 11:55:39 +01:00
Tal Levy	ff829bf197	Removes yaml circuit-breaker tests for geoshape geogrid aggs (#64420 ) These tests were added to do a proper end-to-end test of the memory usage of the geotile_grid and geohash_grid aggregations on `geo_shape` fields. Although this was asperational, the truth is — the test environment does not run these aggregations in isolation. This means that the memory overhead is variable and too flaky to rely on over time. The unit tests for circuit-breaking remain. Closes #63158.	2020-10-30 08:07:12 -07:00
Jason Tedor	1126ba4df8	Serialize can contain data with roles (#64324 ) This commit internalizes whether or not a role represents the ability to contain data. In the future, this will let us remove the compatibility role notion.	2020-10-29 20:44:39 -04:00
Armin Braun	fada4a1c78	Fix CachedBlobContainerIndexInputTests (#64239 ) (#64348 ) Closing the input stream happens on a separate thread now that the `CacheFile` is implemented in a lock-free fashion. Closes #64215	2020-10-29 15:11:39 +01:00
James Rodewig	a2b18e9ab9	[DOCS] Fix case for 'Boolean' (#64299 ) (#64342 )	2020-10-29 10:05:57 -04:00
Andrei Stefan	a6d8319231	* Wrap a verification_exception in case there is no valid index available (#64267 ) Wrap a verification_exception in case there is no valid index available in an index_not_found_exception providing also the original index pattern that may be lost in the chain of filters involving the Security one. (cherry picked from commit 9c9da2f2f9a4ad12704f7d3a273f067e96cd2054)	2020-10-29 10:14:50 +02:00
Armin Braun	6bd8f079a7	Enhance CacheFile#invariant Assertion (#64272 ) (#64280 ) Follow up to #64180 tightening the assertion further.	2020-10-28 13:43:39 +01:00
Armin Braun	2983584ef6	Fix #invariant Assertion in CacheFile (#64180 ) (#64264 ) Fix #invariant Assertion in CacheFile closes #64141	2020-10-28 10:22:47 +01:00
jimczi	2492f48375	Fix test compilation	2020-10-28 08:58:09 +01:00
Jim Ferenczi	dcc433c971	Fix UOE when fetching flattened field (#64241 ) The new fields option allows to fetch the value of all fields in the mapping. However, internal fields that are used by some field mappers are also shown when concrete fields retrieved through a pattern (`` or `foo`). We have a [long term plan](https://github.com/elastic/elasticsearch/issues/63446) to hide these fields in field_caps and from pattern resolution so this change is just a hot fix to ensure that they don't break the retrieval in the meantime. The `flattened._keyed field will show up as an empty field when using a pattern that match the flattened field. Relates #63446	2020-10-28 08:49:03 +01:00

1 2 3 4 5 ...

5783 Commits