OpenSearch

Commit Graph

Author	SHA1	Message	Date
David Kyle	51f8b0b8d2	Mute MonitoringWithWatcherRestIT.testThatLocalExporterAddsWatche (#64143 )	2020-10-26 10:19:48 +00:00
Armin Braun	8161513cb6	Fix Expected Exception Check in BlobstoreCacheService (#63474 ) (#64135 ) The `NodeNotConnectedException` exception can be nested as well in the fairly unlikley case of the disconnect occuring between the connected check and actually sending the request in the transport service. Closes #63233	2020-10-26 10:43:29 +01:00
Armin Braun	17843a40ef	Fix SearchableSnapshotDirectoryTests.testClearCache (#64100 ) (#64132 ) There is a small chance that the file deletion will run on the searchable snapshot thread pool and not on the test thread now that the cache is non-blocking in which case we fail the assertion unless we wait for that thread.	2020-10-26 10:27:08 +01:00
David Roberts	adc5509eda	[ML] Support the unsigned_long type in data frame analytics (#64072 ) Adds support for the unsigned_long type to data frame analytics. This type is handled in the same way as the long type. Values sent to the ML native processes are converted to floats and hence will lose accuracy when outside the range where a float can uniquely represent long values. Backport of #64066	2020-10-26 09:05:49 +00:00
Armin Braun	bd07e44c9a	Make Searchable Snapshot's CacheFile Lock less (#63911 ) (#64125 ) Replacing the mechanism for eviction and listener references via a read-write lock by a reference counting implementation. This fixes a bug that caused test failure #63586 in which concurrently trying to acquire or release an eviction listener while doing a file operation would sometimes lead to throwing an exception since the `tryLock` call on the read lock would fail in this case. Also this removes the possibility of blocking cluster state updates as a result of them waiting on the write-lock which might take a long time if a slow read operation executes concurrently. Closes #63586	2020-10-26 09:30:22 +01:00
David Roberts	cb0c538b35	[ML] Fix rare ML daily maintenance test race condition (#64043 ) Depending on thread scheduling the ML daily maintenance tests could do one more iteration than expected, causing rare failures. Fixes #64036	2020-10-22 13:03:02 +01:00
Rory Hunter	bfd2cbed86	Remove deprecation indexing code from 7.10 (#63942 ) The deprecation indexing code was writing to a regular data stream, and it is not yet possible to hide a data stream or prefix it with a period. This functionality we be re-added once it is possible to mark a data stream as hidden, and also to not rely on the standard logs template since that can be disabled.	2020-10-21 16:28:21 +01:00
James Rodewig	7551c4dc7f	[DOCS] Avoid trailing newline in apikey base64 encoding (#63720 ) (#64002 ) Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com> Co-authored-by: Jurgen Braam <g.j.m.braam@gmail.com>	2020-10-21 09:44:48 -04:00
Yang Wang	c1071b1eb4	Update invalidate api key doc for the new ids field (#63860 ) (#63994 ) Follow-up for #63224 and adds the missing doc update for the new ids field.	2020-10-22 00:19:41 +11:00
Yang Wang	428fd7218c	Use asterisk instead of empty string to clear all cached entries (#63907 ) (#63989 ) The officially supported way to clearing all entries from a cache is to use wildcard of either * or _all. Though empty string has the same effect, it was never intended. Therefore the tests should not use empty string and this PR changes them to use *.	2020-10-21 23:14:01 +11:00
Hendrik Muhs	f2517678aa	[7.10][Transform] add support for unsigned_long data type (#63957 ) add support for unsigned_long, which required a change in writing out integer results properly, because coerce is not supported for unsigned_long fixes #63871 backport #63940	2020-10-20 21:05:46 +02:00
Ignacio Vera	d0f5066310	Upgrade to lucene-8.7.0-snapshot-72d8528c3a6 (#63912 ) (#63928 ) (#63933 )	2020-10-20 15:08:06 +02:00
Benjamin Trent	eff7f06ca6	[ML] fix inference binary classification predication label and feature importance (#63688 ) (#63930 ) When calculating feature importance, the leaf values directly correlate the value of the importance. Consequently, positive leaf values -> positive feature importance negative leaf values -> negative feature importance. It follows that for binary classification, this is done such that the importance relates to the leaf values, which relate directly to the "probability of class 1". So, the feature importance calculated is always for the importance as it relates to class 1. The inverse is the importance as it relates to class 0.	2020-10-20 08:50:15 -04:00
Mayya Sharipova	1287df4074	Fix max/min aggs for unsigned_long (#63904 ) Max and min aggs were producing wrong results for unsigned_long field if field was indexed. If field is indexed for max/min aggs instead of field data, we use values from indexed Points, values of which are derived using method pointReaderIfPossible. Before UnsignedLongFieldType#pointReaderIfPossible was incorrectly producing values, as it failed to shift them back to original values. This patch fixes method pointReaderIfPossible to produce correct original values. Relates to #60050	2020-10-19 15:59:55 -04:00
Adam Locke	c28c3422bb	[DOCS] [7.10] Combining important config settings into a single page (#63849 ) (#63883 ) * [DOCS] Combining important config settings into a single page (#63849) * Combining important config settings into a single page. * Updating ids for two pages causing link errors and implementing redirects. * Updating links to use IDs instead of xrefs.	2020-10-19 12:59:44 -04:00
Julie Tibshirani	f122b88bc5	Remove dependency from version plugin.	2020-10-18 14:09:32 -07:00
Lisa Cawley	e6959d0d20	[7.10][DOCS] Add grant API key API and grant_api_key privilege (#63853 )	2020-10-16 15:41:46 -07:00
Andrei Stefan	5f3c79d64b	Remove filter from QL's field_caps requests (#63840 ) (#63845 ) (cherry picked from commit f009e6341d0fc0471f212d5a41c91e7aab77e006)	2020-10-17 01:36:26 +03:00
James Rodewig	179c6d4014	[DOCS] Fix role template snippet (#63774 ) (#63779 )	2020-10-16 08:24:29 -04:00
Enrico Zimuel	e724be5f41	Specify security.get_role_mapping and security.get_role name as list (#53207 ) (#63785 ) This commit updates the rest API specs for - security.get_role_mapping to specify name as a list type. - security.get_role to specify name as a list type. Co-authored-by: Russ Cam <russ.cam@elastic.co>	2020-10-16 14:22:37 +02:00
Ioannis Kakavas	364511395d	[7.10] Move RestRequestFilter to core (#63507 ) Move RestRequestFilter to core so that Rest requests outside xpack can use it to filter fields and expand its usage. Backport of #63507	2020-10-16 13:57:52 +03:00
Jim Ferenczi	1d78bd0f72	Async search should retry updates on version conflict (#63652 ) * Async search should retry updates on version conflict The _async_search APIs can throw version conflict exception when the internal response is updated concurrently. That can happen if the final response is written while the user extends the expiration time. That scenario should be rare but it happened in Kibana for several users so this change ensures that updates are retried at least 5 times. That should resolve the transient errors for Kibana. This change also preserves the version conflict exception in case the retry didn't work instead of returning a confusing 404. This commit also ensures that we don't delete the response if the search was cancelled internally and not deleted explicitly by the user. Closes #63213	2020-10-16 08:49:02 +02:00
Albert Zaharovits	f4e1e6893d	Add view_index_metadata over metricbeat-* for monitoring agent (#63750 ) The `remote_monitoring_agent` reserved role is extended to grant more privileges over the metricbeat-* index pattern. In addition to the index and create_index index privileges that it granted already, it now also grants the view_index_metadata privilege. Closes #63203	2020-10-16 02:13:55 +03:00
Jay Modi	ebdaeb2f9a	Ensure cancelled jobs do not continue to run (#63771 ) This commit ensures that jobs within the SchedulerEngine do not continue to run after they are cancelled. There was no synchronization between the cancel method of an ActiveSchedule and the run method, so an actively running schedule would go ahead and reschedule itself even if the cancel method had been called. This commit adds synchronization between cancelling and the scheduling of the next run to ensure that the job is cancelled. In real life scenarios this could manifest as a job running multiple times for SLM. This could happen if a job had been triggered and was cancelled prior to completing its run such as if the node was no longer the master node or if SLM was stopping/stopped. Closes #63754 Backport of #63762	2020-10-15 14:01:14 -06:00
Jay Modi	4d6daa6e40	Handle missing logstash index exceptions (#63753 ) This commit updates the APIs in the logstash plugin to handle IndexNotFoundExceptions that are returned by client calls. Until we have the creation of this index in place, we need to handle this case and not let the exception propagate out of the API. Backport of #63698	2020-10-15 09:45:02 -06:00
Marios Trivyzas	1dbd3a90ae	EQL: [Tests] Use snapshot from 7.10 To be able to run the tests from 7.10 onwards use a snapshot created with 7.10. Follows: #63735	2020-10-15 17:28:52 +02:00
Bogdan Pintea	35f5c49e54	SQL: integer parameter validation in string functions (#63338 ) (#63728 ) * SQL: integer parameter validation in string functions (#58923) In insert, locate, substring function, when argument `start` or `length` is greater than Integer.MAX_INT OR less then Integer.MIN_INT + 1 (note that `start` need to minus 1), it causes overflow and leads to unexpected results. * Add range checks for BinaryStringNumericProcessors - Add range checks for Left, Right, Repeat. - Minor refactorings on initial PR changes. Co-authored-by: yinanwu <yinanwu@tencent.com> (cherry picked from commit bf6dc58b93529f977d035a846d083b1c31867694) Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-10-15 16:00:48 +02:00
Marios Trivyzas	095f979060	EQL: [Tests] Add correctness integration tests (#63644 ) (#63735 ) Add a new gradle module under eql/qa which runs and validates a set of queries over a 4m event dataset (restored from a snapshot residing in a gcs bucket). The results are providing by running the exact set of queries with Python EQL against the same dataset. Co-authored-by: Marios Trivyzas <matriv@users.noreply.github.com> (cherry picked from commit 1cf789e5fcfb0f364f665bfaac021e24a4c2f556) Co-authored-by: Mark Vieira <portugee@gmail.com>	2020-10-15 15:28:26 +02:00
Costin Leau	06eae58d40	EQL: Fix translation of bool fields (#63694 ) This commit fixes two issues in dealing with bool fields in EQL: - avoid simplifications of field == true expressions - adding comparison to clauses on fields missing logic (where bool) Fix #63693 (cherry picked from commit d10a5d0e842bbd4e0031834de948ceb24da3872b) (cherry picked from commit 0227da3a275c7f22ff524d99d53e1a79146f9e28)	2020-10-15 14:38:31 +03:00
Hendrik Muhs	65ef399563	[Transform] increase timeout in testStopWaitForCheckpoint (#63657 ) increase the overall timeout by increasing the wait time after every retry. fixes #63365	2020-10-15 08:51:33 +02:00
Albert Zaharovits	2b7fbe9957	Add the missing apikey.* fields to the logfile audit layout for docker builds (#63609 ) The layout pattern for the security audit for docker builds was missing the apiKey.* fields.	2020-10-14 13:58:41 +03:00
Tanguy Leroux	57b5715bf7	Add CCR repository test for snapshot shard size (#63649 ) Following #61906 this commit adds two new integration tests that verifies the sizes of snapshotted shards for CCR repositories. Backport of #63590	2020-10-14 12:51:42 +02:00
Ryland Herrick	7e8769a666	EQL: make allow_no_indices true by default (#63573 ) (#63645 ) * Allow all indices options variants Irrespective of allow_no_indices value, throw VerificationException when there is no index validated Co-authored-by: Andrei Stefan <astefan@users.noreply.github.com>	2020-10-14 03:41:04 +03:00
Lee Hinman	7371e51583	[7.10] Add DiscoveryNodeRole compatibility role for bwc tier serialization (#63581 ) (#63613 ) Backports the following commits to 7.10: Add DiscoveryNodeRole compatibility role for bwc tier serialization (#63581)	2020-10-13 09:17:15 -06:00
Andras Palinkas	f9c8ae5f77	SQL: Improve parser error message for `ESCAPE` (#63616 ) Mentions the list of wildchars in case a wildchar is used as an `ESCAPE` character. Relates #63428 (cherry picked from commit 74cbcf871e9593b3640e382ae6845168fd14966b)	2020-10-13 10:47:06 -04:00
Andras Palinkas	4aabc050a0	SQL: Escaped wildcard () not accepted in LIKE (#63428 ) For a query like `SELECT name FROM test WHERE name LIKE ''%c'` ES SQL generates an error. `` is not a special character in a `LIKE` construct and it's expected to not needing to be escaped, so the previous query should work as is. In the LIKE pattern any `` character was treated as invalid character and the usage of `%` or `_` was suggested instead. But `*` is a valid, acceptable non-wildcard on the right side of the `LIKE` operator. Fix: #55108 (cherry picked from commit 190d9fe3deb31aed0d8f312007360625d4fff217)	2020-10-13 10:47:03 -04:00
Przemysław Witek	acbd48f834	[ML] Allow setting num_top_classes to a special value -1 (#63587 ) (#63602 )	2020-10-13 13:57:50 +02:00
David Roberts	3f210e2620	[ML] Load data streams plugin for ML internal cluster tests (#63560 ) Now that deprecation logs get indexed to a data stream, if we do not load the data stream plugin in our tests and any test generates a deprecation log message then millions of exceptions get logged, slowing down the tests to the extent that they can fail. This change loads the data streams plugin during the ML internal cluster tests. (It should already be present in external cluster tests.) Fixes #63548	2020-10-12 17:46:50 +01:00
Dimitris Athanasiou	e1c418aac7	[7.10][ML] Validate dest pipeline exists on transform update (#63494 ) (#63549 ) Adds validation that the dest pipeline exists when a transform is updated. Refactors the pipeline check into the `SourceDestValidator`. Fixes #59587 Backport of #63494	2020-10-12 15:41:35 +03:00
Tomas Della Vedova	8b07750a8b	Updated version string in yaml test (#63410 ) (#63533 )	2020-10-12 11:19:36 +02:00
Julie Tibshirani	ae2fc4118d	Add factory methods for common value fetchers. (#63438 ) This PR adds factory methods for the most common implementations: * `SourceValueFetcher.identity` to pass through the source value untouched. * `SourceValueFetcher.toString` to simply convert the source value to a string.	2020-10-08 12:14:53 -07:00
Julie Tibshirani	a506705569	Small fixes to flattened field value fetching. (#63443 ) * Remove FlatObjectFieldTypeTests, as it's redundant. * Do not apply null_value when fetching root-level values. * Remove a TODO in favor of opening an issue.	2020-10-08 11:52:54 -07:00
Benjamin Trent	a9be4181c6	[ML] fix grabbing the doc value limit setting in _explain (#63402 ) (#63471 ) Getting the doc value settings shouldn't use the API callers headers. We only use this value internally.	2020-10-08 08:53:26 -04:00
Mayya Sharipova	e022b78198	Upgrade to lucene-8.7.0-snapshot-5c4168d (#63466 ) This disables sort optim on _doc, which may still be unstable. Backport for #63444	2020-10-08 08:20:43 -04:00
Costin Leau	2ab5f226c4	EQL: Avoid filtering on tiebreakers (#63415 ) Do not filter by tiebreaker while searching sequence matches as it's not monotonic and thus can filter out valid data. Add handling for data 'near' the boundary that has the same timestamp but different tie-breaker and thus can be just outside the window. Fix #62781 Relates #63215 (cherry picked from commit 36f834600d4d9ded0fb7b1440274b2e597733770) (cherry picked from commit 72a2ce825f3bfd13f87423ba7f3c739ea64c57f6)	2020-10-08 13:50:41 +03:00
David Roberts	a9d541561f	[ML] Unmute DeleteExpiredDataIT.testDeleteExpiredDataNoThrottle (#63408 ) This test appears to work again following the Lucene bug fix that was integrated in #63395	2020-10-08 09:11:29 +01:00
Przemysław Witek	bd761cce1d	[ML] Validate that AucRoc has the data necessary to be calculated (#63302 ) (#63454 )	2020-10-08 09:52:15 +02:00
Adam Locke	f4530580e7	[DOCS] Updating password usage for create user API (#63292 ) (#63440 ) * Updating password usage. * Add asterisk * Add explanation of asterisk for settings that are only sometimes required. * Clarify change password guidance	2020-10-07 16:21:22 -04:00
Luca Cavanna	659988a77f	Remove runtime fields (#63418 ) We are not going to release runtime fields with 7.10, hence we are removing them from the 7.10 branch.	2020-10-07 20:39:41 +02:00
Mayya Sharipova	e236ea43e9	Upgrade to lucene-8.7.0-snapshot-e914862 (#63401 ) Backport for: #63395	2020-10-07 09:45:14 -04:00
Alan Woodward	88b45dfa61	Convert TextFieldMapper to parametrized form (#63269 ) (#63392 ) As a result of this, we can remove a chunk of code from TypeParsers as well. Tests for search/index mode analyzers have moved into their own file. This commit also rationalises the serialization checks for parameters into a single SerializerCheck interface that takes the values includeDefaults, isConfigured and the value itself. Relates to #62988	2020-10-07 13:26:25 +01:00
David Kyle	e4f37d96f4	Unmute notifications mapping upgrade test (#63388 ) fixed by #63063	2020-10-07 12:51:18 +01:00
Hendrik Muhs	d45f7de3fb	[Transform] Add test logging regarding conflict on start (#63383 ) add extra logging for investigation of #63365	2020-10-07 10:17:31 +02:00
Tim Vernum	c30c5555c5	Mute DeleteExpiredDataIT deleteExpired NoThrottle (#63381 ) Mutes test method DeleteExpiredDataIT.testDeleteExpiredDataNoThrottle Relates: #63379 Backport of: #63380	2020-10-07 17:43:52 +11:00
Tim Vernum	eeb45b4a74	[Backport 7.x] Add example settings to sample security realm (#63301 ) This change adds configurable settings to the `CustomRealm` in the QA project as the correct declaration and use of settings can be a source of confusion in custom realms. The "username" "password" and "roles" are now all configurable, which demonstrates the use of a simple string setting ("username") a secure setting ("password") and a more complex list setting ("roles"). Backport of: #62287	2020-10-07 17:40:24 +11:00
Stuart Tettemer	8a61b95a0f	Scripting: JSON parsing and writing in watcher (#63278 ) (#63377 ) Co-authored-by: Honza Král Co-authored-by: Jack Conradson Backport of: f43e52d	2020-10-06 23:39:40 -05:00
Stuart Tettemer	7f4f70f557	Scripting: Augment String with Hash support in Watcher (#63346 ) (#63375 ) Strings in the watcher context may use the `.sha1()` and `.sha256()` augmentation added for ingest. Ref: #59633, #59671 Fixes: #61244 Backport of: 380ee6f	2020-10-06 22:10:27 -05:00
Gordon Brown	15edc39d9b	Update logstash_admin role for system indices (#63368 ) This PR updates the `logstash_admin` role to include the recently-added Logstash Pipeline Management APIs, as well as access to the `.logstash*` index pattern. Co-authored-by: William Brafford <williamrandolphbrafford@gmail.com>	2020-10-06 20:43:36 -06:00
Mayya Sharipova	f2ba62b894	Upgrade to lucene- 8.7.0-snapshot-66c49a35402 (#63372 ) This includes fixing a bug in doc iteration during sort optimization Backport for #63349	2020-10-06 22:38:58 -04:00
Julie Tibshirani	f17ca18dfa	Make array value parsing flag more robust. (#63371 ) When constructing a value fetcher, the 'parsesArrayValue' flag must match `FieldMapper#parsesArrayValue`. However there is nothing in code or tests to help enforce this. This PR reworks the value fetcher constructors so that `parsesArrayValue` is 'false' by default. Just as for `FieldMapper#parsesArrayValue`, field types must explicitly set it to true and ensure the behavior is covered by tests. Follow-up to #62974.	2020-10-06 17:49:25 -07:00
Gordon Brown	5c8b0662df	Deprecate REST access to System Indices (#63274 ) (Original #60945 ) This PR adds deprecation warnings when accessing System Indices via the REST layer. At this time, these warnings are only enabled for Snapshot builds by default, to allow projects external to Elasticsearch additional time to adjust their access patterns. Deprecation warnings will be triggered by all REST requests which access registered System Indices, except for purpose-specific APIs which access System Indices as an implementation detail a few specific APIs which will continue to allow access to system indices by default: - `GET _cluster/health` - `GET {index}/_recovery` - `GET _cluster/allocation/explain` - `GET _cluster/state` - `POST _cluster/reroute` - `GET {index}/_stats` - `GET {index}/_segments` - `GET {index}/_shard_stores` - `GET _cat/[indices,aliases,health,recovery,shards,segments]` Deprecation warnings for accessing system indices take the form: ``` this request accesses system indices: [.some_system_index], but in a future major version, direct access to system indices will be prevented by default ```	2020-10-06 13:41:40 -06:00
Adam Locke	4f314eeb9c	Updating certificate location instructions. (#63334 ) (#63340 )	2020-10-06 12:51:22 -04:00
Tanguy Leroux	87076c32e2	Determine shard size before allocating shards recovering from snapshots (#61906 ) (#63337 ) Determines the shard size of shards before allocating shards that are recovering from snapshots. It ensures during shard allocation that the target node that is selected as recovery target will have enough free disk space for the recovery event. This applies to regular restores, CCR bootstrap from remote, as well as mounting searchable snapshots. The InternalSnapshotInfoService is responsible for fetching snapshot shard sizes from repositories. It provides a getShardSize() method to other components of the system that can be used to retrieve the latest known shard size. If the latest snapshot shard size retrieval failed, the getShardSize() returns ShardRouting.UNAVAILABLE_EXPECTED_SHARD_SIZE. While we'd like a better way to handle such failures, returning this value allows to keep the existing behavior for now. Note that this PR does not address an issues (we already have today) where a replica is being allocated without knowing how much disk space is being used by the primary. Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-10-06 18:37:05 +02:00
Igor Motov	2405162c39	Mute RegressionIT.testAliasFields test (#63339 ) It fails quite frequently in 7.x. Relates to #63268	2020-10-06 12:18:12 -04:00
David Kyle	ea32b4ab82	[ML] Audit message when nightly maintenance times out (#63252 ) (#63330 ) During deletion of old ml data set the delete by query timeout to 8 hours and audit a job message when the nightly maintenance task times out.	2020-10-06 16:19:37 +01:00
Hendrik Muhs	058c55da6a	[Transform] disallow field and script being empty for group sources (#63313 ) fail validation earlier when field and script are both missing in a group source	2020-10-06 16:59:02 +02:00
Yang Wang	abf9b885b4	Bulk invalidate API keys using a list of IDs (#63224 ) (#63320 ) Add a new ids field to the API of invalidating API keys so that it supports bulk invalidation with a list of IDs. Note the existing id field is kept as is and it is an error if both id and ids are specified.	2020-10-07 00:49:21 +11:00
Yang Wang	bbfa2f1303	Fix test failure due to missing client action	2020-10-07 00:45:30 +11:00
Benjamin Trent	a72d7cc76a	[ML] prefer secondary auth headers on data frame analytics _explain (#63281 ) (#63323 ) We should prefer secondary auth headers when calling _explain	2020-10-06 09:15:29 -04:00
Luca Cavanna	ca68298e89	Remove MapperService argument from IndexFieldData.Builder#build (#63197 ) (#63311 ) MapperService carries a lot of weight and is only used to determine if loading of field data for the id field is enabled, which can be done in a different way.	2020-10-06 15:04:23 +02:00
Yang Wang	7969fbb4ab	Cache API key doc to reduce traffic to the security index (#59376 ) (#63319 ) Getting the API key document form the security index is the most time consuing part of the API Key authentication flow (>60% if index is local and >90% if index is remote). This traffic is now avoided by caching added with this PR. Additionally, we add a cache invalidator registry so that clearing of different caches will be managed in a single place (requires follow-up PRs).	2020-10-06 23:49:23 +11:00
Armin Braun	2aa80f9ee3	Dry up Searchable Snapshots ITs (#63190 ) (#63321 ) Just a few spots where we can dry up these tests using the snapshot test infrastructure in core that I found while studying the existing searchable snapshot tests.	2020-10-06 14:41:11 +02:00
Mayya Sharipova	bea0ead08a	Fix fields retrieval on unsinged_long field (#63310 ) This fixes fields retrieval on unsigned_long field 1) For docvalue_fields a custom UnsignedLongLeafFieldData::getLeafValueFetcher is implemented that correctly retrieves doc values. 2) For stored fields, an error was fixed in UnsignedLongFieldMapper how stored values were stored. Before they were incorrectly stored in the shifted format, now they are stored as original values in String format. Relates to #60050 Backport for #63119	2020-10-06 06:37:31 -04:00
David Kyle	8f4ef40f78	[ML] Auditor ensures template is installed before writes (#63286 ) The ML auditors should not write if the latest template is not present. Instead a PUT template request is made and the writes queued up	2020-10-06 11:20:37 +01:00
Nhat Nguyen	25fbc01459	Retry CCR shard follow task when no seed node left (#63225 ) If the connection between clusters is disconnected or the leader cluster is offline, then CCR shard-follow tasks can stop with "no seed node left". CCR should retry on this error.	2020-10-05 21:56:56 -04:00
Armin Braun	e91936512a	Refactor SnapshotsInProgress State Transitions (#60517 ) (#63266 ) The copy constructors previously used were hard to read and the exact state changes were not obvious at all. Refactored those into a number of named constructors instead, added additional assertions and moved the snapshot abort logic into `SnapshotsInProgress`.	2020-10-06 00:03:42 +02:00
Armin Braun	860791260d	Implement Shard Snapshot Clone Logic (#62771 ) (#63260 ) First part of the snapshot clone logic that implements the snapshot clone functionality on the repository level.	2020-10-05 22:55:52 +02:00
Costin Leau	d027e24b31	EQL: Remove match functions (#63275 ) Since match (for matching regex) is not currently in use remove it for now. Close #63263 (cherry picked from commit 6abd531cf457f3c5686f59709647bed3276e3c6b)	2020-10-05 23:30:41 +03:00
Costin Leau	6856306dcf	EQL: Remove wildcard functionality from : (#63276 ) Restrict : operator to only case insensitive matching on strings Close #63262 (cherry picked from commit bc02e77150cdd85594dfac4f03d8aeb85aaddbb3)	2020-10-05 23:30:41 +03:00
Andrei Stefan	76bba601ab	Remove case_sensitive request option (#63218 ) (#63244 ) Make EQL case sensitive by default and adapt some of the string functions Remove the case sensitive option from Between string function Add case_insensitive option to term and wildcard queries usage (cherry picked from commit 7550e0664c8c2f1f13519036c759b1e76345551f)	2020-10-05 22:04:42 +03:00
Nhat Nguyen	1a6837883a	Upgrade to Lucene-8.7.0-snapshot-77396dbf339 (#63222 ) Includes LUCENE-9554, which exposes the pendingNumDocs from IndexWriter.	2020-10-05 14:39:30 -04:00
Armin Braun	cf75abb021	Optimize XContentParserUtils.ensureExpectedToken (#62691 ) (#63253 ) We only ever use this with `XContentParser` no need to make it inline worse by forcing the lambda and hence dynamic callsite here. => Extraced the exception formatting code path that is likely very cold to a separate method and removed the lambda usage in hot loops by simplifying the signature here.	2020-10-05 19:08:32 +02:00
Armin Braun	de6eeecbd3	Dry up Snapshot Integ Tests some More (#62856 ) (#63248 ) * Just some obvious drying up of these super complex tests. * Mainly just shortening the diff of #61839 here by moving test utilities to the abstract test case. Also, making use of the now available functionality to simplify existing tests and improve logging in them.	2020-10-05 18:33:59 +02:00
Armin Braun	509fa46c9e	Fix Broken Exception Handling in Snapshot Cleanup Tool (#63243 ) In the latest version of the GCS SDK the `404` exception is wrapped in an `IOException` making it not pass to the unwrapping added in the previous fix #63168. We can't be handling `IOException` differently here now that GCS uses it for `404`s so I adjusted the exception unwrapping accordingly. Closes #63091	2020-10-05 16:50:47 +02:00
David Roberts	1b32daf37b	Mute FullClusterRestartIT.testWatcherWithApiKey (#63241 ) Due to https://github.com/elastic/elasticsearch/issues/63088	2020-10-05 15:03:42 +01:00
Rene Groeschke	f58ebe58ee	Use services for archive and file operations in tasks (#62968 ) (#63201 ) Referencing a project instance during task execution is discouraged by Gradle and should be avoided. E.g. It is incompatible with Gradles incubating configuration cache. Instead there are services available to handle archive and filesystem operations in task actions. Brings us one step closer to #57918	2020-10-05 15:52:15 +02:00
Benjamin Trent	1e63313c19	[ML] adds feature_importance_baseline object to model metadata (#63172 ) (#63237 ) this adds the new field `feature_importance_baseline` and allows it to be optionally be included in the model's metadata. Related to: https://github.com/elastic/ml-cpp/pull/1522	2020-10-05 09:33:38 -04:00
Marios Trivyzas	19650e860a	EQL: [Test] Add a test for `identifier` as eventType (#63227 ) (#63235 ) Add a unit test to verify that an identifier surrounded with backquotes is not a valid syntax for eventType value, as eventType is schemantically a string literal and not a field identifier. Follows: #63169 (cherry picked from commit ff12c1340b3890ac52251f31259fa9a719d9eacc)	2020-10-05 15:23:08 +02:00
Costin Leau	1047d67199	Revert "EQL: Avoid filtering on tiebreakers (#63215 )" This reverts commit `efd2243886`.	2020-10-05 15:55:59 +03:00
David Roberts	ccaec70a84	[ML] Muting mappings upgrade test for .ml-stats (#63234 ) Due to https://github.com/elastic/elasticsearch/issues/61908	2020-10-05 13:22:13 +01:00
Costin Leau	8c4503bcc3	EQL: Change default indices options (#63192 ) Ignore by default unavailable indices (same as ES) and verify that allowNoIndices is set to false since at least one index is required for validating the query. Fix #62986 (cherry picked from commit fd75ac27223cd1b699b8d9c311dc401a39f9e0c8)	2020-10-05 14:21:56 +03:00
Costin Leau	b67d2274ae	QL: Optimize regexs without patterns as equality (#63216 ) If a QL regex doesn't contain any pattern, convert it to Equals. Close #63196 (cherry picked from commit e22a843124290aaacd0e80d7ae9b883e5ec2431e)	2020-10-05 14:21:42 +03:00
Costin Leau	efd2243886	EQL: Avoid filtering on tiebreakers (#63215 ) Do not filter by tiebreaker while searching sequence matches as it's not monotonic and thus can filter out valid data. Fix #62781 (cherry picked from commit 4d62198df70f3b70f8b6e7730e888057652c18a8)	2020-10-05 14:21:30 +03:00
Costin Leau	4f593bdd69	EQL: Make queries using Point-In-Time rely on index filtering (#63161 ) Point-In-Time queries cannot be ran on individual indices but on all. Thus all PIT queries move their index from the request level to a filter so this condition is fulfilled while keeping the query scoped accordingly. Fix #63132 (cherry picked from commit c8eb4f724d5dcc0fcc172c6219ecfbc1dc1fbbae)	2020-10-05 14:21:09 +03:00
Alan Woodward	01950bc80f	Move FieldMapper#valueFetcher to MappedFieldType (#62974 ) (#63220 ) For runtime fields, we will want to do all search-time interaction with a field definition via a MappedFieldType, rather than a FieldMapper, to avoid interfering with the logic of document parsing. Currently, fetching values for runtime scripts and for building top hits responses need to call a method on FieldMapper. This commit moves this method to MappedFieldType, incidentally simplifying the current call sites and freeing us up to implement runtime fields as pure MappedFieldType objects.	2020-10-04 14:54:59 +01:00
Jason Tedor	1c136bb7fc	Add tier preference when mounting (#63204 ) This commit adds a tier preference when mounting a searchable snapshot. This sets a preference that a searchable snapshot is mounted to a node with the cold role if one exists, then the warm role, then the hot role, assuming that no other allocation rules are in place. This means that by default, searchable snapshots are mounted to a node with the cold role. Note that depending on how we implement frozen functionality of searchable snapshots (not pre-cached/not fully-cached), we might need to adjust this to prefer frozen if mounting a not pre-cached/fully-cached searchable snapshot versus mounting a pre-cached/fully-cached searchable snapshot. This is a later concern since neither this nor the frozen role are implemented currently.	2020-10-03 07:33:36 -04:00
Nhat Nguyen	4ef8673fdd	Fix testRestartAfterCompletion (#63211 ) We need to complete the search before closing the iterator, which internally closes the point in time; otherwise, the search will fail with a missing context error. Closes #62451	2020-10-02 18:14:42 -04:00
Martijn van Groningen	0b6e2b8f16	Fix enrich policy test bug. Backport #63182 to 7.x branch. The `randomEnrichPolicy(...)` helper method stores the policy and creates the source indices. If a source index already exists, because it was creates for a random policy created earlier then skipping the source index fails, but that is ignored and the test continues. However if the policy has a match field that doesn't exist in the previous random policy then the mapping is never updated and the put policy api fails with the fact that the match field can't be found. This pr fixes that by execute a put mapping call in the event that the source index already exists. Closes #63126	2020-10-02 19:34:39 +02:00
Benjamin Trent	752ee0288e	[7.x] [ML] optimize delete expired snapshots (#63134 ) (#63200 ) * [ML] optimize delete expired snapshots (#63134) When deleting expired snapshots, we do an individual delete action per snapshot per job. We should instead gather the expired snapshots and delete them in a single call. This commit achieves this and a side-effect is there is less audit log spam on nightly cleanup closes https://github.com/elastic/elasticsearch/issues/62875	2020-10-02 13:24:36 -04:00
Marios Trivyzas	3cac996373	EQL: Fix syntax for event type (#63169 ) (#63194 ) Event type is actually a string value for event.category which can contain any kind of characters, or start with a digit, which currently is not supported, so we introduce the possibility to be able to use the usual syntax of " and """ for strings and raw strings. Make the grammar a bit cleaner by using the identifier only where it's actually an identifier in terms of query scemantics. Fixes: #62933 (cherry picked from commit 306e1d76da3db652db57f11f847705b3995609ff)	2020-10-02 17:28:13 +02:00
markharwood	bfb3071539	Wildcard field - add normalisation of ngram tokens to reduce disk space. (#63120 ) (#63193 ) Adds normalisation of ngram tokens to reduce disk space. All punctuation becomes / char and for A-Z0-9 chars turn even codepoints to prior odd e.g. aab becomes aaa Closes #62817	2020-10-02 16:24:27 +01:00
Przemysław Witek	5370f270d7	[7.x] [ML] Ensure data frame analytics jobs don't run on a node that's too new (#62749 ) (#63175 )	2020-10-02 17:19:58 +02:00
Marios Trivyzas	9cf0722fe6	SQL: Fix exception when using CAST on inexact field (#62943 ) (#63187 ) Currently, CAST will use the first keyword subfield of a text field for an expression in WHERE clause that gets translated to a painless script which will lead to an exception thrown: ``` "root_cause": [ { "type": "script_exception", "reason": "runtime error", "script_stack": [ "org.elasticsearch.index.mapper.TextFieldMapper$TextFieldType.fielddataBuilder(TextFieldMapper.java:759)", "org.elasticsearch.index.fielddata.IndexFieldDataService.getForField(IndexFieldDataService.java:116)", "org.elasticsearch.index.query.QueryShardContext.lambda$lookup$0(QueryShardContext.java:308)", "org.elasticsearch.search.lookup.LeafDocLookup$1.run(LeafDocLookup.java:101)", "org.elasticsearch.search.lookup.LeafDocLookup$1.run(LeafDocLookup.java:98)", "java.security.AccessController.doPrivileged(Native Method)", "org.elasticsearch.search.lookup.LeafDocLookup.get(LeafDocLookup.java:98)", "org.elasticsearch.search.lookup.LeafDocLookup.get(LeafDocLookup.java:41)", "org.elasticsearch.xpack.sql.expression.function.scalar.whitelist.InternalSqlScriptUtils.docValue(InternalSqlScriptUtils.java:79)", "InternalSqlScriptUtils.cast(InternalSqlScriptUtils.docValue(doc,params.v0),params.v1)", " ^---- HERE" ], "script": "InternalSqlScriptUtils.cast(InternalSqlScriptUtils.docValue(doc,params.v0),params.v1)", "lang": "painless" } ], ``` Instead of allowing a painless translation using the first underlying keyword silently, which can be confusing, we detect such usage and throw\ an error early. Relates to #60178 (cherry picked from commit 7402e8267ba564e52dc672c25b262824b6048b40)	2020-10-02 16:42:59 +02:00
Joe Gallo	d172a18c95	Tidy up some ILM and SLM packages (#63146 ) Very minor refactoring, just moving some ILM and SLM classes around to decrease the total number of packages.	2020-10-02 09:30:24 -04:00
Martijn van Groningen	300e525138	Fix querying a data stream name in _index field. (#63178 ) Backport #63170 to 7.x branch. The _index field is a special field that allows using queries against the name of an index or alias. Data stream names were not included, this pr fixes that by changing SearchIndexNameMatcher (which used via IndexFieldMapper) to also include data streams.	2020-10-02 15:29:20 +02:00
Armin Braun	1663dc7cf8	Fix GCS Repo Cleanup Tool Exception Handling (#63168 ) We recently upgraded the SDK which resulted in the storage exception to be wrapped now so we must unwrap to check for whether it's a 404 or not. Closes #63091	2020-10-02 15:26:39 +02:00
Marios Trivyzas	7d74fb8577	EQL: Replace ?"..." with """...""" for unescaped strings (#62539 ) (#63174 ) Use triple double quotes enclosing a string literal to interpret it as unescaped, in order to use `?` for marking query params and avoid user confusion. `?` also usually implies regex expressions. Any character inside the `"""` beginning-closing markings is considered raw and the only thing that is not permitted is the `"""` sequence itself. If a user wants to use that, needs to resort to the normal `"` string literal and use proper escaping. Relates to #61659 (cherry picked from commit d87c2ca2eacab5552bca1e520d33cf71da40bcfd)	2020-10-02 14:58:50 +02:00
Benjamin Trent	cfcf973259	[7.x] [ML] renames /inference apis to /trained_models (#63097 ) (#63136 ) * [ML] renames /inference apis to /trained_models (#63097) This commit renames all `inference` CRUD APIs to `trained_models`. This aligns with internal terminology, documentation, and use-cases.	2020-10-02 07:34:28 -04:00
Benjamin Trent	535f8a434b	Revert "[ML] adding `baseline` field to total_feature_importance objects (#63098 ) (#63125 )" (#63144 ) This reverts commit `95242eccee`.	2020-10-02 07:03:15 -04:00
Luca Cavanna	a42a516b67	Shorten runtime field type class names (#63123 ) In the codebase there is the non-written convention that classes that extend `MappedFieldType` are generally called `*FieldType`. With this commit we adopt the same convention for runtime field types which allows us to shorten their names by removing the `Mapped` portion which is implicit.	2020-10-02 11:25:25 +02:00
Ioannis Kakavas	e91f66e22f	Ensure domain_name setting for AD realm is present (#61983 ) (#63159 ) We would only check for a null value and not for an empty string so that meant that we were not actually enforcing this mandatory setting. This commits ensures we check for both and fail accordingly if necessary, on startup	2020-10-02 12:16:08 +03:00
David Kyle	279f951700	[ML] Set parent task Id on ml expired data removers (#62854 ) (#62966 ) Setting the parent task Id (of the delete expired data action) on the ML expired data removers makes it easier to track and cancel long running tasks	2020-10-02 10:14:10 +01:00
Christoph Büscher	4c7c540ca1	Update version field yml test skip version (#63139 )	2020-10-02 10:01:27 +02:00
Costin Leau	614f4c13a5	EQL: Introduce case-sensitive equality (#63121 ) Introduce : operator for doing case insensitive string comparisons. Recognizes "*" for wildcard matches in string literals. Restricted only to string types. Relates #62941 (cherry picked from commit 201e577e65f26a9b958a6197fe6c7268da39de29)	2020-10-02 00:23:08 +03:00
Igor Motov	fc13b72cea	Extract histogramFieldDocValues into an utility class (#63100 ) (#63148 ) This function will be needed in the upcoming rate aggs tests.	2020-10-01 15:44:37 -04:00
Marios Trivyzas	3ad4b00c7e	EQL: Clean grammar from `fork` (#63094 ) (#63138 ) Since `fork` is not used, is undocumented in Python EQL and there is no plan at the moment to implement it in the future, removing it from the grammar. User will get parsing exceptions instead of higher level messages about unsupported features which can lead to wrong expectations. (cherry picked from commit f6a0f8f01c1b1893bab86629d1de73e9f9dae8dc)	2020-10-01 21:14:41 +02:00
Lee Hinman	f0f0da2188	[7.x] Add telemetry for data tiers (#63031 ) (#63140 ) Backports the following commits to 7.x: Add telemetry for data tiers (#63031)	2020-10-01 12:37:32 -06:00
Benjamin Trent	95242eccee	[ML] adding `baseline` field to total_feature_importance objects (#63098 ) (#63125 ) This adds a new `baseline` field to the feature importance values. This field contains the baseline importance for a given feature and class.	2020-10-01 09:48:07 -04:00
Dimitris Athanasiou	46c3973400	[7.x][ML] Remove direct access to system index from filter_crud REST test (#63111 ) (#63115 ) This test accesses system indices for 2 reasons. First, it creates a filter that has a different type. This was done to assert that filter is not returned from the APIs. However, now that access to the `.ml-meta` index is restricted, it is not really a concern. Second, it creates a `.ml-meta` index without mappings to test the get API does not fail due to lack of mappings on a sorted field, namely the `filter_id`. Once again, this test is less useful once system indices have restricted access. Relates #62501 Backport of #63111	2020-10-01 15:15:34 +03:00
Costin Leau	c2992ea287	EQL: Fix NPE from incorrect use of ids search (#63032 ) This fixes a bug introduced when moving from mget to ids query. While mget returns all the ids given, id query is a search query and thus by default returns only 10 documents. The fix correctly sets the expected size so all the information is returned inside the response. Fix #63030 (cherry picked from commit 09ba85548a0142a1fe8376efea9cc4e7764a207c)	2020-10-01 13:49:58 +03:00
Hendrik Muhs	e001b4c021	[Transform] fix time rounding in TransformContinuousIT (#63113 ) fix a time rounding problem in the test, due to rounding down to epoch seconds instead of epoch millis fixes #62951	2020-10-01 11:43:50 +02:00
Ignacio Vera	ba5574935e	Remove dependency of Geometry queries with mapped type names (#63077 ) (#63110 ) It extracts the query capabilities from AbstractGeometryFieldType into two new interfaces, GeoshapeQueryable and ShapeQueryable. Those interfaces are implemented by the final mappers.	2020-10-01 10:49:12 +02:00
Howard	8c6e197f51	Remove allocation id from engine (#62680 ) We no longer need the allocation id in Engine.	2020-09-30 15:28:27 -04:00
Marios Trivyzas	f69d268500	SQL: Allow skip of bwc tests on `check` task (#62936 ) (#63089 ) Bwc tests can consume much time to build and to run so it's nice to be able to skip them when running the `check` task on the SQL module. Introduce a new task `checkNoBwc` so one can use: ``` ./gradlew -p x-pack/plugin/sql checkNoBwc ``` to skip them. (cherry picked from commit a52e1846f338f6869273181c6f248579581fa68c)	2020-09-30 20:03:19 +02:00
Marios Trivyzas	0ebaf8a3ec	EQL: Allow escaped backquote in identifiers (#62932 ) (#63082 ) Previously, backquote couldn't not be used inside an escaped identifier, e.g.: ``` `my`identifier` = "some_value" ``` was not allowed. Introduce escaping of the backquote by using a double backquote: ``` `my``identifier` = "some_value" ``` (cherry picked from commit 49514121486f42a58674b3e5901de4021fda5c15)	2020-09-30 19:10:09 +02:00
Alan Woodward	675d18f6ea	Convert dense/sparse vector field mappers to Parametrized form (#62992 ) Also adds a proper MapperTestCase test for dense vectors. Relates to #62988	2020-09-30 16:55:28 +01:00
Dimitris Athanasiou	e09074d382	[7.x][ML] Fix online updates with custom rules referencing filters (#63057 ) (#63064 ) When an opened anomaly detection job is updated with a detection rule that references a filter, apart from updating the c++ process with the rule, we also need to update it with the referenced filter. This commit fixes a bug which led to the job not applying such updates on-the-fly. Fixes #62948 Backport of #63057	2020-09-30 16:01:06 +03:00
Costin Leau	a6b903b783	EQL: Remove unused classes from reponse API (#62134 ) Remove Count class and related artifacts since that functionality is not (yet) available. Update parser name for better error reporting. Fix #62131 (cherry picked from commit 060f500346788c4c5d0b3b9c045facec5d677d3d)	2020-09-30 15:45:30 +03:00
Mayya Sharipova	f221349593	Fix UnsignedLongTests test failure (#63056 ) Test testSortDifferentFormatsShouldFail was occasionally failing for 2 reasons: 1) Documents on "idx2" were not available for search before a search request started 2) Running a test multiple times was causing occasional ResourceAlreadyExistsException for idx2, as idx2 was not deleted for a test. This patch makes the following fixes: 1) Sets up immediate refresh policy for docs in the index"idx2" 2) Creates an index idx2 only once per cluster Closes: #62997	2020-09-30 08:41:31 -04:00
Yang Wang	e31bef4032	Fix API key role descriptors rewrite bug for upgraded clusters (#62917 ) (#63042 ) This PR ensures that API key role descriptors are always rewritten to a target node compatible format before a request is sent.	2020-09-30 22:16:39 +10:00
Benjamin Trent	0860746bf2	[ML] changing ngram loop order for minor performance improvement (#63033 ) (#63059 ) This is a very minor optimization but trivial to implement, so might as well. ``` Benchmark (nGramStrs) Mode Cnt Score Error Units NGramProcessorBenchmark.ngramInnerLoop 1,2,3 avgt 20 4415092.443 ± 31302.115 ns/op NGramProcessorBenchmark.ngramOuterLoop 1,2,3 avgt 20 4235550.340 ± 103393.465 ns/op ``` This measurement is in nanoseconds, consequently, the overall performance of inference is dominated by other factors (i.e. map#put). But, this optimization adds up overtime and is simple.	2020-09-30 07:51:31 -04:00
Benjamin Trent	b7c47b1717	[ML] Add data frame analytics bwc testing (#63012 ) This commit adds bwc testing for data frame analytics. The bwc tests only go back to the 7.9.0. Meaning, initially only rolling upgrades from 7.9.x -> 7.10.0 are tested. Since the feature was experimental in < 7.9.0, this is acceptable.	2020-09-30 07:13:40 -04:00
Przemysław Witek	4366d58564	[7.x] [ML] Implement AucRoc metric for classification (#60502 ) (#63051 )	2020-09-30 12:55:52 +02:00
Dimitris Athanasiou	179fe9cc0e	[7.x][ML] Delete dest index and reindex if incompatible (#62960 ) (#63050 ) Data frame analytics results format changed in version `7.10.0`. If existing jobs that were not completed are restarted, it is possible the destination index had already been created. That index's mappings are not suitable for the new results format. This commit checks the version of the destination index and deletes it when the version is outdated. The job will then continue by recreating the destination index and reindexing. Backport of #62960	2020-09-30 12:57:48 +03:00
Hendrik Muhs	df93f46888	[Transform] fix issue in TransformIndexerStateTests.testStopAtCheckpoint (#63006 ) fix a test issue by improving counting the number of times the deferred listener is called fixes #62996	2020-09-30 08:54:45 +02:00
David Roberts	05427c2bb2	[ML] Add timeouts to named pipe connections (#63022 ) This PR adds timeouts to the named pipe connections of the autodetect, normalize and data_frame_analyzer processes. This argument requires the changes of elastic/ml-cpp#1514 in order to work, so that PR will be merged before this one. (The controller process already had a different mechanism, tied to the ES JVM lifetime.) Backport of #62993	2020-09-29 18:04:02 +01:00
Costin Leau	3bee28056f	EQL: Fix bug in sequences with any pattern (#63007 ) Fix query creation inside sequences with any queries due to lacking a clause to combine, which lead to an invalid request being created. Fix #62967 (cherry picked from commit ff59d8823919a6e70928816e5c3687308ebde33f)	2020-09-29 18:19:25 +03:00
Benjamin Trent	0b3af242d4	[ML] fixing classification feature importance parsing (#63003 ) (#63015 ) Classification feature importance supports various types in the class name: - string - boolean - numerical The xcontent parsing on the server side and the HLRC side should support and test these types.	2020-09-29 10:54:35 -04:00
Yang Wang	068f605040	Use compilation as validation for painless role template (#62845 ) (#63010 ) * Use compilation as validation for painless role template (#62845) Role template validation now performs only compilation if the script is painless. It no longer attempts to execute the script with empty input which is problematic. The compliation process will catch things like invalid syntax, undefined variables, which still provide certain level of protection against ill-defined role templates. Behaviour for Mustache script is unchanged. * Checkstyle	2020-09-30 00:37:41 +10:00
Alan Woodward	de08ba58bf	Convert percolator, murmur3 and histogram mappers to parametrized form (#63004 ) Relates to #62988	2020-09-29 14:42:26 +01:00
Dimitris Athanasiou	facf9ede0a	[ML] Fix binary classification importance in LegacyFeatureImportanceTests (#63000 ) Fixes #62991	2020-09-29 15:53:34 +03:00
Benjamin Trent	2b9032a07d	[7.x] [ML] fixing testTwoJobsWithSameRandomizeSeedUseSameTrainingSet tests (#62976 ) (#62999 ) * [ML] fixing testTwoJobsWithSameRandomizeSeedUseSameTrainingSet tests (#62976) This fixes the two test failures. The shard failure seems to be due to the .ml-stats index being in the middle of being created.	2020-09-29 08:12:20 -04:00
Hendrik Muhs	154a0c00b7	[Transform] add debug logging to investigate #62951 (#62990 )	2020-09-29 12:06:35 +02:00
Mayya Sharipova	ca42726a99	Ensure consistent ordering of hits in test (#62977 ) 50_script_values/Script query fails sometimes as resulting hits will be ordered differently from expected. This patch ensures consisten ordering of hits. Closes #62975	2020-09-29 06:00:34 -04:00
Armin Braun	678688dc84	Avoid Redundantly Loading Monitoring Templates on CS Applier Thread (#62913 ) (#62979 ) This refactors the loading of monitoring templates slightly so that they aren't loaded over and over again (from disk) on CS updates. This isn't an important optimization in production for obvious reasons since it only affects the install stage, but this turned out to cause some slow CS applies in tests. Relates #62853	2020-09-29 11:45:22 +02:00
David Kyle	f23603dafd	[ML][Transform] Filter null objects from field caps request (#62945 ) (#62971 ) If the transform grouping is a script then exclude the field from the source index mappings fields caps request. A null object caused an NPE in the serialisation of FieldCapabilitiesIndexRequest.	2020-09-29 09:07:01 +01:00
Dimitris Athanasiou	7f6c1ff5b4	[7.x][ML] Remove top level importance from classification inference results (#62486 ) (#62964 ) As we have decided top level importance for classification is not useful, it has been removed from the results from the training job. This commit also removes them from inference. Backport of #62486	2020-09-29 10:58:48 +03:00
Mayya Sharipova	4c8c3c8df6	Upgrade lucene to lucene-8.7.0-snapshot-3b59906 (#62978 ) Backport for #62970	2020-09-28 16:52:31 -04:00
Benjamin Trent	a054e62bc4	[ML] allow datafeeds to run if there are any concrete indices (#62827 ) (#62965 ) This commit allows a datafeed to be assigned to a node if only one index pattern has concrete indices.	2020-09-28 12:58:07 -04:00
Hendrik Muhs	be5edcfb26	[Transform] fix possible NPE if transform task has no node assigned (#62946 ) ignore transform tasks that do not have a node assigned when collecting nodes to forward the request for _stop, _stats and _update fixes #62847	2020-09-28 15:25:38 +02:00

1 2 3 4 5 ...

6568 Commits