OpenSearch

Commit Graph

Author	SHA1	Message	Date
Andras Palinkas	a7301065d7	SQL: Fix the return type in the sign function (#64845 ) (#64968 ) Fixes the inconsistency between the type of the object returned by the `SIGN()/SIGNUM()` SQL functions and the specified `DataType`. In the Class Sign, DataType is DataTypes.INTEGER. The source code is as follows： ``` public DataType dataType() { return DataTypes.INTEGER; } ``` But In the Class MathProcessor, the source code of SIGN((Object l), Parameter and return value types are the same. Therefore, when using double or float parameters to test, there is a little problem, the test method is like the following curl : ``` curl -XPOST 127.0.0.1:9200/_sql -d "{\"query\":\"select SIGN(1.0) \"}" \ -H 'Content-Type: application/json' ``` The result is: ``` {"columns":[{"name":"SIGN(1.0)","type":"integer"}],"rows":[[1.0]]} ``` The result value is `1.0`, but the type is `integer`. Signed-off-by: mantuliu <240951888@qq.com> Co-authored-by: Marios Trivyzas <matriv@gmail.com> (cherry picked from commits aa78301e71f, ced3c1281c7, 40e5b9b)	2020-11-11 17:07:17 -05:00
Daniel Mitterdorfer	b8c9780c23	Mute multiple tests in ClassificationIT (#64930 ) Relates #64926	2020-11-11 15:30:19 +01:00
Daniel Mitterdorfer	90af072277	Mute AsyncSearchActionIT.testRetryVersionConflict (#64919 ) Relates #63948	2020-11-11 14:01:33 +01:00
Nhat Nguyen	aa0e3f85e6	Increase timeout in testCleanUpShardFollowTasksForDeletedIndices (#64562 ) If the deleted index has N shards, then ShardFollowTaskCleaner can send N*(N-1)/2 requests to remove N shard-follow tasks. I think that's fine as the implementation is straightforward. The test failed when the deleted index has 8 shards. This commit increases the timeout in the test. Closes #64311	2020-11-10 11:51:41 -05:00
Benjamin Trent	f0ff673f82	[ML] Fix bug with data frame analytics classification test data sampling when using custom feature processors (#64727 ) (#64864 ) When using custom processors, the field names extracted from the documents are not the same as the feature names used for training. Consequently, it is possible for the stratified sampler to have an incorrect view of the feature rows. This can lead to the wrong column being read for the class label, and thus throw errors on training row extraction. This commit changes the training row feature names used by the stratified sampler so that it matches the names (and their order) that are sent to the analytics process.	2020-11-10 08:47:07 -05:00
Benjamin Trent	dafafd7ec6	[ML] fix edge case for data frame analytics where a field mapped as a keyword actually has boolean and string values in the _source (#64826 ) (#64862 ) It is possible that a value mapped as a `keyword` has any scalar value type. This includes any numerical value, String, or boolean. This commit allows `boolean` types to be considered as a part of the categorical feature collection when this is the case.	2020-11-10 08:46:52 -05:00
Ryan Ernst	6be4acd509	Use no-kpi subdomains for tests downloading artifacts (#64502 ) This commit converts build code that downloads distributions or other artifacts to use the new no-kpi subdomain, and removes the formerly used no-kpi header.	2020-11-07 11:30:52 -08:00
Lee Hinman	6dbfafcff2	[7.10] Fix SetSingleNodeAllocateStep for data tier deployments (#64679 ) (#64730 ) Backports the following commits to 7.10: Fix SetSingleNodeAllocateStep for data tier deployments (#64679)	2020-11-06 10:12:16 -07:00
Jim Ferenczi	9e4105ec37	Validate PIT on _msearch (#63167 ) This change ensures that we validate point in times provided by individual search requests in _msearch. Relates #63132	2020-11-05 15:38:28 +01:00
Dan Hermann	38ee2da564	Add configurable op_type for index watcher action (#64590 ) (#64647 )	2020-11-05 08:21:19 -06:00
Andrei Dan	a3d9408fda	Fix DataTiersUsageTransportActionTests testCalculateMAD (#64596 ) (#64628 ) Random the compression factor starting with 1 (to elimitinate nearly 0 values) which will only use one centroid (and yield 0 for MAD as the aproximate median is the same as the single centroid mean value) (cherry picked from commit 940e0f1fde0f40f99af117dd03ab0891c9eedae6) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-11-05 10:58:40 +00:00
Andras Palinkas	3d8e17f3bd	SQL: Fix incorrect parameter resolution (#63710 ) (#64615 ) Summary of the issue and the root cause: ``` (1) SELECT 100, 100 -> success (2) SELECT ?, ? (with params: 100, 100) -> success (3) SELECT 100, 100 FROM test -> Unknown output attribute exception for the second 100 (4) SELECT ?, ? FROM test (params: 100, 100) -> Unknown output attribute exception for the second ? (5) SELECT field1 as "x", field1 as "x" FROM test -> Unknown output attribute exception for the second "x" ``` There are two separate issues at play here: 1. Construction of `AttributeMap`s keeps only one of the `Attribute`s with the same name even if the `id`s are different (see the `AttributeMapTests` in this PR). This should be fixed no matter what, we should not overwrite attributes with one another during the construction of the `AttributeMap`. 2. The `id` on the `Alias`es is not the same in case the `Alias`es have the same `name` and same `child` It was considered to simpy fix the second issue by just reassigning the same `id`s to the `Alias`es with the same name and child, but it would not solve the `unknown output attribute exception` (see notes below). This PR covers the fix for the first issue. Relates to #56013	2020-11-04 20:38:00 -05:00
Jay Modi	4c3300bf57	Fix job scheduling for same scheduled time (#64598 ) The SchedulerEngine used by SLM uses a custom runnable that will schedule itself for its next execution if there is one to run. For the majority of jobs, this scheduling could be many hours or days away. Due to the scheduling so far in advance, there is a chance that time drifts on the machine or even that time varies core to core so there is no guarantee that the job actually runs on or after the scheduled time. This can cause some jobs to reschedule themselves for the same scheduled time even if they ran only a millisecond prior to the scheduled time, which causes unexpected actions to be taken such as what appears as duplicated snapshots. This change resolves this by checking the triggered time against the scheduled time and using the appropriate value to ensure that we do not have unexpected job runs. Relates #63754 Backport of #64501	2020-11-04 10:15:46 -07:00
Andrei Dan	2dbc444fe5	Tests: fix testMoveToStepRereadsPolicy flakiness (#64466 ) (#64593 ) testMoveToStepRereadsPolicy relied on an updated ILM policy that had a rollover condition that enabled the index to be rolled after one second. This changes the test to use a `max_doc`:1 condition so it's under the test's control to trigger the condition. (cherry picked from commit 73ab35a411bcdf5a92eb3d2b3bae5b1132a2bb56) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-11-04 15:04:01 +00:00
Andrei Dan	835ebcfff2	Convert flaky yml tests to EsRestTestCases (#63634 ) (#64581 ) The yml "Test Invalid Move To Step With Invalid Next Step" worked based on assuming the current step is a particular one. As we can't control the timing of ILM and we can't busy assert in yml test, this converts the test to a java test and makes use of `assertBusy` This converts the explain lifecycle yml tests that depende on ILM having run at least once to a java integration test that makes use of `assertBusy`. (cherry picked from commit 6afd0422ed5ff0e3a2e5661f0e6d192bdad9af4f) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-11-04 12:39:26 +00:00
Mayya Sharipova	8c25130a80	Disable using unsigned_long in scripts (#64552 ) (#64557 ) Backport for #64523 Relates to #64361	2020-11-03 16:38:50 -05:00
Dimitris Athanasiou	e3fee3e6df	[ML] Ignore _doc_count field in data frame analytics (#64541 ) This field is added from version 7.11 onwards. We are adding it to the list of ignored fields for data frame analytics in 7.10 to avoid failing to start an outlier detection job in a mixed cluster environment. Relates #64503	2020-11-03 19:53:17 +02:00
Ignacio Vera	4851bc7bae	Upgrade to Lucene-8.7.0 (#64532 ) (#64537 )	2020-11-03 16:57:04 +01:00
David Roberts	e0d0ac86dd	[ML] Increase forecast test timeout (#64471 ) ForecastIT.testOverflowToDisk has been observed to fail a few times in FIPS JVMs because it takes longer than the permitted 30 seconds. This PR bumps the timeout up to 60 seconds. Fixes #63793	2020-11-02 14:58:18 +00:00
Tanguy Leroux	2d0bddf428	Fix CcrRepositoryIT.testCcrRepositoryFetchesSnapshotShardSizeEtc (#64228 ) (#64479 ) This test failed sometimes for various reasons: an empty bulk request that can't be validated, a background force-merge that completes after the store stats were collected and finally an assertBusy() that waits 10 seconds while we usually wait 60s on the follower cluster in CCR tests. Closes #64167	2020-11-02 15:47:07 +01:00
Marios Trivyzas	0a9481fcaf	EQL: [Tests] enable server side debugging (#64308 ) (#64449 ) Register a new task `runEqlCorrectnessNode` which enables developers to start an ES node in debug mode, properly restore the correctness data and then run queries against it. Assert the index is restored correctly and use new snapshot. (cherry picked from commit fc8c6dd56d602b4a62ee1ff484f00caab92dc6e2)	2020-10-31 11:55:39 +01:00
Tal Levy	ff829bf197	Removes yaml circuit-breaker tests for geoshape geogrid aggs (#64420 ) These tests were added to do a proper end-to-end test of the memory usage of the geotile_grid and geohash_grid aggregations on `geo_shape` fields. Although this was asperational, the truth is — the test environment does not run these aggregations in isolation. This means that the memory overhead is variable and too flaky to rely on over time. The unit tests for circuit-breaking remain. Closes #63158.	2020-10-30 08:07:12 -07:00
Jason Tedor	1126ba4df8	Serialize can contain data with roles (#64324 ) This commit internalizes whether or not a role represents the ability to contain data. In the future, this will let us remove the compatibility role notion.	2020-10-29 20:44:39 -04:00
Armin Braun	fada4a1c78	Fix CachedBlobContainerIndexInputTests (#64239 ) (#64348 ) Closing the input stream happens on a separate thread now that the `CacheFile` is implemented in a lock-free fashion. Closes #64215	2020-10-29 15:11:39 +01:00
James Rodewig	a2b18e9ab9	[DOCS] Fix case for 'Boolean' (#64299 ) (#64342 )	2020-10-29 10:05:57 -04:00
Andrei Stefan	a6d8319231	* Wrap a verification_exception in case there is no valid index available (#64267 ) Wrap a verification_exception in case there is no valid index available in an index_not_found_exception providing also the original index pattern that may be lost in the chain of filters involving the Security one. (cherry picked from commit 9c9da2f2f9a4ad12704f7d3a273f067e96cd2054)	2020-10-29 10:14:50 +02:00
Armin Braun	6bd8f079a7	Enhance CacheFile#invariant Assertion (#64272 ) (#64280 ) Follow up to #64180 tightening the assertion further.	2020-10-28 13:43:39 +01:00
Armin Braun	2983584ef6	Fix #invariant Assertion in CacheFile (#64180 ) (#64264 ) Fix #invariant Assertion in CacheFile closes #64141	2020-10-28 10:22:47 +01:00
jimczi	2492f48375	Fix test compilation	2020-10-28 08:58:09 +01:00
Jim Ferenczi	dcc433c971	Fix UOE when fetching flattened field (#64241 ) The new fields option allows to fetch the value of all fields in the mapping. However, internal fields that are used by some field mappers are also shown when concrete fields retrieved through a pattern (`` or `foo`). We have a [long term plan](https://github.com/elastic/elasticsearch/issues/63446) to hide these fields in field_caps and from pattern resolution so this change is just a hot fix to ensure that they don't break the retrieval in the meantime. The `flattened._keyed field will show up as an empty field when using a pattern that match the flattened field. Relates #63446	2020-10-28 08:49:03 +01:00
Jason Tedor	b46b6d5977	Fix compilation in DataTierTests.java This commit fixes a compilation issue in DataTierTests.java that was introduced due to language-level differences between 7.10/7.x and master.	2020-10-27 13:04:55 -04:00
Jason Tedor	04a9845a49	Adjust defaults for tiered data roles (#64015 ) This commit adjusts the defaults for the tiered data roles so that they are enabled by default, or if the node has the legacy data role. This ensures that the default experience is that the tiered data roles are enabled. To fully specifiy the behavior for the tiered data roles then: - starting a new node with the defaults: enabled - starting a new node with node.roles configured: enabled if and only if the tiered data roles are explicitly configured, independently of the node having the data role - starting a new node with node.data enabled: enabled unless the tiered data roles are explicitly disabled - starting a new node with node.data disabled: disabled unless the tiered data roles are explicitly enabled	2020-10-27 12:48:31 -04:00
Costin Leau	6ca0b6ae6d	EQL: Improve request logging (#64206 ) Add logging to multi-search queries Log response count (cherry picked from commit ee9b9d58f68e2d545d5d841e2f683ec4e96f79e6) (cherry picked from commit 02a4c6b83475cebe715311eeba123ad6fc8d6ba1)	2020-10-27 17:23:43 +02:00
Costin Leau	2363c4be4b	EQL: Polish testing infra (#64205 ) Add tie-breaker inside request creation Add configurable timeout (cherry picked from commit ff281d7b6fd7b4cd2f08bac49aa0b354b6812940) (cherry picked from commit 34bd76fc2987b1ad0b6275ac4358e362a0ba7fb0)	2020-10-27 17:23:43 +02:00
Henning Andersen	0cba23e08f	XPack Usage should run on MANAGEMENT threads (#64160 ) XPack usage starts out on management threads, but depending on the implementation of the usage plugin, they could end up running on transport threads instead. Fixed to always reschedule on a management thread.	2020-10-27 16:03:26 +01:00
Nhat Nguyen	566d1fd459	Return the same point in time in search response (#64188 ) With this change, we will always return the same point in time in a search response as its input until we implement the retry mechanism for the point in times.	2020-10-27 10:17:44 -04:00
Armin Braun	6f1c8136a6	Fix CachedBlobContainerIndexInputTests Shutdown (#64181 ) (#64211 ) Same problem as in #64100 we have to safely wait for all operations to go through to not leak file handles potentially in this test.	2020-10-27 14:22:43 +01:00
Armin Braun	93b52df8c1	Fix SearchableSnapshotDirectoryTests Leaking Listeners (#64150 ) (#64156 ) We need to make sure we don't shut down the pool while ref count release operations are enqueued but not yet executing.	2020-10-26 13:46:14 +01:00
David Kyle	51f8b0b8d2	Mute MonitoringWithWatcherRestIT.testThatLocalExporterAddsWatche (#64143 )	2020-10-26 10:19:48 +00:00
Armin Braun	8161513cb6	Fix Expected Exception Check in BlobstoreCacheService (#63474 ) (#64135 ) The `NodeNotConnectedException` exception can be nested as well in the fairly unlikley case of the disconnect occuring between the connected check and actually sending the request in the transport service. Closes #63233	2020-10-26 10:43:29 +01:00
Armin Braun	17843a40ef	Fix SearchableSnapshotDirectoryTests.testClearCache (#64100 ) (#64132 ) There is a small chance that the file deletion will run on the searchable snapshot thread pool and not on the test thread now that the cache is non-blocking in which case we fail the assertion unless we wait for that thread.	2020-10-26 10:27:08 +01:00
David Roberts	adc5509eda	[ML] Support the unsigned_long type in data frame analytics (#64072 ) Adds support for the unsigned_long type to data frame analytics. This type is handled in the same way as the long type. Values sent to the ML native processes are converted to floats and hence will lose accuracy when outside the range where a float can uniquely represent long values. Backport of #64066	2020-10-26 09:05:49 +00:00
Armin Braun	bd07e44c9a	Make Searchable Snapshot's CacheFile Lock less (#63911 ) (#64125 ) Replacing the mechanism for eviction and listener references via a read-write lock by a reference counting implementation. This fixes a bug that caused test failure #63586 in which concurrently trying to acquire or release an eviction listener while doing a file operation would sometimes lead to throwing an exception since the `tryLock` call on the read lock would fail in this case. Also this removes the possibility of blocking cluster state updates as a result of them waiting on the write-lock which might take a long time if a slow read operation executes concurrently. Closes #63586	2020-10-26 09:30:22 +01:00
David Roberts	cb0c538b35	[ML] Fix rare ML daily maintenance test race condition (#64043 ) Depending on thread scheduling the ML daily maintenance tests could do one more iteration than expected, causing rare failures. Fixes #64036	2020-10-22 13:03:02 +01:00
Rory Hunter	bfd2cbed86	Remove deprecation indexing code from 7.10 (#63942 ) The deprecation indexing code was writing to a regular data stream, and it is not yet possible to hide a data stream or prefix it with a period. This functionality we be re-added once it is possible to mark a data stream as hidden, and also to not rely on the standard logs template since that can be disabled.	2020-10-21 16:28:21 +01:00
Yang Wang	428fd7218c	Use asterisk instead of empty string to clear all cached entries (#63907 ) (#63989 ) The officially supported way to clearing all entries from a cache is to use wildcard of either * or _all. Though empty string has the same effect, it was never intended. Therefore the tests should not use empty string and this PR changes them to use *.	2020-10-21 23:14:01 +11:00
Hendrik Muhs	f2517678aa	[7.10][Transform] add support for unsigned_long data type (#63957 ) add support for unsigned_long, which required a change in writing out integer results properly, because coerce is not supported for unsigned_long fixes #63871 backport #63940	2020-10-20 21:05:46 +02:00
Ignacio Vera	d0f5066310	Upgrade to lucene-8.7.0-snapshot-72d8528c3a6 (#63912 ) (#63928 ) (#63933 )	2020-10-20 15:08:06 +02:00
Benjamin Trent	eff7f06ca6	[ML] fix inference binary classification predication label and feature importance (#63688 ) (#63930 ) When calculating feature importance, the leaf values directly correlate the value of the importance. Consequently, positive leaf values -> positive feature importance negative leaf values -> negative feature importance. It follows that for binary classification, this is done such that the importance relates to the leaf values, which relate directly to the "probability of class 1". So, the feature importance calculated is always for the importance as it relates to class 1. The inverse is the importance as it relates to class 0.	2020-10-20 08:50:15 -04:00
Mayya Sharipova	1287df4074	Fix max/min aggs for unsigned_long (#63904 ) Max and min aggs were producing wrong results for unsigned_long field if field was indexed. If field is indexed for max/min aggs instead of field data, we use values from indexed Points, values of which are derived using method pointReaderIfPossible. Before UnsignedLongFieldType#pointReaderIfPossible was incorrectly producing values, as it failed to shift them back to original values. This patch fixes method pointReaderIfPossible to produce correct original values. Relates to #60050	2020-10-19 15:59:55 -04:00
Julie Tibshirani	f122b88bc5	Remove dependency from version plugin.	2020-10-18 14:09:32 -07:00
Lisa Cawley	e6959d0d20	[7.10][DOCS] Add grant API key API and grant_api_key privilege (#63853 )	2020-10-16 15:41:46 -07:00
Andrei Stefan	5f3c79d64b	Remove filter from QL's field_caps requests (#63840 ) (#63845 ) (cherry picked from commit f009e6341d0fc0471f212d5a41c91e7aab77e006)	2020-10-17 01:36:26 +03:00
Enrico Zimuel	e724be5f41	Specify security.get_role_mapping and security.get_role name as list (#53207 ) (#63785 ) This commit updates the rest API specs for - security.get_role_mapping to specify name as a list type. - security.get_role to specify name as a list type. Co-authored-by: Russ Cam <russ.cam@elastic.co>	2020-10-16 14:22:37 +02:00
Ioannis Kakavas	364511395d	[7.10] Move RestRequestFilter to core (#63507 ) Move RestRequestFilter to core so that Rest requests outside xpack can use it to filter fields and expand its usage. Backport of #63507	2020-10-16 13:57:52 +03:00
Jim Ferenczi	1d78bd0f72	Async search should retry updates on version conflict (#63652 ) * Async search should retry updates on version conflict The _async_search APIs can throw version conflict exception when the internal response is updated concurrently. That can happen if the final response is written while the user extends the expiration time. That scenario should be rare but it happened in Kibana for several users so this change ensures that updates are retried at least 5 times. That should resolve the transient errors for Kibana. This change also preserves the version conflict exception in case the retry didn't work instead of returning a confusing 404. This commit also ensures that we don't delete the response if the search was cancelled internally and not deleted explicitly by the user. Closes #63213	2020-10-16 08:49:02 +02:00
Albert Zaharovits	f4e1e6893d	Add view_index_metadata over metricbeat-* for monitoring agent (#63750 ) The `remote_monitoring_agent` reserved role is extended to grant more privileges over the metricbeat-* index pattern. In addition to the index and create_index index privileges that it granted already, it now also grants the view_index_metadata privilege. Closes #63203	2020-10-16 02:13:55 +03:00
Jay Modi	ebdaeb2f9a	Ensure cancelled jobs do not continue to run (#63771 ) This commit ensures that jobs within the SchedulerEngine do not continue to run after they are cancelled. There was no synchronization between the cancel method of an ActiveSchedule and the run method, so an actively running schedule would go ahead and reschedule itself even if the cancel method had been called. This commit adds synchronization between cancelling and the scheduling of the next run to ensure that the job is cancelled. In real life scenarios this could manifest as a job running multiple times for SLM. This could happen if a job had been triggered and was cancelled prior to completing its run such as if the node was no longer the master node or if SLM was stopping/stopped. Closes #63754 Backport of #63762	2020-10-15 14:01:14 -06:00
Jay Modi	4d6daa6e40	Handle missing logstash index exceptions (#63753 ) This commit updates the APIs in the logstash plugin to handle IndexNotFoundExceptions that are returned by client calls. Until we have the creation of this index in place, we need to handle this case and not let the exception propagate out of the API. Backport of #63698	2020-10-15 09:45:02 -06:00
Marios Trivyzas	1dbd3a90ae	EQL: [Tests] Use snapshot from 7.10 To be able to run the tests from 7.10 onwards use a snapshot created with 7.10. Follows: #63735	2020-10-15 17:28:52 +02:00
Bogdan Pintea	35f5c49e54	SQL: integer parameter validation in string functions (#63338 ) (#63728 ) * SQL: integer parameter validation in string functions (#58923) In insert, locate, substring function, when argument `start` or `length` is greater than Integer.MAX_INT OR less then Integer.MIN_INT + 1 (note that `start` need to minus 1), it causes overflow and leads to unexpected results. * Add range checks for BinaryStringNumericProcessors - Add range checks for Left, Right, Repeat. - Minor refactorings on initial PR changes. Co-authored-by: yinanwu <yinanwu@tencent.com> (cherry picked from commit bf6dc58b93529f977d035a846d083b1c31867694) Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-10-15 16:00:48 +02:00
Marios Trivyzas	095f979060	EQL: [Tests] Add correctness integration tests (#63644 ) (#63735 ) Add a new gradle module under eql/qa which runs and validates a set of queries over a 4m event dataset (restored from a snapshot residing in a gcs bucket). The results are providing by running the exact set of queries with Python EQL against the same dataset. Co-authored-by: Marios Trivyzas <matriv@users.noreply.github.com> (cherry picked from commit 1cf789e5fcfb0f364f665bfaac021e24a4c2f556) Co-authored-by: Mark Vieira <portugee@gmail.com>	2020-10-15 15:28:26 +02:00
Costin Leau	06eae58d40	EQL: Fix translation of bool fields (#63694 ) This commit fixes two issues in dealing with bool fields in EQL: - avoid simplifications of field == true expressions - adding comparison to clauses on fields missing logic (where bool) Fix #63693 (cherry picked from commit d10a5d0e842bbd4e0031834de948ceb24da3872b) (cherry picked from commit 0227da3a275c7f22ff524d99d53e1a79146f9e28)	2020-10-15 14:38:31 +03:00
Hendrik Muhs	65ef399563	[Transform] increase timeout in testStopWaitForCheckpoint (#63657 ) increase the overall timeout by increasing the wait time after every retry. fixes #63365	2020-10-15 08:51:33 +02:00
Albert Zaharovits	2b7fbe9957	Add the missing apikey.* fields to the logfile audit layout for docker builds (#63609 ) The layout pattern for the security audit for docker builds was missing the apiKey.* fields.	2020-10-14 13:58:41 +03:00
Tanguy Leroux	57b5715bf7	Add CCR repository test for snapshot shard size (#63649 ) Following #61906 this commit adds two new integration tests that verifies the sizes of snapshotted shards for CCR repositories. Backport of #63590	2020-10-14 12:51:42 +02:00
Ryland Herrick	7e8769a666	EQL: make allow_no_indices true by default (#63573 ) (#63645 ) * Allow all indices options variants Irrespective of allow_no_indices value, throw VerificationException when there is no index validated Co-authored-by: Andrei Stefan <astefan@users.noreply.github.com>	2020-10-14 03:41:04 +03:00
Lee Hinman	7371e51583	[7.10] Add DiscoveryNodeRole compatibility role for bwc tier serialization (#63581 ) (#63613 ) Backports the following commits to 7.10: Add DiscoveryNodeRole compatibility role for bwc tier serialization (#63581)	2020-10-13 09:17:15 -06:00
Andras Palinkas	f9c8ae5f77	SQL: Improve parser error message for `ESCAPE` (#63616 ) Mentions the list of wildchars in case a wildchar is used as an `ESCAPE` character. Relates #63428 (cherry picked from commit 74cbcf871e9593b3640e382ae6845168fd14966b)	2020-10-13 10:47:06 -04:00
Andras Palinkas	4aabc050a0	SQL: Escaped wildcard () not accepted in LIKE (#63428 ) For a query like `SELECT name FROM test WHERE name LIKE ''%c'` ES SQL generates an error. `` is not a special character in a `LIKE` construct and it's expected to not needing to be escaped, so the previous query should work as is. In the LIKE pattern any `` character was treated as invalid character and the usage of `%` or `_` was suggested instead. But `*` is a valid, acceptable non-wildcard on the right side of the `LIKE` operator. Fix: #55108 (cherry picked from commit 190d9fe3deb31aed0d8f312007360625d4fff217)	2020-10-13 10:47:03 -04:00
Przemysław Witek	acbd48f834	[ML] Allow setting num_top_classes to a special value -1 (#63587 ) (#63602 )	2020-10-13 13:57:50 +02:00
David Roberts	3f210e2620	[ML] Load data streams plugin for ML internal cluster tests (#63560 ) Now that deprecation logs get indexed to a data stream, if we do not load the data stream plugin in our tests and any test generates a deprecation log message then millions of exceptions get logged, slowing down the tests to the extent that they can fail. This change loads the data streams plugin during the ML internal cluster tests. (It should already be present in external cluster tests.) Fixes #63548	2020-10-12 17:46:50 +01:00
Dimitris Athanasiou	e1c418aac7	[7.10][ML] Validate dest pipeline exists on transform update (#63494 ) (#63549 ) Adds validation that the dest pipeline exists when a transform is updated. Refactors the pipeline check into the `SourceDestValidator`. Fixes #59587 Backport of #63494	2020-10-12 15:41:35 +03:00
Tomas Della Vedova	8b07750a8b	Updated version string in yaml test (#63410 ) (#63533 )	2020-10-12 11:19:36 +02:00
Julie Tibshirani	ae2fc4118d	Add factory methods for common value fetchers. (#63438 ) This PR adds factory methods for the most common implementations: * `SourceValueFetcher.identity` to pass through the source value untouched. * `SourceValueFetcher.toString` to simply convert the source value to a string.	2020-10-08 12:14:53 -07:00
Julie Tibshirani	a506705569	Small fixes to flattened field value fetching. (#63443 ) * Remove FlatObjectFieldTypeTests, as it's redundant. * Do not apply null_value when fetching root-level values. * Remove a TODO in favor of opening an issue.	2020-10-08 11:52:54 -07:00
Benjamin Trent	a9be4181c6	[ML] fix grabbing the doc value limit setting in _explain (#63402 ) (#63471 ) Getting the doc value settings shouldn't use the API callers headers. We only use this value internally.	2020-10-08 08:53:26 -04:00
Mayya Sharipova	e022b78198	Upgrade to lucene-8.7.0-snapshot-5c4168d (#63466 ) This disables sort optim on _doc, which may still be unstable. Backport for #63444	2020-10-08 08:20:43 -04:00
Costin Leau	2ab5f226c4	EQL: Avoid filtering on tiebreakers (#63415 ) Do not filter by tiebreaker while searching sequence matches as it's not monotonic and thus can filter out valid data. Add handling for data 'near' the boundary that has the same timestamp but different tie-breaker and thus can be just outside the window. Fix #62781 Relates #63215 (cherry picked from commit 36f834600d4d9ded0fb7b1440274b2e597733770) (cherry picked from commit 72a2ce825f3bfd13f87423ba7f3c739ea64c57f6)	2020-10-08 13:50:41 +03:00
David Roberts	a9d541561f	[ML] Unmute DeleteExpiredDataIT.testDeleteExpiredDataNoThrottle (#63408 ) This test appears to work again following the Lucene bug fix that was integrated in #63395	2020-10-08 09:11:29 +01:00
Przemysław Witek	bd761cce1d	[ML] Validate that AucRoc has the data necessary to be calculated (#63302 ) (#63454 )	2020-10-08 09:52:15 +02:00
Luca Cavanna	659988a77f	Remove runtime fields (#63418 ) We are not going to release runtime fields with 7.10, hence we are removing them from the 7.10 branch.	2020-10-07 20:39:41 +02:00
Mayya Sharipova	e236ea43e9	Upgrade to lucene-8.7.0-snapshot-e914862 (#63401 ) Backport for: #63395	2020-10-07 09:45:14 -04:00
Alan Woodward	88b45dfa61	Convert TextFieldMapper to parametrized form (#63269 ) (#63392 ) As a result of this, we can remove a chunk of code from TypeParsers as well. Tests for search/index mode analyzers have moved into their own file. This commit also rationalises the serialization checks for parameters into a single SerializerCheck interface that takes the values includeDefaults, isConfigured and the value itself. Relates to #62988	2020-10-07 13:26:25 +01:00
Hendrik Muhs	d45f7de3fb	[Transform] Add test logging regarding conflict on start (#63383 ) add extra logging for investigation of #63365	2020-10-07 10:17:31 +02:00
Tim Vernum	c30c5555c5	Mute DeleteExpiredDataIT deleteExpired NoThrottle (#63381 ) Mutes test method DeleteExpiredDataIT.testDeleteExpiredDataNoThrottle Relates: #63379 Backport of: #63380	2020-10-07 17:43:52 +11:00
Stuart Tettemer	8a61b95a0f	Scripting: JSON parsing and writing in watcher (#63278 ) (#63377 ) Co-authored-by: Honza Král Co-authored-by: Jack Conradson Backport of: f43e52d	2020-10-06 23:39:40 -05:00
Stuart Tettemer	7f4f70f557	Scripting: Augment String with Hash support in Watcher (#63346 ) (#63375 ) Strings in the watcher context may use the `.sha1()` and `.sha256()` augmentation added for ingest. Ref: #59633, #59671 Fixes: #61244 Backport of: 380ee6f	2020-10-06 22:10:27 -05:00
Gordon Brown	15edc39d9b	Update logstash_admin role for system indices (#63368 ) This PR updates the `logstash_admin` role to include the recently-added Logstash Pipeline Management APIs, as well as access to the `.logstash*` index pattern. Co-authored-by: William Brafford <williamrandolphbrafford@gmail.com>	2020-10-06 20:43:36 -06:00
Mayya Sharipova	f2ba62b894	Upgrade to lucene- 8.7.0-snapshot-66c49a35402 (#63372 ) This includes fixing a bug in doc iteration during sort optimization Backport for #63349	2020-10-06 22:38:58 -04:00
Julie Tibshirani	f17ca18dfa	Make array value parsing flag more robust. (#63371 ) When constructing a value fetcher, the 'parsesArrayValue' flag must match `FieldMapper#parsesArrayValue`. However there is nothing in code or tests to help enforce this. This PR reworks the value fetcher constructors so that `parsesArrayValue` is 'false' by default. Just as for `FieldMapper#parsesArrayValue`, field types must explicitly set it to true and ensure the behavior is covered by tests. Follow-up to #62974.	2020-10-06 17:49:25 -07:00
Gordon Brown	5c8b0662df	Deprecate REST access to System Indices (#63274 ) (Original #60945 ) This PR adds deprecation warnings when accessing System Indices via the REST layer. At this time, these warnings are only enabled for Snapshot builds by default, to allow projects external to Elasticsearch additional time to adjust their access patterns. Deprecation warnings will be triggered by all REST requests which access registered System Indices, except for purpose-specific APIs which access System Indices as an implementation detail a few specific APIs which will continue to allow access to system indices by default: - `GET _cluster/health` - `GET {index}/_recovery` - `GET _cluster/allocation/explain` - `GET _cluster/state` - `POST _cluster/reroute` - `GET {index}/_stats` - `GET {index}/_segments` - `GET {index}/_shard_stores` - `GET _cat/[indices,aliases,health,recovery,shards,segments]` Deprecation warnings for accessing system indices take the form: ``` this request accesses system indices: [.some_system_index], but in a future major version, direct access to system indices will be prevented by default ```	2020-10-06 13:41:40 -06:00
Tanguy Leroux	87076c32e2	Determine shard size before allocating shards recovering from snapshots (#61906 ) (#63337 ) Determines the shard size of shards before allocating shards that are recovering from snapshots. It ensures during shard allocation that the target node that is selected as recovery target will have enough free disk space for the recovery event. This applies to regular restores, CCR bootstrap from remote, as well as mounting searchable snapshots. The InternalSnapshotInfoService is responsible for fetching snapshot shard sizes from repositories. It provides a getShardSize() method to other components of the system that can be used to retrieve the latest known shard size. If the latest snapshot shard size retrieval failed, the getShardSize() returns ShardRouting.UNAVAILABLE_EXPECTED_SHARD_SIZE. While we'd like a better way to handle such failures, returning this value allows to keep the existing behavior for now. Note that this PR does not address an issues (we already have today) where a replica is being allocated without knowing how much disk space is being used by the primary. Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-10-06 18:37:05 +02:00
Igor Motov	2405162c39	Mute RegressionIT.testAliasFields test (#63339 ) It fails quite frequently in 7.x. Relates to #63268	2020-10-06 12:18:12 -04:00
David Kyle	ea32b4ab82	[ML] Audit message when nightly maintenance times out (#63252 ) (#63330 ) During deletion of old ml data set the delete by query timeout to 8 hours and audit a job message when the nightly maintenance task times out.	2020-10-06 16:19:37 +01:00
Hendrik Muhs	058c55da6a	[Transform] disallow field and script being empty for group sources (#63313 ) fail validation earlier when field and script are both missing in a group source	2020-10-06 16:59:02 +02:00
Yang Wang	abf9b885b4	Bulk invalidate API keys using a list of IDs (#63224 ) (#63320 ) Add a new ids field to the API of invalidating API keys so that it supports bulk invalidation with a list of IDs. Note the existing id field is kept as is and it is an error if both id and ids are specified.	2020-10-07 00:49:21 +11:00
Yang Wang	bbfa2f1303	Fix test failure due to missing client action	2020-10-07 00:45:30 +11:00
Benjamin Trent	a72d7cc76a	[ML] prefer secondary auth headers on data frame analytics _explain (#63281 ) (#63323 ) We should prefer secondary auth headers when calling _explain	2020-10-06 09:15:29 -04:00
Luca Cavanna	ca68298e89	Remove MapperService argument from IndexFieldData.Builder#build (#63197 ) (#63311 ) MapperService carries a lot of weight and is only used to determine if loading of field data for the id field is enabled, which can be done in a different way.	2020-10-06 15:04:23 +02:00

1 2 3 4 5 ...

5813 Commits