OpenSearch

Commit Graph

Author	SHA1	Message	Date
Igor Motov	51c6f69e02	[7.x] Add support for filters to T-Test aggregation (#54980 ) (#55066 ) Adds support for filters to T-Test aggregation. The filters can be used to select populations based on some criteria and use values from the same or different fields. Closes #53692	2020-04-13 12:28:58 -04:00
Jake Landis	a2fafa6af4	[7.x] Lazy test cluster module and plugins (#54852 ) (#55087 ) This change converts the module and plugin parameters for testClusters to be lazy. Meaning that the values are not resolved until they are actually used. This removes the requirement to use project.afterEvaluate to be able to resolve the bundle artifact. Note - this does not completely remove the need for afterEvaluate since it is still needed for the custom resource extension.	2020-04-13 10:53:35 -05:00
Igor Motov	6861295706	Further improve InternalTTestTests (#55081 ) A small follow-up to #54910. Now that we can generated consistent set of internal aggs to reduce, we no longer need to keep agg parameters as class variables. Related to #54910	2020-04-13 10:26:23 -04:00
Benjamin Trent	c5c7ee9d73	[7.x] [ML] Start gathering and storing inference stats (#53429 ) (#54738 ) * [ML] Start gathering and storing inference stats (#53429) This PR enables stats on inference to be gathered and stored in the `.ml-stats-` indices. Each node + model_id will have its own running stats document and these will later be summed together when returning _stats to the user. `.ml-stats-` is ILM managed (when possible). So, at any point the underlying index could change. This means that a stats document that is read in and then later updated will actually be a new doc in a new index. This complicates matters as this means that having a running knowledge of seq_no and primary_term is complicated and almost impossible. This is because we don't know the latest index name. We should also strive for throughput, as this code sits in the middle of an ingest pipeline (or even a query).	2020-04-13 08:15:46 -04:00
Andrei Dan	c0406f78b7	ILM add cluster update timeout on step retry (#54878 ) (#55022 ) This commits adds a timeout when moving ILM back on to a failed step. In case the master is struggling with processing the cluster update requests these ones will expire (as we'll send them again anyway on the next ILM loop run) ILM more descriptive source messages for cluster updates Use the configured ILM step master timeout setting (cherry picked from commit ff6c5ed16616eadfcddd9c95317d370f0d126583) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-11 10:13:31 +01:00
Andrei Dan	b8df265b42	[7.x] ILM use Priority.IMMEDIATE for stop ILM cluster update (#54909 ) (#55018 ) * ILM use Priority.IMMEDIATE for stop ILM cluster update (#54909) This changes the priority of the cluster state update that stops ILM altogether to `IMMEDIATE`. We've chosen to change this as it can be useful to temporarily stop ILM if a cluster is overwhelmed, but a `NORMAL` priority can see the "stop ILM update" not make it up the tasks queue. On the same note, we're keeping the `start ILM` cluster update priority to `NORMAL` on purpose such that we only start `ILM` if the cluster can handle it. (cherry picked from commit d67df3a7cd2a8619c2c9efac4dde3ba83271f2fa) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-11 10:12:35 +01:00
Albert Zaharovits	f22004a262	Preserve parent task id for data frame analytics (#55046 ) This change makes sure that all internal client requests spawned by the data frame analytics persistent task executor and that use the end user security credentials, have the parent task id assigned. The objective here is to permit auditing (as well as tracking for debugging purposes) of all the end-user requests executed on its behalf by persistent tasks. Because data frame analytics taks already implements graceful shutdown of child tasks, this change does not interfere with it by opting out of the persistent task cancellation of child tasks. Relates #54943 #52314	2020-04-10 22:27:21 +03:00
Mark Vieira	5d4ddf9146	Fixes for IntelliJ IDEA 2020.1 support (#55077 )	2020-04-10 11:57:48 -07:00
Nik Everett	c00811f3a3	Make some agg tests easier to read (#54954 ) (#55079 ) We added a fancy method to provide random realistic test data to the reduction tests in #54910. This uses that to remove some of the more esoteric machinations in the agg tests. This will marginally increase the coverage of the serialiation tests and, more importantly, remove some mysterious value generation code that only really made sense for random reduction tests but was used all over the place. It doesn't, on the other hand, make the tests shorter. Just hopefully more clear. I only cleaned up a few tests this way. If we like this it'd probably be worth grabbing others.	2020-04-10 14:15:30 -04:00
Luca Cavanna	93c39ad4e7	Async search: create internal index only before storing initial response (#54619 ) We currently create the .async-search index if necessary before performing any action (index, update or delete). Truth is that this is needed only before storing the initial response. The other operations are either update or delete, which will anyways not find the document to update/delete even if the index gets created when missing. This also caused `testCancellation` failures as we were trying to delete the document twice from the .async-search index, once from `TransportDeleteAsyncSearchAction` and once as a consequence of the search task being completed. The latter may be called after the test is completed, but before the cluster is shut down and causing problems to the after test checks, for instance if it happens after all the indices have been cleaned up. It is totally fine to try to delete a response that is no longer found, but not quite so if such call will also trigger an index creation. With this commit we remove all the calls to createIndexIfNecessary from the update/delete operation, and we leave one call only from storeInitialResponse which is where the index is expected to be created. Closes #54180	2020-04-10 18:24:05 +02:00
Ross Wolf	96a903b17f	EQL: Add string function (#54470 ) * EQL: Add string() function * EQL: Reorder queryfolder_tests * EQL: Add test queries * EQL: Fix InternalEqlScriptUtils.string and test case * EQL: Fix testStringFunctionWithText error message * EQL: Flatten ToStringFunctionPipe.equals * EQL: Reorder painless whitelist * EQL: Address feedback and remove string(null) handling * EQL: Move string(pid) test over * EQL: Rename source -> value	2020-04-10 09:48:29 -06:00
Przemysław Witek	17101d86d9	[7.x] Do not execute ML CRUD actions when upgrade mode is enabled (#54437 ) (#55049 )	2020-04-10 16:07:11 +02:00
Dimitrios Liappis	b062535e27	Mute testSearchableSnapshotAction in TimeSeriesLifecycleActions tests (#55055 ) Backport of #55052 Details in #55050	2020-04-10 16:03:09 +03:00
Jason Tedor	a370668fcc	Clean up even more instances of "metaData" We recently cleaned up the use of the word "metadata" across the codebase. Even more additional uses have trickled in, likely from in-progress work. This commit cleans up these last few additional instances. Relates #54519	2020-04-10 08:52:37 -04:00
Jason Tedor	9eeae59a83	Clarify available processors (#54907 ) The use of available processors, the terminology, and the settings around it have evolved over time. This commit cleans up some places in the codes and in the docs to adjust to the current terminology.	2020-04-10 08:48:27 -04:00
Costin Leau	a7e4f79e8f	EQL: Deprecate lenient sequence declaration (#55032 ) Deprecate alternative sequence parameter declaration (with then by) Disallow lack of time units inside maxspan Fix #55023 Relate #54680 (cherry picked from commit 201adafba9def1de4bf843760defb9def3394f63)	2020-04-10 10:30:07 +03:00
Marios Trivyzas	bf0cadb602	SQL: Implement DATETIME_PARSE function for parsing strings (#54960 ) (#55035 ) Implement DATETIME_PARSE(<datetime_str>, <pattern_str>) function which allows to parse a datetime string according to the specified pattern into a datetime object. The patterns allowed are those of java.time.format.DateTimeFormatter. Relates to #53714 (cherry picked from commit 3febcd8f3cdf9fdda4faf01f23a5f139f38b57e0)	2020-04-10 01:16:29 +02:00
Nhat Nguyen	c9f8fb2dd0	Clear recent errors when auto-follow successfully (#54997 ) Today, we do not clear the recent errors in AutoFollowCoordinator when we successfully auto-follow indices. This can lead to confusion for the operators.	2020-04-09 14:35:16 -04:00
Albert Zaharovits	f55a361b64	Preserve Task Id for ML Datafeed (#54943 ) This change preserves the task id for internal requests for the `StartDatafeedPersistentTask`. Task ids are a way to express a relationship between related internal requests. In this particular case, the task ids are used for debugging and (soon) security auditing, but not for task cancellation, because there is already a graceful-shutdown of child internal requests (given a task id) in place.	2020-04-09 13:22:29 +03:00
Hendrik Muhs	223fbb2ae7	[Transform] fix sporadic test failure due to unavailable notif… (#54939 ) move no initializing shards check before dumping audit messages fixes #54810	2020-04-09 08:04:42 +02:00
Andrei Stefan	85f129a50a	EQL: indexOf function implementation (#54543 ) (#54989 ) (cherry picked from commit a4b1d6e52d9ba22d541dd86d69861b1efee83604)	2020-04-09 02:41:01 +03:00
Mark Vieira	1552f2fa3e	Enable searchable snapshots for release tests (#54987 )	2020-04-08 14:41:03 -07:00
Mark Vieira	0fa8a14bcb	Mute SamlServiceProviderDocumentTests.testStreamRoundTripWithAllFields	2020-04-08 12:56:36 -07:00
Jay Modi	3600c9862f	Reintroduce system index APIs for Kibana (#54935 ) This change reintroduces the system index APIs for Kibana without the changes made for marking what system indices could be accessed using these APIs. In essence, this is a partial revert of #53912. The changes for marking what system indices should be allowed access will be handled in a separate change. The APIs introduced here are wrapped versions of the existing REST endpoints. A new setting is also introduced since the Kibana system indices' names are allowed to be changed by a user in case multiple instances of Kibana use the same instance of Elasticsearch. Relates #52385 Backport of #54858	2020-04-08 09:08:49 -06:00
Bogdan Pintea	8d6d7b88d8	SQL: drop BASE TABLE type in favour for just TABLE (#54836 ) (#54951 ) * Drop BASE TABLE type in favour for just TABLE This commit drops the table type 'BASE TABLE' and replaces all occurences with just 'TABLE', since his type is wider-used and friendlier to the client applications that query for certain table types in their discovery mode. The 'TABLE' type is also explicitely mentioned by the JDBC and ODBC standards and although other data source-specific types are permitted, older apps will not work well with them. * Refactor table type constants out of IndexType Move SQL_TABLE/_ALIAS out of IndexType, so that they can also be used in that Enum definition. (cherry picked from commit 70241b52697ac2cf71004040042123c1ec050299)	2020-04-08 16:02:12 +02:00
Marios Trivyzas	6afd60b082	SQL: Implement DATETIME_FORMAT function for date/time formatting (#54832 ) (#54942 ) Implement DATETIME_FORMAT(<date/datetime/time>, ) function which allows for formatting a timestamp to the specified format. The patterns allowed as those of java.time.format.DateTimeFormatter. Related to #53714 (cherry picked from commit 72be0b54a9299e87e785469cdc9aafac2a48c046)	2020-04-08 13:45:47 +02:00
David Turner	0d2195191d	Allocate searchable snapshots with the balancer (#54889 ) Today the shards of searchable snapshots are allocated with a naive `ExistingShardsAllocator` which selects the first valid node for each shard. Thanks to #54729 we can now allow these shards to fall through to the balanced shards allocator so that they are allocated in a more balanced fashion. Relates #50999	2020-04-08 10:02:42 +01:00
Ryan Ernst	37795d259a	Remove guava from transitive compile classpath (#54309 ) (#54695 ) Guava was removed from Elasticsearch many years ago, but remnants of it remain due to transitive dependencies. When a dependency pulls guava into the compile classpath, devs can inadvertently begin using methods from guava without realizing it. This commit moves guava to a runtime dependency in the modules that it is needed. Note that one special case is the html sanitizer in watcher. The third party dep uses guava in the PolicyFactory class signature. However, only calling a method on the PolicyFactory actually causes the class to be loaded, a reference alone does not trigger compilation to look at the class implementation. There we utilize a MethodHandle for invoking the relevant method at runtime, where guava will continue to exist.	2020-04-07 23:20:17 -07:00
Aleksandr Maus	d02f774cb6	EQL: implement cidrMatch function (#54186 ) (#54928 ) Related to https://github.com/elastic/elasticsearch/issues/54132	2020-04-07 22:07:28 -04:00
Tal Levy	254d1e3543	[7.x] Create new `geo` module and migrate geo_shape registration (#53562 ) (#54924 ) This commit introduces a new `geo` module that is intended to be contain all the geo-spatial-specific features in server. As a first step, the responsibility of registering the geo_shape field mapper is moved to this module. Co-authored-by: Nicholas Knize <nknize@gmail.com>	2020-04-07 16:30:58 -07:00
Aleksandr Maus	de381271f1	EQL: implement stringContains function (#54380 ) (#54923 )	2020-04-07 17:55:13 -04:00
Nik Everett	ce7ae4a7d1	Remove pipline aggs from agg result tree (backport of #54716 ) (#54920 ) This removes pipeline aggregators from the aggregation result tree except for a single field used for backwards compatibility with pre-7.8 versions of Elasticsearch. That field isn't populated unless we are serializing to pre-7.8 Elasticsearch. So, good news! We no longer build pipeline aggregators on the data node. Most of the time.	2020-04-07 17:22:23 -04:00
Nik Everett	100f7258c7	Improve agg reduce tests (#54910 ) (#54914 ) This allows subclasses of `InternalAggregationTestCase` to make a `List` of values to reduce so that it can make values that are realistic together. The first use of this is with `InternalTTest` which uses it to make results that don't cause their `sum` field to wrap. It'd likely be useful for a ton of other aggs but just one for now.	2020-04-07 17:22:04 -04:00
Aleksandr Maus	868798e4db	EQL: implement between function (#54277 ) (#54913 )	2020-04-07 16:52:30 -04:00
Costin Leau	8b1e87cb61	EQL: Change query folding spec from new lines to ; (#54882 ) The usage of blank lines as separator between tests can be tricky to deal with in case of merges where such lines can be added by accident. Further more counting non-consecutive lines is non-intuitive. The tests have been aligned to use ; at the end of the query and exceptions so that the presence or absence of empty lines is irrelevant. The parsing of the spec has been changed to perform validation to not allow invalid/incomplete specs to cause exceptions. (cherry picked from commit 192ad88d3a51e1e1f1f82830526518720ec88217)	2020-04-07 21:57:06 +03:00
Tanguy Leroux	b8d2b952b8	Only one of azure key or token can be specified in 3rd party tests (#54876 ) #54803 introduces more QA tests for Azure storage service, but they fail the build is one of the key or token is missing. It should i nstead work like repository-azure:qa tests.	2020-04-07 19:36:48 +02:00
Nik Everett	faa687c0ae	Fix InternalTTestTests `testReduceRandom` was bumping up against the serialization that I added in #54776. This makes it use random values that reduce in ways that don't cause the randomized serialization to fail.	2020-04-07 11:51:54 -04:00
Larry Gregory	8c8baa10f4	[Backport] Add reserved_ml_user and reserved_ml_admin kibana p… (#54837 ) * add reserved_ml_user and reserved_ml_admin kibana privileges * address feedback, update dataframe roles * fix checkstyle failure	2020-04-07 11:42:11 -04:00
Dimitris Athanasiou	9b4ac60b53	[7.x][ML] Cancel reindex task from correct thread context (#54874 ) (#54898 ) When a data frame analytics job is stopped, if the reindexing task was still in progress we cancel it. Cancelling it should be done from the same context as when we executed the reindexing task. That means from a thread context with ML origin. Backport of #54874	2020-04-07 18:11:58 +03:00
Andrei Dan	bbc57828c4	ILM fix retry delete action test (#54809 ) (#54895 ) Asserting on the failed_step field from the explainAPI can produce flakiness because the ILM state is moved back and forth between the (failing) step and the ERROR step (as the workflow is retry, fail then move to ERROR step, move back to the (failing) step, retry, fail, etc) and the failed_step information is only available whilst in the ERROR state. Unmute other tests as they were collateral failures A read-only index could not be deleted in the wipeCluster phase and caused these failures (cherry picked from commit 99a6d57aeb3cf11abc38b514f38a96bb1612e357) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-07 15:55:56 +01:00
Nik Everett	3c56e0de42	Fix scripted metric in ccs (backport of #54776 ) (#54888 ) `scripted_metric` did not work with cross cluster search because it assumed that you'd never perform a partial reduction, serialize the results, and then perform a final reduction. That serialized-after-partial-reduction step was broken. This is also required to support #54758.	2020-04-07 10:43:00 -04:00
Ignacio Vera	076c199484	Add new point field. (#53804 ) (#54879 ) This commit adds a new point field that is able to index arbitrary pair of values (x/y) in the cartesian space. It only supports filtering using shape queries at the moment.	2020-04-07 15:28:50 +02:00
Tanguy Leroux	4d36917e52	Merge feature/searchable-snapshots branch into 7.x (#54803 ) (#54825 ) This is a backport of #54803 for 7.x. This pull request cherry picks the squashed commit from #54803 with the additional commits: 6f50c92 which adjusts master code to 7.x a114549 to mute a failing ILM test (#54818) 48cbca1 and 50186b2 that cleans up and fixes the previous test aae12bb that adds a missing feature flag (#54861) 6f330e3 that adds missing serialization bits (#54864) bf72c02 that adjust the version in YAML tests a51955f that adds some plumbing for the transport client used in integration tests Co-authored-by: David Turner <david.turner@elastic.co> Co-authored-by: Yannick Welsch <yannick@welsch.lu> Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com> Co-authored-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-07 13:28:53 +02:00
David Roberts	8f2ddaee1a	[TEST] Allow kb or mb for data frame analytics memory estimate (#54869 ) This change is to support the switch from kb to mb being made in https://github.com/elastic/ml-cpp/pull/1126	2020-04-07 11:28:29 +01:00
David Roberts	df4ae79b41	[TEST] Unmute CategorizationIT.testNumMatchesAndCategoryPreference (#54868 ) Should work again now that https://github.com/elastic/ml-cpp/issues/1121 is resolved. Backport of #54768	2020-04-07 11:04:31 +01:00
Armin Braun	1039cae2cc	Fix Repository Consistency TODOs from SLM Tests (#54767 ) (#54860 ) These TODOs don't apply any longer with the repository generation now being tracked consistently so we can remove the workarounds.	2020-04-07 09:27:50 +02:00
Jim Ferenczi	c7ff67ddef	Preserve final response headers in asynchronous search (#54349 ) This change adds the response headers of the original search request in the stored response in order to be able to restore them when retrieving a result from the async-search index. It also ensures that response headers are preserved for users that retrieve a final response on a running search task. Partial response can eventually return response headers too but this change only ensures that they are present when the response if final. Relates #33936	2020-04-07 08:37:03 +02:00
Jim Ferenczi	d57a047ab7	Fix transport serialization of AsyncSearchUser (#54761 ) This change ensures that the AsyncSearchUser is correctly (de)serialized when an action executed by this user is sent to a remote node internally (via transport client).	2020-04-07 08:25:58 +02:00
Costin Leau	99846f47b7	QL: Introduce infrastructure for surrogate functions (#54795 ) Some functions act as shortcuts for more verbose declarations (sometimes with certain constraints). This PR removes the boilerplate around declaring such functions as well as a dedicated rule for the optimizer to perform the actual substitution. Fix #54334 (cherry picked from commit 3231d01b0c583deb89252fafe84db48878da3246)	2020-04-07 00:46:50 +03:00
Costin Leau	36121117f0	EQL: Sequence/Join parsing and model (#54227 ) Add parsing and (logical) domain model for sequence and join (cherry picked from commit 9e9632d41a39877256c68634ab18e441f4b67fe8)	2020-04-06 23:15:35 +03:00

1 2 3 4 5 ...

4560 Commits