OpenSearch

Commit Graph

Author	SHA1	Message	Date
David Roberts	77cc6d5bad	[TEST] Work around _cat/indices bug with security enabled (#47160 ) When the ML native multi-node tests use _cat/indices/_all and the request goes to a non-master node, _all is translated to a list of concrete indices by the authz layer on the coordinating node before the request is forwarded to the master node. Then it is possible for the master node to return an index_not_found_exception if one of the concrete indices that was expanded on the coordinating node has been deleted in the meantime. (#47159 has been opened to track the underlying problem.) It has been observed that the index that gets deleted when the problem affects the ML native multi-node tests is always the ML notifications index. The tests that fail are only interested in the presence or absense of ML results indices. Therefore the workaround is to only _cat indices that match the ML results index pattern. Fixes #45652	2019-09-26 13:29:40 +01:00
Dimitris Athanasiou	0765bd4bf7	[7.x][ML] Ensure data frame analytics task is only marked completed once (#47119 ) (#47157 ) Closes #46907	2019-09-26 15:26:06 +03:00
Tanguy Leroux	95e2ca741e	Remove unused private methods and fields (#47154 ) This commit removes a bunch of unused private fields and unused private methods from the code base. Backport of (#47115)	2019-09-26 12:49:21 +02:00
Yogesh Gaikwad	9a64b7a888	[Backport] Validate `query` field when creating roles (#46275 ) (#47094 ) In the current implementation, the validation of the role query occurs at runtime when the query is being executed. This commit adds validation for the role query when creating a role but not for the template query as we do not have the runtime information required for evaluating the template query (eg. authenticated user's information). This is similar to the scripts that we store but do not evaluate or parse if they are valid queries or not. For validation, the query is evaluated (if not a template), parsed to build the QueryBuilder and verify if the query type is allowed. Closes #34252	2019-09-26 17:57:36 +10:00
Jim Ferenczi	04972baffa	Merge ShardSearchTransportRequest and ShardSearchLocalRequest (#46996 ) (#47081 ) This change merges the `ShardSearchTransportRequest` and `ShardSearchLocalRequest` into a single `ShardSearchRequest` that can be used to create a SearchContext. Relates #46523	2019-09-26 09:20:53 +02:00
Benjamin Trent	fcddaa90de	[7.x] [ML][Inference] adding tree model (#47044 ) (#47141 ) * [ML][Inference] adding tree model (#47044) * [ML][Inference] adding tree model * renaming features for updated schema * fixing 7.x compilation	2019-09-25 19:11:15 -04:00
Gordon Brown	7ac647c365	Add support for POST requests to SLM Execute API (#47061 ) This commit adds support for POST requests to the SLM `_execute` API, because POST is a more appropriate HTTP verb for this action as it is not idempotent. The docs are also changed to favor POST over PUT, although PUT is not removed or officially deprecated.	2019-09-25 16:15:10 -06:00
Andrei Dan	27520cac3b	ILM: parse origination date from index name (#46755 ) (#47124 ) * ILM: parse origination date from index name (#46755) Introduce the `index.lifecycle.parse_origination_date` setting that indicates if the origination date should be parsed from the index name. If set to true an index which doesn't match the expected format (namely `indexName-{dateFormat}-optional_digits` will fail before being created. The origination date will be parsed when initialising a lifecycle for an index and it will be set as the `index.lifecycle.origination_date` for that index. A user set value for `index.lifecycle.origination_date` will always override a possible parsable date from the index name. (cherry picked from commit c363d27f0210733dad0c307d54fa224a92ddb569) Signed-off-by: Andrei Dan <andrei.dan@elastic.co> * Drop usage of Map.of to be java 8 compliant	2019-09-25 21:44:16 +01:00
Lee Hinman	a267df30fa	Wait for snapshot completion in SLM snapshot invocation (#47051 ) * Wait for snapshot completion in SLM snapshot invocation This changes the snapshots internally invoked by SLM to wait for completion. This allows us to capture more snapshotting failure scenarios. For example, previously a snapshot would be created and then registered as a "success", however, the snapshot may have been aborted, or it may have had a subset of its shards fail. These cases are now handled by inspecting the response to the `CreateSnapshotRequest` and ensuring that there are no failures. If any failures are present, the history store now stores the action as a failure instead of a success. Relates to #38461 and #43663	2019-09-25 14:25:22 -06:00
Gordon Brown	a46eef9634	Change SLM stats format (#46991 ) Using arrays of objects with embedded IDs is preferred for new APIs over using entity IDs as JSON keys. This commit changes the SLM stats API to use the preferred format.	2019-09-25 11:32:08 -06:00
Yannick Welsch	9e17b78fee	Mute second test in monitoring/bulk/10_basic Relates #30101	2019-09-25 14:17:01 +02:00
Benjamin Trent	05fb7be571	[7.x] [ML][Inference] Feature pre-processing objects and functions (#46777 ) (#47040 ) * [ML][Inference] Feature pre-processing objects and functions (#46777) To support inference on pre-trained machine learning models, some basic feature encoding will be necessary. I am using a named object serialization approach so new encodings/pre-processing steps could be added in the future. This PR lays down the ground work for 3 basic encodings: * HotOne * Target Mean * Frequency More feature encodings or pre-processings could be added in the future: * Handling missing columns * Standardization * Label encoding * etc.... * fixing compilation for namedxcontent tests	2019-09-25 08:16:24 -04:00
Yannick Welsch	a4cecc54ab	Mute monitoring/bulk/20_privileges Relates #30101	2019-09-25 14:03:08 +02:00
Yannick Welsch	eb86d71edd	Mute MlJobIT.testDeleteJob Relates #45652	2019-09-25 12:53:09 +02:00
Yannick Welsch	7a5b5af171	Mute MlJobIT.testDeleteJobAsync Relates #45652	2019-09-25 12:53:05 +02:00
Ioannis Kakavas	23bceaadf8	Handle RelayState in preparing a SAMLAuthN Request (#46534 ) (#47092 ) This change allows for the caller of the `saml/prepare` API to pass a `relay_state` parameter that will then be part of the redirect URL in the response as the `RelayState` query parameter. The SAML IdP is required to reflect back the value of that relay state when sending a SAML Response. The caller of the APIs can then, when receiving the SAML Response, read and consume the value as it see fit.	2019-09-25 13:23:46 +03:00
Yogesh Gaikwad	6f453aa6b2	Validate index and cluster privilege names when creating a role (#46361 ) (#47063 ) This commit adds validation so a role cannot be created with invalid index or cluster privilege name. Closes #29703	2019-09-25 18:57:11 +10:00
Yannick Welsch	056ac32738	Mute JdbcCsvSpecIT.testAverageWithOneValueAndLimit Relates to #47080	2019-09-25 10:36:53 +02:00
Christoph Büscher	0c187e0a10	Add migration tool checks for `_field_names` disabling (#46972 ) This change adds a check to the migration tool that warns about the deprecated `enabled` setting for the `_field_names` field on 7.x indices and issues a warning for templates containing this setting, which has been removed with 8.0. Relates to #42854, #46681	2019-09-25 10:15:10 +02:00
Hendrik Muhs	7377ac4637	[Transform] Replace transforms with transform, index constants (#47023 ) - replace "transforms" with "transform" for consistency - use constants for internal index naming wherever possible and document required changes	2019-09-25 08:31:43 +02:00
Hendrik Muhs	e974f178b5	[Transform] rename data frame transform to transform for hlrc client (#46933 ) rename data frame transform to transform for hlrc	2019-09-25 08:31:43 +02:00
Benjamin Trent	00c1c0132b	[ML] fix two datafeed flush lockup bugs (#46982 ) (#47024 ) * [ML] fix two flush lockup bugs * Addressing PR comments * moving debug logging line so it is only written on success	2019-09-24 13:03:20 -04:00
Albert Zaharovits	3a82e0f7f4	Do not rewrite aliases on remove-index from aliases requests (#46989 ) (#47018 ) When we rewrite alias requests, after filtering down to only those that the user is authorized to see, it can be that there are no aliases remaining in the request. However, core Elasticsearch interprets this as _all so the user would see more than they are authorized for. To address this, we previously rewrote all such requests to have aliases `""`, `"-"`, which would be interpreted when aliases are resolved as nome. Yet, this is only needed for get aliases requests and we were applying it to all alias requests, including remove index requests. If such a request was sent to a coordinating node that is not the master node, the request would be rewritten to include `""` and `"-"`, and then the master would authorize the user for these. If the user had limited permissions, the request would fail, even if they were authorized on the index that the remove index action was over. This commit addresses this by rewriting for get aliases and remove aliases request types but not for the remove index. Co-authored-by: Albert Zaharovits <albert.zaharovits@elastic.co> Co-authored-by: Tim Vernum <tim@adjective.org>	2019-09-24 19:07:55 +03:00
Dimitris Athanasiou	64bf1b56fe	[7.x] SQL: Mute pivot testAverageWithOneValueAndOrder and testSumWithoutSubquery (#47030 ) (#47033 ) Relates #47002	2019-09-24 19:04:52 +03:00
Ioannis Kakavas	98e6bb4d01	Workaround JDK-8213202 in SSLClientAuthTests (#46995 ) This change works around JDK-8213202, which is a bug related to TLSv1.3 session resumption before JDK 11.0.3 that occurs when there are multiple concurrent sessions being established. Nodes connecting to each other will trigger this bug when client authentication is disabled, which is the case for SSLClientAuthTests. Backport of #46680	2019-09-24 12:47:56 +03:00
Lee Hinman	5ca37db60c	Mute SLMSnapshotBlockingIntegTests.testRetentionWhileSnapshotInProgress Relates to #46508	2019-09-23 17:08:09 -06:00
Julie Tibshirani	9124c94a6c	Add support for aliases in queries on _index. (#46944 ) Previously, queries on the _index field were not able to specify index aliases. This was a regression in functionality compared to the 'indices' query that was deprecated and removed in 6.0. Now queries on _index can specify an alias, which is resolved to the concrete index names when we check whether an index matches. To match a remote shard target, the pattern needs to be of the form 'cluster:index' to match the fully-qualified index name. Index aliases can be specified in the following query types: term, terms, prefix, and wildcard.	2019-09-23 13:21:37 -07:00
Jim Ferenczi	08f28e642b	Replace SearchContext with QueryShardContext in query builder tests (#46978 ) This commit replaces the SearchContext used in AbstractQueryTestCase with a QueryShardContext in order to reduce the visibility of search contexts. Relates #46523	2019-09-23 20:24:02 +02:00
Costin Leau	a610503783	SQL: Add PIVOT support (#46489 ) Add initial PIVOT support for transforming a regular table into a statistics table around an arbitrary pivoting column: SELECT * FROM (SELECT languages, country, salary, FROM mp) PIVOT (AVG(salary) FOR countries IN ('NL', 'DE', 'ES', 'RO', 'US')) In the current implementation PIVOT allows only one aggregation however this restriction is likely to be lifted in the future. Also not all aggregations are working, in particular MatrixStats are not yet supported. (cherry picked from commit d91263746a222915c570d4a662ec48c1d6b4f583)	2019-09-23 21:04:13 +03:00
Lisa Cawley	875d864be6	[DOCS] Update data frame transform URLs (#46940 ) (#46946 )	2019-09-20 15:57:43 -07:00
Hendrik Muhs	4a2cb05162	add message about transform disabled if license is missing (#46901 ) adds a message for transform about what happens if no license has been activated	2019-09-20 13:47:40 +02:00
Hendrik Muhs	abe889af75	[7.5][Transform] rename classes in transform plugin (#46867 ) rename classes and settings in transform plugin, provide BWC for old settings	2019-09-20 10:43:00 +02:00
Jason Tedor	bd77626177	Add the ability to require an ingest pipeline (#46847 ) This commit adds the ability to require an ingest pipeline on an index. Today we can have a default pipeline, but that could be overridden by a request pipeline parameter. This commit introduces a new index setting index.required_pipeline that acts similarly to index.default_pipeline, except that it can not be overridden by a request pipeline parameter. Additionally, a default pipeline and a request pipeline can not both be set. The required pipeline can be set to _none to ensure that no pipeline ever runs for index requests on that index.	2019-09-19 16:37:45 -04:00
Yannick Welsch	9638ca20b0	Allow dropping documents with auto-generated ID (#46773 ) When using auto-generated IDs + the ingest drop processor (which looks to be used by filebeat as well) + coordinating nodes that do not have the ingest processor functionality, this can lead to a NullPointerException. The issue is that markCurrentItemAsDropped() is creating an UpdateResponse with no id when the request contains auto-generated IDs. The response serialization is lenient for our REST/XContent format (i.e. we will send "id" : null) but the internal transport format (used for communication between nodes) assumes for this field to be non-null, which means that it can't be serialized between nodes. Bulk requests with ingest functionality are processed on the coordinating node if the node has the ingest capability, and only otherwise sent to a different node. This means that, in order to reproduce this, one needs two nodes, with the coordinating node not having the ingest functionality. Closes #46678	2019-09-19 16:46:33 +02:00
Armin Braun	6b09c2cdbb	Limit Netty Workers in NativeRealmIntegTestCase (#46816 ) (#46850 ) The fact that this test randomly uses a relatively large number of nodes and hence Netty worker threads created a problem with running out of direct memory on CI. Tests run with 512M heap (and hence 512M direct memory) by default. On a CI worker with 16 cores, this means Netty will by default set up 32 transport workers. If we get unlucky and a lot of them actually do work (and thus instantiate a `CopyBytesSocketChannel` which costs 1M per thread for the thread-local IO buffer) we would run out of memory. This specific failure was only seen with `NativeRealmIntegTests` so I only added the constraint on the Netty worker count here. We can add it to other tests (or `SecurityIntegTestCase`) if need be but for now it doesn't seem necessary so I opted for least impact. Closes #46803	2019-09-19 13:07:42 +02:00
Dimitris Athanasiou	02a5e153dc	[7.x][ML] Parse and index data frame analytics state (#46804 ) (#46820 ) This commit reuses the same state processor that is used for autodetect to parse state output from data frame analytics jobs. We then index the state document into the state index. Backport of #46804	2019-09-18 20:37:40 +03:00
Benjamin Trent	9cf9c64ec2	[7.x] [ML][Transforms] remove `force` flag from _start (#46414 ) (#46748 ) * [ML][Transforms] remove `force` flag from _start (#46414) * [ML][Transforms] remove `force` flag from _start * fixing expected error message * adjusting bwc version	2019-09-18 10:06:05 -04:00
Dimitris Athanasiou	cebe8da617	[7.x][ML] MlMemoryTracker should ignore analytics tasks without config (#46789 ) (#46811 ) It is possible for a running analytics job that its config is removed from the '.ml-config' index (perhaps the user deleted the entire index, etc.). In that case the task remains without a matching config. I have raised #46781 to discuss how to deal with this issue. This commit focuses on `MlMemoryTracker` and changes it so that when we get the configs for the running tasks we leniently ignore missing ones. This at least means memory tracking will keep working for other jobs if one or more are missing. In addition, this commit makes the cleanup code for native analytics tests more robust by explicitly stopping all jobs and force-stopping if an error occurs. This helps so that a single failing test does not cause other tests fail due to pending tasks. Backport of #46789	2019-09-18 16:35:25 +03:00
Alpar Torok	f3e67bdd17	Add resolution rule to allow resolving all deps (#46768 ) Since the `resolveAllDependencies` task resolves all the congfigurations it can find, this was not caught by our testing, but it's required to be configuraed specifically. We should probably cut-over to the new configurations at some point to avoid problems like this. Closes elastic/infra#14580	2019-09-18 11:09:43 +03:00
Lee Hinman	b85468d6ea	Add node setting for disabling SLM (#46794 ) (#46796 ) This adds the `xpack.slm.enabled` setting to allow disabling of SLM functionality as well as its HTTP API endpoints. Relates to #38461	2019-09-17 17:39:41 -06:00
Oliver Gupte	cbd58d3b78	Give kibana user privileges to create APM agent config index (#46765 ) (#46792 ) * Give kibana user reserved role privileges on .apm-* to create APM agent configuration index. * fixed test to include checking all .apm-* permissions * changed pattern from ".apm-*" to the more specific ".apm-agent-configuration"	2019-09-17 15:01:42 -07:00
Costin Leau	92e518e789	SQL: Properly handle indices with no/empty mapping (#46775 ) When encountering only indices with empty mapping, the IndexResolver throws an exception as it expects to find at least one entry. This commit fixes this case so that an empty mapping is returned. Fix #46757 (cherry picked from commit 5f4f5807acb93b5fab36718c092c328977a396b6)	2019-09-17 16:01:22 +03:00
Armin Braun	b0f09b279f	Make Snapshot Logic Write Metadata after Segments (#45689 ) (#46764 ) * Write metadata during snapshot finalization after segment files to prevent outdated metadata in case of dynamic mapping updates as explained in #41581 * Keep the old behavior of writing the metadata beforehand in the case of mixed version clusters for BwC reasons * Still overwrite the metadata in the end, so even a mixed version cluster is fixed by this change if a newer version master does the finalization * Fixes #41581	2019-09-17 13:09:39 +02:00
Tomas Della Vedova	e1cf103980	Fixes for API specification (#46522 ) (#46736 ) Follow-up of #42346	2019-09-17 11:49:24 +02:00
Costin Leau	683b5fdeca	SQL: Support queries with HAVING over SELECT (#46709 ) Handle queries with implicit GROUP BY where the aggregation is not in the projection/SELECT but inside the filter/HAVING such as: SELECT 1 FROM x HAVING COUNT(*) > 0 The engine now properly identifies the case and handles it accordingly. Fix #37051 (cherry picked from commit fa53ca05d8219c27079b50b4a5b7aeb220c7cde2)	2019-09-17 11:14:39 +03:00
Costin Leau	90f4c2379b	SQL: improve ResultSet behavior when no rows are available (#46753 ) Improve the defensive behavior of ResultSet when dealing with incorrect API usage. In particular handle the case of dealing with no row available (either because the cursor is before the first entry or after the last). Fix #46750 (cherry picked from commit 58fa38e4606625962e879265d35eacb0960c6cdb)	2019-09-17 11:14:38 +03:00
Przemysław Witek	e49be611ad	[7.x] Add audit messages for Data Frame Analytics (#46521 ) (#46738 )	2019-09-16 21:21:38 +02:00
Benjamin Trent	92acc732de	[ML][Transform] Use field caps for mapping deductino (#46703 ) (#46742 )	2019-09-16 10:05:55 -04:00
Andrei Stefan	40e9353947	SQL: use the correct data type for types conversion (#46574 ) (cherry picked from commit 3e25db2f302c3aafe27e4d8d4fb1743401d85e6d)	2019-09-16 15:36:17 +03:00
Hendrik Muhs	c8f52ec4ff	[Transform] Rename data frame plugin to transform: classes in xpack.core (#46644 ) (#46734 ) rename classes in xpack.core of transform plugin from "data frame transform" to "transform"	2019-09-16 13:39:22 +02:00

1 2 3 4 5 ...

3310 Commits