OpenSearch

Commit Graph

Author	SHA1	Message	Date
Lisa Cawley	4ff78e8a00	[7.x][DOCS] Adds X-Pack usage API (#52592 )	2020-02-21 06:57:11 -08:00
Przemysław Witek	b84e8db7b5	[7.x] Rename .ml-state index to .ml-state-000001 to support rollover (#52510 ) (#52595 )	2020-02-21 08:55:59 +01:00
Benjamin Trent	2a5c181dda	[ML][Inference] don't return inflated definition when storing trained models (#52573 ) (#52580 ) When `PUT` is called to store a trained model, it is useful to return the newly create model config. But, it is NOT useful to return the inflated definition. These definitions can be large and returning the inflated definition causes undo work on the server and client side. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-20 19:47:29 -05:00
Russ Cam	62da077beb	Specify name on enrich.get_policy as list type (#50217 ) This commit updates the enrich.get_policy API to specify name as a list, in line with other URL parts that accept a comma-separated list of values. In addition, update the get enrich policy API docs to align the URL part name in the documentation with the name used in the REST API specs. (cherry picked from commit 94f6f946ef283dc93040e052b4676c5bc37f4bde)	2020-02-20 11:39:28 +10:00
Ryan Ernst	3c3a0b2f37	Mute additional failing top_metrics test (#52545 ) Most top_metrics tests were muted in #52468, but the scaled float can also fail. This commit mutes that test as well. relates #52418	2020-02-19 16:14:26 -08:00
David Kyle	7bbe5c8464	[Ml] Validate tree feature index is within range (#52514 ) This changes the tree validation code to ensure no node in the tree has a feature index that is beyond the bounds of the feature_names array. Specifically this handles the situation where the C++ emits a tree containing a single node and an empty feature_names list. This is valid tree used to centre the data in the ensemble but the validation code would reject this as feature_names is empty. This meant a broken workflow as you cannot GET the model and PUT it back	2020-02-19 14:41:43 +00:00
Przemysław Witek	7cd997df84	[ML] Make ml internal indices hidden (#52423 ) (#52509 )	2020-02-19 14:02:32 +01:00
Hendrik Muhs	4d006f09d2	[Transform] fix XPackRestIT continuous transform stats test failure do not match explicit number but only test existence for duration test (#52504) fixes #52429	2020-02-19 12:32:54 +01:00
Henning Andersen	84de601551	Mute failing top_metrics tests (#52468 ) These tests fails when the global template is added, which changes number_of_shards to 2. Relates #52409 and #52418	2020-02-18 13:29:28 +01:00
Przemysław Witek	6fa067a2a0	Relax assertions on memory_estimation.* fields (#52452 ) (#52458 )	2020-02-18 11:57:03 +01:00
Nik Everett	146def8caa	Implement top_metrics agg (#51155 ) (#52366 ) The `top_metrics` agg is kind of like `top_hits` but it only works on doc values so it should be faster. At this point it is fairly limited in that it only supports a single, numeric sort and a single, numeric metric. And it only fetches the "very topest" document worth of metric. We plan to support returning a configurable number of top metrics, requesting more than one metric and more than one sort. And, eventually, non-numeric sorts and metrics. The trick is doing those things fairly efficiently. Co-Authored by: Zachary Tong <zach@elastic.co>	2020-02-14 11:19:11 -05:00
Dimitris Athanasiou	ad56802ac6	[7.x][ML] Refactor ML mappings and templates into JSON resources (#51… (#52353 ) ML mappings and index templates have so far been created programmatically. While this had its merits due to static typing, there is consensus it would be clear to maintain those in json files. In addition, we are going to adding ILM policies to these indices and the component for a plugin to register ILM policies is `IndexTemplateRegistry`. It expects the templates to be in resource json files. For the above reasons this commit refactors ML mappings and index templates into json resource files that are registered via `MlIndexTemplateRegistry`. Backport of #51765	2020-02-14 17:16:06 +02:00
Hendrik Muhs	efd7542b2a	[7.x][Transform] provide exponential_avg* stats for batch transforms (#52041 ) (#52323 ) provide exponential_avg* stats for batch transforms, avoids confusion why those values are all 0 otherwise	2020-02-14 07:48:23 +01:00
Przemysław Witek	0da3af7581	[7.x] [ML] Add _cat/ml/data_frame/analytics API (#52260 ) (#52312 )	2020-02-13 16:55:47 +01:00
Marios Trivyzas	ea6f0e39bc	[Tests] Update skip version for YAML tests (#52310 ) Update skip versions upper boundary to match the release or intented release version of the feature/fix.	2020-02-13 15:36:31 +01:00
Julie Tibshirani	f0668cabbc	Adjust the 'skip' version in flattened REST tests. (#52293 ) I forgot to adjust it after backporting the flattened fields feature.	2020-02-12 15:17:44 -08:00
James Rodewig	3f151d1d75	[DOCS] Add redirects, update JSON spec to fix docs build (#51747 ) Docs build [#11556][0] broke due to several outdated or incorrect links in the JSON REST spec. This fixes those links where possible and adds redirects. [0]: https://elasticsearch-ci.elastic.co/job/elastic+docs+master+build/11556/	2020-02-12 08:30:59 -05:00
David Roberts	473468d763	[ML] Better error when persistent task assignment disabled (#52014 ) Changes the misleading error message when attempting to open a job while the "cluster.persistent_tasks.allocation.enable" setting is set to "none" to a clearer message that names the setting. Closes #51956	2020-02-11 15:23:21 +00:00
Igor Motov	667e1a5225	Add Boxplot Aggregation (#52174 ) Adds a `boxplot` aggregation that calculates min, max, medium and the first and the third quartiles of the given data set. Closes #33112	2020-02-11 09:38:17 -05:00
Ignacio Vera	80e3c97210	Upgrade to lucene-8.5.0-snapshot-d62f6307658 (#52039 ) (#52130 )	2020-02-10 10:13:22 +01:00
David Roberts	1cefafdd14	[ML] Add new categorization stats to model_size_stats (#52009 ) This change adds support for the following new model_size_stats fields: - categorized_doc_count - total_category_count - frequent_category_count - rare_category_count - dead_category_count - categorization_status Backport of #51879	2020-02-10 09:10:50 +00:00
Jason Tedor	25daf5f1e1	Add autoscaling API skelton (#51564 ) The main purpose of this commit is to add a single autoscaling REST endpoint skeleton, for the purpose of starting to build out the build and testing infrastructure that will surround it. For example, rather than commiting a fully-functioning autoscaling API, we introduce here the skeleton so that we can start wiring up the build and testing infrastructure, establish security roles/permissions, an so on. This way, in a forthcoming PR that introduces actual functionality, that PR will be smaller and have less distractions around that sort of infrastructure.	2020-02-06 21:55:01 -05:00
Martijn Laarman	898dd0b9cc	Cat.ml.* introduces an additional depths to namespace API's (#51981 ) Not all clients support this e.g if the java high level rest client were to map this it would look like `client.cat().ml().api()` which hinders discoverability. (cherry picked from commit 21cdabf09dc8305ce2f5e3b6cb193f67137d8bdb)	2020-02-06 13:16:59 +01:00
Benjamin Trent	79f143907a	[7.x] [ML] add _cat/ml/trained_models API (#51529 ) (#51936 ) * [ML] add _cat/ml/trained_models API (#51529) This adds _cat/ml/trained_models.	2020-02-05 08:26:44 -05:00
Adrien Grand	ad9d2f1922	Move analysis/mappings stats to cluster-stats. (#51875 ) Closes #51138	2020-02-05 11:02:25 +01:00
debadair	c0156cbb5d	Backporting updates to ILM org, overview, & GS (#51898 ) * [DOCS] Align with ILM API docs (#48705) * [DOCS] Reconciled with Snapshot/Restore reorg * [DOCS] Split off ILM overview to a separate topic. (#51287) * [DOCS} Split off overview to a separate topic. * [DOCS] Incorporated feedback from @jrodewig. * [DOCS] Edit ILM GS tutorial (#51513) * [DOCS] Edit ILM GS tutorial * [DOCS] Incorporated review feedback from @andreidan. * [DOCS] Removed test link & fixed anchor & title. * Update docs/reference/ilm/getting-started-ilm.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co> * Fixed glossary merge error. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-02-04 16:45:18 -08:00
Hendrik Muhs	b7aace44f3	mark transform API's stable (#51862 ) mark transform API's stable, meaning making transform GA for the next minor release	2020-02-04 16:13:47 +01:00
Benjamin Trent	d293980a09	[7.x] [ML] add GET _cat/ml/datafeeds (#51500 ) (#51829 ) * [ML] add GET _cat/ml/datafeeds (#51500) This adds GET _cat/ml/datafeeds && _cat/ml/datafeeds/{datafeed_id} * fixing for java8 compilation	2020-02-03 17:16:33 -05:00
Jonathan Budzenski	8fa4a40bdf	[rest spec] fill in documentation links for security.{put,delete}_privileges (#48482 ) Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-03 10:53:50 -06:00
James Rodewig	4ea7297e1e	[DOCS] Change http://elastic.co -> https (#48479 ) (#51812 ) Co-authored-by: Jonathan Budzenski <jon@budzenski.me>	2020-02-03 09:50:11 -05:00
Karel Minarik	050c4d4c89	Fixes for the REST specification (#51791 ) * REST: Test: Fix the `accept_enterprise` parameter for Get License API (#51527) The Get License API specifies the `accept_enterprise` parameter as a `boolean`: `0ca5cb8cb6/x-pack/plugin/src/test/resources/rest-api-spec/api/license.get.json (L22-L27)` In the test, a `string` is passed however, which makes the test compilation fail in the Go client. (cherry picked from commit e2a2169b3d44592057c143253bb56375ed3e4268) * Fix the SQL API documentation in REST specification (#51534) This patch fixes the SQL REST API documentation to conform to the current schema. (cherry picked from commit c8b6a849852699883086a6ada42279f2f68d7e07) * Fix the "slices" parameter for the Delete By Query API in the REST specification (#51535) This patch updates the `type` parameter in the Delete By Query API: according to [the documentation](https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-delete-by-query.html#docs-delete-by-query-slice), it can be set to "auto", but the type in the documentation allows only numerical values. This prevents people from setting the parameter to "auto" eg. in the Go client, which generates source from the specification, and sets the corresponding Go type as number. The patch uses the `\|` notation, which we have discussed previously for encoding a "polymorphic" parameter like this. Related: https://github.com/elastic/go-elasticsearch/issues/77 * Fix the Enrich API documentation in REST specification (#51528) This patch fixes the REST API documentation for the Enrich APIs to conform to the current schema. (cherry picked from commit 59f28f4f2feeba3f6d2f0b632410577eacb28121)	2020-02-02 15:28:08 +01:00
Benjamin Trent	e372854d43	[ML][Inference] Fix model pagination with models as resources (#51573 ) (#51736 ) This adds logic to handle paging problems when the ID pattern + tags reference models stored as resources. Most of the complexity comes from the issue where a model stored as a resource could be at the start, or the end of a page or when we are on the last page.	2020-01-31 07:52:19 -05:00
Albert Zaharovits	f25b6cc2eb	Add new 'maintenance' index privilege #50643 This commit creates a new index privilege named `maintenance`. The privilege grants the following actions: `refresh`, `flush` (also synced-`flush`), and `force-merge`. Previously the actions were only under the `manage` privilege which in some situations was too permissive. Co-authored-by: Amir H Movahed <arhd83@gmail.com>	2020-01-30 11:59:11 +02:00
Julie Tibshirani	9dcc3ef7e6	Always use one shard in vector REST tests. (#51643 ) This PR tries to address the intermittent vector test failures on 7.x by making sure we create indices with one shard. The fix is based on this theory as to what's happening: * On 7.x, the default number of shards is 1, but in REST tests we randomly use 2 in order to cover the multiple shards case. In the failing test run, we use 2 shards and all documents end up on only one shard. * During a search, the response from the empty shard doesn't produce deprecation warnings because we never try to execute the script. If not all shard responses contain the warning headers, then certain deprecation warnings can be lost (due to the bug described in #33936). Addresses #50716. Relates to #50061.	2020-01-29 12:24:41 -08:00
Gordon Brown	89c2834b24	Deprecate creation of dot-prefixed index names except for hidden and system indices (#49959 ) This commit deprecates the creation of dot-prefixed index names (e.g. .watches) unless they are either 1) a hidden index, or 2) registered by a plugin that extends SystemIndexPlugin. This is the first step towards more thorough protections for system indices. This commit also modifies several plugins which use dot-prefixed indices to register indices they own as system indices, and adds a plugin to register .tasks as a system index.	2020-01-28 10:01:16 -07:00
Aleksandr Maus	79875ce4d9	Initial EQL rest API implementation (#49768 )	2020-01-27 15:11:41 -05:00
Benjamin Trent	bf53ca3380	[7.x] [ML] Add _cat/ml/anomaly_detectors API (#51364 ) (#51408 ) [ML] Add _cat/ml/anomaly_detectors API (#51364)	2020-01-24 11:54:22 -05:00
Benjamin Trent	fc994d9ce1	[ML][Inference] Adds validations for model PUT (#51376 ) (#51409 ) Adds validations making sure that * `input.field_names` is not empty * `ensemble.trained_models` is not empty * `tree.feature_names` is not empty closes https://github.com/elastic/elasticsearch/issues/51354	2020-01-24 09:29:12 -05:00
Benjamin Trent	76660a5a4f	[7.x] [ML][Inference] add tags url param to GET (#51330 ) (#51404 ) * [ML][Inference] add tags url param to GET (#51330) Adds a new URL parameter, `tags` to the GET _ml/inference/<model_id> endpoint. This parameter allows the list of models to be further reduced to those who contain all the provided tags.	2020-01-24 08:26:58 -05:00
Dimitris Athanasiou	59687a9384	[7.x][ML] Validate classification dependent_variable cardinality is at lea… (#51232 ) (#51309 ) Data frame analytics classification currently only supports 2 classes for the dependent variable. We were checking that the field's cardinality is not higher than 2 but we should also check it is not less than that as otherwise the process fails. Backport of #51232	2020-01-22 16:51:16 +02:00
Nik Everett	ca15a3f5a8	Add "did you mean" to unknown queries (#51177 ) (#51254 ) This replaces the message we return for unknown queries with the standard one that we use for unknown fields from `ObjectParser`. This is nice because it includes "did you mean". One day we might convert parsing queries to using object parser, but that looks complex. This change is much smaller and seems useful.	2020-01-21 12:45:52 -05:00
Adrien Grand	1a73d8329c	Disable xpack/15_basic/Usage stats for mappings. Relates #51127	2020-01-20 18:05:26 +01:00
Adrien Grand	45d7bdcfd7	Add analysis components and mapping types to the usage API. (#51062 ) Knowing about used analysis components and mapping types would be incredibly useful in order to know which ones may be deprecated or should get more love. Some field types also act as a proxy to know about feature usage of some APIs like the `percolator` or `completion` fields types for percolation and the completion suggester, respectively.	2020-01-16 09:56:41 +01:00
Tomas Della Vedova	5b6fa79fd8	[ML] Removed key value from the catch regex test (#50977 ) (#51021 )	2020-01-15 08:50:59 +01:00
Nik Everett	fc5fde7950	Add "did you mean" to ObjectParser (#50938 ) (#50985 ) Check it out: ``` $ curl -u elastic:password -HContent-Type:application/json -XPOST localhost:9200/test/_update/foo?pretty -d'{ "dac": {} }' { "error" : { "root_cause" : [ { "type" : "x_content_parse_exception", "reason" : "[2:3] [UpdateRequest] unknown field [dac] did you mean [doc]?" } ], "type" : "x_content_parse_exception", "reason" : "[2:3] [UpdateRequest] unknown field [dac] did you mean [doc]?" }, "status" : 400 } ``` The tricky thing about implementing this is that x-content doesn't depend on Lucene. So this works by creating an extension point for the error message using SPI. Elasticsearch's server module provides the "spell checking" implementation. s	2020-01-14 17:53:41 -05:00
Tim Vernum	c2acb8830a	Add max_resource_units to enterprise license (#50910 ) The enterprise license type must have "max_resource_units" and may not have "max_nodes". This change adds support for this new field, validation that the field is present if-and-only-if the license is enterprise and bumps the license version number to reflect the new field. Includes a BWC layer to return "max_nodes: ${max_resource_units}" in the GET license API. Backport of: #50735	2020-01-14 12:37:05 +11:00
Benjamin Trent	fa116a6d26	[7.x] [ML][Inference] PUT API (#50852 ) (#50887 ) * [ML][Inference] PUT API (#50852) This adds the `PUT` API for creating trained models that support our format. This includes * HLRC change for the API * API creation * Validations of model format and call * fixing backport	2020-01-12 10:59:11 -05:00
Julie Tibshirani	3bac1dc414	Adjust the skip version in flattened field telemetry tests. We forgot to adjust the version when backporting the commit to 7.x.	2020-01-10 10:36:41 -08:00
Dimitris Athanasiou	422422a2bc	[7.x][ML] Reuse SourceDestValidator for data frame analytics (#50841 ) (#50850 ) This commit removes validation logic of source and dest indices for data frame analytics and replaces it with using the common `SourceDestValidator` class which is already used by transforms. This way the validations and their messages become consistent while we reduce code. This means that where these validations fail the error messages will be slightly different for data frame analytics. Backport of #50841	2020-01-10 14:24:13 +02:00
Benjamin Trent	3e014d39c2	[Transform] fail to start/put on missing pipeline (#50701 ) (#50795 ) If a pipeline referenced by a transform does not exist, we should not allow the transform to be created. We do allow the pipeline existence check to be skipped with defer_validations, but if the pipeline still does not exist on `_start`, the pipeline will fail to start. relates: #50135	2020-01-09 10:33:22 -05:00

1 2 3 4 5 ...

646 Commits