OpenSearch

Commit Graph

Author	SHA1	Message	Date
Dimitris Athanasiou	b70ebdeb96	[7.x][ML] DF Analytics _explain API should skip object fields (#51115 ) (#51147 ) Object fields cannot be used as features. At the moment _explain API includes them and even worse it allows it does not error when an object field is excluded. This creates the expectation to the user that all children fields will also be excluded while it's not the case. This commit omits object fields from the _explain API and also adds an error if an object field is included or excluded. Backport of #51115	2020-01-17 14:02:59 +02:00
Christoph Büscher	d291f189a8	Fix hardcoded version replacement in put-dfanalytics.asciidoc #51053 The version replacement for the code snippet should replace 7.6 with the current version, but doesn't match because of a missing whitespace. Closes #51052	2020-01-15 18:09:37 +01:00
Przemysław Witek	b4a631277a	Add missing docs for new evaluation metrics (#50967 ) (#51041 )	2020-01-15 15:53:42 +01:00
Dimitris Athanasiou	1d8cb3c741	[7.x][ML] Add num_top_feature_importance_values param to regression and classi… (#50914 ) (#50976 ) Adds a new parameter to regression and classification that enables computation of importance for the top most important features. The computation of the importance is based on SHAP (SHapley Additive exPlanations) method. Backport of #50914	2020-01-14 16:46:09 +02:00
István Zoltán Szabó	4f150e4961	[7.x][DOCS] Moves analysis resources to PUT DFA API docs (#50793 )	2020-01-09 16:21:35 +01:00
István Zoltán Szabó	71afeec7d0	Revert "[DOCS] Moves analysis resources to PUT DFA API docs (#50704 )" This reverts commit `4e1107d5d7`.	2020-01-09 14:31:35 +01:00
István Zoltán Szabó	4e1107d5d7	[DOCS] Moves analysis resources to PUT DFA API docs (#50704 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-01-09 14:13:37 +01:00
István Zoltán Szabó	0ac6786f41	[DOCS] Forms role and privilege requirements as bulleted lists in DFA API docs (#50732 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2020-01-09 10:45:18 +01:00
Dimitris Athanasiou	ca0828ba07	[7.x][ML] Implement force deleting a data frame analytics job (#50553 ) (#50589 ) Adds a `force` parameter to the delete data frame analytics request. When `force` is `true`, the action force-stops the jobs and then proceeds to the deletion. This can be used in order to delete a non-stopped job with a single request. Closes #48124 Backport of #50553	2020-01-03 13:46:02 +02:00
István Zoltán Szabó	a34b3f133c	[DOCS] Specifies the possible data types of classification dependent_variable (#50582 )	2020-01-03 10:42:56 +01:00
István Zoltán Szabó	5759a263cb	[DOCS] Adds GET, GET stats and DELETE inference APIs (#50224 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2019-12-18 09:18:56 +01:00
István Zoltán Szabó	8f36bfa37f	[7.x][DOCS] Changes hyperparam optimization section ID (#50173 )	2019-12-13 12:22:50 +01:00
István Zoltán Szabó	7611b3c9be	[7.x][DOCS] Moves data frame analytics job resource definitions into APIs (#50165 ) * [7.x][DOCS] Moves data frame analytics job resource definitions into APIs.	2019-12-13 11:48:21 +01:00
Dimitris Athanasiou	8891f4db88	[7.x][ML] Introduce randomize_seed setting for regression and classification (#49990 ) (#50023 ) This adds a new `randomize_seed` for regression and classification. When not explicitly set, the seed is randomly generated. One can reuse the seed in a similar job in order to ensure the same docs are picked for training. Backport of #49990	2019-12-10 15:29:19 +02:00
István Zoltán Szabó	63d3933787	[DOCS] Fixes classification evaluation example response. (#49905 )	2019-12-06 13:25:40 +01:00
István Zoltán Szabó	f4b3bb7d6b	[DOCS] Adds an example of preprocessing actions to the PUT DFA API docs (#49831 )	2019-12-05 14:16:38 +01:00
Dimitris Athanasiou	4edb2e7bb6	[7.x][ML] Add optional source filtering during data frame reindexing (#49690 ) (#49718 ) This adds a `_source` setting under the `source` setting of a data frame analytics config. The new `_source` is reusing the structure of a `FetchSourceContext` like `analyzed_fields` does. Specifying includes and excludes for source allows selecting which fields will get reindexed and will be available in the destination index. Closes #49531 Backport of #49690	2019-11-29 16:10:44 +02:00
lcawl	777431265b	[DOCS] Fixes typo in ML resources	2019-11-26 10:28:59 -08:00
lcawl	a42003b95b	[DOCS] Fixes data type formatting	2019-11-26 08:22:50 -08:00
Dimitris Athanasiou	8eaee7cbdc	[7.x][ML] Explain data frame analytics API (#49455 ) (#49504 ) This commit replaces the _estimate_memory_usage API with a new API, the _explain API. The API consolidates information that is useful before creating a data frame analytics job. It includes: - memory estimation - field selection explanation Memory estimation is moved here from what was previously calculated in the _estimate_memory_usage API. Field selection is a new feature that explains to the user whether each available field was selected to be included or not in the analysis. In the case it was not included, it also explains the reason why. Backport of #49455	2019-11-22 22:06:10 +02:00
István Zoltán Szabó	c2f52015d3	[DOCS] Removes best practice about fields that are highly correlated to the dependent variable. (#48935 )	2019-11-11 16:01:21 +01:00
István Zoltán Szabó	91888959e8	[DOCS] Extends analyzed_fields description in PUT DFA API docs. (#48307 )	2019-11-11 15:55:12 +01:00
István Zoltán Szabó	3c9bd13dca	[DOCS] Adds classification type DFA API docs and ml-shared.asciidoc (#48241 )	2019-11-06 07:41:38 -05:00
István Zoltán Szabó	70765dfb05	[DOCS] Adds classification type evaluation docs to the DFA evaluation API (#47657 )	2019-11-06 07:38:33 -05:00
David Roberts	984323783e	[ML][7.x] Add lazy assignment job config option (#47993 ) This change adds: - A new option, allow_lazy_open, to anomaly detection jobs - A new option, allow_lazy_start, to data frame analytics jobs Both work in the same way: they allow a job to be opened/started even if no ML node exists that can accommodate the job immediately. In this situation the job waits in the opening/starting state until ML node capacity is available. (The starting state for data frame analytics jobs is new in this change.) Additionally, the ML nightly maintenance tasks now creates audit warnings for ML jobs that are unassigned. This means that jobs that cannot be assigned to an ML node for a very long time will show a yellow warning triangle in the UI. A final change is that it is now possible to close a job that is not assigned to a node without using force. This is because previously jobs that were open but not assigned to a node were an aberration, whereas after this change they'll be relatively common.	2019-10-15 06:55:11 +01:00
István Zoltán Szabó	9eac8bf2a8	[DOCS] Adds supported fields section to the PUT DFA API description (#47842 )	2019-10-10 12:42:54 +02:00
István Zoltán Szabó	6f4b7e9a7f	[DOCS] Extends the analyzed_fields description in the PUT DFA API docs (#47791 )	2019-10-09 18:14:58 +02:00
Lisa Cawley	39ef795085	[DOCS] Cleans up links to security content (#47610 ) (#47703 )	2019-10-07 15:23:19 -07:00
Dimitris Athanasiou	7667ea5f6f	[7.x][ML] Additional outlier detection parameters (#47600 ) (#47669 ) Adds the following parameters to `outlier_detection`: - `compute_feature_influence` (boolean): whether to compute or not feature influence scores - `outlier_fraction` (double): the proportion of the data set assumed to be outlying prior to running outlier detection - `standardization_enabled` (boolean): whether to apply standardization to the feature values Backport of #47600	2019-10-07 18:21:33 +03:00
István Zoltán Szabó	033aa9cf9b	[DOCS] Adds examples to the PUT dfa and the evaluate dfa APIs (#46966 ) * [DOCS] Adds examples to the PUT dfa and the evaluate dfa APIs. * [DOCS] Removes extra lines from examples. * Update docs/reference/ml/df-analytics/apis/evaluate-dfanalytics.asciidoc Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * Update docs/reference/ml/df-analytics/apis/put-dfanalytics.asciidoc Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * [DOCS] Explains examples.	2019-10-02 10:33:45 +02:00
István Zoltán Szabó	6a9f04ee76	[DOCS] Fixes typos in the PUT dfa and the evaluate dfa documentation. (#47348 )	2019-10-02 09:52:29 +02:00
István Zoltán Szabó	170b102ab5	[DOCS] Changes wording to move away from data frame terminology in the ES repo (#47093 ) * [DOCS] Changes wording to move away from data frame terminology in the ES repo. Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2019-10-01 08:08:17 +02:00
István Zoltán Szabó	3be51fbdf7	[DOCS] Adds regression analytics resources and examples to the data frame analytics APIs and the evaluation API (#46176 ) * [DOCS] Adds regression analytics resources and examples to the data frame analytics APIs. Co-Authored-By: Benjamin Trent <ben.w.trent@gmail.com> Co-Authored-By: Tom Veasey <tveasey@users.noreply.github.com>	2019-09-19 09:23:18 +02:00
István Zoltán Szabó	fe8f33a8e1	[DOCS] Adds outlier detection params to the data frame analytics resources (#46323 ) * [DOCS] Adds outlier detection params to the data frame analytics resources. Co-Authored-By: Tom Veasey <tveasey@users.noreply.github.com> Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2019-09-16 14:23:23 +02:00
James Rodewig	e253ee6ba6	[DOCS] Change // CONSOLE comments to [source,console] (#46440 ) (#46494 )	2019-09-09 12:35:50 -04:00
James Rodewig	f04573f8e8	[DOCS] [5 of 5] Change // TESTRESPONSE comments to [source,console-results] (#46449 ) (#46459 )	2019-09-06 16:09:09 -04:00
James Rodewig	bb7bff5e30	[DOCS] Replace "// TESTRESPONSE" magic comments with "[source,console-result] (#46295 ) (#46418 )	2019-09-06 09:22:08 -04:00
István Zoltán Szabó	8208ffa666	[DOCS] Adds progress parameter description to the GET stats data frame analytics API doc. (#46434 )	2019-09-06 15:18:57 +02:00
István Zoltán Szabó	a75348d1fb	[DOCS] [PUT DFA] Documents inline the child params of source and dest (#45649 ) * [DOCS] [PUT DFA] Documents inline the child params of source and dest. * [DOCS] Fixes indentation issues and amends dfa definitions.	2019-08-29 15:09:02 +02:00
Dimitris Athanasiou	dd6c13fdf9	[ML] Add description to DF analytics (#45774 ) (#46019 )	2019-08-27 15:48:59 +03:00
Dimitris Athanasiou	be554fe5f0	[7.x][ML] Improve progress reportings for DF analytics (#45856 ) (#45910 ) Previously, the stats API reports a progress percentage for DF analytics tasks that are running and are in the `reindexing` or `analyzing` state. This means that when the task is `stopped` there is no progress reported. Thus, one cannot distinguish between a task that never run to one that completed. In addition, there are blind spots in the progress reporting. In particular, we do not account for when data is loaded into the process. We also do not account for when results are written. This commit addresses the above issues. It changes progress to being a list of objects, each one describing the phase and its progress as a percentage. We currently have 4 phases: reindexing, loading_data, analyzing, writing_results. When the task stops, progress is persisted as a document in the state index. The stats API now reports progress from in-memory if the task is running, or returns the persisted document (if there is one).	2019-08-23 23:04:39 +03:00
Przemysław Witek	7512337922	[7.x] Allow the user to specify 'query' in Evaluate Data Frame request (#45775 ) (#45825 )	2019-08-22 11:14:26 +02:00
Przemysław Witek	5faa012fd6	[7.x] Add docs for HLRC for Estimate memory usage API (#45538 ) (#45783 )	2019-08-21 14:27:36 +02:00
Przemysław Witek	df574e5168	[7.x] Implement ml/data_frame/analytics/_estimate_memory_usage API endpoint (#45188 ) (#45510 )	2019-08-14 08:26:03 +02:00
István Zoltán Szabó	cd7ba9f302	[DOCS] Amends data frame analytics resources, GET, and PUT API docs (#44806 ) This PR addresses the feedback in https://github.com/elastic/ml-team/issues/175#issuecomment-512215731. * Adds an example to `analyzed_fields` * Includes `source` and `dest` objects inline in the resource page * Lists `model_memory_limit` in the PUT API page * Amends the `analysis` section in the resource page * Removes Properties headings in subsections	2019-07-26 11:52:43 +02:00
Lisa Cawley	8445c41004	[DOCS] Moves content to ML anomaly-detection folder (#44520 ) (#44530 )	2019-07-18 08:44:52 -07:00
Lisa Cawley	213af8411f	[DOCS] Fixes query default value (#44572 )	2019-07-18 08:18:58 -07:00
Lisa Cawley	53514b0477	[DOCS] Separates data frame analytics APIs (#44451 )	2019-07-16 13:33:23 -07:00

1 2

98 Commits