OpenSearch

Commit Graph

Author	SHA1	Message	Date
Przemysław Witek	bd761cce1d	[ML] Validate that AucRoc has the data necessary to be calculated (#63302 ) (#63454 )	2020-10-08 09:52:15 +02:00
Lisa Cawley	8f76c89cd3	[7.x][DOCS] Add feature_importance_baseline to get trained model API (#63279 ) (#63336 ) Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>	2020-10-06 10:08:34 -07:00
István Zoltán Szabó	a3a373b67f	[DOCS] Adds delta and offset parameters to Evaluate DFA API docs (#63317 ) (#63329 )	2020-10-06 16:49:08 +02:00
Lisa Cawley	4de6104dae	[DOCS] Fix titles for ML APIs (#63152 ) (#63207 )	2020-10-02 14:01:01 -07:00
István Zoltán Szabó	8278bdb7de	[DOCS] Updates trained models API docs titles. (#63165 )	2020-10-02 10:16:19 -07:00
Benjamin Trent	cfcf973259	[7.x] [ML] renames /inference apis to /trained_models (#63097 ) (#63136 ) * [ML] renames /inference apis to /trained_models (#63097) This commit renames all `inference` CRUD APIs to `trained_models`. This aligns with internal terminology, documentation, and use-cases.	2020-10-02 07:34:28 -04:00
Przemysław Witek	4366d58564	[7.x] [ML] Implement AucRoc metric for classification (#60502 ) (#63051 )	2020-09-30 12:55:52 +02:00
Lisa Cawley	fa48b5c836	[DOCS] Formatting fix in get trained model API (#62643 )	2020-09-21 08:22:40 -07:00
Benjamin Trent	0f142c6afc	[ML] all multiple wildcard values for GET Calendars, Events, and DELETE forecasts (#62563 ) (#62629 ) This commit adjusts the following APIs so now they not only support an `_all` case, but wildcard patterned Ids as well. - `GET _ml/calendars/<calendar_id>/events` - `GET _ml/calendars/<calendar_id>` - `GET _ml/anomaly_detectors/<job_id>/model_snapshots/<snapshot_id>` - `DELETE _ml/anomaly_detectors/<job_id>/_forecast/<forecast_id>`	2020-09-18 11:06:07 -04:00
Benjamin Trent	e163559e4c	[7.x] [ML] Add new include flag to GET inference/<model_id> API for model training metadata (#61922 ) (#62620 ) * [ML] Add new include flag to GET inference/<model_id> API for model training metadata (#61922) Adds new flag include to the get trained models API The flag initially has two valid values: definition, total_feature_importance. Consequently, the old include_model_definition flag is now deprecated. When total_feature_importance is included, the total_feature_importance field is included in the model metadata object. Including definition is the same as previously setting include_model_definition=true. * fixing test * Update x-pack/plugin/core/src/test/java/org/elasticsearch/xpack/core/ml/action/GetTrainedModelsRequestTests.java	2020-09-18 10:07:35 -04:00
Lisa Cawley	6320967546	[DOCS] Minor typo in ML API (#62414 )	2020-09-15 13:20:55 -07:00
David Roberts	969a1c558b	[ML] Include the "properties" layer in find_file_structure mappings (#62158 ) Previously the "mappings" field of the response from the find_file_structure endpoint was not a drop-in for the mappings format of the create index endpoint - the "properties" layer was missing. The reason for omitting it initially was that the assumption was that the find_file_structure endpoint would only ever return very simple mappings without any nested objects. However, this will not be true in the future, as we will improve mappings detection for complex JSON objects. As a first step it makes sense to move the returned mappings closer to the standard format. This is a small building block towards fixing #55616	2020-09-10 09:33:42 +01:00
Lisa Cawley	1eb4595a29	[DOCS] Removes inference from trained model API text (#62125 )	2020-09-09 10:13:32 -07:00
Lisa Cawley	78b955eb86	[DOCS] Fix from and size descriptions for model APIs (#62128 )	2020-09-08 12:56:36 -07:00
Lisa Cawley	f0e7d88699	[DOCS] Fix allow_no_match description for model APIs (#62008 )	2020-09-08 08:15:16 -07:00
István Zoltán Szabó	b07b75ce14	[DOCS] Removes inference from the names of trained model APIs. (#62036 ) (#62041 ) # Conflicts: # docs/reference/ml/df-analytics/apis/get-inference-trained-model.asciidoc	2020-09-07 12:14:13 +02:00
Lisa Cawley	2789b8e6c4	[DOCS] Refresh machine learning custom URL example (#61826 ) (#61950 )	2020-09-04 09:44:55 -07:00
Lisa Cawley	6d6f5d4acc	[DOCS] Per-partition categorization (#61506 )	2020-08-26 17:10:01 -07:00
lcawl	5fa839b906	[DOCS] Fix typo in update anomaly detection job API	2020-08-25 17:13:38 -07:00
Benjamin Trent	1ae2923632	[7.x] [ML] adding docs + hlrc for data frame analysis feature_processors (#61149 ) (#61493 ) * [ML] adding docs + hlrc for data frame analysis feature_processors (#61149) Adds HLRC and some docs for the new feature_processors field in Data frame analytics. Co-authored-by: Przemysław Witek <przemyslaw.witek@elastic.co> Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-08-24 12:56:21 -04:00
James Rodewig	60876a0e32	[DOCS] Replace Wikipedia links with attribute (#61171 ) (#61209 )	2020-08-17 11:27:04 -04:00
James Rodewig	a761985fab	[DOCS] Move script and stored fields content to search fields page (#60826 ) (#60835 ) Changes: * Moves `Retrieve selected fields` to its own page and adds a title abbreviation. * Adds existing script and stored fields content to `Retrieve selected fields` * Adds a xref for `Retrieve selected fields` to `Search your data` * Adds related redirects and updates existing xrefs	2020-08-06 13:06:06 -04:00
István Zoltán Szabó	35b9f2b46b	[DOCS] Adds inference phase to get DFA job stats. (#60737 )	2020-08-05 16:26:02 +02:00
Przemysław Witek	0afa1bd972	Deprecate allow_no_jobs and allow_no_datafeeds in favor of allow_no_match (#60601 ) (#60727 )	2020-08-05 13:39:40 +02:00
James Rodewig	aba785cb6e	[DOCS] Update my-index examples (#60132 ) (#60248 ) Changes the following example index names to `my-index-000001` for consistency: * `my-index` * `my_index` * `myindex`	2020-07-27 15:58:26 -04:00
Lisa Cawley	2665bfffce	[DOCS] Fix security links in machine learning APIs (#60098 ) (#60152 )	2020-07-23 16:43:10 -07:00
James Rodewig	988e8c8fc6	[DOCS] Swap `[float]` for `[discrete]` (#60134 ) Changes instances of `[float]` in our docs for `[discrete]`. Asciidoctor prefers the `[discrete]` tag for floating headings: https://asciidoctor.org/docs/asciidoc-asciidoctor-diffs/#blocks	2020-07-23 12:42:33 -04:00
James Rodewig	b302b09b85	[DOCS] Reformat snippets to use two-space indents (#59973 ) (#59994 )	2020-07-21 15:49:58 -04:00
Przemysław Witek	283a1f605c	Rename binary_soft_classification evaluation to outlier_detection (#59951 ) (#59970 )	2020-07-21 15:15:04 +02:00
Lisa Cawley	fb212269ce	[DOCS] Changes level offset of anomaly detection pages (#59911 ) (#59940 )	2020-07-20 17:04:59 -07:00
Lisa Cawley	9633d503d8	[DOCS] Changes level offset for anomaly detection APIs (#59920 ) (#59928 )	2020-07-20 13:10:54 -07:00
Lisa Cawley	8f8d24b3c1	[DOCS] Changes level offset in data frame analytics APIs (#59919 ) (#59923 )	2020-07-20 13:06:29 -07:00
Benjamin Trent	a28547c4b4	[7.x] [ML] add new `custom` field to trained model processors (#59542 ) (#59700 ) * [ML] add new `custom` field to trained model processors (#59542) This commit adds the new configurable field `custom`. `custom` indicates if the preprocessor was submitted by a user or automatically created by the analytics job. Eventually, this field will be used in calculating feature importance. When `custom` is true, the feature importance for the processed fields is calculated. When `false` the current behavior is the same (we calculate the importance for the originating field/feature). This also adds new required methods to the preprocessor interface. If users are to supply their own preprocessors in the analytics job configuration, we need to know the input and output field names.	2020-07-16 10:57:38 -04:00
Przemysław Witek	df4fea79cb	Add a "verbose" option to the data frame analytics stats endpoint (#59589 ) (#59621 )	2020-07-16 09:51:31 +02:00
Dimitris Athanasiou	b2243337d8	[7.x][ML] Data frame analytics max_num_threads setting (#59254 ) (#59308 ) This adds a setting to data frame analytics jobs called `max_number_threads`. The setting expects a positive integer. When used the user specifies the max number of threads that may be used by the analysis. Note that the actual number of threads used is limited by the number of processors on the node where the job is assigned. Also, the process may use a couple more threads for operational functionality that is not the analysis itself. This setting may also be updated for a stopped job. More threads may reduce the time it takes to complete the job at the cost of using more CPU. Backport of #59254 and #57274	2020-07-09 19:15:46 +03:00
James Rodewig	6ed356ffc3	[DOCS] Replace `datatype` with `data type` (#58972 ) (#59184 )	2020-07-07 14:59:35 -04:00
Przemysław Witek	f35ad0d4e1	Report peak model memory in ModelSizeStats (#59017 ) (#59055 )	2020-07-06 12:55:12 +02:00
Benjamin Trent	b9d9964d10	[ML] add exponent output aggregator to inference (#58933 ) (#59016 ) * [ML] add exponent output aggregator to inference * fixing docs Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-07-03 14:51:00 -04:00
Przemysław Witek	751e84e4c8	Rename regression evaluation metrics to make the names consistent with loss functions (#58887 ) (#58927 )	2020-07-02 17:35:55 +02:00
Przemysław Witek	909649dd15	[7.x] Implement pseudo Huber loss (PseudoHuber) evaluation metric for regression analysis (#58734 ) (#58825 )	2020-07-01 14:52:06 +02:00
Przemysław Witek	9ea9b7bd3b	[7.x] Implement MSLE (MeanSquaredLogarithmicError) evaluation metric for regression analysis (#58684 ) (#58731 )	2020-06-30 14:09:11 +02:00
István Zoltán Szabó	13aa8b8d9a	[DOCS] Updates results_field description in the inference processor docs (#58554 )	2020-06-29 13:15:15 +02:00
Przemysław Witek	3f7c45472e	[7.x] Introduce DataFrameAnalyticsConfig update API (#58302 ) (#58648 )	2020-06-29 10:56:11 +02:00
Dimitris Athanasiou	1817b896c9	[7.x][ML] Add status and increased estimate to memory usage (#58588 ) (#58606 ) Adds parsing of `status` and `memory_reestimate_bytes` to data frame analytics `memory_usage`. When the training surpasses the model memory limit, the status will be set to `hard_limit` and `memory_reestimate_bytes` can be used to update the job's limit in order to restart the job. Backport of #58588	2020-06-28 16:27:26 +03:00
István Zoltán Szabó	3169e4c70e	[DOCS] Updates screenshots in ML population analysis (#58318 )	2020-06-23 09:05:08 +02:00
Benjamin Trent	bf8641aa15	[7.x] [ML] calculate cache misses for inference and return in stats (#58252 ) (#58363 ) When a local model is constructed, the cache hit miss count is incremented. When a user calls _stats, we will include the sum cache hit miss count across ALL nodes. This statistic is important to in comparing against the inference_count. If the cache hit miss count is near the inference_count it indicates that the cache is overburdened, or inappropriately configured.	2020-06-19 09:46:51 -04:00
Przemysław Witek	7a1300a09e	[7.x] Make ModelPlotConfig.annotations_enabled default to ModelPlotConfig.enabled if unset (#57808 ) (#57815 )	2020-06-08 17:41:12 +02:00
David Kyle	08d1286de7	[7.x] Delete expired data by job (#57337 ) (#57796 ) Deleting expired data can take a long time leading to timeouts if there are many jobs. Often the problem is due to a few large jobs which prevent the regular maintenance of the remaining jobs. This change adds a job_id parameter to the delete expired data endpoint to help clean up those problematic jobs.	2020-06-08 13:00:23 +01:00
David Roberts	1d64d55a86	[7.x][ML] Add per-partition categorization option (#57723 ) This PR adds the initial Java side changes to enable use of the per-partition categorization functionality added in elastic/ml-cpp#1293. There will be a followup change to complete the work, as there cannot be any end-to-end integration tests until elastic/ml-cpp#1293 is merged, and also elastic/ml-cpp#1293 does not implement some of the more peripheral functionality, like stop_on_warn and per-partition stats documents. The changes so far cover REST APIs, results object formats, HLRC and docs. Backport of #57683	2020-06-06 08:15:17 +01:00
Dimitris Athanasiou	f49a14ce6f	[7.x][ML] Fix race condition when force stopping DF analytics job (#57680 ) (#57717 ) When we force delete a DF analytics job, we currently first force stop it and then we proceed with deleting the job config. This may result in logging errors if the job config is deleted before it is retrieved while the job is starting. Instead of force stopping the job, it would make more sense to try to stop the job gracefully first. So we now try that out first. If normal stop fails, then we resort to force stopping the job to ensure we can go through with the delete. In addition, this commit introduces `timeout` for the delete action and makes use of it in the child requests. Backport of #57680	2020-06-05 17:50:01 +03:00

1 2 3 4 5 ...

298 Commits