OpenSearch

Commit Graph

Author	SHA1	Message	Date
Dimitris Athanasiou	e8c0351fd8	[7.x][ML] Allow force stopping failed and stopping DF analytics (#54650 ) (#54712 ) Force stopping a failed job used to work but it now puts the job in `stopping` state and hangs. In addition, force stopping a `stopping` job is not handled. This commit addresses those issues with force stopping data frame analytics. It inlines the approach with that followed for anomaly detection jobs. Backport of #54650	2020-04-03 16:08:06 +03:00
Christoph Büscher	9f22c0d37c	Fix Eclipse compile problem in ModelLoadingService (#54670 ) Current Eclipse 4.14.0 cannot deal with the direct lambda notation, changing to an exlicite one.	2020-04-03 11:56:30 +02:00
Benjamin Trent	6e73f67f3b	[ML] unmute categorization test for native backport (#54679 )	2020-04-02 17:08:19 -04:00
Benjamin Trent	7fe38935f6	[ML] add training_percent to analytics process params (#54605 ) (#54678 ) This adds training_percent parameter to the analytics process for Classification and Regression. This parameter is then used to give more accurate memory estimations. See native side pr: elastic/ml-cpp#1111	2020-04-02 17:08:06 -04:00
Benjamin Trent	4a1610265f	[7.x] [ML] add new inference_config field to trained model config (#54421 ) (#54647 ) * [ML] add new inference_config field to trained model config (#54421) A new field called `inference_config` is now added to the trained model config object. This new field allows for default inference settings from analytics or some external model builder. The inference processor can still override whatever is set as the default in the trained model config. * fixing for backport	2020-04-02 12:25:10 -04:00
Benjamin Trent	65233383f6	[7.x] [ML] prefer secondary authorization header for data[feed\|frame] authz (#54121 ) (#54645 ) * [ML] prefer secondary authorization header for data[feed\|frame] authz (#54121) Secondary authorization headers are to be used to facilitate Kibana spaces support + ML jobs/datafeeds. Now on PUT/Update/Preview datafeed, and PUT data frame analytics the secondary authorization is preferred over the primary (if provided). closes https://github.com/elastic/elasticsearch/issues/53801 * fixing for backport	2020-04-02 11:20:25 -04:00
David Roberts	4b4800e096	[ML] Take more care that normalize processes use unique named pipes (#54641 ) When one of ML's normalize processes fails to connect to the JVM quickly enough and another normalize process for the same job starts shortly afterwards it is possible that their named pipes can get mixed up. This change avoids the risk of that by adding an incrementing counter value into the named pipe names used for normalize processes. Backport of #54636	2020-04-02 14:25:31 +01:00
Benjamin Trent	eb31be0e71	[7.x] [ML] add num_matches and preferred_to_categories to category defintion objects (#54214 ) (#54639 ) * [ML] add num_matches and preferred_to_categories to category defintion objects (#54214) This adds two new fields to category definitions. - `num_matches` indicating how many documents have been seen by this category - `preferred_to_categories` indicating which other categories this particular category supersedes when messages are categorized. These fields are only guaranteed to be up to date after a `_flush` or `_close` native change: https://github.com/elastic/ml-cpp/pull/1062 * adjusting for backport	2020-04-02 09:09:19 -04:00
Mayya Sharipova	bf4857d9e0	Search hit refactoring (#41656 ) (#54584 ) Refactor SearchHit to have separate document and meta fields. This is a part of bigger refactoring of issue #24422 to remove dependency on MapperService to check if a field is metafield. Relates to PR: #38373 Relates to issue #24422 Co-authored-by: sandmannn <bohdanpukalskyi@gmail.com>	2020-04-01 15:19:00 -04:00
Przemysław Witek	1fe2705826	Skip daily maintenance activity if upgrade mode is enabled (#54565 ) (#54571 )	2020-04-01 13:29:34 +02:00
Jason Tedor	63e5f2b765	Rename META_DATA to METADATA This is a follow up to a previous commit that renamed MetaData to Metadata in all of the places. In that commit in master, we renamed META_DATA to METADATA, but lost this on the backport. This commit addresses that.	2020-03-31 17:30:51 -04:00
Jason Tedor	5fcda57b37	Rename MetaData to Metadata in all of the places (#54519 ) This is a simple naming change PR, to fix the fact that "metadata" is a single English word, and for too long we have not followed general naming conventions for it. We are also not consistent about it, for example, METADATA instead of META_DATA if we were trying to be consistent with MetaData (although METADATA is correct when considered in the context of "metadata"). This was a simple find and replace across the code base, only taking a few minutes to fix this naming issue forever.	2020-03-31 17:24:38 -04:00
David Roberts	b8f06df53f	[ML] Fix bug, add tests, improve estimates for estimate_model_memory (#54508 ) This PR: 1. Fixes the bug where a cardinality estimate of zero could cause a 500 status 2. Adds tests for that scenario and a few others 3. Adds sensible estimates for the cases that were previously TODO Backport of #54462	2020-03-31 17:59:38 +01:00
David Kyle	9150e77269	[7.x] Remove unused environment from anomaly detector classes (#54399 ) (#54456 )	2020-03-31 16:55:37 +01:00
Dimitris Athanasiou	e4230c533c	[7.x][ML] Move DFA MemoryUsage to stats.common pkg (#54492 ) (#54512 ) This belongs in stats.common Backport of #54492	2020-03-31 18:36:05 +03:00
Dimitris Athanasiou	6d96ca9bc8	[7.x][ML] Reenable classification and regression integ tests (#54489 ) (#54494 ) Relates #54401 Backport of #54489	2020-03-31 17:50:08 +03:00
Dimitris Athanasiou	b4b54efa73	[7.x][ML] Hyperparameter names should match config (#54401 ) (#54435 ) Java side of elastic/ml-cpp#1096 Backport of #54401	2020-03-30 23:32:40 +03:00
Przemysław Witek	3c604da7f6	[7.x] Create an annotation when a model snapshot is stored (#53783 ) (#54405 )	2020-03-30 15:17:08 +02:00
Martijn van Groningen	4b4fbc160d	Refactor AliasOrIndex abstraction. (#54394 ) Backport of #53982 In order to prepare the `AliasOrIndex` abstraction for the introduction of data streams, the abstraction needs to be made more flexible, because currently it really can be only an alias or an index. * Renamed `AliasOrIndex` to `IndexAbstraction`. * Introduced a `IndexAbstraction.Type` enum to indicate what a `IndexAbstraction` instance is. * Replaced the `isAlias()` method that returns a boolean with the `getType()` method that returns the new Type enum. * Moved `getWriteIndex()` up from the `IndexAbstraction.Alias` to the `IndexAbstraction` interface. * Moved `getAliasName()` up from the `IndexAbstraction.Alias` to the `IndexAbstraction` interface and renamed it to `getName()`. * Removed unnecessary casting to `IndexAbstraction.Alias` by just checking the `getType()` method. Relates to #53100	2020-03-30 10:12:16 +02:00
Jason Tedor	cf68ac8a2c	Do not stash environment in machine learning (#54371 ) Today the machine learning plugin stashes a copy of the environment in its constructor, and uses the stashed copy to construct its components even though it is provided with an environment to create these components. What is more, the environment it creates in its constructor is not fully initialized, as it does not have the final copy of the settings, but the environment passed in while creating components does. This commit removes that stashed copy of the environment.	2020-03-28 12:46:16 -04:00
Stuart Tettemer	1630de4a42	Scripting: stats per context in nodes stats (#54008 ) (#54357 ) Adds script cache stats to `_node/stats`. If using the general cache: ``` "script_cache": { "sum": { "compilations": 12, "cache_evictions": 9, "compilation_limit_triggered": 5 } } ``` If using context caches: ``` "script_cache": { "sum": { "compilations": 13, "cache_evictions": 9, "compilation_limit_triggered": 5 }, "contexts": [ { "context": "aggregation_selector", "compilations": 8, "cache_evictions": 6, "compilation_limit_triggered": 3 }, { "context": "aggs", "compilations": 5, "cache_evictions": 3, "compilation_limit_triggered": 2 }, ``` Backport of: 32f46f2 Refs: #50152	2020-03-27 12:26:00 -06:00
Przemysław Witek	2eb079b67f	Add version guards around ML hidden indices settings (#54322 )	2020-03-27 14:50:57 +01:00
Przemysław Witek	d40afc7871	[7.x] Do not fail Evaluate API when the actual and predicted fields' types differ (#54255 ) (#54319 )	2020-03-27 10:05:19 +01:00
William Brafford	14204f8381	Use set-based interface for NodesStatsRequest (#53637 ) (#54141 ) The NodesStatsRequest class uses a set of strings for its internal serialization. This commit updates the class's interface so that we no longer use hard-coded getters and setters, but rather methods that add strings directly. For example, the old way of adding "os" metrics to a request would be to call request.os(true). The new way of doing this is to call request.addMetric("os"). For the time being, the canonical list of metrics is an enum in NodesStatsRequest. This will eventually be replaced with something pluggable.	2020-03-26 14:41:49 -04:00
Dimitris Athanasiou	13368aae37	[7.x][ML] DF Analytics should always display operational stats (#54210 ) (#54290 ) This commit populates the _stats API response with sensible "empty" `data_counts` and `memory_usage` objects when the job itself has not started reporting them. Backport of #54210	2020-03-26 20:03:14 +02:00
Dimitris Athanasiou	cc981fa377	[7.x][ML] Get ML filters size should default to 100 (#54207 ) (#54278 ) When get filters is called without setting the `size` paramter only up to 10 filters are returned. However, 100 filters should be returned. This commit fixes this and adds an integ test to guard it. It seems this was accidentally broken in #39976. Closes #54206 Backport of #54207	2020-03-26 17:51:43 +02:00
Mark Vieira	7728ccd920	Encore consistent compile options across all projects (#54120 ) (cherry picked from commit ddd068a7e92dc140774598664efdc15155ab05c2)	2020-03-25 08:24:21 -07:00
Dimitris Athanasiou	ba09a778dc	[7.x][ML] Unmute classification cardinality integ test (#54165 ) (#54173 ) Adjusts test to work for new cardinality limit. Backport of #54165	2020-03-25 15:00:34 +02:00
Benjamin Trent	ef05a4f416	[ML] relaxing parameters on stratified split test (#54127 ) (#54168 ) Relaxing the error rate a bit on two of the tests. Ran 1000s of times locally and never had a failure after these changes. closes https://github.com/elastic/elasticsearch/issues/54122	2020-03-25 08:06:15 -04:00
Tanguy Leroux	3a3930c7ec	Mute TooManyJobsIT.testCloseFailedJob on 7.x (#54163 ) Relates #54162	2020-03-25 12:44:41 +01:00
Jason Tedor	381d7586e4	Introduce formal role for remote cluster client (#54138 ) This commit introduce a formal role for identifying nodes that are capable of making connections to remote clusters. Relates #53924	2020-03-24 21:59:43 -04:00
David Roberts	7667004b20	[ML] Add a model memory estimation endpoint for anomaly detection (#54129 ) A new endpoint for estimating anomaly detection job model memory requirements: POST _ml/anomaly_detectors/estimate_model_memory Backport of #53507	2020-03-24 22:55:11 +00:00
Dimitris Athanasiou	c141c1dd89	[7.x][ML] Stratified cross validation split for classification (#54087 ) (#54104 ) As classification now works for multiple classes, randomly picking training/test data frame rows is not good enough. This commit introduces a stratified cross validation splitter that maintains the proportion of the each class in the dataset in the sample that is used for training the model. Backport of #54087	2020-03-24 18:47:36 +02:00
David Roberts	1421471556	[ML] Introduce a "starting" datafeed state for lazy jobs (#54065 ) It is possible for ML jobs to open lazily if the "allow_lazy_open" option in the job config is set to true. Such jobs wait in the "opening" state until a node has sufficient capacity to run them. This commit fixes the bug that prevented datafeeds for jobs lazily waiting assignment from being started. The state of such datafeeds is "starting", and they can be stopped by the stop datafeed API while in this state with or without force. Backport of #53918	2020-03-24 13:00:04 +00:00
Dimitris Athanasiou	be20bb5755	[7.x][ML] No refresh on indexing DFA stats (#53977 ) (#54064 ) When we index data frame analytics stats docs we do not need to refresh immediately. Backport of #53977	2020-03-24 13:13:03 +02:00
Dimitris Athanasiou	5ce7c99e74	[7.x][ML] Data frame analytics data counts (#53998 ) (#54031 ) This commit instruments data frame analytics with stats for the data that are being analyzed. In particular, we count training docs, test docs, and skipped docs. In order to account docs with missing values as skipped docs for analyses that do not support missing values, this commit changes the extractor so that it only ignores docs with missing values when it collects the data summary, which is used to estimate memory usage. Backport of #53998	2020-03-24 11:30:43 +02:00
Benjamin Trent	19af869243	[ML] adds multi-class feature importance support (#53803 ) (#54024 ) Adds multi-class feature importance calculation. Feature importance objects are now mapped as follows (logistic) Regression: ``` { "feature_name": "feature_0", "importance": -1.3 } ``` Multi-class [class names are `foo`, `bar`, `baz`] ``` { “feature_name”: “feature_0”, “importance”: 2.0, // sum(abs()) of class importances “foo”: 1.0, “bar”: 0.5, “baz”: -0.5 }, ``` For users to get the full benefit of aggregating and searching for feature importance, they should update their index mapping as follows (before turning this option on in their pipelines) ``` "ml.inference.feature_importance": { "type": "nested", "dynamic": true, "properties": { "feature_name": { "type": "keyword" }, "importance": { "type": "double" } } } ``` The mapping field name is as follows `ml.<inference.target_field>.<inference.tag>.feature_importance` if `inference.tag` is not provided in the processor definition, it is not part of the field path. `inference.target_field` is defaulted to `ml.inference`. //cc @lcawl ^ Where should we document this? If this makes it in for 7.7, there shouldn't be any feature_importance at inference BWC worries as 7.7 is the first version to have it.	2020-03-23 18:49:07 -04:00
Benjamin Trent	d276058c6c	[ML] adjusting feature importance mapping for multi-class support (#53821 ) (#54013 ) Feature importance storage format is changing to encompass multi-class. Feature importance objects are now mapped as follows (logistic) Regression: ``` { "feature_name": "feature_0", "importance": -1.3 } ``` Multi-class [class names are `foo`, `bar`, `baz`] ``` { “feature_name”: “feature_0”, “importance”: 2.0, // sum(abs()) of class importances “foo”: 1.0, “bar”: 0.5, “baz”: -0.5 }, ``` This change adjusts the mapping creation for analytics so that the field is mapped as a `nested` type. Native side change: https://github.com/elastic/ml-cpp/pull/1071	2020-03-23 15:50:12 -04:00
Przemysław Witek	88c5d520b3	[7.x] Verify that the field is aggregatable before attempting cardinality aggregation (#53874 ) (#54004 )	2020-03-23 19:36:33 +01:00
Dimitris Athanasiou	965af3a68b	[7.x][ML] Delete DF analytics stats upon job deletion (#53933 ) (#53997 ) Since a data frame analytics job may have associated docs in the .ml-stats-* indices, when the job is deleted we should delete those docs too. Backport of #53933	2020-03-23 19:55:36 +02:00
Ryan Ernst	960d1fb578	Revert "Introduce system index APIs for Kibana (#53035 )" (#53992 ) This reverts commit `c610e0893d`. backport of #53912	2020-03-23 10:29:35 -07:00
Dimitris Athanasiou	3873510332	[7.x][ML] Refactor DFA custom processor to cross validation splitter (#53915 ) (#53956 ) While `CustomProcessor` is generic and allows for flexibility, there are new requirements that make cross validation a concept it's hard to abstract behind custom processor. In particular, we would like to add data_counts to the DFA jobs stats. Counting training VS. test docs would be a useful statistic. We would also want to add a different cross validation strategy for multiclass classification. This commit renames custom processors to cross validation splitters which allows for those enhancements without cryptically doing things as a side effect of the abstract custom processing. Backport of #53915	2020-03-23 17:15:14 +02:00
Przemysław Witek	a68071dbba	[7.x] Delete empty .ml-state* indices during nightly maintenance task. (#53587 ) (#53849 )	2020-03-20 13:08:36 +01:00
Alan Woodward	d23112f441	Report parser name and location in XContent deprecation warnings (#53805 ) It's simple to deprecate a field used in an ObjectParser just by adding deprecation markers to the relevant ParseField objects. The warnings themselves don't currently have any context - they simply say that a deprecated field has been used, but not where in the input xcontent it appears. This commit adds the parent object parser name and XContentLocation to these deprecation messages. Note that the context is automatically stripped from warning messages when they are asserted on by integration tests and REST tests, because randomization of xcontent type during these tests means that the XContentLocation is not constant	2020-03-20 11:52:55 +00:00
Dimitris Athanasiou	60153c5433	[7.x][ML] Data frame analytics analysis stats (#53788 ) (#53844 ) Adds parsing and indexing of analysis instrumentation stats. The latest one is also returned from the get-stats API. Note that we chose to duplicate objects even where they are currently similar. There are already ideas on how these will diverge in the future and while the duplication looks ugly at the moment, it is the option that offers the highest flexibility. Backport of #53788	2020-03-20 12:11:53 +02:00
Benjamin Trent	433952b595	[7.x] [ML] only retry persistence failures when the failure is intermittent and stop retrying when analytics job is stopping (#53725 ) (#53808 ) * [ML] only retry persistence failures when the failure is intermittent and stop retrying when analytics job is stopping (#53725) This fixes two issues: - Results persister would retry actions even if they are not intermittent. An example of an persistent failure is a doc mapping problem. - Data frame analytics would continue to retry to persist results even after the job is stopped. closes https://github.com/elastic/elasticsearch/issues/53687	2020-03-19 13:56:41 -04:00
Jake Landis	db3420d757	[7.x] Optimize which Rest resources are used by the Rest tests… (#53766 ) This should help with Gradle's incremental compile such that projects only depend upon the resources they use. related #52114	2020-03-19 12:28:59 -05:00
Benjamin Trent	2ccb963f1d	Create GET _cat/transforms API Issue (#53643 ) (#53726 ) Adds new` _cat/transform` and `_cat/transform/{transform_id}` endpoints.	2020-03-18 10:45:28 -04:00
Przemysław Witek	ec13c093df	Make ML index aliases hidden (#53160 ) (#53710 )	2020-03-18 10:28:45 +01:00
Przemysław Witek	376b2ae735	[7.x] Make classification evaluation metrics work when there is field mapping type mismatch (#53458 ) (#53601 )	2020-03-16 15:38:56 +01:00
Dimitris Athanasiou	94da4ca3fc	[7.x][ML] Extend classification to support multiple classes (#53539 ) (#53597 ) Prepares classification analysis to support more than just two classes. It introduces a new parameter to the process config which dictates the `num_classes` to the process. It also changes the max classes limit to `30` provisionally. Backport of #53539	2020-03-16 15:00:54 +02:00
Benjamin Trent	1262ab2762	[ML] [Inference] fix number inference models returned in x-pack info call (#53540 ) (#53560 ) the ML portion of the x-pack info API was erroneously counting configuration documents and definition documents. The underlying implementation of our storage separates the two out. This PR filters the query so that only trained model config documents are counted.	2020-03-13 16:53:34 -04:00
Benjamin Trent	4e43ede735	[ML] renaming inference processor field field_mappings to new name field_map (#53433 ) (#53502 ) This renames the `inference` processor configuration field `field_mappings` to `field_map`. `field_mappings` is now deprecated.	2020-03-13 15:40:57 -04:00
Tom Veasey	690099553c	[7.x][ML] Adds the class_assignment_objective parameter to classification (#53552 ) Adds a new parameter for classification that enables choosing whether to assign labels to maximise accuracy or to maximise the minimum class recall. Fixes #52427.	2020-03-13 17:35:51 +00:00
Benjamin Trent	89668c5ea0	[ML][Inference] adds new default_field_map field to trained models (#53294 ) (#53419 ) Adds a new `default_field_map` field to trained model config objects. This allows the model creator to supply field map if it knows that there should be some map for inference to work directly against the training data. The use case internally is having analytics jobs supply a field mapping for multi-field fields. This allows us to use the model "out of the box" on data where we trained on `foo.keyword` but the `_source` only references `foo`.	2020-03-11 13:49:39 -04:00
Przemysław Witek	8c4c19d310	Perform evaluation in multiple steps when necessary (#53295 ) (#53409 )	2020-03-11 15:36:38 +01:00
Przemysław Witek	063957b7d8	Simplify "refresh" calls. (#53385 ) (#53393 )	2020-03-11 12:26:11 +01:00
Dimitris Athanasiou	cc7751eb16	[7.x][ML] Add ILM policy to ml stats indices (#53349 ) (#53392 ) Adds a size based ILM policy to automatically rollover ml stats indices. Backport of #53349	2020-03-11 13:01:34 +02:00
Dimitris Athanasiou	0fd0516d0d	[7.x][ML] Rename data frame analytics maximum_number_trees to max_trees (#53300 ) (#53390 ) Deprecates `maximum_number_trees` parameter of classification and regression and replaces it with `max_trees`. Backport of #53300	2020-03-11 12:45:27 +02:00
David Roberts	532a720e1b	[ML] Skeleton estimate_model_memory endpoint for anomaly detection (#53386 ) This is a partial implementation of an endpoint for anomaly detector model memory estimation. It is not complete, lacking docs, HLRC and sensible numbers for many anomaly detector configurations. These will be added in a followup PR in time for 7.7 feature freeze. A skeleton endpoint is useful now because it allows work on the UI side of the change to commence. The skeleton endpoint handles the same cases that the old UI code used to handle, and produces very similar estimates for these cases. Backport of #53333	2020-03-11 10:20:00 +00:00
Przemysław Witek	d54d7f2be0	[7.x] Implement ILM policy for .ml-state* indices (#52356 ) (#53327 )	2020-03-10 14:24:18 +01:00
Benjamin Trent	856d9bfbc1	[ML] fixing data frame analysis test when two jobs are started in succession quickly (#53192 ) (#53332 ) A previous change (#53029) is causing analysis jobs to wait for certain indices to be made available. While this it is good for jobs to wait, they could fail early on _start. This change will cause the persistent task to continually retry node assignment when the failure is due to shards not being available. If the shards are not available by the time `timeout` is reached by the predicate, it is treated as a _start failure and the task is canceled. For tasks seeking a new assignment after a node failure, that behavior is unchanged. closes #53188	2020-03-10 08:30:47 -04:00
Mayya Sharipova	f96ad5c32d	Mute testSingleNumericFeatureAndMixedTrainingAndNonTrainingRows	2020-03-06 12:48:05 -05:00
Mark Vieira	09a3f45880	Mute ClassificationIT.testTwoJobsWithSameRandomizeSeedUseSameTrainingSet Signed-off-by: Mark Vieira <portugee@gmail.com>	2020-03-06 07:38:04 -08:00
James Baiera	01f00df5cd	Mute RegressionIT.testTwoJobsWithSameRandomizeSeedUseSameTrainingSet	2020-03-06 07:37:57 -08:00
Dimitris Athanasiou	9abf537527	[7.x][ML] Improve DF analytics audits and logging (#53179 ) (#53218 ) Adds audits for when the job starts reindexing, loading data, analyzing, writing results. Also adds some info logging. Backport of #53179	2020-03-06 13:47:27 +02:00
Benjamin Trent	af0b1c2860	[ML] Fix minor race condition in dataframe analytics _stop (#53029 ) (#53164 ) Tests have been periodically failing due to a race condition on checking a recently `STOPPED` task's state. The `.ml-state` index is not created until the task has already been transitioned to `STARTED`. This allows the `_start` API call to return. But, if a user (or test) immediately attempts to `_stop` that job, the job could stop and the task removed BEFORE the `.ml-state\|stats` indices are created/updated. This change moves towards the task cleaning up itself in its main execution thread. `stop` flips the flag of the task to `isStopping` and now we check `isStopping` at every necessary method. Allowing the task to gracefully stop. closes #53007	2020-03-05 09:59:18 -05:00
Benjamin Trent	181ee3ae0b	[ML] specifying missing_field_value value and using it instead of empty_string (#53108 ) (#53165 ) For analytics, we need a consistent way of indicating when a value is missing. Inheriting from anomaly detection, analysis sent `""` when a field is missing. This works fine with numbers, but the underlying analytics process actually treats `""` as a category in categorical values. Consequently, you end up with this situation in the resulting model ``` { "frequency_encoding" : { "field" : "RainToday", "feature_name" : "RainToday_frequency", "frequency_map" : { "" : 0.009844409027270245, "No" : 0.6472019970785184, "Yes" : 0.6472019970785184 } } } ``` For inference this is a problem, because inference will treat missing values as `null`. And thus not include them on the infer call against the model. This PR takes advantage of our new `missing_field_value` option and supplies `\0` as the value.	2020-03-05 09:50:52 -05:00
David Roberts	01504df876	[TEST] Force close failed job before skipping test (#53128 ) The assumption added in #52631 skips a problematic test if it fails to create the required conditions for the scenario it is supposed to be testing. (This happens very rarely.) However, before skipping the test it needs to remove the failed job it has created because the standard test cleanup code treats failed jobs as fatal errors. Closes #52608	2020-03-05 10:52:41 +00:00
Jay Modi	c610e0893d	Introduce system index APIs for Kibana (#53035 ) This commit introduces a module for Kibana that exposes REST APIs that will be used by Kibana for access to its system indices. These APIs are wrapped versions of the existing REST endpoints. A new setting is also introduced since the Kibana system indices' names are allowed to be changed by a user in case multiple instances of Kibana use the same instance of Elasticsearch. Additionally, the ThreadContext has been extended to indicate that the use of system indices may be allowed in a request. This will be built upon in the future for the protection of system indices. Backport of #52385	2020-03-03 14:11:36 -07:00
Yang Wang	70814daa86	Allow _rollup_search with read privilege (#52043 ) (#53047 ) Currently _rollup_search requires manage privilege to access. It should really be a read only operation. This PR changes the requirement to be read indices privilege. Resolves: #50245	2020-03-03 22:29:54 +11:00
Mark Vieira	f8396e8d15	Mute RunDataFrameAnalyticsIT.testStopOutlierDetectionWithEnoughDocumentsToScroll Signed-off-by: Mark Vieira <portugee@gmail.com>	2020-03-02 09:21:55 -08:00
Lisa Cawley	4fbe1b0550	[DOCS] Adds cat anomaly detectors API (#52866 ) (#52970 )	2020-03-02 07:28:55 -08:00
Dimitris Athanasiou	85b4e45093	[7.x]ML] Parse and report memory usage for DF Analytics (#52778 ) (#52980 ) Adds reporting of memory usage for data frame analytics jobs. This commit introduces a new index pattern `.ml-stats-*` whose first concrete index will be `.ml-stats-000001`. This index serves to store instrumentation information for those jobs. Backport of #52778 and #52958	2020-02-29 13:03:40 +02:00
Benjamin Trent	19a6c5d980	[7.x] [ML][Inference] Add support for multi-value leaves to the tree model (#52531 ) (#52901 ) * [ML][Inference] Add support for multi-value leaves to the tree model (#52531) This adds support for multi-value leaves. This is a prerequisite for multi-class boosted tree classification.	2020-02-27 14:05:28 -05:00
Benjamin Trent	eac38e9847	[ML] Add indices_options to datafeed config and update (#52793 ) (#52905 ) This adds a new configurable field called `indices_options`. This allows users to create or update the indices_options used when a datafeed reads from an index. This is necessary for the following use cases: - Reading from frozen indices - Allowing certain indices in multiple index patterns to not exist yet These index options are available on datafeed creation and update. Users may specify them as URL parameters or within the configuration object. closes https://github.com/elastic/elasticsearch/issues/48056	2020-02-27 13:43:25 -05:00
David Kyle	d8bdf31110	Revert "Mute RunDataFrameAnalyticsIT.testOutlierDetectionStopAndRestart" This reverts commit `ad3a3b1af9`.	2020-02-27 12:38:13 +00:00
David Kyle	6e5e64559a	Unwrap cause from remote ActionTransportExceptions (#52842 ) (#52878 ) And log the cause	2020-02-27 11:58:28 +00:00
Yang Wang	14c21aedd2	Simplify ml license checking with XpackLicenseState internals (#52684 ) (#52863 ) This change removes TrainedModelConfig#isAvailableWithLicense method with calls to XPackLicenseState#isAllowedByLicense. Please note there are subtle changes to the code logic. But they are the right changes: * Instead of Platinum license, Enterprise license nows guarantees availability. * No explicit check when the license requirement is basic. Since basic license is always available, this check is unnecessary. * Trial license is always allowed.	2020-02-27 14:14:16 +11:00
Jake Landis	b4179a8814	[7.x] Refactor watcher tests (#52799 ) (#52844 ) This PR moves the majority of the Watcher REST tests under the Watcher x-pack plugin. Specifically, moves the Watcher tests from: x-pack/plugin/test x-pack/qa/smoke-test-watcher x-pack/qa/smoke-test-watcher-with-security x-pack/qa/smoke-test-monitoring-with-watcher to: x-pack/plugin/watcher/qa/rest (/test and /qa/smoke-test-watcher) x-pack/plugin/watcher/qa/with-security x-pack/plugin/watcher/qa/with-monitoring Additionally, this disables Watcher from the main x-pack test cluster and consolidates the stop/start logic for the tests listed. No changes to the tests (beyond moving them) are included. 3rd party tests and doc tests (which also touch Watcher) are not included in the changes here.	2020-02-26 15:57:10 -06:00
Lisa Cawley	b788ec7157	[DOCS] Adds cat datafeeds API (#52738 )	2020-02-26 09:28:57 -08:00
David Kyle	ad3a3b1af9	Mute RunDataFrameAnalyticsIT.testOutlierDetectionStopAndRestart	2020-02-26 14:31:00 +00:00
Jake Landis	8d311297ca	[7.x] Smarter copying of the rest specs and tests (#52114 ) (#52798 ) * Smarter copying of the rest specs and tests (#52114) This PR addresses the unnecessary copying of the rest specs and allows for better semantics for which specs and tests are copied. By default the rest specs will get copied if the project applies `elasticsearch.standalone-rest-test` or `esplugin` and the project has rest tests or you configure the custom extension `restResources`. This PR also removes the need for dozens of places where the x-pack specs were copied by supporting copying of the x-pack rest specs too. The plugin/task introduced here can also copy the rest tests to the local project through a similar configuration. The new plugin/task allows a user to minimize the surface area of which rest specs are copied. Per project can be configured to include only a subset of the specs (or tests). Configuring a project to only copy the specs when actually needed should help with build cache hit rates since we can better define what is actually in use. However, project level optimizations for build cache hit rates are not included with this PR. Also, with this PR you can no longer use the includePackaged flag on integTest task. The following items are included in this PR: * new plugin: `elasticsearch.rest-resources` * new tasks: CopyRestApiTask and CopyRestTestsTask - performs the copy * new extension 'restResources' ``` restResources { restApi { includeCore 'foo' , 'bar' //will include the core specs that start with foo and bar includeXpack 'baz' //will include x-pack specs that start with baz } restTests { includeCore 'foo', 'bar' //will include the core tests that start with foo and bar includeXpack 'baz' //will include the x-pack tests that start with baz } } ```	2020-02-26 08:13:41 -06:00
David Kyle	37be695d5c	[ML] Handle failed datafeed in MlDistributedFailureIT (#52631 ) (#52789 )	2020-02-26 08:18:37 +00:00
David Roberts	cf122d13b8	[ML] Use event.timezone in file_structure_finder ingest pipeline (#52720 ) This is because beat.timezone was renamed to event.timezone in elastic/beats#9458	2020-02-25 12:33:53 +00:00
David Kyle	de3d674bb7	Revert "Mute RunDataFrameAnalyticsIT.testOutlierDetectionStopAndRestart" This reverts commit `c4d91143ac`.	2020-02-24 15:22:49 +00:00
David Kyle	044a4e127a	[ML] Add reason to DataFrameAnalyticsTask setFailed log message (#52659 ) (#52707 )	2020-02-24 15:21:51 +00:00
Benjamin Trent	afd90647c9	[ML] Adds feature importance to option to inference processor (#52218 ) (#52666 ) This adds machine learning model feature importance calculations to the inference processor. The new flag in the configuration matches the analytics parameter name: `num_top_feature_importance_values` Example: ``` "inference": { "field_mappings": {}, "model_id": "my_model", "inference_config": { "regression": { "num_top_feature_importance_values": 3 } } } ``` This will write to the document as follows: ``` "inference" : { "feature_importance" : { "FlightTimeMin" : -76.90955548511226, "FlightDelayType" : 114.13514762158526, "DistanceMiles" : 13.731580450792187 }, "predicted_value" : 108.33165831875137, "model_id" : "my_model" } ``` This is done through calculating the [SHAP values](https://arxiv.org/abs/1802.03888). It requires that models have populated `number_samples` for each tree node. This is not available to models that were created before 7.7. Additionally, if the inference config is requesting feature_importance, and not all nodes have been upgraded yet, it will not allow the pipeline to be created. This is to safe-guard in a mixed-version environment where only some ingest nodes have been upgraded. NOTE: the algorithm is a Java port of the one laid out in ml-cpp: https://github.com/elastic/ml-cpp/blob/master/lib/maths/CTreeShapFeatureImportance.cc usability blocked by: https://github.com/elastic/ml-cpp/pull/991	2020-02-21 18:42:31 -05:00
Jack Conradson	c4d91143ac	Mute RunDataFrameAnalyticsIT.testOutlierDetectionStopAndRestart Relates: #52654	2020-02-21 09:32:19 -08:00
Jay Modi	f3f6ff97ee	Single instance of the IndexNameExpressionResolver (#52604 ) This commit modifies the codebase so that our production code uses a single instance of the IndexNameExpressionResolver class. This change is being made in preparation for allowing name expression resolution to be augmented by a plugin. In order to remove some instances of IndexNameExpressionResolver, the single instance is added as a parameter of Plugin#createComponents and PersistentTaskPlugin#getPersistentTasksExecutor. Backport of #52596	2020-02-21 07:50:02 -07:00
Przemysław Witek	b84e8db7b5	[7.x] Rename .ml-state index to .ml-state-000001 to support rollover (#52510 ) (#52595 )	2020-02-21 08:55:59 +01:00
Yang Wang	4bc7545e43	Add enterprise mode and refactor license check (#51864 ) (#52115 ) Add enterprise operation mode to properly map enterprise license. Aslo refactor XPackLicenstate class to consolidate license status and mode checks. This class has many sychronised methods to check basically three things: * Minimum operation mode required * Whether security is enabled * Whether current license needs to be active Depends on the actual feature, either 1, 2 or all of above checks are performed. These are now consolidated in to 3 helper methods (2 of them are new). The synchronization is pushed down to the helper methods so actual checking methods no longer need to worry about it. resolves: #51081	2020-02-21 14:18:18 +11:00
Benjamin Trent	2a5c181dda	[ML][Inference] don't return inflated definition when storing trained models (#52573 ) (#52580 ) When `PUT` is called to store a trained model, it is useful to return the newly create model config. But, it is NOT useful to return the inflated definition. These definitions can be large and returning the inflated definition causes undo work on the server and client side. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-20 19:47:29 -05:00
Benjamin Trent	013d5c2d24	[ML] Adds support for a global calendar via `_all` (#50372 ) (#52578 ) This adds `_all` to Calendar searches. This enables users to supply the `_all` string in the `job_ids` array when creating a Calendar. That calendar will now be applied to all jobs (existing and newly created). Closes #45013 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-20 17:22:59 -05:00
David Kyle	7bbe5c8464	[Ml] Validate tree feature index is within range (#52514 ) This changes the tree validation code to ensure no node in the tree has a feature index that is beyond the bounds of the feature_names array. Specifically this handles the situation where the C++ emits a tree containing a single node and an empty feature_names list. This is valid tree used to centre the data in the ensemble but the validation code would reject this as feature_names is empty. This meant a broken workflow as you cannot GET the model and PUT it back	2020-02-19 14:41:43 +00:00
Przemysław Witek	7cd997df84	[ML] Make ml internal indices hidden (#52423 ) (#52509 )	2020-02-19 14:02:32 +01:00
David Roberts	9c49868bc5	[TEST] Use busy asserts in ML distributed failure test (#52461 ) When changing a job state using a mechanism that doesn't wait for the desired state to be reached within the production code the test code needs to loop until the cluster state has been updated. Closes #52451	2020-02-18 11:17:37 +00:00
David Roberts	48ccf36db9	[ML] Increase assertBusy timeout in ML node failure tests (#52425 ) Following the change to store cluster state in Lucene indices (#50907) it can take longer for all the cluster state updates associated with node failure scenarios to be processed during internal cluster tests where several nodes all run in the same JVM.	2020-02-17 17:04:18 +00:00
Dimitris Athanasiou	ad56802ac6	[7.x][ML] Refactor ML mappings and templates into JSON resources (#51… (#52353 ) ML mappings and index templates have so far been created programmatically. While this had its merits due to static typing, there is consensus it would be clear to maintain those in json files. In addition, we are going to adding ILM policies to these indices and the component for a plugin to register ILM policies is `IndexTemplateRegistry`. It expects the templates to be in resource json files. For the above reasons this commit refactors ML mappings and index templates into json resource files that are registered via `MlIndexTemplateRegistry`. Backport of #51765	2020-02-14 17:16:06 +02:00
Przemysław Witek	0da3af7581	[7.x] [ML] Add _cat/ml/data_frame/analytics API (#52260 ) (#52312 )	2020-02-13 16:55:47 +01:00
Jay Modi	5bcc6fce5c	Remove DeprecationLogger from route objects (#52285 ) This commit removes the need for DeprecatedRoute and ReplacedRoute to have an instance of a DeprecationLogger. Instead the RestController now has a DeprecationLogger that will be used for all deprecated and replaced route messages. Relates #51950 Backport of #52278	2020-02-12 15:05:41 -07:00
Benjamin Trent	2a968f4f2b	[ML] job results provider refactoring (#52012 ) (#52238 ) During a bug hunt, I caught a handful of things (unrelated to the bug) that could be potential issues: 1. Needlessly wrapping in exception handling (minor cleanup) 2. Potential of notifying listeners of a failure multiple times + even trying to notify of a success after a failure notification	2020-02-11 17:54:44 -05:00
David Roberts	d1d9c40e71	[ML] Switch poor categorization audit warning to use status field (#52195 ) In #51146 a rudimentary check for poor categorization was added to 7.6. This change replaces that warning based on a Java-side check with a new one based on the categorization_status field that the ML C++ sets. categorization_status was added in 7.7 and above by #51879, so this new warning based on more advanced conditions will also be in 7.7 and above. Closes #50749	2020-02-11 15:33:27 +00:00
David Roberts	473468d763	[ML] Better error when persistent task assignment disabled (#52014 ) Changes the misleading error message when attempting to open a job while the "cluster.persistent_tasks.allocation.enable" setting is set to "none" to a clearer message that names the setting. Closes #51956	2020-02-11 15:23:21 +00:00
Dimitris Athanasiou	6086fadf00	[7.x][ML] Prepare to hold additional stats in DF Analytics task (#52134 ) (#52187 ) Refactors `DataFrameAnalyticsTask` to hold a `StatsHolder` object. That just has a `ProgressTracker` for now but this is paving the way to add additional stats like memory usage, analysis stats, etc. Backport #52134	2020-02-11 11:18:45 +02:00
Dimitris Athanasiou	cbebc26f50	[7.x][ML] Retry persisting DF Analytics results (#52048 ) (#52160 ) Employs `ResultsPersisterService` from `DataFrameRowsJoiner` in order to add retries when a data frame analytics job is persisting the results to the destination data frame. Backport of #52048	2020-02-11 09:55:00 +02:00
Przemysław Witek	c7cc383d33	[7.x] Update persistent state document in the index the document belongs to (#51751 ) (#52145 )	2020-02-10 16:32:34 +01:00
David Roberts	1cefafdd14	[ML] Add new categorization stats to model_size_stats (#52009 ) This change adds support for the following new model_size_stats fields: - categorized_doc_count - total_category_count - frequent_category_count - rare_category_count - dead_category_count - categorization_status Backport of #51879	2020-02-10 09:10:50 +00:00
Jay Modi	3edadfefd0	RestHandlers declare handled routes (#52123 ) This commit changes how RestHandlers are registered with the RestController so that a RestHandler no longer needs to register itself with the RestController. Instead the RestHandler interface has new methods which when called provide information about the routes (method and path combinations) that are handled by the handler including any deprecated and/or replaced combinations. This change also makes the publication of RestHandlers safe since they no longer publish a reference to themselves within their constructors. Closes #51622 Co-authored-by: Jason Tedor <jason@tedor.me> Backport of #51950	2020-02-09 22:48:32 -07:00
Benjamin Trent	dffcd021df	[7.x] [ML] Add bwc serialization unit test scaffold (#51889 ) (#52061 ) * [ML] Add bwc serialization unit test scaffold (#51889) Adds new `AbstractBWCSerializationTestCase` which provides easy scaffolding for BWC serialization unit tests. These are no replacement for true BWC tests (which execute actual old code). These tests do provide some good coverage for the current code when serializing to/from old versions. * removing unnecessary override for 7.series branch * adding necessary import Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-07 17:17:11 -05:00
Benjamin Trent	846f87a26e	[ML] allow close/stop for jobs/datafeeds with missing configs (#51888 ) (#51997 ) If the configs are removed (by some horrific means), we should still allow tasks to be cleaned up easily. Datafeeds and jobs with missing configs are now visible in their respective _stats calls and can be stopped/closed.	2020-02-06 12:10:18 -05:00
Benjamin Trent	79f143907a	[7.x] [ML] add _cat/ml/trained_models API (#51529 ) (#51936 ) * [ML] add _cat/ml/trained_models API (#51529) This adds _cat/ml/trained_models.	2020-02-05 08:26:44 -05:00
Julie Tibshirani	38ce428831	Create a class to hold field capabilities for one index. (#51844 ) Currently, the same class `FieldCapabilities` is used both to represent the capabilities for one index, and also the merged capabilities across indices. To help clarify the logic, this PR proposes to create a separate class `IndexFieldCapabilities` for the capabilities in one index. The refactor will also help when adding `source_path` information in #49264, since the merged source path field will have a different structure from the field for a single index. Individual changes: * Add a new class IndexFieldCapabilities. * Remove extra constructor from FieldCapabilities. * Combine the add and merge methods in FieldCapabilities.Builder.	2020-02-04 11:24:57 -08:00
David Roberts	9d55c45b5a	[ML] Improve multiline_start_pattern for CSV in find_file_structure (#51737 ) The work to switch file upload over to treating delimited files like semi-structured text and using the ingest pipeline for CSV parsing makes the multi-line start pattern used for delimited files much more critical than it used to be. Previously it was always based on the time field, even if that was towards the end of the columns, and no multi-line pattern was created if no timestamp was detected. This change improves the multi-line start pattern by: 1. Never creating a multi-line pattern if the sample contained only single line records. This improves the import efficiency in a common case. 2. Choosing the leftmost field that has a well-defined pattern, whether that be the time field or a boolean/numeric field. This reduces the risk of a field with newlines occurring earlier, and also means the algorithm doesn't automatically fail for data without a timestamp.	2020-02-04 12:37:48 +00:00
Benjamin Trent	d293980a09	[7.x] [ML] add GET _cat/ml/datafeeds (#51500 ) (#51829 ) * [ML] add GET _cat/ml/datafeeds (#51500) This adds GET _cat/ml/datafeeds && _cat/ml/datafeeds/{datafeed_id} * fixing for java8 compilation	2020-02-03 17:16:33 -05:00
David Roberts	d5d8fb26fa	[TEST] Remove obsolete test trace logging from NetworkDisruptionIT (#51746 ) The issue this logging was added to fix (#49908) was closed in December and the problem has not recurred so this logging is no longer needed.	2020-02-03 11:25:53 +00:00
Dimitris Athanasiou	55b5c8f703	[7.x][ML] Remove index.unassigned.node_left.delayed_timeout setting from M… (#51740 ) (#51764 ) This setting was introduced with the purpose of reducing the time took by tests that shut nodes down. Tests like `MlDistributedFailureIT` and `NetworkDisruptionIT`. However, it is unfortunate to have to set the value to an explicit value in production. In addition, and most important, the dynamically choosing the value for this setting makes it impossible to adopt static index template configs that we register via `IndexTemplateRegistry`, which we need to use in order to start registering ILM policies for the ML indices. This commit removes this setting from our templates. I run the tests a few times and could not see execution time differing significantly. Backport of #51740	2020-01-31 20:28:29 +02:00
Benjamin Trent	e372854d43	[ML][Inference] Fix model pagination with models as resources (#51573 ) (#51736 ) This adds logic to handle paging problems when the ID pattern + tags reference models stored as resources. Most of the complexity comes from the issue where a model stored as a resource could be at the start, or the end of a page or when we are on the last page.	2020-01-31 07:52:19 -05:00
Gordon Brown	10c8179351	Use exclusions list instead of fake system indices (#51586 ) This commit switches the strategy for managing dot-prefixed indices that should be hidden indices from using "fake" system indices to an explicit exclusions list that must be updated when those indices are converted to hidden indices.	2020-01-30 16:31:27 -07:00
Benjamin Trent	1380dd439a	[7.x] [ML][Inference] Fix weighted mode definition (#51648 ) (#51695 ) * [ML][Inference] Fix weighted mode definition (#51648) Weighted mode inaccurately assumed that the "max value" of the input values would be the maximum class value. This does not make sense. Weighted Mode should know how many classes there are. Hence the new parameter `num_classes`. This indicates what the maximum class value to be expected.	2020-01-30 15:33:25 -05:00
Henning Andersen	149b68d850	[ML] Fix possible race condition starting datafeed (#51646 ) Datafeeds being closed while starting could result in and NPE. This was handled as any other failure, masking out the NPE. However, this conflicts with the changes in #50886. Related to #50886 and #51302	2020-01-30 08:23:45 +01:00
Przemysław Witek	683170b007	Increase the number of indexed documents to increase a chance that there are at least 2 training rows. (#51607 ) (#51615 )	2020-01-29 17:17:19 +01:00
Gordon Brown	89c2834b24	Deprecate creation of dot-prefixed index names except for hidden and system indices (#49959 ) This commit deprecates the creation of dot-prefixed index names (e.g. .watches) unless they are either 1) a hidden index, or 2) registered by a plugin that extends SystemIndexPlugin. This is the first step towards more thorough protections for system indices. This commit also modifies several plugins which use dot-prefixed indices to register indices they own as system indices, and adds a plugin to register .tasks as a system index.	2020-01-28 10:01:16 -07:00
David Roberts	550254ec7f	[ML] Use CSV ingest processor in find_file_structure ingest pipeline (#51492 ) Changes the find_file_structure response to include a CSV ingest processor in the ingest pipeline it suggests. Previously the Kibana file upload functionality parsed CSV in the browser, but by parsing CSV in the ingest pipeline it makes the Kibana file upload functionality more easily interchangable with Filebeat such that the configurations it creates can more easily be used to import data with the same structure repeatedly in production.	2020-01-28 14:38:43 +00:00
David Roberts	3c223ceea1	[ML] Fix 2 digit year regex in find_file_structure (#51469 ) The DATE and DATESTAMP Grok patterns match 2 digit years as well as 4 digit years. The pattern determination in find_file_structure worked correctly in this case, but the regex used to create a multi-line start pattern was assuming a 4 digit year. Also, the quick rule-out patterns did not always correctly consider 2 digit years, meaning that detection was inconsistent. This change fixes both problems, and also extends the tests for DATE and DATESTAMP to check both 2 and 4 digit years.	2020-01-27 17:23:18 +00:00
Przemysław Witek	dd3e2f1e18	[7.x] Update quantiles document in the index the document belongs to (#51135 ) (#51415 )	2020-01-27 10:13:02 +01:00
Benjamin Trent	bf53ca3380	[7.x] [ML] Add _cat/ml/anomaly_detectors API (#51364 ) (#51408 ) [ML] Add _cat/ml/anomaly_detectors API (#51364)	2020-01-24 11:54:22 -05:00
Benjamin Trent	fc994d9ce1	[ML][Inference] Adds validations for model PUT (#51376 ) (#51409 ) Adds validations making sure that * `input.field_names` is not empty * `ensemble.trained_models` is not empty * `tree.feature_names` is not empty closes https://github.com/elastic/elasticsearch/issues/51354	2020-01-24 09:29:12 -05:00
Benjamin Trent	76660a5a4f	[7.x] [ML][Inference] add tags url param to GET (#51330 ) (#51404 ) * [ML][Inference] add tags url param to GET (#51330) Adds a new URL parameter, `tags` to the GET _ml/inference/<model_id> endpoint. This parameter allows the list of models to be further reduced to those who contain all the provided tags.	2020-01-24 08:26:58 -05:00
Dimitris Athanasiou	3443d69883	[7.x][ML] Rename DataFrameAnalyticsIndex to DestinationIndex (#51353 ) (#51356 ) As we prepare to introduce a new index for storing additional information about data frame analytics jobs (e.g. intrumentation), renaming this class to `DestinationIndex` better captures what it does and leaves its prior name available for a more suitable use. Backport of #51353 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-01-24 09:51:48 +02:00
David Kyle	0ac03ac5e7	[ML] Add parsers for inference configuration classes (#51300 )	2020-01-22 17:03:01 +00:00
David Kyle	ca4b90a001	[ML] Calculate results and snapshot retention using latest bucket timestamps (#51061 ) (#51301 ) The retention period is calculated relative to the last bucket result or snapshot time rather than wall clock	2020-01-22 14:52:33 +00:00
Dimitris Athanasiou	59687a9384	[7.x][ML] Validate classification dependent_variable cardinality is at lea… (#51232 ) (#51309 ) Data frame analytics classification currently only supports 2 classes for the dependent variable. We were checking that the field's cardinality is not higher than 2 but we should also check it is not less than that as otherwise the process fails. Backport of #51232	2020-01-22 16:51:16 +02:00
Benjamin Trent	2a73e849d6	[ML][Inference] fixing ingest IT tests (#51267 ) (#51311 ) Converts InferenceIngestIT into a `ESRestTestCase`. closes #51201	2020-01-22 09:50:17 -05:00
David Roberts	932c63297f	[ML] Fix possible race condition when starting datafeed (#51302 ) The ID of the datafeed's associated job was being obtained frequently by looking up the datafeed task in a map that was being modified in other threads. This could lead to NPEs if the datafeed stopped running at an unexpected time. This change reduces the number of places where a datafeed's associated job ID is looked up to avoid the possibility of failures when the datafeed's task is removed from the map of running tasks during multi-step operations in other threads. Fixes #51285	2020-01-22 11:40:39 +00:00
Przemysław Witek	bfcfcdee33	[7.x] Do not copy mapping from dependent variable to prediction field in regression analysis (#51227 ) (#51288 )	2020-01-22 12:36:24 +01:00
Benjamin Trent	a9b2bc525e	[ML] address two edge cases for categorization.GrokPatternCreator#findBestGrokMatchFromExamples (#51168 ) (#51255 ) There are two edge cases that can be ran into when example input is matched in a weird way. 1. Recursion depth could continue many many times, resulting in a HUGE runtime cost. I put a limit of 10 recursions (could be adjusted I suppose). 2. If there are no "fixed regex bits", exploring the grok space would result in a fence-post error during runtime (with assertions turned off)	2020-01-21 10:29:29 -05:00
David Roberts	0fa7db9a95	[ML] Make datafeeds work with nanosecond time fields (#51180 ) Allows ML datafeeds to work with time fields that have the "date_nanos" type _and make use of the extra precision_. (Previously datafeeds only worked with time fields that were exact multiples of milliseconds. So datafeeds would work with "date_nanos" only if the extra precision over "date" was not used.) Relates #49889	2020-01-21 09:59:50 +00:00
Jay Modi	107989df3e	Introduce hidden indices (#51164 ) This change introduces a new feature for indices so that they can be hidden from wildcard expansion. The feature is referred to as hidden indices. An index can be marked hidden through the use of an index setting, `index.hidden`, at creation time. One primary use case for this feature is to have a construct that fits indices that are created by the stack that contain data used for display to the user and/or intended for querying by the user. The desire to keep them hidden is to avoid confusing users when searching all of the data they have indexed and getting results returned from indices created by the system. Hidden indices have the following properties: * API calls for all indices (empty indices array, _all, or ) will not return hidden indices by default. Wildcard expansion will not return hidden indices by default unless the wildcard pattern begins with a `.`. This behavior is similar to shell expansion of wildcards. * REST API calls can enable the expansion of wildcards to hidden indices with the `expand_wildcards` parameter. To expand wildcards to hidden indices, use the value `hidden` in conjunction with `open` and/or `closed`. * Creation of a hidden index will ignore global index templates. A global index template is one with a match-all pattern. * Index templates can make an index hidden, with the exception of a global index template. * Accessing a hidden index directly requires no additional parameters. Backport of #50452	2020-01-17 10:09:01 -07:00
David Roberts	295665b1ea	[ML] Add audit warning for 1000 categories found early in job (#51146 ) If 1000 different category definitions are created for a job in the first 100 buckets it processes then an audit warning will now be created. (This will cause a yellow warning triangle in the ML UI's jobs list.) Such a large number of categories suggests that the field that categorization is working on is not well suited to the ML categorization functionality.	2020-01-17 16:28:45 +00:00
Przemysław Witek	da73c9104e	[ML] Fix tests randomly failing on CI (#51142 ) (#51150 )	2020-01-17 14:58:58 +01:00
Dimitris Athanasiou	b70ebdeb96	[7.x][ML] DF Analytics _explain API should skip object fields (#51115 ) (#51147 ) Object fields cannot be used as features. At the moment _explain API includes them and even worse it allows it does not error when an object field is excluded. This creates the expectation to the user that all children fields will also be excluded while it's not the case. This commit omits object fields from the _explain API and also adds an error if an object field is included or excluded. Backport of #51115	2020-01-17 14:02:59 +02:00
Przemysław Witek	b1a526d5e9	[7.x] [ML] Update DFA progress document in the index the document belongs to (#51111 ) (#51117 )	2020-01-17 08:12:54 +01:00
Tom Veasey	32ec934b15	[7.x][ML] Assert top classes are ordered by score (#51028 ) Backport #51003.	2020-01-16 12:23:15 +00:00
David Roberts	1536c3e622	[TEST] Increase ML distributed test job open timeout (#50998 ) There have been occasional failures, presumably due to too many tests running in parallel, caused by jobs taking around 15 seconds to open. (You can see the job open successfully during the cleanup phase shortly after the failure of the test in these cases.) This change increases the wait time from 10 seconds to 20 seconds to reduce the risk of this happening.	2020-01-15 08:58:55 +00:00
Nik Everett	fc5fde7950	Add "did you mean" to ObjectParser (#50938 ) (#50985 ) Check it out: ``` $ curl -u elastic:password -HContent-Type:application/json -XPOST localhost:9200/test/_update/foo?pretty -d'{ "dac": {} }' { "error" : { "root_cause" : [ { "type" : "x_content_parse_exception", "reason" : "[2:3] [UpdateRequest] unknown field [dac] did you mean [doc]?" } ], "type" : "x_content_parse_exception", "reason" : "[2:3] [UpdateRequest] unknown field [dac] did you mean [doc]?" }, "status" : 400 } ``` The tricky thing about implementing this is that x-content doesn't depend on Lucene. So this works by creating an extension point for the error message using SPI. Elasticsearch's server module provides the "spell checking" implementation. s	2020-01-14 17:53:41 -05:00
Benjamin Trent	72c270946f	[ML][Inference] Adding classification_weights to ensemble models (#50874 ) (#50994 ) * [ML][Inference] Adding classification_weights to ensemble models classification_weights are a way to allow models to prefer specific classification results over others this might be advantageous if classification value probabilities are a known quantity and can improve model error rates.	2020-01-14 12:40:25 -05:00
Tom Veasey	de5713fa4b	[ML] Disable invalid assertion (#50988 ) Backport #50986.	2020-01-14 17:35:00 +00:00
David Kyle	7f309a18f1	[7.x][ML] Explicitly require a OriginSettingClient in ML results iterators (#50981 ) In classes where the client is used directly rather than through a call to executeAsyncWithOrigin explicitly require the client to be OriginSettingClient rather than using the Client interface. Also remove calls to deprecated ClientHelper.clientWithOrigin() method.	2020-01-14 17:14:39 +00:00
Dimitris Athanasiou	1d8cb3c741	[7.x][ML] Add num_top_feature_importance_values param to regression and classi… (#50914 ) (#50976 ) Adds a new parameter to regression and classification that enables computation of importance for the top most important features. The computation of the importance is based on SHAP (SHapley Additive exPlanations) method. Backport of #50914	2020-01-14 16:46:09 +02:00
Przemysław Witek	9c6ffdc2be	[7.x] Handle nested and aliased fields correctly when copying mapping. (#50918 ) (#50968 )	2020-01-14 14:43:39 +01:00
Benjamin Trent	eb8fd44836	[ML][Inference] minor fixes for created_by, and action permission (#50890 ) (#50911 ) The system created and models we provide now use the `_xpack` user for uniformity with our other features The `PUT` action is now an admin cluster action And XPackClient class now references the action instance.	2020-01-13 07:59:31 -05:00
Benjamin Trent	fa116a6d26	[7.x] [ML][Inference] PUT API (#50852 ) (#50887 ) * [ML][Inference] PUT API (#50852) This adds the `PUT` API for creating trained models that support our format. This includes * HLRC change for the API * API creation * Validations of model format and call * fixing backport	2020-01-12 10:59:11 -05:00
Dimitris Athanasiou	422422a2bc	[7.x][ML] Reuse SourceDestValidator for data frame analytics (#50841 ) (#50850 ) This commit removes validation logic of source and dest indices for data frame analytics and replaces it with using the common `SourceDestValidator` class which is already used by transforms. This way the validations and their messages become consistent while we reduce code. This means that where these validations fail the error messages will be slightly different for data frame analytics. Backport of #50841	2020-01-10 14:24:13 +02:00
Jake Landis	de6f132887	[7.x] Foreach processor - fork recursive call (#50514 ) (#50773 ) A very large number of recursive calls can cause a stack overflow exception. This commit forks the recursive calls for non-async processors. Once forked, each thread will handle at most 10 recursive calls to help keep the stack size and thread count down to a reasonable size.	2020-01-09 13:21:18 -06:00
Benjamin Trent	cc0e64572a	[ML][Inference][HLRC] Add necessary lang ident classes (#50705 ) (#50794 ) This adds the necessary named XContent classes to the HLRC for the lang ident model. This is so the HLRC can call `GET _ml/inference/lang_ident_model_1?include_definition=true` without XContent parsing errors. The constructors are package private as since this classes are used exclusively within the pre-packaged model (and require the specific weights, etc. to be of any use).	2020-01-09 10:33:38 -05:00
Adrien Grand	31158ab3d5	Add per-field metadata. (#50333 ) This PR adds per-field metadata that can be set in the mappings and is later returned by the field capabilities API. This metadata is completely opaque to Elasticsearch but may be used by tools that index data in Elasticsearch to communicate metadata about fields with tools that then search this data. A typical example that has been requested in the past is the ability to attach a unit to a numeric field. In order to not bloat the cluster state, Elasticsearch requires that this metadata be small: - keys can't be longer than 20 chars, - values can only be numbers or strings of no more than 50 chars - no inner arrays or objects, - the metadata can't have more than 5 keys in total. Given that metadata is opaque to Elasticsearch, field capabilities don't try to do anything smart when merging metadata about multiple indices, the union of all field metadatas is returned. Here is how the meta might look like in mappings: ```json { "properties": { "latency": { "type": "long", "meta": { "unit": "ms" } } } } ``` And then in the field capabilities response: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms" ] } } } } ``` When there are no conflicts, values are arrays of size 1, but when there are conflicts, Elasticsearch includes all unique values in this array, without giving ways to know which index has which metadata value: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms", "ns" ] } } } } ``` Closes #33267	2020-01-08 16:21:18 +01:00
Benjamin Trent	060e0a6277	[ML][Inference] Add support for models shipped as resources (#50680 ) (#50700 ) This adds support for models that are shipped as resources in the ML plugin. The first of which is the `lang_ident` model.	2020-01-07 09:21:59 -05:00
Przemysław Witek	4116452d90	Implement testStopAndRestart for ClassificationIT (#50585 ) (#50698 )	2020-01-07 13:41:37 +01:00
David Roberts	35453e2b0e	[ML] Improve uniqueness of result document IDs (#50644 ) Switch from a 32 bit Java hash to a 128 bit Murmur hash for creating document IDs from by/over/partition field values. The 32 bit Java hash was not sufficiently unique, and could produce identical numbers for relatively common combinations of by/partition field values such as L018/128 and L017/228. Fixes #50613	2020-01-07 10:24:45 +00:00
David Roberts	46d600c446	[ML] Fix off-by-one error in ml_classic tokenizer end offset (#50655 ) The end offset of a tokenizer is supposed to point one past the end of the input, not to the end character of the input. The ml_classic tokenizer was erroneously doing the latter.	2020-01-07 10:14:59 +00:00
Benjamin Trent	06cea5136e	[ML] construct new random generator on each persistence call (#50657 ) (#50684 ) Sharing a random generator may cause test failures as non-threadsafe random generators are periodically utilized in tests (see: https://github.com/elastic/elasticsearch/issues/50651) This change constructs a calls `Randomness.get()` within the `bulkIndexWithRetry` method so that the returned `Random` object is only used in a single thread. Before, the member variable could have been used between threads, which caused test failures.	2020-01-06 16:26:29 -05:00
Benjamin Trent	5ab9e75e28	[7.x] [ML][Inference] lang_ident model (#50292 ) (#50675 ) * [ML][Inference] lang_ident model (#50292) This PR contains a java port of Google's CLD3 compact NN model https://github.com/google/cld3 The ported model is formatted to fit within our inference model formatting and stored as a resource in the `:xpack:ml:` plugin and is under basic license. The model is broken up into two major parts: - Preprocessing through the custom embedding (based on CLD3's embedding layer) - Pushing the embedded text through the two layers of fully connected shallow NN. Main differences between this port and CLD3: - We take advantage of Java's internal Unicode handling where possible (i.e. codepoints, characters, decoders, etc.) - We do not trim down input text by removing duplicated tokens - We do not encode doubles/floats as longs/integers.	2020-01-06 16:24:03 -05:00
Benjamin Trent	f52af7977d	[ML][Inference] minor cleanup for inference (#50444 ) (#50676 )	2020-01-06 14:05:04 -05:00
Dimitris Athanasiou	ca0828ba07	[7.x][ML] Implement force deleting a data frame analytics job (#50553 ) (#50589 ) Adds a `force` parameter to the delete data frame analytics request. When `force` is `true`, the action force-stops the jobs and then proceeds to the deletion. This can be used in order to delete a non-stopped job with a single request. Closes #48124 Backport of #50553	2020-01-03 13:46:02 +02:00
Przemysław Witek	8917c05df8	[7.x] Synchronize processInStream.close() call (#50581 )	2020-01-03 10:23:51 +01:00
Przemysław Witek	4ecabe496f	Mute testStopAndRestart test case (#50551 )	2020-01-02 15:28:20 +01:00
Christoph Büscher	1599af8428	Fix type conversion problem in Eclipse (#50549 ) Eclipse 4.13 shows a type mismatch error in the affected line because it cannot correctly infer the boolean return type for the method call. Assigning return value to a local variable resolves this problem.	2020-01-02 14:29:20 +01:00
Przemysław Witek	3e3a93002f	[7.x] Fix accuracy metric (#50310 ) (#50433 )	2019-12-20 15:34:38 +01:00
Przemysław Witek	14d95aae46	[7.x] Make each analysis report desired field mappings to be copied (#50219 ) (#50428 )	2019-12-20 15:10:33 +01:00
Przemysław Witek	5bb668b866	[7.x] Get rid of maxClassesCardinality internal parameter (#50418 ) (#50423 )	2019-12-20 14:24:23 +01:00
Przemysław Witek	cc4bc797f9	[7.x] Implement `precision` and `recall` metrics for classification evaluation (#49671 ) (#50378 )	2019-12-19 18:55:05 +01:00
Dimitris Athanasiou	d3c83cd55a	[7.x][ML] Refresh state index before completing data frame analytics job (#50322 ) (#50324 ) In order to ensure any persisted model state is searchable by the moment the job reports itself as `stopped`, we need to refresh the state index before completing. This should fix the occasional failures we see in #50168 and #50313 where the model state appears missing. Closes #50168 Closes #50313 Backport of #50322	2019-12-18 22:19:59 +00:00
Benjamin Trent	4396a1f78b	[ML][Inference] fix support for nested fields (#50258 ) (#50335 ) This fixes support for nested fields We now support fully nested, fully collapsed, or a mix of both on inference docs. ES mappings allow the `_source` to be any combination of nested objects + dot delimited fields. So, we should do our best to find the best path down the Map for the desired field.	2019-12-18 15:47:06 -05:00
Dimitris Athanasiou	447bac27d2	[7.x][ML] Delete unused data frame analytics state (#50243 ) (#50280 ) This commit adds removal of unused data frame analytics state from the _delete_expired_data API (and in extend th ML daily maintenance task). At the moment the potential state docs include the progress document and state for regression and classification analyses. Backport of #50243	2019-12-18 12:30:11 +00:00
Przemysław Witek	ac974c35c0	Pass processConnectTimeout to the method that fetches C++ process' PID (#50276 ) (#50290 )	2019-12-17 21:32:37 +01:00
David Kyle	098f540f9d	[ML] Remove usage of base action logger in ml actions (#50074 ) (#50236 )	2019-12-17 13:03:27 +00:00
David Kyle	5542686283	[ML] Wait for green after opening job in NetworkDisruptionIT (#50232 ) Closes #49908	2019-12-16 14:55:58 +00:00
Dimitris Athanasiou	73add726d7	[7.x][ML] Fix exception when field is not included and excluded at the same time (#50192 ) (#50223 ) Executing the data frame analytics _explain API with a config that contains a field that is not in the includes list but at the same time is the excludes list results to trying to remove the field twice from the iterator. That causes an `IllegalStateException`. This commit fixes this issue and adds a test that captures the scenario. Backport of #50192	2019-12-16 11:30:06 +00:00
Benjamin Trent	4805d8ac7d	[ML][Inference] Adding a warning_field for warning msgs. (#49838 ) (#50183 ) This adds a new field for the inference processor. `warning_field` is a place for us to write warnings provided from the inference call. When there are warnings we are not going to write an inference result. The goal of this is to indicate that the data provided was too poor or too different for the model to make an accurate prediction. The user could optionally include the `warning_field`. When it is not provided, it is assumed no warnings were desired to be written. The first of these warnings is when ALL of the input fields are missing. If none of the trained fields are present, we don't bother inferencing against the model and instead provide a warning stating that the fields were missing. Also, this adds checks to not allow duplicated fields during processor creation.	2019-12-13 10:39:51 -05:00
Benjamin Trent	41736dd6c3	[ML] retry bulk indexing of state docs (#50149 ) (#50185 ) This exchanges the direct use of the `Client` for `ResultsPersisterService`. State doc persistence will now retry. Failures to persist state will still not throw, but will be audited and logged.	2019-12-13 10:39:34 -05:00
Dimitris Athanasiou	fe3c9e71d1	[7.x][ML] Fix DFA explain API timeout when source index is missing (#50176 ) (#50180 ) This commit fixes a bug that caused the data frame analytics _explain API to time out in a multi-node setup when the source index was missing. When we try to create the extracted fields detector, we check the index settings. If the index is missing that responds with a failure that could be wrapped as a remote exception. While we unwrapped correctly to check if the cause was an `IndexNotFoundException`, we then proceeded to cast the original exception instead of the cause. Backport of #50176	2019-12-13 17:00:55 +02:00
Dimitris Athanasiou	e6cbcf7f7c	[7.x] [ML] Persist/restore state for DFA classification (#50040 ) (#50147 ) This commit adds state persist/restore for data frame analytics classification jobs. Backport of #50040	2019-12-13 10:33:19 +02:00
Benjamin Trent	d7ffa7f8f7	[7.x][ML] Add graceful retry for anomaly detector result indexing failures(#49508 ) (#50145 ) * [ML] Add graceful retry for anomaly detector result indexing failures (#49508) All results indexing now retry the amount of times configured in `xpack.ml.persist_results_max_retries`. The retries are done in a semi-random, exponential backoff. * fixing test	2019-12-12 12:24:58 -05:00
Benjamin Trent	c043aa887f	[ML][Inference] Simplify inference processor options (#50105 ) (#50146 ) * [ML][Inference] Simplify inference processor options * addressing pr comments	2019-12-12 11:13:55 -05:00
David Roberts	13e47df97d	[TEST] Increase timeout for ML internal cluster cleanup (#50142 ) Closes #48511	2019-12-12 15:38:22 +00:00
David Kyle	7d4118dc4e	Enable trace logging in failing ml NetworkDisruptionIT https://github.com/elastic/elasticsearch/issues/49908	2019-12-12 11:16:01 +00:00
Dimitris Athanasiou	03ecaae221	[7.x][ML] Avoid classification integ test training on single class (#50072 ) (#50078 ) The `ClassificationIT.testTwoJobsWithSameRandomizeSeedUseSameTrainingSet` test was previously set up to just have 10 rows. With `training_percent` of 50%, only 5 rows will be used for training. There is a good chance that all 5 rows will be of one class which results to failure. This commit increases the rows to 100. Now 50 rows should be used for training and the chance of failure should be very small. Backport of #50072	2019-12-11 18:50:26 +02:00
David Turner	285eacd267	Use more specific loggers in subclasses of TMNA (#50076 ) Adjusts the subclasses of `TransportMasterNodeAction` to use their own loggers instead of the one for the base class. Relates #50056. Partial backport of #46431 to 7.x.	2019-12-11 15:07:47 +00:00
Przemysław Witek	9b116c8fef	A few improvements to AnalyticsProcessManager class that make the code more readable. (#50026 ) (#50069 )	2019-12-11 09:35:05 +01:00
Dimitris Athanasiou	8891f4db88	[7.x][ML] Introduce randomize_seed setting for regression and classification (#49990 ) (#50023 ) This adds a new `randomize_seed` for regression and classification. When not explicitly set, the seed is randomly generated. One can reuse the seed in a similar job in order to ensure the same docs are picked for training. Backport of #49990	2019-12-10 15:29:19 +02:00
Benjamin Trent	0b6ce9683c	[ML] Use query in cardinality check (#49939 ) (#49984 ) When checking the cardinality of a field, the query should be take into account. The user might know about some bad data in their index and want to filter down to the target_field values they care about.	2019-12-09 10:14:41 -05:00
Przemysław Witek	0965a10468	[7.x] Pass `prediction_field_type` to C++ analytics process (#49861 ) (#49981 )	2019-12-09 14:43:01 +01:00
Benjamin Trent	049d854360	[ML][Inference] adjust so target_field always has inference result and optionally allow new top classes field in the classification config (#49923 ) (#49982 )	2019-12-09 08:29:45 -05:00
Dimitris Athanasiou	e4f838e764	[7.x][ML] Update expected mem estimate in explain API integ test (#49924 ) (#49979 ) Work in progress in the c++ side is increasing memory estimates a bit and this test fails. At the time of this commit the mem estimate when there is no source query is a about 2Mb. So I am relaxing the test to assert memory estimate is less than 1Mb instead of 500Kb. Backport of #49924	2019-12-09 11:52:06 +02:00
Przemysław Witek	e60837aa3b	[7.x] Log whole analytics stats when the state assertion fails (#49906 ) (#49911 )	2019-12-06 14:31:17 +01:00
Przemysław Witek	1d8e3d69d7	Make only a part of `stop()` method a critical section. (#49756 ) (#49788 )	2019-12-03 09:54:16 +01:00
Dimitris Athanasiou	4edb2e7bb6	[7.x][ML] Add optional source filtering during data frame reindexing (#49690 ) (#49718 ) This adds a `_source` setting under the `source` setting of a data frame analytics config. The new `_source` is reusing the structure of a `FetchSourceContext` like `analyzed_fields` does. Specifying includes and excludes for source allows selecting which fields will get reindexed and will be available in the destination index. Closes #49531 Backport of #49690	2019-11-29 16:10:44 +02:00
Dimitris Athanasiou	c23a2187da	[7.x][ML] Only report complete writing_results progress after completion (#49551 ) (#49577 ) We depend on the number of data frame rows in order to report progress for the writing of results, the last phase of a job run. However, results include other objects than just the data frame rows (e.g, progress, inference model, etc.). The problem this commit fixes is that if we receive the last data frame row results we'll report that progress is complete even though we still have more results to process potentially. If the job gets stopped for any reason at this point, we will not be able to restart the job properly as we'll think that the job was completed. This commit addresses this by limiting the max progress we can report for the writing_results phase before the results processor completes to 98. At the end, when the process is done we set the progress to 100. The commit also improves failure capturing and reporting in the results processor. Backport of #49551	2019-11-26 12:20:37 +02:00
Benjamin Trent	688c78c589	[ML] Stop timing stats failure propagation (#49495 ) (#49501 )	2019-11-25 10:09:30 -05:00
David Roberts	62811c2272	[ML] Add default categorization analyzer definition to ML info (#49545 ) The categorization job wizard in the ML UI will use this information when showing the effect of the chosen categorization analyzer on a sample of input.	2019-11-25 13:39:16 +00:00
Dimitris Athanasiou	aca38f6882	[7.x][ML] DFA jobs should accept excluding an unsupported field (#49535 ) (#49544 ) Before this change excluding an unsupported field resulted in an error message that explained the excluded field could not be detected as if it doesn't exist. This error message is confusing. This commit commit changes this so that there is no error in this scenario. When excluding a field that does exist but has been automatically been excluded from the analysis there is no harm (unlike excluding a missing field which could be a typo). Backport of #49535	2019-11-25 15:13:00 +02:00
Dimitris Athanasiou	c149c64dc4	[7.x][ML] Apply source query on data frame analytics memory estimation (#49517 ) (#49532 ) Closes #49454 Backport of #49517	2019-11-25 12:51:57 +02:00
Dimitris Athanasiou	8eaee7cbdc	[7.x][ML] Explain data frame analytics API (#49455 ) (#49504 ) This commit replaces the _estimate_memory_usage API with a new API, the _explain API. The API consolidates information that is useful before creating a data frame analytics job. It includes: - memory estimation - field selection explanation Memory estimation is moved here from what was previously calculated in the _estimate_memory_usage API. Field selection is a new feature that explains to the user whether each available field was selected to be included or not in the analysis. In the case it was not included, it also explains the reason why. Backport of #49455	2019-11-22 22:06:10 +02:00
Benjamin Trent	a7477ad7c3	[7.x] [ML][Inference] compressing model definition and lazy parsing (#49269 ) (#49446 ) * [ML][Inference] compressing model definition and lazy parsing (#49269) * [ML][Inference] compressing model definition and lazy parsing * addressing PR comments * adding commons io * implementing simplified bounded stream * adjusting for type inclusion	2019-11-21 15:32:32 -05:00
Benjamin Trent	d41b2e3f38	[ML][Inference] allowing per-model licensing (#49398 ) (#49435 ) * [ML][Inference] allowing per-model licensing * changing to internal action + removing pre-mature opt	2019-11-21 09:46:34 -05:00
Przemysław Witek	c7ac2011eb	[7.x] Implement accuracy metric for multiclass classification (#47772 ) (#49430 )	2019-11-21 15:01:18 +01:00
David Roberts	20558cf61c	[ML] Fix simultaneous stop and force stop datafeed (#49367 ) If a datafeed is stopped normally and force stopped at the same time then it is possible that the force stop removes the persistent task while the normal stop is performing actions. Currently this causes the normal stop to error, but since stopping a stopped datafeed is not an error this doesn't make sense. Instead the force stop should just take precedence. This is a followup to #49191 and should really have been included in the changes in that PR.	2019-11-20 12:52:47 +00:00
Przemysław Witek	9c0ec7ce23	[7.x] Make AnalyticsProcessManager class more robust (#49282 ) (#49356 )	2019-11-20 10:08:16 +01:00
Dimitris Athanasiou	4d6e037e90	[7.x][ML] Extract creation of DFA field extractor into a factory (#49315 ) (#49329 ) This commit moves the async calls required to retrieve the components that make up `ExtractedFieldsExtractor` out of `DataFrameDataExtractorFactory` and into a dedicated `ExtractorFieldsExtractorFactory` class. A few more refactorings are performed: - The detector no longer needs the results field. Instead, it knows whether to use it or not based on whether the task is restarting. - We pass more accurately whether the task is restarting or not. - The validation of whether fields that have a cardinality limit are valid is now performed in the detector after retrieving the respective cardinalities. Backport of #49315	2019-11-20 10:02:42 +02:00
Przemysław Witek	42bb8ae525	[7.x] Extract indexData method out of RegressionIT tests (#49306 ) (#49313 )	2019-11-19 22:47:12 +01:00
Benjamin Trent	19602fd573	[ML][Inference] changing setting to be memorySizeSettting (#49259 ) (#49302 )	2019-11-19 07:56:40 -05:00
David Roberts	a5204c1c80	[ML] Fixes for stop datafeed edge cases (#49284 ) The following edge cases were fixed: 1. A request to force-stop a stopping datafeed is no longer ignored. Force-stop is an important recovery mechanism if normal stop doesn't work for some reason, and needs to operate on a datafeed in any state other than stopped. 2. If the node that a datafeed is running on is removed from the cluster during a normal stop then the stop request is retried (and will likely succeed on this retry by simply cancelling the persistent task for the affected datafeed). 3. If there are multiple simultaneous force-stop requests for the same datafeed we no longer fail the one that is processed second. The previous behaviour was wrong as stopping a stopped datafeed is not an error, so stopping a datafeed twice simultaneously should not be either. Backport of #49191	2019-11-19 10:51:46 +00:00
Benjamin Trent	eefe7688ce	[7.x][ML] ML Model Inference Ingest Processor (#49052 ) (#49257 ) * [ML] ML Model Inference Ingest Processor (#49052) * [ML][Inference] adds lazy model loader and inference (#47410) This adds a couple of things: - A model loader service that is accessible via transport calls. This service will load in models and cache them. They will stay loaded until a processor no longer references them - A Model class and its first sub-class LocalModel. Used to cache model information and run inference. - Transport action and handler for requests to infer against a local model Related Feature PRs: * [ML][Inference] Adjust inference configuration option API (#47812) * [ML][Inference] adds logistic_regression output aggregator (#48075) * [ML][Inference] Adding read/del trained models (#47882) * [ML][Inference] Adding inference ingest processor (#47859) * [ML][Inference] fixing classification inference for ensemble (#48463) * [ML][Inference] Adding model memory estimations (#48323) * [ML][Inference] adding more options to inference processor (#48545) * [ML][Inference] handle string values better in feature extraction (#48584) * [ML][Inference] Adding _stats endpoint for inference (#48492) * [ML][Inference] add inference processors and trained models to usage (#47869) * [ML][Inference] add new flag for optionally including model definition (#48718) * [ML][Inference] adding license checks (#49056) * [ML][Inference] Adding memory and compute estimates to inference (#48955) * fixing version of indexed docs for model inference	2019-11-18 13:19:17 -05:00
Przemysław Witek	5f9965e4b8	Lower minimum model memory limit value from 1MB to 1kB. (#49227 ) (#49242 )	2019-11-18 14:58:20 +01:00
Dimitris Athanasiou	805c31e19e	[7.x][ML] Avoid NPE when node load is calculated on job assignment (#49186 ) (#49214 ) This commit fixes a NPE problem as reported in #49150. But this problem uncovered that we never added proper handling of state for data frame analytics tasks. In this commit we improve the `MlTasks.getDataFrameAnalyticsState` method to handle null tasks and state tasks properly. Closes #49150 Backport of #49186	2019-11-18 10:33:07 +02:00
Przemysław Witek	150db2b544	Throw an exception when memory usage estimation endpoint encounters empty data frame. (#49143 ) (#49164 )	2019-11-18 07:52:57 +01:00
Rory Hunter	c46a0e8708	Apply 2-space indent to all gradle scripts (#49071 ) Backport of #48849. Update `.editorconfig` to make the Java settings the default for all files, and then apply a 2-space indent to all `*.gradle` files. Then reformat all the files.	2019-11-14 11:01:23 +00:00
Przemysław Witek	e6ad3c29fd	Do not throw exceptions resulting from persisting datafeed timing stats. (#49044 ) (#49050 )	2019-11-13 20:23:13 +01:00
Christoph Büscher	6119f0aaa2	Fix Eclipse compilation in DataFrameDataExtractorTests (#48942 )	2019-11-11 16:17:55 +01:00
Dimitris Athanasiou	dfc6a13b44	[7.x][ML] Handle nested arrays in source fields (#48885 ) (#48889 ) Backport of #48885	2019-11-07 07:30:50 +02:00
David Roberts	c03f7ba74c	[TEST] Mute TimeoutCheckerTests.testWatchdog Due to https://github.com/elastic/elasticsearch/issues/48861	2019-11-05 11:49:46 +00:00
Dimitris Athanasiou	f2d4c94a9c	[7.x][ML] Deduplicate multi-fields for data frame analytics (#48799 ) (#48806 ) In the case multi-fields exist in the source index, we pick all variants of them in our extracted fields detection for data frame analytics. This means we may have multiple instances of the same feature. The worse consequence of this is when the dependent variable (for regression or classification) is also duplicated which means we train a model on the dependent variable itself. Now that #48770 is merged, this commit is adding logic to only select one variant of multi-fields. Closes #48756 Backport of #48799	2019-11-01 16:53:05 +02:00
Dimitris Athanasiou	1f662e0b12	[7.x][ML] Prevent fetching multi-field from source (#48770 ) (#48797 ) Aggregatable mutli-fields are at the moment wrongly mapped as normal doc_value fields and thus they support fetching from source. However, they do not exist in the source. This results to failure to extract such fields. This commit fixes this bug. While a fix could be worked out on top of the existing code, it is evident the extraction logic has become difficult to understand and maintain. As we also want to deduplicate multi-fields for data frame analytics, it seemed appropriate to refactor the code to simplify and better handle the extraction of multi-fields. Relates #48756 Backport of #48770	2019-11-01 14:18:03 +02:00
David Roberts	c3063c4e1f	[ML] Make the URL of the ML C++ Ivy repo configurable (#48702 ) At present the ML C++ artifact is always downloaded from S3. This change adds an option to configure the location. (The intention is to use a file:/// URL to pick up the artifact built in a Docker container in ml-cpp PR builds so that C++ changes that will break Java integration tests can be detected before the ml-cpp PRs are merged.) Relates elastic/ml-cpp#766	2019-10-31 09:21:44 +00:00
Dimitris Athanasiou	919596b2e8	[7.x][ML] Move field extraction logic to its own package (#48709 ) (#48712 ) Moves common field extraction logic to its own package so that it can be used both for anomaly detection and data frame analytics. In preparation for refactoring extraction fields to be simpler and to support multi-fields properly. Backport of #48709	2019-10-31 02:41:00 +02:00
Benjamin Trent	c9ead80c31	[7.x] [ML][Inference] separating definition and config object storage (#48651 ) (#48695 ) * [ML][Inference] separating definition and config object storage (#48651) This separates out the `definition` object from being stored within the configuration object in the index. This allows us to gather the config object without decompressing a potentially large definition. Additionally, `input` is moved to the TrainedModelConfig object and out of the definition. This is so the trained input fields are accessible outside the potentially large model definition.	2019-10-30 13:27:29 -04:00
Przemysław Witek	7c944d26c5	[7.x] Assert that the results of classification analysis can be evaluated using _evaluate API. (#48626 ) (#48634 )	2019-10-29 16:20:56 +01:00
Przemysław Witek	7e30277a37	Mute RegressionIT.testStopAndRestart (#48575 ) (#48576 )	2019-10-28 13:08:11 +01:00
Martijn van Groningen	b034153df7	Change grok watch dog to be Matcher based instead of thread based. (#48346 ) There is a watchdog in order to avoid long running (and expensive) grok expressions. Currently the watchdog is thread based, threads that run grok expressions are registered and after completion unregister. If these threads stay registered for too long then the watch dog interrupts these threads. Joni (the library that powers grok expressions) has a mechanism that checks whether the current thread is interrupted and if so abort the pattern matching. Newer versions have an additional method to abort long running pattern matching inside joni. Instead of checking the thread's interrupted flag, joni now also checks a volatile field that can be set via a `Matcher` instance. This is more efficient method for aborting long running matches. (joni checks each 30k iterations whether interrupted flag is set vs. just checking a volatile field) Recently we upgraded to a recent joni version (#47374), and this PR is a followup of that PR. This change should also fix #43673, since it appears when unit tests are ran the a test runner thread's interrupted flag may already have been set, due to some thread reuse.	2019-10-24 15:34:01 +02:00
Przemysław Witek	149537a165	Assert that inference model has been persisted (#48332 ) (#48453 )	2019-10-24 14:18:43 +02:00
Przemyslaw Gomulka	aaa6209be6	[7.x] [Java.time] Calculate week of a year with ISO rules BACKPORT(#48209 ) (#48349 ) Reverting the change introducing IsoLocal.ROOT and introducing IsoCalendarDataProvider that defaults start of the week to Monday and requires minimum 4 days in first week of a year. This extension is using java SPI mechanism and defaults for Locale.ROOT only. It require jvm property java.locale.providers to be set with SPI,COMPAT closes #41670 backport #48209	2019-10-23 17:39:38 +02:00
Przemysław Witek	60d8ecb2b7	Mute ClassificationIT tests (#48338 ) (#48339 )	2019-10-22 12:45:50 +02:00
Przemysław Witek	2db2b945ec	[7.x] Change format of MulticlassConfusionMatrix result to be more self-explanatory (#48174 ) (#48294 )	2019-10-21 22:07:19 +02:00
Benjamin Trent	abd1b5118f	[ML] fixing tests (#48084 ) (#48253 ) * [ML] fixing tests * unmuting tests * reverting outlier detection job changes	2019-10-21 09:21:06 -04:00
rsarawgi	5e4dd0fd2e	[ML] Removing usages of ToXContentParams.INCLUDE_TYPE (#48165 ) Removing the option of ToXContentParams.INCLUDE_TYPE and replacing them with ToXContentParams.FOR_INTERNAL_STORAGE Closes #48057	2019-10-18 14:49:26 +01:00
Przemysław Witek	28f68fa221	Make num_top_classes parameter's default value equal to 2 (#48119 ) (#48201 )	2019-10-17 18:43:15 +02:00
Dimitris Athanasiou	e0489fc328	[7.x][ML] Always refresh dest index before starting analytics process (#48090 ) (#48196 ) If a job stops right after reindexing is finished but before we refresh the destination index, we don't refresh at all. If the job is started again right after, it jumps into the analyzing state. However, the data is still not searchable. This is why we were seeing test failures that we start the process expecting X rows (where X is lower than the expected number of docs) and we end up getting X+. We fix this by moving the refresh of the dest index right before we start the process so it always ensures the data is searchable. Closes #47612 Backport of #48090	2019-10-17 17:20:19 +01:00
Benjamin Trent	ee110c2d42	[ML] Muting tests due to #48085 (#48086 ) (#48154 )	2019-10-16 15:46:50 -04:00
Benjamin Trent	0dddbb5b42	[ML] Parse and index inference model (#48016 ) (#48152 ) This adds parsing an inference model as a possible result of the analytics process. When we do parse such a model we persist a `TrainedModelConfig` into the inference index that contains additional metadata derived from the running job.	2019-10-16 15:46:20 -04:00
Przemysław Witek	8f815240b3	[7.x] Allow integer types for classification's dependent variable (#47902 ) (#48080 )	2019-10-16 11:09:56 +02:00
David Roberts	d9c7e3847e	[TEST] Don't assert order of data frame analytics audit messages (#48065 ) Audit messages are stored with millisecond timestamps. If two messages have the same millisecond timestamp then asserting on their order is impossible given the information available. This PR changes the assertion on audit messages in the native data frame analytics tests to assert that the expected audit messages exist in any order. Fixes #48035	2019-10-15 19:59:52 +01:00
Przemysław Witek	eaa56344b5	Verify that the failure reason of analytics process is empty (#48042 ) (#48071 )	2019-10-15 18:33:20 +02:00
Przemysław Witek	620bd9d224	Enable test testSingleNumericFeatureAndMixedTrainingAndNonTrainingRows_TopClassesRequested now that top classes are correctly reported by C++. (#48043 ) (#48053 )	2019-10-15 14:49:16 +02:00
David Roberts	984323783e	[ML][7.x] Add lazy assignment job config option (#47993 ) This change adds: - A new option, allow_lazy_open, to anomaly detection jobs - A new option, allow_lazy_start, to data frame analytics jobs Both work in the same way: they allow a job to be opened/started even if no ML node exists that can accommodate the job immediately. In this situation the job waits in the opening/starting state until ML node capacity is available. (The starting state for data frame analytics jobs is new in this change.) Additionally, the ML nightly maintenance tasks now creates audit warnings for ML jobs that are unassigned. This means that jobs that cannot be assigned to an ML node for a very long time will show a yellow warning triangle in the UI. A final change is that it is now possible to close a job that is not assigned to a node without using force. This is because previously jobs that were open but not assigned to a node were an aberration, whereas after this change they'll be relatively common.	2019-10-15 06:55:11 +01:00
David Roberts	1ca25bed38	[ML][7.x] Add option to stop datafeed that finds no data (#47995 ) Adds a new datafeed config option, max_empty_searches, that tells a datafeed that has never found any data to stop itself and close its associated job after a certain number of real-time searches have returned no data. Backport of #47922	2019-10-14 17:19:13 +01:00
David Roberts	46ae86ac31	[ML] Fix detection of syslog-like timestamp in find_file_structure (#47970 ) Usually syslog timestamps have two spaces before a single digit day-of-month. However, in some non-syslog cases where syslog-like timestamps are used there is only one space. The grok pattern supports this, so the timestamp parser should too. This change makes the find_file_structure endpoint do this. Also fixes another problem that the same test case exposed in the find_file_structure endpoint, which was that the exclude_lines_pattern for delimited files was always created on the assumption the delimiter was a comma. Now it is based on the actual delimiter.	2019-10-13 20:07:54 +01:00
Benjamin Trent	627faf1850	[7.x] [ML][Analytics] fix bug where regression deleted early does not delete state (#47885 ) (#47914 ) * [ML][Analytics] fix bug where regression deleted early does not delete state (#47885) * [ML][Analytics] fix bug where regression deleted early does not delete state * Fixing ml with security test failure * fixing for older java	2019-10-11 15:11:16 -04:00
Przemysław Witek	c62fe8c344	Require that the dependent variable column has at most 2 distinct values in classfication analysis. (#47858 ) (#47906 )	2019-10-11 14:57:08 +02:00
Igor Motov	b5afa95fd8	Fix Mute RunDataFrameAnalyticsIT.testOutlierDetectionStopAndRestart Tracked by #47612	2019-10-10 18:17:01 +04:00

... 3 4 5 6 7 ...

946 Commits