OpenSearch

Commit Graph

Author	SHA1	Message	Date
Dimitris Athanasiou	4dacfa95d2	[ML] Allow asynchronous job deletion (#34058 ) This changes the delete job API by adding the choice to delete a job asynchronously. The commit adds a `wait_for_completion` parameter to the delete job request. When set to `false`, the action returns immediately and the response contains the task id. This also changes the handling of subsequent delete requests for a job that is already being deleted. It now uses the task framework to check if the job is being deleted instead of the cluster state. This is a beneficial for it is going to also be working once the job configs are moved out of the cluster state and into an index. Also, force delete requests that are waiting for the job to be deleted will not proceed with the deletion if the first task fails. This will prevent overloading the cluster. Instead, the failure is communicated better via notifications so that the user may retry. Finally, this makes the `deleting` property of the job visible (also it was renamed from `deleted`). This allows a client to render a deleting job differently. Closes #32836	2018-10-05 02:41:28 +03:00
David Kyle	ef5007b6d8	[ML] Remove unused last_data_time member from Job (#34262 )	2018-10-04 13:16:14 +01:00
Kazuhiro Sera	d45fe43a68	Fix a variety of typos and misspelled words (#32792 )	2018-10-03 18:11:38 +01:00
Lee Hinman	2d9cb21490	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-01 14:10:09 -06:00
Benjamin Trent	96be057195	Removing unused ML parameters (#34159 )	2018-10-01 08:09:46 -07:00
David Roberts	a1d2ded98d	[ML] Fix unit test deadlock problem (#34174 ) This change fixes a potential deadlock problem in the unit test introduced in #34117. It also removes a piece of debug code and corrects a docs formatting problem that were both added in that same PR.	2018-10-01 15:35:37 +01:00
Lee Hinman	6ea396a476	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-28 15:40:12 -06:00
David Roberts	f709c2f694	[ML] Add a timeout option to file structure finder (#34117 ) This can be used to restrict the amount of CPU a single structure finder request can use. The timeout is not implemented precisely, so requests may run for slightly longer than the timeout before aborting. The default is 25 seconds, which is a little below Kibana's default timeout of 30 seconds for calls to Elasticsearch APIs.	2018-09-28 17:32:35 +01:00
Lee Hinman	a26cc1a242	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-27 11:00:37 -06:00
Christoph Büscher	ba3ceeaccf	Clean up "unused variable" warnings (#31876 ) This change cleans up "unused variable" warnings. There are several cases were we most likely want to suppress the warnings (especially in the client documentation test where the snippets contain many unused variables). In a lot of cases the unused variables can just be deleted though.	2018-09-26 14:09:32 +02:00
Ed Savage	cc70352b3f	[ML] Modify thresholds for normalization triggers (#33663 ) [ML] Modify thresholds for normalization triggers The (arbitrary) threshold factors used to judge if scores have changed significantly enough to trigger a look-back renormalization have been changed to values that reduce the frequency of such renormalizations. Added a clause to treat changes in scores as a 'big change' if it would result in a change of severity reported in the UI. Also altered the clause affecting small scores so that a change should be considered big if scores have changed by at least 1.5. Relates https://github.com/elastic/machine-learning-qa/issues/263	2018-09-25 15:30:10 +01:00
David Roberts	dfe5af0411	[ML] Return both Joda and Java formats from structure finder (#33900 ) Previously the timestamp_formats field in the response from the find_file_structure endpoint contained Joda timestamp formats. This change makes that clear by renaming the field to joda_timestamp_formats, and also adds a java_timestamp_formats field containing the equivalent Java time format strings.	2018-09-25 12:52:51 +01:00
Benjamin Trent	74d7be805a	Make certain ML node settings dynamic (#33565 ) (#33961 ) * Make certain ML node settings dynamic (#33565) * Changing to pull in updating settings and pass to constructor * adding note about only newly opened jobs getting updated value	2018-09-24 12:54:32 -07:00
Lee Hinman	243e863f6e	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-24 10:33:51 -06:00
David Roberts	b89551c452	[ML] Display integers without .0 in file structure field stats (#33947 ) Previously numeric values in the field_stats created by the find_file_structure endpoint were always output with a decimal point. This looked unfriendly and unnatural for fields that clearly store integer values. This change converts integer values to type Integer before output in the file structure field stats.	2018-09-22 15:48:59 +01:00
Benjamin Trent	e17bd8e913	Removing poor randomization for node name (#33918 )	2018-09-21 04:49:20 -07:00
Christoph Büscher	b654d986d7	Add OneStatementPerLineCheck to Checkstyle rules (#33682 ) This change adds the OneStatementPerLineCheck to our checkstyle precommit checks. This rule restricts the number of statements per line to one. The resoning behind this is that it is very difficult to read multiple statements on one line. People seem to mostly use it in short lambdas and switch statements in our code base, but just going through the changes already uncovered some actual problems in randomization in test code, so I think its worth it.	2018-09-21 11:52:31 +02:00
Dimitris Athanasiou	8e3a0fad9d	[ML] Refactor job deletion logic into the transport action (#33891 ) The job deletion logic was scattered around a few places: the transport action, the job manager and the deletion task. Overloading the task with deletion logic also meant extra dependencies in the core package which should be unnecessary. This commit consolidates all this logic into the transport action and replaces the deletion task with a plain one that needs not be aware of deletion logic.	2018-09-20 15:48:42 +01:00
Benjamin Trent	4767a016a5	Adding node_count to ML Usage (#33850 ) (#33863 )	2018-09-19 13:35:09 -07:00
Lee Hinman	81e9150c7a	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-19 09:43:26 -06:00
Alan Woodward	5107949402	Allow TokenFilterFactories to rewrite themselves against their preceding chain (#33702 ) We currently special-case SynonymFilterFactory and SynonymGraphFilterFactory, which need to know their predecessors in the analysis chain in order to correctly analyze their synonym lists. This special-casing doesn't work with Referring filter factories, such as the Multiplexer or Conditional filters. We also have a number of filters (eg the Multiplexer) that will break synonyms when they appear before them in a chain, because they produce multiple tokens at the same position. This commit adds two methods to the TokenFilterFactory interface. * `getChainAwareTokenFilterFactory()` allows a filter factory to rewrite itself against its preceding filter chain, or to resolve references to other filters. It replaces `ReferringFilterFactory` and `CustomAnalyzerProvider.checkAndApplySynonymFilter`, and by default returns `this`. * `getSynonymFilter()` defines whether or not a filter should be applied when building a synonym list `Analyzer`. By default it returns `true`. Fixes #33609	2018-09-19 15:52:14 +01:00
Benjamin Trent	4190a9f1e9	Delete custom index if the only contained job is deleted (#33788 ) * Delete custom index if the only contained job is deleted	2018-09-19 07:42:26 -07:00
Lee Hinman	e6cbaa5a78	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-14 16:27:37 -06:00
David Roberts	568ac10ca6	[ML] Allow overrides for some file structure detection decisions (#33630 ) This change modifies the file structure detection functionality such that some of the decisions can be overridden with user supplied values. The fields that can be overridden are: - charset - format - has_header_row - column_names - delimiter - quote - should_trim_fields - grok_pattern - timestamp_field - timestamp_format If an override makes finding the file structure impossible then the endpoint will return an exception.	2018-09-14 09:29:11 +01:00
Benjamin Trent	7e51b960fb	Adding index refresh (#33647 )	2018-09-13 10:44:33 -07:00
Colin Goodheart-Smithe	8e59de3eb2	Merge branch 'master' into index-lifecycle	2018-09-13 09:46:14 +01:00
Jay Modi	20c6c9c542	Address license state update/read thread safety (#33396 ) This change addresses some issues regarding thread safety around updates and method calls on the XPackLicenseState object. There exists a possibility that there could be a concurrent update to the XPackLicenseState when there is a scheduled check to see if the license is expired and a cluster state update. In order to address this, the update method now has a synchronized block where member variables are updated. Each method that reads these variables is now also synchronized. Along with the above change, there was a consistency issue around security calls to the license state. The majority of security checks make two calls to the license state, which could result in incorrect behavior due to the checks being made against different license states. The majority of this behavior was introduced for 6.3 with the inclusion of x-pack in the default distribution. In order to resolve the majority of these cases, the `isSecurityEnabled` method is no longer public and the logic is also included in individual methods about security such as `isAuthAllowed`. There were a few cases where this did not remove multiple calls on the license state, so a new method has been added which creates a copy of the current license state that will not change. Callers can use this copy of the license state to make decisions based on a consistent view of the license state.	2018-09-12 13:08:09 -06:00
David Roberts	8e05ce567f	[ML] Rename input_fields to column_names in file structure (#33568 ) This change tightens up the meaning of the "input_fields" field in the file structure finder output. Previously it was permitted but not calculated for JSON and XML files. Following this change the field is called "column_names" and is only permitted for delimited files. Additionally the way the column names are set for headerless delimited files is refactored to encapsulate the way they're named to one line of the code rather than having the same logic in two places.	2018-09-11 08:46:26 +01:00
Colin Goodheart-Smithe	cdc4f57a77	Merge branch 'master' into index-lifecycle	2018-09-10 21:30:44 +01:00
Dimitris Athanasiou	fcb15b0ce3	[ML] Get job stats request should filter non-ML job tasks (#33516 ) When requesting job stats for `_all`, all ES tasks are accepted resulting to loads of cluster traffic and a memory overhead. This commit correctly filters out non ML job tasks. Closes #33515	2018-09-09 22:53:03 +01:00
Nhat Nguyen	94e4cb64c2	Bootstrap a new history_uuid when force allocating a stale primary (#33432 ) This commit ensures that we bootstrap a new history_uuid when force allocating a stale primary. A stale primary should never be the source of an operation-based recovery to another shard which exists before the forced-allocation. Closes #26712	2018-09-08 19:29:31 -04:00
David Roberts	e42cc5cd8c	[ML] Add a file structure determination endpoint (#33471 ) This endpoint accepts an arbitrary file in the request body and attempts to determine the structure. If successful it also proposes mappings that could be used when indexing the file's contents, and calculates simple statistics for each of the fields that are useful in the data preparation step prior to configuring machine learning jobs.	2018-09-07 17:41:57 +01:00
Colin Goodheart-Smithe	017ffe5d12	Merge branch 'master' into index-lifecycle	2018-09-07 10:59:10 +01:00
Jim Ferenczi	79cd6385fe	Collapse package structure for metrics aggs (#33463 ) This change collapses all metrics aggregations classes into a single package `org.elasticsearch.aggregations.metrics`. It also restricts the visibility of some classes (aggregators and factories) that should not be used outside of the package. Relates #22868	2018-09-07 10:58:06 +02:00
David Roberts	0849b98f60	[ML] Rename log structure to file structure (#33421 ) Many files supplied to the upcoming ML data preparation functionality will not be "log" files. For example, CSV files are generally not "log" files. Therefore it makes sense to rename library that determines the structure of these files. Although "file structure" could be considered too broad, as the library currently only works with a few text formats, in the future it may be extended to work with more formats.	2018-09-06 09:13:08 +01:00
Tal Levy	b5f7fb6882	Merge branch 'master' into index-lifecycle	2018-09-05 12:56:58 -07:00
David Roberts	a296829205	[ML] Add field stats to log structure finder (#33351 ) The log structure endpoint will return these in addition to pure structure information so that it can be used to drive pre-import data visualizer functionality. The statistics for every field are count, cardinality (distinct count) and top hits (most common values). Extra statistics are calculated if the field is numeric: min, max, mean and median.	2018-09-05 12:57:20 +01:00
Colin Goodheart-Smithe	f00a28a909	Merge branch 'master' into index-lifecycle	2018-09-05 09:48:48 +01:00
Nik Everett	ebd5eb6dc2	ML: Fix build after HLRC change I recently merged a HLRC change that passed the PR builds but didn't compile after merging. Sad time. This fixes the compilation.	2018-09-04 11:10:44 -04:00
Sohaib Iftikhar	761e8c461f	HLRC: Add delete by query API (#32782 ) Adds the delete-by-query API to the High Level REST Client.	2018-09-04 08:56:26 -04:00
Dimitris Athanasiou	1457b07a06	[ML] The sort field on get records should default to the record_score (#33358 ) This is not changing the behaviour as when the sort field was set to `influencer_score` the secondary sort would be used and that was using the `record_score` at the highest priority.	2018-09-04 11:38:24 +01:00
David Roberts	84eaac79d7	[ML] Minor improvements to categorization Grok pattern creation (#33353 ) 1. The TOMCAT_DATESTAMP format needs to be checked before TIMESTAMP_ISO8601, otherwise TIMESTAMP_ISO8601 will match the start of the Tomcat datestamp. 2. Exclude more characters before and after numbers. For example, in 1.2.3 we don't want to match 1.2 as a float.	2018-09-04 09:43:49 +01:00
Alpar Torok	7f7e8fd733	Disable assemble task instead of removing it (#33348 )	2018-09-04 07:32:14 +03:00
Benjamin Trent	767d8e0801	[ML] Delete forecast API (#31134 ) (#33218 ) * Delete forecast API (#31134)	2018-09-03 19:06:18 -05:00
Colin Goodheart-Smithe	e2c1beb1be	Merge branch 'master' into index-lifecycle	2018-09-03 10:01:16 +01:00
Nhat Nguyen	b93507608a	Merge branch 'master' into ccr * master: Mute test watcher usage stats output [Rollup] Fix FullClusterRestart test Adjust soft-deletes version after backport into 6.5 completely drop `index.shard.check_on_startup: fix` for 7.0 (#33194) Fix AwaitsFix issue number Mute SmokeTestWatcherWithSecurityIT testsi drop `index.shard.check_on_startup: fix` (#32279) tracked at [DOCS] Moves ml folder from x-pack/docs to docs (#33248) [DOCS] Move rollup APIs to docs (#31450) [DOCS] Rename X-Pack Commands section (#33005) TEST: Disable soft-deletes in ParentChildTestCase Fixes SecurityIntegTestCase so it always adds at least one alias (#33296) Fix pom for build-tools (#33300) Lazy evaluate java9home (#33301) SQL: test coverage for JdbcResultSet (#32813) Work around to be able to generate eclipse projects (#33295) Highlight that index_phrases only works if no slop is used (#33303) Different handling for security specific errors in the CLI. Fix for https://github.com/elastic/elasticsearch/issues/33230 (#33255) [ML] Refactor delimited file structure detection (#33233) SQL: Support multi-index format as table identifier (#33278) MINOR: Remove Dead Code from PathTrie (#33280) Enable forbiddenapis server java9 (#33245)	2018-08-31 19:03:04 -04:00
Colin Goodheart-Smithe	3eef74d5d5	Merge branch 'master' into index-lifecycle	2018-08-31 14:45:22 +01:00
David Roberts	7345878d33	[ML] Refactor delimited file structure detection (#33233 ) 1. Use the term "delimited" rather than "separated values" 2. Use a single factory class with arguments to specify the delimiter and identification constraints This change makes it easier to add support for other delimiter characters.	2018-08-31 08:48:45 +01:00
Nhat Nguyen	5632e31c74	Merge branch 'master' into ccr * master: Painless: Add Bindings (#33042) Update version after client credentials backport Fix forbidden apis on FIPS (#33202) Remote 6.x transport BWC Layer for `_shrink` (#33236) Test fix - Graph HLRC tests needed another field adding to randomisation exception list HLRC: Add ML Get Records API (#33085) [ML] Fix character set finder bug with unencodable charsets (#33234) TESTS: Fix overly long lines (#33240) Test fix - Graph HLRC test was missing field name to be excluded from randomisation logic Remove unsupported group_shard_failures parameter (#33208) Update BucketUtils#suggestShardSideQueueSize signature (#33210) Parse PEM Key files leniantly (#33173) INGEST: Add Pipeline Processor (#32473) Core: Add java time xcontent serializers (#33120) Consider multi release jars when running third party audit (#33206) Update MSI documentation (#31950) HLRC: create base timed request class (#33216) [DOCS] Fixes command page titles HLRC: Move ML protocol classes into client ml package (#33203) Scroll queries asking for rescore are considered invalid (#32918) Painless: Fix Semicolon Regression (#33212) ingest: minor - update test to include dissect (#33211) Switch remaining LLREST usage to new style Requests (#33171) HLREST: add reindex API (#32679)	2018-08-29 12:30:24 -04:00
Gordon Brown	454ce99b01	Merge branch 'master' into index-lifecycle	2018-08-29 08:28:23 -06:00
David Roberts	22415fa2de	[ML] Fix character set finder bug with unencodable charsets (#33234 ) Some character sets cannot be encoded and this was tripping up the binary data check in the ML log structure character set finder. The fix is to assume that if ICU4J identifies that some bytes correspond to a character set that cannot be encoded and those bytes contain zeroes then the data is binary rather than text. Fixes #33227	2018-08-29 14:56:02 +01:00
Tal Levy	5783545222	Merge branch 'master' into index-lifecycle	2018-08-27 08:19:05 -07:00
Nhat Nguyen	75304f405b	Merge branch 'master' into ccr * master: Add proxy support to RemoteClusterConnection (#33062) TEST: Skip assertSeqNos for closed shards (#33130) TEST: resync operation on replica should acquire shard permit (#33103) Switch remaining x-pack tests to new style Requests (#33108) Switch remaining tests to new style Requests (#33109) Switch remaining ml tests to new style Requests (#33107) Build: Line up IDE detection logic Security index expands to a single replica (#33131) HLRC: request/response homogeneity and JavaDoc improvements (#33133) Checkstyle! [Test] Fix sporadic failure in MembershipActionTests Revert "Do NOT allow termvectors on nested fields (#32728)" [Rollup] Move toAggCap() methods out of rollup config objects (#32583) Fix race condition in scheduler engine test	2018-08-25 21:41:53 -04:00
Nik Everett	8bee6b3a92	Switch remaining ml tests to new style Requests (#33107 ) In #29623 we added `Request` object flavored requests to the low level REST client and in #30315 we deprecated the old `performRequest`s. This changes all calls in the `x-pack/plugin/ml/qa/native-multi-node-tests`, `x-pack/plugin/ml/qa/single-node-tests` projects to use the new versions.	2018-08-24 16:36:40 -04:00
Tal Levy	74312be0ea	Merge branch 'master' into index-lifecycle	2018-08-24 12:41:12 -07:00
Jason Tedor	91a052b617	Merge branch 'master' into ccr * master: Add hook to skip asserting x-content equivalence (#33114) Muted testListenersThrowingExceptionsDoNotCauseOtherListenersToBeSkipped [Rollup] Move getMetadata() methods out of rollup config objects (#32579) Muted testEmptyAuthorizedIndicesSearchForAllDisallowNoIndices Update Google Cloud Storage Library for Java (#32940) Remove unsupported Version.V_5_* (#32937)	2018-08-24 06:55:10 -04:00
Jim Ferenczi	f4e9729d64	Remove unsupported Version.V_5_* (#32937 ) This change removes the es 5x version constants and their usages.	2018-08-24 09:51:21 +02:00
Martijn van Groningen	82592dda5a	Merge remote-tracking branch 'es/master' into ccr * es/master: (62 commits) [DOCS] Add docs for Application Privileges (#32635) Add versions 5.6.12 and 6.4.1 Do NOT allow termvectors on nested fields (#32728) [Rollup] Return empty response when aggs are missing (#32796) [TEST] Add some ACL yaml tests for Rollup (#33035) Move non duplicated actions back into xpack core (#32952) Test fix - GraphExploreResponseTests should not randomise array elements Closes #33086 Use `addIfAbsent` instead of checking if an element is contained TESTS: Fix Random Fail in MockTcpTransportTests (#33061) HLRC: Fix Compile Error From Missing Throws (#33083) [DOCS] Remove reload password from docs cf. #32889 HLRC: Add ML Get Buckets API (#33056) Watcher: Improve error messages for CronEvalTool (#32800) Search: Support of wildcard on docvalue_fields (#32980) Change query field expansion (#33020) INGEST: Cleanup Redundant Put Method (#33034) SQL: skip uppercasing/lowercasing function tests for AZ locales as well (#32910) Fix the default pom file name (#33063) Switch ml basic tests to new style Requests (#32483) Switch some watcher tests to new style Requests (#33044) ...	2018-08-24 12:22:11 +07:00
Gordon Brown	1f13c77b49	Merge branch 'master' into index-lifecycle	2018-08-23 11:52:59 -06:00
Nik Everett	0cc99d270c	Switch ml basic tests to new style Requests (#32483 ) In #29623 we added `Request` object flavored requests to the low level REST client and in #30315 we deprecated the old `performRequest`s. This changes all calls in the `x-pack/qa/ml-basic-multi-node` project to use the new versions.	2018-08-22 14:23:43 -04:00
Alpar Torok	82d10b484a	Run forbidden api checks with runtimeJavaVersion (#32947 ) Run forbidden APIs checks with runtime hava version	2018-08-22 09:05:22 +03:00
Nik Everett	2c81d7f77e	Build: Rework shadow plugin configuration (#32409 ) This reworks how we configure the `shadow` plugin in the build. The major change is that we no longer bundle dependencies in the `compile` configuration, instead we bundle dependencies in the new `bundle` configuration. This feels more right because it is a little more "opt in" rather than "opt out" and the name of the `bundle` configuration is a little more obvious. As an neat side effect of this, the `runtimeElements` configuration used when one project depends on another now contains exactly the dependencies needed to run the project so you no longer need to reference projects that use the shadow plugin like this: ``` testCompile project(path: ':client:rest-high-level', configuration: 'shadow') ``` You can instead use the much more normal: ``` testCompile "org.elasticsearch.client:elasticsearch-rest-high-level-client:${version}" ```	2018-08-21 20:03:28 -04:00
Nik Everett	fcf8cadd9a	Switch some x-pack tests to new style Requests (#32500 ) In #29623 we added `Request` object flavored requests to the low level REST client and in #30315 we deprecated the old `performRequest`s. This changes all calls in the `x-pack/qa/audit-tests`, `x-pack/qa/ml-disabled`, and `x-pack/qa/multi-node` projects to use the new versions.	2018-08-21 14:48:53 -04:00
Jason Tedor	28d12b05b7	Move ML tests to be sub-projects of ML (#33026 ) This commit moves the ML QA tests to be a sub-project of ML. The purpose of this refactoring is to enable ML developers to run :x-pack:plugin:ml:check and run the vast majority of a ML tests with a single command (this still does not contain the ML REST tests, nor the upgrade tests). This simplifies local development for faster iteration.	2018-08-21 12:23:21 -04:00
Benjamin Trent	3f91bbfa6b	[ML] Allowing _close to accept body payloads for options (#32989 ) (#33000 )	2018-08-21 08:08:26 -05:00
Jason Tedor	b08d02e3b7	Implement CCR licensing (#33002 ) This commit implements licensing for CCR. CCR will require a platinum license, and administrative endpoints will be disabled when a license is non-compliant.	2018-08-20 23:33:18 -04:00
Jason Tedor	9050c7e846	Generalize remote license checker (#32971 ) Machine learning has baked a remote license checker for use in checking license compatibility of a remote license. This remote license checker has general usage for any feature that relies on a remote cluster. For example, cross-cluster replication will pull changes from a remote cluster and require that the local and remote clusters have platinum licenses. This commit generalizes the remote cluster license check for use in cross-cluster replication.	2018-08-20 15:33:29 -04:00
Alpar Torok	4b34b3f4aa	Set forbidden APIs target compatibility to compiler java version (#32935 ) Set forbidden apis target compatibility to compiler version Fix outstanding deprecation	2018-08-20 09:27:02 +03:00
Benjamin Trent	9cec4aa14b	[ML] fix updating opened jobs scheduled events (#31651 ) (#32881 ) * ML: fix updating opened jobs scheduled events (#31651) * Adding UpdateParamsTests license header * Adding integration test and addressing PR comments * addressing test and job names	2018-08-17 07:21:17 -05:00
David Roberts	5ba04e23fc	[ML] Add log structure finder functionality (#32788 ) This change adds a library to ML that can be used to deduce a log file's structure given only a sample of the log file. Eventually this will be used to add an endpoint to ML to make the functionality available to end users, but this will follow in a separate change. The functionality is split into a library so that it can also be used by a command line tool without requiring the command line tool to include all server code.	2018-08-15 18:04:21 +01:00
Lee Hinman	48281ac5bc	Use generic AcknowledgedResponse instead of extended classes (#32859 ) This removes custom Response classes that extend `AcknowledgedResponse` and do nothing, these classes are not needed and we can directly use the non-abstract super-class instead. While this appears to be a large PR, no code has actually changed, only class names have been changed and entire classes removed.	2018-08-15 08:06:14 -06:00
Ed Savage	8ce1ab3ed9	[ML] Removing old per-partition normalization code (#32816 ) [ML] Removing old per-partition normalization code Per-partition normalization is an old, undocumented feature that was never used by clients. It has been superseded by per-partition maximum scoring. To maintain communication compatibility with nodes prior to 6.5 it is necessary to maintain/cope with the old wire format	2018-08-15 13:13:32 +01:00
Ed Savage	d147cd72cc	[ML] Partition-wise maximum scores (#32748 ) Added infrastructure to push through the 'person name field value' to the normalizer process. This is required by the normalizer to retrieve the maximum scores for individual partitions.	2018-08-13 10:31:17 +01:00
Benjamin Trent	b08416b899	Clear Job#finished_time when it is opened (#32605 ) (#32755 ) * Clear Job#finished_time when it is opened (#32605) * not returning failure when Job#finished_time is not reset * Changing error log string and source string	2018-08-10 13:52:00 -05:00
Dimitris Athanasiou	c7b1ba33aa	[ML] Refactor ProcessCtrl into Autodetect and Normalizer builders (#32720 ) This moves the helper functionality for creating the autodetect and mormalizer processes into corresponding builders.	2018-08-10 17:28:20 +01:00
David Roberts	ae0c303dad	Move icu4j and super-csv version numbers to versions file (#32769 ) The upcoming ML log structure finder functionality will use these libraries, and it makes sense to use the same versions that are being used elsewhere in Elasticsearch. This is especially true with icu4j, which is pretty big.	2018-08-10 12:19:06 +01:00
Nicholas Knize	e162127ff3	Upgrade to Lucene-7.5.0-snapshot-13b9e28f9d The main feature is the inclusion of bkd backed geo_shape with INTERSECT, DISJOINT, WITHIN bounding box and polygon query support.	2018-08-09 11:15:02 -05:00
Dimitris Athanasiou	f30bb0ebf8	[ML] Remove multiple_bucket_spans (#32496 ) This commit removes the never released multiple_bucket_spans configuration parameter. This is now replaced with the new multibucket feature that requires no configuration.	2018-08-02 11:25:56 +01:00
David Kyle	15679315e3	[ML] Rename JobProvider to JobResultsProvider (#32551 )	2018-08-02 09:53:47 +01:00
Benjamin Trent	9fb790dcc3	[ML] Fix thread leak when waiting for job flush (#32196 ) (#32541 )	2018-08-01 10:38:04 -05:00
Armin Braun	4b199dde8d	NETWORKING: Fix Netty Leaks by upgrading to 4.1.28 (#32511 ) * Upgrade to `4.1.28` since the problem reported in #32487 is a bug in Netty itself (see https://github.com/netty/netty/issues/7337) * Fixed other leaks in test code that now showed up due to fixes improvements in leak reporting in the newer version * Needed to extend permissions for netty common package because it now sets a classloader at runtime after changes in `63bae0956a` * Adjusted forbidden APIs check accordingly * Closes #32487	2018-08-01 02:34:58 +02:00
David Roberts	0afa265ac9	[ML] Consistent pattern for strict/lenient parser names (#32399 ) Previously we had two patterns for naming of strict and lenient parsers. Some classes had CONFIG_PARSER and METADATA_PARSER, and used an enum to pass the parser type to nested parsers. Other classes had STRICT_PARSER and LENIENT_PARSER and used ternary operators to pass the parser type to nested parsers. This change makes all ML classes use the second of the patterns described above.	2018-07-26 16:55:40 +01:00
Christoph Büscher	35ae87125d	Remove some dead code (#31993 ) Removing some dead code or supressing warnings where apropriate. Most of the time the variable tested for null is dereferenced earlier or never used before.	2018-07-26 17:12:51 +02:00
Tim Vernum	387c3c7f1d	Introduce Application Privileges with support for Kibana RBAC (#32309 ) This commit introduces "Application Privileges" to the X-Pack security model. Application Privileges are managed within Elasticsearch, and can be tested with the _has_privileges API, but do not grant access to any actions or resources within Elasticsearch. Their purpose is to allow applications outside of Elasticsearch to represent and store their own privileges model within Elasticsearch roles. Access to manage application privileges is handled in a new way that grants permission to specific application names only. This lays the foundation for more OLS on cluster privileges, which is implemented by allowing a cluster permission to inspect not just the action being executed, but also the request to which the action is applied. To support this, a "conditional cluster privilege" is introduced, which is like the existing cluster privilege, except that it has a Predicate over the request as well as over the action name. Specifically, this adds - GET/PUT/DELETE actions for defining application level privileges - application privileges in role definitions - application privileges in the has_privileges API - changes to the cluster permission class to support checking of request objects - a new "global" element on role definition to provide cluster object level security (only for manage application privileges) - changes to `kibana_user`, `kibana_dashboard_only_user` and `kibana_system` roles to use and manage application privileges Closes #29820 Closes #31559	2018-07-24 10:34:46 -06:00
Nik Everett	e6b9f59e4e	Build: Shadow x-pack:protocol into x-pack:plugin:core (#32240 ) This bundles the x-pack:protocol project into the x-pack:plugin:core project because we'd like folks to consider it an implementation detail of our build rather than a separate artifact to be managed and depended on. It is now bundled into both x-pack:plugin:core and client:rest-high-level. To make this work I had to fix a few things. Firstly, I had to make PluginBuildPlugin work with the shadow plugin. In that case we have to bundle only the `shadow` dependencies and the shadow jar. Secondly, every reference to x-pack:plugin:core has to use the `shadow` configuration. Without that the reference is missing all of the un-shadowed dependencies. I tried to make it so that applying the shadow plugin automatically redefines the `default` configuration to mirror the `shadow` configuration which would allow us to use bare project references to the x-pack:plugin:core project but I couldn't make it work. It'd look like it works but then fail for transitive dependencies anyway. I think it is still a good thing to do but I don't have the willpower to do it now. Finally, I had to fix an issue where Eclipse and IntelliJ didn't properly reference shadowed transitive dependencies. Neither IDE supports shadowing natively so they have to reference the shadowed projects. We fix this by detecting `shadow` dependencies when in "Intellij mode" or "Eclipse mode" and adding `runtime` dependencies to the same target. This convinces IntelliJ and Eclipse to play nice.	2018-07-24 11:53:04 -04:00
David Kyle	99426eb4f8	[ML] Extract persistent task methods from MlMetadata (#32319 ) Move ML persistent task helper functions to the new class MlTasks and remove MLMetadataField after moving the string constant to MlMetadata.	2018-07-24 15:22:57 +01:00
Christoph Büscher	ff87b7aba4	Remove unnecessary warning supressions (#32250 )	2018-07-23 11:31:04 +02:00
David Kyle	ac960bfa6b	[ML] Use default request durability for .ml-state index (#32233 ) The initial decision to use async durability was made a long time ago for performance reasons. That argument no longer applies and we prefer the safety of request durability.	2018-07-20 15:49:37 +01:00
Tim Vernum	c32981db6b	Detect old trial licenses and mimic behaviour (#32209 ) Prior to 6.3 a trial license default to security enabled. Since 6.3 they default to security disabled. If a cluster is upgraded from <6.3 to >6.3, then we detect this and mimic the old behaviour with respect to security.	2018-07-20 10:09:28 +10:00
David Roberts	99c2a82c04	[ML] Move analyzer dependencies out of categorization config (#32123 ) The ML config classes will shortly be moved to the X-Pack protocol library to allow the ML APIs to be moved to the high level REST client. Dependencies on server functionality should be removed from the config classes before this is done. This change is entirely about moving code between packages. It does not add or remove any functionality or tests.	2018-07-17 15:01:12 +01:00
Armin Braun	ed3b44fb4c	Handle TokenizerFactory TODOs (#32063 ) * Don't replace Replace TokenizerFactory with Supplier, this approach was rejected in #32063 * Remove unused parameter from constructor	2018-07-17 14:14:02 +02:00
David Roberts	d2461643cd	[ML] Move open job failure explanation out of root cause (#31925 ) When an ML job cannot be allocated to a node the exception contained an explanation of why the job couldn't be allocated to each node in the cluster. For large clusters this was not particularly easy to read and made the error displayed in the UI look very scary. This commit changes the structure of the error to an outer ElasticsearchException with a high level message and an inner IllegalStateException containing the detailed explanation. Because the definition of root cause is the innermost ElasticsearchException the detailed explanation will not be the root cause (which is what Kibana displays). Fixes #29950	2018-07-13 08:57:33 +01:00
Nik Everett	dcbb1154bf	HLRest: Move xPackInfo() to xPack().info() (#31905 ) Originally I put the X-Pack info object into the top level rest client object. I did that because we thought we'd like to squash `xpack` from the name of the X-Pack APIs now that it is part of the default distribution. We still kind of want to do that, but at least for now we feel like it is better to keep the high level rest client aligned with the other language clients like C# and Python. This shifts the X-Pack info API to align with its json spec file. Relates to #31870	2018-07-10 13:01:28 -04:00
Nik Everett	fb27f3e7f0	HLREST: Add x-pack-info API (#31870 ) This is the first x-pack API we're adding to the high level REST client so there is a lot to talk about here! = Open source The client for these APIs is open source. We're taking the previously Elastic licensed files used for the `Request` and `Response` objects and relicensing them under the Apache 2 license. The implementation of these features is staying under the Elastic license. This lines up with how the rest of the Elasticsearch language clients work. = Location of the new files We're moving all of the `Request` and `Response` objects that we're relicensing to the `x-pack/protocol` directory. We're adding a copy of the Apache 2 license to the root fo the `x-pack/protocol` directory to line up with the language in the root `LICENSE.txt` file. All files in this directory will have the Apache 2 license header as well. We don't want there to be any confusion. Even though the files are under the `x-pack` directory, they are Apache 2 licensed. We chose this particular directory layout because it keeps the X-Pack stuff together and easier to think about. = Location of the API in the REST client We've been following the layout of the rest-api-spec files for other APIs and we plan to do this for the X-Pack APIs with one exception: we're dropping the `xpack` from the name of most of the APIs. So `xpack.graph.explore` will become `graph().explore()` and `xpack.license.get` will become `license().get()`. `xpack.info` and `xpack.usage` are special here though because they don't belong to any proper category. For now I'm just calling `xpack.info` `xPackInfo()` and intend to call usage `xPackUsage` though I'm not convinced that this is the final name for them. But it does get us started. = Jars, jars everywhere! This change makes the `xpack:protocol` project a `compile` scoped dependency of the `x-pack:plugin:core` and `client:rest-high-level` projects. I intend to keep it a compile scoped dependency of `x-pack:plugin:core` but I intend to bundle the contents of the protocol jar into the `client:rest-high-level` jar in a follow up. This change has grown large enough at this point. In that followup I'll address javadoc issues as well. = Breaking-Java This breaks that transport client by a few classes around. We've traditionally been ok with doing this to the transport client.	2018-07-08 11:03:56 -04:00
Dimitris Athanasiou	49ba271bd8	[ML] Fix master node deadlock during ML daily maintenance (#31836 ) This is the implementation for master and 6.x of #31691. Native tests are changed to use multi-node clusters in #31757. Relates #31683	2018-07-07 09:43:28 +01:00
Christoph Büscher	bd1c513422	Reduce more raw types warnings (#31780 ) Similar to #31523.	2018-07-05 15:38:06 +02:00
David Roberts	92de94c237	[ML] Don't treat stale FAILED jobs as OPENING in job allocation (#31800 ) Job persistent tasks with stale allocation IDs used to always be considered as OPENING jobs in the ML job node allocation decision. However, FAILED jobs are not relocated to other nodes, which leads to them blocking up the nodes they failed on after node restarts. FAILED jobs should not restrict how many other jobs can open on a node, regardless of whether they are stale or not. Closes #31794	2018-07-05 13:26:17 +01:00
Dimitris Athanasiou	9c11bf1e12	[ML] Fix calendar and filter updates from non-master nodes (#31804 ) Job updates or changes to calendars or filters may result into updating the job process if it has been running. To preserve the order of updates, process updates are queued through the UpdateJobProcessNotifier which is only running on the master node. All actions performing such updates must run on the master node. However, the CRUD actions for calendars and filters are not master node actions. They have been submitting the updates to the UpdateJobProcessNotifier even though it might have not been running (given the action was run on a non-master node). When that happens, the update never reaches the process. This commit fixes this problem by ensuring the notifier runs on all nodes and by ensuring the process update action gets the resources again before updating the process (instead of having those resources passed in the request). This ensures that even if the order of the updates gets messed up, the latest update will read the latest state of those resource and the process will get back in sync. This leaves us with 2 types of updates: 1. updates to the job config should happen on the master node. This is because we cannot refetch the entire job and update it. We need to know the parts that have been changed. 2. updates to resources the job uses. Those can be handled on non-master nodes but they should be re-fetched by the update process action. Closes #31803	2018-07-05 13:14:12 +01:00
David Roberts	308e37f80e	[ML] Rate limit established model memory updates (#31768 ) There is at most one model size stats document per bucket, but during lookback a job can churn through many buckets very quickly. This can lead to many cluster state updates if established model memory needs to be updated for a given model size stats document. This change rate limits established model memory updates to one per job per 5 seconds. This is done by scheduling the updates 5 seconds in the future, but replacing the value to be written if another model size stats document is received during the waiting period. Updating the values in arrears like this means that the last value received will be the one associated with the job in the long term, whereas alternative approaches such as not updating the value if a new value was close to the old value would not.	2018-07-04 13:56:32 +01:00
Hendrik Muhs	e9f8442bee	[ML] Return statistics about forecasts as part of the jobsstats and usage API (#31647 ) This change adds stats about forecasts, to the jobstats api as well as xpack/_usage. The following information is collected: _xpack/ml/anomaly_detectors/{jobid\|_all}/_stats: - total number of forecasts - memory statistics (mean/min/max) - runtime statistics - record statistics - counts by status _xpack/usage - collected by job status as well as overall (_all): - total number of forecasts - number of jobs that have at least 1 forecast - memory, runtime, record statistics - counts by status Fixes #31395	2018-07-04 08:15:45 +02:00

1 2 3 4

192 Commits