OpenSearch

Commit Graph

Author	SHA1	Message	Date
Tim Vernum	1577a0e617	Validate field permissions when creating a role (#50917 ) When creating a role, we do not check if the exceptions for the field permissions are a subset of granted fields. If such a role is assigned to a user then that user's authentication fails for this reason. We added a check to validate role query in #46275 and on the same lines, this commit adds check if the exceptions for the field permissions is a subset of granted fields when parsing the index privileges from the role descriptor. Backport of: #50212 Co-authored-by: Yogesh Gaikwad <bizybot@users.noreply.github.com>	2020-01-14 12:37:45 +11:00
Tim Vernum	c2acb8830a	Add max_resource_units to enterprise license (#50910 ) The enterprise license type must have "max_resource_units" and may not have "max_nodes". This change adds support for this new field, validation that the field is present if-and-only-if the license is enterprise and bumps the license version number to reflect the new field. Includes a BWC layer to return "max_nodes: ${max_resource_units}" in the GET license API. Backport of: #50735	2020-01-14 12:37:05 +11:00
Przemko Robakowski	a18736b46d	[7.x] ILM action to wait for SLM policy execution (#50454 ) (#50943 ) * ILM action to wait for SLM policy execution (#50454) This change add new ILM action to wait for SLM policy execution to ensure that index has snapshot before deletion. Closes #45067 * Fix flaky TimeSeriesLifecycleActionsIT#testWaitForSnapshot test This change adds some randomness and cleanup step to TimeSeriesLifecycleActionsIT#testWaitForSnapshot and testWaitForSnapshotSlmExecutedBefore tests in attempt to make them stable. Reletes to #50781 * Formatting changes * Longer timeout * Fix Map.of in Java8 * Unused import removed	2020-01-14 01:34:33 +01:00
Lee Hinman	91689e793d	[7.x] Refresh cached phase policy definition if possible on ne… (#50941 ) * Refresh cached phase policy definition if possible on new policy There are some cases when updating a policy does not change the structure in a significant way. In these cases, we can reread the policy definition for any indices using the updated policy. This commit adds this refreshing to the `TransportPutLifecycleAction` to allow this. It allows us to do things like change the configuration values for a particular step, even when on that step (for example, changing the rollover criteria while on the `check-rollover-ready` step). There are more cases where the phase definition can be reread that just the ones checked here (for example, removing an action that has already been passed), and those will be added in subsequent work. Relates to #48431	2020-01-13 14:31:41 -07:00
Bogdan Pintea	f04b4cbee8	SQL: Optimisation fixes for conjunction merges (#50703 ) (#50933 ) * SQL: Optimisation fixes for conjunction merges This commit fixes the following issues around the way comparisions are merged with ranges in conjunctions: * the decision to include the equality of the lower limit is corrected; * the selection of the upper limit is corrected to use the upper bound of the range; * the list of terms in the conjunction is sorted to have the ranges at the bottom; this allows subsequent binary comarisions to find compatible ranges and potentially be merged away. The end guarantee being that the optimisation takes place irrespective of the order of the conjunction terms in the statement. Some comments are also corrected. * adress review observation on anon. comparator Replace anonymous comparator of split AND Expressions with a lambda. (cherry picked from commit 9828cb143a41f1bda1219541f3a8fdc03bf6dd14)	2020-01-13 21:51:29 +01:00
Ioannis Kakavas	ba37e3c4a0	Disable DiagnosticTrustManager in FIPS 140 (#49888 ) This commit changes the default behavior for xpack.security.ssl.diagnose.trust when running in a FIPS 140 JVM. More specifically, when xpack.security.fips_mode.enabled is true: - If xpack.security.ssl.diagnose.trust is not explicitly set, the default value of it becomes false and a log message is printed on info level, notifying of the fact that the TLS/SSL diagnostic messages are not enabled when in a FIPS 140 JVM. - If xpack.security.ssl.diagnose.trust is explicitly set, the value of it is honored, even in FIPS mode. This is relevant only for 7.x where we support Java 8 in which SunJSSE can still be used as a FIPS 140 provider for TLS. SunJSSE in FIPS mode, disallows the use of other TrustManager implementations than the one shipped with SunJSSE.	2020-01-13 17:04:23 +02:00
Larry Gregory	cc8aafcfc2	[7.x] - Adding GET/PUT ILM cluster privileges to `kibana_syste… (#50878 ) Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-01-13 08:36:48 -05:00
Benjamin Trent	eb8fd44836	[ML][Inference] minor fixes for created_by, and action permission (#50890 ) (#50911 ) The system created and models we provide now use the `_xpack` user for uniformity with our other features The `PUT` action is now an admin cluster action And XPackClient class now references the action instance.	2020-01-13 07:59:31 -05:00
Albert Zaharovits	4e837599b3	Nit fix test randomInt bound Relates `2b789fa3e6`	2020-01-13 13:28:20 +02:00
Albert Zaharovits	2b789fa3e6	Make .async-search-* a restricted namespace (#50294 ) Hide the `.async-search-*` in Security by making it a restricted index namespace. The namespace is hard-coded. To grant privileges on restricted indices, one must explicitly toggle the `allow_restricted_indices` flag in the indices permission in the role definition. As is the case with any other index, if a certain user lacks all permissions for an index, that index is effectively nonexistent for that user.	2020-01-13 12:20:54 +02:00
Tim Vernum	985c95dcca	Populate OpenIDConnect metadata collections (#50893 ) The OpenIdConnectRealm had a bug which would cause it not to populate User metadata for collections contained in the user JWT claims. This commit fixes that bug. Backport of: #50521	2020-01-13 18:02:22 +11:00
Benjamin Trent	fa116a6d26	[7.x] [ML][Inference] PUT API (#50852 ) (#50887 ) * [ML][Inference] PUT API (#50852) This adds the `PUT` API for creating trained models that support our format. This includes * HLRC change for the API * API creation * Validations of model format and call * fixing backport	2020-01-12 10:59:11 -05:00
Lee Hinman	63472d30c7	[7.x] Fix SLM check for restore in progress (#50868 ) (#50876 ) * Fix SLM check for restore in progress (#50868) * Fix SLM check for restore in progress This commit fixes the check in SLM where the `RestoreInProgress` metadata was checked for existence. Rather than check existence we should instead check the `isEmpty` method. Prior to this, a successful restore for a repository that used SLM retention would prevent SLM retention from running in subsequent invocations, due to SLM thinking that a restore was still running. * Fix 7.x-isms	2020-01-10 14:27:55 -07:00
Julie Tibshirani	3bac1dc414	Adjust the skip version in flattened field telemetry tests. We forgot to adjust the version when backporting the commit to 7.x.	2020-01-10 10:36:41 -08:00
Benjamin Trent	5afa0b71e9	[ML][Inference] Unify top_classes object field names with analytics (#50858 ) (#50861 )	2020-01-10 12:00:37 -05:00
Dimitris Athanasiou	422422a2bc	[7.x][ML] Reuse SourceDestValidator for data frame analytics (#50841 ) (#50850 ) This commit removes validation logic of source and dest indices for data frame analytics and replaces it with using the common `SourceDestValidator` class which is already used by transforms. This way the validations and their messages become consistent while we reduce code. This means that where these validations fail the error messages will be slightly different for data frame analytics. Backport of #50841	2020-01-10 14:24:13 +02:00
Nik Everett	ae40e22452	Drop "funny" functions building parsers (#50715 ) (#50814 ) Replaces the "funny" `Function<String, ConstructingObjectParser<T, Void>>` with a much simpler `ConstructingObjectParser<T, String>`. This makes pretty much all of our object parsers static.	2020-01-09 15:53:03 -05:00
Jake Landis	de6f132887	[7.x] Foreach processor - fork recursive call (#50514 ) (#50773 ) A very large number of recursive calls can cause a stack overflow exception. This commit forks the recursive calls for non-async processors. Once forked, each thread will handle at most 10 recursive calls to help keep the stack size and thread count down to a reasonable size.	2020-01-09 13:21:18 -06:00
Sean Story	c51303d051	Typo of ' instead of ` (#50767 )	2020-01-09 09:41:41 -08:00
Benjamin Trent	cc0e64572a	[ML][Inference][HLRC] Add necessary lang ident classes (#50705 ) (#50794 ) This adds the necessary named XContent classes to the HLRC for the lang ident model. This is so the HLRC can call `GET _ml/inference/lang_ident_model_1?include_definition=true` without XContent parsing errors. The constructors are package private as since this classes are used exclusively within the pre-packaged model (and require the specific weights, etc. to be of any use).	2020-01-09 10:33:38 -05:00
Benjamin Trent	3e014d39c2	[Transform] fail to start/put on missing pipeline (#50701 ) (#50795 ) If a pipeline referenced by a transform does not exist, we should not allow the transform to be created. We do allow the pipeline existence check to be skipped with defer_validations, but if the pipeline still does not exist on `_start`, the pipeline will fail to start. relates: #50135	2020-01-09 10:33:22 -05:00
Martijn van Groningen	f75d99149b	Wrap triggering of a watch inside an assertBusy(...) invocation This test replaces the watch index after watcher got started. This triggers watches being reloaded and while this happens the trigger engine is paused, which disallows watches from being triggered. At this time there are no watches in the .watches index and I think this is just unlucky timing. Reloading of watches happens in the background and the watch state can be started when that happens. For normal schedule trigger engines this is not an issue, because watches that are meant to be triggered are triggered when the engine triggers the next time. However for the mock scheduled trigger engine this is different, because watches are triggered programatically and there is no retry in this test. I think just adding `timeWarp().trigger("mywatch");` inside a `assertBusy(...)`` is the right fix here. If it fails because the mock schedule trigger engine is paused then the test will try again. In the mean time the the watches can be reloaded, which then resumes the mock scheduled trigger engine. Closes #50658	2020-01-09 09:05:20 +01:00
Ioannis Kakavas	d2189b9d80	Mute SamlAuthenticatorTests in Azulu Zulu (#50779 ) See #49742	2020-01-09 09:41:04 +02:00
Christoph Büscher	b1b4282273	Make Multiplexer inherit filter chains analysis mode (#50662 ) Currently, if an updateable synonym filter is included in a multiplexer filter, it is not reloaded via the _reload_search_analyzers because the multiplexer itself doesn't pass on the analysis mode of the filters it contains, so its not recognized as "updateable" in itself. Instead we can check and merge the AnalysisMode settings of all filters in the multiplexer and use the resulting mode (e.g. search-time only) for the multiplexer itself, thus making any synonym filters contained in it reloadable. This, of course, will also make the analyzers using the multiplexer be usable at search-time only. Closes #50554	2020-01-08 22:12:01 +01:00
Lee Hinman	8dc6e98819	[7.x] Make InitializePolicyContextStep retryable (#50685 ) (#50760 ) This commits makes the "init" ILM step retryable. It also adds a test where an index is created with a non-parsable index name and then fails. Related to #48183	2020-01-08 13:13:57 -07:00
Nhat Nguyen	90e66a7b97	Mute testPolicyCRUD Tracked at #44997	2020-01-08 13:25:40 -05:00
Adrien Grand	4f2299c714	Upgrade to Lucene 8.4.0. (#50518 ) (#50750 )	2020-01-08 18:53:59 +01:00
Lee Hinman	615532b4f8	Mute TimeSeriesLifecycleActionsIT.testHistoryIsWritten* (#50755 ) Related to #50353	2020-01-08 10:35:44 -07:00
Armin Braun	a725896c92	Fix and Reenable SnapshotTool Minio Tests (#50736 ) (#50745 ) This solves half of the problem in #46813 by moving the S3 tests to using the shared minio fixture so we at least have some non-3rd-party, constantly running coverage on these tests.	2020-01-08 16:33:36 +01:00
Adrien Grand	31158ab3d5	Add per-field metadata. (#50333 ) This PR adds per-field metadata that can be set in the mappings and is later returned by the field capabilities API. This metadata is completely opaque to Elasticsearch but may be used by tools that index data in Elasticsearch to communicate metadata about fields with tools that then search this data. A typical example that has been requested in the past is the ability to attach a unit to a numeric field. In order to not bloat the cluster state, Elasticsearch requires that this metadata be small: - keys can't be longer than 20 chars, - values can only be numbers or strings of no more than 50 chars - no inner arrays or objects, - the metadata can't have more than 5 keys in total. Given that metadata is opaque to Elasticsearch, field capabilities don't try to do anything smart when merging metadata about multiple indices, the union of all field metadatas is returned. Here is how the meta might look like in mappings: ```json { "properties": { "latency": { "type": "long", "meta": { "unit": "ms" } } } } ``` And then in the field capabilities response: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms" ] } } } } ``` When there are no conflicts, values are arrays of size 1, but when there are conflicts, Elasticsearch includes all unique values in this array, without giving ways to know which index has which metadata value: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms", "ns" ] } } } } ``` Closes #33267	2020-01-08 16:21:18 +01:00
Andrei Dan	3915d4c055	Make the UpdateRolloverLifecycleDateStep retryable (#50702 ) (#50730 ) This makes the "update-rollover-lifecycle-date" step, which is part of the rollover action, retryable. It also adds an integration test to check the step is retried and it eventually succeeds. (cherry picked from commit 5bf068522deb2b6cd2563bcf80f34fdbf459c9f2) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-01-08 11:45:26 +01:00
Christoph Büscher	d8c907d648	Remove _reload_search_analyzer experimental status (#50696 ) Removing the experimental status in the docs and the rest specs.	2020-01-08 10:35:19 +01:00
Tim Vernum	293661d62c	Security should not reload files that haven't changed (#50724 ) In security we currently monitor a set of files for changes: - config/role_mapping.yml (or alternative configured path) - config/roles.yml - config/users - config/users_roles This commit prevents unnecessary reloading when the file change actually doesn't change the internal structure. Backport of: #50207 Co-authored-by: Anton Shuvaev <anton.shuvaev91@gmail.com>	2020-01-08 15:13:47 +11:00
Mayya Sharipova	c1c0b47d5e	Specify the indexname in searches (#50717 ) vector REST tests occasionally fail on 7.x because we don't receive the expected response headers with deprecation warnings. This happens as searchers were executed against all indices including internal indices, whose shards did not produce expected warnings. This PR ensures that searchers are executed only against expected indices. Closes #50716	2020-01-07 17:06:52 -05:00
Benjamin Trent	060e0a6277	[ML][Inference] Add support for models shipped as resources (#50680 ) (#50700 ) This adds support for models that are shipped as resources in the ML plugin. The first of which is the `lang_ident` model.	2020-01-07 09:21:59 -05:00
Hendrik Muhs	98ca9500e8	implement a workaround for remote cluster validation (#50460 ) In 7.x an internal API used for validating remote cluster does not throw, see #50420 for the details. This change implements a workaround for remote cluster validation, only for 7.x branches. fixes #50420	2020-01-07 13:51:51 +01:00
Przemysław Witek	4116452d90	Implement testStopAndRestart for ClassificationIT (#50585 ) (#50698 )	2020-01-07 13:41:37 +01:00
David Roberts	35453e2b0e	[ML] Improve uniqueness of result document IDs (#50644 ) Switch from a 32 bit Java hash to a 128 bit Murmur hash for creating document IDs from by/over/partition field values. The 32 bit Java hash was not sufficiently unique, and could produce identical numbers for relatively common combinations of by/partition field values such as L018/128 and L017/228. Fixes #50613	2020-01-07 10:24:45 +00:00
David Roberts	46d600c446	[ML] Fix off-by-one error in ml_classic tokenizer end offset (#50655 ) The end offset of a tokenizer is supposed to point one past the end of the input, not to the end character of the input. The ml_classic tokenizer was erroneously doing the latter.	2020-01-07 10:14:59 +00:00
Lee Hinman	552edd862e	[7.x] Add aditional logging for ILM history store tests (#5062… (#50678 ) * Add aditional logging for ILM history store tests (#50624) These tests use the same index name, making it hard to read logs when diagnosing the failures. Additionally more information about the current state of the index could be retrieved when failing. This changes these two things in the hope of capturing more data about why this fails on some CI nodes but not others. Relates to #50353	2020-01-06 15:24:24 -07:00
Nik Everett	7fd84a03a0	Drop references to deprecated logger (#50474 ) (#50681 ) This drops all remaining references to `BaseRestHandler.logger` which has been deprecated for something like a year now. I replaced all of the references with locally declared loggers which is so much less spooky action at a distance to me.	2020-01-06 16:34:07 -05:00
Benjamin Trent	06cea5136e	[ML] construct new random generator on each persistence call (#50657 ) (#50684 ) Sharing a random generator may cause test failures as non-threadsafe random generators are periodically utilized in tests (see: https://github.com/elastic/elasticsearch/issues/50651) This change constructs a calls `Randomness.get()` within the `bulkIndexWithRetry` method so that the returned `Random` object is only used in a single thread. Before, the member variable could have been used between threads, which caused test failures.	2020-01-06 16:26:29 -05:00
Benjamin Trent	5ab9e75e28	[7.x] [ML][Inference] lang_ident model (#50292 ) (#50675 ) * [ML][Inference] lang_ident model (#50292) This PR contains a java port of Google's CLD3 compact NN model https://github.com/google/cld3 The ported model is formatted to fit within our inference model formatting and stored as a resource in the `:xpack:ml:` plugin and is under basic license. The model is broken up into two major parts: - Preprocessing through the custom embedding (based on CLD3's embedding layer) - Pushing the embedded text through the two layers of fully connected shallow NN. Main differences between this port and CLD3: - We take advantage of Java's internal Unicode handling where possible (i.e. codepoints, characters, decoders, etc.) - We do not trim down input text by removing duplicated tokens - We do not encode doubles/floats as longs/integers.	2020-01-06 16:24:03 -05:00
Benjamin Trent	f52af7977d	[ML][Inference] minor cleanup for inference (#50444 ) (#50676 )	2020-01-06 14:05:04 -05:00
Nik Everett	1b28af489f	Fix bare warnings on RollupJobTests (#50633 ) (#50677 ) Silences some ugly warnings.	2020-01-06 14:03:30 -05:00
Albert Zaharovits	9ae3cd2a78	Add 'monitor_snapshot' cluster privilege (#50489 ) (#50647 ) This adds a new cluster privilege `monitor_snapshot` which is a restricted version of `create_snapshot`, granting the same privileges to view snapshot and repository info and status but not granting the actual privilege to create a snapshot. Co-authored-by: j-bean <anton.shuvaev91@gmail.com>	2020-01-06 13:15:55 +02:00
Martijn van Groningen	0f2d26bdca	Unmute 'Test url escaping with url mustache function' webhook watcher test (#50439 ) Some changes had to be made in order to make the test pass due to the removal or types. Added some more assertions. The failure description in this comment [0] indicates that the rest handler couldn't be found. The test passes now. I plan to merge this into master and see how CI reacts, if it handles this change well then I will also unmute this test in 7 dot x branch. Also check watch count after stopping watcher in test teardown and disabled slm in smoke test watcher qa test. Relates to #41172 0: https://github.com/elastic/elasticsearch/issues/41172#issuecomment-496993976	2020-01-06 10:43:55 +01:00
Nik Everett	2362c430cd	Clean up wire test case a bit (#50627 ) (#50632 ) * Adds JavaDoc to `AbstractWireTestCase` and `AbstractWireSerializingTestCase` so it is more obvious you should prefer the latter if you have a choice * Moves the `instanceReader` method out of `AbstractWireTestCase` becaue it is no longer used. * Marks a bunch of methods final so it is more obvious which classes are for what. * Cleans up the side effects of the above.	2020-01-05 16:20:38 -05:00
Nik Everett	45663ac1a8	Use Void context on parsers where possible (#50573 ) (#50617 ) Most of our parsing can be done without passing any extra context into the parser that isn't already part of the xcontent stream. While I was looking around at the places that do need a context I found a few places that were declared to need a context but don't actually need it.	2020-01-03 13:28:55 -05:00
Christoph Büscher	6c8868e955	Mute TimeSeriesLifecycleActionsIT.testHistoryIsWrittenWithSuccess Also muting TimeSeriesLifecycleActionsIT.testHistoryIsWrittenWithFailure. Tracked in #50353	2020-01-03 18:32:03 +01:00
Andrei Dan	3c971f2911	ILM retryable async action steps (#50522 ) (#50591 ) This adds support for retrying AsyncActionSteps by triggering the async step after ILM was moved back on the failed step (the async step we'll be attempting to run after the cluster state reflects ILM being moved back on the failed step). This also marks the RolloverStep as retryable and adds an integration test where the RolloverStep is failing to execute as the rolled over index already exists to test that the async action RolloverStep is retried until the rolled over index is deleted. (cherry picked from commit 8bee5f4cb58a1242cc2ef4bc0317dae6c8be49d3) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-01-03 16:19:58 +02:00
Dimitris Athanasiou	ca0828ba07	[7.x][ML] Implement force deleting a data frame analytics job (#50553 ) (#50589 ) Adds a `force` parameter to the delete data frame analytics request. When `force` is `true`, the action force-stops the jobs and then proceeds to the deletion. This can be used in order to delete a non-stopped job with a single request. Closes #48124 Backport of #50553	2020-01-03 13:46:02 +02:00
Przemysław Witek	8917c05df8	[7.x] Synchronize processInStream.close() call (#50581 )	2020-01-03 10:23:51 +01:00
Lee Hinman	0d78aa2708	Don't dump a stacktrace for invalid patterns when executing elasticsearch-croneval (#49744 ) (#50578 ) Co-authored-by: bellengao <gbl_long@163.com>	2020-01-02 16:57:51 -07:00
Nik Everett	b36a8ab141	Make some ObjectParsers final (#50471 ) (#50556 ) We have about 800 `ObjectParsers` in Elasticsearch, about 700 of which are final. This is probably the right way to declare them because in practice we never mutate them after they are built. And we certainly don't change the static reference. Anyway, this adds `final` to a bunch of these parsers, mostly the ones in xpack and their "paired" parsers in the high level rest client. I picked these just to have somewhere to break the up the change so it wouldn't be huge. I found the non-final parsers with this: ``` diff \ <(find . -type f -name '.java' -exec grep -iHe 'static.PARSER\s=' {} \+ \| sort) \ <(find . -type f -name '.java' -exec grep -iHe 'static.final.PARSER\s*=' {} \+ \| sort) \ 2>&1 \| grep '^<' ```	2020-01-02 10:47:38 -05:00
Przemysław Witek	4ecabe496f	Mute testStopAndRestart test case (#50551 )	2020-01-02 15:28:20 +01:00
Christoph Büscher	1599af8428	Fix type conversion problem in Eclipse (#50549 ) Eclipse 4.13 shows a type mismatch error in the affected line because it cannot correctly infer the boolean return type for the method call. Assigning return value to a local variable resolves this problem.	2020-01-02 14:29:20 +01:00
Lisa Cawley	8869f2b9b2	[DOCS] Adds intro for OIDC realm (#50485 )	2019-12-30 07:05:28 -08:00
Tim Vernum	cad0f6bf28	Do not load SSLService in plugin contructor (#50519 ) XPackPlugin created an SSLService within the plugin contructor. This has 2 negative consequences: 1. The service may be constructed based on a partial view of settings. Other plugins are free to add setting values via the additionalSettings() method, but this (necessarily) happens after plugins have been constructed. 2. Any exceptions thrown during the plugin construction are handled differently than exceptions thrown during "createComponents". Since SSL configurations exceptions are relatively common, it is far preferable for them to be thrown and handled as part of the createComponents flow. This commit moves the creation of the SSLService to XPackPlugin.createComponents, and alters the sequence of some other steps to accommodate this change. Backport of: #49667	2019-12-30 14:42:32 +11:00
James Rodewig	3f7f31b6b0	[DOCS] Fix search request body links (#50500 ) PR #44238 changed several links related to the Elasticsearch search request body API. This updates several places still using outdated links or anchors. This will ultimately let us remove some redirects related to those link changes.	2019-12-26 14:31:09 -05:00
James Rodewig	ef467cc6f5	[DOCS] Remove unneeded redirects (#50476 ) The docs/reference/redirects.asciidoc file stores a list of relocated or deleted pages for the Elasticsearch Reference documentation. This prunes several older redirects that are no longer needed and don't require work to fix broken links in other repositories.	2019-12-26 08:29:28 -05:00
Orhan Toy	6a3d1a077e	[DOCS] Fixes "enables you to" typos (#50225 )	2019-12-23 14:39:14 -05:00
Armin Braun	cec02da0ac	Fix Source Only Snapshot REST Test Failure (#50456 ) (#50459 ) We are matching on the exact number of shards in this test, but may run into snapshotting more than the single index created in it due to auto-created indices like `.watcher`. Fixed by making the test only take a snapshot of the single index used by this test. Closes #50450	2019-12-23 12:24:08 +01:00
Igor Motov	339d10c16f	Geo: Switch generated GeoJson type names to camel case (#50400 ) Switches generated GeoJson type names to camel case to conform to the standard. Closes #49568	2019-12-20 15:37:22 -05:00
Lee Hinman	c3c9ccf61f	[7.x] Add ILM histore store index (#50287 ) (#50345 ) * Add ILM histore store index (#50287) * Add ILM histore store index This commit adds an ILM history store that tracks the lifecycle execution state as an index progresses through its ILM policy. ILM history documents store output similar to what the ILM explain API returns. An example document with ALL fields (not all documents will have all fields) would look like: ```json { "@timestamp": 1203012389, "policy": "my-ilm-policy", "index": "index-2019.1.1-000023", "index_age":123120, "success": true, "state": { "phase": "warm", "action": "allocate", "step": "ERROR", "failed_step": "update-settings", "is_auto-retryable_error": true, "creation_date": 12389012039, "phase_time": 12908389120, "action_time": 1283901209, "step_time": 123904107140, "phase_definition": "{\"policy\":\"ilm-history-ilm-policy\",\"phase_definition\":{\"min_age\":\"0ms\",\"actions\":{\"rollover\":{\"max_size\":\"50gb\",\"max_age\":\"30d\"}}},\"version\":1,\"modified_date_in_millis\":1576517253463}", "step_info": "{... etc step info here as json ...}" }, "error_details": "java.lang.RuntimeException: etc\n\tcaused by:etc etc etc full stacktrace" } ``` These documents go into the `ilm-history-1-00000N` index to provide an audit trail of the operations ILM has performed. This history storage is enabled by default but can be disabled by setting `index.lifecycle.history_index_enabled` to `false.` Resolves #49180 * Make ILMHistoryStore.putAsync truly async (#50403) This moves the `putAsync` method in `ILMHistoryStore` never to block. Previously due to the way that the `BulkProcessor` works, it was possible for `BulkProcessor#add` to block executing a bulk request. This was bad as we may be adding things to the history store in cluster state update threads. This also moves the index creation to be done prior to the bulk request execution, rather than being checked every time an operation was added to the queue. This lessens the chance of the index being created, then deleted (by some external force), and then recreated via a bulk indexing request. Resolves #50353	2019-12-20 12:33:36 -07:00
Lisa Cawley	2106a7b02a	[7.x][DOCS] Updates ML links (#50387 ) (#50409 )	2019-12-20 10:01:19 -08:00
Benjamin Trent	71ff330c4e	[ML][Inference] updates specs with new params + docs (#50373 ) (#50441 )	2019-12-20 12:13:45 -05:00
Martijn van Groningen	9646f3abad	Disable slm in AbstractWatcherIntegrationTestCase (#50422 ) SLM isn't required tests extending from this base class and only add noise during test suite teardown. Closes #50302	2019-12-20 15:51:46 +01:00
Przemysław Witek	3e3a93002f	[7.x] Fix accuracy metric (#50310 ) (#50433 )	2019-12-20 15:34:38 +01:00
Przemysław Witek	14d95aae46	[7.x] Make each analysis report desired field mappings to be copied (#50219 ) (#50428 )	2019-12-20 15:10:33 +01:00
Przemysław Witek	5bb668b866	[7.x] Get rid of maxClassesCardinality internal parameter (#50418 ) (#50423 )	2019-12-20 14:24:23 +01:00
Hendrik Muhs	40bce49a7f	mute SourceDestValidatorTests.testRemoteSourceDoesNotExist	2019-12-20 11:25:43 +01:00
Hendrik Muhs	7c10e9b0e7	[Transform] improve checkpoint reporting (#50369 ) fixes empty checkpoints, re-factors checkpoint info creation (moves builder) and always reports last change detection relates #43201 relates #50018	2019-12-20 10:49:53 +01:00
Hendrik Muhs	de14092ad2	[Transform] refactor source and dest validation to support CCS (#50018 ) refactors source and dest validation, adds support for CCS, makes resolve work like reindex/search, allow aliased dest index with a single write index. fixes #49988 fixes #49851 relates #43201	2019-12-20 10:49:53 +01:00
Marios Trivyzas	f1a6b675f7	SQL: Fix issue with CAST and NULL checking. (#50371 ) Previously, during expression optimisation, CAST would be considered nullable if the casted expression resulted to a NULL literal, and would be always non-nullable otherwise. As a result if CASE was wrapped by a null check function like IS NULL or IS NOT NULL it was simplified to TRUE/FALSE, eliminating the actual casting operation. So in case of an expression with an erroneous casting like CAST('foo' AS DATETIME) IS NULL it would be simplified to FALSE instead of throwing an Exception signifying the attempt to cast 'foo' to a DATETIME type. CAST now always returns Nullability.UKNOWN except from the case that its result evaluated to a constant NULL, where it returns Nullability.TRUE. This way the IS NULL/IS NOT NULL don't get simplified to FALSE/TRUE and the CAST actually gets evaluated resulting to a thrown Exception. Fixes: #50191 (cherry picked from commit 671e07a931cd828661e226cba22a5d38804a17a5)	2019-12-20 10:24:35 +02:00
Tim Brooks	cb73fb0f9b	Backport remote proxy mode stats and naming (#50402 ) * Update remote cluster stats to support simple mode (#49961) Remote cluster stats API currently only returns useful information if the strategy in use is the SNIFF mode. This PR modifies the API to provide relevant information if the user is in the SIMPLE mode. This information is the configured addresses, max socket connections, and open socket connections. * Send hostname in SNI header in simple remote mode (#50247) Currently an intermediate proxy must route conncctions to the appropriate remote cluster when using simple mode. This commit offers a additional mechanism for the proxy to route the connections by including the hostname in the TLS SNI header. * Rename the remote connection mode simple to proxy (#50291) This commit renames the simple connection mode to the proxy connection mode for remote cluster connections. In order to do this, the mode specific settings which we namespaced by their mode (ex: sniff.seed and proxy.addresses) have been reverted. * Modify proxy mode to support a single address (#50391) Currently, the remote proxy connection mode uses a list setting for the proxy address. This commit modifies this so that the setting is proxy_address and only supports a single remote proxy address.	2019-12-19 18:02:48 -07:00
Stuart Tettemer	689df1f28f	Scripting: ScriptFactory not required by compile (#50344 ) (#50392 ) Avoid backwards incompatible changes for 8.x and 7.6 by removing type restriction on compile and Factory. Factories may optionally implement ScriptFactory. If so, then they can indicate determinism and thus cacheability. Backport Relates: #49466	2019-12-19 12:50:25 -07:00
Przemysław Witek	cc4bc797f9	[7.x] Implement `precision` and `recall` metrics for classification evaluation (#49671 ) (#50378 )	2019-12-19 18:55:05 +01:00
Igor Motov	c77ca98928	Geo: Switch generated WKT to upper case (#50285 ) Switches generated WKT to upper case to conform to the standard recommendation. Relates #49568	2019-12-18 17:29:08 -05:00
Dimitris Athanasiou	d3c83cd55a	[7.x][ML] Refresh state index before completing data frame analytics job (#50322 ) (#50324 ) In order to ensure any persisted model state is searchable by the moment the job reports itself as `stopped`, we need to refresh the state index before completing. This should fix the occasional failures we see in #50168 and #50313 where the model state appears missing. Closes #50168 Closes #50313 Backport of #50322	2019-12-18 22:19:59 +00:00
Benjamin Trent	4396a1f78b	[ML][Inference] fix support for nested fields (#50258 ) (#50335 ) This fixes support for nested fields We now support fully nested, fully collapsed, or a mix of both on inference docs. ES mappings allow the `_source` to be any combination of nested objects + dot delimited fields. So, we should do our best to find the best path down the Map for the desired field.	2019-12-18 15:47:06 -05:00
Jason Tedor	7c5a3bcf6d	Always consume the body in has privileges (#50298 ) Our REST infrastructure will reject requests that have a body where the body of the request is never consumed. This ensures that we reject requests on endpoints that do not support having a body. This requires cooperation from the REST handlers though, to actually consume the body, otherwise the REST infrastructure will proceed with rejecting the request. This commit addresses an issue in the has privileges API where we would prematurely try to reject a request for not having a username, before consuming the body. Since the body was not consumed, the REST infrastructure would instead reject the request as a bad request.	2019-12-18 08:30:53 -05:00
Dimitris Athanasiou	447bac27d2	[7.x][ML] Delete unused data frame analytics state (#50243 ) (#50280 ) This commit adds removal of unused data frame analytics state from the _delete_expired_data API (and in extend th ML daily maintenance task). At the moment the potential state docs include the progress document and state for regression and classification analyses. Backport of #50243	2019-12-18 12:30:11 +00:00
Yannick Welsch	82086929d7	Increase timeout on FollowIndexSecurityIT.testAutoFollowPatterns (#50282 ) This test was causing test failures on slow CI runs. Closes #50279	2019-12-18 10:37:11 +01:00
Przemysław Witek	ac974c35c0	Pass processConnectTimeout to the method that fetches C++ process' PID (#50276 ) (#50290 )	2019-12-17 21:32:37 +01:00
Florian Kelbert	afe9ee3fa5	[DOCS] Fix typo in Create API key docs (#50233 )	2019-12-17 11:19:13 -05:00
David Kyle	098f540f9d	[ML] Remove usage of base action logger in ml actions (#50074 ) (#50236 )	2019-12-17 13:03:27 +00:00
Martijn van Groningen	2079f1cbeb	Backport: Fix ingest simulate response document order if processor executes async (#50269 ) Backport #50244 to 7.x branch. If a processor executes asynchronously and the ingest simulate api simulates with multiple documents then the order of the documents in the response may not match the order of the documents in the request. Alexander Reelsen discovered this issue with the enrich processor with the following reproduction: ``` PUT cities/_doc/munich {"zip":"80331","city":"Munich"} PUT cities/_doc/berlin {"zip":"10965","city":"Berlin"} PUT /_enrich/policy/zip-policy { "match": { "indices": "cities", "match_field": "zip", "enrich_fields": [ "city" ] } } POST /_enrich/policy/zip-policy/_execute GET _cat/indices/.enrich-* POST /_ingest/pipeline/_simulate { "pipeline": { "processors" : [ { "enrich" : { "policy_name": "zip-policy", "field" : "zip", "target_field": "city", "max_matches": "1" } } ] }, "docs": [ { "_id": "first", "_source" : { "zip" : "80331" } } , { "_id": "second", "_source" : { "zip" : "50667" } } ] } ``` * fixed test compile error	2019-12-17 12:27:07 +01:00
Armin Braun	2e7b1ab375	Use ClusterState as Consistency Source for Snapshot Repositories (#49060 ) (#50267 ) Follow up to #49729 This change removes falling back to listing out the repository contents to find the latest `index-N` in write-mounted blob store repositories. This saves 2-3 list operations on each snapshot create and delete operation. Also it makes all the snapshot status APIs cheaper (and faster) by saving one list operation there as well in many cases. This removes the resiliency to concurrent modifications of the repository as a result and puts a repository in a `corrupted` state in case loading `RepositoryData` failed from the assumed generation.	2019-12-17 10:55:13 +01:00
Andrei Stefan	c6fdf9ed8a	Handle NULL in ResultSet's getDate() method (#50184 ) (cherry picked from commit 08214eb1338fef5c8082c3f8b84c24dd53224ebe)	2019-12-17 10:03:23 +02:00
Tim Vernum	ce2aab3f2f	Add setting to restrict license types (#50252 ) This adds a new "xpack.license.upload.types" setting that restricts which license types may be uploaded to a cluster. By default all types are allowed (excluding basic, which can only be generated and never uploaded). This setting does not restrict APIs that generate licenses such as the start trial API. This setting is not documented as it is intended to be set by orchestrators and not end users. Backport of: #49418	2019-12-17 14:58:58 +11:00
Julie Tibshirani	463cd414aa	Bump the scroll keep-alive time in cluster upgrade tests. (#50195 ) In the yaml cluster upgrade tests, we start a scroll in a mixed-version cluster, then attempt to continue the scroll after the upgrade is complete. This test occasionally fails because the scroll can expire before the cluster is done upgrading. The current scroll keep-alive time 5m. This PR bumps it to 10m, which gives a good buffer since in failing tests the time was only exceeded by ~30 seconds. Addresses #46529.	2019-12-16 10:58:31 -08:00
Rory Hunter	2bd3a05892	Refactor environment variable processing for Docker (#50221 ) Backport of #49612. The current Docker entrypoint script picks up environment variables and translates them into -E command line arguments. However, since any tool executes via `docker exec` doesn't run the entrypoint, it results in a poorer user experience. Therefore, refactor the env var handling so that the -E options are generated in `elasticsearch-env`. These have to be appended to any existing command arguments, since some CLI tools have subcommands and -E arguments must come after the subcommand. Also extract the support for `_FILE` env vars into a separate script, so that it can be called from more than once place (the behaviour is idempotent). Finally, add noop -E handling to CronEvalTool for parity, and support `-E` in MultiCommand before subcommands.	2019-12-16 15:39:28 +00:00
David Kyle	5542686283	[ML] Wait for green after opening job in NetworkDisruptionIT (#50232 ) Closes #49908	2019-12-16 14:55:58 +00:00
David Roberts	32b2445744	Change process kill order for testclusters shutdown (#50215 ) The testclusters shutdown code was killing child processes of the ES JVM before the ES JVM. This causes any running ML jobs to be recorded as failed, as the ES JVM notices that they have disconnected from it without being told to stop, as they would if they crashed. In many test suites this doesn't matter because the test cluster will never be restarted, but in the case of upgrade tests it makes it impossible to test what happens when an ML job is running at the time of the upgrade. This change reverses the order of killing the ES process tree such that the parent processes are killed before their children. A list of children is stored before killing the parent so that they can subsequently be killed (if they don't exit by themselves as a side effect of the parent dying). Backport of #50175	2019-12-16 14:12:36 +00:00
Dimitris Athanasiou	73add726d7	[7.x][ML] Fix exception when field is not included and excluded at the same time (#50192 ) (#50223 ) Executing the data frame analytics _explain API with a config that contains a field that is not in the includes list but at the same time is the excludes list results to trying to remove the field twice from the iterator. That causes an `IllegalStateException`. This commit fixes this issue and adds a test that captures the scenario. Backport of #50192	2019-12-16 11:30:06 +00:00
Armin Braun	761d6e8e4b	Remove BlobContainer Tests against Mocks (#50194 ) (#50220 ) * Remove BlobContainer Tests against Mocks Removing all these weird mocks as asked for by #30424. All these tests are now part of real repository ITs and otherwise left unchanged if they had independent tests that didn't call the `createBlobStore` method previously. The HDFS tests also get added coverage as a side-effect because they did not have an implementation of the abstract repository ITs. Closes #30424	2019-12-16 11:37:09 +01:00
Ignacio Vera	3717c733ff	"CONTAINS" support for BKD-backed geo_shape and shape fields (#50141 ) (#50213 ) Lucene 8.4 added support for "CONTAINS", therefore in this commit those changes are integrated in Elasticsearch. This commit contains as well a bug fix when querying with a geometry collection with "DISJOINT" relation.	2019-12-16 09:17:51 +01:00
Tim Vernum	a9d16ee895	Skip enterprise license tests in release build (#50182 ) The release builds use a production license key, and our rest test load licenses that are signed by the dev license key. This change adds the new enterprise license Rest tests to the blacklist on release builds. Backport of: #50163	2019-12-16 10:11:21 +11:00
Nhat Nguyen	df46848fb0	Migrate peer recovery from translog to retention lease (#49448 ) Since 7.4, we switch from translog to Lucene as the source of history for peer recoveries. However, we reduce the likelihood of operation-based recoveries when performing a full cluster restart from pre-7.4 because existing copies do not have PPRL. To remedy this issue, we fallback using translog in peer recoveries if the recovering replica does not have a peer recovery retention lease, and the replication group hasn't fully migrated to PRRL. Relates #45136	2019-12-15 10:24:39 -05:00
Nhat Nguyen	c151a75dfe	Use retention lease in peer recovery of closed indices (#48430 ) Today we do not use retention leases in peer recovery for closed indices because we can't sync retention leases on closed indices. This change allows that ability and adjusts peer recovery to use retention leases for all indices with soft-deletes enabled. Relates #45136 Co-authored-by: David Turner <david.turner@elastic.co>	2019-12-15 10:24:34 -05:00
Benjamin Trent	4805d8ac7d	[ML][Inference] Adding a warning_field for warning msgs. (#49838 ) (#50183 ) This adds a new field for the inference processor. `warning_field` is a place for us to write warnings provided from the inference call. When there are warnings we are not going to write an inference result. The goal of this is to indicate that the data provided was too poor or too different for the model to make an accurate prediction. The user could optionally include the `warning_field`. When it is not provided, it is assumed no warnings were desired to be written. The first of these warnings is when ALL of the input fields are missing. If none of the trained fields are present, we don't bother inferencing against the model and instead provide a warning stating that the fields were missing. Also, this adds checks to not allow duplicated fields during processor creation.	2019-12-13 10:39:51 -05:00
Benjamin Trent	41736dd6c3	[ML] retry bulk indexing of state docs (#50149 ) (#50185 ) This exchanges the direct use of the `Client` for `ResultsPersisterService`. State doc persistence will now retry. Failures to persist state will still not throw, but will be audited and logged.	2019-12-13 10:39:34 -05:00
Dimitris Athanasiou	fe3c9e71d1	[7.x][ML] Fix DFA explain API timeout when source index is missing (#50176 ) (#50180 ) This commit fixes a bug that caused the data frame analytics _explain API to time out in a multi-node setup when the source index was missing. When we try to create the extracted fields detector, we check the index settings. If the index is missing that responds with a failure that could be wrapped as a remote exception. While we unwrapped correctly to check if the cause was an `IndexNotFoundException`, we then proceeded to cast the original exception instead of the cause. Backport of #50176	2019-12-13 17:00:55 +02:00
Ioannis Kakavas	46376100b1	Fix testMalformedToken (#50164 ) (#50170 ) This test was fixed as part of #49736 so that it used a TokenService mock instance that was enabled, so that token verification fails because the token is invalid and not because the token service is not enabled. When the randomly generated token we send, decodes to being of version > 7.2 , we need to have mocked a GetResponse for the call that TokenService#getUserTokenFromId will make, otherwise this hangs and times out.	2019-12-13 13:46:44 +02:00
Dimitris Athanasiou	e6cbcf7f7c	[7.x] [ML] Persist/restore state for DFA classification (#50040 ) (#50147 ) This commit adds state persist/restore for data frame analytics classification jobs. Backport of #50040	2019-12-13 10:33:19 +02:00
Hendrik Muhs	1c3ce110bd	[Transform] add actual timeout in message (#50140 ) add the timeout to the message if stopping a transform times out	2019-12-13 08:10:25 +01:00
Jason Tedor	29526d0dfe	Validate exporter type is HTTP for HTTP exporter (#49992 ) Today the HTTP exporter settings without the exporter type having been configured to HTTP. When it is time to initialize the exporter, we can blow up. Since this initialization happens on the cluster state applier thread, it is quite problematic that we do not reject settings updates where the type is not configured to HTTP, but there are HTTP exporter settings configured. This commit addresses this by validating that the exporter type is not only set, but is set to HTTP.	2019-12-12 20:01:04 -05:00
Tim Vernum	2811b97b76	Remove reserved roles for code search (#50115 ) The "code_user" and "code_admin" reserved roles existed to support code search which is no longer included in Kibana. The "kibana_system" role included privileges to read/write from the code search indices, but no longer needs that access. Backport of: #50068	2019-12-13 10:22:55 +11:00
Julie Tibshirani	73c412063b	Reenable the 'continue scroll' cluster upgrade test.	2019-12-12 12:34:49 -08:00
Benjamin Trent	d7ffa7f8f7	[7.x][ML] Add graceful retry for anomaly detector result indexing failures(#49508 ) (#50145 ) * [ML] Add graceful retry for anomaly detector result indexing failures (#49508) All results indexing now retry the amount of times configured in `xpack.ml.persist_results_max_retries`. The retries are done in a semi-random, exponential backoff. * fixing test	2019-12-12 12:24:58 -05:00
Benjamin Trent	c043aa887f	[ML][Inference] Simplify inference processor options (#50105 ) (#50146 ) * [ML][Inference] Simplify inference processor options * addressing pr comments	2019-12-12 11:13:55 -05:00
David Roberts	13e47df97d	[TEST] Increase timeout for ML internal cluster cleanup (#50142 ) Closes #48511	2019-12-12 15:38:22 +00:00
Ignacio Vera	b5ec227de8	upgrade to lucene 8.4.0-snapshot-08b8d116f8f (#50129 ) (#50132 )	2019-12-12 13:13:37 +01:00
David Kyle	7d4118dc4e	Enable trace logging in failing ml NetworkDisruptionIT https://github.com/elastic/elasticsearch/issues/49908	2019-12-12 11:16:01 +00:00
Tim Vernum	47e5e34f42	Support "enterprise" license types (#49474 ) This adds "enterprise" as an acceptable type for a license loaded through the PUT _license API. Internally an enterprise license is treated as having a "platinum" operating mode. The handling of License types was refactored to have a new explicit "LicenseType" enum in addition to the existing "OperatingMode" enum. By default (in 7.x) the GET license API will return "platinum" when an enterprise license is active in order to be compatible with existing consumers of that API. A new "accept_enterprise" flag has been introduced to allow clients to opt-in to receive the correct "enterprise" type. Backport of: #49223	2019-12-12 14:37:44 +11:00
Andrei Stefan	e9e2e5fc71	Have COUNT DISTINCT return 0 instead of NULL for no documents matching. (#50037 ) (cherry picked from commit cb94731e6f41bc51c23e4aab495b64eea731a061)	2019-12-12 00:34:04 +02:00
Julie Tibshirani	277880bb4f	In sparse vector REST tests, specify the index name in searches. (#50061 ) The `sparse_vector` REST tests occasionally fail on 7.x because we don't receive the expected response headers with deprecation warnings. One theory as to what is happening is that there is an extra empty index present in addition to the test index. Since the search doesn't specify an index name, it hits both the test index and this extra empty index and shard responses from the extra index don't produce deprecation warnings. If not all shard responses contain the warning headers, then certain deprecation warnings can be lost (due to the bug described in #33936). This PR tries to harden the `sparse_vector` tests by always specifying the index name during a search. This doesn't fix the root causes of the issue, but is good practice and can help avoid intermittent failures. Addresses #49383.	2019-12-11 10:33:47 -08:00
Dimitris Athanasiou	03ecaae221	[7.x][ML] Avoid classification integ test training on single class (#50072 ) (#50078 ) The `ClassificationIT.testTwoJobsWithSameRandomizeSeedUseSameTrainingSet` test was previously set up to just have 10 rows. With `training_percent` of 50%, only 5 rows will be used for training. There is a good chance that all 5 rows will be of one class which results to failure. This commit increases the rows to 100. Now 50 rows should be used for training and the chance of failure should be very small. Backport of #50072	2019-12-11 18:50:26 +02:00
Henning Andersen	9cdabbd363	Log attachment generation failures (#50080 ) Watcher logs when actions fail in ActionWrapper, but failures to generate an email attachment are not logged and we thus only know the type of the exception and not where/how it occurred.	2019-12-11 17:20:22 +01:00
David Turner	285eacd267	Use more specific loggers in subclasses of TMNA (#50076 ) Adjusts the subclasses of `TransportMasterNodeAction` to use their own loggers instead of the one for the base class. Relates #50056. Partial backport of #46431 to 7.x.	2019-12-11 15:07:47 +00:00
Przemysław Witek	9b116c8fef	A few improvements to AnalyticsProcessManager class that make the code more readable. (#50026 ) (#50069 )	2019-12-11 09:35:05 +01:00
Ioannis Kakavas	3b613c36f4	Always return 401 for not valid tokens (#49736 ) (#50042 ) Return a 401 in all cases when a request is submitted with an access token that we can't consume. Before this change, we would throw a 500 when a request came in with an access token that we had generated but was then invalidated/expired and deleted from the tokens index. Resolves: #38866 Backport of #49736	2019-12-11 09:14:50 +02:00
Stuart Cam	44cd2f444c	Add the REST API specifications for SLM Status / Start / Stop endpoints. (#49759 ) Was originally missed in PR #47710 (cherry picked from commit 133b34c8355639ae0f699a86ffd9f37d19f73bca)	2019-12-11 13:34:13 +11:00
Adrien Grand	87e72156ce	Upgrade to lucene 8.4.0-snapshot-662c455. (#50016 ) (#50039 ) Lucene 8.4 is about to be released so we should check it doesn't cause problems with Elasticsearch.	2019-12-10 18:04:58 +01:00
Dimitris Athanasiou	8891f4db88	[7.x][ML] Introduce randomize_seed setting for regression and classification (#49990 ) (#50023 ) This adds a new `randomize_seed` for regression and classification. When not explicitly set, the seed is randomly generated. One can reuse the seed in a similar job in order to ensure the same docs are picked for training. Backport of #49990	2019-12-10 15:29:19 +02:00
Yannick Welsch	a16abf921f	Make elasticsearch-node tools custom metadata-aware (#48390 ) The elasticsearch-node tools allow manipulating the on-disk cluster state. The tool is currently unaware of plugins and will therefore drop custom metadata from the cluster state once the state is written out again (as it skips over the custom metadata that it can't read). This commit preserves unknown customs when editing on-disk metadata through the elasticsearch-node command-line tools.	2019-12-10 09:58:11 +01:00
Jason Tedor	bfb2dc1353	Enable dependent settings values to be validated (#49942 ) Today settings can declare dependencies on another setting. This declaration is implemented so that if the declared setting is not set when the declaring setting is, settings validation fails. Yet, in some cases we want not only that the setting is set, but that it also has a specific value. For example, with the monitoring exporter settings, if xpack.monitoring.exporters.my_exporter.host is set, we not only want that xpack.monitoring.exporters.my_exporter.type is set, but that it is also set to local. This commit extends the settings infrastructure so that this declaration is possible. The use of this in the monitoring exporter settings will be implemented in a follow-up.	2019-12-09 12:45:50 -05:00
Marios Trivyzas	48e7420307	SQL: [Tests] Unmute Pivot from NodeSublassTests (#49925 ) The `testReplaceChildren()` has been fixed for Pivot as part of #49693. Reverting: #49045 (cherry picked from commit 4b9b9edbcf2041a8619b65580bbe192bf424cebc)	2019-12-09 17:20:20 +01:00
Benjamin Trent	0b6ce9683c	[ML] Use query in cardinality check (#49939 ) (#49984 ) When checking the cardinality of a field, the query should be take into account. The user might know about some bad data in their index and want to filter down to the target_field values they care about.	2019-12-09 10:14:41 -05:00
Przemysław Witek	0965a10468	[7.x] Pass `prediction_field_type` to C++ analytics process (#49861 ) (#49981 )	2019-12-09 14:43:01 +01:00
Benjamin Trent	049d854360	[ML][Inference] adjust so target_field always has inference result and optionally allow new top classes field in the classification config (#49923 ) (#49982 )	2019-12-09 08:29:45 -05:00
Dimitris Athanasiou	e4f838e764	[7.x][ML] Update expected mem estimate in explain API integ test (#49924 ) (#49979 ) Work in progress in the c++ side is increasing memory estimates a bit and this test fails. At the time of this commit the mem estimate when there is no source query is a about 2Mb. So I am relaxing the test to assert memory estimate is less than 1Mb instead of 500Kb. Backport of #49924	2019-12-09 11:52:06 +02:00
cachedout	549b103458	[7.x] APM system_user (#47668 ) (#49912 ) * Add test for APM beats index perms * Grant monitoring index privs to apm_system user * Review feedback * Fix compilation problem	2019-12-09 08:25:03 +00:00
Armin Braun	ac2774c9fa	Use Cluster State to Track Repository Generation (#49729 ) (#49976 ) Step on the road to #49060. This commit adds the logic to keep track of a repository's generation across repository operations. See changes to package level Javadoc for the concrete changes in the distributed state machine. It updates the write side of new repository generations to be fully consistent via the cluster state. With this change, no `index-N` will be overwritten for the same repository ever. So eventual consistency issues around conflicting updates to the same `index-N` are not a possibility any longer. With this change the read side will still use listing of repository contents instead of relying solely on the cluster state contents. The logic for that will be introduced in #49060. This retains the ability to externally delete the contents of a repository and continue using it afterwards for the time being. In #49060 the use of listing to determine the repository generation will be removed in all cases (except for full-cluster restart) as the last step in this effort.	2019-12-09 09:02:57 +01:00
Yannick Welsch	01d36afa4b	Randomly run CCR tests with _source disabled (#49922 ) Makes sure that CCR also properly works with _source disabled. Changes one exception in LuceneChangesSnapshot as the case of missing _recovery_source because of a missing lease was not properly properly bubbled up to CCR (testIndexFallBehind was failing).	2019-12-09 08:33:40 +01:00
Costin Leau	5b896c5bb5	SQL: Refactor usage of NamedExpression (#49693 ) To recap, Attributes form the properties of a derived table. Each LogicalPlan has Attributes as output since each one can be part of a query and as such its result are sent to its consumer. This change essentially removes the name id comparison so any changes applied to existing expressions should work as long as the said expressions are semantically equivalent. This change enforces the hashCode and equals which has the side-effect of using hashCode as identifiers for each expression. By removing any property from an Attribute, the various components need to look the original source for comparison which, while annoying, should prevent a reference from getting out of sync with its source due to optimizations. Essentially going forward there are only 3 types of NamedExpressions: Alias - user define (implicit or explicit) name FieldAttribute - field backed by Elasticsearch ReferenceAttribute - a reference to another source acting as an Attribute. Typically the Attribute of an Alias. * Remove the usage of NamedExpression as basis for all Expressions. Instead, restrict their use only for named context, such as projections by using Aliasing instead. * Remove different types of Attributes and allow only FieldAttribute, UnresolvedAttribute and ReferenceAttribute. To avoid issues with rewrites, resolve the references inside the QueryContainer so the information always stays on the source. * Side-effect, simplify the rules as the state for InnerAggs doesn't have to be contained anymore. * Improve ResolveMissingRef rule to handle references to named non-singular expression tree against the same expression used up the tree. #49693 backport to 7.x (cherry picked from commit 5d095e2173bcbf120f534a6f2a584185a7879b57)	2019-12-07 11:02:14 +02:00
Stuart Tettemer	17cda5b2c0	Scripting: Groundwork for caching script results (#49895 ) (#49944 ) In order to cache script results in the query shard cache, we need to check if scripts are deterministic. This change adds a default method to the script factories, `isResultDeterministic() -> false` which is used by the `QueryShardContext`. Script results were never cached and that does not change here. Future changes will implement this method based on whether the results of the scripts are deterministic or not and therefore cacheable. Refs: #49466 Backport	2019-12-06 15:08:05 -07:00
Lee Hinman	8205cdd423	[7.x] Refactor IndexLifecycleRunner to split state modificatio… (#49936 ) This commit refactors the `IndexLifecycleRunner` to split out and consolidate the number of methods that change state from within ILM. It adds a new class `IndexLifecycleTransition` that contains a number of static methods used to modify ILM's state. These methods all return new cluster states rather than making changes themselves (they can be thought of as helpers for modifying ILM state). Rather than having multiple ways to move an index to a particular step (like `moveClusterStateToStep`, `moveClusterStateToNextStep`, `moveClusterStateToPreviouslyFailedStep`, etc (there are others)) this now consolidates those into three with (hopefully) useful names: - `moveClusterStateToStep` - `moveClusterStateToErrorStep` - `moveClusterStateToPreviouslyFailedStep` In the move, I was also able to consolidate duplicate or redundant arguments to these functions. Prior to this commit there were many calls that provided duplicate information (both `IndexMetaData` and `LifecycleExecutionState` for example) where the duplicate argument could be derived from a previous argument with no problems. With this split, `IndexLifecycleRunner` now contains the methods used to actually run steps as well as the methods that kick off cluster state updates for state transitions. `IndexLifecycleTransition` contains only the helpers for constructing new states from given scenarios. This also adds Javadocs to all methods in both `IndexLifecycleRunner` and `IndexLifecycleTransition` (this accounts for almost all of the increase in code lines for this commit). It also makes all methods be as restrictive in visibility, to limit the scope of where they are used. This refactoring is part of work towards capturing actions and transitions that ILM makes, by consolidating and simplifying the places we make state changes, it will make adding operation auditing easier.	2019-12-06 12:55:16 -07:00
Jake Landis	1c5a139968	Update jackson-databind to 2.8.11.4 (#49347 ) (#49937 )	2019-12-06 13:39:33 -06:00
Przemysław Witek	e60837aa3b	[7.x] Log whole analytics stats when the state assertion fails (#49906 ) (#49911 )	2019-12-06 14:31:17 +01:00
Zachary Tong	fec882a457	Decouple pipeline reductions from final agg reduction (#45796 ) Historically only two things happened in the final reduction: empty buckets were filled, and pipeline aggs were reduced (since it was the final reduction, this was safe). Usage of the final reduction is growing however. Auto-date-histo might need to perform many reductions on final-reduce to merge down buckets, CCS may need to side-step the final reduction if sending to a different cluster, etc Having pipelines generate their output in the final reduce was convenient, but is becoming increasingly difficult to manage as the rest of the agg framework advances. This commit decouples pipeline aggs from the final reduction by introducing a new "top level" reduce, which should be called at the beginning of the reduce cycle (e.g. from the SearchPhaseController). This will only reduce pipeline aggs on the final reduce after the non-pipeline agg tree has been fully reduced. By separating pipeline reduction into their own set of methods, aggregations are free to use the final reduction for whatever purpose without worrying about generating pipeline results which are non-reducible	2019-12-05 16:11:54 -05:00
Ignacio Vera	44e94555ee	Add reusable HistogramValue object (#49799 ) (#49823 ) Adds a reusable implementation of HistogramValue so we do not create an object per document.	2019-12-04 11:51:53 +01:00
Armin Braun	996cddd98b	Stop Copying Every Http Request in Message Handler (#44564 ) (#49809 ) * Copying the request is not necessary here. We can simply release it once the response has been generated and a lot of `Unpooled` allocations that way * Relates #32228 * I think the issue that preventet that PR that PR from being merged was solved by #39634 that moved the bulk index marker search to ByteBuf bulk access so the composite buffer shouldn't require many additional bounds checks (I'd argue the bounds checks we add, we save when copying the composite buffer) * I couldn't neccessarily reproduce much of a speedup from this change, but I could reproduce a very measureable reduction in GC time with e.g. Rally's PMC (4g heap node and bulk requests of size 5k saw a reduction in young GC time by ~10% for me)	2019-12-04 08:41:42 +01:00
Hendrik Muhs	c33be29dc7	[Transform] automatic deletion of old checkpoints (#49496 ) add automatic deletion of old checkpoints based on count and time	2019-12-04 07:55:57 +01:00
Hendrik Muhs	d5eb9379c9	remove flaky test: might fail due to async execution	2019-12-03 18:28:41 +01:00
Hendrik Muhs	7aae212287	[Transform] Fix possible audit logging disappearance after rolling upgrade (#49731 ) (#49767 ) ensure audit index template is available during a rolling upgrade before a transform task can write to it. fixes #49730	2019-12-03 18:05:06 +01:00
Przemysław Witek	a3f88595d7	A few cleanups in evaluation tests (#49791 ) (#49794 )	2019-12-03 15:48:39 +01:00
Yannick Welsch	fbb92f527a	Replicate write actions before fsyncing them (#49746 ) This commit fixes a number of issues with data replication: - Local and global checkpoints are not updated after the new operations have been fsynced, but might capture a state before the fsync. The reason why this probably went undetected for so long is that AsyncIOProcessor is synchronous if you index one item at a time, and hence working as intended unless you have a high enough level of concurrent indexing. As we rely in other places on the assumption that we have an up-to-date local checkpoint in case of synchronous translog durability, there's a risk for the local and global checkpoints not to be up-to-date after replication completes, and that this won't be corrected by the periodic global checkpoint sync. - AsyncIOProcessor also has another "bad" side effect here: if you index one bulk at a time, the bulk is always first fsynced on the primary before being sent to the replica. Further, if one thread is tasked by AsyncIOProcessor to drain the processing queue and fsync, other threads can easily pile more bulk requests on top of that thread. Things are not very fair here, and the thread might continue doing a lot more fsyncs before returning (as the other threads pile more and more on top), which blocks it from returning as a replication request (e.g. if this thread is on the primary, it blocks the replication requests to the replicas from going out, and delaying checkpoint advancement). This commit fixes all these issues, and also simplifies the code that coordinates all the after write actions.	2019-12-03 12:22:46 +01:00
Przemysław Witek	1d8e3d69d7	Make only a part of `stop()` method a critical section. (#49756 ) (#49788 )	2019-12-03 09:54:16 +01:00

1 2 3 4 5 ...

4561 Commits