OpenSearch

Commit Graph

Author	SHA1	Message	Date
Gordon Brown	89c2834b24	Deprecate creation of dot-prefixed index names except for hidden and system indices (#49959 ) This commit deprecates the creation of dot-prefixed index names (e.g. .watches) unless they are either 1) a hidden index, or 2) registered by a plugin that extends SystemIndexPlugin. This is the first step towards more thorough protections for system indices. This commit also modifies several plugins which use dot-prefixed indices to register indices they own as system indices, and adds a plugin to register .tasks as a system index.	2020-01-28 10:01:16 -07:00
Yannick Welsch	f6686345c9	Avoid unnecessary setup and teardown in docs tests (#51430 ) The docs tests have recently been running much slower than before (see #49753). The gist here is that with ILM/SLM we do a lot of unnecessary setup / teardown work on each test. Compounded with the slightly slower cluster state storage mechanism, this causes the tests to run much slower. In particular, on RAMDisk, docs:check is taking ES 7.4: 6:55 minutes ES master: 16:09 minutes ES with this commit: 6:52 minutes on SSD, docs:check is taking ES 7.4: ??? minutes ES master: 32:20 minutes ES with this commit: 11:21 minutes	2020-01-28 16:52:23 +01:00
David Roberts	550254ec7f	[ML] Use CSV ingest processor in find_file_structure ingest pipeline (#51492 ) Changes the find_file_structure response to include a CSV ingest processor in the ingest pipeline it suggests. Previously the Kibana file upload functionality parsed CSV in the browser, but by parsing CSV in the ingest pipeline it makes the Kibana file upload functionality more easily interchangable with Filebeat such that the configurations it creates can more easily be used to import data with the same structure repeatedly in production.	2020-01-28 14:38:43 +00:00
Aleksandr Maus	a8bd4d08e3	Merge branch 'feature/eql_backport' into 7.x	2020-01-28 09:19:39 -05:00
Hendrik Muhs	53e4d1ef07	[Transform] fix TransformRobustnessIT intermittent test failures part 2 (#51523 ) add wait for completion in transform robustness test to avoid occasional test failures during cleanup fixes #51347	2020-01-28 13:37:01 +01:00
William Brafford	9efa5be60e	Password-protected Keystore Feature Branch PR (#51123 ) (#51510 ) * Reload secure settings with password (#43197) If a password is not set, we assume an empty string to be compatible with previous behavior. Only allow the reload to be broadcast to other nodes if TLS is enabled for the transport layer. * Add passphrase support to elasticsearch-keystore (#38498) This change adds support for keystore passphrases to all subcommands of the elasticsearch-keystore cli tool and adds a subcommand for changing the passphrase of an existing keystore. The work to read the passphrase in Elasticsearch when loading, which will be addressed in a different PR. Subcommands of elasticsearch-keystore can handle (open and create) passphrase protected keystores When reading a keystore, a user is only prompted for a passphrase only if the keystore is passphrase protected. When creating a keystore, a user is allowed (default behavior) to create one with an empty passphrase Passphrase can be set to be empty when changing/setting it for an existing keystore Relates to: #32691 Supersedes: #37472 * Restore behavior for force parameter (#44847) Turns out that the behavior of `-f` for the add and add-file sub commands where it would also forcibly create the keystore if it didn't exist, was by design - although undocumented. This change restores that behavior auto-creating a keystore that is not password protected if the force flag is used. The force OptionSpec is moved to the BaseKeyStoreCommand as we will presumably want to maintain the same behavior in any other command that takes a force option. * Handle pwd protected keystores in all CLI tools (#45289) This change ensures that `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` can handle a password protected elasticsearch.keystore. For setup passwords the user would be prompted to add the elasticsearch keystore password upon running the tool. There is no option to pass the password as a parameter as we assume the user is present in order to enter the desired passwords for the built-in users. For saml-metadata, we prompt for the keystore password at all times even though we'd only need to read something from the keystore when there is a signing or encryption configuration. * Modify docs for setup passwords and saml metadata cli (#45797) Adds a sentence in the documentation of `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` to describe that users would be prompted for the keystore's password when running these CLI tools, when the keystore is password protected. Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * Elasticsearch keystore passphrase for startup scripts (#44775) This commit allows a user to provide a keystore password on Elasticsearch startup, but only prompts when the keystore exists and is encrypted. The entrypoint in Java code is standard input. When the Bootstrap class is checking for secure keystore settings, it checks whether or not the keystore is encrypted. If so, we read one line from standard input and use this as the password. For simplicity's sake, we allow a maximum passphrase length of 128 characters. (This is an arbitrary limit and could be increased or eliminated. It is also enforced in the keystore tools, so that a user can't create a password that's too long to enter at startup.) In order to provide a password on standard input, we have to account for four different ways of starting Elasticsearch: the bash startup script, the Windows batch startup script, systemd startup, and docker startup. We use wrapper scripts to reduce systemd and docker to the bash case: in both cases, a wrapper script can read a passphrase from the filesystem and pass it to the bash script. In order to simplify testing the need for a passphrase, I have added a has-passwd command to the keystore tool. This command can run silently, and exit with status 0 when the keystore has a password. It exits with status 1 if the keystore doesn't exist or exists and is unencrypted. A good deal of the code-change in this commit has to do with refactoring packaging tests to cleanly use the same tests for both the "archive" and the "package" cases. This required not only moving tests around, but also adding some convenience methods for an abstraction layer over distribution-specific commands. * Adjust docs for password protected keystore (#45054) This commit adds relevant parts in the elasticsearch-keystore sub-commands reference docs and in the reload secure settings API doc. * Fix failing Keystore Passphrase test for feature branch (#50154) One problem with the passphrase-from-file tests, as written, is that they would leave a SystemD environment variable set when they failed, and this setting would cause elasticsearch startup to fail for other tests as well. By using a try-finally, I hope that these tests will fail more gracefully. It appears that our Fedora and Ubuntu environments may be configured to store journald information under /var rather than under /run, so that it will persist between boots. Our destructive tests that read from the journal need to account for this in order to avoid trying to limit the output we check in tests. * Run keystore management tests on docker distros (#50610) * Add Docker handling to PackagingTestCase Keystore tests need to be able to run in the Docker case. We can do this by using a DockerShell instead of a plain Shell when Docker is running. * Improve ES startup check for docker Previously we were checking truncated output for the packaged JDK as an indication that Elasticsearch had started. With new preliminary password checks, we might get a false positive from ES keystore commands, so we have to check specifically that the Elasticsearch class from the Bootstrap package is what's running. * Test password-protected keystore with Docker (#50803) This commit adds two tests for the case where we mount a password-protected keystore into a Docker container and provide a password via a Docker environment variable. We also fix a logging bug where we were logging the identifier for an array of strings rather than the contents of that array. * Add documentation for keystore startup prompting (#50821) When a keystore is password-protected, Elasticsearch will prompt at startup. This commit adds documentation for this prompt for the archive, systemd, and Docker cases. Co-authored-by: Lisa Cawley <lcawley@elastic.co> * Warn when unable to upgrade keystore on debian (#51011) For Red Hat RPM upgrades, we warn if we can't upgrade the keystore. This commit brings the same logic to the code for Debian packages. See the posttrans file for gets executed for RPMs. * Restore handling of string input Adds tests that were mistakenly removed. One of these tests proved we were not handling the the stdin (-x) option correctly when no input was added. This commit restores the original approach of reading stdin one char at a time until there is no more (-1, \r, \n) instead of using readline() that might return null * Apply spotless reformatting * Use '--since' flag to get recent journal messages When we get Elasticsearch logs from journald, we want to fetch only log messages from the last run. There are two reasons for this. First, if there are many logs, we might get a string that's too large for our utility methods. Second, when we're looking for a specific message or error, we almost certainly want to look only at messages from the last execution. Previously, we've been trying to do this by clearing out the physical files under the journald process. But there seems to be some contention over these directories: if journald writes a log file in between when our deletion command deletes the file and when it deletes the log directory, the deletion will fail. It seems to me that we might be able to use journald's "--since" flag to retrieve only log messages from the last run, and that this might be less likely to fail due to race conditions in file deletion. Unfortunately, it looks as if the "--since" flag has a granularity of one-second. I've added a two-second sleep to make sure that there's a sufficient gap between the test that will read from journald and the test before it. * Use new journald wrapper pattern * Update version added in secure settings request Co-authored-by: Lisa Cawley <lcawley@elastic.co> Co-authored-by: Ioannis Kakavas <ikakavas@protonmail.com>	2020-01-28 05:32:32 -05:00
Hendrik Muhs	2239ba8c6e	[Transform] avoid mapping problems with index templates (#51368 ) (#51519 ) insert explict mappings for objects in nested output to avoid clashes with index templates fixes #51321	2020-01-28 11:31:07 +01:00
Hendrik Muhs	61663b495e	add an integration test using date_nanos as timestamp (#51477 ) add a test for using date_nanos as timestamp field in a continuous transform	2020-01-28 10:10:23 +01:00
Hendrik Muhs	bebce4b190	audit index creation after it the index has been created (#51479 ) moves audit message for index creation after the index has been successfully created. This has been confusing for a user where index creation failed but audit reported index creation.	2020-01-28 10:06:46 +01:00
Ioannis Kakavas	4f3548fbd7	Disable diagnostic trust manager in tests (#51501 ) This commit sets `xpack.security.ssl.diagnose.trust` to false in all of our tests when running in FIPS 140 mode and when settings objects are used to create an instance of the SSLService. This is needed in 7.x because setting xpack.security.ssl.diagnose.trust to true wraps SunJSSE TrustManager with our own DiagnosticTrustManager and this is not allowed when SunJSSE is in FIPS mode. An alternative would be to set xpack.security.fips.enabled to true which would also implicitly disable xpack.security.ssl.diagnose.trust but would have additional effects (would require that we set PBKDF2 for password hashing algorithm in all test clusters, would prohibit using JKS keystores in nodes even if relevant tests have been muted in FIPS mode etc.) Relates: #49900 Resolves: #51268	2020-01-28 10:17:35 +02:00
Przemko Robakowski	919083decd	Don't overwrite target field with SetSecurityUserProcessor (#51454 ) (#51506 ) * Don't overwrite target field with SetSecurityUserProcessor This change fix problem with `SetSecurityUserProcessor` which was overwriting whole target field and not only fields really filled by the processor. Closes #51428 * Unused imports removed	2020-01-28 02:12:09 +01:00
Jason Tedor	92b611ece1	Formalize build snapshot (#51484 ) Today we are repeatedly checking if the current build is a snapshot build or not by reading the system property build.snapshot. This commit formalizes this by adding a build parameter to indicate whether or not the current build is a snapshot build.	2020-01-27 16:56:31 -05:00
Aleksandr Maus	eb1ed2a35f	Compilation fixes for 7.x	2020-01-27 16:23:36 -05:00
Aleksandr Maus	d8f1735e39	Add xpack.eql.enabled feature flag, disabled by default. Enabled only for integration tests. (#51370 ) Related to https://github.com/elastic/elasticsearch/issues/49581	2020-01-27 15:15:22 -05:00
Costin Leau	d049de5b72	EQL: import QL into EQL (#50904 ) Link QL into the new build file Remove duplicate classes and use the new ql package Update Exception hierarchy on top of QlException	2020-01-27 15:13:22 -05:00
Igor Motov	c184411456	EQL: Replace EqlSearchResponse.Hits parser with ObjectParser (#50925 ) Replaces the existing hand-build Hits parser with a ConstructingObjectParser version. Relates to #49581	2020-01-27 15:13:09 -05:00
Igor Motov	88cc30c0d8	EQL: Remove list classes from EqlSearchResponse (#50870 ) Removes unnecessary classes from EqlSearchResponse that just represent lists of other elements. Relates to #49581	2020-01-27 15:13:00 -05:00
Aleksandr Maus	d715176c00	Add more Eql REST API validation integration tests, clean up request implementation (#50822 )	2020-01-27 15:12:48 -05:00
Igor Motov	628083183f	EQL: Make EqlSearchResponse immutable (#50810 ) Refactors EqlSearchResponse to make it immutable Relates to #49581	2020-01-27 15:12:07 -05:00
Aleksandr Maus	31d2d01e25	Correct search_after handling (#50629 )	2020-01-27 15:11:51 -05:00
Aleksandr Maus	79875ce4d9	Initial EQL rest API implementation (#49768 )	2020-01-27 15:11:41 -05:00
Costin Leau	10a16d15d1	Add draft EQL grammar and expression tree	2020-01-27 15:11:18 -05:00
Costin Leau	e22f501018	QL: Backport project to 7.x (#51497 ) * Introduce reusable QL plugin for SQL and EQL (#50815) Extract reusable functionality from SQL into its own dedicated project QL. Implemented as a plugin, it provides common components across SQL and the upcoming EQL. While this commit is fairly large, for the most part it's just a big file move from sql package to the newly introduced ql. (cherry picked from commit ec1ac0d463bfa12a02c8174afbcdd6984345e8b4) * SQL: Fix incomplete registration of geo NamedWritables (cherry picked from commit e295763686f9592976e551e504fdad1d2a3a566d) * QL: Extend NodeSubclass to read classes from jars (#50866) As the test classes are spread across more than one project, the Gradle classpath contains not just folders but also jars. This commit allows the test class to explore the archive content and load matching classes from said source. (cherry picked from commit 25ad74928afcbf286dc58f7d430491b0af662f04) * QL: Remove implicit conversion inside Literal (#50962) Literal constructor makes an implicit conversion for each value given which turns out has some subtle side-effects. Improve MathProcessors to preserve numeric type where possible Fix bug on issue compatibility between date and intervals Preserve the source when folding inside the Optimizer (cherry picked from commit 9b73e225b0aa07a23859550fb117bae571a2b672) * QL: Refactor DataType for pluggability (#51328) Change DataType from enum to class Break DataType enums into QL (default) and SQL types Make data type conversion pluggable so that new types can be introduced As part of the process: - static type conversion in QL package (such as Literal) has been removed - several utility classes have been broken into base (QL) and extended (SQL) parts based on type awareness - operators (+,-,/,) are - due to extensibility, serialization of arithmetic operation has been slightly changed and pushed down to the operator executor itself (cherry picked from commit aebda81b30e1563b877a8896309fd50633e0b663) Compilation fixes for 7.x	2020-01-27 22:03:58 +02:00
Ryan Ernst	6ee1baf2ed	Migrate cron eval bats test to java (#50940 ) (#51007 ) This commit migrates the simple test of the cron eval tool from bats to java packaging tests. relates #46005	2020-01-27 10:49:01 -08:00
Nik Everett	4ff314a9d5	Begin moving date_histogram to offset rounding (take two) (#51271 ) (#51495 ) We added a new rounding in #50609 that handles offsets to the start and end of the rounding so that we could support `offset` in the `composite` aggregation. This starts moving `date_histogram` to that new offset. This is a redo of #50873 with more integration tests. This reverts commit d114c9db3e1d1a766f9f48f846eed0466125ce83.	2020-01-27 13:40:54 -05:00
David Roberts	3c223ceea1	[ML] Fix 2 digit year regex in find_file_structure (#51469 ) The DATE and DATESTAMP Grok patterns match 2 digit years as well as 4 digit years. The pattern determination in find_file_structure worked correctly in this case, but the regex used to create a multi-line start pattern was assuming a 4 digit year. Also, the quick rule-out patterns did not always correctly consider 2 digit years, meaning that detection was inconsistent. This change fixes both problems, and also extends the tests for DATE and DATESTAMP to check both 2 and 4 digit years.	2020-01-27 17:23:18 +00:00
Benjamin Trent	8559ff7cee	[ML][Inference] fixing pattern compilation + unnecessary string copy (#51483 ) (#51487 )	2020-01-27 12:12:34 -05:00
Martijn van Groningen	8b851bfc33	Removed more unchecked suppress warnings. See #48381	2020-01-27 14:51:49 +01:00
Martijn van Groningen	716904fab7	Unmuted test with more logging and removed unchecked suppress warnings. See #48381	2020-01-27 14:10:43 +01:00
Hendrik Muhs	b233e93014	[Transform] refactor naming leftovers and apply code formating (#51465 ) (#51470 ) refactor renaming leftovers: "data frame transform" to "transforms", touch only internals (variable names, non-public API's, doc strings, ...) and apply code-formatting (spotless). No logical changes.	2020-01-27 14:04:57 +01:00
Andrei Dan	977cce002e	Preserve slm-history-ilm-policy between test runs (#51442 ) (#51468 ) (cherry picked from commit 4e95c8a94fa700d44ac31ef17547512748ab1885) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-01-27 10:40:40 +00:00
Andrei Dan	d872db278a	Fix TimeSeriesLifecycleActionsIT.testShrinkAction (#51431 ) (#51467 ) * Fix TimeSeriesLifecycleActionsIT.testShrinkAction Shrinking a 6 shard index to 3 shards can be quite time consuming and assertBusy probes the conditions at exponentially growing intervals. This separates the one assertion that was used for all the conditions into multiple assertBusy statements and increases the timeout for waiting for the shrink to complete. * Allow more time for shrink to complete This commit allows more time for the shrink operation to complete in testRetryFailedShrinkAction (separating the assertBusy calls too) and testMoveToRolloverStep. * Shrink to no more than 2 shards in tests (cherry picked from commit 5fe780148fa3536915d61475b087896a5b9ace82) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-01-27 10:40:29 +00:00
Martijn van Groningen	d289c1d5f1	Wrong bug url in @AwaitsFix See #48381	2020-01-27 10:38:03 +01:00
Martijn van Groningen	e253b7e73d	Retry response exceptions in the test. Relates to #30777	2020-01-27 10:32:38 +01:00
Martijn van Groningen	7e0f73e035	Muted watcher bwc restart test #30777	2020-01-27 10:32:37 +01:00
Ioannis Kakavas	ee202a642f	Enable tests in FIPS 140 in JDK 11 (#49485 ) This change changes the way to run our test suites in JVMs configured in FIPS 140 approved mode. It does so by: - Configuring any given runtime Java in FIPS mode with the bundled policy and security properties files, setting the system properties java.security.properties and java.security.policy with the == operator that overrides the default JVM properties and policy. - When runtime java is 11 and higher, using BouncyCastle FIPS Cryptographic provider and BCJSSE in FIPS mode. These are used as testRuntime dependencies for unit tests and internal clusters, and copied (relevant jars) explicitly to the lib directory for testclusters used in REST tests - When runtime java is 8, using BouncyCastle FIPS Cryptographic provider and SunJSSE in FIPS mode. Running the tests in FIPS 140 approved mode doesn't require an additional configuration either in CI workers or locally and is controlled by specifying -Dtests.fips.enabled=true	2020-01-27 11:14:52 +02:00
Przemysław Witek	dd3e2f1e18	[7.x] Update quantiles document in the index the document belongs to (#51135 ) (#51415 )	2020-01-27 10:13:02 +01:00
Przemko Robakowski	fbec19c022	Centralize mocks initialization in ILM steps tests (#51384 ) (#51453 ) * Centralize mocks initialization in ILM steps tests This change centralizes initialization of `Client`, `AdminClient` and `IndicesAdminClient` for all classes extending `AbstractStepTestCase`. This removes a lot of code duplication and make it easier to write tests. This also removes need for `AsyncActionStep#setClient` * Unused imports removed * Added missed tests * Fix OpenFollowerIndexStepTests	2020-01-25 01:19:55 +01:00
Lee Hinman	8560847dd9	[7.x] Check all snapshots in SnapshotLifecycleRestIT.testFullP… (#51448 ) * Check all snapshots in SnapshotLifecycleRestIT.testFullPolicy Rather than check the first returned snapshot for a snapshot starting with `snap-` in SnapshotLifecycleRestIT.testFullPolicy, this commit changes the test to find any snapshots starting with `snap-`. In the event that there are no snapshots (the failure case), this also exposes the full results map so we can diagnose why a failure occurred. Relates to #50358 * Use a more imperative style for checking	2020-01-24 14:30:42 -07:00
Lee Hinman	bdb8b6aa0d	[7.x] Separate aliases used for tests in TimeSeriesLifecycleAc… (#51432 ) * Separate aliases used for tests in TimeSeriesLifecycleActionsIT This is related to #51375 and hopes to help illuminate why some of those tests are failing. This commit switches the aliases used in the test to use a random alias name every time (since there were some complaints in the tests about aliases having more than one write index). With this we hope to determine the actual cause of the failure in the test. This also adds additional information to the exception returned when calling move-to-step with the incorrect current step. * Fix rest test	2020-01-24 11:05:19 -07:00
Benjamin Trent	bf53ca3380	[7.x] [ML] Add _cat/ml/anomaly_detectors API (#51364 ) (#51408 ) [ML] Add _cat/ml/anomaly_detectors API (#51364)	2020-01-24 11:54:22 -05:00
Benjamin Trent	fc994d9ce1	[ML][Inference] Adds validations for model PUT (#51376 ) (#51409 ) Adds validations making sure that * `input.field_names` is not empty * `ensemble.trained_models` is not empty * `tree.feature_names` is not empty closes https://github.com/elastic/elasticsearch/issues/51354	2020-01-24 09:29:12 -05:00
Hendrik Muhs	d177747f66	fix TransformRobustnessIT intermittent test failures ensure the cluster is not in some intermediate state when cleaning up. fixes #51347	2020-01-24 15:19:11 +01:00
Martijn van Groningen	36b460060c	Unmuted watcher security smoke tests on 7 dot x branch. Also removed the usage of types in watcher's index action and added more logging in case this test fails again. Relates to #30777	2020-01-24 14:51:07 +01:00
Martijn van Groningen	7af0474101	Add more logging when failing watch history entry fails. (#50931 ) Relates to #30777	2020-01-24 14:49:57 +01:00
Benjamin Trent	76660a5a4f	[7.x] [ML][Inference] add tags url param to GET (#51330 ) (#51404 ) * [ML][Inference] add tags url param to GET (#51330) Adds a new URL parameter, `tags` to the GET _ml/inference/<model_id> endpoint. This parameter allows the list of models to be further reduced to those who contain all the provided tags.	2020-01-24 08:26:58 -05:00
Martijn van Groningen	d3078c5b40	Re-enable FullClusterRestartIT#testWatcher test (#50463 ) Previously this test failed waiting for yellow: https://gradle-enterprise.elastic.co/s/fv55holsa36tg/console-log#L2676 Oddly cluster health returned red status, but there were no unassigned, relocating or initializing shards. Placed the waiting for green in a try-catch block, so that when this fails again then cluster state gets printed. Relates to #48381	2020-01-24 14:07:09 +01:00
Martijn van Groningen	53ac28e398	Update smoke test watcher test suite with the changes in master branch. Relates to #32299	2020-01-24 14:02:55 +01:00
Hendrik Muhs	ded7407b4d	[Transform] Adapt tests for error message to 7.x format adapt messages to 7.x format (#51398) fixes #51360	2020-01-24 12:17:32 +01:00
Andrei Dan	2f7c240184	[7.x] Use ESSingleNodeTestCase instead of ESIntegTestCase (#51345 ) (#51346 ) * Use ESSingleNodeTestCase instead of ESIntegTestCase (#51345) (cherry picked from commit abcf1c41faf05a0b0196fb06e57c3de8c3d67688) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-01-24 10:53:37 +00:00
Przemysław Witek	8703b885c2	Move TupleMatchers class to org.elasticsearch.test.hamcrest package (#51359 ) (#51395 )	2020-01-24 11:10:54 +01:00
Tim Vernum	0981a469ae	Preserve ApiKey credentials for async verification (#51389 ) The ApiKeyService would aggressively "close" ApiKeyCredentials objects during processing. However, under rare circumstances, the verfication of the secret key would be performed asychronously and may need access to the SecureString after it had been closed by the caller. The trigger for this would be if the cache already held a Future for that ApiKey, but the future was not yet complete. In this case the verification of the secret key would take place asynchronously on the generic thread pool. This commit moves the "close" of the credentials to the body of the listener so that it only occurs after key verification is complete. Backport of: #51244	2020-01-24 19:35:07 +11:00
Hendrik Muhs	d46e8c3f7f	[Transform] disallow dotted fieldnames (#51369 ) adds field validation to disallow output field names starting and/or ending with a '.'. Avoids indexing/mapping problems when starting the transform.	2020-01-24 09:05:44 +01:00
Dimitris Athanasiou	3443d69883	[7.x][ML] Rename DataFrameAnalyticsIndex to DestinationIndex (#51353 ) (#51356 ) As we prepare to introduce a new index for storing additional information about data frame analytics jobs (e.g. intrumentation), renaming this class to `DestinationIndex` better captures what it does and leaves its prior name available for a more suitable use. Backport of #51353 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-01-24 09:51:48 +02:00
Lee Hinman	31747de2a2	Fix testHashcodeAndEquals mutation for WaitForSnapshotStep (#51379 ) This fixes the test failure, it was randomness returning the same policy rather than a new one. Switched to use `randomValueOtherThan`. Resolves #51377	2020-01-23 16:19:50 -07:00
Nhat Nguyen	072203cba8	Clean soft-deletes setting in ccr tests (#51113 ) (#51372 ) We no longer need to explicitly enable soft-deletes in CCR tests. Relates #50775 Backport of #51113	2020-01-23 16:31:47 -05:00
Zachary Tong	2e314a133f	Mute TransformTaskFailedStateIT#testStartFailedTransform Tracking issue: https://github.com/elastic/elasticsearch/issues/51360	2020-01-23 12:53:02 -05:00
Zachary Tong	feb2a25761	Mute TransformTaskFailedStateIT#testForceStopFailedTransform Tracking issue: https://github.com/elastic/elasticsearch/issues/51360	2020-01-23 11:58:36 -05:00
Ioannis Kakavas	d279b462f7	Fix flaky testCreateApiKey test (#51223 ) (#51349 ) API Key expiration value has millisecond precision as we use {@link Instant#toEpoqueMilli()} when creating the API key document. It could often happen that `Instant.now()` Instant in the testCreateApiKey was close enough to the ApiKeyService's `clock.instant()` Instant, when the nanos were removed from the latter ( due to the call to `toEpoqueMilli()` ) the result of comparing these two Instants was a few nanos short of a 7 days. Resolves: #47958	2020-01-23 17:45:55 +02:00
Hendrik Muhs	3553f68f5a	[Transform] Handle permanent bulk indexing errors (#51307 ) check bulk indexing error for permanent problems and ensure the state goes into failed instead of retry. Corrects the stats API to show the real error and avoids excessive audit logging. fixes #50122	2020-01-23 16:17:26 +01:00
Przemko Robakowski	84664e8d60	Expose master timeout for ILM actions (#51130 ) (#51348 ) This change exposes master timeout to ILM steps through global dynamic setting. All currently implemented steps make use of this setting as well. Closes #44136	2020-01-23 15:28:13 +01:00
Nhat Nguyen	acf84b68cb	Do not wrap soft-deletes reader for segment stats (#51331 ) IndexWriter might not filter out fully deleted segments if retention leases exist or the number of the retaining operations is non-zero. SoftDeletesDirectoryReaderWrapper, however, always filters out fully deleted segments. This change uses the original directory reader when calculating segment stats instead. Relates #51192 Closes #51303	2020-01-23 08:43:06 -05:00
David Kyle	0ac03ac5e7	[ML] Add parsers for inference configuration classes (#51300 )	2020-01-22 17:03:01 +00:00
David Kyle	ca4b90a001	[ML] Calculate results and snapshot retention using latest bucket timestamps (#51061 ) (#51301 ) The retention period is calculated relative to the last bucket result or snapshot time rather than wall clock	2020-01-22 14:52:33 +00:00
Dimitris Athanasiou	59687a9384	[7.x][ML] Validate classification dependent_variable cardinality is at lea… (#51232 ) (#51309 ) Data frame analytics classification currently only supports 2 classes for the dependent variable. We were checking that the field's cardinality is not higher than 2 but we should also check it is not less than that as otherwise the process fails. Backport of #51232	2020-01-22 16:51:16 +02:00
Benjamin Trent	2a73e849d6	[ML][Inference] fixing ingest IT tests (#51267 ) (#51311 ) Converts InferenceIngestIT into a `ESRestTestCase`. closes #51201	2020-01-22 09:50:17 -05:00
David Roberts	932c63297f	[ML] Fix possible race condition when starting datafeed (#51302 ) The ID of the datafeed's associated job was being obtained frequently by looking up the datafeed task in a map that was being modified in other threads. This could lead to NPEs if the datafeed stopped running at an unexpected time. This change reduces the number of places where a datafeed's associated job ID is looked up to avoid the possibility of failures when the datafeed's task is removed from the map of running tasks during multi-step operations in other threads. Fixes #51285	2020-01-22 11:40:39 +00:00
Przemysław Witek	bfcfcdee33	[7.x] Do not copy mapping from dependent variable to prediction field in regression analysis (#51227 ) (#51288 )	2020-01-22 12:36:24 +01:00
Andrei Dan	421aa14972	ILM: Make UpdateSettingsStep retryable (#51235 ) (#51298 ) This makes the UpdateSettingsStep retryable. This step updates settings needed during the execution of ILM actions (mark indexes as read-only, change allocation configurations, mark indexing complete, etc) As the index updates are idempotent in nature (PUT requests and are applied only if the values have changed) and the settings values are seldom user-configurable (aside from the allocate action) the testing for this change goes along the lines of artificially simulating a setting update failure on a particular value update, which is followed by a successful step execution (a retry) in an environment outside of ILM (the step executions are triggered manually). (cherry picked from commit 8391b0aba469f39532bfc2796b76148167dc0289) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-01-22 11:02:26 +00:00
Andrei Dan	123266714b	ILM wait for active shards on rolled index in a separate step (#50718 ) (#51296 ) After we rollover the index we wait for the configured number of shards for the rolled index to become active (based on the index.write.wait_for_active_shards setting which might be present in a template, or otherwise in the default case, for the primaries to become active). This wait might be long due to disk watermarks being tripped, replicas not being able to spring to life due to cluster nodes reconfiguration and others and, the RolloverStep might not complete successfully due to this inherent transient situation, albeit the rolled index having been created. (cherry picked from commit 457a92fb4c68c55976cc3c3e2f00a053dd2eac70) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-01-22 11:01:52 +00:00
Hendrik Muhs	af76ae4ab9	[Transform] Add yml test suite for testing remote clusters (CCS) (#51033 ) add a test suite for remote clusters features and add test cases for transform	2020-01-22 11:19:02 +01:00
Ioannis Kakavas	a76321437c	Truncate SAML Response in trace log (#51237 ) (#51283 ) When not truncated, a long SAML response XML document can fill max line length and mask the actual exception message that the trace statement is meant to inform about. The same XML Document is also printed in full on trace level in SamlRequestHandler#parseSamlMessage() so there is no loss of information	2020-01-22 09:56:39 +02:00
Nhat Nguyen	5d4bbdcc50	Use conditional doc type in testFrozenIndexAfterRestarted	2020-01-21 12:57:58 -05:00
Nik Everett	ca15a3f5a8	Add "did you mean" to unknown queries (#51177 ) (#51254 ) This replaces the message we return for unknown queries with the standard one that we use for unknown fields from `ObjectParser`. This is nice because it includes "did you mean". One day we might convert parsing queries to using object parser, but that looks complex. This change is much smaller and seems useful.	2020-01-21 12:45:52 -05:00
Benjamin Trent	a9b2bc525e	[ML] address two edge cases for categorization.GrokPatternCreator#findBestGrokMatchFromExamples (#51168 ) (#51255 ) There are two edge cases that can be ran into when example input is matched in a weird way. 1. Recursion depth could continue many many times, resulting in a HUGE runtime cost. I put a limit of 10 recursions (could be adjusted I suppose). 2. If there are no "fixed regex bits", exploring the grok space would result in a fence-post error during runtime (with assertions turned off)	2020-01-21 10:29:29 -05:00
Martijn van Groningen	6b5b26a595	Protects against NPE: 2> REPRODUCE WITH: ./gradlew ':x-pack:plugin:watcher:test' --tests "org.elasticsearch.xpack.watcher.history.HistoryTemplateTransformMappingsTests.testTransformFields" -Dtests.seed=26754396AB9C1A30 -Dtests.security.manager=true -Dtests.locale=lv-LV -Dtests.timezone=America/Dominica -Dcompiler.java=13 -Druntime.java=8 2> java.lang.NullPointerException at __randomizedtesting.SeedInfo.seed([26754396AB9C1A30:B2A3CA27E260803B]:0) at org.elasticsearch.xpack.watcher.history.HistoryTemplateTransformMappingsTests.lambda$testTransformFields$1(HistoryTemplateTransformMappingsTests.java:85) at java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193) at java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193) at java.util.HashMap$ValueSpliterator.forEachRemaining(HashMap.java:1628) at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:482) at java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:472) at java.util.stream.ReduceOps$ReduceOp.evaluateSequential(ReduceOps.java:708) at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234) at java.util.stream.ReferencePipeline.collect(ReferencePipeline.java:499) at org.elasticsearch.xpack.watcher.history.HistoryTemplateTransformMappingsTests.lambda$testTransformFields$2(HistoryTemplateTransformMappingsTests.java:88) at org.elasticsearch.test.ESTestCase.assertBusy(ESTestCase.java:892) at org.elasticsearch.test.ESTestCase.assertBusy(ESTestCase.java:877) at org.elasticsearch.xpack.watcher.history.HistoryTemplateTransformMappingsTests.testTransformFields(HistoryTemplateTransformMappingsTests.java:74)	2020-01-21 15:42:22 +01:00
Nik Everett	788836ea3f	Revert "Begin moving date_histogram to offset rounding (backport of #50873 ) (#50978 )" (#51239 ) This reverts commit `9a3d4db840`. It was subtly broken in ways we didn't have tests for.	2020-01-21 08:50:02 -05:00
David Roberts	0fa7db9a95	[ML] Make datafeeds work with nanosecond time fields (#51180 ) Allows ML datafeeds to work with time fields that have the "date_nanos" type _and make use of the extra precision_. (Previously datafeeds only worked with time fields that were exact multiples of milliseconds. So datafeeds would work with "date_nanos" only if the extra precision over "date" was not used.) Relates #49889	2020-01-21 09:59:50 +00:00
Nhat Nguyen	43ed244a04	Account soft-deletes in FrozenEngine (#51192 ) (#51229 ) Currently, we do not exclude soft-deleted documents when opening index reader in the FrozenEngine. Backport of #51192	2020-01-20 17:07:29 -05:00
Adrien Grand	1a73d8329c	Disable xpack/15_basic/Usage stats for mappings. Relates #51127	2020-01-20 18:05:26 +01:00
Andrei Stefan	2908b7e5fc	SQL: add support for passing query parameters in REST API calls (#51029 ) (#51222 ) * REST PreparedStatement-like query parameters are now supported in the form of an array of non-object, non-array values where ES SQL parser will try to infer the data type of the value being passed as parameter. (cherry picked from commit 45b8bf619aecb1c03d7bc0cf06928dcc36005a66)	2020-01-20 16:40:19 +02:00
Andrei Stefan	543cc85b78	Add trace logging for responses coming from server (#50530 ) (#51221 ) (cherry picked from commit 38eb485deffa175c7eb0b55a42a3e309f8a9802d)	2020-01-20 16:39:46 +02:00
Andrei Stefan	df36169220	SQL: change the way unsupported data types fields are handled (#50823 ) (#51220 ) The hierarchy of fields/sub-fields under a field that is of an unsupported data type will be marked as unsupported as well. Until this change, the behavior was to set the unsupported data type field's hierarchy as empty. Example, considering the following hierarchy of fields/sub-fields a -> b -> c -> d, if b would be of type "foo", then b, c and d will be marked as unsupported. (cherry picked from commit 7adb286c4c485b9e781f88b0a2f98cab9ec5b7e2)	2020-01-20 16:23:43 +02:00
Hendrik Muhs	51134d9738	check custom meta data to avoid NPE (#51163 ) check custom meta data to avoid NPE, fixes a problem introduced in #51072 fixes #51153	2020-01-20 13:53:42 +01:00
Tim Vernum	a0ca82422c	Mute TimeSeriesLifecycleActionsIT.waitForSnapshot (#51208 ) This test was recently un-muted, but is still failing Relates: #50781 Backport of: #51203	2020-01-20 20:19:29 +11:00
Nik Everett	977b53ab91	Fix flaky usage tracking test (#51169 ) (#51179 ) We added tracking of index feature usage in #51031 but due to some copy and paste errors the test fails on some seeds. This fixes those errors.	2020-01-17 16:53:13 -05:00
Jason Tedor	9ce4d2b901	Initial autoscaling commit (#51161 ) This commit merely adds the skeleton for the autoscaling project, adding the basics to include the autoscaling module in the default distribution, opt-in to code formatting, and a placeholder for the docs.	2020-01-17 15:31:12 -05:00
Lee Hinman	731c96b507	[7.x] Use separate policies for tests in SnapshotLifecycleRest… (#51181 ) These policies store statistics, but since stats updating is asynchronous, it's possible for the update from one test to bleed into a separate one. This change switches the tests to use separate policy ids so that their stats are tracked independently. It also relaxes the checking constraint in one of the tests. Hopefully this: Resolves #48531 Resolves #48017	2020-01-17 13:26:40 -07:00
Jay Modi	107989df3e	Introduce hidden indices (#51164 ) This change introduces a new feature for indices so that they can be hidden from wildcard expansion. The feature is referred to as hidden indices. An index can be marked hidden through the use of an index setting, `index.hidden`, at creation time. One primary use case for this feature is to have a construct that fits indices that are created by the stack that contain data used for display to the user and/or intended for querying by the user. The desire to keep them hidden is to avoid confusing users when searching all of the data they have indexed and getting results returned from indices created by the system. Hidden indices have the following properties: * API calls for all indices (empty indices array, _all, or ) will not return hidden indices by default. Wildcard expansion will not return hidden indices by default unless the wildcard pattern begins with a `.`. This behavior is similar to shell expansion of wildcards. * REST API calls can enable the expansion of wildcards to hidden indices with the `expand_wildcards` parameter. To expand wildcards to hidden indices, use the value `hidden` in conjunction with `open` and/or `closed`. * Creation of a hidden index will ignore global index templates. A global index template is one with a match-all pattern. * Index templates can make an index hidden, with the exception of a global index template. * Accessing a hidden index directly requires no additional parameters. Backport of #50452	2020-01-17 10:09:01 -07:00
Jay Modi	96e8f67425	Upgrade to the latest OWASP HTML sanitizer (#50765 ) (#51166 ) This commit upgrades the OWASP HTML sanitizer used by watcher to the latest version and also upgrades guava, which it depends on. The guava upgrade also requires the addition of a new dependency that guava itself requires as of version 27.0. The sanitizer's behavior has changed to re-write these templated values with a comment that results in this output `{<!-- -->{ctx.metadata.name}}`. This would be an issue if we attempted to sanitize the template, but the code that uses the sanitizer runs the rendered string through the sanitizer, which means that the templated values have been replaced already. Relates #50395	2020-01-17 10:00:33 -07:00
Ioannis Kakavas	4fc865e579	Don't fallback to anonymous for tokens/apikeys (#51042 ) (#51159 ) This commit changes our behavior so that when we receive a request with an invalid/expired/wrong access token or API Key we do not fallback to authenticating as the anonymous user even if anonymous access is enabled for Elasticsearch.	2020-01-17 18:56:02 +02:00
David Roberts	295665b1ea	[ML] Add audit warning for 1000 categories found early in job (#51146 ) If 1000 different category definitions are created for a job in the first 100 buckets it processes then an audit warning will now be created. (This will cause a yellow warning triangle in the ML UI's jobs list.) Such a large number of categories suggests that the field that categorization is working on is not well suited to the ML categorization functionality.	2020-01-17 16:28:45 +00:00
Przemysław Witek	da73c9104e	[ML] Fix tests randomly failing on CI (#51142 ) (#51150 )	2020-01-17 14:58:58 +01:00
Dimitris Athanasiou	b70ebdeb96	[7.x][ML] DF Analytics _explain API should skip object fields (#51115 ) (#51147 ) Object fields cannot be used as features. At the moment _explain API includes them and even worse it allows it does not error when an object field is excluded. This creates the expectation to the user that all children fields will also be excluded while it's not the case. This commit omits object fields from the _explain API and also adds an error if an object field is included or excluded. Backport of #51115	2020-01-17 14:02:59 +02:00
Przemysław Witek	b1a526d5e9	[7.x] [ML] Update DFA progress document in the index the document belongs to (#51111 ) (#51117 )	2020-01-17 08:12:54 +01:00
Hendrik Muhs	13343b15c9	[Transform] Improve force stop robustness in case of an error (#51072 ) If a transform config got lost (e.g. because the internal index disappeared) tasks could not be stopped using transform API. This change makes it possible to stop transforms without a config, meaning to remove the background task. In order to do so force must be set to true.	2020-01-17 07:42:21 +01:00
Ioannis Kakavas	d0554fd317	Fail gracefully on invalid token strings (#51014 ) (#51096 ) When we receive a request with an Authorization header that contains a Bearer token that is not generated by us or that is malformed in some way, attempting to decode it as one of our own might cause a number of exceptions that are not IOExceptions. This commit ensures that we catch and log these too and call onResponse with `null, so that we can return 401 instead of 500. Resolves: #50497	2020-01-16 17:00:17 +02:00
Florian Kelbert	584cb0d926	[DOCS] Correctly read total hits inside watcher config (#50614 ) With elastic/elasticsearch#35848, users can now retrieve total hits as an integer when the `rest_total_hits_as_int` query parameter is `true`. This is the default value. This updates several snippet examples in the Watcher docs that used a workaround to get a total hits integer.	2020-01-16 09:43:25 -05:00
Bogdan Pintea	fb65ef3f2d	SQL: Extend the optimisations for equalities (#50792 ) (#51098 ) * Extend the optimizations for equalities This commit supplements the optimisations of equalities in conjunctions and disjunctions: * for conjunctions, the existing optimizations with ranges are extended with not-equalities and inequalities; these lead to a fast resolution, the conjunction either being evaluate to a FALSE, or the non-equality conditions being dropped as superfluous; * optimisations for disjunctions are added to be applied against ranges, inequalities and not-equalities; these lead to disjunction either becoming TRUE or the equality being dropped, either as superfluous or merged into a range/inequality. * Adress review notes * Fix the bug around wrongly optimizing 'a=2 OR a!=?', which only yields TRUE for same values in equality and inequality. * Var renamings, code style adjustments, comments corrections. * Address further review comments. Extend optim. - fix a few code comments; - extend the Equals OR NotEquals optimitsation (a=2 OR a!=5 -> a!=5); - extend the Equals OR Range optimisation on limits equality (a=2 OR 2<=a<5 -> 2<=a<5); - in case an equality is being removed in a conjunction, the rest of possible optimisations to test is now skipped. * rename one var for better legiblity - s/rmEqual/removeEquals (cherry picked from commit 62e7c6a010f10cd7893ee5c99bad8b8d2a693436)	2020-01-16 14:32:34 +01:00
Tom Veasey	32ec934b15	[7.x][ML] Assert top classes are ordered by score (#51028 ) Backport #51003.	2020-01-16 12:23:15 +00:00

1 2 3 4 5 ...

4642 Commits