OpenSearch

Commit Graph

Author	SHA1	Message	Date
Benjamin Trent	dffcd021df	[7.x] [ML] Add bwc serialization unit test scaffold (#51889 ) (#52061 ) * [ML] Add bwc serialization unit test scaffold (#51889) Adds new `AbstractBWCSerializationTestCase` which provides easy scaffolding for BWC serialization unit tests. These are no replacement for true BWC tests (which execute actual old code). These tests do provide some good coverage for the current code when serializing to/from old versions. * removing unnecessary override for 7.series branch * adding necessary import Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-07 17:17:11 -05:00
Benjamin Trent	c6111eb90e	[ML][Inference] adding number_samples to TreeNode (#51937 ) (#52060 ) in preparation for feature importance and split information gain, adding `number_samples` field to `TreeNode` definition. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-07 17:04:58 -05:00
Julie Tibshirani	337d73a7c6	Rename MapperService#fullName to fieldType. The new name more accurately describes what the method returns.	2020-02-07 10:35:53 -08:00
Emanuele Sabellico	282e919607	SQL: [Tests] Add integ tests for selecting a literal and an aggregate (#42121 ) The related issue regarding aggregation queries where some literals are also selected together with aggregate function has been fixed with #49570. Add integration tests to verify the behavior. Relates to: #41411 (cherry picked from commit 9f414a8d05c75e1a9f8250084f6dcd634d5d78d8)	2020-02-07 19:00:15 +01:00
Albert Zaharovits	4add82d966	Mute CoreFullClusterRestartIT testRecovery (#52038 ) Relates #52031	2020-02-07 13:35:43 +02:00
David Kyle	8f10a7c6ca	[ML] Make Ensemble feature names optional (#51996 ) The featureNames field is requisite in individual models but is not required by the Ensemble.	2020-02-07 10:08:37 +00:00
Armin Braun	91e938ead8	Add Trace Logging of REST Requests (#51684 ) (#52015 ) Being able to trace log all REST requests to a node would make debugging a number of issues a lot easier.	2020-02-07 09:03:20 +01:00
Jason Tedor	25daf5f1e1	Add autoscaling API skelton (#51564 ) The main purpose of this commit is to add a single autoscaling REST endpoint skeleton, for the purpose of starting to build out the build and testing infrastructure that will surround it. For example, rather than commiting a fully-functioning autoscaling API, we introduce here the skeleton so that we can start wiring up the build and testing infrastructure, establish security roles/permissions, an so on. This way, in a forthcoming PR that introduces actual functionality, that PR will be smaller and have less distractions around that sort of infrastructure.	2020-02-06 21:55:01 -05:00
Andrei Stefan	488944f4a1	SQL: Handle uberjar scenario where the ES jdbc driver file is bundled in another jar (#51856 ) (#52024 ) (cherry picked from commit 6247b0793c9db19a8a9fa6f0164cc14d0debed6e)	2020-02-07 04:15:59 +02:00
Benjamin Trent	846f87a26e	[ML] allow close/stop for jobs/datafeeds with missing configs (#51888 ) (#51997 ) If the configs are removed (by some horrific means), we should still allow tasks to be cleaned up easily. Datafeeds and jobs with missing configs are now visible in their respective _stats calls and can be stopped/closed.	2020-02-06 12:10:18 -05:00
Hendrik Muhs	03fb5cdaae	fallback to float if source type is scaled_float for mapping deduction (#51990 ) fallback to float if source type is scaled_float for mapping deduction of min/max aggregation fixes #51780	2020-02-06 17:27:26 +01:00
Martijn Laarman	898dd0b9cc	Cat.ml.* introduces an additional depths to namespace API's (#51981 ) Not all clients support this e.g if the java high level rest client were to map this it would look like `client.cat().ml().api()` which hinders discoverability. (cherry picked from commit 21cdabf09dc8305ce2f5e3b6cb193f67137d8bdb)	2020-02-06 13:16:59 +01:00
Jim Ferenczi	0f333c89b9	Always rewrite search shard request outside of the search thread pool (#51708 ) (#51979 ) This change ensures that the rewrite of the shard request is executed in the network thread or in the refresh listener when waiting for an active shard. This allows queries that rewrite to match_no_docs to bypass the search thread pool entirely even if the can_match phase was skipped (pre_filter_shard_size > number of shards). Coordinating nodes don't have the ability to create empty responses so this change also ensures that at least one shard creates a full empty response while the other can return null ones. This is needed since creating true empty responses on shards require to create concrete aggregators which would be too costly to build on a network thread. We should move this functionality to aggregation builders in a follow up but that would be a much bigger change. This change is also important for #49601 since we want to add the ability to use the result of other shards to rewrite the request of subsequent ones. For instance if the first M shards have their top N computed, the top worst document in the global queue can be pass to subsequent shards that can then rewrite to match_no_docs if they can guarantee that they don't have any document better than the provided one.	2020-02-06 10:53:11 +01:00
Lisa Cawley	53bd88ea8c	[DOCS] Adds tip for elastic built-in user (#51891 )	2020-02-05 18:56:23 -08:00
Jason Tedor	12473c2bcb	Log failure when cleaning shard follow task (#51971 ) When clenaing a shard follow task after an index has been deleted, an exception can occur submitting the complete persistent task action. However, this exception message is not logged. This commit addresses this by including the exception that led to the failure in the log message.	2020-02-05 20:48:00 -05:00
Tanguy Leroux	d86a7ad6d2	Give more time to AutoFollowIT tests (#51938 ) AutoFollowIT tests are regularly failing on CI because they rely on how cluster state updates are processed within the integration clusters. We tried to limit this in #49141 by moving to latches instead of waiting for assertions to pass but there are still some places were it still need to wait for the cluster state updates to be processed and auto-follow stats to be updated. This commit gives more time to assertBusy() that verifies the AutoFollowStats (up to 60 seconds) and also always log the auto-follow stats in case the assertions failed. Closes #48982	2020-02-05 15:57:27 +01:00
Costin Leau	bd6d9e063c	EQL: Add missing commit messages for #51940 * EQL: Plug query params into the AstBuilder (#51886) As the eventType is customizable, plug that into the parser based on the given request. (cherry picked from commit 5b4a3a3c07eacbc339cbd4c05a3621d056cc8d60) * EQL: Add field resolution and verification (#51872) Add basic field resolution inside the Analyzer and a basic Verifier to check for any unresolved fields. (cherry picked from commit 7087358ae2fb212811d480ec8641a46167946c82) * EQL: Introduce basic execution pipeline (#51809) Add main classes that form the 'execution' pipeline are added - most of them have no functionality; the purpose of this PR is to add flesh out the contract between the various moving parts so that work can start on them independently. (cherry picked from commit 9a1bae50a49af7fe8467b74b154c0d82c6bb9a19) * EQL: Add AstBuilder to convert to QL tree (#51558) * EQL: Add AstBuilder visitors * EQL: Add tests for wildcards and sets * EQL: Fix licensing * EQL: Fix ExpressionTests.java license * EQL: Cleanup imports * EQL: PR feedback and remove LiteralBuilder * EQL: Split off logical plan from expressions * EQL: Remove stray import * EQL: Add predicate handling for set checks * EQL: Remove commented out dead code * EQL: Remove wildcard test, wait until analyzer (cherry picked from commit a462700f9c8e1fb977d62d42eb0077403b8fa98b) * EQL grammar updates and tests (#49658) * EQL: Additional tests and grammar updates * EQL: Add backtick escaped identifiers * EQL: Adding keywords to language * EQL: Add checks for unsupported syntax * EQL: Testing updates and PR feedback * EQL: Add string escapes * EQL: Cleanup grammar for identifier * EQL: Remove tabs from .eql tests (cherry picked from commit 6f1890bf2d52cabdfd1e7848fb481cf54b895f25)	2020-02-05 16:53:42 +02:00
Costin Leau	6ff0e411a8	EQL: backport updates to 7.x (#51940 )	2020-02-05 16:45:58 +02:00
Benjamin Trent	79f143907a	[7.x] [ML] add _cat/ml/trained_models API (#51529 ) (#51936 ) * [ML] add _cat/ml/trained_models API (#51529) This adds _cat/ml/trained_models.	2020-02-05 08:26:44 -05:00
Marios Trivyzas	64f9a2089b	SQL: [Tests] add tests for literals and GROUP BY (#51878 ) Add unit and integration tests where literals are SELECTed in combination with GROUP BY and possibly aggregate functions. Relates to #41411 and #34583 which have been fixed. (cherry picked from commit b97f1ca12675d6ea4772c60578922fe1cc2409ee)	2020-02-05 12:55:56 +01:00
Ignacio Vera	ababd730f6	Histogram field: Use #name() instead of #simpleName() when generating doc values (#51920 ) (#51927 )	2020-02-05 12:35:49 +01:00
Yannick Welsch	60c93b6df5	Increase scroll timeout for upgrade test (#51912 ) Bumps the timeout already bumped in #50195, which was insufficient.	2020-02-05 11:13:58 +01:00
Adrien Grand	ad9d2f1922	Move analysis/mappings stats to cluster-stats. (#51875 ) Closes #51138	2020-02-05 11:02:25 +01:00
debadair	c0156cbb5d	Backporting updates to ILM org, overview, & GS (#51898 ) * [DOCS] Align with ILM API docs (#48705) * [DOCS] Reconciled with Snapshot/Restore reorg * [DOCS] Split off ILM overview to a separate topic. (#51287) * [DOCS} Split off overview to a separate topic. * [DOCS] Incorporated feedback from @jrodewig. * [DOCS] Edit ILM GS tutorial (#51513) * [DOCS] Edit ILM GS tutorial * [DOCS] Incorporated review feedback from @andreidan. * [DOCS] Removed test link & fixed anchor & title. * Update docs/reference/ilm/getting-started-ilm.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co> * Fixed glossary merge error. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-02-04 16:45:18 -08:00
Lee Hinman	0be61a3662	[7.x] Adding best_compression (#49974 ) (763480ee) (#51819 ) * Adding best_compression (#49974) This commit adds a `codec` parameter to the ILM `forcemerge` action. When setting the codec to `best_compression` ILM will close the index, then update the codec setting, re-open the index, and finally perform a force merge. * Fix ForceMergeAction toSteps construction (#51825) There was a duplicate force merge step and the test continued to fail. This commit clarifies the `toStep` method and changes the `assertBestCompression` method for better readability. Resolves #51822 * Update version constants Co-authored-by: Sivagurunathan Velayutham <sivadeva.93@gmail.com>	2020-02-04 14:15:43 -07:00
Julie Tibshirani	38ce428831	Create a class to hold field capabilities for one index. (#51844 ) Currently, the same class `FieldCapabilities` is used both to represent the capabilities for one index, and also the merged capabilities across indices. To help clarify the logic, this PR proposes to create a separate class `IndexFieldCapabilities` for the capabilities in one index. The refactor will also help when adding `source_path` information in #49264, since the merged source path field will have a different structure from the field for a single index. Individual changes: * Add a new class IndexFieldCapabilities. * Remove extra constructor from FieldCapabilities. * Combine the add and merge methods in FieldCapabilities.Builder.	2020-02-04 11:24:57 -08:00
Hendrik Muhs	b7aace44f3	mark transform API's stable (#51862 ) mark transform API's stable, meaning making transform GA for the next minor release	2020-02-04 16:13:47 +01:00
David Roberts	9d55c45b5a	[ML] Improve multiline_start_pattern for CSV in find_file_structure (#51737 ) The work to switch file upload over to treating delimited files like semi-structured text and using the ingest pipeline for CSV parsing makes the multi-line start pattern used for delimited files much more critical than it used to be. Previously it was always based on the time field, even if that was towards the end of the columns, and no multi-line pattern was created if no timestamp was detected. This change improves the multi-line start pattern by: 1. Never creating a multi-line pattern if the sample contained only single line records. This improves the import efficiency in a common case. 2. Choosing the leftmost field that has a well-defined pattern, whether that be the time field or a boolean/numeric field. This reduces the risk of a field with newlines occurring earlier, and also means the algorithm doesn't automatically fail for data without a timestamp.	2020-02-04 12:37:48 +00:00
Hendrik Muhs	c2b08bb72f	[Transform] add support for percentile aggs (#51808 ) make transform ready for multi value aggregations and add support for percentile fixes #51663	2020-02-04 12:02:20 +01:00
Hendrik Muhs	5d5f3ce256	[Transform] improve irrecoverable error detection treat resource not found and illegal argument exceptions as irrecoverable error relates #50135	2020-02-04 10:36:35 +01:00
Benjamin Trent	d293980a09	[7.x] [ML] add GET _cat/ml/datafeeds (#51500 ) (#51829 ) * [ML] add GET _cat/ml/datafeeds (#51500) This adds GET _cat/ml/datafeeds && _cat/ml/datafeeds/{datafeed_id} * fixing for java8 compilation	2020-02-03 17:16:33 -05:00
Jonathan Budzenski	8fa4a40bdf	[rest spec] fill in documentation links for security.{put,delete}_privileges (#48482 ) Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-03 10:53:50 -06:00
Jochem Wichers Hoeth	8aaca45922	Fix section header in Get API key information API doc (#51807 )	2020-02-03 07:36:18 -08:00
James Rodewig	4ea7297e1e	[DOCS] Change http://elastic.co -> https (#48479 ) (#51812 ) Co-authored-by: Jonathan Budzenski <jon@budzenski.me>	2020-02-03 09:50:11 -05:00
Dan Hermann	4083eae0b7	[7.x] Secure password for monitoring HTTP exporter (#51775 ) Adds a secure and reloadable SECURE_AUTH_PASSWORD setting to allow keystore entries in the form "xpack.monitoring.exporters.*.auth.secure_password" to securely supply passwords for monitoring HTTP exporters. Also deprecates the insecure `AUTH_PASSWORD` setting.	2020-02-03 07:42:30 -06:00
Andrei Dan	81388051d8	Reenable testWhenUserLimitedByOnlyAliasOfIndexCanWriteToIndexWhichWasRolledoverByILMPolicy (#51768 ) (#51801 ) We suspect the flakiness could’ve come from the fact that the rollover step used to create the new index and roll the write alias to the new index in separate cluster state updates. So the assertion that the rolled index exists could’ve passed in the test but, before the alias was rolled over to the new index, the subsequent write we execute in the test (namely `indexDocs("test_user", "x-pack-test-password", "foo_alias", 1)`) would’ve sent the new document to the source index (ie. foo-logs-000001) This would see the source index containing 3 documents and the rolled index (foo-logs-000002) 0 documents. However, we fixed this and the rollover step executes the “create index and roll alias” in one single cluster update, so this situation should not occur anymore. (cherry picked from commit 834261c4fe7dd93f437eeec43c00d01ff2279f86) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-02-03 11:54:00 +00:00
David Roberts	d5d8fb26fa	[TEST] Remove obsolete test trace logging from NetworkDisruptionIT (#51746 ) The issue this logging was added to fix (#49908) was closed in December and the problem has not recurred so this logging is no longer needed.	2020-02-03 11:25:53 +00:00
Karel Minarik	050c4d4c89	Fixes for the REST specification (#51791 ) * REST: Test: Fix the `accept_enterprise` parameter for Get License API (#51527) The Get License API specifies the `accept_enterprise` parameter as a `boolean`: `0ca5cb8cb6/x-pack/plugin/src/test/resources/rest-api-spec/api/license.get.json (L22-L27)` In the test, a `string` is passed however, which makes the test compilation fail in the Go client. (cherry picked from commit e2a2169b3d44592057c143253bb56375ed3e4268) * Fix the SQL API documentation in REST specification (#51534) This patch fixes the SQL REST API documentation to conform to the current schema. (cherry picked from commit c8b6a849852699883086a6ada42279f2f68d7e07) * Fix the "slices" parameter for the Delete By Query API in the REST specification (#51535) This patch updates the `type` parameter in the Delete By Query API: according to [the documentation](https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-delete-by-query.html#docs-delete-by-query-slice), it can be set to "auto", but the type in the documentation allows only numerical values. This prevents people from setting the parameter to "auto" eg. in the Go client, which generates source from the specification, and sets the corresponding Go type as number. The patch uses the `\|` notation, which we have discussed previously for encoding a "polymorphic" parameter like this. Related: https://github.com/elastic/go-elasticsearch/issues/77 * Fix the Enrich API documentation in REST specification (#51528) This patch fixes the REST API documentation for the Enrich APIs to conform to the current schema. (cherry picked from commit 59f28f4f2feeba3f6d2f0b632410577eacb28121)	2020-02-02 15:28:08 +01:00
Hendrik Muhs	ed170cc548	[Transform] Fix stats can return old state information if security is enabled (#51732 ) (#51738 ) do index refresh of the internal transform index with the system user instead of using the calling user which does not have sufficient rights if security is enabled fixes #51728	2020-02-01 19:34:58 +01:00
Ryan Ernst	21224caeaf	Remove comparison to true for booleans (#51723 ) While we use `== false` as a more visible form of boolean negation (instead of `!`), the true case is implied and the true value does not need to explicitly checked. This commit converts cases that have slipped into the code checking for `== true`.	2020-01-31 16:35:43 -08:00
Lee Hinman	4594a210bf	[7.x] Fix SnapshotLifecycleRestIT.testFullPolicySnapshot (#517… (#51778 ) * Fix SnapshotLifecycleRestIT.testFullPolicySnapshot This previously was missing some key information in the output of the failure. This captures that information and adds logging at each step so we can determine the cause if it fails again. Resolves #50358	2020-01-31 15:38:28 -07:00
Aleksandr Maus	d4f6f38150	EQL: Fix #51541 : [CI] unknown setting [xpack.eql.enabled] in release-tests (#51699 ) (#51770 ) Fixes #51541 Co-authored-by: Igor Motov <igor@motovs.org>	2020-01-31 15:14:27 -05:00
Dimitris Athanasiou	55b5c8f703	[7.x][ML] Remove index.unassigned.node_left.delayed_timeout setting from M… (#51740 ) (#51764 ) This setting was introduced with the purpose of reducing the time took by tests that shut nodes down. Tests like `MlDistributedFailureIT` and `NetworkDisruptionIT`. However, it is unfortunate to have to set the value to an explicit value in production. In addition, and most important, the dynamically choosing the value for this setting makes it impossible to adopt static index template configs that we register via `IndexTemplateRegistry`, which we need to use in order to start registering ILM policies for the ML indices. This commit removes this setting from our templates. I run the tests a few times and could not see execution time differing significantly. Backport of #51740	2020-01-31 20:28:29 +02:00
Andrei Dan	5ca51562ec	Fix testThatNonExistingTemplatesAreAddedImmediately (#51668 ) (#51752 ) This addresses another race condition that could yield this test flaky. (cherry picked from commit d20d90aceb2b687239654d6f013f61f7f4cc1512) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-01-31 18:18:00 +00:00
Andrei Dan	20f47b14b0	Fix SnapshotLifecycleServiceTests.testPolicyCRUD (#51653 ) (#51755 ) (cherry picked from commit 8f9a87fa576a8a1c6ea3efb29bf1296d50d89ace) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-01-31 18:17:38 +00:00
Przemko Robakowski	227621dd13	Change index.lifecycle.step.master_timeout to indices.lifecycle.step.master_timeout (#51744 ) (#51761 ) * Change index.lifecycle.step.master_timeout to indices.lifecycle.step.master_timeout This changes setting name from `index.lifecycle.step.master_timeout` to `indices.lifecycle.step.master_timeout` to avoid confusion about its scope. `index.*` settings are recognized as index level settings, this one is node level. Reletes to #51698	2020-01-31 18:56:50 +01:00
Lee Hinman	deefc85d60	[7.x] Stop policy on last PhaseCompleteStep instead of Termina… (#51758 ) Currently when an ILM policy finishes its execution, the index moves into the `TerminalPolicyStep`, denoted by a completed/completed/completed phase/action/step lifecycle execution state. This commit changes the behavior so that the index lifecycle execution state halts at the last configured phase's `PhaseCompleteStep`, so for instance, if an index were configured with a policy containing a `hot` and `cold` phase, the index would stop at the `cold/complete/complete` `PhaseCompleteStep`. This allows an ILM user to update the policy to add any later phases and have indices configured to use that policy pick up execution at the newly added "later" phase. For example, if a `delete` phase were added to the policy specified about, the index would then move from `cold/complete/complete` into the `delete` phase. Relates to #48431	2020-01-31 10:36:41 -07:00
Mayya Sharipova	42b885f050	Upgrade to lucene-8.5.0-snapshot-3333ce7da6d (#51749 ) Backport for #51327	2020-01-31 11:20:15 -05:00
Benjamin Trent	e372854d43	[ML][Inference] Fix model pagination with models as resources (#51573 ) (#51736 ) This adds logic to handle paging problems when the ID pattern + tags reference models stored as resources. Most of the complexity comes from the issue where a model stored as a resource could be at the start, or the end of a page or when we are on the last page.	2020-01-31 07:52:19 -05:00
Yang Wang	77b00fc0c0	Add warnings for invalid realm order config (#51195 ) (#51515 ) The changes are to help users prepare for migration to next major release (v8.0.0) regarding to the break change of realm order config. Warnings are added for when: * A realm does not have an order config * Multiple realms have the same order config The warning messages are added to both deprecation API and loggings. The main reasons for doing this are: 1) there is currently no automatic relay between the two; 2) deprecation API is under basic and we need logging for OSS.	2020-01-31 12:32:37 +11:00
Gordon Brown	10c8179351	Use exclusions list instead of fake system indices (#51586 ) This commit switches the strategy for managing dot-prefixed indices that should be hidden indices from using "fake" system indices to an explicit exclusions list that must be updated when those indices are converted to hidden indices.	2020-01-30 16:31:27 -07:00
Mark Vieira	8d2370bf00	Always use bundled JDK for external cluster nodes when BWC testing (#51505 ) (#51701 )	2020-01-30 14:35:43 -08:00
Bogdan Pintea	f1173aaa48	SQL: Add optimisations for not-equalities (#51088 ) (#51700 ) * Optimize not-equalities in con-/disjunctions This commit adds optimisations of not-equalities in conjunctions and disjunctions: * for conjunctions, the not-equality can be optimized away when applied together with a range or inequality, in case the not-equality point falls outside the domain of the later condition; if its on the boarder, it will modify the bound, to simply exclude the equality, if present; otherwise no optimisation can be applied; * for disjunctions, the not-equals could filter away the ranges and inequalities, unless these include an equality on the bound, in which case the entire condition becomes always true, but this would influence the score() function, so it's been omitted; * fix aggregations of inequalities in ranges This commit fixes the loop that aggregates inequalities into ranges: - it won't advance the outer loop index in case of a merge, since the current element is removed; - it will break the inner loop, since comparision against the element selected in the outer loop can't continue, as it had been removed. (cherry picked from commit 789724ac2cc726de603849b4eeb8194da7528bcc)	2020-01-30 23:29:39 +01:00
Lee Hinman	b9faa0733d	[7.x] Rename ILM history index enablement setting (#51698 ) (#51705 ) * Rename ILM history index enablement setting The previous setting was `index.lifecycle.history_index_enabled`, this commit changes it to `indices.lifecycle.history_index_enabled` to indicate this is not an index-level setting (it's node level).	2020-01-30 15:27:44 -07:00
Benjamin Trent	1380dd439a	[7.x] [ML][Inference] Fix weighted mode definition (#51648 ) (#51695 ) * [ML][Inference] Fix weighted mode definition (#51648) Weighted mode inaccurately assumed that the "max value" of the input values would be the maximum class value. This does not make sense. Weighted Mode should know how many classes there are. Hence the new parameter `num_classes`. This indicates what the maximum class value to be expected.	2020-01-30 15:33:25 -05:00
Nhat Nguyen	1cba5d7c4b	Force flush in FrozenEngine#testSearchers (#51635 ) We need to force flush to make the last commit safe; otherwise, we might fail to open FrozenEngine. Note that we force flush before closing a shard. Closes #51620	2020-01-30 14:48:45 -05:00
Benjamin Trent	2a2a0941af	[ML][Inference] stream inflate to parser + throw when byte limit is reached (#51644 ) (#51679 ) Three fixes for when the `compressed_definition` is utilized on PUT * Update the inflate byte limit to be the minimum of 10% the max heap, or 1GB (what it was previously) * Stream data directly to the JSON parser, so if it is invalid, we don't have to inflate the whole stream to find out * Throw when the maximum bytes are reach indicating that is why the request was rejected	2020-01-30 10:16:14 -05:00
Marios Trivyzas	f373020349	SQL: Fix ORDER BY YEAR() function (#51562 ) Previously, if YEAR() was used as and ORDER BY argument without being wrapped with another scalar (e.g. YEAR(birth_date) + 10), no script ordering was used but instead the underlying field (e.g. birth_date) was used instead as a performance optimisation. This works correctly if YEAR() is the only ORDER BY arg but if further args are used as tie breakers for the ordering wrong results are produced. This is because 2 rows with the different birth_date but on the same year are not tied as the underlying ordering is on birth_date and not on the YEAR(birth_date), and the following ORDER BY args are ignored. Remove this optimisation for YEAR() to avoid incorrect results in such cases. As a consequence another bug is revealed: scalar functions on top of nested fields produce scripted sorting/filtering which is not yet supported. In such cases no error was thrown but instead all values for such nested fields were null and were passed to the script implementing the sorting/filtering, producing incorrect results. Detect such cases and throw a validation exception. Fixes: #51224 (cherry picked from commit f41efd6753dc3650a7eabb3e07b02b3b32c5704c)	2020-01-30 15:29:36 +01:00
Martijn van Groningen	f7e2082378	Backport: unmute rolling upgrade watcher tests and (#51664 ) set watcher logger to debug level. These tests haven't run in such a long time, we first need to get a better picture how/if these tests fail today. Backport of #51478 See #33185	2020-01-30 14:01:30 +01:00
Marios Trivyzas	285a167c34	SQL: Verify Full-Text Search functions not allowed in SELECT (#51568 ) Add a verification that full-text search functions `MATCH()` and `QUERY()` are not allowed in the SELECT clause, so that a nice error message is returned to the user early instead of an "ugly" exception. Fixes: #47446	2020-01-30 13:14:38 +01:00
Albert Zaharovits	f25b6cc2eb	Add new 'maintenance' index privilege #50643 This commit creates a new index privilege named `maintenance`. The privilege grants the following actions: `refresh`, `flush` (also synced-`flush`), and `force-merge`. Previously the actions were only under the `manage` privilege which in some situations was too permissive. Co-authored-by: Amir H Movahed <arhd83@gmail.com>	2020-01-30 11:59:11 +02:00
Henning Andersen	149b68d850	[ML] Fix possible race condition starting datafeed (#51646 ) Datafeeds being closed while starting could result in and NPE. This was handled as any other failure, masking out the NPE. However, this conflicts with the changes in #50886. Related to #50886 and #51302	2020-01-30 08:23:45 +01:00
Lisa Cawley	28f2f3dd02	[DOCS] Minor fixes in transform documentation (#51633 )	2020-01-29 16:58:18 -08:00
Aleksandr Maus	0d21d9e2c5	EQL: Enable QA/rest integration tests for snapshot builds only (#51624 ) (#51645 ) * Related to #51541: [CI] unknown setting [xpack.eql.enabled] in release-tests	2020-01-29 16:38:52 -05:00
Julie Tibshirani	9dcc3ef7e6	Always use one shard in vector REST tests. (#51643 ) This PR tries to address the intermittent vector test failures on 7.x by making sure we create indices with one shard. The fix is based on this theory as to what's happening: * On 7.x, the default number of shards is 1, but in REST tests we randomly use 2 in order to cover the multiple shards case. In the failing test run, we use 2 shards and all documents end up on only one shard. * During a search, the response from the empty shard doesn't produce deprecation warnings because we never try to execute the script. If not all shard responses contain the warning headers, then certain deprecation warnings can be lost (due to the bug described in #33936). Addresses #50716. Relates to #50061.	2020-01-29 12:24:41 -08:00
Przemysław Witek	683170b007	Increase the number of indexed documents to increase a chance that there are at least 2 training rows. (#51607 ) (#51615 )	2020-01-29 17:17:19 +01:00
Martijn van Groningen	8c78404814	Only query for watcher history docs with the expected state, otherwise other history docs can be returned, which may not have the expected state, and this test only checks the first hit. Relates to #30777	2020-01-29 13:51:48 +01:00
David Roberts	e0e35b7feb	[TEST] Mute TimeSeriesLifecycleActionsIT.testWaitForSnapshotSlmExecutedBefore Due to https://github.com/elastic/elasticsearch/issues/50781	2020-01-29 13:08:55 +01:00
Martijn van Groningen	b253af36f3	The watcher indexing listener didn't handle document level exceptions. (#51466 ) Prior to the change the watcher index listener didn't implement the `postIndex(ShardId, Engine.Index, Engine.IndexResult)` method. This caused document level exceptions like VersionConflictEngineException to be ignored. This commit fixes this. The watcher indexing listener did implement the `postIndex(ShardId, Engine.Index, Exception)` method, but that only handles engine level exceptions. This change also unmutes the SmokeTestWatcherTestSuiteIT#testMonitorClusterHealth test again. Relates to #32299	2020-01-29 12:55:02 +01:00
Martijn van Groningen	206c8ac3e9	Muted test, see #48381	2020-01-29 12:17:38 +01:00
Rory Hunter	d8bd736f8a	Formatting: keep simple if / else on the same line (#51544 ) Backport of #51526. Previous the formatter was breaking simple if/else statements (i.e. without braces) onto separate lines, which could be fragile because the formatter cannot also introduce braces. Instead, keep such expressions on the same line.	2020-01-29 10:42:04 +00:00
Albert Zaharovits	90285ee907	Deprecate timeout.tcp_read AD/LDAP realm setting (#47305 ) The timeout.tcp_read AD/LDAP realm setting, despite the low-level allusion, controls the time interval the realms wait for a response for a query (search or bind). If the connection to the server is synchronous (un-pooled) the response timeout is analogous to the tcp read timeout. But the tcp read timeout is irrelevant in the common case of a pooled connection (when a Bind DN is specified). The timeout.tcp_read qualifier is hereby deprecated in favor of timeout.response. In addition, the default value for both timeout.tcp_read and timeout.response is that of timeout.ldap_search, instead of the 5s (but the default for timeout.ldap_search is still 5s). The timeout.ldap_search defines the server-controlled timeout of a search request. There is no practical use case to have a smaller tcp_read timeout compared to ldap_search (in this case the request would time-out on the client but continue to be processed on the server). The proposed change aims to simplify configuration so that the more common configuration change, adjusting timeout.ldap_search up, has the expected result (no timeout during searches) without any additional modifications. Closes #46028	2020-01-29 10:48:26 +02:00
Jason Tedor	3a7192966a	Check if interface is up for loopback devices only (#51583 ) In the SQL with SSL tests, we need to find the interfaces that are up, are loopback devices, or have a loopback address. If we check if the device is up first, we can run into situations where the device is a virtual ethernet device that might have disappeared between us seeing the device, and checking if it is up. By first checking if the device is a loopback device or it has a loopback address, then we can avoid checking if the device is up except for loopback devices and therefore we can avoid the disappearing virtual ethernet device problem.	2020-01-28 18:38:46 -05:00
Armin Braun	aae93a7578	Allow Repository Plugins to Filter Metadata on Create (#51472 ) (#51542 ) * Allow Repository Plugins to Filter Metadata on Create Add a hook that allows repository plugins to filter the repository metadata before it gets written to the cluster state.	2020-01-28 18:33:26 +01:00
Gordon Brown	89c2834b24	Deprecate creation of dot-prefixed index names except for hidden and system indices (#49959 ) This commit deprecates the creation of dot-prefixed index names (e.g. .watches) unless they are either 1) a hidden index, or 2) registered by a plugin that extends SystemIndexPlugin. This is the first step towards more thorough protections for system indices. This commit also modifies several plugins which use dot-prefixed indices to register indices they own as system indices, and adds a plugin to register .tasks as a system index.	2020-01-28 10:01:16 -07:00
Yannick Welsch	f6686345c9	Avoid unnecessary setup and teardown in docs tests (#51430 ) The docs tests have recently been running much slower than before (see #49753). The gist here is that with ILM/SLM we do a lot of unnecessary setup / teardown work on each test. Compounded with the slightly slower cluster state storage mechanism, this causes the tests to run much slower. In particular, on RAMDisk, docs:check is taking ES 7.4: 6:55 minutes ES master: 16:09 minutes ES with this commit: 6:52 minutes on SSD, docs:check is taking ES 7.4: ??? minutes ES master: 32:20 minutes ES with this commit: 11:21 minutes	2020-01-28 16:52:23 +01:00
David Roberts	550254ec7f	[ML] Use CSV ingest processor in find_file_structure ingest pipeline (#51492 ) Changes the find_file_structure response to include a CSV ingest processor in the ingest pipeline it suggests. Previously the Kibana file upload functionality parsed CSV in the browser, but by parsing CSV in the ingest pipeline it makes the Kibana file upload functionality more easily interchangable with Filebeat such that the configurations it creates can more easily be used to import data with the same structure repeatedly in production.	2020-01-28 14:38:43 +00:00
Aleksandr Maus	a8bd4d08e3	Merge branch 'feature/eql_backport' into 7.x	2020-01-28 09:19:39 -05:00
Hendrik Muhs	53e4d1ef07	[Transform] fix TransformRobustnessIT intermittent test failures part 2 (#51523 ) add wait for completion in transform robustness test to avoid occasional test failures during cleanup fixes #51347	2020-01-28 13:37:01 +01:00
William Brafford	9efa5be60e	Password-protected Keystore Feature Branch PR (#51123 ) (#51510 ) * Reload secure settings with password (#43197) If a password is not set, we assume an empty string to be compatible with previous behavior. Only allow the reload to be broadcast to other nodes if TLS is enabled for the transport layer. * Add passphrase support to elasticsearch-keystore (#38498) This change adds support for keystore passphrases to all subcommands of the elasticsearch-keystore cli tool and adds a subcommand for changing the passphrase of an existing keystore. The work to read the passphrase in Elasticsearch when loading, which will be addressed in a different PR. Subcommands of elasticsearch-keystore can handle (open and create) passphrase protected keystores When reading a keystore, a user is only prompted for a passphrase only if the keystore is passphrase protected. When creating a keystore, a user is allowed (default behavior) to create one with an empty passphrase Passphrase can be set to be empty when changing/setting it for an existing keystore Relates to: #32691 Supersedes: #37472 * Restore behavior for force parameter (#44847) Turns out that the behavior of `-f` for the add and add-file sub commands where it would also forcibly create the keystore if it didn't exist, was by design - although undocumented. This change restores that behavior auto-creating a keystore that is not password protected if the force flag is used. The force OptionSpec is moved to the BaseKeyStoreCommand as we will presumably want to maintain the same behavior in any other command that takes a force option. * Handle pwd protected keystores in all CLI tools (#45289) This change ensures that `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` can handle a password protected elasticsearch.keystore. For setup passwords the user would be prompted to add the elasticsearch keystore password upon running the tool. There is no option to pass the password as a parameter as we assume the user is present in order to enter the desired passwords for the built-in users. For saml-metadata, we prompt for the keystore password at all times even though we'd only need to read something from the keystore when there is a signing or encryption configuration. * Modify docs for setup passwords and saml metadata cli (#45797) Adds a sentence in the documentation of `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` to describe that users would be prompted for the keystore's password when running these CLI tools, when the keystore is password protected. Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * Elasticsearch keystore passphrase for startup scripts (#44775) This commit allows a user to provide a keystore password on Elasticsearch startup, but only prompts when the keystore exists and is encrypted. The entrypoint in Java code is standard input. When the Bootstrap class is checking for secure keystore settings, it checks whether or not the keystore is encrypted. If so, we read one line from standard input and use this as the password. For simplicity's sake, we allow a maximum passphrase length of 128 characters. (This is an arbitrary limit and could be increased or eliminated. It is also enforced in the keystore tools, so that a user can't create a password that's too long to enter at startup.) In order to provide a password on standard input, we have to account for four different ways of starting Elasticsearch: the bash startup script, the Windows batch startup script, systemd startup, and docker startup. We use wrapper scripts to reduce systemd and docker to the bash case: in both cases, a wrapper script can read a passphrase from the filesystem and pass it to the bash script. In order to simplify testing the need for a passphrase, I have added a has-passwd command to the keystore tool. This command can run silently, and exit with status 0 when the keystore has a password. It exits with status 1 if the keystore doesn't exist or exists and is unencrypted. A good deal of the code-change in this commit has to do with refactoring packaging tests to cleanly use the same tests for both the "archive" and the "package" cases. This required not only moving tests around, but also adding some convenience methods for an abstraction layer over distribution-specific commands. * Adjust docs for password protected keystore (#45054) This commit adds relevant parts in the elasticsearch-keystore sub-commands reference docs and in the reload secure settings API doc. * Fix failing Keystore Passphrase test for feature branch (#50154) One problem with the passphrase-from-file tests, as written, is that they would leave a SystemD environment variable set when they failed, and this setting would cause elasticsearch startup to fail for other tests as well. By using a try-finally, I hope that these tests will fail more gracefully. It appears that our Fedora and Ubuntu environments may be configured to store journald information under /var rather than under /run, so that it will persist between boots. Our destructive tests that read from the journal need to account for this in order to avoid trying to limit the output we check in tests. * Run keystore management tests on docker distros (#50610) * Add Docker handling to PackagingTestCase Keystore tests need to be able to run in the Docker case. We can do this by using a DockerShell instead of a plain Shell when Docker is running. * Improve ES startup check for docker Previously we were checking truncated output for the packaged JDK as an indication that Elasticsearch had started. With new preliminary password checks, we might get a false positive from ES keystore commands, so we have to check specifically that the Elasticsearch class from the Bootstrap package is what's running. * Test password-protected keystore with Docker (#50803) This commit adds two tests for the case where we mount a password-protected keystore into a Docker container and provide a password via a Docker environment variable. We also fix a logging bug where we were logging the identifier for an array of strings rather than the contents of that array. * Add documentation for keystore startup prompting (#50821) When a keystore is password-protected, Elasticsearch will prompt at startup. This commit adds documentation for this prompt for the archive, systemd, and Docker cases. Co-authored-by: Lisa Cawley <lcawley@elastic.co> * Warn when unable to upgrade keystore on debian (#51011) For Red Hat RPM upgrades, we warn if we can't upgrade the keystore. This commit brings the same logic to the code for Debian packages. See the posttrans file for gets executed for RPMs. * Restore handling of string input Adds tests that were mistakenly removed. One of these tests proved we were not handling the the stdin (-x) option correctly when no input was added. This commit restores the original approach of reading stdin one char at a time until there is no more (-1, \r, \n) instead of using readline() that might return null * Apply spotless reformatting * Use '--since' flag to get recent journal messages When we get Elasticsearch logs from journald, we want to fetch only log messages from the last run. There are two reasons for this. First, if there are many logs, we might get a string that's too large for our utility methods. Second, when we're looking for a specific message or error, we almost certainly want to look only at messages from the last execution. Previously, we've been trying to do this by clearing out the physical files under the journald process. But there seems to be some contention over these directories: if journald writes a log file in between when our deletion command deletes the file and when it deletes the log directory, the deletion will fail. It seems to me that we might be able to use journald's "--since" flag to retrieve only log messages from the last run, and that this might be less likely to fail due to race conditions in file deletion. Unfortunately, it looks as if the "--since" flag has a granularity of one-second. I've added a two-second sleep to make sure that there's a sufficient gap between the test that will read from journald and the test before it. * Use new journald wrapper pattern * Update version added in secure settings request Co-authored-by: Lisa Cawley <lcawley@elastic.co> Co-authored-by: Ioannis Kakavas <ikakavas@protonmail.com>	2020-01-28 05:32:32 -05:00
Hendrik Muhs	2239ba8c6e	[Transform] avoid mapping problems with index templates (#51368 ) (#51519 ) insert explict mappings for objects in nested output to avoid clashes with index templates fixes #51321	2020-01-28 11:31:07 +01:00
Hendrik Muhs	61663b495e	add an integration test using date_nanos as timestamp (#51477 ) add a test for using date_nanos as timestamp field in a continuous transform	2020-01-28 10:10:23 +01:00
Hendrik Muhs	bebce4b190	audit index creation after it the index has been created (#51479 ) moves audit message for index creation after the index has been successfully created. This has been confusing for a user where index creation failed but audit reported index creation.	2020-01-28 10:06:46 +01:00
Ioannis Kakavas	4f3548fbd7	Disable diagnostic trust manager in tests (#51501 ) This commit sets `xpack.security.ssl.diagnose.trust` to false in all of our tests when running in FIPS 140 mode and when settings objects are used to create an instance of the SSLService. This is needed in 7.x because setting xpack.security.ssl.diagnose.trust to true wraps SunJSSE TrustManager with our own DiagnosticTrustManager and this is not allowed when SunJSSE is in FIPS mode. An alternative would be to set xpack.security.fips.enabled to true which would also implicitly disable xpack.security.ssl.diagnose.trust but would have additional effects (would require that we set PBKDF2 for password hashing algorithm in all test clusters, would prohibit using JKS keystores in nodes even if relevant tests have been muted in FIPS mode etc.) Relates: #49900 Resolves: #51268	2020-01-28 10:17:35 +02:00
Przemko Robakowski	919083decd	Don't overwrite target field with SetSecurityUserProcessor (#51454 ) (#51506 ) * Don't overwrite target field with SetSecurityUserProcessor This change fix problem with `SetSecurityUserProcessor` which was overwriting whole target field and not only fields really filled by the processor. Closes #51428 * Unused imports removed	2020-01-28 02:12:09 +01:00
Jason Tedor	92b611ece1	Formalize build snapshot (#51484 ) Today we are repeatedly checking if the current build is a snapshot build or not by reading the system property build.snapshot. This commit formalizes this by adding a build parameter to indicate whether or not the current build is a snapshot build.	2020-01-27 16:56:31 -05:00
Aleksandr Maus	eb1ed2a35f	Compilation fixes for 7.x	2020-01-27 16:23:36 -05:00
Aleksandr Maus	d8f1735e39	Add xpack.eql.enabled feature flag, disabled by default. Enabled only for integration tests. (#51370 ) Related to https://github.com/elastic/elasticsearch/issues/49581	2020-01-27 15:15:22 -05:00
Costin Leau	d049de5b72	EQL: import QL into EQL (#50904 ) Link QL into the new build file Remove duplicate classes and use the new ql package Update Exception hierarchy on top of QlException	2020-01-27 15:13:22 -05:00
Igor Motov	c184411456	EQL: Replace EqlSearchResponse.Hits parser with ObjectParser (#50925 ) Replaces the existing hand-build Hits parser with a ConstructingObjectParser version. Relates to #49581	2020-01-27 15:13:09 -05:00
Igor Motov	88cc30c0d8	EQL: Remove list classes from EqlSearchResponse (#50870 ) Removes unnecessary classes from EqlSearchResponse that just represent lists of other elements. Relates to #49581	2020-01-27 15:13:00 -05:00
Aleksandr Maus	d715176c00	Add more Eql REST API validation integration tests, clean up request implementation (#50822 )	2020-01-27 15:12:48 -05:00
Igor Motov	628083183f	EQL: Make EqlSearchResponse immutable (#50810 ) Refactors EqlSearchResponse to make it immutable Relates to #49581	2020-01-27 15:12:07 -05:00
Aleksandr Maus	31d2d01e25	Correct search_after handling (#50629 )	2020-01-27 15:11:51 -05:00
Aleksandr Maus	79875ce4d9	Initial EQL rest API implementation (#49768 )	2020-01-27 15:11:41 -05:00
Costin Leau	10a16d15d1	Add draft EQL grammar and expression tree	2020-01-27 15:11:18 -05:00
Costin Leau	e22f501018	QL: Backport project to 7.x (#51497 ) * Introduce reusable QL plugin for SQL and EQL (#50815) Extract reusable functionality from SQL into its own dedicated project QL. Implemented as a plugin, it provides common components across SQL and the upcoming EQL. While this commit is fairly large, for the most part it's just a big file move from sql package to the newly introduced ql. (cherry picked from commit ec1ac0d463bfa12a02c8174afbcdd6984345e8b4) * SQL: Fix incomplete registration of geo NamedWritables (cherry picked from commit e295763686f9592976e551e504fdad1d2a3a566d) * QL: Extend NodeSubclass to read classes from jars (#50866) As the test classes are spread across more than one project, the Gradle classpath contains not just folders but also jars. This commit allows the test class to explore the archive content and load matching classes from said source. (cherry picked from commit 25ad74928afcbf286dc58f7d430491b0af662f04) * QL: Remove implicit conversion inside Literal (#50962) Literal constructor makes an implicit conversion for each value given which turns out has some subtle side-effects. Improve MathProcessors to preserve numeric type where possible Fix bug on issue compatibility between date and intervals Preserve the source when folding inside the Optimizer (cherry picked from commit 9b73e225b0aa07a23859550fb117bae571a2b672) * QL: Refactor DataType for pluggability (#51328) Change DataType from enum to class Break DataType enums into QL (default) and SQL types Make data type conversion pluggable so that new types can be introduced As part of the process: - static type conversion in QL package (such as Literal) has been removed - several utility classes have been broken into base (QL) and extended (SQL) parts based on type awareness - operators (+,-,/,) are - due to extensibility, serialization of arithmetic operation has been slightly changed and pushed down to the operator executor itself (cherry picked from commit aebda81b30e1563b877a8896309fd50633e0b663) Compilation fixes for 7.x	2020-01-27 22:03:58 +02:00
Ryan Ernst	6ee1baf2ed	Migrate cron eval bats test to java (#50940 ) (#51007 ) This commit migrates the simple test of the cron eval tool from bats to java packaging tests. relates #46005	2020-01-27 10:49:01 -08:00
Nik Everett	4ff314a9d5	Begin moving date_histogram to offset rounding (take two) (#51271 ) (#51495 ) We added a new rounding in #50609 that handles offsets to the start and end of the rounding so that we could support `offset` in the `composite` aggregation. This starts moving `date_histogram` to that new offset. This is a redo of #50873 with more integration tests. This reverts commit d114c9db3e1d1a766f9f48f846eed0466125ce83.	2020-01-27 13:40:54 -05:00
David Roberts	3c223ceea1	[ML] Fix 2 digit year regex in find_file_structure (#51469 ) The DATE and DATESTAMP Grok patterns match 2 digit years as well as 4 digit years. The pattern determination in find_file_structure worked correctly in this case, but the regex used to create a multi-line start pattern was assuming a 4 digit year. Also, the quick rule-out patterns did not always correctly consider 2 digit years, meaning that detection was inconsistent. This change fixes both problems, and also extends the tests for DATE and DATESTAMP to check both 2 and 4 digit years.	2020-01-27 17:23:18 +00:00

1 2 3 4 5 ...

4716 Commits