OpenSearch

Commit Graph

Author	SHA1	Message	Date
Benjamin Trent	9cf9c64ec2	[7.x] [ML][Transforms] remove `force` flag from _start (#46414 ) (#46748 ) * [ML][Transforms] remove `force` flag from _start (#46414) * [ML][Transforms] remove `force` flag from _start * fixing expected error message * adjusting bwc version	2019-09-18 10:06:05 -04:00
Dimitris Athanasiou	cebe8da617	[7.x][ML] MlMemoryTracker should ignore analytics tasks without config (#46789 ) (#46811 ) It is possible for a running analytics job that its config is removed from the '.ml-config' index (perhaps the user deleted the entire index, etc.). In that case the task remains without a matching config. I have raised #46781 to discuss how to deal with this issue. This commit focuses on `MlMemoryTracker` and changes it so that when we get the configs for the running tasks we leniently ignore missing ones. This at least means memory tracking will keep working for other jobs if one or more are missing. In addition, this commit makes the cleanup code for native analytics tests more robust by explicitly stopping all jobs and force-stopping if an error occurs. This helps so that a single failing test does not cause other tests fail due to pending tasks. Backport of #46789	2019-09-18 16:35:25 +03:00
Alpar Torok	f3e67bdd17	Add resolution rule to allow resolving all deps (#46768 ) Since the `resolveAllDependencies` task resolves all the congfigurations it can find, this was not caught by our testing, but it's required to be configuraed specifically. We should probably cut-over to the new configurations at some point to avoid problems like this. Closes elastic/infra#14580	2019-09-18 11:09:43 +03:00
Lee Hinman	b85468d6ea	Add node setting for disabling SLM (#46794 ) (#46796 ) This adds the `xpack.slm.enabled` setting to allow disabling of SLM functionality as well as its HTTP API endpoints. Relates to #38461	2019-09-17 17:39:41 -06:00
Oliver Gupte	cbd58d3b78	Give kibana user privileges to create APM agent config index (#46765 ) (#46792 ) * Give kibana user reserved role privileges on .apm-* to create APM agent configuration index. * fixed test to include checking all .apm-* permissions * changed pattern from ".apm-*" to the more specific ".apm-agent-configuration"	2019-09-17 15:01:42 -07:00
Costin Leau	92e518e789	SQL: Properly handle indices with no/empty mapping (#46775 ) When encountering only indices with empty mapping, the IndexResolver throws an exception as it expects to find at least one entry. This commit fixes this case so that an empty mapping is returned. Fix #46757 (cherry picked from commit 5f4f5807acb93b5fab36718c092c328977a396b6)	2019-09-17 16:01:22 +03:00
Armin Braun	b0f09b279f	Make Snapshot Logic Write Metadata after Segments (#45689 ) (#46764 ) * Write metadata during snapshot finalization after segment files to prevent outdated metadata in case of dynamic mapping updates as explained in #41581 * Keep the old behavior of writing the metadata beforehand in the case of mixed version clusters for BwC reasons * Still overwrite the metadata in the end, so even a mixed version cluster is fixed by this change if a newer version master does the finalization * Fixes #41581	2019-09-17 13:09:39 +02:00
Tomas Della Vedova	e1cf103980	Fixes for API specification (#46522 ) (#46736 ) Follow-up of #42346	2019-09-17 11:49:24 +02:00
Costin Leau	683b5fdeca	SQL: Support queries with HAVING over SELECT (#46709 ) Handle queries with implicit GROUP BY where the aggregation is not in the projection/SELECT but inside the filter/HAVING such as: SELECT 1 FROM x HAVING COUNT(*) > 0 The engine now properly identifies the case and handles it accordingly. Fix #37051 (cherry picked from commit fa53ca05d8219c27079b50b4a5b7aeb220c7cde2)	2019-09-17 11:14:39 +03:00
Costin Leau	90f4c2379b	SQL: improve ResultSet behavior when no rows are available (#46753 ) Improve the defensive behavior of ResultSet when dealing with incorrect API usage. In particular handle the case of dealing with no row available (either because the cursor is before the first entry or after the last). Fix #46750 (cherry picked from commit 58fa38e4606625962e879265d35eacb0960c6cdb)	2019-09-17 11:14:38 +03:00
Przemysław Witek	e49be611ad	[7.x] Add audit messages for Data Frame Analytics (#46521 ) (#46738 )	2019-09-16 21:21:38 +02:00
Benjamin Trent	92acc732de	[ML][Transform] Use field caps for mapping deductino (#46703 ) (#46742 )	2019-09-16 10:05:55 -04:00
Andrei Stefan	40e9353947	SQL: use the correct data type for types conversion (#46574 ) (cherry picked from commit 3e25db2f302c3aafe27e4d8d4fb1743401d85e6d)	2019-09-16 15:36:17 +03:00
Hendrik Muhs	c8f52ec4ff	[Transform] Rename data frame plugin to transform: classes in xpack.core (#46644 ) (#46734 ) rename classes in xpack.core of transform plugin from "data frame transform" to "transform"	2019-09-16 13:39:22 +02:00
Andrei Dan	c57cca98b2	[ILM] Add date setting to calculate index age (#46561 ) (#46697 ) * [ILM] Add date setting to calculate index age Add the `index.lifecycle.origination_date` to allow users to configure a custom date that'll be used to calculate the index age for the phase transmissions (as opposed to the default index creation date). This could be useful for users to create an index with an "older" origination date when indexing old data. Relates to #42449. * [ILM] Don't override creation date on policy init The initial approach we took was to override the lifecycle creation date if the `index.lifecycle.origination_date` setting was set. This had the disadvantage of the user not being able to update the `origination_date` anymore once set. This commit changes the way we makes use of the `index.lifecycle.origination_date` setting by checking its value when we calculate the index age (ie. at "read time") and, in case it's not set, default to the index creation date. * Make origination date setting index scope dynamic * Document orignation date setting in ilm settings (cherry picked from commit d5bd2bb77ee28c1978ab6679f941d7c02e389d32) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2019-09-16 08:50:28 +01:00
Dimitris Athanasiou	63eb0d9081	[7.x][ML] Avoid marking data frame analytics task completed twice (#46721 ) (#46724 ) When the stop API is called while the task is running there is a chance the task gets marked completed twice. This may cause undesired side effects, like indexing the progress document a second time after the stop API has returned (the cause for #46705). This commit adds a check that the task has not been completed before proceeding to mark it so. In addition, when we update the task's state we could get some warnings that the task was missing if the stop API has been called in the meantime. We now check the errors are `ResourceNotFoundException` and ignore them if so. Closes #46705 Backports #46721	2019-09-15 17:25:26 +03:00
Nhat Nguyen	cabff5a7cd	Handle lower retaining seqno retention lease error (#46420 ) We renew the CCR retention lease at a fixed interval, therefore it's possible to have more than one in-flight renewal requests at the same time. If requests arrive out of order, then the assertion is violated. Closes #46416 Closes #46013	2019-09-13 08:50:19 -04:00
Dimitris Athanasiou	0bc8acaf5b	[7.x][ML] Create state index and alias before starting an analytics job (#46602 ) (#46648 ) This is fixing a bug where if an analytics job is started before any anomaly detection job is opened, we create an index after the state write alias. Instead, we should create the state index and alias before starting an analytics job and this commit makes sure this is the case. Backport of #46602	2019-09-13 10:34:12 +03:00
Luca Cavanna	e57756492a	Update http-core and http-client dependencies (#46549 ) Relates to #45808 Closes #45577	2019-09-12 09:45:29 +02:00
Zachary Tong	6dc8ed5d57	[7.x Backport] Refactor AllocatedPersistentTask#init(), move rollup ctor logic (#46406 ) This makes the AllocatedPersistentTask#init() method protected so that implementing classes can perform their initialization logic there, instead of the constructor. Rollup's task is adjusted to use this init method. It also slightly refactors the methods to se a static logger in the AllocatedTask instead of passing it in via an argument. This is simpler, logged messages come from the task instead of the service, and is easier for tests	2019-09-11 17:00:28 -04:00
James Rodewig	f9bf10f2b6	[DOCS] Change "a SSL" to "an SSL" in the Java docs (#46524 ) (#46618 )	2019-09-11 15:55:57 -04:00
Marios Trivyzas	d956509394	SQL: Implement DATE_TRUNC function (#46473 ) DATE_TRUNC(<truncate field>, <date/datetime>) is a function that allows the user to truncate a timestamp to the specified field by zeroing out the rest of the fields. The function is implemented according to the spec from PostgreSQL: https://www.postgresql.org/docs/current/functions-datetime.html#FUNCTIONS-DATETIME-TRUNC Closes: #46319 (cherry picked from commit b37e96712db1aace09f17b574eb02ff6b942a297)	2019-09-11 21:41:02 +03:00
Ryan Ernst	86290cb3d9	Make reuse of sql test code explicit (#45884 ) The sql project uses a common set of security tests, which are run in subprojects. Currently these are shared through a shared directory, but this is not setup correctly to ensure it is built before tests run. This commit changes the test classes to be an artifact of the sql/qa/security project and makes the test runner use the built artifact (a directory of classes) for tests. closes #45866	2019-09-11 10:56:07 -07:00
Lee Hinman	09a9cefaa0	Handle partial failure retrieving segments in SegmentCountStep (#46556 ) Since the `IndicesSegmentsRequest` scatters to all shards for the index, it's possible that some of the shards may fail. This adds failure handling and logging (since this is a best-effort step in the first place) for this case.	2019-09-11 10:29:31 -06:00
Marios Trivyzas	0963e78164	SQL: Fix issue with common type resolution (#46565 ) Many scalar functions try to find out the common type between their arguments in order to set it as their return time, e.g.: for `float + double` the common type which is set as the return type of the + operation is `double`. Previously, for data types TEXT and KEYWORD (string data types) there was no common data type found and null was returned causing NPEs when the function was trying to resolve the return data type. Fixes: #46551 (cherry picked from commit 291017d69dfc810707c3c7c692f5a50af431b790)	2019-09-11 19:10:15 +03:00
Lee Hinman	52d7b03b49	Wait for no snapshots in state in testRetentionWhileSnapshotIn… (#46573 ) This commit adds a wait/check for all running snapshots to be cleared before taking another snapshot. The previous snapshot was successful but had not yet been cleared from the cluster state, so the second snapshot failed due to a `ConcurrentSnapshotException`. Resolves #46508	2019-09-11 09:47:01 -06:00
David Roberts	461de5b58e	[TEST] Remove incorrect data frame analytics state assertion (#46597 ) After starting the analytics job and checking its state the state can be any of "started", "reindexing" or "analyzing" depending on how quickly the work is done.	2019-09-11 16:33:14 +01:00
David Roberts	07a0140260	[ML-DataFrame] Ensure latest index template exists before indexing docs (#46595 ) When upgrading data nodes to a newer version before master nodes there was a risk that a transform running on an upgraded data node would index a document into the new transforms internal index before its index template was created. This would cause the index to be created with entirely dynamic mappings. This change introduces a check before indexing any internal transforms document to ensure that the required index template exists and create it if it doesn't. Backport of #46553	2019-09-11 16:27:26 +01:00
Jim Ferenczi	23bf310c84	Replace the SearchContext with QueryShardContext when building aggregator factories (#46527 ) This commit replaces the `SearchContext` with the `QueryShardContext` when building aggregator factories. Aggregator factories are part of the `SearchContext` so they shouldn't require a `SearchContext` to create them. The main changes here are the signatures of `AggregationBuilder#build` that now takes a `QueryShardContext` and `AggregatorFactory#createInternal` that passes the `SearchContext` to build the `Aggregator`. Relates #46523	2019-09-11 16:43:30 +02:00
Hendrik Muhs	efea581dcc	[7.x][Transform]Rename data frame plugin to transform: plugin and package names (#46583 ) rename data frame transform plugin to transform: - rename plugin data-frame to transform - change all package names from o.e..dataframe. to o.e..transform. - necessary changes to fix loading/testing	2019-09-11 14:50:08 +02:00
Armin Braun	41633cb9b5	More Efficient Ordering of Shard Upload Execution (#42791 ) (#46588 ) * More Efficient Ordering of Shard Upload Execution (#42791) * Change the upload order of of snapshots to work file by file in parallel on the snapshot pool instead of merely shard-by-shard * Inspired by #39657 * Cleanup BlobStoreRepository Abort and Failure Handling (#46208)	2019-09-11 13:59:20 +02:00
Jim Ferenczi	425b1a77e8	Add more context to QueryShardContext (#46584 ) This change adds an IndexSearcher and the node's BigArrays in the QueryShardContext. It's a spin off of #46527 as this change is required to allow aggregation builder to solely use the query shard context. Relates #46523	2019-09-11 12:24:51 +02:00
Dimitris Athanasiou	579af626f5	[7.x][ML] No error when datafeed stops during updating to started (#46495 ) (#46542 ) Investigating the test failure reported in #45518 it appears that the datafeed task was not found during a tast state update. There are only two places where such an update is performed: when we set the state to `started` and when we set it to `stopping`. We handle `ResourceNotFoundException` in the latter but not in the former. Thus the test reveals a rare race condition where the datafeed gets requested to stop before we managed to update its state to `started`. I could not reproduce this scenario but it would be my best guess. This commit catches `ResourceNotFoundException` while updating the state to `started` and lets the task terminate smoothly. Closes #45518 Backport of #46495	2019-09-11 13:18:42 +03:00
Przemysław Witek	e38e631dac	[7.x] Implement DataFrameAnalyticsAuditMessage and DataFrameAnalyticsAuditor (#45967 ) (#46519 )	2019-09-11 12:17:26 +02:00
Ioannis Kakavas	35810bd2ae	Enforce realm name uniqueness (#46580 ) We depend on file realms being unique in a number of places. Pre 7.0 this was enforced by the fact that the multiple realm types with different name would mean identical configuration keys and cause configuration parsing errors. Since we intoduced affix settings for realms this is not the case any more as the realm type is part of the configuration key. This change adds a check when building realms which will explicitly fail if multiple realms are defined with the same name. Backport of #46253	2019-09-11 13:13:59 +03:00
Tim Vernum	80064652f8	Fallback to realm authc if ApiKey fails (#46552 ) This changes API-Key authentication to always fallback to the realm chain if the API key is not valid. The previous behaviour was inconsistent and would terminate on some failures, but continue to the realm chain for others. Backport of: #46538	2019-09-11 14:33:17 +10:00
Gordon Brown	7a2878b29b	Fix class used to initialize logger in Watcher (#46467 ) This class has been using a logger configured for a different class for quite a while. While the circumstance in which it logs is rare, it should still use the correct logger.	2019-09-10 12:41:36 -06:00
Lee Hinman	cdc3a260af	Add retention to Snapshot Lifecycle Management (backport of #4… (#46506 ) * Add retention to Snapshot Lifecycle Management (#46407) This commit adds retention to the existing Snapshot Lifecycle Management feature (#38461) as described in #43663. This allows a user to configure SLM to automatically delete older snapshots based on a number of criteria. An example policy would look like: ``` PUT /_slm/policy/snapshot-every-day { "schedule": "0 30 2 * * ?", "name": "<production-snap-{now/d}>", "repository": "my-s3-repository", "config": { "indices": ["foo-", "important"] }, // Newly configured retention options "retention": { // Snapshots should be deleted after 14 days "expire_after": "14d", // Keep a maximum of thirty snapshots "max_count": 30, // Keep a minimum of the four most recent snapshots "min_count": 4 } } ``` SLM Retention is run on a scheduled configurable with the `slm.retention_schedule` setting, which supports cron expressions. Deletions are run for a configurable time bounded by the `slm.retention_duration` setting, which defaults to 1 hour. Included in this work is a new SLM stats API endpoint available through ``` json GET /_slm/stats ``` That returns statistics about snapshot taken and deleted, as well as successful retention runs, failures, and the time spent deleting snapshots. #45362 has more information as well as an example of the output. These stats are also included when retrieving SLM policies via the API. Add base framework for snapshot retention (#43605) * Add base framework for snapshot retention This adds a basic `SnapshotRetentionService` and `SnapshotRetentionTask` to start as the basis for SLM's retention implementation. Relates to #38461 * Remove extraneous 'public' * Use a local var instead of reading class var repeatedly * Add SnapshotRetentionConfiguration for retention configuration (#43777) * Add SnapshotRetentionConfiguration for retention configuration This commit adds the `SnapshotRetentionConfiguration` class and its HLRC counterpart to encapsulate the configuration for SLM retention. Currently only a single parameter is supported as an example (we still need to discuss the different options we want to support and their names) to keep the size of the PR down. It also does not yet include version serialization checks since the original SLM branch has not yet been merged. Relates to #43663 * Fix REST tests * Fix more documentation * Use Objects.equals to avoid NPE * Put `randomSnapshotLifecyclePolicy` in only one place * Occasionally return retention with no configuration * Implement SnapshotRetentionTask's snapshot filtering and delet… (#44764) * Implement SnapshotRetentionTask's snapshot filtering and deletion This commit implements the snapshot filtering and deletion for `SnapshotRetentionTask`. Currently only the expire-after age is used for determining whether a snapshot is eligible for deletion. Relates to #43663 * Fix deletes running on the wrong thread * Handle missing or null policy in snap metadata differently * Convert Tuple<String, List<SnapshotInfo>> to Map<String, List<SnapshotInfo>> * Use the `OriginSettingClient` to work with security, enhance logging * Prevent NPE in test by mocking Client * Allow empty/missing SLM retention configuration (#45018) Semi-related to #44465, this allows the `"retention"` configuration map to be missing. Relates to #43663 * Add min_count and max_count as SLM retention predicates (#44926) This adds the configuration options for `min_count` and `max_count` as well as the logic for determining whether a snapshot meets this criteria to SLM's retention feature. These options are optional and one, two, or all three can be specified in an SLM policy. Relates to #43663 * Time-bound deletion of snapshots in retention delete function (#45065) * Time-bound deletion of snapshots in retention delete function With a cluster that has a large number of snapshots, it's possible that snapshot deletion can take a very long time (especially since deletes currently have to happen in a serial fashion). To prevent snapshot deletion from taking forever in a cluster and blocking other operations, this commit adds a setting to allow configuring a maximum time to spend deletion snapshots during retention. This dynamic setting defaults to 1 hour and is best-effort, meaning that it doesn't hard stop a deletion at an hour mark, but ensures that once the time has passed, all subsequent deletions are deferred until the next retention cycle. Relates to #43663 * Wow snapshots suuuure can take a long time. * Use a LongSupplier instead of actually sleeping * Remove TestLogging annotation * Remove rate limiting * Add SLM metrics gathering and endpoint (#45362) * Add SLM metrics gathering and endpoint This commit adds the infrastructure to gather metrics about the different SLM actions that a cluster takes. These actions are stored in `SnapshotLifecycleStats` and perpetuated in cluster state. The stats stored include the number of snapshots taken, failed, deleted, the number of retention runs, as well as per-policy counts for snapshots taken, failed, and deleted. It also includes the amount of time spent deleting snapshots from SLM retention. This commit also adds an endpoint for retrieving all stats (further commits will expose this in the SLM get-policy API) that looks like: ``` GET /_slm/stats { "retention_runs" : 13, "retention_failed" : 0, "retention_timed_out" : 0, "retention_deletion_time" : "1.4s", "retention_deletion_time_millis" : 1404, "policy_metrics" : { "daily-snapshots2" : { "snapshots_taken" : 7, "snapshots_failed" : 0, "snapshots_deleted" : 6, "snapshot_deletion_failures" : 0 }, "daily-snapshots" : { "snapshots_taken" : 12, "snapshots_failed" : 0, "snapshots_deleted" : 12, "snapshot_deletion_failures" : 6 } }, "total_snapshots_taken" : 19, "total_snapshots_failed" : 0, "total_snapshots_deleted" : 18, "total_snapshot_deletion_failures" : 6 } ``` This does not yet include HLRC for this, as this commit is quite large on its own. That will be added in a subsequent commit. Relates to #43663 * Version qualify serialization * Initialize counters outside constructor * Use computeIfAbsent instead of being too verbose * Move part of XContent generation into subclass * Fix REST action for master merge * Unused import * Record history of SLM retention actions (#45513) This commit records the deletion of snapshots by the retention component of SLM into the SLM history index for the purposes of reviewing operations taken by SLM and alerting. * Retry SLM retention after currently running snapshot completes (#45802) * Retry SLM retention after currently running snapshot completes This commit adds a ClusterStateObserver to wait until the currently running snapshot is complete before proceeding with snapshot deletion. SLM retention waits for the maximum allowed deletion time for the snapshot to complete, however, the waiting time is not factored into the limit on actual deletions. Relates to #43663 * Increase timeout waiting for snapshot completion * Apply patch From `2374316f0d`.patch * Rename test variables * [TEST] Be less strict for stats checking * Skip SLM retention if ILM is STOPPING or STOPPED (#45869) This adds a check to ensure we take no action during SLM retention if ILM is currently stopped or in the process of stopping. Relates to #43663 * Check all actions preventing snapshot delete during retention (#45992) * Check all actions preventing snapshot delete during retention run Previously we only checked to see if a snapshot was currently running, but it turns out that more things can block snapshot deletion. This changes the check to be a check for: - a snapshot currently running - a deletion already in progress - a repo cleanup in progress - a restore currently running This was found by CI where a third party delete in a test caused SLM retention deletion to throw an exception. Relates to #43663 * Add unit test for okayToDeleteSnapshots * Fix bug where SLM retention task would be scheduled on every node * Enhance test logging * Ignore if snapshot is already deleted * Missing import * Fix SnapshotRetentionServiceTests * Expose SLM policy stats in get SLM policy API (#45989) This also adds support for the SLM stats endpoint to the high level rest client. Retrieving a policy now looks like: ```json { "daily-snapshots" : { "version": 1, "modified_date": "2019-04-23T01:30:00.000Z", "modified_date_millis": 1556048137314, "policy" : { "schedule": "0 30 1 * * ?", "name": "<daily-snap-{now/d}>", "repository": "my_repository", "config": { "indices": ["data-", "important"], "ignore_unavailable": false, "include_global_state": false }, "retention": {} }, "stats": { "snapshots_taken": 0, "snapshots_failed": 0, "snapshots_deleted": 0, "snapshot_deletion_failures": 0 }, "next_execution": "2019-04-24T01:30:00.000Z", "next_execution_millis": 1556048160000 } } ``` Relates to #43663 Rewrite SnapshotLifecycleIT as as ESIntegTestCase (#46356) * Rewrite SnapshotLifecycleIT as as ESIntegTestCase This commit splits `SnapshotLifecycleIT` into two different tests. `SnapshotLifecycleRestIT` which includes the tests that do not require slow repositories, and `SLMSnapshotBlockingIntegTests` which is now an integration test using `MockRepository` to simulate a snapshot being in progress. Relates to #43663 Resolves #46205 * Add error logging when exceptions are thrown * Update serialization versions * Fix type inference * Use non-Cancellable HLRC return value * Fix Client mocking in test * Fix SLMSnapshotBlockingIntegTests for 7.x branch * Update SnapshotRetentionTask for non-multi-repo snapshot retrieval * Add serialization guards for SnapshotLifecyclePolicy	2019-09-10 09:08:09 -06:00
Ioannis Kakavas	690164d0be	Change EmailSslTest for FIPS 140 JVMs (#46278 ) This commit changes the SSLContext for the email server we use in the tests so that it loads its key material from an in memory keystore (that is in turn built from a pair of PEM encoded private key and certificate) instead of a PKCS#12 one. This is done so that when we run our tests in FIPS 140-2 JVMs, the keystore is of a type that the Security Provider actually supports. This also mutes testCanSendMessageToSmtpServerByDisablingVerification as we can't run tests with verification set to `none` in FIPS 140 JVMs.	2019-09-10 14:39:40 +03:00
Alpar Torok	b40ac6dee7	mute on 7.x fo windows Tracking #44942	2019-09-10 12:34:16 +03:00
Przemysław Witek	e21deae535	Disallow persisting any documents when datafeed is isolated (#46485 ) (#46490 )	2019-09-09 21:01:27 +02:00
Andrei Stefan	7b26a8c041	Use `null` schema response for `SYS TABLES` command. (#46386 ) (cherry picked from commit a6152f42a47a1ccd668e5892778c8bd2d3a78c4c)	2019-09-07 09:24:54 +03:00
Andrei Stefan	7cf100ba07	SQL: fix scripting for grouped by datetime functions (#46421 ) * Fix issue with painless scripting not being correctly generated when datetime functions are used for GROUPing of an INTERVAL operation. (cherry picked from commit cb92828e8ec9d9d241bd6189e5835fd99f8b9a44)	2019-09-07 09:24:53 +03:00
David Roberts	7c7fb7e32d	[ML] Tolerate total_search_time_ms not mapped in get datafeed stats (#46432 ) ML users who upgrade from versions prior to 7.4 to 7.4 or later will have ML results indices that do not have mappings for the total_search_time_ms field. Therefore, when searching these indices we must tolerate this field not having a mapping. Fixes #46437	2019-09-06 14:31:15 +01:00
Hendrik Muhs	c2194aa7e1	[Transform] simplify class structure of indexer (#46306 ) simplify transform task and indexer - remove redundant transform id - moving client data frame indexer (and builder) into a separate file	2019-09-06 15:24:26 +02:00
Dimitris Athanasiou	a6834068e3	[7.x][ML] Extract DataFrameAnalyticsTask into its own class (#46402 ) (#46426 ) This refactors `DataFrameAnalyticsTask` into its own class. The task has quite a lot of functionality now and I believe it would make code more readable to have it live as its own class rather than an inner class of the start action class. Backport of #46402	2019-09-06 14:13:46 +03:00
Hendrik Muhs	42ada88c5d	cleanup static member	2019-09-06 08:55:57 +02:00
Hendrik Muhs	ab5f09d29b	[ML-DataFrame] improve error message for timeout case in stop (#46131 ) improve error message if stopping of transform times out. related #45610	2019-09-06 08:36:01 +02:00
Hendrik Muhs	78824cce81	reuse mock client to avoid probles with thread context closed errors (#46398 )	2019-09-06 08:17:25 +02:00
Benjamin Trent	457ff3e2fb	7.x/ml fix instance serialization bwc (#46404 ) * [ML] Fixing instance serialization version for bwc * fixing CppLogMessage	2019-09-05 13:23:26 -05:00
Benjamin Trent	caf3e4d654	[7.x] [ML][Transforms] fixing rolling upgrade continuous transform test (#45823 ) (#46347 ) (#46337 ) * [ML][Transforms] fixing rolling upgrade continuous transform test (#45823) * [ML][Transforms] fixing rolling upgrade continuous transform test * adjusting wait assert logic * adjusting wait conditions * [ML][Transforms] allow executor to call start on started task (#46347) * making sure we only upgrade from 7.4.0 in test	2019-09-05 10:31:11 -05:00
Benjamin Trent	5201386232	[ML] testFullClusterRestart waiting for stable cluster (#46280 ) (#46335 ) * [ML] waiting for ml indices before waiting task assignment testFullClusterRestart * waiting for a stable cluster after fullrestart * removing unused imports	2019-09-05 06:57:33 -05:00
Yogesh Gaikwad	d5acb15a71	[Backport] Initialize document subset bit set cache used for DLS (#46211 ) (#46359 ) This commit initializes DocumentSubsetBitsetCache even if DLS is disabled. Previously it would throw null pointer when querying usage stats if we explicitly disabled DLS as there would be no instance of DocumentSubsetBitsetCache to query. It is okay to initialize DocumentSubsetBitsetCache which will be empty as the license enforcement would prevent usage of DLS feature and it will not fail when accessing usage stats. Closes #45147	2019-09-05 14:34:19 +10:00
Julie Tibshirani	40c3225d26	First round of optimizations for vector functions. (#46294 ) This PR merges the `vectors-optimize-brute-force` feature branch, which makes the following changes to how vector functions are computed: * Precompute the L2 norm of each vector at indexing time. (#45390) * Switch to ByteBuffer for vector encoding. (#45936) * Decode vectors and while computing the vector function. (#46103) * Use an array instead of a List for the query vector. (#46155) * Precompute the normalized query vector when using cosine similarity. (#46190) Co-authored-by: Mayya Sharipova <mayya.sharipova@elastic.co>	2019-09-04 14:45:57 -07:00
Andrey Ershov	ece9eb4acd	Remove stack trace logging in Security(Transport\|Http)ExceptionHandler (#45966 ) As per #45852 comment we no longer need to log stack-traces in SecurityTransportExceptionHandler and SecurityHttpExceptionHandler even if trace logging is enabled. (cherry picked from commit c99224a32d26db985053b7b36e2049036e438f97)	2019-09-04 11:50:35 +03:00
Dimitris Athanasiou	8fca5b5204	[7.x][ML] Unmute testStopOutlierDetectionWithEnoughDocumentsToScroll (#46271 ) (#46282 ) The test seems to have been failing due to a race condition between stopping the task and refreshing the destination index. In particular, we were going forward with refreshing the destination index even though the task stopped in the meantime. This was fixed in request. Closes #43960 Backport of #46271	2019-09-04 10:57:01 +03:00
Marios Trivyzas	fd0affb503	SQL: Fix issue with IIF function when condition folds (#46290 ) Previously, when the condition (1st argument) of the IIF function could be evaluated (folded) to false, the `IfConditional` was eliminated which caused `IndexOutOfBoundsException` to be thrown when `info()` and `resolveType()` methods where called. Fixes: #46268 (cherry picked from commit 9a885a3ac47bc8f52c07770d1d8d670ce0af1e59)	2019-09-04 10:32:49 +03:00
Benjamin Trent	8a4ff7d57d	[ML][Transforms] protecting doSaveState with optimistic concurrency (#46156 ) (#46281 ) * [ML][Transforms] protecting doSaveState with optimistic concurrency * task code cleanup	2019-09-03 15:27:24 -05:00
Benjamin Trent	c3c1dcd2ac	[ML][Transforms] fixing listener being called twice (#46284 ) (#46292 )	2019-09-03 15:26:56 -05:00
Benjamin Trent	53df54c703	[ML][Transforms] fixing stop on changes check bug (#46162 ) (#46273 ) * [ML][Transforms] fixing stop on changes check bug * Adding new method finishAndCheckState to cover race conditions in early terminations * changing stopping conditions in `onStart` * allow indexer to finish when exiting early	2019-09-03 11:04:18 -05:00
Lee Hinman	3d4b8e01c7	Validate SLM policy ids strictly (#45998 ) (#46145 ) This uses strict validation for SLM policy ids, similar to what we use for index names. Resolves #45997	2019-09-03 09:20:02 -06:00
David Roberts	ab045744ac	[ML-DataFrame] Fix off-by-one error in checkpoint operations_behind (#46235 ) Fixes a problem where operations_behind would be one less than expected per shard in a new index matched by the data frame transform source pattern. For example, if a data frame transform had a source of foo* and a new index foo-new was created with 2 shards and 7 documents indexed in it then operations_behind would be 5 prior to this change. The problem was that an empty index has a global checkpoint number of -1 and the sequence number of the first document that is indexed into an index is 0, not 1. This doesn't matter for indices included in both the last and next checkpoints, as the off-by-one errors cancelled, but for a new index it affected the observed result.	2019-09-03 12:45:02 +01:00
Ignacio Vera	59c474e675	reset queryGeometry in ShapeQueryTests (#45974 ) (#46251 )	2019-09-03 10:24:46 +02:00
markharwood	a9e3871fc7	Test fix for PinnedQueryBuilderIT (#46187 ) (#46227 ) Fix test issue to stabilise scoring through use of DFS search mode. Randomised index-then-delete docs introduced by the test framework likely caused an imbalance in IDF scores across shards. Also made number of shards used in test a random number for added test coverage. Closes #46174	2019-09-02 13:31:22 +01:00
Marios Trivyzas	3bee647e5b	SQL: Fix issue with DataType for CASE with NULL (#46173 ) Previously, if the DataType of all the WHEN conditions of a CASE statement is NULL, then it was set to NULL even if the ELSE clause has a non-NULL data type, e.g.: ``` CASE WHEN a = 1 THEN NULL WHEN a = 5 THEN NULL ELSE 'foo' ``` Fixes: #46032 (cherry picked from commit 8c1012efbbd3a300afd0dfb9b18250f15ea753f9)	2019-09-02 11:17:24 +03:00
Benjamin Trent	d0c5573a51	[ML] Throw an error when a datafeed needs CCS but it is not enabled for the node (#46044 ) (#46096 ) Though we allow CCS within datafeeds, users could prevent nodes from accessing remote clusters. This can cause mysterious errors and difficult to troubleshoot. This commit adds a check to verify that `cluster.remote.connect` is enabled on the current node when a datafeed is configured with a remote index pattern.	2019-08-30 09:27:07 -05:00
Alexander Reelsen	98c32c7846	Fix wrong URL encoding in watcher HTTP client (#45894 ) The test assumption was calling the wrong method resulting in a URL encoding before returning the data. Closes #44970	2019-08-30 14:02:49 +02:00
Dimitris Athanasiou	5921ae53d8	[7.x][ML] Regression dependent variable must be numeric (#46072 ) (#46136 ) * [ML] Regression dependent variable must be numeric This adds a validation that the dependent variable of a regression analysis must be numeric. * Address review comments and fix some problems In addition to addressing the review comments, this commit fixes a few issues I found during testing. In particular: - if there were mappings for required fields but they were not included we were not reporting the error - if explicitly included fields had unsupported types we were not reporting the error Unfortunately, I couldn't get those fixed without refactoring the code in `ExtractedFieldsDetector`.	2019-08-30 09:57:43 +03:00
Zachary Tong	cf8a4171e1	Rename `data-science` plugin to `analytics` (#46133 ) Rename `data-science` plugin to `analytics`. Also removes enabled flag. Backport of #46092	2019-08-29 12:45:39 -04:00
Simon Willnauer	9b2ea07b17	Flush engine after big merge (#46066 ) (#46111 ) Today we might carry on a big merge uncommitted and therefore occupy a significant amount of diskspace for quite a long time if for instance indexing load goes down and we are not quickly reaching the translog size threshold. This change will cause a flush if we hit a significant merge (512MB by default) which frees diskspace sooner.	2019-08-29 17:54:15 +02:00
Nhat Nguyen	028e792e1d	Remove already exist assertion while renew ccr lease (#46009 ) If a CCR lease is disappeared while we are renewing it, then we will issue asyncAddRetentionLease to add that lease. And if asyncAddRetentionLease takes longer than retentionLeaseRenewInterval, then we can issue another asyncAddRetentionLease request. One of asyncAddRetentionLease requests will fail with RetentionLeaseAlreadyExistsException, hence trip the assertion. Closes #45192	2019-08-29 09:44:40 -04:00
Przemysław Witek	b8a0379057	Refactor auditor-related classes (#45893 ) (#46120 )	2019-08-29 14:21:03 +02:00
Przemysław Witek	fbe9e8a530	Do not throw an exception if the process finished quickly but without any error. (#46073 ) (#46113 )	2019-08-29 10:47:17 +02:00
Gordon Brown	47bbd9d9a9	[7.x] Fix rollover alias in SLM history index template (#46001 ) This commit adds the `rollover_alias` setting required for ILM to work correctly to the SLM history index template and adds assertions to the SLM integration tests to ensure that it works correctly.	2019-08-28 14:50:22 -07:00
Tal Levy	a356bcff41	Add Circle Processor (#43851 ) (#46097 ) add circle-processor that translates circles to polygons	2019-08-28 14:44:08 -07:00
Julie Tibshirani	d94c4dcffb	Use float instead of double for query vectors. (#46004 ) Currently, when using script_score functions like cosineSimilarity, the query vector is treated as an array of doubles. Since the stored document vectors use floats, it seems like the least surprising behavior for the query vectors to also be float arrays. In addition to improving consistency, this change may help with some optimizations we have been considering around vector dot product.	2019-08-28 11:03:14 -07:00
Mark Tozzi	9ac85a4a2b	Fix compilation in CumulativeCardinalityAggregatorTests	2019-08-28 09:31:48 -04:00
Dimitris Athanasiou	25d64508f6	[7.x][ML] Support boolean fields for DF analytics (#46037 ) (#46054 ) This commit adds support for `boolean` fields in data frame analytics (and currently both outlier detection and regression). The analytics process expects `boolean` fields to be encoded as integers with 0 or 1 value.	2019-08-28 12:02:29 +03:00
Jake Landis	154d1dd962	Watcher max_iterations with foreach action execution (#45715 ) (#46039 ) Prior to this commit the foreach action execution had a hard coded limit to 100 iterations. This commit allows the max number of iterations to be a configuration ('max_iterations') on the foreach action. The default remains 100.	2019-08-27 16:57:20 -05:00
Armin Braun	fdef293c81	Fix RegressionTests#fromXContent (#46029 ) * The `trainingPercent` must be between `1` and `100`, not `0` and `100` which is causing test failures	2019-08-27 18:24:26 +03:00
Dimitris Athanasiou	873ad3f942	[7.x][ML] Add option to regression to randomize training set (#45969 ) (#46017 ) Adds a parameter `training_percent` to regression. The default value is `100`. When the parameter is set to a value less than `100`, from the rows that can be used for training (ie. those that have a value for the dependent variable) we randomly choose whether to actually use for training. This enables splitting the data into a training set and the rest, usually called testing, validation or holdout set, which allows for validating the model on data that have not been used for training. Technically, the analytics process considers as training the data that have a value for the dependent variable. Thus, when we decide a training row is not going to be used for training, we simply clear the row's dependent variable.	2019-08-27 17:53:11 +03:00
Yogesh Gaikwad	7b6246ec67	Add `manage_own_api_key` cluster privilege (#45897 ) (#46023 ) The existing privilege model for API keys with privileges like `manage_api_key`, `manage_security` etc. are too permissive and we would want finer-grained control over the cluster privileges for API keys. Previously APIs created would also need these privileges to get its own information. This commit adds support for `manage_own_api_key` cluster privilege which only allows api key cluster actions on API keys owned by the currently authenticated user. Also adds support for retrieval of the API key self-information when authenticating via API key without the need for the additional API key privileges. To support this privilege, we are introducing additional authentication context along with the request context such that it can be used to authorize cluster actions based on the current user authentication. The API key get and invalidate APIs introduce an `owner` flag that can be set to true if the API key request (Get or Invalidate) is for the API keys owned by the currently authenticated user only. In that case, `realm` and `username` cannot be set as they are assumed to be the currently authenticated ones. The changes cover HLRC changes, documentation for the API changes. Closes #40031	2019-08-28 00:44:23 +10:00
Dimitris Athanasiou	dd6c13fdf9	[ML] Add description to DF analytics (#45774 ) (#46019 )	2019-08-27 15:48:59 +03:00
Albert Zaharovits	1ebee5bf9b	PKI realm authentication delegation (#45906 ) This commit introduces PKI realm delegation. This feature supports the PKI authentication feature in Kibana. In essence, this creates a new API endpoint which Kibana must call to authenticate clients that use certificates in their TLS connection to Kibana. The API call passes to Elasticsearch the client's certificate chain. The response contains an access token to be further used to authenticate as the client. The client's certificates are validated by the PKI realms that have been explicitly configured to permit certificates from the proxy (Kibana). The user calling the delegation API must have the delegate_pki privilege. Closes #34396	2019-08-27 14:42:46 +03:00
Zachary Tong	943a016bb2	Add Cumulative Cardinality agg (and Data Science plugin) (#45990 ) This adds a pipeline aggregation that calculates the cumulative cardinality of a field. It does this by iteratively merging in the HLL sketch from consecutive buckets and emitting the cardinality up to that point. This is useful for things like finding the total "new" users that have visited a website (as opposed to "repeat" visitors). This is a Basic+ aggregation and adds a new Data Science plugin to house it and future advanced analytics/data science aggregations.	2019-08-26 16:19:55 -04:00
Benjamin Trent	a3a4ae0ac2	[ML] fixing bug where analytics process starts with 0 rows (#45879 ) (#45988 ) The native process requires that there be a non-zero number of rows to analyze. If the flag --rows 0 is passed to the executable, it throws and does not start. When building the configuration for the process we should not start the native process if there are no rows. Adding some logging to indicate what is occurring.	2019-08-26 14:18:17 -05:00
Benjamin Trent	d64018f8e1	[ML] add supported types to no fields error message (#45926 ) (#45987 ) * [ML] add supported types to no fields error message * adding supported types to logger debug	2019-08-26 14:18:00 -05:00
Jake Landis	767f648f8e	Watcher add email warning if CSV attachment contains formulas (#44460 ) (#45557 ) * Watcher add email warning if CSV attachment contains formulas (#44460) This commit introduces a Warning message to the emails generated by Watcher's reporting action. This change complements Kibana's CSV formula notifications (see elastic/kibana#37930). This is implemented by reading a header (kbn-csv-contains-formulas) provided by Kibana to notify to attach the Warning to the email. The wording of the warning is borrowed from Kibana's UI and may be overridden by a dynamic setting xpack.notification.reporting.warning.kbn-csv-contains-formulas.text. This warning is enabled by default, but may be disabled via a dynamic setting xpack.notification.reporting.warning.enabled.	2019-08-26 08:35:33 -05:00
Jake Landis	f2241a152f	watcher tests - increase stop timeout to 60s (#45679 ) (#45934 ) As of #43939 Watcher tests now correctly block until all Watch executions kicked off by that test are finished. Prior we allowed tests to finish with outstanding watch executions. It was known that this would increase the time needed to finish a test. However, running the tests on CI can be slow and on at least 1 occasion it took 60s to actually finish. This PR simply increases the max allowable timeout for Watcher tests to clean up after themselves.	2019-08-26 08:34:54 -05:00
Andrey Ershov	479ab9b8db	Fix plaintext on TLS port logging (#45852 ) Today if non-TLS record is received on TLS port generic exception will be logged with the stack-trace. SSLExceptionHelper.isNotSslRecordException method does not work because it's assuming that NonSslRecordException would be top-level. This commit addresses the issue and the log would be more concise. (cherry picked from commit 6b83527bf0c23d4d5b97fab7f290c43432945d4f)	2019-08-26 12:32:35 +02:00
Ioannis Kakavas	2bee27dd54	Allow Transport Actions to indicate authN realm (#45946 ) This commit allows the Transport Actions for the SSO realms to indicate the realm that should be used to authenticate the constructed AuthenticationToken. This is useful in the case that many authentication realms of the same type have been configured and where the caller of the API(Kibana or a custom web app) already know which realm should be used so there is no need to iterate all the realms of the same type. The realm parameter is added in the relevant REST APIs as optional so as not to introduce any breaking change.	2019-08-25 19:36:41 +03:00
Jason Tedor	040a810b3c	Add deprecation check for pidfile setting (#45939 ) The pidfile setting is deprecated. This commit adds a deprecation check for usage of this setting.	2019-08-24 17:19:20 -04:00
Jason Tedor	43ca652d11	Add deprecation check for processors (#45925 ) The processors setting is deprecated. This commit adds a deprecation check for the use of the processors setting.	2019-08-23 20:16:40 -04:00
Jason Tedor	6b116a48f3	Skip feature aware check on JDK 14 (#45928 ) ASM can not currently handle classes compiled with JDK 14. This commit skips these checks on JDK 14, for now.	2019-08-23 17:38:15 -04:00
Dimitris Athanasiou	be554fe5f0	[7.x][ML] Improve progress reportings for DF analytics (#45856 ) (#45910 ) Previously, the stats API reports a progress percentage for DF analytics tasks that are running and are in the `reindexing` or `analyzing` state. This means that when the task is `stopped` there is no progress reported. Thus, one cannot distinguish between a task that never run to one that completed. In addition, there are blind spots in the progress reporting. In particular, we do not account for when data is loaded into the process. We also do not account for when results are written. This commit addresses the above issues. It changes progress to being a list of objects, each one describing the phase and its progress as a percentage. We currently have 4 phases: reindexing, loading_data, analyzing, writing_results. When the task stops, progress is persisted as a document in the state index. The stats API now reports progress from in-memory if the task is running, or returns the persisted document (if there is one).	2019-08-23 23:04:39 +03:00
Benjamin Trent	b756e1b9be	[ML][Transforms] adjusting when and what to audit (#45876 ) (#45916 ) * [ML][Transforms] adjusting when and what to audit * Update DataFrameTransformTask.java * removing unnecessary audit message	2019-08-23 13:53:02 -05:00
Benjamin Trent	94c2de65b9	[ML][Transforms] fix doSaveState check (#45882 ) (#45902 ) * [ML][Transforms] fix doSaveState check * removing unnecessary log statement	2019-08-23 09:38:52 -05:00
Alexander Reelsen	ecafe4f4ad	Update joda to 2.10.3 (#45495 )	2019-08-23 10:39:39 +02:00
markharwood	217e41ab6c	Search - added HLRC support for PinnedQueryBuilder (#45779 ) (#45853 ) Added HLRC support for PinnedQueryBuilder Related #44074	2019-08-23 09:22:17 +01:00
Przemysław Witek	85d55e30d0	Add test that proves _timing_stats document is deleted when the job is deleted (#45840 ) (#45854 )	2019-08-23 07:03:09 +02:00
Przemysław Witek	2ed19b2c81	Put error message from inside the process into the exception that is thrown when the process doesn't start correctly. (#45846 ) (#45875 )	2019-08-23 07:02:50 +02:00
Tim Vernum	f94e4a9151	Set security index refresh interval to 1s (#45888 ) The security indices were being created without specifying the refresh interval, which means it would inherit a value from any templates that exists. However, certain security functionality depends on being able to wait_for refresh, and causes errors (e.g. in Kibana) if that time exceeds 30s. This commit changes the security indices configuration to always be created with a 1s refresh interval. This prevents any templates from inadvertantly interfering with the proper functioning of security. It is possible for an administrator to explicitly change the refresh interval after the indices have been created. Backport of: #45434	2019-08-23 12:41:37 +10:00
Tim Vernum	029725fc35	Add SSL/TLS settings for watcher email (#45836 ) This change adds a new SSL context xpack.notification.email.ssl.* that supports the standard SSL configuration settings (truststore, verification_mode, etc). This SSL context is used when configuring outbound SMTP properties for watcher email notifications. Backport of: #45272	2019-08-23 10:13:51 +10:00
Nhat Nguyen	3393f9599e	Ignore translog retention policy if soft-deletes enabled (#45473 ) Since #45136, we use soft-deletes instead of translog in peer recovery. There's no need to retain extra translog to increase a chance of operation-based recoveries. This commit ignores the translog retention policy if soft-deletes is enabled so we can discard translog more quickly. Backport of #45473 Relates #45136	2019-08-22 16:40:06 -04:00
Benjamin Trent	8e3c54fff7	[7.x] [ML] Adding data frame analytics stats to _usage API (#45820 ) (#45872 ) * [ML] Adding data frame analytics stats to _usage API (#45820) * [ML] Adding data frame analytics stats to _usage API * making the size of analytics stats 10k * adjusting backport	2019-08-22 15:15:41 -05:00
Benjamin Trent	dff3e636c2	[ML][Transforms] unifying logging, adding some more logging (#45788 ) (#45859 ) * [ML][Transforms] unifying logging, adding some more logging * using parameterizedMessage instead of string concat * fixing bracket closure	2019-08-22 13:15:07 -05:00
Benjamin Trent	e50a78cf50	[ML-DataFrame] version data frame transform internal index (#45375 ) (#45837 ) Adds index versioning for the internal data frame transform index. Allows for new indices to be created and referenced, `GET` requests now query over the index pattern and takes the latest doc (based on INDEX name).	2019-08-22 11:46:30 -05:00
Jake Landis	1dab73929f	Watcher add stopped listener (#43939 ) (#45670 ) When Watcher is stopped and there are still outstanding watches running Watcher will report it self as stopped. In normal cases, this is not problematic. However, for integration tests Watcher is started and stopped between each test to help ensure a clean slate for each test. The tests are blocking only on the stopped state and make an implicit assumption that all watches are finished if the Watcher is stopped. This is an incorrect assumption since Stopped really means, "I will not accept any more watches". This can lead to un-predictable behavior in the tests such as message : "Watch is already queued in thread pool" and state: "not_executed_already_queued". This can also change the .watcher-history if watches linger between tests. This commit changes the semantics of a manual stopping watcher to now mean: "I will not accept any more watches AND all running watches are complete". There is now an intermediary step "Stopping" and callback to allow transition to a "Stopped" state when all Watches have completed. Additionally since this impacts how long the tests will block waiting for a "Stopped" state, the timeout has been increased. Related: #42409	2019-08-22 10:54:29 -05:00
Armin Braun	bfddaaa2ae	Acknowledge Indices Were Wiped Successfully in REST Tests (#45832 ) (#45842 ) In internal test clusters tests we check that wiping all indices was acknowledged but in REST tests we didn't. This aligns the behavior in both kinds of tests. Relates #45605 which might be caused by unacked deletes that were just slow.	2019-08-22 17:19:51 +02:00
Przemysław Witek	7512337922	[7.x] Allow the user to specify 'query' in Evaluate Data Frame request (#45775 ) (#45825 )	2019-08-22 11:14:26 +02:00
Benjamin Trent	3ebeaa2557	Fixing rollup state tests after onFailure ordering change (#45784 ) (#45814 ) After the PR #45676 onFailure is now called before the indexer state has transitioned out of indexing. To fix these tests, I added a new check to make sure that we don't mark it as failed until AFTER doSaveState is called with a STARTED indexer.	2019-08-21 14:46:09 -05:00
Gordon Brown	47b1e2b3d0	[7.x] Use rollover for SLM's history indices (#45686 ) Following our own guidelines, SLM should use rollover instead of purely time-based indices to keep shard counts low. This commit implements lazy index creation for SLM's history indices, indexing via an alias, and rollover in the built-in ILM policy.	2019-08-21 13:42:11 -06:00
William Brafford	2b549e7342	CLI tools: write errors to stderr instead of stdout (#45586 ) Most of our CLI tools use the Terminal class, which previously did not provide methods for writing to standard output. When all output goes to standard out, there are two basic problems. First, errors and warnings are "swallowed" in pipelines, making it hard for a user to know when something's gone wrong. Second, errors and warnings are intermingled with legitimate output, making it difficult to pass the results of interactive scripts to other tools. This commit adds a second set of print commands to Terminal for printing to standard error, with errorPrint corresponding to print and errorPrintln corresponding to println. This leaves it to developers to decide which output should go where. It also adjusts existing commands to send errors and warnings to stderr. Usage is printed to standard output when it's correctly requested (e.g., bin/elasticsearch-keystore --help) but goes to standard error when a command is invoked incorrectly (e.g. bin/elasticsearch-keystore list-with-a-typo \| sort).	2019-08-21 14:46:07 -04:00
Przemysław Witek	bf701b83d2	Shorten field names in EstimateMemoryUsageResponse (#45719 ) (#45772 )	2019-08-21 12:45:09 +02:00
Zachary Tong	6b391cd0d5	Mute ShapeQueryTests#testFieldAlias() Tracking issue: https://github.com/elastic/elasticsearch/issues/45628	2019-08-21 10:31:13 +01:00
David Kyle	982560afeb	Mute RollupIndexerStateTests See #45770	2019-08-21 10:05:15 +01:00
Przemysław Witek	c6709f0979	Mute tests affected by renaming fields in Estimate memory usage response (#45743 ) (#45766 )	2019-08-21 09:57:23 +02:00
Dimitris Athanasiou	d5c3d9b50f	[7.x][ML] Do not skip rows with missing values for regression (#45751 ) (#45754 ) Regression analysis support missing fields. Even more, it is expected that the dependent variable has missing fields to the part of the data frame that is not for training. This commit allows to declare that an analysis supports missing values. For such analysis, rows with missing values are not skipped. Instead, they are written as normal with empty strings used for the missing values. This also contains a fix to the integration test. Closes #45425	2019-08-21 08:15:38 +03:00
Benjamin Trent	ba7b677618	[ML] better handle empty results when evaluating regression (#45745 ) (#45759 ) * [ML] better handle empty results when evaluating regression * adding new failure test to ml_security black list * fixing equality check for regression results	2019-08-20 17:37:04 -05:00
Armin Braun	a01bd6c5a3	Stop Executing SLM Policy Transport Action on Snapshot Pool (#45727 ) (#45748 ) * Executing SLM policies on the snapshot thread will block until a snapshot finishes if the pool is completely busy executing that snapshot * Fixes #45594	2019-08-20 19:15:36 +02:00
Nhat Nguyen	99b21d50b8	Include leases in ccr errmsg when ops no longer available (#45681 ) The setting index.soft_deletes.retention.operations is no longer needed nor recommended in CCR. We, therefore, should hint users about the retention leases period setting instead when operations are no longer available for replicating.	2019-08-20 10:40:12 -04:00
Benjamin Trent	43bb5924e6	[ML][Data Frame] fixing _start?force=true bug (#45660 ) (#45734 ) * [ML][Data Frame] fixing _start?force=true bug * removing unused import * removing old TODO	2019-08-20 09:23:07 -05:00
Dimitris Athanasiou	49edf9e5b5	[7.x][ML] Remove timeout on waiting for DF analytics result processor to complete (#45724 ) (#45733 ) We cannot know how long the analysis will take to complete thus we should not have a timeout. Note that if the process crashes, the result processor will pick the exception due to the stream closing. Closes #45723	2019-08-20 17:21:40 +03:00
Przemysław Witek	b37ebd1adf	Prepare the codebase for new Auditor subclasses (#45716 ) (#45731 )	2019-08-20 16:03:50 +02:00
Przemysław Witek	80dd0a0948	Get rid of EstimateMemoryUsageRequest and EstimateMemoryUsageAction.Request. (#45718 ) (#45725 )	2019-08-20 15:49:17 +02:00
Benjamin Trent	88641a08af	[ML][Data frame] fixing failure state transitions and race condition (#45627 ) (#45656 ) * [ML][Data frame] fixing failure state transitions and race condition (#45627) There is a small window for a race condition while we are flagging a task as failed. Here are the steps where the race condition occurs: 1. A failure occurs 2. Before `AsyncTwoPhaseIndexer` calls the `onFailure` handler it does the following: a. `finishAndSetState()` which sets the IndexerState to STARTED b. `doSaveState(...)` which attempts to save the current state of the indexer 3. Another trigger is fired BEFORE `onFailure` can fire, but AFTER `finishAndSetState()` occurs. The trick here is that we will eventually set the indexer to failed, but possibly not before another trigger had the opportunity to fire. This could obviously cause some weird state interactions. To combat this, I have put in some predicates to verify the state before taking actions. This is so if state is indeed marked failed, the "second trigger" stops ASAP. Additionally, I move the task state checks INTO the `start` and `stop` methods, which will now require a `force` parameter. `start`, `stop`, `trigger` and `markAsFailed` are all `synchronized`. This should gives us some guarantees that one will not switch states out from underneath another. I also flag the task as `failed` BEFORE we successfully write it to cluster state, this is to allow us to make the task fail more quickly. But, this does add the behavior where the task is "failed" but the cluster state does not indicate as much. Adding the checks in `start` and `stop` will handle this "real state vs cluster state" race condition. This has always been a problem for `_stop` as it is not a master node action and doesn’t always have the latest cluster state. closes #45609 Relates to #45562 * [ML][Data Frame] moves failure state transition for MT safety (#45676) * [ML][Data Frame] moves failure state transition for MT safety * removing unused imports	2019-08-20 07:30:17 -05:00
markharwood	7d5ab17bb2	Search enhancement: pinned queries (#44345 ) (#45657 ) * Search enhancement: pinned queries (#44345) Search enhancement: - new query type allows selected documents to be promoted above any "organic” search results. This is the first feature in a new module `search-business-rules` which will house licensed (non OSS) logic for rewriting queries according to business rules. The PinnedQueryBuilder class offers a new `pinned` query in the DSL that takes an array of promoted IDs and an “organic” query and ensures the documents with the promoted IDs rank higher than the organic matches. Closes #44074	2019-08-20 11:38:22 +01:00
Costin Leau	0f51dd69cb	SQL: Improve serialization of SQL processors (#45678 ) Encapsulate the serialization/deserialization of SQL client classes. Make configuration specific parameters (such as ZoneId) generic just like the version and remove the need for consumer classes to manage them individually. This is not only consistent but also provides significant savings in the cursor. Fix #40216 (cherry picked from commit 5c844798045d7baa0d932289d2e3d1607ba6a9a4)	2019-08-20 11:50:47 +03:00
Przemysław Witek	7bc8400222	Call the new _estimate_memory_usage API endpoint on df analytics _start (#45536 ) (#45701 )	2019-08-19 21:37:55 +02:00
Costin Leau	1cd58c8ea8	SQL: Break TextFormatter/Cursor dependency (#45613 ) Improve the initialization and state passing of TextFormatter in CLI and TEXT mode by leveraging the Page listener hook. Additionally simplify the code inside RestSqlQueryAction. (cherry picked from commit a56db2fa119cf9e8748723e19f1fc9f6a8afe5fc)	2019-08-17 00:16:08 +03:00
Costin Leau	96883dd028	SQL: Refactor away the cycle between Rowset and Cursor (#45516 ) Improve encapsulation of pagination of rowsets by breaking the cycle between cursor and associated rowset implementation, all logic now residing inside each cursor implementation. (cherry picked from commit be8fe0a0ce562fe732fae12a0b236b5731e4638c)	2019-08-17 00:16:05 +03:00
Gordon Brown	ecb3ebd796	Clean SLM and ongoing snapshots in test framework (#45564 ) Adjusts the cluster cleanup routine in ESRestTestCase to clean up SLM test cases, and optionally wait for all snapshots to be deleted. Waiting for all snapshots to be deleted, rather than failing if any are in progress, is necessary for tests which use SLM policies because SLM policies may be in the process of executing when the test ends.	2019-08-16 14:17:34 -06:00
Igor Motov	98c850c08b	Geo: Change order of parameter in Geometries to lon, lat 7.x (#45618 ) Changes the order of parameters in Geometries from lat, lon to lon, lat and moves all Geometry classes are moved to the org.elasticsearch.geomtery package. Backport of #45332 Closes #45048	2019-08-16 14:42:02 -04:00
Luca Cavanna	c31cddf27e	Update the schema for the REST API specification (#42346 ) * Update the REST API specification This patch updates the REST API spefication in JSON files to better encode deprecated entities, to improve specification of URL paths, and to open up the schema for future extensions. Notably, it changes the `paths` from a list of strings to a list of objects, where each particular object encodes all the information for this particular path: the `parts` and the `methods`. Among the benefits of this approach is eg. encoding the difference between using the `PUT` and `POST` methods in the Index API, to either use a specific document ID, or let Elasticsearch generate one. Also `documentation` becomes an object that supports an `url` and also a `description` which is a new field. * Adapt YAML runner to new REST API specification format The logic for choosing the path to use when running tests has been simplified, as a consequence of the path parts being listed under each path in the spec. The special case for create and index has been removed. Also the parsing code has been hardened so that errors are thrown earlier when the structure of the spec differs from what expected, and their error messages should be more helpful.	2019-08-16 14:40:00 +02:00
Andrei Stefan	30a0711777	Remove deprecated use of "interval" method, in favor of "fixedInterval". (#45501 ) (cherry picked from commit 3fef65160f9e61883e9f8f7f345b814f945e2f4b)	2019-08-16 15:03:43 +03:00
Alpar Torok	7119e54be5	Mute data frame tests on 7.x Tracking in #45610 #45609	2019-08-15 17:07:53 +03:00
David Roberts	d40f3718f2	[ML] Muting 5 SSLErrorMessageTests tests on Windows (#45602 ) Due to https://github.com/elastic/elasticsearch/issues/45598	2019-08-15 11:05:00 +01:00
Benjamin Trent	fde5dae387	[ML][Data Frame] adjusting change detection workflow (#45511 ) (#45580 ) * [ML][Data Frame] adjusting change detection workflow * adjusting for PR comment * disallowing null as an argument value	2019-08-14 17:26:24 -05:00
Nick Knize	647a8308c3	[SPATIAL] Backport new ShapeFieldMapper and ShapeQueryBuilder to 7x (#45363 ) * Introduce Spatial Plugin (#44389) Introduce a skeleton Spatial plugin that holds new licensed features coming to Geo/Spatial land! * [GEO] Refactor DeprecatedParameters in AbstractGeometryFieldMapper (#44923) Refactor DeprecatedParameters specific to legacy geo_shape out of AbstractGeometryFieldMapper.TypeParser#parse. * [SPATIAL] New ShapeFieldMapper for indexing cartesian geometries (#44980) Add a new ShapeFieldMapper to the xpack spatial module for indexing arbitrary cartesian geometries using a new field type called shape. The indexing approach leverages lucene's new XYShape field type which is backed by BKD in the same manner as LatLonShape but without the WGS84 latitude longitude restrictions. The new field mapper builds on and extends the refactoring effort in AbstractGeometryFieldMapper and accepts shapes in either GeoJSON or WKT format (both of which support non geospatial geometries). Tests are provided in the ShapeFieldMapperTest class in the same manner as GeoShapeFieldMapperTests and LegacyGeoShapeFieldMapperTests. Documentation for how to use the new field type and what parameters are accepted is included. The QueryBuilder for searching indexed shapes is provided in a separate commit. * [SPATIAL] New ShapeQueryBuilder for querying indexed cartesian geometry (#45108) Add a new ShapeQueryBuilder to the xpack spatial module for querying arbitrary Cartesian geometries indexed using the new shape field type. The query builder extends AbstractGeometryQueryBuilder and leverages the ShapeQueryProcessor added in the previous field mapper commit. Tests are provided in ShapeQueryTests in the same manner as GeoShapeQueryTests and docs are updated to explain how the query works.	2019-08-14 16:35:10 -05:00
Benjamin Trent	0c343d8443	[7.x] [ML][Transforms] adjusting stats.progress for cont. transforms (#45361 ) (#45551 ) * [ML][Transforms] adjusting stats.progress for cont. transforms (#45361) * [ML][Transforms] adjusting stats.progress for cont. transforms * addressing PR comments * rename fix * Adjusting bwc serialization versions	2019-08-14 13:08:27 -05:00
Przemysław Witek	df574e5168	[7.x] Implement ml/data_frame/analytics/_estimate_memory_usage API endpoint (#45188 ) (#45510 )	2019-08-14 08:26:03 +02:00
Gordon Brown	3f5dab99c3	Properly set origin for SLM history store client (#45515 ) The origin was not set properly for the SnapshotHistoryStore client, resulting in errors when SLM was used when security was enabled.	2019-08-13 18:23:20 -06:00
Andrei Stefan	adf8e20021	SQL: adds format parameter to range queries for constant date comparisons (#45503 ) * Add format parameter to the range queries built for CURRENT_* functions used in comparison conditions * Use range queries for date fields equality/non-equality as well. (cherry picked from commit c1e81e90f937ee5a002524d632bfce74d76962f9)	2019-08-13 23:04:30 +03:00
Armin Braun	90803a5caf	Reenable Integ Tests in native-multi-node-tests (#45482 ) (#45496 ) * Reenable Integ Tests in native-multi-node-tests * The tests broken here were likely fixed by #45463 => let's reenable them and see if things run fine again * Relates #45405, #45455	2019-08-13 15:55:54 +02:00
Alexander Reelsen	dd527b4e91	Fix watcher HttpClient URL creation (#45207 ) The http client could end up creating URLs, that did not resemble the original one, when encoding. This fixes a couple of corner cases, where too much or too few slashes were added to an URI. Closes #44970	2019-08-13 12:15:54 +02:00
Przemysław Witek	1aed388a24	Add view_index_metadata to roles.yml and remove as many df analytics test cases from build.gradle blacklist as possible. (#45451 ) (#45465 )	2019-08-13 08:31:58 +02:00
Yogesh Gaikwad	471d940c44	Refactor cluster privileges and cluster permission (#45265 ) (#45442 ) The current implementations make it difficult for adding new privileges (example: a cluster privilege which is more than cluster action-based and not exposed to the security administrator). On the high level, we would like our cluster privilege either: - a named cluster privilege This corresponds to `cluster` field from the role descriptor - or a configurable cluster privilege This corresponds to the `global` field from the role-descriptor and allows a security administrator to configure them. Some of the responsibilities like the merging of action based cluster privileges are now pushed at cluster permission level. How to implement the predicate (using Automaton) is being now enforced by cluster permission. `ClusterPermission` helps in enforcing the cluster level access either by performing checks against cluster action and optionally against a request. It is a collection of one or more permission checks where if any of the checks allow access then the permission allows access to a cluster action. Implementations of cluster privilege must be able to provide information regarding the predicates to the cluster permission so that can be enforced. This is enforced by making implementations of cluster privilege aware of cluster permission builder and provide a way to specify how the permission is to be built for a given privilege. This commit renames `ConditionalClusterPrivilege` to `ConfigurableClusterPrivilege`. `ConfigurableClusterPrivilege` is a renderable cluster privilege exposed as a `global` field in role descriptor. Other than this there is a requirement where we would want to know if a cluster permission is implied by another cluster-permission (`has-privileges`). This is helpful in addressing queries related to privileges for a user. This is not just simply checking of cluster permissions since we do not have access to runtime information (like request object). This refactoring does not try to address those scenarios. Relates #44048	2019-08-13 09:06:18 +10:00
Mark Vieira	7e3379444b	Fix build failure due to unknown task and disable test conventions (cherry picked from commit 8ed84bc5cef9bcfae6c817059f764d97e4451a4a)	2019-08-12 09:18:39 -07:00
Przemyslaw Gomulka	421e9b8e8b	Mute integ tests in native-multi-node-tests (#45457 ) Tracked at #45405	2019-08-12 17:42:24 +02:00
Przemyslaw Gomulka	d11ae08467	Muting ForecastIT.testOverflowToDisk (#45435 ) (#45438 ) awaits #45405	2019-08-12 11:01:32 +02:00

1 2 3 4 5 ...

3374 Commits