OpenSearch

Commit Graph

Author	SHA1	Message	Date
Marios Trivyzas	3233bce8cb	SQL: Fix issue with negative literels and parentheses (#48113 ) Previously when a numeric literal was enclosed in parentheses and then negated, the negation was lost and the number was considered positive, e.g.: `-(5)` was considered as `5` instead of `-5` `- ( (1.28) )` was considered as `1.28` instead of `-1.28` Fixes: #48009 (cherry picked from commit 4dee4bf3b34081062ba2e28ab8524a066812a180)	2019-10-16 12:56:35 +02:00
Przemysław Witek	8f815240b3	[7.x] Allow integer types for classification's dependent variable (#47902 ) (#48080 )	2019-10-16 11:09:56 +02:00
Alex Pang	09604dbaea	[DOCS] Fix truststores typo (#47738 )	2019-10-15 15:50:54 -04:00
David Roberts	d9c7e3847e	[TEST] Don't assert order of data frame analytics audit messages (#48065 ) Audit messages are stored with millisecond timestamps. If two messages have the same millisecond timestamp then asserting on their order is impossible given the information available. This PR changes the assertion on audit messages in the native data frame analytics tests to assert that the expected audit messages exist in any order. Fixes #48035	2019-10-15 19:59:52 +01:00
Przemysław Witek	eaa56344b5	Verify that the failure reason of analytics process is empty (#48042 ) (#48071 )	2019-10-15 18:33:20 +02:00
Martijn van Groningen	aff0c9babc	This commits merges (#48040 ) the enrich-7.x feature branch, which is backport merge and adds a new ingest processor, named enrich processor, that allows document being ingested to be enriched with data from other indices. Besides a new enrich processor, this PR adds several APIs to manage an enrich policy. An enrich policy is in charge of making the data from other indices available to the enrich processor in an efficient manner. Related to #32789	2019-10-15 17:31:45 +02:00
Hendrik Muhs	b2ce72850b	[7.5][Transform] prevent assignment if any node is older than 7.4 (#48055 ) disable task assignment of transforms if any node uses version 7.2 or 7.2 (mixed cluster). fixes #48019	2019-10-15 16:14:39 +02:00
Marios Trivyzas	7fddf198b7	SQL: Implement DATEDIFF function (#47920 ) Implement DATEDIFF/TIMESTAMPDIFF function as per the MS-SQL spec: https://docs.microsoft.com/en-us/sql/t-sql/functions/datediff-transact-sql?view=sql-server-2017 which allows a user to substract two date/datetime fields and return the difference in the date/time unit specified. Closes: #47919 (cherry picked from commit 745699f38dc8222670ffd65b66df33b5da39040b)	2019-10-15 15:12:11 +02:00
Hendrik Muhs	4aa7c7bad6	[Transform] add alias for backwards compatibility with 7.4 (#48049 ) add alias for backwards compatibility with 7.4 relates #47943	2019-10-15 15:04:09 +02:00
Przemysław Witek	620bd9d224	Enable test testSingleNumericFeatureAndMixedTrainingAndNonTrainingRows_TopClassesRequested now that top classes are correctly reported by C++. (#48043 ) (#48053 )	2019-10-15 14:49:16 +02:00
Benjamin Trent	361e7ad0ef	[ML][Transforms] fix bwc serialization with 7.3 (#48021 ) (#48048 )	2019-10-15 07:52:13 -04:00
David Roberts	83321b0e5e	[ML] Fix isNoop() for datafeed update (#48046 ) max_empty_searches = -1 in a datafeed update implies max_empty_searches will be unset on the datafeed when the update is applied. The isNoop() method needs to take this -1 to null equivalence into account.	2019-10-15 12:28:53 +01:00
Marios Trivyzas	6589617a51	SQL: Fix arg verification for DateAddProcessor (#48041 ) Previously, the safety check for the 2nd argument of the DateAddProcessor was restricting it to Integer which was wrong since we allow all non-rational numbers, so it's changed to a Number check as it's done in other cases. Enhanced some tests regarding the check for an integer (non-rational argument). (cherry picked from commit 0516b6eaf5eb98fa5bd087c3fece80139a6b118e)	2019-10-15 12:52:11 +02:00
David Roberts	984323783e	[ML][7.x] Add lazy assignment job config option (#47993 ) This change adds: - A new option, allow_lazy_open, to anomaly detection jobs - A new option, allow_lazy_start, to data frame analytics jobs Both work in the same way: they allow a job to be opened/started even if no ML node exists that can accommodate the job immediately. In this situation the job waits in the opening/starting state until ML node capacity is available. (The starting state for data frame analytics jobs is new in this change.) Additionally, the ML nightly maintenance tasks now creates audit warnings for ML jobs that are unassigned. This means that jobs that cannot be assigned to an ML node for a very long time will show a yellow warning triangle in the UI. A final change is that it is now possible to close a job that is not assigned to a node without using force. This is because previously jobs that were open but not assigned to a node were an aberration, whereas after this change they'll be relatively common.	2019-10-15 06:55:11 +01:00
Martijn van Groningen	77164e9017	adjusted minimal supported version	2019-10-15 07:45:00 +02:00
Martijn van Groningen	cc4b6c43b3	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-10-15 07:23:47 +02:00
Martijn van Groningen	51c33f3edf	remove eclipse conditional	2019-10-15 07:18:32 +02:00
Martijn van Groningen	c4b1a3045a	Fixed test, take into account that Map can be the result if max_matches is 1.	2019-10-15 07:03:01 +02:00
James Baiera	18d7e32b7d	Add wait for completion for Enrich policy execution (#47886 ) This PR adds the ability to run the enrich policy execution task in the background, returning a task id instead of waiting for the completed operation.	2019-10-14 16:05:28 -04:00
Martijn van Groningen	7fc9198d46	Change how `max_matches` affects `target_field` option. (#47982 ) Prior to this change the `target_field` would always be a json array field in the document being ingested. This to take into account that multiple enrich documents could be inserted into the `target_field`. However the default `max_matches` is `1`. Meaning that by default only a single enrich document would be added to `target_field` json array field. This commit changes this; if `max_matches` is set to `1` then the single document would be added as a json object to the `target_field` and if it is configured to a higher value then the enrich documents will be added as a json array (even if a single enrich document happens to be enriched).	2019-10-14 21:09:48 +02:00
Jake Landis	5a4745ae69	Re-enable Watcher full cluster restart test (#47950 ) (#48000 ) This test is believed to be fixed by #43939 closes #40178	2019-10-14 13:40:28 -05:00
Hendrik Muhs	17d8ee9a9c	[Transform] wait for deprecated index shards to get active (#47997 ) wait for deprecated index shards to get active	2019-10-14 20:14:30 +02:00
Gordon Brown	699d4d4c6f	Manage retention of partial snapshots in SLM (#47833 ) Currently, partial snapshots will eventually build up unless they are manually deleted. Partial snapshots may be useful if there is not a more recent successful snapshot, but should eventually be deleted if they are no longer useful. With this change, partial snapshots are deleted using the following strategy: PARTIAL snapshots will be kept until the configured expire_after period has passed, if present, and then be deleted. If there is no configured expire_after in the retention policy, then they will be deleted if there is at least one more recent successful snapshot from this policy (as they may otherwise be useful for troubleshooting purposes). Partial snapshots are not counted towards either min_count or max_count.	2019-10-14 10:19:57 -06:00
David Roberts	1ca25bed38	[ML][7.x] Add option to stop datafeed that finds no data (#47995 ) Adds a new datafeed config option, max_empty_searches, that tells a datafeed that has never found any data to stop itself and close its associated job after a certain number of real-time searches have returned no data. Backport of #47922	2019-10-14 17:19:13 +01:00
Benjamin Trent	508db4589b	[ML][Transforms] signal listener early on stop failure (#47954 ) (#48002 )	2019-10-14 11:17:11 -04:00
Ioannis Kakavas	2b1372adfd	File based role mappings vs the role mapping APIs (#47015 ) (#47978 ) Make clear in the docs that the role mapping APIs is the preferred way to manage role mappings and that the role mappings that are defined in files cannot be viewed or managed with the APIs	2019-10-14 17:55:46 +03:00
Tanguy Leroux	c2a3e83427	Remove unused transport action from TransportFreezeIndexAction (#47992 ) Removes unnecessary TransportCloseIndexAction from TransportFreezeIndexAction	2019-10-14 16:20:37 +02:00
Martijn van Groningen	d4901a71d7	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-10-14 10:27:17 +02:00
Ioannis Kakavas	9ee7b3743e	Add FIPS 140 mode to XPack Usage API (#47278 ) (#47976 ) This change adds support for the FIPS 140 mode feature to be retrieved via the XPack Usage API.	2019-10-14 10:40:24 +03:00
David Roberts	46ae86ac31	[ML] Fix detection of syslog-like timestamp in find_file_structure (#47970 ) Usually syslog timestamps have two spaces before a single digit day-of-month. However, in some non-syslog cases where syslog-like timestamps are used there is only one space. The grok pattern supports this, so the timestamp parser should too. This change makes the find_file_structure endpoint do this. Also fixes another problem that the same test case exposed in the find_file_structure endpoint, which was that the exclude_lines_pattern for delimited files was always created on the assumption the delimiter was a comma. Now it is based on the actual delimiter.	2019-10-13 20:07:54 +01:00
Tanguy Leroux	742fa818b8	Add Pause/Resume Auto Follower APIs (#47510 ) (#47904 ) This commit adds two APIs that allow to pause and resume CCR auto-follower patterns: // pause auto-follower POST /_ccr/auto_follow/my_pattern/pause // resume auto-follower POST /_ccr/auto_follow/my_pattern/resume The ability to pause and resume auto-follow patterns can be useful in some situations, including the rolling upgrades of cluster using a bi-directional cross-cluster replication scheme (see #46665). This commit adds a new active flag to the AutoFollowPattern and adapts the AutoCoordinator and AutoFollower classes so that it stops to fetch remote's cluster state when all auto-follow patterns associate to the remote cluster are paused. When an auto-follower is paused, remote indices that match the pattern are just ignored: they are not added to the pattern's followed indices uids list that is maintained in the local cluster state. This way, when the auto-follow pattern is resumed the indices created in the remote cluster in the meantime will be picked up again and added as new following indices. Indices created and then deleted in the remote cluster will be ignored as they won't be seen at all by the auto-follower pattern at resume time. Backport of #47510 for 7.x	2019-10-13 09:22:51 +02:00
Marios Trivyzas	65717f6f42	SQL: Fix Nullability of DATEADD (#47921 ) Previously, Nullability was set to UNKNOWN instead of TRUE which resulted on QueryFolder not correctly folding to NULL if any of the args was null. Remove the overriding nullable() also for DatePart/DateTrunc to allow delegation the parent class. (cherry picked from commit 05a7108e133b5ae7bec2257db5ae2d30ad926ee2)	2019-10-12 13:25:08 +02:00
Yogesh Gaikwad	ac209c142c	Remove uniqueness constraint for API key name and make it optional (#47549 ) (#47959 ) Since we cannot guarantee the uniqueness of the API key `name` this commit removes the constraint and makes this field optional. Closes #46646	2019-10-12 22:22:16 +11:00
Przemyslaw Gomulka	6ab58de7ef	[7.x] Enable ResolverStyle.STRICT for java formatters backport(#46675 ) (#47913 ) Joda was using ResolverStyle.STRICT when parsing. This means that date will be validated to be a correct year, year-of-month, day-of-month However, we also want to make it works with Year-Of-Era as Joda used to, hence custom temporalquery.localdate in DateFormatters.from Within DateFormatters we use the correct uuuu year instead of yyyy year of era worth noting: if yyyy(without an era) is used in code, the parsing result will be a TemporalAccessor which will fail to be converted into LocalDate. We mostly use DateFormatters.from so this takes care of this. If possible the uuuu format should be used.	2019-10-11 21:19:56 +02:00
Benjamin Trent	627faf1850	[7.x] [ML][Analytics] fix bug where regression deleted early does not delete state (#47885 ) (#47914 ) * [ML][Analytics] fix bug where regression deleted early does not delete state (#47885) * [ML][Analytics] fix bug where regression deleted early does not delete state * Fixing ml with security test failure * fixing for older java	2019-10-11 15:11:16 -04:00
Nick Knize	68eaa21d77	Mute testBasicFailureRetention (#47940 )	2019-10-11 14:03:46 -05:00
Chris Roberson	c57191b163	[Monitoring] Add new cluster privilege now necessary for the stack monitoring ui (#47871 ) (#47915 ) * Add new cluster privilege now necessary for the stack monitoring ui * PR feedback, and add test	2019-10-11 14:54:59 -04:00
Benjamin Trent	1636fa5f15	[ML][Transforms] Muting tests in 7.x (#47946 )	2019-10-11 14:49:20 -04:00
James Baiera	73263c654a	Add basic task support for executing enrich policies (#47523 ) Changes the execution logic to create a new task using the execute request, and attaches the new task to the policy runner to be updated. Also, a new response is now returned from the execute api, which contains either the task id of the execution, or the completed status of the run. The fields are mutually exclusive to make it easier to discern what type of response it is.	2019-10-11 13:32:06 -04:00
Hendrik Muhs	0ca53bd80e	add BWC alias for internal index create an alias for old nodes to retrieve new documents in the internal index as they do not know the new index pattern	2019-10-11 17:15:01 +02:00
Ioannis Kakavas	33705c4b95	Document SAML APIs (#45105 ) (#47909 ) This change adds documentation for the SAML APIs in Elasticsearch and adds simple instructions on how these APIs can be used to authenticate a user with SAML by a custom web application other than Kibana. Resolves: #40352	2019-10-11 16:34:11 +03:00
Przemysław Witek	c62fe8c344	Require that the dependent variable column has at most 2 distinct values in classfication analysis. (#47858 ) (#47906 )	2019-10-11 14:57:08 +02:00
Hendrik Muhs	3da91d5f7a	[Transform] Rename internal indexes for transform plugin (#47788 ) (#47900 ) rename internal indexes of transform plugin - rename audit index and create an alias for accessing it, BWC: add an alias for old indexes to keep them working, kibana UI will switch to use the read alias - rename config index and provide BWC to read from old and new ones	2019-10-11 14:16:17 +02:00
Hendrik Muhs	5dd6bd6f49	do not assert on state in mixed cluster due to endpoint differences (#47898 ) do not assert on state in mixed cluster due to endpoint differences between 7.3 and 7.4 regression #46452 fixes #47693	2019-10-11 12:27:54 +02:00
Hendrik Muhs	fd1c4c198a	[Transform] fixes tests which might fail due to auto-stop (#47867 ) Batch transforms automatically stop after all data has processed, therefore tests can not reliable test the state. This change rewrites tests to remove the unreliable tests or use continuous transforms instead as they do not auto-stop. fixes #47441	2019-10-11 11:10:38 +02:00
Alexander Reelsen	e60221d2bd	Update jakarta mail dependency to 1.6.4 (#47810 ) This one contains a few small bugfixes, see https://eclipse-ee4j.github.io/mail/docs/CHANGES.txt	2019-10-11 09:24:11 +02:00
Armin Braun	48823b1112	Fix SLMSnapshotBlockingIntegTests (#47841 ) (#47863 ) One of the tests in this suit stops a master node, plus we're doing other node starts in this suit. => the internal test cluster should be TEST and not `SUITE` scoped to avoid random failures like the one in #47834 Closes #47834	2019-10-10 18:41:57 +02:00
Marios Trivyzas	59b3294bc9	SQL: Implement DATEADD function (#47747 ) Implement DATEADD/TIMESTAMPADD function as per the MS-SQL spec: https://docs.microsoft.com/en-us/sql/t-sql/functions/dateadd-transact-sql?view=sql-server-2017 which allows a user to add/subtract specified number of specified units to/from a date/datetime field/expression. Closes: #47746 (cherry picked from commit e624bc281bebb4bbe0b0c2e0a8cbc712e50097a8)	2019-10-10 16:22:13 +02:00
Igor Motov	b5afa95fd8	Fix Mute RunDataFrameAnalyticsIT.testOutlierDetectionStopAndRestart Tracked by #47612	2019-10-10 18:17:01 +04:00
Igor Motov	17433e79d8	Mute RunDataFrameAnalyticsIT.testOutlierDetectionStopAndRestart Tracked by #47612	2019-10-10 17:56:23 +04:00
Costin Leau	dc6f0f9dc7	SQL: Re-enable muted test Close #47080 (cherry picked from commit 63a0aa7b392f565ea01ac478fec1dd91a80202e5)	2019-10-10 15:47:47 +03:00
Martijn van Groningen	102016d571	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-10-10 14:44:05 +02:00
Christoph Büscher	f07de06cdd	Ensure random timestamps are within search boundary (#38753 ) (#47787 ) The random timestamps were landing too close to the current time, so an unlucky rollup interval would round such that the doc wasn't included in the search range (and thus not "rolled up") which would then fail the test. The fix is to make sure the timestamp of all docs is sufficiently behind 'now' that the possible rounding intervals will always include them. Backport of #38753 to 7.x where the test was still muted.	2019-10-10 14:38:01 +02:00
Marios Trivyzas	c1f30e34ff	SQL: Refactor binary date time functions (#47786 ) Refactor DateTrunc and DatePart to use separate Pipe classes which allows the removal of the BinaryDateOperation enum. (cherry picked from commit a6075e7718dff94a90dbc0795dd924dcb7641092)	2019-10-10 13:52:41 +02:00
Andrei Stefan	6a4bf5de2c	SQL: make date/datetime and interval types compatible in conditional functions (#47595 ) (cherry picked from commit 6ff953e6396d7cc90640419aee5d036954e2eae3)	2019-10-10 13:58:35 +03:00
Hendrik Muhs	0e7869128a	[7.5][Transform] introduce new roles and deprecate old ones (#47780 ) (#47819 ) deprecate data_frame_transforms_{user,admin} roles and introduce transform_{user,admin} roles as replacement	2019-10-10 10:31:24 +02:00
Martijn van Groningen	aace42d38d	Add HLRC support for enrich stats API (#47306 ) This PR also includes HLRC docs for the enrich stats api. Relates to #32789	2019-10-10 09:08:29 +02:00
Martijn van Groningen	19393fc5a7	match processor should handler values other than string properly (#47419 ) Currently if the document being ingested contains another field value than a string then the processor fails with an error. This commit changes the match processor to handle number values and array values correctly. If a json array is detected then the `terms` query is used instead of the `term` query.	2019-10-10 08:49:17 +02:00
Mark Vieira	0360a18f61	Mute test SLMSnapshotBlockingIntegTests.testRetentionWhileSnapshotInProgress Signed-off-by: Mark Vieira <portugee@gmail.com> (cherry picked from commit a8a7477c396554926f260d210364f009d85ae5f2)	2019-10-09 15:38:24 -07:00
Armin Braun	302e09decf	Simplify some Common ActionRunnable Uses (#47799 ) (#47828 ) Especially in the snapshot code there's a lot of logic chaining `ActionRunnables` in tricky ways now and the code is getting hard to follow. This change introduces two convinience methods that make it clear that a wrapped listener is invoked with certainty in some trickier spots and shortens the code a bit.	2019-10-09 23:29:50 +02:00
Gordon Brown	9b3790d4f2	Mute "Test All Indexes Lifecycle Explain" (#47317 )	2019-10-09 21:32:58 +04:00
Tanguy Leroux	8f86469d3f	Do not auto-follow closed indices (#47721 ) (#47800 ) Backport of (#47721) for 7.x. Similarly to #47582, Auto-follow patterns creates following indices as long as the remote index matches the pattern and the remote primary shards are all started. But since 7.2 closed indices are also replicated, and it does not play well with CCR auto-follow patterns as they create following indices for closed leader indices too. This commit changes the getLeaderIndicesToFollow() so that closed indices are excluded from auto-follow patterns.	2019-10-09 19:16:23 +02:00
Jim Ferenczi	d96977202d	Disable SLMSnapshotBlockingIntegTests#testSnapshotInProgress (#47775 ) This test fails constantly in master and prs. Relates #47689	2019-10-09 17:49:13 +02:00
Jake Landis	43dc72f1a5	Fix cluster alert for watcher/monitoring IndexOutOfBoundsExcep… (#47756 ) If a cluster sending monitoring data is unhealthy and triggers an alert, then stops sending data the following exception [1] can occur. This exception stops the current Watch and the behavior is actually correct in part due to the exception. Simply fixing the exception introduces some incorrect behavior. Now that the Watch does not error in the this case, it will result in an incorrectly "resolved" alert. The fix here is two parts a) fix the exception b) fix the following incorrect behavior. a) fixing the exception is as easy as checking the size of the array before accessing it. b) fixing the following incorrect behavior is a bit more intrusive - Note - the UI depends on the success/met state for each condition to determine an "OK" or "FIRING" In this scenario, where an unhealthy cluster triggers an alert and then goes silent, it should keep "FIRING" until it hears back that the cluster is green. To keep the Watch "FIRING" either the index action or the email action needs to fire. Since the Watch is neither a "new" alert or a "resolved" alert, we do not want to keep sending an email (that would be non-passive too). Without completely changing the logic of how an alert is resolved allowing the index action to take place would result in the alert being resolved. Since we can not keep "FIRING" either the email or index action (since we don't want to resolve the alert nor re-write the logic for alert resolution), we will introduce a 3rd action. A logging action that WILL fire when the cluster is unhealthy. Specifically will fire when there is an unresolved alert and it can not find the cluster state. This logging action is logged at debug, so it should be noticed much. This logging action serves as an 'anchor' for the UI to keep the state in an a "FIRING" status until the alert is resolved. This presents a possible scenario where a cluster starts firing, then goes completely silent forever, the Watch will be "FIRING" forever. This is an edge case that already exists in some scenarios and requires manual intervention to remove that Watch. This changes changes to use a template-like method to populate the version_created for the default monitoring watches. The version is set to 7.5 since that is where this is first introduced. Fixes #43184	2019-10-09 10:47:21 -05:00
Martijn van Groningen	f8ebb75fcf	Reuse OperationRouting#searchShards(...) to select local enrich shard (#47359 ) The currently logic shard selecting logic selects a random shard copy instead of selecting the local shard copy and if local copy is not available then selecting a random shard copy. The latter is desired behaviour for enrich. By reusing `OperationRouting#searchShards(...)` we get the desired behaviour and reuse the same logic that the search api is using.	2019-10-09 17:31:43 +02:00
Yogesh Gaikwad	1139cce9a3	[DOCS] Add docs for `create_doc` index privilege (#47584 ) (#47778 ) This commit adds documentation for new index privilege create_doc which only allows indexing of new documents but no updates to existing documents via Index or Bulk APIs. Relates: #45806	2019-10-09 21:22:36 +11:00
Andrei Stefan	75a7daae73	SQL: use calendar interval of 1y instead of fixed interval for grouping by YEAR and HISTOGRAMs (#47558 ) (cherry picked from commit 55f5463eee4ecea3537df4b34645f1d87472a802)	2019-10-09 11:51:35 +03:00
Martijn van Groningen	be0e17770c	required change after merging in 7 dot x branch	2019-10-09 09:16:23 +02:00
Martijn van Groningen	da1e2ea461	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-10-09 09:06:13 +02:00
Lee Hinman	fb7abe9fa4	Separate SLM stop/start/status API from ILM (#47710 ) * Separate SLM stop/start/status API from ILM This separates a start/stop/status API for SLM from being tied to ILM's operation mode. These APIs look like: ``` POST /_slm/stop POST /_slm/start GET /_slm/status ``` This allows administrators to have fine-grained control over preventing periodic snapshots and deletions while performing cluster maintenance. Relates to #43663 * Allow going from RUNNING to STOPPED * Align with the OperationMode rules * Fix slmStopping method * Make OperationModeUpdateTask constructor private * Wipe snapshots better in test	2019-10-08 17:21:38 -06:00
Gordon Brown	a492864a9d	Manage retention of failed snapshots in SLM (#47617 ) Failed snapshots will eventually build up unless they are deleted. While failures may not take up much space, they add noise to the list of snapshots and it's desirable to remove them when they are no longer useful. With this change, failed snapshots are deleted using the following strategy: `FAILED` snapshots will be kept until the configured `expire_after` period has passed, if present, and then be deleted. If there is no configured `expire_after` in the retention policy, then they will be deleted if there is at least one more recent successful snapshot from this policy (as they may otherwise be useful for troubleshooting purposes). Failed snapshots are not counted towards either `min_count` or `max_count`.	2019-10-08 17:07:08 -06:00
James Baiera	b9fb354618	Add retry to force merge operation in EnrichPolicyRunner (#47178 ) Adds a check when running an Enrich policy to make sure that an Enrich index is force merged down to one segment, and if it was not fully merged, attempts the merge again, up to a configurable number of times.	2019-10-08 11:23:02 -04:00
Martijn van Groningen	8b7100eb1f	Don't remove indices to avoid monitoring from intermittently failing to index monitoring docs.	2019-10-08 17:10:42 +02:00
Jake Landis	b578059c90	Re-enable Watcher rest test (#47699 ) (#47705 ) This test is believed to be fixed by #43939 closes #43988	2019-10-08 09:45:27 -05:00
Dimitris Athanasiou	c1b0bfd74a	[7.x][ML] Unwrap exception causes before calling instanceof (#47676 ) (#47724 ) When exceptions could be returned from another node, the exception might be wrapped in a `RemoteTransportException`. In places where we handled specific exceptions using `instanceof` we ought to unwrap the cause first. This commit attempts to fix this issue after searching code in the ML plugin. Backport of #47676	2019-10-08 16:02:47 +03:00
Alpar Torok	36d018c909	Convert RunTask to use testclusers, remove ClusterFormationTasks (#47572 ) * Convert RunTask to use testclusers, remove ClusterFormationTasks This PR adds a new RunTask and a way for it to start a testclusters cluster out of band and block on it to replace the old RunTask that used ClusterFormationTasks. With this we can now remove ClusterFormationTasks.	2019-10-08 14:43:29 +03:00
Benjamin Trent	d33dbf82d4	[7.x] [ML][Inference] adjusting definition object schema and validation (#47447 ) (#47673 ) * [ML][Inference] adjusting definition object schema and validation (#47447) * [ML][Inference] adjusting definition object schema and validation * finalizing schema and fixing inference npe * addressing PR comments * fixing for backport	2019-10-08 07:11:05 -04:00
Hendrik Muhs	5e0e54f455	[Transform] move root endpoint to _transform with BWC layer (#47127 ) (#47682 ) move the main endpoint to /_transform/ from /_data_frame/transforms/ with providing backwards compatibility and deprecation warnings	2019-10-08 08:59:01 +02:00
Lee Hinman	91988c7c26	Throw error retrieving non-existent SLM policy (#47679 ) Previously when retrieving an SLM policy it would always return a 200 with `{}` in the body, even if the policy did not exist. This changes that behavior to throw an error (similar to our other APIs) if a policy doesn't exist. This also adds a basic CRUD yml test for the behavior. Resolves #47664	2019-10-07 19:54:04 -06:00
Lee Hinman	906be45209	Add a test for SLM retention with security enabled (#47608 ) This enhances the existing SLM test using users/roles/etc to also test that SLM retention works when security is enabled. Relates to #43663	2019-10-07 19:52:09 -06:00
Lisa Cawley	39ef795085	[DOCS] Cleans up links to security content (#47610 ) (#47703 )	2019-10-07 15:23:19 -07:00
Tal Levy	a17f394e27	Geo-Match Enrich Processor (#47243 ) (#47701 ) this commit introduces a geo-match enrich processor that looks up a specific `geo_point` field in the enrich-index for all entries that have a geo_shape match field that meets some specific relation criteria with the input field. For example, the enrich index may contain documents with zipcodes and their respective geo_shape. Ingesting documents with a geo_point field can be enriched with which zipcode they associate according to which shape they are contained within. this commit also refactors some of the MatchProcessor by moving a lot of the shared code to AbstractEnrichProcessor. Closes #42639.	2019-10-07 15:03:46 -07:00
Jake Landis	74876811c2	Watcher - catch uncaught exception. (#47680 ) (#47695 ) If a thread pool rejection exception happens, an alternative code path is chosen to write history and delete the trigger. If an exception happens during deletion of the trigger an exception may be thrown and not caught. This commit catches the exception and provides a meaning error message. fixes #47008	2019-10-07 15:45:45 -05:00
Jake Landis	a49a1b6994	Watcher remove assertion that is susceptible to a race conditi… (#47667 ) When deactivating a watch, there is a chance that it is fully deactivated and reporting as not running but the history is not fully written yet. There is not a tight coupling between the associated watcher history index and the deactivation. This test assumes that once a watch is deactivated that all history is fully written in a very short time period. If the Watch is deactivated, but the history is slow to write it can result in a failing test. This change removes an assertion that assumes that the deactivation of a watch ensured the all of the watch history was written. There is still a minor race condition with respect to the remaining history assertions. However, if the history is slow to be written, it will allow the test to still passing. fixes #47503	2019-10-07 12:07:10 -05:00
Dimitris Athanasiou	7667ea5f6f	[7.x][ML] Additional outlier detection parameters (#47600 ) (#47669 ) Adds the following parameters to `outlier_detection`: - `compute_feature_influence` (boolean): whether to compute or not feature influence scores - `outlier_fraction` (double): the proportion of the data set assumed to be outlying prior to running outlier detection - `standardization_enabled` (boolean): whether to apply standardization to the feature values Backport of #47600	2019-10-07 18:21:33 +03:00
Marios Trivyzas	e698e68f06	SQL: Allow whitespaces in escape patterns (#47577 ) Previously, we supported only the format `{fn <FUNCTION_NAME>()}` but other DBs like MSSQL, DB2, MariaDB/MySQL alos allow whitespaces between `{` and `fn`. Furhermore, also some applications - like PowerBI - generate escape sequences with spaces: `select { fn name(params) } etc.` Add support for white spaces between `{` and the escape pattern definition like `fn`, `ts`, `d`, `guid` etc. Closes: #47401 (cherry picked from commit 08a22d0b393f4a76c52dabc5e7b9cafcc19c30ca)	2019-10-07 15:05:02 +02:00
Yogesh Gaikwad	b6d1d2e6ec	Add 'create_doc' index privilege (#45806 ) (#47645 ) Use case: User with `create_doc` index privilege will be allowed to only index new documents either via Index API or Bulk API. There are two cases that we need to think: - User indexing a new document without specifying an Id. For this ES auto generates an Id and now ES version 7.5.0 onwards defaults to `op_type` `create` we just need to authorize on the `op_type`. - User indexing a new document with an Id. This is problematic as we do not know whether a document with Id exists or not. If the `op_type` is `create` then we can assume the user is trying to add a document, if it exists it is going to throw an error from the index engine. Given these both cases, we can safely authorize based on the `op_type` value. If the value is `create` then the user with `create_doc` privilege is authorized to index new documents. In the `AuthorizationService` when authorizing a bulk request, we check the implied action. This code changes that to append the `:op_type/index` or `:op_type/create` to indicate the implied index action.	2019-10-07 23:58:44 +11:00
Yogesh Gaikwad	7c862fe71f	Add support to retrieve all API keys if user has privilege (#47274 ) (#47641 ) This commit adds support to retrieve all API keys if the authenticated user is authorized to do so. This removes the restriction of specifying one of the parameters (like id, name, username and/or realm name) when the `owner` is set to `false`. Closes #46887	2019-10-07 23:58:21 +11:00
Tanguy Leroux	b5ac0204d2	Fail earlier Put Follow requests for closed leader indices (#47637 ) Backport of (#47582) Today when following a new leader index, we fetch the remote cluster state, check the remote cluster license, check the user privileges, retrieve the index shard stats before initiating a CCR restore session. But if the leader index to follow is closed, we're executing a bunch of operations that would inevitability fail at some point (on retrieving the index shard stats, because this type of request forbid closed indices when resolving indices). We could fail a Put Follow request at the first step by checking the leader index state directly from the remote cluster state. This also helps the Resume Follow API to fail a bit earlier.	2019-10-07 13:59:04 +02:00
Alpar Torok	bc85b22c1f	Complete testclusters backport (#47623 ) * Use versions specific distribution folders so we don't need to clean up (#46539) * Retry deleting distro dir on windows When retarting the cluster we clean up old distribution files that might still be in use by the OS. Windows closes resources of ded processes async, so we do a couple of retries to get arround it. Closes #46014 * Avoid having to delete the distro folder. * Remove the use of ClusterFormationTasks form RestTestTask (#47022) This PR removes a use-case of the ClusterFormationTasks and converts a project that flew under the radar so far. There's probably more clean-up possible here, but for now the goal is to be able to remove that code after `RunTask` is also updated. * Migrate some 7.x only projects	2019-10-07 11:43:57 +03:00
Martijn van Groningen	f2f2304c75	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-10-07 10:07:56 +02:00
Andrei Dan	4506b37ed5	ILM: Skip rolling indexes that are already rolled (#47324 ) (#47592 ) An index with an ILM policy that has a rollover action in one of the phases was rolled over when the ILM conditions dictated regardless if it was already rolled over (eg. manually after modifying an index template in order to force the creation of a new index that uses the new mappings). This changes this behaviour and has ILM check if the index it's about to roll has not been rolled over in the meantime. (cherry picked from commit 37d6106feeb9f9369519117c88a9e7e30f3ac797) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2019-10-07 07:47:47 +01:00
Ioannis Kakavas	36cabbae80	NameID mapping and Single Logout (#47288 ) (#47561 ) Clarify in the documentation that for SAML Single Logout to be functional, the Identity Provider needs to release a NameID.	2019-10-07 09:19:32 +03:00
Dimitris Athanasiou	ffacfc642c	[7.x][ML] Mute RegressionIT.testStopAndRestart (#47624 ) (#47625 ) Relates #47612	2019-10-05 23:58:32 +03:00
Jason Tedor	35ca3d68d7	Validating monitoring hosts setting while parsing (#47571 ) This commit lifts the validation of the monitoring hosts setting into the setting itself, rather than when the setting is used. This prevents a scenario where an invalid value for the setting is accepted, but then later fails while applying a cluster state with the invalid setting.	2019-10-04 17:32:49 -04:00
Lee Hinman	79376b7219	Set default SLM retention invocation time (#47604 ) This adds a default for the `slm.retention_schedule` setting, setting it to `0 30 1 * * ?` which is 1:30am every day. Having retention unset meant that it would never be invoked and clean up snapshots. We determined it would be better to have a default than never to be run. When coming to a decision, we weighed the option of an absolute time (such as 1:30am) versus a periodic invocation (like every 12 hours). In the end we decided on the absolute time because it has better predictability and consistency than a periodic invocation, which would rely on when the master node were elected or restarted. Relates to #43663	2019-10-04 15:00:20 -06:00
Lisa Cawley	f35fcf7204	[DOCS] Adds security content in the Elasticsearch Reference (#47596 )	2019-10-04 13:11:05 -07:00
James Baiera	a66c0dcd95	Add pipeline to ensure unique Enrich index documents (#46348 ) Adds a pipeline that removes ids and routing from documents before indexing them into enrich indices. Enrich documents may come from multiple indices, and thus have id collisions on them. This pipeline ensures that documents with colliding id fields do not clobber one another during the reindex operation while executing an enrich policy.	2019-10-04 12:20:52 -04:00
Przemysław Witek	ee952da2e2	[7.x] Implement evaluation API for multiclass classification problem (#47126 ) (#47343 )	2019-10-04 17:54:51 +02:00
Lisa Cawley	9b3e5409c1	[7.x][DOCS] Copies security source files from stack-docs (#47534 )	2019-10-04 08:19:10 -07:00
Andrei Stefan	a46f312ded	SQL: fix multi full-text functions usage with aggregate functions (#47444 ) * Skip functions involving full-text predicates when replacing multiple aggregate functions with "stats" or "matrix_stats" aggregations. (cherry picked from commit bb14ba83128dfb7a70f825ea08b1524072fb9ad0)	2019-10-04 16:27:22 +03:00
Alpar Torok	2b16d7bcf8	Backport testclusters all (#47565 ) * Bwc testclusters all (#46265) Convert all bwc projects to testclusters * Fix bwc versions config * WIP fix rolling upgrade * Fix bwc tests on old versions * Fix rolling upgrade	2019-10-04 16:12:53 +03:00
Przemysław Witek	8c180a77f0	[7.x] Fix serialization of evaluation response. (#47557 ) (#47566 )	2019-10-04 15:12:18 +02:00
Przemysław Witek	ec9b77deaa	[7.x] Implement new analysis type: classification (#46537 ) (#47559 )	2019-10-04 13:47:19 +02:00
David Roberts	31a5e1c7ee	[ML] More accurate job memory overhead (#47516 ) When an ML job runs the memory required can be broken down into: 1. Memory required to load the executable code 2. Instrumented model memory 3. Other memory used by the job's main process or ancilliary processes that is not instrumented Previously we added a simple fixed overhead to account for 1 and 3. This was 100MB for anomaly detection jobs (large because of the completely uninstrumented categorization function and normalize process), and 20MB for data frame analytics jobs. However, this was an oversimplification because the executable code only needs to be loaded once per machine. Also the 100MB overhead for anomaly detection jobs was probably too high in most cases because categorization and normalization don't use _that_ much memory. This PR therefore changes the calculation of memory requirements as follows: 1. A per-node overhead of 30MB for _only_ the first job of any type to be run on a given node - this is to account for loading the executable code 2. The established model memory (if applicable) or model memory limit of the job 3. A per-job overhead of 10MB for anomaly detection jobs and 5MB for data frame analytics jobs, to account for the uninstrumented memory usage This change will enable more jobs to be run on the same node. It will be particularly beneficial when there are a large number of small jobs. It will have less of an effect when there are a small number of large jobs.	2019-10-04 09:57:31 +01:00
Yogesh Gaikwad	d371f9d44d	Fix for ApiKeyIntegTests related to Expired API keys remover (#43477 ) (#47546 ) When API key is invalidated we do two things first it tries to trigger `ExpiredApiKeysRemover` task and second, we do index the invalidation for the API key. The index invalidation may happen before the `ExpiredApiKeysRemover` task is run and in that case, the API key invalidated will also get deleted. If the `ExpiredApiKeysRemover` runs before the API key invalidation is indexed then the API key is not deleted and will be deleted in the future run. This behavior was not captured in the tests related to `ExpiredApiKeysRemover` causing intermittent failures. This commit fixes those tests by checking if the API key invalidated is reported back when we get API keys after invalidation and perform the checks based on that. Closes #41747	2019-10-04 13:17:52 +10:00
Lisa Cawley	9c7b58900c	[DOCS] Fixes missing link title (#47481 )	2019-10-03 08:06:31 -07:00
Ioannis Kakavas	fd6a585009	Fix ADRealmTests in FIPS 140 JVMs (#47437 ) (#47506 ) The changes introduced in #47179 made it so that we could try to build an SSLContext with verification mode set to None, which is not allowed in FIPS 140 JVMs. This commit address that	2019-10-03 17:14:26 +03:00
Alpar Torok	0a14bb174f	Remove eclipse conditionals (#44075 ) * Remove eclipse conditionals We used to have some meta projects with a `-test` prefix because historically eclipse could not distinguish between test and main source-sets and could only use a single classpath. This is no longer the case for the past few Eclipse versions. This PR adds the necessary configuration to correctly categorize source folders and libraries. With this change eclipse can import projects, and the visibility rules are correct e.x. auto compete doesn't offer classes from test code or `testCompile` dependencies when editing classes in `main`. Unfortunately the cyclic dependency detection in Eclipse doesn't seem to take the difference between test and non test source sets into account, but since we are checking this in Gradle anyhow, it's safe to set to `warning` in the settings. Unfortunately there is no setting to ignore it. This might cause problems when building since Eclipse will probably not know the right order to build things in so more wirk might be necesarry.	2019-10-03 11:55:00 +03:00
Lee Hinman	2e3eb4b24e	Add API to execute SLM retention on-demand (#47405 ) (#47463 ) * Add API to execute SLM retention on-demand (#47405) This is a backport of #47405 This commit adds the `/_slm/_execute_retention` API endpoint. This endpoint kicks off SLM retention and then returns immediately. This in particular allows us to run retention without scheduling it (for entirely manual invocation) or perform a one-off cleanup. This commit also includes HLRC for the new API, and fixes an issue in SLMSnapshotBlockingIntegTests where retention invoked prior to the test completing could resurrect an index the internal test cluster cleanup had already deleted. Resolves #46508 Relates to #43663	2019-10-02 12:29:04 -06:00
Lee Hinman	013d87d716	Fix AllocationRoutedStepTests.testConditionMetOnlyOneCopyAlloc… (#47313 ) * Fix AllocationRoutedStepTests.testConditionMetOnlyOneCopyAllocated These tests were using randomly generated includes/excludes/requires for routing, however, it was possible to generate mutually exclusive allocation settings (about 1 out of 50,000 times for my runs). This splits the test into three different tests, and removes the randomization (it doesn't add anything to the testing here) to fix the issue. Resolves #47142	2019-10-02 10:01:23 -06:00
Ioannis Kakavas	4f722f0f53	Fix Active Directory tests (#47358 ) (#47440 ) Fixes multiple Active Directory related tests that run against the samba fixture. Some were failing since we changed the realm settings format in 7.0 and a few were slightly broken in other ways. We can move to cleanup the tests in a follow up but this work fits better to be done with or after we move the tests from a Samba based fixture to a real(-ish) Microsoft Active Directory based fixture. Resolves: #33425, #35738	2019-10-02 17:18:12 +03:00
Benjamin Trent	2228a7dd8d	[ML][Inference] adding ensemble model objects (#47241 ) (#47438 ) * [ML][Inference] adding ensemble model objects * addressing PR comments * Update TreeTests.java * addressing PR comments * fixing test	2019-10-02 09:49:46 -04:00
Dimitris Athanasiou	b9541eb3af	[7.x][ML] Make PUT data frame analytics action a master node action (… (#47433 ) While it seemed like the PUT data frame analytics action did not have to be a master node action as the config is stored in an index rather than the cluster state, there are other subtle nuances which make it worthwhile to convert it. In particular, it helps maintain order of execution for put actions which are anyhow user driven and are expected to have low volume. This commit converts `TransportPutDataFrameAnalyticsAction` from a handled transport action to a master node action. Note this means that the action might fail in a mixed cluster but as the API is still experimental and not widely used there will be few moments more suitable to make this change than now.	2019-10-02 16:24:21 +03:00
Yannick Welsch	7b2613db55	Allow optype CREATE for append-only indexing operations (#47169 ) Bulk requests currently do not allow adding "create" actions with auto-generated IDs. This commit allows using the optype CREATE for append-only indexing operations. This is mainly the user facing aspect of it.	2019-10-02 14:16:52 +02:00
Henning Andersen	42453aec96	Fix XPackPlugin usages in tests (#47252 ) XPackPlugin holds data in statics and can only be initialized once. This caused tests to fail primarily when running with a low max-workers. Replaced usages with the LocalStateCompositeXPackPlugin, which handles this properly for testing.	2019-10-02 12:36:02 +02:00
David Roberts	4379a3c52b	[ML] Throttle the delete-by-query of expired results (#47177 ) Due to #47003 many clusters will have built up a large backlog of expired results. On upgrading to a version where that bug is fixed users could find that the first ML daily maintenance task deletes a very large amount of documents. This change introduces throttling to the delete-by-query that the ML daily maintenance uses to delete expired results to limit it to deleting an average 200 documents per second. (There is no throttling for state/forecast documents as these are expected to be lower volume.) Additionally a rough time limit of 8 hours is applied to the whole delete expired data action. (This is only rough as it won't stop part way through a single operation - it only checks the timeout between operations.) Relates #47103	2019-10-02 11:16:34 +01:00
Dimitris Athanasiou	36884a3c32	[7.x][ML] Restore analytics state if available (#47128 ) (#47393 ) This commit restores the model state if available in data frame analytics jobs. In addition, this changes the start API so that a stopped job can be restarted. As we now store the progress in the state index when the task is stopped, we can use it to determine what state the job was in when it got stopped. Note that in order to be able to distinguish between a job that runs for the first time and another that is restarting, we ensure reindexing progress is reported to be at least 1 for a running task.	2019-10-02 10:24:05 +03:00
Benjamin Trent	f5fe5e7cd6	[7.x] [ML][Inference] Adding preprocessors to definition object (#47320 ) (#47370 ) * [ML][Inference] Adding preprocessors to definition object (#47320) * [ML][Inference] Adding preprocessors to definition object * Update TrainedModelConfig.java * adjusting for backport	2019-10-01 13:31:25 -04:00
Michael Basnight	0e1b77568a	Add enable checks to missing enrich plugin methods (#47187 ) Some of the server side objects that do not need to be created unless enrich is enabled were still being created. This commit fixes that.	2019-10-01 12:04:46 -05:00
Albert Zaharovits	78558a7b2f	Fix AD realm additional metadata (#47179 ) Due to a regression bug the metadata Active Directory realm setting is ignored (it works correctly for the LDAP realm type). This commit redresses it. Closes #45848	2019-10-01 17:05:25 +03:00
Marios Trivyzas	f792dbf239	SQL: Implement DATE_PART function (#47206 ) DATE_PART(<datetime unit>, <date/datetime>) is a function that allows the user to extract the specified unit from a date/datetime field similar to the EXTRACT (<datetime unit> FROM <date/datetime>) but with different names and aliases for the units and it also provides more options like `DATE_PART('tzoffset', datetimeField)`. Implemented following the SQL server's spec: https://docs.microsoft.com/en-us/sql/t-sql/functions/datepart-transact-sql?view=sql-server-2017 with the difference that the <datetime unit> argument is either a literal single quoted string or gets a value from a table field, whereas in SQL server keywords are used (unquoted identifiers) and it's not possible to use a value coming for a table column. Closes: #46372 (cherry picked from commit ead743d3579eb753fd314d4a58fae205e465d72e)	2019-10-01 16:28:27 +03:00
Benjamin Trent	4335e07716	[7.x] [ML][Inference] adding .ml-inference* index and storage (#47267 ) (#47310 ) * [ML][Inference] adding .ml-inference* index and storage (#47267) * [ML][Inference] adding .ml-inference* index and storage * Addressing PR comments * Allowing null definition, adding validation tests for model config * fixing line length * adjusting for backport	2019-10-01 08:20:33 -04:00
Ioannis Kakavas	3b06916fcd	Revert "Fix Active Directory tests (#47266 )" This reverts commit `7d9c064218`.	2019-10-01 13:32:31 +03:00
Ioannis Kakavas	7d9c064218	Fix Active Directory tests (#47266 ) Fixes multiple Active Directory related tests that run against the samba fixture. Some were failing since we changed the realm settings format in 7.0 and a few were slightly broken in other ways. We can move to cleanup the tests in a follow up but this work fits better to be done with or after we move the tests from a Samba based fixture to a real(-ish) Microsoft Active Directory based fixture. Resolves: #33425, #35738	2019-10-01 10:52:07 +03:00
Ioannis Kakavas	33c5e5b09d	Fix SSLErrorMessageTests in Windows (#47315 ) - Build paths with PathUtils#get instead of hard-coding a string with forward slashes. - Do not try to match the whole message that includes paths. The file separator is `\\` in windows but when we throw an Elasticsearch Exception, the message is formatted with LoggerMessageFormat#format which replaces `\\` with `\` in Path names. That means that in Windows the Exception message will contain paths with single backslashes while the expected string that comes from Path#toString on filename and env.configFile will contain double backslashes. There is no point in attempting to match the whole message string for the purpose of this test. Resolves: #45598	2019-10-01 09:14:36 +03:00
Marios Trivyzas	fa0b1b641a	SQL: Add examples fo muting sql/csv integ tests (#47291 ) Add examples of failures for both sql and csv integeration tests and instructions on how to mute them. (cherry picked from commit 591bba46516d770f5fc95a4c536dd7448b74dd49)	2019-10-01 09:12:20 +03:00
Armin Braun	3d23cb44a3	Speed up Snapshot Finalization (#47283 ) (#47309 ) As a result of #45689 snapshot finalization started to take significantly longer than before. This may be a little unfortunate since it increases the likelihood of failing to finalize after having written out all the segment blobs. This change parallelizes all the metadata writes that can safely run in parallel in the finalization step to speed the finalization step up again. Also, this will generally speed up the snapshot process overall in case of large number of indices. This is also a nice to have for #46250 since we add yet another step (deleting of old index- blobs in the shards to the finalization.	2019-09-30 23:28:59 +02:00
Marios Trivyzas	bd2abeef40	SQL: [TESTS] Improve error messages on failures (#47308 ) When an integration test fails before the assertion of the results it's missing information, like the file name and the line in the file where the test resides. (cherry picked from commit 683dc7213311d13c81e06829e08f3f9f80ebf73a)	2019-09-30 22:18:39 +03:00
Lisa Cawley	0c3ee0b15c	[DOCS] Moves Watcher content into Elasticsearch book (#47147 ) (#47255 ) Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-30 10:18:50 -07:00
David Roberts	24b3703005	[TEST] Only wait for 6.6 prerequisites if BWC version is 6.6 or higher (#47289 ) With this change the test setup for ML config upgrade tests only waits for v6.6+ ML index templates to be installed if the old cluster is running version 6.6.0 or higher. Previously it was always waiting, but timing out without failing the test if the templates were not installed within 10 seconds, effectively just adding a pointless 10 second sleep to BWC tests against versions earlier than 6.6.0. This problem was exposed by #47112. Fixes #47286	2019-09-30 14:55:50 +01:00
emasab	87156ad93b	SQL: Fix issue with duplicate columns in SELECT (#42122 ) Previously, if a column (field, scalar, alias) appeared more than once in the SELECT list, the value was returned only once (1st appearance) in each row. Fixes: #41811 (cherry picked from commit 097ea36581a751605fc4f2088319d954ce35b5d1)	2019-09-30 15:56:29 +03:00
Martijn van Groningen	fe937ea4b8	Add config namespace in get policy api response (#47162 ) Currently the policy config is placed directly in the json object of the toplevel `policies` array field. For example: ``` { "policies": [ { "match": { "name" : "my-policy", "indices" : ["users"], "match_field" : "email", "enrich_fields" : [ "first_name", "last_name", "city", "zip", "state" ] } } ] } ``` This change adds a `config` field in each policy json object: ``` { "policies": [ { "config": { "match": { "name" : "my-policy", "indices" : ["users"], "match_field" : "email", "enrich_fields" : [ "first_name", "last_name", "city", "zip", "state" ] } } } ] } ``` This allows us in the future to add other information about policies in the get policy api response. The UI will consume this API to build an overview of all policies. The UI may in the future include additional information about a policy and the plan is to include that in the get policy api, so that this information can be gathered in a single api call. An example of the information that is likely to be added is: * Last policy execution time * The status of a policy (executing, executed, unexecuted) * Information about the last failure if exists	2019-09-30 14:37:23 +02:00
David Roberts	0807d409bf	[ML] Reinstate ML daily maintenance actions (#47103 ) A refactoring in 6.6 meant that the ML daily maintenance actions have not been run at all since then. This change installs the local master listener that schedules the ML daily maintenance, and also defends against some subtle race conditions that could occur in the future if a node flipped very quickly between master and non-master. Fixes #47003	2019-09-30 13:12:32 +01:00
Jason Tedor	2cba323b4e	Remove use of get raw in token/API key settings (#47260 ) These settings were using get raw to fallback to whether or not SSL is enabled. Yet, we have a formal mechanism for falling back to a setting. This commit cuts over to that formal mechanism.	2019-09-30 06:35:58 -04:00
David Roberts	a1d3711b52	[TEST] Mute MlConfigIndexMappingsFullClusterRestartIT.testMlConfigIndexMappingsAfterMigratio Due to https://github.com/elastic/elasticsearch/issues/47286	2019-09-30 11:24:34 +01:00
Yannick Welsch	9dc90e41fc	Remove "force" version type (#47228 ) It's been deprecated long ago and can be removed. Relates to #20377 Closes #19769	2019-09-30 11:58:34 +02:00
Martijn van Groningen	bb3e9cb908	fixed checkstyle violation	2019-09-30 08:42:51 +02:00
Martijn van Groningen	66f72bcdbc	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-09-30 08:12:28 +02:00
Martijn van Groningen	1c3d5b77b5	give monitoring more time	2019-09-30 08:04:29 +02:00
Yogesh Gaikwad	2be351c5d0	Use 'should' clause instead of 'filter' when querying native privileges (#47019 ) (#47271 ) When we added support for wildcard application names, we started to build the prefix query along with the term query but we used 'filter' clause instead of 'should', so this would not fetch the correct application privilege descriptor thereby failing the _has_privilege checks. This commit changes the clause to use should and with minimum_should_match as 1.	2019-09-30 14:14:52 +10:00
Yogesh Gaikwad	cec2ff5ef4	Enhance docs for create api keys created when role descriptor not specified (#46897 ) This commit adds the documentation to point the user that when one creates API keys with no role descriptor specified then that API key will have a point in time snapshot of user permissions. Closes#46876	2019-09-30 12:15:29 +10:00
Rory Hunter	53a4d2176f	Convert most awaitBusy calls to assertBusy (#45794 ) (#47112 ) Backport of #45794 to 7.x. Convert most `awaitBusy` calls to `assertBusy`, and use asserts where possible. Follows on from #28548 by @liketic. There were a small number of places where it didn't make sense to me to call `assertBusy`, so I kept the existing calls but renamed the method to `waitUntil`. This was partly to better reflect its usage, and partly so that anyone trying to add a new call to awaitBusy wouldn't be able to find it. I also didn't change the usage in `TransportStopRollupAction` as the comments state that the local awaitBusy method is a temporary copy-and-paste. Other changes: * Rework `waitForDocs` to scale its timeout. Instead of calling `assertBusy` in a loop, work out a reasonable overall timeout and await just once. * Some tests failed after switching to `assertBusy` and had to be fixed. * Correct the expect templates in AbstractUpgradeTestCase. The ES Security team confirmed that they don't use templates any more, so remove this from the expected templates. Also rewrite how the setup code checks for templates, in order to give more information. * Remove an expected ML template from XPackRestTestConstants The ML team advised that the ML tests shouldn't be waiting for any `.ml-notifications` templates, since such checks should happen in the production code instead. Also rework the template checking code in `XPackRestTestHelper` to give more helpful failure messages. * Fix issue in `DataFrameSurvivesUpgradeIT` when upgrading from < 7.4	2019-09-29 12:21:46 +01:00
Nhat Nguyen	444b47ce88	Relax maxSeqNoOfUpdates assertion in FollowingEngine (#47188 ) We disable MSU optimization if the local checkpoint is smaller than max_seq_no_of_updates. Hence, we need to relax the MSU assertion in FollowingEngine for that scenario. Suppose the leader has three operations: index-0, delete-1, and index-2 for the same doc Id. MSU on the leader is 1 as index-2 is an append. If the follower applies index-0 then index-2, then the assertion is violated. Closes #47137	2019-09-27 14:00:20 -04:00
James Rodewig	b159305274	[DOCS] Add redirect for SLM API docs (#46838 ) (#46865 )	2019-09-27 11:05:55 -04:00
Martijn van Groningen	7ffe2e7e63	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-09-27 14:42:11 +02:00
Marios Trivyzas	01623f9f1c	SQL: Add alias DATETRUNC to DATE_TRUNC function (#47173 ) To be on the safe side in terms of use cases also add the alias DATETRUNC to the DATE_TRUNC function. Follows: #46473 (cherry picked from commit 9ac223cb1fc66486f86e218fa785a32b61e9bacc)	2019-09-27 15:38:51 +03:00
Andrei Dan	4c909438dd	Fix OriginationDate parsing tests. (#47170 ) (#47200 ) Drop the usage of `SimpleDateFormat` and use the `DateFormatter` instead (cherry picked from commit 7cf509a7a11ecf6c40c44c18e8f03b8e81fcd1c2) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2019-09-27 13:16:45 +01:00
Przemysław Witek	3fbd58d156	[7.x] Allow evaluation to consist of multiple steps. (#46653 ) (#47194 )	2019-09-27 13:01:51 +02:00
Costin Leau	b29a2cb360	SQL: Check case where the pivot limit is reached (#47121 ) In some cases, the fetch size affects the way the groups are returned causing the last page to go beyond the limit. Add dedicated check to prevent extra data from being returned. Fix #47002 (cherry picked from commit f4c29646f097bbd29855300342823ef4cef61c05)	2019-09-26 22:32:42 +03:00
Igor Motov	ae202fda21	SQL: Add support for shape type (#46464 ) Enables support for Cartesian geometries shape type. We still need to decide how to handle the distance function since it is currently using the haversine distance formula and returns results in meters, which doesn't make any sense for Cartesian geometries. Closes #46412 Relates to #43644	2019-09-26 09:47:42 -04:00
David Roberts	77cc6d5bad	[TEST] Work around _cat/indices bug with security enabled (#47160 ) When the ML native multi-node tests use _cat/indices/_all and the request goes to a non-master node, _all is translated to a list of concrete indices by the authz layer on the coordinating node before the request is forwarded to the master node. Then it is possible for the master node to return an index_not_found_exception if one of the concrete indices that was expanded on the coordinating node has been deleted in the meantime. (#47159 has been opened to track the underlying problem.) It has been observed that the index that gets deleted when the problem affects the ML native multi-node tests is always the ML notifications index. The tests that fail are only interested in the presence or absense of ML results indices. Therefore the workaround is to only _cat indices that match the ML results index pattern. Fixes #45652	2019-09-26 13:29:40 +01:00
Dimitris Athanasiou	0765bd4bf7	[7.x][ML] Ensure data frame analytics task is only marked completed once (#47119 ) (#47157 ) Closes #46907	2019-09-26 15:26:06 +03:00
Tanguy Leroux	95e2ca741e	Remove unused private methods and fields (#47154 ) This commit removes a bunch of unused private fields and unused private methods from the code base. Backport of (#47115)	2019-09-26 12:49:21 +02:00
Martijn van Groningen	8a4eefdd83	Expose enrich stats api to monitoring. (#46708 ) This change also slightly modifies the stats response, so that is can easier consumer by monitoring and other users. (coordinators stats are now in a list instead of a map and has an additional field for the node id) Relates to #32789	2019-09-26 11:04:33 +02:00
Yogesh Gaikwad	9a64b7a888	[Backport] Validate `query` field when creating roles (#46275 ) (#47094 ) In the current implementation, the validation of the role query occurs at runtime when the query is being executed. This commit adds validation for the role query when creating a role but not for the template query as we do not have the runtime information required for evaluating the template query (eg. authenticated user's information). This is similar to the scripts that we store but do not evaluate or parse if they are valid queries or not. For validation, the query is evaluated (if not a template), parsed to build the QueryBuilder and verify if the query type is allowed. Closes #34252	2019-09-26 17:57:36 +10:00
Jim Ferenczi	04972baffa	Merge ShardSearchTransportRequest and ShardSearchLocalRequest (#46996 ) (#47081 ) This change merges the `ShardSearchTransportRequest` and `ShardSearchLocalRequest` into a single `ShardSearchRequest` that can be used to create a SearchContext. Relates #46523	2019-09-26 09:20:53 +02:00
Benjamin Trent	fcddaa90de	[7.x] [ML][Inference] adding tree model (#47044 ) (#47141 ) * [ML][Inference] adding tree model (#47044) * [ML][Inference] adding tree model * renaming features for updated schema * fixing 7.x compilation	2019-09-25 19:11:15 -04:00
Gordon Brown	7ac647c365	Add support for POST requests to SLM Execute API (#47061 ) This commit adds support for POST requests to the SLM `_execute` API, because POST is a more appropriate HTTP verb for this action as it is not idempotent. The docs are also changed to favor POST over PUT, although PUT is not removed or officially deprecated.	2019-09-25 16:15:10 -06:00
Andrei Dan	27520cac3b	ILM: parse origination date from index name (#46755 ) (#47124 ) * ILM: parse origination date from index name (#46755) Introduce the `index.lifecycle.parse_origination_date` setting that indicates if the origination date should be parsed from the index name. If set to true an index which doesn't match the expected format (namely `indexName-{dateFormat}-optional_digits` will fail before being created. The origination date will be parsed when initialising a lifecycle for an index and it will be set as the `index.lifecycle.origination_date` for that index. A user set value for `index.lifecycle.origination_date` will always override a possible parsable date from the index name. (cherry picked from commit c363d27f0210733dad0c307d54fa224a92ddb569) Signed-off-by: Andrei Dan <andrei.dan@elastic.co> * Drop usage of Map.of to be java 8 compliant	2019-09-25 21:44:16 +01:00
Lee Hinman	a267df30fa	Wait for snapshot completion in SLM snapshot invocation (#47051 ) * Wait for snapshot completion in SLM snapshot invocation This changes the snapshots internally invoked by SLM to wait for completion. This allows us to capture more snapshotting failure scenarios. For example, previously a snapshot would be created and then registered as a "success", however, the snapshot may have been aborted, or it may have had a subset of its shards fail. These cases are now handled by inspecting the response to the `CreateSnapshotRequest` and ensuring that there are no failures. If any failures are present, the history store now stores the action as a failure instead of a success. Relates to #38461 and #43663	2019-09-25 14:25:22 -06:00
Gordon Brown	a46eef9634	Change SLM stats format (#46991 ) Using arrays of objects with embedded IDs is preferred for new APIs over using entity IDs as JSON keys. This commit changes the SLM stats API to use the preferred format.	2019-09-25 11:32:08 -06:00
Yannick Welsch	9e17b78fee	Mute second test in monitoring/bulk/10_basic Relates #30101	2019-09-25 14:17:01 +02:00
Benjamin Trent	05fb7be571	[7.x] [ML][Inference] Feature pre-processing objects and functions (#46777 ) (#47040 ) * [ML][Inference] Feature pre-processing objects and functions (#46777) To support inference on pre-trained machine learning models, some basic feature encoding will be necessary. I am using a named object serialization approach so new encodings/pre-processing steps could be added in the future. This PR lays down the ground work for 3 basic encodings: * HotOne * Target Mean * Frequency More feature encodings or pre-processings could be added in the future: * Handling missing columns * Standardization * Label encoding * etc.... * fixing compilation for namedxcontent tests	2019-09-25 08:16:24 -04:00
Yannick Welsch	a4cecc54ab	Mute monitoring/bulk/20_privileges Relates #30101	2019-09-25 14:03:08 +02:00
Yannick Welsch	eb86d71edd	Mute MlJobIT.testDeleteJob Relates #45652	2019-09-25 12:53:09 +02:00
Yannick Welsch	7a5b5af171	Mute MlJobIT.testDeleteJobAsync Relates #45652	2019-09-25 12:53:05 +02:00
Ioannis Kakavas	f785c31531	File based role definition documentation additions (#46304 ) (#47085 ) This commit clarifies and points out that the Role management UI and the Role management API cannot be used to manage roles that are defined in roles.yml and that file based role management is intended to have a small administrative scope and not handle all possible RBAC use cases.	2019-09-25 13:52:05 +03:00
Ioannis Kakavas	23bceaadf8	Handle RelayState in preparing a SAMLAuthN Request (#46534 ) (#47092 ) This change allows for the caller of the `saml/prepare` API to pass a `relay_state` parameter that will then be part of the redirect URL in the response as the `RelayState` query parameter. The SAML IdP is required to reflect back the value of that relay state when sending a SAML Response. The caller of the APIs can then, when receiving the SAML Response, read and consume the value as it see fit.	2019-09-25 13:23:46 +03:00
Yogesh Gaikwad	6f453aa6b2	Validate index and cluster privilege names when creating a role (#46361 ) (#47063 ) This commit adds validation so a role cannot be created with invalid index or cluster privilege name. Closes #29703	2019-09-25 18:57:11 +10:00
Yannick Welsch	056ac32738	Mute JdbcCsvSpecIT.testAverageWithOneValueAndLimit Relates to #47080	2019-09-25 10:36:53 +02:00
Christoph Büscher	0c187e0a10	Add migration tool checks for `_field_names` disabling (#46972 ) This change adds a check to the migration tool that warns about the deprecated `enabled` setting for the `_field_names` field on 7.x indices and issues a warning for templates containing this setting, which has been removed with 8.0. Relates to #42854, #46681	2019-09-25 10:15:10 +02:00
Hendrik Muhs	7377ac4637	[Transform] Replace transforms with transform, index constants (#47023 ) - replace "transforms" with "transform" for consistency - use constants for internal index naming wherever possible and document required changes	2019-09-25 08:31:43 +02:00
Hendrik Muhs	e974f178b5	[Transform] rename data frame transform to transform for hlrc client (#46933 ) rename data frame transform to transform for hlrc	2019-09-25 08:31:43 +02:00
Benjamin Trent	00c1c0132b	[ML] fix two datafeed flush lockup bugs (#46982 ) (#47024 ) * [ML] fix two flush lockup bugs * Addressing PR comments * moving debug logging line so it is only written on success	2019-09-24 13:03:20 -04:00
James Baiera	9967aff714	Add notice to Enrich index mapping metadata (#45996 )	2019-09-24 12:55:11 -04:00
Albert Zaharovits	3a82e0f7f4	Do not rewrite aliases on remove-index from aliases requests (#46989 ) (#47018 ) When we rewrite alias requests, after filtering down to only those that the user is authorized to see, it can be that there are no aliases remaining in the request. However, core Elasticsearch interprets this as _all so the user would see more than they are authorized for. To address this, we previously rewrote all such requests to have aliases `""`, `"-"`, which would be interpreted when aliases are resolved as nome. Yet, this is only needed for get aliases requests and we were applying it to all alias requests, including remove index requests. If such a request was sent to a coordinating node that is not the master node, the request would be rewritten to include `""` and `"-"`, and then the master would authorize the user for these. If the user had limited permissions, the request would fail, even if they were authorized on the index that the remove index action was over. This commit addresses this by rewriting for get aliases and remove aliases request types but not for the remove index. Co-authored-by: Albert Zaharovits <albert.zaharovits@elastic.co> Co-authored-by: Tim Vernum <tim@adjective.org>	2019-09-24 19:07:55 +03:00
Dimitris Athanasiou	64bf1b56fe	[7.x] SQL: Mute pivot testAverageWithOneValueAndOrder and testSumWithoutSubquery (#47030 ) (#47033 ) Relates #47002	2019-09-24 19:04:52 +03:00
Armin Braun	00f2e7f627	Update AWS SDK for repository-s3 plugin to support IAM Roles for Service Accounts (#46969 ) (#47004 ) * Update AWS SDK for repository-s3 and discovery-ec2 plugins	2019-09-24 17:15:11 +02:00
Ioannis Kakavas	98e6bb4d01	Workaround JDK-8213202 in SSLClientAuthTests (#46995 ) This change works around JDK-8213202, which is a bug related to TLSv1.3 session resumption before JDK 11.0.3 that occurs when there are multiple concurrent sessions being established. Nodes connecting to each other will trigger this bug when client authentication is disabled, which is the case for SSLClientAuthTests. Backport of #46680	2019-09-24 12:47:56 +03:00
Lee Hinman	5ca37db60c	Mute SLMSnapshotBlockingIntegTests.testRetentionWhileSnapshotInProgress Relates to #46508	2019-09-23 17:08:09 -06:00
James Baiera	a349b22273	Add the cluster version to enrich policies (#45021 ) Adds the Elasticsearch version as a field on the EnrichPolicy object	2019-09-23 18:44:45 -04:00
Julie Tibshirani	9124c94a6c	Add support for aliases in queries on _index. (#46944 ) Previously, queries on the _index field were not able to specify index aliases. This was a regression in functionality compared to the 'indices' query that was deprecated and removed in 6.0. Now queries on _index can specify an alias, which is resolved to the concrete index names when we check whether an index matches. To match a remote shard target, the pattern needs to be of the form 'cluster:index' to match the fully-qualified index name. Index aliases can be specified in the following query types: term, terms, prefix, and wildcard.	2019-09-23 13:21:37 -07:00
Jim Ferenczi	08f28e642b	Replace SearchContext with QueryShardContext in query builder tests (#46978 ) This commit replaces the SearchContext used in AbstractQueryTestCase with a QueryShardContext in order to reduce the visibility of search contexts. Relates #46523	2019-09-23 20:24:02 +02:00
Costin Leau	a610503783	SQL: Add PIVOT support (#46489 ) Add initial PIVOT support for transforming a regular table into a statistics table around an arbitrary pivoting column: SELECT * FROM (SELECT languages, country, salary, FROM mp) PIVOT (AVG(salary) FOR countries IN ('NL', 'DE', 'ES', 'RO', 'US')) In the current implementation PIVOT allows only one aggregation however this restriction is likely to be lifted in the future. Also not all aggregations are working, in particular MatrixStats are not yet supported. (cherry picked from commit d91263746a222915c570d4a662ec48c1d6b4f583)	2019-09-23 21:04:13 +03:00
Alpar Torok	5fd7505efc	Testfixtures allow a single service only (#46780 ) This PR adds some restrictions around testfixtures to make sure the same service ( as defiend in docker-compose.yml ) is not shared between multiple projects. Sharing would break running with --parallel. Projects can still share fixtures as long as each has it;s own service within. This is still useful to share some of the setup and configuration code of the fixture. Project now also have to specify a service name when calling useCluster to refer to a specific service. If this is not the case all services will be claimed and the fixture can't be shared. For this reason fixtures have to explicitly specify if they are using themselves ( fixture and tests in the same project ).	2019-09-23 14:13:49 +03:00
Martijn van Groningen	33bbc4798b	fixed compile errors after merging	2019-09-23 09:46:14 +02:00
Martijn van Groningen	0cfddca61d	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-09-23 09:46:05 +02:00
Martijn van Groningen	bf42789eb6	fixed compile error	2019-09-22 21:20:12 +02:00
Lisa Cawley	875d864be6	[DOCS] Update data frame transform URLs (#46940 ) (#46946 )	2019-09-20 15:57:43 -07:00
Michael Basnight	f1c7ed647b	Allow comma separated ids in get enrich policy API (#46351 ) This commit changes the GET REST api so it will accept an optional comma separated list of enrich policy ids. This change also modifies the behavior of the GET API in that it will not error if it is passed a bad enrich id anymore, but will instead just return an empty list.	2019-09-20 10:06:58 -05:00
Hendrik Muhs	4a2cb05162	add message about transform disabled if license is missing (#46901 ) adds a message for transform about what happens if no license has been activated	2019-09-20 13:47:40 +02:00
Hendrik Muhs	abe889af75	[7.5][Transform] rename classes in transform plugin (#46867 ) rename classes and settings in transform plugin, provide BWC for old settings	2019-09-20 10:43:00 +02:00
Jason Tedor	bd77626177	Add the ability to require an ingest pipeline (#46847 ) This commit adds the ability to require an ingest pipeline on an index. Today we can have a default pipeline, but that could be overridden by a request pipeline parameter. This commit introduces a new index setting index.required_pipeline that acts similarly to index.default_pipeline, except that it can not be overridden by a request pipeline parameter. Additionally, a default pipeline and a request pipeline can not both be set. The required pipeline can be set to _none to ensure that no pipeline ever runs for index requests on that index.	2019-09-19 16:37:45 -04:00
Yannick Welsch	9638ca20b0	Allow dropping documents with auto-generated ID (#46773 ) When using auto-generated IDs + the ingest drop processor (which looks to be used by filebeat as well) + coordinating nodes that do not have the ingest processor functionality, this can lead to a NullPointerException. The issue is that markCurrentItemAsDropped() is creating an UpdateResponse with no id when the request contains auto-generated IDs. The response serialization is lenient for our REST/XContent format (i.e. we will send "id" : null) but the internal transport format (used for communication between nodes) assumes for this field to be non-null, which means that it can't be serialized between nodes. Bulk requests with ingest functionality are processed on the coordinating node if the node has the ingest capability, and only otherwise sent to a different node. This means that, in order to reproduce this, one needs two nodes, with the coordinating node not having the ingest functionality. Closes #46678	2019-09-19 16:46:33 +02:00
Armin Braun	6b09c2cdbb	Limit Netty Workers in NativeRealmIntegTestCase (#46816 ) (#46850 ) The fact that this test randomly uses a relatively large number of nodes and hence Netty worker threads created a problem with running out of direct memory on CI. Tests run with 512M heap (and hence 512M direct memory) by default. On a CI worker with 16 cores, this means Netty will by default set up 32 transport workers. If we get unlucky and a lot of them actually do work (and thus instantiate a `CopyBytesSocketChannel` which costs 1M per thread for the thread-local IO buffer) we would run out of memory. This specific failure was only seen with `NativeRealmIntegTests` so I only added the constraint on the Netty worker count here. We can add it to other tests (or `SecurityIntegTestCase`) if need be but for now it doesn't seem necessary so I opted for least impact. Closes #46803	2019-09-19 13:07:42 +02:00
Dimitris Athanasiou	02a5e153dc	[7.x][ML] Parse and index data frame analytics state (#46804 ) (#46820 ) This commit reuses the same state processor that is used for autodetect to parse state output from data frame analytics jobs. We then index the state document into the state index. Backport of #46804	2019-09-18 20:37:40 +03:00
Benjamin Trent	9cf9c64ec2	[7.x] [ML][Transforms] remove `force` flag from _start (#46414 ) (#46748 ) * [ML][Transforms] remove `force` flag from _start (#46414) * [ML][Transforms] remove `force` flag from _start * fixing expected error message * adjusting bwc version	2019-09-18 10:06:05 -04:00
Dimitris Athanasiou	cebe8da617	[7.x][ML] MlMemoryTracker should ignore analytics tasks without config (#46789 ) (#46811 ) It is possible for a running analytics job that its config is removed from the '.ml-config' index (perhaps the user deleted the entire index, etc.). In that case the task remains without a matching config. I have raised #46781 to discuss how to deal with this issue. This commit focuses on `MlMemoryTracker` and changes it so that when we get the configs for the running tasks we leniently ignore missing ones. This at least means memory tracking will keep working for other jobs if one or more are missing. In addition, this commit makes the cleanup code for native analytics tests more robust by explicitly stopping all jobs and force-stopping if an error occurs. This helps so that a single failing test does not cause other tests fail due to pending tasks. Backport of #46789	2019-09-18 16:35:25 +03:00
Alpar Torok	f3e67bdd17	Add resolution rule to allow resolving all deps (#46768 ) Since the `resolveAllDependencies` task resolves all the congfigurations it can find, this was not caught by our testing, but it's required to be configuraed specifically. We should probably cut-over to the new configurations at some point to avoid problems like this. Closes elastic/infra#14580	2019-09-18 11:09:43 +03:00
Lee Hinman	b85468d6ea	Add node setting for disabling SLM (#46794 ) (#46796 ) This adds the `xpack.slm.enabled` setting to allow disabling of SLM functionality as well as its HTTP API endpoints. Relates to #38461	2019-09-17 17:39:41 -06:00
Oliver Gupte	cbd58d3b78	Give kibana user privileges to create APM agent config index (#46765 ) (#46792 ) * Give kibana user reserved role privileges on .apm-* to create APM agent configuration index. * fixed test to include checking all .apm-* permissions * changed pattern from ".apm-*" to the more specific ".apm-agent-configuration"	2019-09-17 15:01:42 -07:00
Costin Leau	92e518e789	SQL: Properly handle indices with no/empty mapping (#46775 ) When encountering only indices with empty mapping, the IndexResolver throws an exception as it expects to find at least one entry. This commit fixes this case so that an empty mapping is returned. Fix #46757 (cherry picked from commit 5f4f5807acb93b5fab36718c092c328977a396b6)	2019-09-17 16:01:22 +03:00
Armin Braun	b0f09b279f	Make Snapshot Logic Write Metadata after Segments (#45689 ) (#46764 ) * Write metadata during snapshot finalization after segment files to prevent outdated metadata in case of dynamic mapping updates as explained in #41581 * Keep the old behavior of writing the metadata beforehand in the case of mixed version clusters for BwC reasons * Still overwrite the metadata in the end, so even a mixed version cluster is fixed by this change if a newer version master does the finalization * Fixes #41581	2019-09-17 13:09:39 +02:00
Tomas Della Vedova	e1cf103980	Fixes for API specification (#46522 ) (#46736 ) Follow-up of #42346	2019-09-17 11:49:24 +02:00
Costin Leau	683b5fdeca	SQL: Support queries with HAVING over SELECT (#46709 ) Handle queries with implicit GROUP BY where the aggregation is not in the projection/SELECT but inside the filter/HAVING such as: SELECT 1 FROM x HAVING COUNT(*) > 0 The engine now properly identifies the case and handles it accordingly. Fix #37051 (cherry picked from commit fa53ca05d8219c27079b50b4a5b7aeb220c7cde2)	2019-09-17 11:14:39 +03:00
Costin Leau	90f4c2379b	SQL: improve ResultSet behavior when no rows are available (#46753 ) Improve the defensive behavior of ResultSet when dealing with incorrect API usage. In particular handle the case of dealing with no row available (either because the cursor is before the first entry or after the last). Fix #46750 (cherry picked from commit 58fa38e4606625962e879265d35eacb0960c6cdb)	2019-09-17 11:14:38 +03:00
Przemysław Witek	e49be611ad	[7.x] Add audit messages for Data Frame Analytics (#46521 ) (#46738 )	2019-09-16 21:21:38 +02:00
Benjamin Trent	92acc732de	[ML][Transform] Use field caps for mapping deductino (#46703 ) (#46742 )	2019-09-16 10:05:55 -04:00
Andrei Stefan	40e9353947	SQL: use the correct data type for types conversion (#46574 ) (cherry picked from commit 3e25db2f302c3aafe27e4d8d4fb1743401d85e6d)	2019-09-16 15:36:17 +03:00
Hendrik Muhs	c8f52ec4ff	[Transform] Rename data frame plugin to transform: classes in xpack.core (#46644 ) (#46734 ) rename classes in xpack.core of transform plugin from "data frame transform" to "transform"	2019-09-16 13:39:22 +02:00
Andrei Dan	c57cca98b2	[ILM] Add date setting to calculate index age (#46561 ) (#46697 ) * [ILM] Add date setting to calculate index age Add the `index.lifecycle.origination_date` to allow users to configure a custom date that'll be used to calculate the index age for the phase transmissions (as opposed to the default index creation date). This could be useful for users to create an index with an "older" origination date when indexing old data. Relates to #42449. * [ILM] Don't override creation date on policy init The initial approach we took was to override the lifecycle creation date if the `index.lifecycle.origination_date` setting was set. This had the disadvantage of the user not being able to update the `origination_date` anymore once set. This commit changes the way we makes use of the `index.lifecycle.origination_date` setting by checking its value when we calculate the index age (ie. at "read time") and, in case it's not set, default to the index creation date. * Make origination date setting index scope dynamic * Document orignation date setting in ilm settings (cherry picked from commit d5bd2bb77ee28c1978ab6679f941d7c02e389d32) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2019-09-16 08:50:28 +01:00
Dimitris Athanasiou	63eb0d9081	[7.x][ML] Avoid marking data frame analytics task completed twice (#46721 ) (#46724 ) When the stop API is called while the task is running there is a chance the task gets marked completed twice. This may cause undesired side effects, like indexing the progress document a second time after the stop API has returned (the cause for #46705). This commit adds a check that the task has not been completed before proceeding to mark it so. In addition, when we update the task's state we could get some warnings that the task was missing if the stop API has been called in the meantime. We now check the errors are `ResourceNotFoundException` and ignore them if so. Closes #46705 Backports #46721	2019-09-15 17:25:26 +03:00
Hendrik Muhs	e1842c0e5a	[7.x][Transforms] backport BWC tests for transforms crud (#46452 ) backport 8.0 transform tests to 7.x	2019-09-14 13:06:48 +02:00
Lisa Cawley	c0a16047fa	[DOCS] Updates links to reporting content (#46717 )	2019-09-13 11:40:07 -07:00
James Rodewig	2831535cf9	[DOCS] Replace "// CONSOLE" comments with [source,console] (#46679 )	2019-09-13 11:44:54 -04:00
Nhat Nguyen	cabff5a7cd	Handle lower retaining seqno retention lease error (#46420 ) We renew the CCR retention lease at a fixed interval, therefore it's possible to have more than one in-flight renewal requests at the same time. If requests arrive out of order, then the assertion is violated. Closes #46416 Closes #46013	2019-09-13 08:50:19 -04:00
Dimitris Athanasiou	0bc8acaf5b	[7.x][ML] Create state index and alias before starting an analytics job (#46602 ) (#46648 ) This is fixing a bug where if an analytics job is started before any anomaly detection job is opened, we create an index after the state write alias. Instead, we should create the state index and alias before starting an analytics job and this commit makes sure this is the case. Backport of #46602	2019-09-13 10:34:12 +03:00
Lisa Cawley	dae5b22bf8	[DOCS] Fixes link to Kibana security (#46690 )	2019-09-12 16:30:43 -07:00
Przemysław Witek	5b1f6669ff	Do not wait for the old notifications index (".ml-notifications"). It is no longer used. (#46657 ) (#46666 )	2019-09-12 21:47:25 +02:00
Luca Cavanna	e57756492a	Update http-core and http-client dependencies (#46549 ) Relates to #45808 Closes #45577	2019-09-12 09:45:29 +02:00
Lisa Cawley	ec5592ed76	[DOCS] Adds missing icons to Watcher HLRC APIs (#46626 )	2019-09-11 16:35:15 -07:00
Zachary Tong	6dc8ed5d57	[7.x Backport] Refactor AllocatedPersistentTask#init(), move rollup ctor logic (#46406 ) This makes the AllocatedPersistentTask#init() method protected so that implementing classes can perform their initialization logic there, instead of the constructor. Rollup's task is adjusted to use this init method. It also slightly refactors the methods to se a static logger in the AllocatedTask instead of passing it in via an argument. This is simpler, logged messages come from the task instead of the service, and is easier for tests	2019-09-11 17:00:28 -04:00
James Rodewig	f9bf10f2b6	[DOCS] Change "a SSL" to "an SSL" in the Java docs (#46524 ) (#46618 )	2019-09-11 15:55:57 -04:00
Marios Trivyzas	d956509394	SQL: Implement DATE_TRUNC function (#46473 ) DATE_TRUNC(<truncate field>, <date/datetime>) is a function that allows the user to truncate a timestamp to the specified field by zeroing out the rest of the fields. The function is implemented according to the spec from PostgreSQL: https://www.postgresql.org/docs/current/functions-datetime.html#FUNCTIONS-DATETIME-TRUNC Closes: #46319 (cherry picked from commit b37e96712db1aace09f17b574eb02ff6b942a297)	2019-09-11 21:41:02 +03:00
Ryan Ernst	86290cb3d9	Make reuse of sql test code explicit (#45884 ) The sql project uses a common set of security tests, which are run in subprojects. Currently these are shared through a shared directory, but this is not setup correctly to ensure it is built before tests run. This commit changes the test classes to be an artifact of the sql/qa/security project and makes the test runner use the built artifact (a directory of classes) for tests. closes #45866	2019-09-11 10:56:07 -07:00
Lee Hinman	09a9cefaa0	Handle partial failure retrieving segments in SegmentCountStep (#46556 ) Since the `IndicesSegmentsRequest` scatters to all shards for the index, it's possible that some of the shards may fail. This adds failure handling and logging (since this is a best-effort step in the first place) for this case.	2019-09-11 10:29:31 -06:00
Marios Trivyzas	0963e78164	SQL: Fix issue with common type resolution (#46565 ) Many scalar functions try to find out the common type between their arguments in order to set it as their return time, e.g.: for `float + double` the common type which is set as the return type of the + operation is `double`. Previously, for data types TEXT and KEYWORD (string data types) there was no common data type found and null was returned causing NPEs when the function was trying to resolve the return data type. Fixes: #46551 (cherry picked from commit 291017d69dfc810707c3c7c692f5a50af431b790)	2019-09-11 19:10:15 +03:00
Lee Hinman	52d7b03b49	Wait for no snapshots in state in testRetentionWhileSnapshotIn… (#46573 ) This commit adds a wait/check for all running snapshots to be cleared before taking another snapshot. The previous snapshot was successful but had not yet been cleared from the cluster state, so the second snapshot failed due to a `ConcurrentSnapshotException`. Resolves #46508	2019-09-11 09:47:01 -06:00
David Roberts	461de5b58e	[TEST] Remove incorrect data frame analytics state assertion (#46597 ) After starting the analytics job and checking its state the state can be any of "started", "reindexing" or "analyzing" depending on how quickly the work is done.	2019-09-11 16:33:14 +01:00
David Roberts	07a0140260	[ML-DataFrame] Ensure latest index template exists before indexing docs (#46595 ) When upgrading data nodes to a newer version before master nodes there was a risk that a transform running on an upgraded data node would index a document into the new transforms internal index before its index template was created. This would cause the index to be created with entirely dynamic mappings. This change introduces a check before indexing any internal transforms document to ensure that the required index template exists and create it if it doesn't. Backport of #46553	2019-09-11 16:27:26 +01:00
Jim Ferenczi	23bf310c84	Replace the SearchContext with QueryShardContext when building aggregator factories (#46527 ) This commit replaces the `SearchContext` with the `QueryShardContext` when building aggregator factories. Aggregator factories are part of the `SearchContext` so they shouldn't require a `SearchContext` to create them. The main changes here are the signatures of `AggregationBuilder#build` that now takes a `QueryShardContext` and `AggregatorFactory#createInternal` that passes the `SearchContext` to build the `Aggregator`. Relates #46523	2019-09-11 16:43:30 +02:00
Hendrik Muhs	efea581dcc	[7.x][Transform]Rename data frame plugin to transform: plugin and package names (#46583 ) rename data frame transform plugin to transform: - rename plugin data-frame to transform - change all package names from o.e..dataframe. to o.e..transform. - necessary changes to fix loading/testing	2019-09-11 14:50:08 +02:00
Armin Braun	41633cb9b5	More Efficient Ordering of Shard Upload Execution (#42791 ) (#46588 ) * More Efficient Ordering of Shard Upload Execution (#42791) * Change the upload order of of snapshots to work file by file in parallel on the snapshot pool instead of merely shard-by-shard * Inspired by #39657 * Cleanup BlobStoreRepository Abort and Failure Handling (#46208)	2019-09-11 13:59:20 +02:00
Martijn van Groningen	a4b0f66919	Add enrich stats api (#46462 ) The enrich api returns enrich coordinator stats and information about currently executing enrich policies. The coordinator stats include per ingest node: * The current number of search requests in the queue. * The total number of outstanding remote requests that have been executed since node startup. Each remote request is likely to include multiple search requests. This depends on how much search requests are in the queue at the time when the remote request is performed. * The number of current outstanding remote requests. * The total number of search requests that `enrich` processors have executed since node startup. The current execution policies stats include: * The name of policy that is executing * A full blow task info object that is executing the policy. Relates to #32789	2019-09-11 13:40:24 +02:00
Jim Ferenczi	425b1a77e8	Add more context to QueryShardContext (#46584 ) This change adds an IndexSearcher and the node's BigArrays in the QueryShardContext. It's a spin off of #46527 as this change is required to allow aggregation builder to solely use the query shard context. Relates #46523	2019-09-11 12:24:51 +02:00
Dimitris Athanasiou	579af626f5	[7.x][ML] No error when datafeed stops during updating to started (#46495 ) (#46542 ) Investigating the test failure reported in #45518 it appears that the datafeed task was not found during a tast state update. There are only two places where such an update is performed: when we set the state to `started` and when we set it to `stopping`. We handle `ResourceNotFoundException` in the latter but not in the former. Thus the test reveals a rare race condition where the datafeed gets requested to stop before we managed to update its state to `started`. I could not reproduce this scenario but it would be my best guess. This commit catches `ResourceNotFoundException` while updating the state to `started` and lets the task terminate smoothly. Closes #45518 Backport of #46495	2019-09-11 13:18:42 +03:00
Przemysław Witek	e38e631dac	[7.x] Implement DataFrameAnalyticsAuditMessage and DataFrameAnalyticsAuditor (#45967 ) (#46519 )	2019-09-11 12:17:26 +02:00
Ioannis Kakavas	35810bd2ae	Enforce realm name uniqueness (#46580 ) We depend on file realms being unique in a number of places. Pre 7.0 this was enforced by the fact that the multiple realm types with different name would mean identical configuration keys and cause configuration parsing errors. Since we intoduced affix settings for realms this is not the case any more as the realm type is part of the configuration key. This change adds a check when building realms which will explicitly fail if multiple realms are defined with the same name. Backport of #46253	2019-09-11 13:13:59 +03:00
Martijn van Groningen	c79a8e448d	Convert enrich qa modules to use testclusters.	2019-09-11 11:40:18 +02:00
Martijn van Groningen	8a48ef2a06	fixed typo	2019-09-11 09:52:25 +02:00
Martijn van Groningen	ef33a99e6e	Disable default features that are not needed for enrich indices. (#46525 ) Relates to #32789	2019-09-11 09:20:38 +02:00
Tim Vernum	80064652f8	Fallback to realm authc if ApiKey fails (#46552 ) This changes API-Key authentication to always fallback to the realm chain if the API key is not valid. The previous behaviour was inconsistent and would terminate on some failures, but continue to the realm chain for others. Backport of: #46538	2019-09-11 14:33:17 +10:00
Gordon Brown	7a2878b29b	Fix class used to initialize logger in Watcher (#46467 ) This class has been using a logger configured for a different class for quite a while. While the circumstance in which it logs is rare, it should still use the correct logger.	2019-09-10 12:41:36 -06:00
Lisa Cawley	70c00621db	[DOCS] Add missing xpack role attributes (#46468 )	2019-09-10 10:46:14 -07:00
Lee Hinman	cdc3a260af	Add retention to Snapshot Lifecycle Management (backport of #4… (#46506 ) * Add retention to Snapshot Lifecycle Management (#46407) This commit adds retention to the existing Snapshot Lifecycle Management feature (#38461) as described in #43663. This allows a user to configure SLM to automatically delete older snapshots based on a number of criteria. An example policy would look like: ``` PUT /_slm/policy/snapshot-every-day { "schedule": "0 30 2 * * ?", "name": "<production-snap-{now/d}>", "repository": "my-s3-repository", "config": { "indices": ["foo-", "important"] }, // Newly configured retention options "retention": { // Snapshots should be deleted after 14 days "expire_after": "14d", // Keep a maximum of thirty snapshots "max_count": 30, // Keep a minimum of the four most recent snapshots "min_count": 4 } } ``` SLM Retention is run on a scheduled configurable with the `slm.retention_schedule` setting, which supports cron expressions. Deletions are run for a configurable time bounded by the `slm.retention_duration` setting, which defaults to 1 hour. Included in this work is a new SLM stats API endpoint available through ``` json GET /_slm/stats ``` That returns statistics about snapshot taken and deleted, as well as successful retention runs, failures, and the time spent deleting snapshots. #45362 has more information as well as an example of the output. These stats are also included when retrieving SLM policies via the API. Add base framework for snapshot retention (#43605) * Add base framework for snapshot retention This adds a basic `SnapshotRetentionService` and `SnapshotRetentionTask` to start as the basis for SLM's retention implementation. Relates to #38461 * Remove extraneous 'public' * Use a local var instead of reading class var repeatedly * Add SnapshotRetentionConfiguration for retention configuration (#43777) * Add SnapshotRetentionConfiguration for retention configuration This commit adds the `SnapshotRetentionConfiguration` class and its HLRC counterpart to encapsulate the configuration for SLM retention. Currently only a single parameter is supported as an example (we still need to discuss the different options we want to support and their names) to keep the size of the PR down. It also does not yet include version serialization checks since the original SLM branch has not yet been merged. Relates to #43663 * Fix REST tests * Fix more documentation * Use Objects.equals to avoid NPE * Put `randomSnapshotLifecyclePolicy` in only one place * Occasionally return retention with no configuration * Implement SnapshotRetentionTask's snapshot filtering and delet… (#44764) * Implement SnapshotRetentionTask's snapshot filtering and deletion This commit implements the snapshot filtering and deletion for `SnapshotRetentionTask`. Currently only the expire-after age is used for determining whether a snapshot is eligible for deletion. Relates to #43663 * Fix deletes running on the wrong thread * Handle missing or null policy in snap metadata differently * Convert Tuple<String, List<SnapshotInfo>> to Map<String, List<SnapshotInfo>> * Use the `OriginSettingClient` to work with security, enhance logging * Prevent NPE in test by mocking Client * Allow empty/missing SLM retention configuration (#45018) Semi-related to #44465, this allows the `"retention"` configuration map to be missing. Relates to #43663 * Add min_count and max_count as SLM retention predicates (#44926) This adds the configuration options for `min_count` and `max_count` as well as the logic for determining whether a snapshot meets this criteria to SLM's retention feature. These options are optional and one, two, or all three can be specified in an SLM policy. Relates to #43663 * Time-bound deletion of snapshots in retention delete function (#45065) * Time-bound deletion of snapshots in retention delete function With a cluster that has a large number of snapshots, it's possible that snapshot deletion can take a very long time (especially since deletes currently have to happen in a serial fashion). To prevent snapshot deletion from taking forever in a cluster and blocking other operations, this commit adds a setting to allow configuring a maximum time to spend deletion snapshots during retention. This dynamic setting defaults to 1 hour and is best-effort, meaning that it doesn't hard stop a deletion at an hour mark, but ensures that once the time has passed, all subsequent deletions are deferred until the next retention cycle. Relates to #43663 * Wow snapshots suuuure can take a long time. * Use a LongSupplier instead of actually sleeping * Remove TestLogging annotation * Remove rate limiting * Add SLM metrics gathering and endpoint (#45362) * Add SLM metrics gathering and endpoint This commit adds the infrastructure to gather metrics about the different SLM actions that a cluster takes. These actions are stored in `SnapshotLifecycleStats` and perpetuated in cluster state. The stats stored include the number of snapshots taken, failed, deleted, the number of retention runs, as well as per-policy counts for snapshots taken, failed, and deleted. It also includes the amount of time spent deleting snapshots from SLM retention. This commit also adds an endpoint for retrieving all stats (further commits will expose this in the SLM get-policy API) that looks like: ``` GET /_slm/stats { "retention_runs" : 13, "retention_failed" : 0, "retention_timed_out" : 0, "retention_deletion_time" : "1.4s", "retention_deletion_time_millis" : 1404, "policy_metrics" : { "daily-snapshots2" : { "snapshots_taken" : 7, "snapshots_failed" : 0, "snapshots_deleted" : 6, "snapshot_deletion_failures" : 0 }, "daily-snapshots" : { "snapshots_taken" : 12, "snapshots_failed" : 0, "snapshots_deleted" : 12, "snapshot_deletion_failures" : 6 } }, "total_snapshots_taken" : 19, "total_snapshots_failed" : 0, "total_snapshots_deleted" : 18, "total_snapshot_deletion_failures" : 6 } ``` This does not yet include HLRC for this, as this commit is quite large on its own. That will be added in a subsequent commit. Relates to #43663 * Version qualify serialization * Initialize counters outside constructor * Use computeIfAbsent instead of being too verbose * Move part of XContent generation into subclass * Fix REST action for master merge * Unused import * Record history of SLM retention actions (#45513) This commit records the deletion of snapshots by the retention component of SLM into the SLM history index for the purposes of reviewing operations taken by SLM and alerting. * Retry SLM retention after currently running snapshot completes (#45802) * Retry SLM retention after currently running snapshot completes This commit adds a ClusterStateObserver to wait until the currently running snapshot is complete before proceeding with snapshot deletion. SLM retention waits for the maximum allowed deletion time for the snapshot to complete, however, the waiting time is not factored into the limit on actual deletions. Relates to #43663 * Increase timeout waiting for snapshot completion * Apply patch From `2374316f0d`.patch * Rename test variables * [TEST] Be less strict for stats checking * Skip SLM retention if ILM is STOPPING or STOPPED (#45869) This adds a check to ensure we take no action during SLM retention if ILM is currently stopped or in the process of stopping. Relates to #43663 * Check all actions preventing snapshot delete during retention (#45992) * Check all actions preventing snapshot delete during retention run Previously we only checked to see if a snapshot was currently running, but it turns out that more things can block snapshot deletion. This changes the check to be a check for: - a snapshot currently running - a deletion already in progress - a repo cleanup in progress - a restore currently running This was found by CI where a third party delete in a test caused SLM retention deletion to throw an exception. Relates to #43663 * Add unit test for okayToDeleteSnapshots * Fix bug where SLM retention task would be scheduled on every node * Enhance test logging * Ignore if snapshot is already deleted * Missing import * Fix SnapshotRetentionServiceTests * Expose SLM policy stats in get SLM policy API (#45989) This also adds support for the SLM stats endpoint to the high level rest client. Retrieving a policy now looks like: ```json { "daily-snapshots" : { "version": 1, "modified_date": "2019-04-23T01:30:00.000Z", "modified_date_millis": 1556048137314, "policy" : { "schedule": "0 30 1 * * ?", "name": "<daily-snap-{now/d}>", "repository": "my_repository", "config": { "indices": ["data-", "important"], "ignore_unavailable": false, "include_global_state": false }, "retention": {} }, "stats": { "snapshots_taken": 0, "snapshots_failed": 0, "snapshots_deleted": 0, "snapshot_deletion_failures": 0 }, "next_execution": "2019-04-24T01:30:00.000Z", "next_execution_millis": 1556048160000 } } ``` Relates to #43663 Rewrite SnapshotLifecycleIT as as ESIntegTestCase (#46356) * Rewrite SnapshotLifecycleIT as as ESIntegTestCase This commit splits `SnapshotLifecycleIT` into two different tests. `SnapshotLifecycleRestIT` which includes the tests that do not require slow repositories, and `SLMSnapshotBlockingIntegTests` which is now an integration test using `MockRepository` to simulate a snapshot being in progress. Relates to #43663 Resolves #46205 * Add error logging when exceptions are thrown * Update serialization versions * Fix type inference * Use non-Cancellable HLRC return value * Fix Client mocking in test * Fix SLMSnapshotBlockingIntegTests for 7.x branch * Update SnapshotRetentionTask for non-multi-repo snapshot retrieval * Add serialization guards for SnapshotLifecyclePolicy	2019-09-10 09:08:09 -06:00
Michael Basnight	9304f5c889	Ensure enrich executes on master node only (#46448 ) The previous transport action was a read action, which under the right set of circumstances can execute on a coordinating node. This commit ensures that cannot happen.	2019-09-10 09:59:36 -05:00
Ioannis Kakavas	690164d0be	Change EmailSslTest for FIPS 140 JVMs (#46278 ) This commit changes the SSLContext for the email server we use in the tests so that it loads its key material from an in memory keystore (that is in turn built from a pair of PEM encoded private key and certificate) instead of a PKCS#12 one. This is done so that when we run our tests in FIPS 140-2 JVMs, the keystore is of a type that the Security Provider actually supports. This also mutes testCanSendMessageToSmtpServerByDisablingVerification as we can't run tests with verification set to `none` in FIPS 140 JVMs.	2019-09-10 14:39:40 +03:00
Alpar Torok	0ac52d0e72	Mute test in 7.x Tracked in #46529	2019-09-10 13:28:28 +03:00
Alpar Torok	b40ac6dee7	mute on 7.x fo windows Tracking #44942	2019-09-10 12:34:16 +03:00
Przemysław Witek	e21deae535	Disallow persisting any documents when datafeed is isolated (#46485 ) (#46490 )	2019-09-09 21:01:27 +02:00
James Rodewig	e253ee6ba6	[DOCS] Change // CONSOLE comments to [source,console] (#46440 ) (#46494 )	2019-09-09 12:35:50 -04:00
Martijn van Groningen	c057fce978	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-09-09 08:40:54 +02:00
Andrei Stefan	7b26a8c041	Use `null` schema response for `SYS TABLES` command. (#46386 ) (cherry picked from commit a6152f42a47a1ccd668e5892778c8bd2d3a78c4c)	2019-09-07 09:24:54 +03:00
Andrei Stefan	7cf100ba07	SQL: fix scripting for grouped by datetime functions (#46421 ) * Fix issue with painless scripting not being correctly generated when datetime functions are used for GROUPing of an INTERVAL operation. (cherry picked from commit cb92828e8ec9d9d241bd6189e5835fd99f8b9a44)	2019-09-07 09:24:53 +03:00
James Rodewig	f04573f8e8	[DOCS] [5 of 5] Change // TESTRESPONSE comments to [source,console-results] (#46449 ) (#46459 )	2019-09-06 16:09:09 -04:00
David Roberts	7c7fb7e32d	[ML] Tolerate total_search_time_ms not mapped in get datafeed stats (#46432 ) ML users who upgrade from versions prior to 7.4 to 7.4 or later will have ML results indices that do not have mappings for the total_search_time_ms field. Therefore, when searching these indices we must tolerate this field not having a mapping. Fixes #46437	2019-09-06 14:31:15 +01:00
Hendrik Muhs	c2194aa7e1	[Transform] simplify class structure of indexer (#46306 ) simplify transform task and indexer - remove redundant transform id - moving client data frame indexer (and builder) into a separate file	2019-09-06 15:24:26 +02:00
James Rodewig	bb7bff5e30	[DOCS] Replace "// TESTRESPONSE" magic comments with "[source,console-result] (#46295 ) (#46418 )	2019-09-06 09:22:08 -04:00
Dimitris Athanasiou	a6834068e3	[7.x][ML] Extract DataFrameAnalyticsTask into its own class (#46402 ) (#46426 ) This refactors `DataFrameAnalyticsTask` into its own class. The task has quite a lot of functionality now and I believe it would make code more readable to have it live as its own class rather than an inner class of the start action class. Backport of #46402	2019-09-06 14:13:46 +03:00
Hendrik Muhs	42ada88c5d	cleanup static member	2019-09-06 08:55:57 +02:00
Hendrik Muhs	ab5f09d29b	[ML-DataFrame] improve error message for timeout case in stop (#46131 ) improve error message if stopping of transform times out. related #45610	2019-09-06 08:36:01 +02:00
Hendrik Muhs	78824cce81	reuse mock client to avoid probles with thread context closed errors (#46398 )	2019-09-06 08:17:25 +02:00
Lisa Cawley	9f1339d0ce	[DOCS] Reformats Watcher APIs using template (#46152 )	2019-09-05 11:52:23 -07:00
Benjamin Trent	457ff3e2fb	7.x/ml fix instance serialization bwc (#46404 ) * [ML] Fixing instance serialization version for bwc * fixing CppLogMessage	2019-09-05 13:23:26 -05:00
Lisa Cawley	828ff01515	[DOCS] Update snippets in security APIs (#46191 ) (#46401 )	2019-09-05 11:12:39 -07:00
Benjamin Trent	caf3e4d654	[7.x] [ML][Transforms] fixing rolling upgrade continuous transform test (#45823 ) (#46347 ) (#46337 ) * [ML][Transforms] fixing rolling upgrade continuous transform test (#45823) * [ML][Transforms] fixing rolling upgrade continuous transform test * adjusting wait assert logic * adjusting wait conditions * [ML][Transforms] allow executor to call start on started task (#46347) * making sure we only upgrade from 7.4.0 in test	2019-09-05 10:31:11 -05:00
Benjamin Trent	5201386232	[ML] testFullClusterRestart waiting for stable cluster (#46280 ) (#46335 ) * [ML] waiting for ml indices before waiting task assignment testFullClusterRestart * waiting for a stable cluster after fullrestart * removing unused imports	2019-09-05 06:57:33 -05:00
Ioannis Kakavas	999658826f	Mute failing SamlAuthenticationIT tests (#46369 ) see #44410	2019-09-05 12:25:43 +03:00
Yogesh Gaikwad	d5acb15a71	[Backport] Initialize document subset bit set cache used for DLS (#46211 ) (#46359 ) This commit initializes DocumentSubsetBitsetCache even if DLS is disabled. Previously it would throw null pointer when querying usage stats if we explicitly disabled DLS as there would be no instance of DocumentSubsetBitsetCache to query. It is okay to initialize DocumentSubsetBitsetCache which will be empty as the license enforcement would prevent usage of DLS feature and it will not fail when accessing usage stats. Closes #45147	2019-09-05 14:34:19 +10:00
Lisa Cawley	b11968cf4d	[DOCS] Synchs Watcher API titles with better HLRC titles (#46328 )	2019-09-04 17:04:19 -07:00
Julie Tibshirani	40c3225d26	First round of optimizations for vector functions. (#46294 ) This PR merges the `vectors-optimize-brute-force` feature branch, which makes the following changes to how vector functions are computed: * Precompute the L2 norm of each vector at indexing time. (#45390) * Switch to ByteBuffer for vector encoding. (#45936) * Decode vectors and while computing the vector function. (#46103) * Use an array instead of a List for the query vector. (#46155) * Precompute the normalized query vector when using cosine similarity. (#46190) Co-authored-by: Mayya Sharipova <mayya.sharipova@elastic.co>	2019-09-04 14:45:57 -07:00
Aleh Zasypkin	5ee336ff78	[7.x] Document support of OIDC Implicit flow in Kibana. (#46329 )	2019-09-04 20:50:15 +02:00
Martijn van Groningen	ded98e50b7	Change exact match processor to match processor. (#46041 ) Besides a rename, this changes allows to processor to attach multiple enrich docs to the document being ingested. Also in order to control the maximum number of enrich docs to be included in the document being ingested, the `max_matches` setting is added to the enrich processor. Relates #32789	2019-09-04 18:05:12 +02:00
Albert Zaharovits	1a29711b06	DOCS Link to kib reference from es reference on PKI authn (#46260 )	2019-09-04 08:17:17 -07:00
Martijn van Groningen	6bec63fdfa	removed redundant cast	2019-09-04 11:18:31 +02:00
Andrey Ershov	ece9eb4acd	Remove stack trace logging in Security(Transport\|Http)ExceptionHandler (#45966 ) As per #45852 comment we no longer need to log stack-traces in SecurityTransportExceptionHandler and SecurityHttpExceptionHandler even if trace logging is enabled. (cherry picked from commit c99224a32d26db985053b7b36e2049036e438f97)	2019-09-04 11:50:35 +03:00
Dimitris Athanasiou	8fca5b5204	[7.x][ML] Unmute testStopOutlierDetectionWithEnoughDocumentsToScroll (#46271 ) (#46282 ) The test seems to have been failing due to a race condition between stopping the task and refreshing the destination index. In particular, we were going forward with refreshing the destination index even though the task stopped in the meantime. This was fixed in request. Closes #43960 Backport of #46271	2019-09-04 10:57:01 +03:00
Marios Trivyzas	fd0affb503	SQL: Fix issue with IIF function when condition folds (#46290 ) Previously, when the condition (1st argument) of the IIF function could be evaluated (folded) to false, the `IfConditional` was eliminated which caused `IndexOutOfBoundsException` to be thrown when `info()` and `resolveType()` methods where called. Fixes: #46268 (cherry picked from commit 9a885a3ac47bc8f52c07770d1d8d670ce0af1e59)	2019-09-04 10:32:49 +03:00
Benjamin Trent	8a4ff7d57d	[ML][Transforms] protecting doSaveState with optimistic concurrency (#46156 ) (#46281 ) * [ML][Transforms] protecting doSaveState with optimistic concurrency * task code cleanup	2019-09-03 15:27:24 -05:00
Benjamin Trent	c3c1dcd2ac	[ML][Transforms] fixing listener being called twice (#46284 ) (#46292 )	2019-09-03 15:26:56 -05:00
Benjamin Trent	53df54c703	[ML][Transforms] fixing stop on changes check bug (#46162 ) (#46273 ) * [ML][Transforms] fixing stop on changes check bug * Adding new method finishAndCheckState to cover race conditions in early terminations * changing stopping conditions in `onStart` * allow indexer to finish when exiting early	2019-09-03 11:04:18 -05:00
Lee Hinman	3d4b8e01c7	Validate SLM policy ids strictly (#45998 ) (#46145 ) This uses strict validation for SLM policy ids, similar to what we use for index names. Resolves #45997	2019-09-03 09:20:02 -06:00
David Roberts	ab045744ac	[ML-DataFrame] Fix off-by-one error in checkpoint operations_behind (#46235 ) Fixes a problem where operations_behind would be one less than expected per shard in a new index matched by the data frame transform source pattern. For example, if a data frame transform had a source of foo* and a new index foo-new was created with 2 shards and 7 documents indexed in it then operations_behind would be 5 prior to this change. The problem was that an empty index has a global checkpoint number of -1 and the sequence number of the first document that is indexed into an index is 0, not 1. This doesn't matter for indices included in both the last and next checkpoints, as the off-by-one errors cancelled, but for a new index it affected the observed result.	2019-09-03 12:45:02 +01:00
Ignacio Vera	59c474e675	reset queryGeometry in ShapeQueryTests (#45974 ) (#46251 )	2019-09-03 10:24:46 +02:00
markharwood	a9e3871fc7	Test fix for PinnedQueryBuilderIT (#46187 ) (#46227 ) Fix test issue to stabilise scoring through use of DFS search mode. Randomised index-then-delete docs introduced by the test framework likely caused an imbalance in IDF scores across shards. Also made number of shards used in test a random number for added test coverage. Closes #46174	2019-09-02 13:31:22 +01:00
Marios Trivyzas	3bee647e5b	SQL: Fix issue with DataType for CASE with NULL (#46173 ) Previously, if the DataType of all the WHEN conditions of a CASE statement is NULL, then it was set to NULL even if the ELSE clause has a non-NULL data type, e.g.: ``` CASE WHEN a = 1 THEN NULL WHEN a = 5 THEN NULL ELSE 'foo' ``` Fixes: #46032 (cherry picked from commit 8c1012efbbd3a300afd0dfb9b18250f15ea753f9)	2019-09-02 11:17:24 +03:00
Martijn van Groningen	555b630160	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-09-02 09:16:55 +02:00
Benjamin Trent	d0c5573a51	[ML] Throw an error when a datafeed needs CCS but it is not enabled for the node (#46044 ) (#46096 ) Though we allow CCS within datafeeds, users could prevent nodes from accessing remote clusters. This can cause mysterious errors and difficult to troubleshoot. This commit adds a check to verify that `cluster.remote.connect` is enabled on the current node when a datafeed is configured with a remote index pattern.	2019-08-30 09:27:07 -05:00
Alexander Reelsen	98c32c7846	Fix wrong URL encoding in watcher HTTP client (#45894 ) The test assumption was calling the wrong method resulting in a URL encoding before returning the data. Closes #44970	2019-08-30 14:02:49 +02:00
Dimitris Athanasiou	5921ae53d8	[7.x][ML] Regression dependent variable must be numeric (#46072 ) (#46136 ) * [ML] Regression dependent variable must be numeric This adds a validation that the dependent variable of a regression analysis must be numeric. * Address review comments and fix some problems In addition to addressing the review comments, this commit fixes a few issues I found during testing. In particular: - if there were mappings for required fields but they were not included we were not reporting the error - if explicitly included fields had unsupported types we were not reporting the error Unfortunately, I couldn't get those fixed without refactoring the code in `ExtractedFieldsDetector`.	2019-08-30 09:57:43 +03:00
Zachary Tong	cf8a4171e1	Rename `data-science` plugin to `analytics` (#46133 ) Rename `data-science` plugin to `analytics`. Also removes enabled flag. Backport of #46092	2019-08-29 12:45:39 -04:00
Simon Willnauer	9b2ea07b17	Flush engine after big merge (#46066 ) (#46111 ) Today we might carry on a big merge uncommitted and therefore occupy a significant amount of diskspace for quite a long time if for instance indexing load goes down and we are not quickly reaching the translog size threshold. This change will cause a flush if we hit a significant merge (512MB by default) which frees diskspace sooner.	2019-08-29 17:54:15 +02:00
Michael Basnight	51a703da29	Add enrich transport client support (#46002 ) This commit adds an enrich client, as well as a smoke test to validate the client works.	2019-08-29 09:10:07 -05:00
Nhat Nguyen	028e792e1d	Remove already exist assertion while renew ccr lease (#46009 ) If a CCR lease is disappeared while we are renewing it, then we will issue asyncAddRetentionLease to add that lease. And if asyncAddRetentionLease takes longer than retentionLeaseRenewInterval, then we can issue another asyncAddRetentionLease request. One of asyncAddRetentionLease requests will fail with RetentionLeaseAlreadyExistsException, hence trip the assertion. Closes #45192	2019-08-29 09:44:40 -04:00
Przemysław Witek	b8a0379057	Refactor auditor-related classes (#45893 ) (#46120 )	2019-08-29 14:21:03 +02:00
Przemysław Witek	fbe9e8a530	Do not throw an exception if the process finished quickly but without any error. (#46073 ) (#46113 )	2019-08-29 10:47:17 +02:00
Gordon Brown	47bbd9d9a9	[7.x] Fix rollover alias in SLM history index template (#46001 ) This commit adds the `rollover_alias` setting required for ILM to work correctly to the SLM history index template and adds assertions to the SLM integration tests to ensure that it works correctly.	2019-08-28 14:50:22 -07:00
Tal Levy	a356bcff41	Add Circle Processor (#43851 ) (#46097 ) add circle-processor that translates circles to polygons	2019-08-28 14:44:08 -07:00
Julie Tibshirani	d94c4dcffb	Use float instead of double for query vectors. (#46004 ) Currently, when using script_score functions like cosineSimilarity, the query vector is treated as an array of doubles. Since the stored document vectors use floats, it seems like the least surprising behavior for the query vectors to also be float arrays. In addition to improving consistency, this change may help with some optimizations we have been considering around vector dot product.	2019-08-28 11:03:14 -07:00
Mark Tozzi	9ac85a4a2b	Fix compilation in CumulativeCardinalityAggregatorTests	2019-08-28 09:31:48 -04:00
Dimitris Athanasiou	25d64508f6	[7.x][ML] Support boolean fields for DF analytics (#46037 ) (#46054 ) This commit adds support for `boolean` fields in data frame analytics (and currently both outlier detection and regression). The analytics process expects `boolean` fields to be encoded as integers with 0 or 1 value.	2019-08-28 12:02:29 +03:00
Martijn van Groningen	1157224a6b	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-08-28 10:14:07 +02:00
Jake Landis	154d1dd962	Watcher max_iterations with foreach action execution (#45715 ) (#46039 ) Prior to this commit the foreach action execution had a hard coded limit to 100 iterations. This commit allows the max number of iterations to be a configuration ('max_iterations') on the foreach action. The default remains 100.	2019-08-27 16:57:20 -05:00
Armin Braun	fdef293c81	Fix RegressionTests#fromXContent (#46029 ) * The `trainingPercent` must be between `1` and `100`, not `0` and `100` which is causing test failures	2019-08-27 18:24:26 +03:00
Dimitris Athanasiou	873ad3f942	[7.x][ML] Add option to regression to randomize training set (#45969 ) (#46017 ) Adds a parameter `training_percent` to regression. The default value is `100`. When the parameter is set to a value less than `100`, from the rows that can be used for training (ie. those that have a value for the dependent variable) we randomly choose whether to actually use for training. This enables splitting the data into a training set and the rest, usually called testing, validation or holdout set, which allows for validating the model on data that have not been used for training. Technically, the analytics process considers as training the data that have a value for the dependent variable. Thus, when we decide a training row is not going to be used for training, we simply clear the row's dependent variable.	2019-08-27 17:53:11 +03:00
Yogesh Gaikwad	7b6246ec67	Add `manage_own_api_key` cluster privilege (#45897 ) (#46023 ) The existing privilege model for API keys with privileges like `manage_api_key`, `manage_security` etc. are too permissive and we would want finer-grained control over the cluster privileges for API keys. Previously APIs created would also need these privileges to get its own information. This commit adds support for `manage_own_api_key` cluster privilege which only allows api key cluster actions on API keys owned by the currently authenticated user. Also adds support for retrieval of the API key self-information when authenticating via API key without the need for the additional API key privileges. To support this privilege, we are introducing additional authentication context along with the request context such that it can be used to authorize cluster actions based on the current user authentication. The API key get and invalidate APIs introduce an `owner` flag that can be set to true if the API key request (Get or Invalidate) is for the API keys owned by the currently authenticated user only. In that case, `realm` and `username` cannot be set as they are assumed to be the currently authenticated ones. The changes cover HLRC changes, documentation for the API changes. Closes #40031	2019-08-28 00:44:23 +10:00
Dimitris Athanasiou	dd6c13fdf9	[ML] Add description to DF analytics (#45774 ) (#46019 )	2019-08-27 15:48:59 +03:00
Albert Zaharovits	1ebee5bf9b	PKI realm authentication delegation (#45906 ) This commit introduces PKI realm delegation. This feature supports the PKI authentication feature in Kibana. In essence, this creates a new API endpoint which Kibana must call to authenticate clients that use certificates in their TLS connection to Kibana. The API call passes to Elasticsearch the client's certificate chain. The response contains an access token to be further used to authenticate as the client. The client's certificates are validated by the PKI realms that have been explicitly configured to permit certificates from the proxy (Kibana). The user calling the delegation API must have the delegate_pki privilege. Closes #34396	2019-08-27 14:42:46 +03:00
Ioannis Kakavas	b249e25bb4	Partly revert globalInfo.ready check (#45960 ) This check was introduced in #41392 but had the unwanted side-effect that the keystore settings in such blocks would note be added in the node's keystore. Given that we have a mid-term plan for FIPS testing that would made such checks unnecessary, and that the conditional in these two cases is not really that important, this change removes this conditional logic so that full-cluster-restart and rolling upgrade tests will run with PEM files for key/certificate material no matter if we're in a FIPS JVM or not. Resolves: #45475	2019-08-27 13:01:56 +03:00
Zachary Tong	943a016bb2	Add Cumulative Cardinality agg (and Data Science plugin) (#45990 ) This adds a pipeline aggregation that calculates the cumulative cardinality of a field. It does this by iteratively merging in the HLL sketch from consecutive buckets and emitting the cardinality up to that point. This is useful for things like finding the total "new" users that have visited a website (as opposed to "repeat" visitors). This is a Basic+ aggregation and adds a new Data Science plugin to house it and future advanced analytics/data science aggregations.	2019-08-26 16:19:55 -04:00
Benjamin Trent	a3a4ae0ac2	[ML] fixing bug where analytics process starts with 0 rows (#45879 ) (#45988 ) The native process requires that there be a non-zero number of rows to analyze. If the flag --rows 0 is passed to the executable, it throws and does not start. When building the configuration for the process we should not start the native process if there are no rows. Adding some logging to indicate what is occurring.	2019-08-26 14:18:17 -05:00
Benjamin Trent	d64018f8e1	[ML] add supported types to no fields error message (#45926 ) (#45987 ) * [ML] add supported types to no fields error message * adding supported types to logger debug	2019-08-26 14:18:00 -05:00
Jake Landis	767f648f8e	Watcher add email warning if CSV attachment contains formulas (#44460 ) (#45557 ) * Watcher add email warning if CSV attachment contains formulas (#44460) This commit introduces a Warning message to the emails generated by Watcher's reporting action. This change complements Kibana's CSV formula notifications (see elastic/kibana#37930). This is implemented by reading a header (kbn-csv-contains-formulas) provided by Kibana to notify to attach the Warning to the email. The wording of the warning is borrowed from Kibana's UI and may be overridden by a dynamic setting xpack.notification.reporting.warning.kbn-csv-contains-formulas.text. This warning is enabled by default, but may be disabled via a dynamic setting xpack.notification.reporting.warning.enabled.	2019-08-26 08:35:33 -05:00
Jake Landis	f2241a152f	watcher tests - increase stop timeout to 60s (#45679 ) (#45934 ) As of #43939 Watcher tests now correctly block until all Watch executions kicked off by that test are finished. Prior we allowed tests to finish with outstanding watch executions. It was known that this would increase the time needed to finish a test. However, running the tests on CI can be slow and on at least 1 occasion it took 60s to actually finish. This PR simply increases the max allowable timeout for Watcher tests to clean up after themselves.	2019-08-26 08:34:54 -05:00
Andrey Ershov	479ab9b8db	Fix plaintext on TLS port logging (#45852 ) Today if non-TLS record is received on TLS port generic exception will be logged with the stack-trace. SSLExceptionHelper.isNotSslRecordException method does not work because it's assuming that NonSslRecordException would be top-level. This commit addresses the issue and the log would be more concise. (cherry picked from commit 6b83527bf0c23d4d5b97fab7f290c43432945d4f)	2019-08-26 12:32:35 +02:00
Ioannis Kakavas	2bee27dd54	Allow Transport Actions to indicate authN realm (#45946 ) This commit allows the Transport Actions for the SSO realms to indicate the realm that should be used to authenticate the constructed AuthenticationToken. This is useful in the case that many authentication realms of the same type have been configured and where the caller of the API(Kibana or a custom web app) already know which realm should be used so there is no need to iterate all the realms of the same type. The realm parameter is added in the relevant REST APIs as optional so as not to introduce any breaking change.	2019-08-25 19:36:41 +03:00
Jason Tedor	040a810b3c	Add deprecation check for pidfile setting (#45939 ) The pidfile setting is deprecated. This commit adds a deprecation check for usage of this setting.	2019-08-24 17:19:20 -04:00
Jason Tedor	43ca652d11	Add deprecation check for processors (#45925 ) The processors setting is deprecated. This commit adds a deprecation check for the use of the processors setting.	2019-08-23 20:16:40 -04:00
Jason Tedor	6b116a48f3	Skip feature aware check on JDK 14 (#45928 ) ASM can not currently handle classes compiled with JDK 14. This commit skips these checks on JDK 14, for now.	2019-08-23 17:38:15 -04:00
Michael Basnight	a82d24b3ce	Remove enrich indices on delete policy (#45870 ) When a policy is deleted, the enrich indices that are backing the policy alias should also be deleted. This commit does that work and cleans up the transport action a bit so that the lock release is easier to see, as well as to ensure that any action carried out, regardless of exception, unlocks the policy.	2019-08-23 15:26:43 -05:00
Dimitris Athanasiou	be554fe5f0	[7.x][ML] Improve progress reportings for DF analytics (#45856 ) (#45910 ) Previously, the stats API reports a progress percentage for DF analytics tasks that are running and are in the `reindexing` or `analyzing` state. This means that when the task is `stopped` there is no progress reported. Thus, one cannot distinguish between a task that never run to one that completed. In addition, there are blind spots in the progress reporting. In particular, we do not account for when data is loaded into the process. We also do not account for when results are written. This commit addresses the above issues. It changes progress to being a list of objects, each one describing the phase and its progress as a percentage. We currently have 4 phases: reindexing, loading_data, analyzing, writing_results. When the task stops, progress is persisted as a document in the state index. The stats API now reports progress from in-memory if the task is running, or returns the persisted document (if there is one).	2019-08-23 23:04:39 +03:00
Benjamin Trent	b756e1b9be	[ML][Transforms] adjusting when and what to audit (#45876 ) (#45916 ) * [ML][Transforms] adjusting when and what to audit * Update DataFrameTransformTask.java * removing unnecessary audit message	2019-08-23 13:53:02 -05:00
Benjamin Trent	94c2de65b9	[ML][Transforms] fix doSaveState check (#45882 ) (#45902 ) * [ML][Transforms] fix doSaveState check * removing unnecessary log statement	2019-08-23 09:38:52 -05:00
Martijn van Groningen	a38e6850a5	fixed errors after cherry-picking 2 commits	2019-08-23 13:51:00 +02:00
Martijn van Groningen	6067065ed6	Decouple enrich processor factory from enrich policy (#45826 ) This commit changes the enrich processor factory to read the required configuration from the current enrich index (from meta mapping field) in order to create the processor. Before this change the required config was read from the enrich policy in the cluster state. Enrich policies are going to be stored in an index (instead of the cluster state). In a processor factory there isn't a way to load something from an index, so with this change we read the required config / info from the enrich index (which is derived from the enrich policy), which then allows us to move enrich policies to an index. With this change it is required to execute a policy before creating a pipeline. Otherwise there is no enrich index and then there is no way to validate that a policy exist or retrieve its type and match field. Relates to #32789	2019-08-23 13:46:39 +02:00
Martijn van Groningen	cb42e19a32	Change how type is stored in an enrich policy. (#45789 ) A policy type controls how the enrich index is created and the query executed against the match field. Currently there is a single policy type (`exact_match`). In the near future more policy types will be added and different policy may have different configuration options. For this reason type should be a json object instead of a string field: ``` { "exact_match": { ... } } ``` instead of: ``` { "type": "exact_match", ... } ``` This will make streaming parsing of enrich policies easier as in the new format, the parsing code can know ahead what configuration fields to expect. In the latter format that is not possible if the type field appears not as the first field. Relates to #32789	2019-08-23 13:43:38 +02:00
Martijn van Groningen	837cfa2640	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-08-23 11:22:27 +02:00
Alexander Reelsen	ecafe4f4ad	Update joda to 2.10.3 (#45495 )	2019-08-23 10:39:39 +02:00
markharwood	217e41ab6c	Search - added HLRC support for PinnedQueryBuilder (#45779 ) (#45853 ) Added HLRC support for PinnedQueryBuilder Related #44074	2019-08-23 09:22:17 +01:00
Przemysław Witek	85d55e30d0	Add test that proves _timing_stats document is deleted when the job is deleted (#45840 ) (#45854 )	2019-08-23 07:03:09 +02:00
Przemysław Witek	2ed19b2c81	Put error message from inside the process into the exception that is thrown when the process doesn't start correctly. (#45846 ) (#45875 )	2019-08-23 07:02:50 +02:00
Tim Vernum	f94e4a9151	Set security index refresh interval to 1s (#45888 ) The security indices were being created without specifying the refresh interval, which means it would inherit a value from any templates that exists. However, certain security functionality depends on being able to wait_for refresh, and causes errors (e.g. in Kibana) if that time exceeds 30s. This commit changes the security indices configuration to always be created with a 1s refresh interval. This prevents any templates from inadvertantly interfering with the proper functioning of security. It is possible for an administrator to explicitly change the refresh interval after the indices have been created. Backport of: #45434	2019-08-23 12:41:37 +10:00
Tim Vernum	029725fc35	Add SSL/TLS settings for watcher email (#45836 ) This change adds a new SSL context xpack.notification.email.ssl.* that supports the standard SSL configuration settings (truststore, verification_mode, etc). This SSL context is used when configuring outbound SMTP properties for watcher email notifications. Backport of: #45272	2019-08-23 10:13:51 +10:00
Nhat Nguyen	3393f9599e	Ignore translog retention policy if soft-deletes enabled (#45473 ) Since #45136, we use soft-deletes instead of translog in peer recovery. There's no need to retain extra translog to increase a chance of operation-based recoveries. This commit ignores the translog retention policy if soft-deletes is enabled so we can discard translog more quickly. Backport of #45473 Relates #45136	2019-08-22 16:40:06 -04:00
Benjamin Trent	8e3c54fff7	[7.x] [ML] Adding data frame analytics stats to _usage API (#45820 ) (#45872 ) * [ML] Adding data frame analytics stats to _usage API (#45820) * [ML] Adding data frame analytics stats to _usage API * making the size of analytics stats 10k * adjusting backport	2019-08-22 15:15:41 -05:00
Benjamin Trent	dff3e636c2	[ML][Transforms] unifying logging, adding some more logging (#45788 ) (#45859 ) * [ML][Transforms] unifying logging, adding some more logging * using parameterizedMessage instead of string concat * fixing bracket closure	2019-08-22 13:15:07 -05:00
Benjamin Trent	e50a78cf50	[ML-DataFrame] version data frame transform internal index (#45375 ) (#45837 ) Adds index versioning for the internal data frame transform index. Allows for new indices to be created and referenced, `GET` requests now query over the index pattern and takes the latest doc (based on INDEX name).	2019-08-22 11:46:30 -05:00
Jake Landis	1dab73929f	Watcher add stopped listener (#43939 ) (#45670 ) When Watcher is stopped and there are still outstanding watches running Watcher will report it self as stopped. In normal cases, this is not problematic. However, for integration tests Watcher is started and stopped between each test to help ensure a clean slate for each test. The tests are blocking only on the stopped state and make an implicit assumption that all watches are finished if the Watcher is stopped. This is an incorrect assumption since Stopped really means, "I will not accept any more watches". This can lead to un-predictable behavior in the tests such as message : "Watch is already queued in thread pool" and state: "not_executed_already_queued". This can also change the .watcher-history if watches linger between tests. This commit changes the semantics of a manual stopping watcher to now mean: "I will not accept any more watches AND all running watches are complete". There is now an intermediary step "Stopping" and callback to allow transition to a "Stopped" state when all Watches have completed. Additionally since this impacts how long the tests will block waiting for a "Stopped" state, the timeout has been increased. Related: #42409	2019-08-22 10:54:29 -05:00
Armin Braun	bfddaaa2ae	Acknowledge Indices Were Wiped Successfully in REST Tests (#45832 ) (#45842 ) In internal test clusters tests we check that wiping all indices was acknowledged but in REST tests we didn't. This aligns the behavior in both kinds of tests. Relates #45605 which might be caused by unacked deletes that were just slow.	2019-08-22 17:19:51 +02:00
Przemysław Witek	7512337922	[7.x] Allow the user to specify 'query' in Evaluate Data Frame request (#45775 ) (#45825 )	2019-08-22 11:14:26 +02:00
Martijn van Groningen	33972423e9	Enrich processor configuration changes (#45466 ) Enrich processor configuration changes: * Renamed `enrich_key` option to `field` option. * Replaced `set_from` and `targets` options with `target_field`. The `target_field` option behaves different to how `set_from` and `targets` worked. The `target_field` is the field that will contain the looked up document. Relates to #32789	2019-08-22 09:49:22 +02:00
Benjamin Trent	3ebeaa2557	Fixing rollup state tests after onFailure ordering change (#45784 ) (#45814 ) After the PR #45676 onFailure is now called before the indexer state has transitioned out of indexing. To fix these tests, I added a new check to make sure that we don't mark it as failed until AFTER doSaveState is called with a STARTED indexer.	2019-08-21 14:46:09 -05:00
Gordon Brown	47b1e2b3d0	[7.x] Use rollover for SLM's history indices (#45686 ) Following our own guidelines, SLM should use rollover instead of purely time-based indices to keep shard counts low. This commit implements lazy index creation for SLM's history indices, indexing via an alias, and rollover in the built-in ILM policy.	2019-08-21 13:42:11 -06:00
Henning Andersen	c3296d3251	Unmute testBiDirectionalIndexFollowing (#45641 ) (#45792 ) Cause is believed to be in build system caching so unmuting.	2019-08-21 20:53:14 +02:00
William Brafford	2b549e7342	CLI tools: write errors to stderr instead of stdout (#45586 ) Most of our CLI tools use the Terminal class, which previously did not provide methods for writing to standard output. When all output goes to standard out, there are two basic problems. First, errors and warnings are "swallowed" in pipelines, making it hard for a user to know when something's gone wrong. Second, errors and warnings are intermingled with legitimate output, making it difficult to pass the results of interactive scripts to other tools. This commit adds a second set of print commands to Terminal for printing to standard error, with errorPrint corresponding to print and errorPrintln corresponding to println. This leaves it to developers to decide which output should go where. It also adjusts existing commands to send errors and warnings to stderr. Usage is printed to standard output when it's correctly requested (e.g., bin/elasticsearch-keystore --help) but goes to standard error when a command is invoked incorrectly (e.g. bin/elasticsearch-keystore list-with-a-typo \| sort).	2019-08-21 14:46:07 -04:00
Martijn van Groningen	7f2ba91360	adjusted enrich rest specs to new format	2019-08-21 14:42:10 +02:00
Martijn van Groningen	2677ac14d2	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-08-21 14:28:17 +02:00
Przemysław Witek	bf701b83d2	Shorten field names in EstimateMemoryUsageResponse (#45719 ) (#45772 )	2019-08-21 12:45:09 +02:00
Zachary Tong	6b391cd0d5	Mute ShapeQueryTests#testFieldAlias() Tracking issue: https://github.com/elastic/elasticsearch/issues/45628	2019-08-21 10:31:13 +01:00
David Kyle	982560afeb	Mute RollupIndexerStateTests See #45770	2019-08-21 10:05:15 +01:00
Martijn van Groningen	5864f30771	ensure that the items in the bulk response are the same as is in the bulk request	2019-08-21 10:07:02 +02:00
Przemysław Witek	c6709f0979	Mute tests affected by renaming fields in Estimate memory usage response (#45743 ) (#45766 )	2019-08-21 09:57:23 +02:00
Dimitris Athanasiou	d5c3d9b50f	[7.x][ML] Do not skip rows with missing values for regression (#45751 ) (#45754 ) Regression analysis support missing fields. Even more, it is expected that the dependent variable has missing fields to the part of the data frame that is not for training. This commit allows to declare that an analysis supports missing values. For such analysis, rows with missing values are not skipped. Instead, they are written as normal with empty strings used for the missing values. This also contains a fix to the integration test. Closes #45425	2019-08-21 08:15:38 +03:00
Benjamin Trent	ba7b677618	[ML] better handle empty results when evaluating regression (#45745 ) (#45759 ) * [ML] better handle empty results when evaluating regression * adding new failure test to ml_security black list * fixing equality check for regression results	2019-08-20 17:37:04 -05:00
Armin Braun	a01bd6c5a3	Stop Executing SLM Policy Transport Action on Snapshot Pool (#45727 ) (#45748 ) * Executing SLM policies on the snapshot thread will block until a snapshot finishes if the pool is completely busy executing that snapshot * Fixes #45594	2019-08-20 19:15:36 +02:00
Martijn van Groningen	ac7173c0d4	Renamed CoordinatorProxyAction to EnrichCoordinatorProxyAction and (#45663 ) fail if query shard context needs current time (certain queries / scripts use this, but in the enrich context this is not used).	2019-08-20 18:51:47 +02:00
Michael Basnight	e3373d349b	Consolidate enrich list all and get by name APIs (#45705 ) The get and list APIs are a single API in this commit. Whether requesting one named policy or all policies, a list of policies is returened. The list API code has all been removed and the GET api is what remains, which contains much of the list response code.	2019-08-20 10:29:59 -05:00
Nhat Nguyen	99b21d50b8	Include leases in ccr errmsg when ops no longer available (#45681 ) The setting index.soft_deletes.retention.operations is no longer needed nor recommended in CCR. We, therefore, should hint users about the retention leases period setting instead when operations are no longer available for replicating.	2019-08-20 10:40:12 -04:00
Benjamin Trent	43bb5924e6	[ML][Data Frame] fixing _start?force=true bug (#45660 ) (#45734 ) * [ML][Data Frame] fixing _start?force=true bug * removing unused import * removing old TODO	2019-08-20 09:23:07 -05:00
Dimitris Athanasiou	49edf9e5b5	[7.x][ML] Remove timeout on waiting for DF analytics result processor to complete (#45724 ) (#45733 ) We cannot know how long the analysis will take to complete thus we should not have a timeout. Note that if the process crashes, the result processor will pick the exception due to the stream closing. Closes #45723	2019-08-20 17:21:40 +03:00
Przemysław Witek	b37ebd1adf	Prepare the codebase for new Auditor subclasses (#45716 ) (#45731 )	2019-08-20 16:03:50 +02:00
Przemysław Witek	80dd0a0948	Get rid of EstimateMemoryUsageRequest and EstimateMemoryUsageAction.Request. (#45718 ) (#45725 )	2019-08-20 15:49:17 +02:00
Benjamin Trent	88641a08af	[ML][Data frame] fixing failure state transitions and race condition (#45627 ) (#45656 ) * [ML][Data frame] fixing failure state transitions and race condition (#45627) There is a small window for a race condition while we are flagging a task as failed. Here are the steps where the race condition occurs: 1. A failure occurs 2. Before `AsyncTwoPhaseIndexer` calls the `onFailure` handler it does the following: a. `finishAndSetState()` which sets the IndexerState to STARTED b. `doSaveState(...)` which attempts to save the current state of the indexer 3. Another trigger is fired BEFORE `onFailure` can fire, but AFTER `finishAndSetState()` occurs. The trick here is that we will eventually set the indexer to failed, but possibly not before another trigger had the opportunity to fire. This could obviously cause some weird state interactions. To combat this, I have put in some predicates to verify the state before taking actions. This is so if state is indeed marked failed, the "second trigger" stops ASAP. Additionally, I move the task state checks INTO the `start` and `stop` methods, which will now require a `force` parameter. `start`, `stop`, `trigger` and `markAsFailed` are all `synchronized`. This should gives us some guarantees that one will not switch states out from underneath another. I also flag the task as `failed` BEFORE we successfully write it to cluster state, this is to allow us to make the task fail more quickly. But, this does add the behavior where the task is "failed" but the cluster state does not indicate as much. Adding the checks in `start` and `stop` will handle this "real state vs cluster state" race condition. This has always been a problem for `_stop` as it is not a master node action and doesn’t always have the latest cluster state. closes #45609 Relates to #45562 * [ML][Data Frame] moves failure state transition for MT safety (#45676) * [ML][Data Frame] moves failure state transition for MT safety * removing unused imports	2019-08-20 07:30:17 -05:00
markharwood	7d5ab17bb2	Search enhancement: pinned queries (#44345 ) (#45657 ) * Search enhancement: pinned queries (#44345) Search enhancement: - new query type allows selected documents to be promoted above any "organic” search results. This is the first feature in a new module `search-business-rules` which will house licensed (non OSS) logic for rewriting queries according to business rules. The PinnedQueryBuilder class offers a new `pinned` query in the DSL that takes an array of promoted IDs and an “organic” query and ensures the documents with the promoted IDs rank higher than the organic matches. Closes #44074	2019-08-20 11:38:22 +01:00
Costin Leau	0f51dd69cb	SQL: Improve serialization of SQL processors (#45678 ) Encapsulate the serialization/deserialization of SQL client classes. Make configuration specific parameters (such as ZoneId) generic just like the version and remove the need for consumer classes to manage them individually. This is not only consistent but also provides significant savings in the cursor. Fix #40216 (cherry picked from commit 5c844798045d7baa0d932289d2e3d1607ba6a9a4)	2019-08-20 11:50:47 +03:00
Przemysław Witek	7bc8400222	Call the new _estimate_memory_usage API endpoint on df analytics _start (#45536 ) (#45701 )	2019-08-19 21:37:55 +02:00
James Rodewig	4b932519aa	[DOCS] Document `throttle_period_in_millis` for watcher actions (#45607 )	2019-08-19 08:27:52 -04:00
Costin Leau	1cd58c8ea8	SQL: Break TextFormatter/Cursor dependency (#45613 ) Improve the initialization and state passing of TextFormatter in CLI and TEXT mode by leveraging the Page listener hook. Additionally simplify the code inside RestSqlQueryAction. (cherry picked from commit a56db2fa119cf9e8748723e19f1fc9f6a8afe5fc)	2019-08-17 00:16:08 +03:00
Costin Leau	96883dd028	SQL: Refactor away the cycle between Rowset and Cursor (#45516 ) Improve encapsulation of pagination of rowsets by breaking the cycle between cursor and associated rowset implementation, all logic now residing inside each cursor implementation. (cherry picked from commit be8fe0a0ce562fe732fae12a0b236b5731e4638c)	2019-08-17 00:16:05 +03:00
Gordon Brown	ecb3ebd796	Clean SLM and ongoing snapshots in test framework (#45564 ) Adjusts the cluster cleanup routine in ESRestTestCase to clean up SLM test cases, and optionally wait for all snapshots to be deleted. Waiting for all snapshots to be deleted, rather than failing if any are in progress, is necessary for tests which use SLM policies because SLM policies may be in the process of executing when the test ends.	2019-08-16 14:17:34 -06:00
Armin Braun	c321272ae7	Mute testBiDirectionalIndexFollowing for #45641 (#45674 ) * Muting #45641	2019-08-16 22:02:41 +02:00
Igor Motov	98c850c08b	Geo: Change order of parameter in Geometries to lon, lat 7.x (#45618 ) Changes the order of parameters in Geometries from lat, lon to lon, lat and moves all Geometry classes are moved to the org.elasticsearch.geomtery package. Backport of #45332 Closes #45048	2019-08-16 14:42:02 -04:00
Luca Cavanna	c31cddf27e	Update the schema for the REST API specification (#42346 ) * Update the REST API specification This patch updates the REST API spefication in JSON files to better encode deprecated entities, to improve specification of URL paths, and to open up the schema for future extensions. Notably, it changes the `paths` from a list of strings to a list of objects, where each particular object encodes all the information for this particular path: the `parts` and the `methods`. Among the benefits of this approach is eg. encoding the difference between using the `PUT` and `POST` methods in the Index API, to either use a specific document ID, or let Elasticsearch generate one. Also `documentation` becomes an object that supports an `url` and also a `description` which is a new field. * Adapt YAML runner to new REST API specification format The logic for choosing the path to use when running tests has been simplified, as a consequence of the path parts being listed under each path in the spec. The special case for create and index has been removed. Also the parsing code has been hardened so that errors are thrown earlier when the structure of the spec differs from what expected, and their error messages should be more helpful.	2019-08-16 14:40:00 +02:00
Andrei Stefan	30a0711777	Remove deprecated use of "interval" method, in favor of "fixedInterval". (#45501 ) (cherry picked from commit 3fef65160f9e61883e9f8f7f345b814f945e2f4b)	2019-08-16 15:03:43 +03:00
Martijn van Groningen	5ea0985711	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-08-16 09:47:11 +02:00
Michael Basnight	db57d2206a	Prevent delete policy for active executing policy (#45472 ) This commit adds a lock to the delete policy, in the same way that the locking is done for policy execution. It also creates a test to exercise the delete transport action, and modifies an existing test to provide a common set of functions for saving and deleting policies.	2019-08-15 10:08:11 -05:00
Alpar Torok	7119e54be5	Mute data frame tests on 7.x Tracking in #45610 #45609	2019-08-15 17:07:53 +03:00
Michael Basnight	03f45dad57	Fix policy removal bug in delete policy (#45573 ) The delete policy had a subtle bug in that it would still delete the policy if pipelines were accessing it, after giving the client back an error. This commit fixes that and ensures it does not happen by adding verification in the test.	2019-08-15 13:20:59 +02:00
David Roberts	d40f3718f2	[ML] Muting 5 SSLErrorMessageTests tests on Windows (#45602 ) Due to https://github.com/elastic/elasticsearch/issues/45598	2019-08-15 11:05:00 +01:00
Benjamin Trent	fde5dae387	[ML][Data Frame] adjusting change detection workflow (#45511 ) (#45580 ) * [ML][Data Frame] adjusting change detection workflow * adjusting for PR comment * disallowing null as an argument value	2019-08-14 17:26:24 -05:00
Nick Knize	647a8308c3	[SPATIAL] Backport new ShapeFieldMapper and ShapeQueryBuilder to 7x (#45363 ) * Introduce Spatial Plugin (#44389) Introduce a skeleton Spatial plugin that holds new licensed features coming to Geo/Spatial land! * [GEO] Refactor DeprecatedParameters in AbstractGeometryFieldMapper (#44923) Refactor DeprecatedParameters specific to legacy geo_shape out of AbstractGeometryFieldMapper.TypeParser#parse. * [SPATIAL] New ShapeFieldMapper for indexing cartesian geometries (#44980) Add a new ShapeFieldMapper to the xpack spatial module for indexing arbitrary cartesian geometries using a new field type called shape. The indexing approach leverages lucene's new XYShape field type which is backed by BKD in the same manner as LatLonShape but without the WGS84 latitude longitude restrictions. The new field mapper builds on and extends the refactoring effort in AbstractGeometryFieldMapper and accepts shapes in either GeoJSON or WKT format (both of which support non geospatial geometries). Tests are provided in the ShapeFieldMapperTest class in the same manner as GeoShapeFieldMapperTests and LegacyGeoShapeFieldMapperTests. Documentation for how to use the new field type and what parameters are accepted is included. The QueryBuilder for searching indexed shapes is provided in a separate commit. * [SPATIAL] New ShapeQueryBuilder for querying indexed cartesian geometry (#45108) Add a new ShapeQueryBuilder to the xpack spatial module for querying arbitrary Cartesian geometries indexed using the new shape field type. The query builder extends AbstractGeometryQueryBuilder and leverages the ShapeQueryProcessor added in the previous field mapper commit. Tests are provided in ShapeQueryTests in the same manner as GeoShapeQueryTests and docs are updated to explain how the query works.	2019-08-14 16:35:10 -05:00
Michael Basnight	fd57d3cb29	Fix test broken by policy rename	2019-08-14 13:57:47 -05:00
Michael Basnight	52a094b177	Fail delete policy if pipeline exists (#44438 ) If a pipeline that refrences the policy exists, we should not allow the policy to be deleted. The user will need to remove the processor from the pipeline before deleting the policy. This commit adds a check to ensure that the policy cannot be deleted if it is referenced by any pipeline in the system.	2019-08-14 13:51:10 -05:00
Benjamin Trent	0c343d8443	[7.x] [ML][Transforms] adjusting stats.progress for cont. transforms (#45361 ) (#45551 ) * [ML][Transforms] adjusting stats.progress for cont. transforms (#45361) * [ML][Transforms] adjusting stats.progress for cont. transforms * addressing PR comments * rename fix * Adjusting bwc serialization versions	2019-08-14 13:08:27 -05:00
Martijn van Groningen	43b8ab607d	Improve naming of enrich policy fields. (#45494 ) Renamed `enrich_key` to `match_field` and renamed `enrich_values` to `enrich_fields`. Relates #32789	2019-08-14 11:45:22 +02:00
Przemysław Witek	df574e5168	[7.x] Implement ml/data_frame/analytics/_estimate_memory_usage API endpoint (#45188 ) (#45510 )	2019-08-14 08:26:03 +02:00
Gordon Brown	3f5dab99c3	Properly set origin for SLM history store client (#45515 ) The origin was not set properly for the SnapshotHistoryStore client, resulting in errors when SLM was used when security was enabled.	2019-08-13 18:23:20 -06:00
Andrei Stefan	adf8e20021	SQL: adds format parameter to range queries for constant date comparisons (#45503 ) * Add format parameter to the range queries built for CURRENT_* functions used in comparison conditions * Use range queries for date fields equality/non-equality as well. (cherry picked from commit c1e81e90f937ee5a002524d632bfce74d76962f9)	2019-08-13 23:04:30 +03:00
Martijn van Groningen	452557cf2e	Validate policy name like an index name. (#45452 ) The policy name is used to generate the enrich index name. For this reason, a policy name should be validated in the same way as index names. Relates to #32789	2019-08-13 20:25:17 +02:00
Armin Braun	90803a5caf	Reenable Integ Tests in native-multi-node-tests (#45482 ) (#45496 ) * Reenable Integ Tests in native-multi-node-tests * The tests broken here were likely fixed by #45463 => let's reenable them and see if things run fine again * Relates #45405, #45455	2019-08-13 15:55:54 +02:00
Mayya Sharipova	22ab389531	Clarify that FLS/DLS disable shard request cache (#45462 )	2019-08-13 09:05:57 -04:00
Alexander Reelsen	dd527b4e91	Fix watcher HttpClient URL creation (#45207 ) The http client could end up creating URLs, that did not resemble the original one, when encoding. This fixes a couple of corner cases, where too much or too few slashes were added to an URI. Closes #44970	2019-08-13 12:15:54 +02:00
Martijn van Groningen	0353eb9291	required changes after merging in upstream branch	2019-08-13 09:17:57 +02:00
Martijn van Groningen	1951cdf1cb	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-08-13 09:12:31 +02:00
Przemysław Witek	1aed388a24	Add view_index_metadata to roles.yml and remove as many df analytics test cases from build.gradle blacklist as possible. (#45451 ) (#45465 )	2019-08-13 08:31:58 +02:00
Yogesh Gaikwad	471d940c44	Refactor cluster privileges and cluster permission (#45265 ) (#45442 ) The current implementations make it difficult for adding new privileges (example: a cluster privilege which is more than cluster action-based and not exposed to the security administrator). On the high level, we would like our cluster privilege either: - a named cluster privilege This corresponds to `cluster` field from the role descriptor - or a configurable cluster privilege This corresponds to the `global` field from the role-descriptor and allows a security administrator to configure them. Some of the responsibilities like the merging of action based cluster privileges are now pushed at cluster permission level. How to implement the predicate (using Automaton) is being now enforced by cluster permission. `ClusterPermission` helps in enforcing the cluster level access either by performing checks against cluster action and optionally against a request. It is a collection of one or more permission checks where if any of the checks allow access then the permission allows access to a cluster action. Implementations of cluster privilege must be able to provide information regarding the predicates to the cluster permission so that can be enforced. This is enforced by making implementations of cluster privilege aware of cluster permission builder and provide a way to specify how the permission is to be built for a given privilege. This commit renames `ConditionalClusterPrivilege` to `ConfigurableClusterPrivilege`. `ConfigurableClusterPrivilege` is a renderable cluster privilege exposed as a `global` field in role descriptor. Other than this there is a requirement where we would want to know if a cluster permission is implied by another cluster-permission (`has-privileges`). This is helpful in addressing queries related to privileges for a user. This is not just simply checking of cluster permissions since we do not have access to runtime information (like request object). This refactoring does not try to address those scenarios. Relates #44048	2019-08-13 09:06:18 +10:00
Ryan Ernst	97efb6a403	Convert vagrant tests to per platform projects (#45064 ) The vagrant based tests currently reside in a single project, creating dozens of tasks to manage starting and stopping the vagrant VM along with running java and bats tests within each image. This all-in-one pattern makes parallelizing packaging tests difficult. This commit rewrites the vagrant testing infrastructure to be independent of the actual test runners, thus allowing each platform to be handled in a separate subproject. Additionally, the java and bats tests are changed to be run through a "destructive" gradle task, which is run inside the VM. The combination of these will allow parallelization both locally (through running several VMs at once) as well as running the destructive tasks in CI machines dedicated to each platform (thus removing the need for vagrant in CI).	2019-08-12 16:01:53 -07:00
Mark Vieira	7e3379444b	Fix build failure due to unknown task and disable test conventions (cherry picked from commit 8ed84bc5cef9bcfae6c817059f764d97e4451a4a)	2019-08-12 09:18:39 -07:00
Przemyslaw Gomulka	421e9b8e8b	Mute integ tests in native-multi-node-tests (#45457 ) Tracked at #45405	2019-08-12 17:42:24 +02:00
Przemyslaw Gomulka	d11ae08467	Muting ForecastIT.testOverflowToDisk (#45435 ) (#45438 ) awaits #45405	2019-08-12 11:01:32 +02:00
Dimitris Athanasiou	d02d6e40c2	[ML] Mute regression integ test Relates #45425	2019-08-12 10:59:24 +03:00
Armin Braun	a9e1402189	Remove Settings from BaseRestRequest Constructor (#45418 ) (#45429 ) * Resolving the todo, cleaning up the unused `settings` parameter * Cleaning up some other minor dead code in affected classes	2019-08-12 05:14:45 +02:00
Benjamin Trent	fac1a6f8e8	[ML][Data Frame] have DataFrameTransformConfigUpdate#apply set Version (#45391 ) (#45400 )	2019-08-09 14:32:49 -05:00
Hendrik Muhs	bf4da6c6ad	[ML-DataFrame] fix starting a batch data frame after stopping at runtime (#45340 ) (#45381 ) fix loading of next checkpoint after data frame transform has been stopped/started within one run closes #45339	2019-08-09 20:30:11 +02:00
Dimitris Athanasiou	27497ff75f	[7.x][ML] Add regression analysis to DF analytics (#45292 ) (#45388 ) This commit adds a first draft of a regression analysis to data frame analytics. There is high probability that the exact syntax might change. This commit adds the new analysis type and its parameters as well as appropriate validation. It also modifies the extractor and the fields detector to be able to handle categorical fields as regression analysis supports them.	2019-08-09 19:31:13 +03:00
Martijn van Groningen	4ac25b23f6	Add support for a more compact enrich values format (#45033 ) In the case that source and target are the same in `enrich_values` then a string array can be specified. For example instead of this: ``` PUT /_ingest/pipeline/my-pipeline { "processors": [ { "enrich" : { "policy_name": "my-policy", "enrich_values": [ { "source": "first_name", "target": "first_name" }, { "source": "last_name", "target": "last_name" }, { "source": "address", "target": "address" }, { "source": "city", "target": "city" }, { "source": "state", "target": "state" }, { "source": "zip", "target": "zip" } ] } } ] } ``` This more compact format can be specified: ``` PUT /_ingest/pipeline/my-pipeline { "processors": [ { "enrich" : { "policy_name": "my-policy", "targets": [ "first_name", "last_name", "address", "city", "state", "zip" ] } } ] } ``` And the `enrich_values` key has been renamed to `set_from`. Relates to #32789	2019-08-09 12:40:58 +02:00
Alpar Torok	634a070430	Restrict which tasks can use testclusters (#45198 ) * Restrict which tasks can use testclusters This PR fixes a problem between the interaction of test-clusters and build cache. Before this any task could have used a cluster without tracking it as input. With this change a new interface is introduced to track the tasks that can use clusters and we do consider the cluster as input for all of them.	2019-08-09 13:38:01 +03:00
Martijn van Groningen	f1ee29f22e	Added a custom api to perform the msearch more efficiently for enrich processor (#43965 ) Currently the msearch api is used to execute buffered search requests; however the msearch api doesn't deal with search requests in an intelligent way. It basically executes each search separately in a concurrent manner. This api reuses the msearch request and response classes and executes the searches as one request in the node holding the enrich index shard. Things like engine.searcher and query shard context are only created once. Also there are less layers than executing a regular msearch request. This results in an interesting speedup. Without this change, in a single node cluster, enriching documents with a bulk size of 5000 items, the ingest time in each bulk response varied from 174ms to 822ms. With this change the ingest time in each bulk response varied from 54ms to 109ms. I think we should add a change like this based on this improvement in ingest time. However I do wonder if instead of doing this change, we should improve the msearch api to execute more efficiently. That would be more complicated then this change, because in this change the custom api can only search enrich index shards and these are special because they always have a single primary shard. If msearch api is to be improved then that should work for any search request to any indices. Making the same optimization for indices with more than 1 primary shard requires much more work. The current change is isolated in the enrich plugin and LOC / complexity is small. So this good enough for now.	2019-08-09 09:11:04 +02:00
Hendrik Muhs	7d0aff0ed5	[ML-DataFrame] fix test failure in checkpoint retrieval (#45297 ) gracefully handle if index response returns null, increase and assert timeout closes #45238	2019-08-09 09:04:53 +02:00
Hendrik Muhs	68f9102550	[ML-DataFrame] audit changes in the source index (#45282 ) add audits when the set of source indexes changes and in a special case runs empty	2019-08-08 23:31:55 +02:00
Andrei Stefan	740d58fd46	SQL: Uniquely named inner_hits sections for each nested field condition (#45341 ) * Name each inner_hits section of nested queries differently and extract and combine the multiple values it generates into a single list. This also introduces a limitation (its origin it's with Elasticsearch though) on the sorting capabilities when the sorting is based on the nested fields filtered: only one of the conditions applied to nested documents will be used in the nested sorting. (cherry picked from commit cfc5cf68f6e83b07bb9006986d0903d6be418ec6)	2019-08-09 00:22:49 +03:00
David Roberts	14545f8958	[ML-DataFrame] Combine task_state and indexer_state in _stats (#45324 ) This commit replaces task_state and indexer_state in the data frame _stats output with a single top level state that combines the two. It is defined as: - failed if what's currently reported as task_state is failed - stopped if there is no persistent task - Otherwise what's currently reported as indexer_state Backport of #45276	2019-08-08 16:24:26 +01:00
Martijn van Groningen	708f856940	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-08-08 16:52:45 +02:00
Ioannis Kakavas	99ddb8b3d8	Allow empty token endpoint for implicit flow (#45038 ) When using the implicit flow in OpenID Connect, the op.token_endpoint_url should not be mandatory as there is no need to contact the token endpoint of the OP.	2019-08-08 12:50:53 +03:00
Martijn van Groningen	e3fd1e6c7d	Add support for overwrite parameter in the enrich processor. (#45029 ) Similar to how it is supported in the set processor: https://www.elastic.co/guide/en/elasticsearch/reference/current/set-processor.html Relates to #32789	2019-08-08 10:33:19 +02:00
Benjamin Trent	5db9982f71	[7.x] [ML][Data Frame] Add update transform api endpoint (#45154 ) (#45279 ) * [ML][Data Frame] Add update transform api endpoint (#45154) This adds the ability to `_update` stored data frame transforms. All mutable fields are applied when the next checkpoint starts. The exception being `description`. This PR contains all that is necessary for this addition: * HLRC * Docs * Server side	2019-08-07 10:37:35 -05:00
Benjamin Trent	3a71b91dca	[ML][Data Frame] add support for geo_bounds aggregation (#44441 ) (#45281 ) This adds support for `geo_bounds` aggregation inside the `pivot.aggregations` configuration. The two points returned from the `geo_bounds` aggregation are transformed into `geo_shape` whose types are dynamic given the point's similarity. * `point` if the two points are identical * `linestring` if the two points share either a latitude or longitude * `polygon` if the two points are completely different The automatically deduced mapping for the resulting field is a `geo_shape`.	2019-08-07 10:37:09 -05:00
Lee Hinman	c7ec0b8431	Include in-progress snapshot for a policy with get SLM policy… (#45245 ) This commit adds the "in_progress" key to the SLM get policy API, returning a policy that looks like: ```json { "daily-snapshots" : { "version" : 1, "modified_date" : "2019-08-05T18:41:48.778Z", "modified_date_millis" : 1565030508778, "policy" : { "name" : "<production-snap-{now/d}>", "schedule" : "0 30 1 * * ?", "repository" : "repo", "config" : { "indices" : [ "foo-*", "important" ], "ignore_unavailable" : true, "include_global_state" : false }, "retention" : { "expire_after" : "10m" } }, "last_success" : { "snapshot_name" : "production-snap-2019.08.05-oxctmnobqye3luim4uejhg", "time_string" : "2019-08-05T18:42:23.257Z", "time" : 1565030543257 }, "next_execution" : "2019-08-06T01:30:00.000Z", "next_execution_millis" : 1565055000000, "in_progress" : { "name" : "production-snap-2019.08.05-oxctmnobqye3luim4uejhg", "uuid" : "t8Idqt6JQxiZrzp0Vt7z6g", "state" : "STARTED", "start_time" : "2019-08-05T18:42:22.998Z", "start_time_millis" : 1565030542998 } } } ``` These are only visible while the snapshot is being taken (or failed), since it reads from the cluster state rather than from the repository itself.	2019-08-07 08:29:49 -06:00
Benjamin Trent	be911e6a53	[ML][Data Frames] Fix null aggregation handling in indexer (#45061 ) (#45257 ) * [ML][Data Frames] Fix null aggregation handling in indexer * addressing PR comments * adjusting error messages	2019-08-07 07:01:13 -05:00
Tom Callahan	a7a419bee8	Change Ldap SDK License to LGPL-2.1 (#45116 ) We currently use the unboundid ldap SDK, which is triply licensed under GPL-2.0, LGPL-2.1, and the "UnboundID LDAP SDK Free Use License". We currently identify the license as the latter, but LGPL-2.1 is the one we should be using per our policy.	2019-08-06 16:48:09 -04:00
Jason Tedor	9a142ff25c	Introduce formal node ML role (#45174 ) This commit builds on the ability for plugins to introduce new roles to add a formal node ML role.	2019-08-06 13:00:05 -04:00
Zachary Tong	422aca9a5d	Fix Rollup job creation to work with templates (#43943 ) The PutJob API accidentally used an "expert" API of CreateIndexRequest. That API is semi-lenient to syntax; a type could be omitted and the request would work as expected. But if a type was omitted it would not merge with templates correctly, leading to index creation that only has the template and not the requested mappings in the request. This commit refactors the PutJob API to: - Include the type name - Use a less "expert" API in an attempt to future proof against errors - Uses an XContentBuilder instead of string replacing, removes json template	2019-08-06 10:53:44 -04:00
Jason Tedor	5b1b146099	Normalize environment paths (#45179 ) This commit applies a normalization process to environment paths, both in how they are stored internally, also their settings values. This normalization is done via two means: - we make the paths absolute - we remove redundant name elements from the path (what Java calls "normalization") This change ensures that when we compare and refer to these paths within the system, we are using a common ground. For example, prior to the change if the data path was relative, we would not compare it correctly to paths from disk usage. This is because the paths in disk usage were being made absolute.	2019-08-06 06:04:30 -04:00
Yannick Welsch	7aeb2fe73c	Add per-socket keepalive options (#44055 ) Uses JDK 11's per-socket configuration of TCP keepalive (supported on Linux and Mac), see https://bugs.openjdk.java.net/browse/JDK-8194298, and exposes these as transport settings. By default, these options are disabled for now (i.e. fall-back to OS behavior), but we would like to explore whether we can enable them by default, in particular to force keepalive configurations that are better tuned for running ES.	2019-08-06 10:45:44 +02:00
Hendrik Muhs	6b5a2513a9	[ML-DataFrame] introduce an abstraction for checkpointing (#44900 ) introduces an abstraction for how checkpointing and synchronization works, covering - retrieval of checkpoints - check for updates - retrieving stats information	2019-08-06 07:38:59 +02:00
Benjamin Trent	7bfaba98c2	[ML][Data Frame] cleaning up and adjusting failure tests (#45101 ) (#45144 )	2019-08-05 09:12:11 -05:00
Tim Brooks	984ba82251	Move nio channel initialization to event loop (#45155 ) Currently in the transport-nio work we connect and bind channels on the a thread before the channel is registered with a selector. Additionally, it is at this point that we set all the socket options. This commit moves these operations onto the event-loop after the channel has been registered with a selector. It attempts to set the socket options for a non-server channel at registration time. If that fails, it will attempt to set the options after the channel is connected. This should fix #41071.	2019-08-02 17:31:31 -04:00
Lisa Cawley	00235bbecd	[DOCS] Reformats the security APIs (#45124 )	2019-08-02 11:32:47 -07:00
Alison Goryachev	b607148ae9	[DOCS] Fix watcher email action docs (#44877 )	2019-08-02 14:02:08 -04:00
David Roberts	a1f0285f0e	[TEST] Only test US locale in day/month order test in FIPS JVM (#45141 ) In the FIPS JVM the JVM default locale seems to leak into places where it should be overridden. This change skips assertions in TimestampFormatFinderTests.testGuessIsDayFirstFromLocale that may be impacted. Fixes #45140	2019-08-02 15:04:47 +01:00
David Turner	9ff320d967	Use index for peer recovery instead of translog (#45137 ) Today we recover a replica by copying operations from the primary's translog. However we also retain some historical operations in the index itself, as long as soft-deletes are enabled. This commit adjusts peer recovery to use the operations in the index for recovery rather than those in the translog, and ensures that the replication group retains enough history for use in peer recovery by means of retention leases. Reverts #38904 and #42211 Relates #41536 Backport of #45136 to 7.x.	2019-08-02 15:00:43 +01:00
Christoph Büscher	3366726ad1	Enable reloading of synonym_graph filters (#45135 ) Reloading of synonym_graph filter doesn't work currently because the search time AnalysisMode doesn't get propagated to the TokenFilterFactory emitted by the graph filters getChainAwareTokenFilterFactory() method. This change fixes that. Closes #45127	2019-08-02 15:33:42 +02:00
Tomas Della Vedova	6b71621afc	Updated slm API spec parameters and URL (#44797 ) (#45102 )	2019-08-02 11:39:52 +02:00
David Roberts	f617585dbd	[ML] Improve CSV header row detection in find_file_structure (#45099 ) When doing a fieldwise Levenshtein distance comparison between CSV rows, this change ignores all fields that have long values, not just the longest field. This approach works better for CSV formats that have multiple freeform text fields rather than just a single "message" field. Fixes #45047	2019-08-02 09:08:21 +01:00
Tim Vernum	e21d58541a	Improve errors when TLS files cannot be read (#45122 ) This change improves the exception messages that are thrown when the system cannot read TLS resources such as keystores, truststores, certificates, keys or certificate-chains (CAs). This change specifically handles: - Files that do not exist - Files that cannot be read due to file-system permissions - Files that cannot be read due to the ES security-manager Backport of: #44787	2019-08-02 12:29:43 +10:00
Tim Vernum	590777150f	Explicitly fail if a realm only exists in keystore (#45091 ) There are no realms that can be configured exclusively with secure settings. Every realm that supports secure settings also requires one or more non-secure settings. However, sometimes a node will be configured with entries in the keystore for which there is nothing in elasticsearch.yml - this may be because the realm we removed from the yml, but not deleted from the keystore, or it could be because there was a typo in the realm name which has accidentially orphaned the keystore entry. In these cases the realm building would fail, but the error would not always be clear or point to the root cause (orphaned keystore entries). RealmSettings would act as though the realm existed, but then fail because an incorrect combination of settings was provided. This change causes realm building to fail early, with an explicit message about incorrect keystore entries. Backport of: #44471	2019-08-02 12:28:59 +10:00
Yogesh Gaikwad	ae5c01e2d2	Do not use scroll when finding duplicate API key (#45026 ) When we create API key we check if the API key with the name already exists. It searches with scroll enabled and this causes the request to fail when creating large number of API keys in parallel as it hits the number of open scroll limit (default 500). We do not need the search context to be created so this commit removes the scroll parameter from the search request for duplicate API key.	2019-08-02 10:16:48 +10:00
Dimitris Athanasiou	8a6675b994	[7.x][ML] Check dest index is empty when starting DF analytics (#45094 ) (#45112 ) If one tries to start a DF analytics job that has already run, the result will be that the task will fail after reindexing the dest index from the source index. The results of the prior run will be gone and the task state is not properly set to failed with the failure reason. This commit improves the behavior in this scenario. First, we set the task state to `failed` in a set of failures that were missed. Second, a validation is added that if the destination index exists, it must be empty.	2019-08-02 00:19:48 +03:00
Mark Vieira	c13285a382	Remove unnecessary plugin application and project configuration (#45100 )	2019-08-01 14:18:24 -07:00
Yannick Welsch	917510d3e4	Always use primary term of operation in InternalEngine (#45083 ) We keep adding the current primary term to operations for which we do not assign a sequence number. This does not make sense anymore as all operations which we care about have sequence numbers now. The goal of this commit is to clean things up in InternalEngine and reduce the complexity.	2019-08-01 17:30:00 +02:00
Przemysław Witek	6c87845fc1	Persist DatafeedTimingStats with RefreshPolicy.NONE by default (#44940 ) (#45079 )	2019-08-01 14:36:59 +02:00
Martijn van Groningen	aae2f0cff2	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-08-01 13:38:03 +07:00
Mayya Sharipova	0c68765088	Adds usage stats for vectors (#45023 ) Example of usage: _xpack/usage "vectors": { "available": true, "enabled": true, "dense_vector_fields_count" : 1, "sparse_vector_fields_count" : 1, "dense_vector_dims_avg_count" : 100 } Backport for #44512	2019-07-31 12:32:41 -04:00
Armin Braun	c7d7230524	Stop Recreating Wrapped Handlers in RestController (#44964 ) (#45040 ) * We shouldn't be recreating wrapped REST handlers over and over for every request. We only use this hook in x-pack and the wrapper there does not have any per request state. This is inefficient and could lead to some very unexpected memory behavior => I made the logic create the wrapper on handler registration and adjusted the x-pack wrapper implementation to correctly forward the circuit breaker and content stream flags	2019-07-31 17:11:34 +02:00
Ioannis Kakavas	56da35b706	Indicate that some user APIs handle built-in users (#44857 ) The Get Users API also returns users form the restricted realm or built-in users, as we call them in our docs. One can also change the passwords of built-in users with the Change Password API	2019-07-31 17:55:28 +03:00
Tim Vernum	3c17d4379d	Expand logging when SAML Audience condition fails (#45027 ) A mismatched configuration between the IdP and SP will often result in SAML authentication attempts failing because the audience condition is not met (because the IdP and SP disagree about the correct form of the SP's Entity ID). Previously the error message in this case did not provide sufficient information to resolve the issue because the IdP's expected audience would be truncated if it exceeeded 32 characters. Since the error did not provide both IDs in full, it was not possible to determine the correct fix (in detail) based on the error alone. This change expands the message that is included in the thrown exception, and also adds additional logging of every failed audience condition, with diagnostics of the match failure. Backport of: #44334	2019-07-31 19:40:17 +10:00
Benjamin Trent	3f48720d41	[ML][Data Frames] unify validation exceptions between PUT/_preview (#44983 ) (#45012 ) * [ML][Data Frames] unify validation exceptions between PUT/_preview * addressing PR comments	2019-07-30 13:05:07 -05:00
Benjamin Trent	22feedf289	[ML][Data Frame] add support for bucket_selector (#44718 ) (#45008 )	2019-07-30 11:32:58 -05:00
Armin Braun	548c767b6b	S3 3rd Party Test Goal (#44799 ) (#45004 ) * Create S3 Third Party Test Task that Covers the S3 CLI Tool * Adjust snapshot cli test tool tests to work with real S3 * Build adjustment * Clean up repo path before testing * Dedup the logic for asserting path contents by using the correct utility method here that somehow became unused	2019-07-30 17:16:41 +02:00
David Kyle	d0cbf0cc7f	Mute WatcherRestIT 20_minimal_body Relates to https://github.com/elastic/elasticsearch/issues/43988	2019-07-30 15:58:16 +01:00
David Turner	55f1dd8da6	Close nodes properly in Coordinator tests (#44967 ) Today closing a `ClusterNode` in an `AbstractCoordinatorTestCase` uses `onNode()` so has no effect if the node is not in the current list of nodes. It also discards the `Runnable` it creates without having run it, so has no effect anyway. This commit makes these tests much stricter about properly closing the nodes started during `Coordinator` tests, by tracking the persisted states that are opened, and adds an assertion to catch the trappy requirement that the closing node still belongs to the cluster.	2019-07-30 11:47:36 +01:00
David Kyle	e18e9fa8c5	Mute SnapshotLifecycleServiceTests#testPolicyCRUD Relates to https://github.com/elastic/elasticsearch/issues/44997	2019-07-30 10:36:27 +01:00
Andrey Ershov	5a0bd696fc	Snapshot tool S3 cleanup 7.x backport (#44575 ) Backport of #44551	2019-07-30 11:02:08 +02:00
Tim Vernum	f575370e2f	Fix broken short-circuit in getUnlicensedRealms (#44937 ) The existing equals check was broken, and would always be false. The correct behaviour is to return "Collections.emptyList()" whenever the the active(licensed)-realms equals the configured-realms. Backport of: #44399	2019-07-30 16:32:04 +10:00
Lee Hinman	598c4e72f9	[7.x] Rename indexlifecycle to ilm and snapshotlifecycle to sl… (#44977 ) * Rename indexlifecycle to ilm and snapshotlifecycle to slm (#44917) As a followup to #44725 and #44608, which renamed the packages within the x-pack project, this renames the packages within the core x-pack project. It also renames 'snapshotlifecycle' within the HLRC to slm. * Fix one more import	2019-07-29 15:51:14 -06:00
Dimitris Athanasiou	aef419c0b0	[7.x][ML] Catch any error thrown while closing data frame analytics process (#44958 ) (#44968 ) In case closing the process throws an exception we should be catching it no matter its type. The process may have terminated because of a fatal error in which case closing the process will throw a server error, not an `IOException`. If this happens we fail to mark the persistent task as failed and the task gets in limbo.	2019-07-29 21:59:10 +03:00
Benjamin Trent	3b514f0dae	[ML] update Instant serialization (#44765 ) (#44954 ) * [ML] update Instant serialization * addressing PR comments * removing unused import	2019-07-29 13:06:56 -05:00
Dimitris Athanasiou	9dd527328a	[ML] Outlier detection should only fetch docs that have the analyzed … (#44944 ) (#44959 ) As data frame rows with missing values for analyzed fields are skipped, we can be more efficient by including a query that only picks documents that have values for all analyzed fields. Besides improving the number of documents we go through, we also provide a more accurate measurement of how many rows we need which reduces the memory requirements. This also adds an integration test that runs outlier detection on data with missing fields.	2019-07-29 18:23:56 +03:00
Luca Cavanna	a3cc32da64	TaskListener#onFailure to accept Exception instead of Throwable (#44946 ) TaskListener accepts today Throwable in its onFailure method. Though looking at where it is called (TransportAction), it can never be notified of a Throwable. This commit changes the signature of TaskListener#onFailure so that it accepts an `Exception` rather than a `Throwable` as second argument.	2019-07-29 16:47:19 +02:00
David Kyle	d05f12dadb	[ML] Close any opened pipes if there is an error connecting to the process (#44869 )	2019-07-29 10:48:31 +01:00
Martijn van Groningen	db49cb505e	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-07-29 14:45:10 +07:00
James Baiera	480af1ccf2	Fix build errors (#44933 ) Add EnrichPlugin to test cases that update cluster state	2019-07-29 14:17:44 +07:00
Gordon Brown	d4b2d21339	Add option to filter ILM explain response (#44777 ) In order to make it easier to interpret the output of the ILM Explain API, this commit adds two request parameters to that API: - `only_managed`, which causes the response to only contain indices which have `index.lifecycle.name` set - `only_errors`, which causes the response to contain only indices in an ILM error state "Error state" is defined as either being in the `ERROR` step or having `index.lifecycle.name` set to a policy that does not exist.	2019-07-26 11:57:38 -04:00
Przemysław Witek	79121ea127	[7.x] Implement exponential average search time per hour statistics. (#44683 ) (#44897 )	2019-07-26 15:56:34 +02:00
Jason Tedor	6ea2b5dec0	Deprecate setting processors to more than available (#44889 ) Today the processors setting is permitted to be set to more than the number of processors available to the JVM. The processors setting directly sizes the number of threads in the various thread pools, with most of these sizes being a linear function in the number of processors. It doesn't make any sense to set processors very high as the overhead from context switching amongst all the threads will overwhelm, and changing the setting does not control how many physical CPU resources there are on which to schedule the additional threads. We have to draw a line somewhere and this commit deprecates setting processors to more than the number of available processors. This is the right place to draw the line given the linear growth as a function of processors in most of the thread pools, and that some are capped at the number of available processors already.	2019-07-26 17:06:44 +09:00
Ioannis Kakavas	ac131f986b	Document xpack.security.authc.saml.realm for Kibana (#44705 ) Since 7.3, it's possible to explicitly configure the SAML realm to be used in Kibana's configuration. This in turn, eliminates the need of properly setting `xpack.security.public.*` settings in Kibana and largely simplifies relevant documentation. This also changes `xpack.security.authProviders` to `xpack.security.authc.providers` as the former was deprecated in favor of the latter in 7.3 in Kibana	2019-07-26 09:38:49 +03:00
Ignacio Vera	821f6f893b	Upgrade to Lucene 8.2.0 release (#44859 ) (#44892 )	2019-07-26 08:14:59 +02:00
James Baiera	fda4db4fab	fixup! Merge branch '7.x' into enrich-7.x	2019-07-25 15:28:40 -04:00
Przemysław Witek	8bb8543fdf	Treat PostDataActionResponse.DataCounts.bucketCount as incremental rather than absolute (total). (#44803 ) (#44856 )	2019-07-25 20:46:56 +02:00
Lisa Cawley	c9909b09b5	[DOCS] Adds command reference for elasticsearch-croneval (#43946 )	2019-07-25 11:41:05 -07:00
James Baiera	c5528a25e6	Merge branch '7.x' into enrich-7.x	2019-07-25 13:12:56 -04:00
Hendrik Muhs	2ca6306452	do not assert on indexer state (#44854 ) remove the unreliable check for the state change fixes #44813	2019-07-25 16:39:24 +02:00
David Roberts	b2e969f4ba	[ML-DataFrame] Remove ID field from data frame indexer stats (#44848 ) This is a followup to #44350. The indexer stats used to be persisted standalone, but now are only persisted as part of a state-and-stats document. During the review of #44350 it was decided that we'll stick with this design, so there will never be a need for an indexer stats object to store its transform ID as it is stored on the enclosing document. This PR removes the indexer stats document ID. Backport of #44768	2019-07-25 15:19:32 +01:00
Albert Zaharovits	af937b14ae	SecurityIndexManager handle RuntimeEx while reading mapping (#44409 ) Fixes exception handling while reading and parsing `.security-*` mappings and templates.	2019-07-25 16:52:21 +03:00
Przemysław Witek	53f409e5ae	Add result_type field to TimingStats and DatafeedTimingStats documents (#44812 ) (#44841 )	2019-07-25 10:11:55 +02:00
Yannick Welsch	e0d4544ef6	Close connection manager on current thread in RemoteClusterConnection (#44805 ) The problem is that RemoteClusterConnection closes the connection manager asynchronously, which races with the threadpool being shutdown at the end of the test. Closes #44339 Closes #44610	2019-07-25 09:34:41 +02:00
Tanguy Leroux	9944e193f9	[7.x] Clean up ShardFollowTasks for deleted indices (#44702 ) (#44790 ) Deleting a follower index does not delete its ShardFollowTasks, potentially leaving many persistent tasks in the cluster that cannot be allocated on nodes and unnecessary fill the logs. This commit adds a cluster state listener (ShardFollowTaskCleaner) that completes (with a failure) any persistent task that refers to a non existent follower index. I think that this bug has been introduced by #34404: before this change the task would have been completed as failed and removed from the cluster state. Backport of #44702 and #44801 on 7.x	2019-07-25 09:33:57 +02:00
Andrei Stefan	fd74b63602	SQL: fix URI path being lost in case of hosted ES scenario (#44776 ) (cherry picked from commit 06dea859e8fddada868941aaae15e83b4f64babe)	2019-07-25 10:27:51 +03:00
Andrei Stefan	ee53f7e161	SQL: [Tests] Re-enable testDriverConfigurationWithSSLInURL test with more logging (#44800 ) (cherry picked from commit 5b9ccd72e9a3bb65c8b7b06979a75cb795c17111)	2019-07-25 10:10:45 +03:00
Andrei Stefan	2633d11eb7	Switch from using docvalue_fields to extracting values from _source (#44062 ) (#44804 ) * Switch from using docvalue_fields to extracting values from _source where applicable. Doing this means parsing the _source and handling the numbers parsing just like Elasticsearch is doing it when it's indexing a document. * This also introduces a minor limitation: aliases type of fields that are NOT part of a tree of sub-fields will not be able to be retrieved anymore. field_caps API doesn't shed any light into a field being an alias or not and at _source parsing time there is no way to know if a root field is an alias or not. Fields of the type "a.b.c.alias" can be extracted from docvalue_fields, only if the field they point to can be extracted from docvalue_fields. Also, not all fields in a hierarchy of fields can be evaluated to being an alias. (cherry picked from commit 8bf8a055e38f00df5f49c8d97f632f69d6e00c2c)	2019-07-25 10:02:41 +03:00
Gordon Brown	2ac54da60a	Fix swapped variables in error message (#44300 ) The alias name and index were in the incorrect order in this error message. This commit corrects the order.	2019-07-24 11:38:15 -04:00
Andrei Stefan	04cb3aebd5	Use hasValue() methods from aggregations' InspectionHelpers (#44745 ) Use InspectionHelper classes to decide if the aggregations should return null (in case there is no value) or the value itself. (cherry picked from commit dafd7b039b0da072750e8f57e7572d24f7aad44a)	2019-07-24 16:14:45 +03:00
James Rodewig	ad7c164dd0	[DOCS] Rewrite `regexp` query (#42711 )	2019-07-24 08:38:41 -04:00
Ioannis Kakavas	bfb2e323e9	mute test (#44809 ) see #44808	2019-07-24 15:00:50 +03:00
Przemysław Witek	26da573e94	[ML] [7.x] Only emit deprecation warning if there was actual change of a datafeed's job_id. (#44755 ) * Only emit deprecation warning if there was actual change of a datafeed's job_id. * Add @Deprecated annotation to DatafeedUpdate.Builder#setJobId method	2019-07-24 10:03:25 +02:00
James Baiera	c357f81aa7	Add soft limit for max concurrent policy executions (#43117 ) Adds a global soft limit on the number of concurrently executing enrich policies. Since an enrich policy is run on the generic thread pool, this is meant to limit policy runs separately from the generic thread pool capacity.	2019-07-23 16:03:14 -04:00
David Roberts	caf9411a72	[ML] Improve response format of data frame stats endpoint (#44743 ) This change adjusts the data frame transforms stats endpoint to return a structure that is easier to understand. This is a breaking change for clients of the data frame transforms stats endpoint, but the feature is in beta so stability is not guaranteed. Backport of #44350	2019-07-23 18:00:50 +01:00
Benjamin Trent	6f53865fde	[ML][Data Frame] Fixes failure state tests and failure setting handling (#44645 ) (#44698 ) * [ML][Data Frame] fixing flaky test * adjusting frequency * fixing tests * addressing PR comments	2019-07-23 08:33:12 -05:00
Przemysław Witek	16c8e18013	Deprecate the ability to update datafeed's job_id. (#44691 ) (#44742 )	2019-07-23 14:48:56 +02:00
Jason Tedor	e2c8f8dfa3	Rename ILM package to ilm (#44725 ) This commit renames the ILM package from indexlifecycle to ilm. We have all come to know index lifecycle management as ILM, the APIs and settings use ilm, and it would be nice of the package did too. This commit makes that change.	2019-07-23 16:46:38 +09:00
Jason Tedor	5878bde8dc	Rename SLM package to slm (#44608 ) This commit renames the SLM package from snapshotlifecycle to slm. We have all come to know index lifecycle management as ILM, the APIs and settings use ilm, and it would be nice of the package did too. For SLM, let's use slm for all of these including the package name from the beginning.	2019-07-23 07:35:06 +09:00
Benjamin Trent	4456850a8e	[7.x] [ML][Data Frame] Add optional defer_validation param to PUT (#44455 ) (#44697 ) * [ML][Data Frame] Add optional defer_validation param to PUT (#44455) * [ML][Data Frame] Add optional defer_validation param to PUT * addressing PR comments * reverting bad replace * addressing pr comments * Update put-transform.asciidoc * Update put-transform.asciidoc * Update put-transform.asciidoc * adjusting for backport * fixing imports * [DOCS] Fixes formatting in create data frame transform API	2019-07-22 15:12:55 -05:00
James Baiera	fc20264b99	Add Enrich index background task to cleanup old indices (#43746 ) This PR adds a background maintenance task that is scheduled on the master node only. The deletion of an index is based on if it is not linked to a policy or if the enrich alias is not currently pointing at it. Synchronization has been added to make sure that no policy executions are running at the time of cleanup, and if any executions do occur, the marking process delays cleanup until next run.	2019-07-22 14:41:22 -04:00
Benjamin Trent	06e21f7902	[7.x] [ML][Data Frame] adding force delete (#44590 ) (#44696 ) * [ML][Data Frame] adding force delete (#44590) * [ML][Data Frame] adding force delete * Update delete-transform.asciidoc * adjusting for backport	2019-07-22 13:13:25 -05:00
Ioannis Kakavas	3714cb63da	Allow parsing the value of java.version sysprop (#44017 ) We often start testing with early access versions of new Java versions and this have caused minor issues in our tests (i.e. #43141) because the version string that the JVM reports cannot be parsed as it ends with the string -ea. This commit changes how we parse and compare Java versions to allow correct parsing and comparison of the output of java.version system property that might include an additional alphanumeric part after the version numbers (see [JEP 223[(https://openjdk.java.net/jeps/223)). In short it handles a version number part, like before, but additionally a PRE part that matches ([a-zA-Z0-9]+). It also changes a number of tests that would attempt to parse java.specification.version in order to get the full version of Java. java.specification.version only contains the major version and is thus inappropriate when trying to compare against a version that might contain a minor, patch or an early access part. We know parse java.version that can be consistently parsed. Resolves #43141	2019-07-22 20:14:56 +03:00
Hendrik Muhs	4387d81e5b	add a test to check excecution flow (#44481 ) add a test for the execution flow of a2p indexer	2019-07-22 17:07:11 +02:00
Benjamin Trent	a948362d0a	[7.x] [ML][Data Frame] deregister scheduler on transform failure (#44569 ) (#44576 ) * [ML][Data Frame] deregister scheduler on transform failure (#44569) * fixing test * Update DataFrameRestTestCase.java * Update DataFrameTaskFailedStateIT.java * Update DataFramePivotRestIT.java	2019-07-22 09:06:48 -05:00
David Roberts	6d27eec30f	[ML-DataFrame] Use lenient expand open in data frame searches (#44633 ) Since #44344 we use IndicesOptions.LENIENT_EXPAND_OPEN when deciding which indices to include in checkpoint calculation. This change uses the same option when deciding which indices to search for data and which indices to get mappings from, otherwise there is a potential mismatch between the checkpoint details and what is searched elsewhere.	2019-07-22 11:30:33 +01:00
Alpar Torok	b34ac66d96	Mute multiple tests on Windows (7.x) (#44676 ) * Mute failing test tracked in #44552 * mute EvilSecurityTests tracking in #44558 * Fix line endings in ESJsonLayoutTests * Mute failing ForecastIT test on windows Tracking in #44609 * mute BasicRenormalizationIT.testDefaultRenormalization tracked in #44613 * fix mute testDefaultRenormalization * Increase busyWait timeout windows is slow * Mute failure unconfigured node name * mute x-pack internal cluster test windows tracking #44610 * Mute JvmErgonomicsTests on windows Tracking #44669 * mute SharedClusterSnapshotRestoreIT testParallelRestoreOperationsFromSingleSnapshot Tracking #44671 * Mute NodeTests on Windows Tracking #44256	2019-07-22 11:32:29 +03:00
Tal Levy	7c84636029	Remove StreamOutput #writeOptionalStreamable and #writeStreamableList (#44602 ) (#44643 ) remove usages of writeOptionalStreamable and writeStreambaleList relates #34389.	2019-07-19 15:55:53 -07:00
Benjamin Trent	2e303fc5f7	[ML][Data Frame] adding dynamic cluster setting for failure retries (#44577 ) (#44639 ) This adds a new dynamic cluster setting `xpack.data_frame.num_transform_failure_retries`. This setting indicates how many times non-critical failures should be retried before a data frame transform is marked as failed and should stop executing. At the time of this commit; Min: 0, Max: 100, Default: 10	2019-07-19 16:17:39 -05:00
Ryan Ernst	f193d14764	Convert remaining Action Response/Request to writeable.reader (#44528 ) (#44607 ) This commit converts readFrom to ctor with StreamInput on the remaining ActionResponse and ActionRequest classes. relates #34389	2019-07-19 13:33:38 -07:00
Christoph Büscher	eafe54c81c	Fix AnalysisMode propagation in NamedAnalyzer (#44626 ) NamedAnalyzer should return the same AnalysisMode than any custom analyzer it wraps, otherwise AnalysisMode.ALL. This used to be only CustomAnalyzer in the past, but with the introduction of the ReloadableCustomAnalyzer this needs to be added as an option where the analysis mode gets propagated. Closes #44625	2019-07-19 18:18:43 +02:00
James Rodewig	d46545f729	[DOCS] Update anchors and links for Elasticsearch API relocation (#44500 )	2019-07-19 09:18:23 -04:00
Ryan Ernst	60785a9fa8	Convert several direct uses of Streamable to Writeable (#44586 ) (#44604 ) This commit converts several utility classes that implement Streamable to have StreamInput constructors. It also adds a default version of readFrom to Streamable so that overriding to throw UOE is not necessary. relates #34389	2019-07-18 21:25:44 -07:00
Ryan Ernst	13f46aa801	Convert index and persistent actions/response to writeable (#44582 ) (#44601 ) This commit converts several more classes from streamable to writeable in server, mostly within the o.e.index and o.e.persistent packages. relates #34389	2019-07-18 18:32:09 -07:00
Tal Levy	03f5084ac7	remove usages of #readOptionalStreamable, #readStreamableList. (#44578 ) (#44598 ) This commit removes references to Streamable from StreamInput. This is all a part of the effort to remove Streamable usage. relates #34389.	2019-07-18 16:19:02 -07:00
Lee Hinman	3001f7941f	Allow empty configuration for SLM policies (#44465 ) * Allow empty configuration for SLM policies When putting or updating a snapshot lifecycle policy it was not possible to elide the `config` map. This commit makes the configuration optional, the same way that it is when taking a snapshot. Relates to #38461 * Add Objects.requireNonNull for required parts of the policy	2019-07-18 16:20:31 -06:00
Lee Hinman	fe2ef66e45	Expose index age in ILM explain output (#44457 ) * Expose index age in ILM explain output This adds the index's age to the ILM explain output, for example: ``` { "indices" : { "ilm-000001" : { "index" : "ilm-000001", "managed" : true, "policy" : "full-lifecycle", "lifecycle_date" : "2019-07-16T19:48:22.294Z", "lifecycle_date_millis" : 1563306502294, "age" : "1.34m", "phase" : "hot", "phase_time" : "2019-07-16T19:48:22.487Z", ... etc ... } } } ``` This age can be used to tell when ILM will transition the index to the next phase, based on that phase's `min_age`. Resolves #38988 * Expose age in getters and in HLRC	2019-07-18 15:33:45 -06:00
Benjamin Trent	3477f5ae04	muting test testBulkIndexFailuresCauseTaskToFail (#44594 )	2019-07-18 15:03:50 -05:00
Benjamin Trent	d5ca72740e	[ML][Data Frame] adjust onFinish audit frequency (#44450 ) (#44508 )	2019-07-18 14:28:34 -05:00
Tal Levy	c8a8915b27	migrate rollup/monitoring/graph/watcher actions to Writeable (#44464 ) (#44538 ) this commit migrates leftover actions from a few x-pack plugins to the new Writeable.Reader infrastructure. relates #34389.	2019-07-18 08:42:56 -07:00
Andrey Ershov	ef6ddd15c6	Revert "Snapshot tool: S3 orphaned files cleanup (#44551)" This reverts commit `09edeeb3`	2019-07-18 17:21:45 +02:00
Nhat Nguyen	47cfc25fa0	Skip update if leader and follower settings identical (#44535 ) If the setting on the follower and the leader are identical after filtering out private and internal settings, then we should not call update setting (on the follower) as there's nothing to change. Moreover, this makes the ShardFollowTask abort as it considers ActionRequestValidationException (caused by an empty update setting request) as a fatal error. Closes #44521	2019-07-18 11:08:45 -04:00
Andrey Ershov	09edeeb38e	Snapshot tool: S3 orphaned files cleanup (#44551 ) A tool to work with snapshots. Co-authored by @original-brownbear. This commit adds snapshot tool and the single command cleanup, that cleans up orphaned files for S3. Snapshot tool lives in x-pack/snapshot-tool. (cherry picked from commit fc4aed44dd975d83229561090f957a95cc76b287)	2019-07-18 16:38:00 +02:00
Christoph Büscher	e9f257b4d9	Fix ReloadDetailsTests compilation	2019-07-18 11:55:22 +02:00
Christoph Büscher	eed2db8947	Add stream serialization to ReloadAnalyzersResponse (#44420 ) This change adds Writeable support to ReloadAnalyzersResponse that is required when using the transport client in 7.x. Closes #44383	2019-07-18 11:11:54 +02:00
David Kyle	0fc091f166	Enable XLint warnings for ML (#44346 ) Removes the warning suppression -Xlint:-deprecation,-rawtypes,-serial,-try,-unchecked. Many warnings were unchecked warnings in the test code often because of the use of mocks. These are suppressed with @SuppressWarning	2019-07-18 09:33:37 +01:00
Ryan Ernst	edd26339c5	Convert remaining request classes in xpack core to writeable.reader (#44524 ) (#44534 ) This commit converts all remaining classes extending ActionRequest in xpack core to have a StreamInput constructor. relates #34389	2019-07-18 01:11:45 -07:00
Tal Levy	a5ad59451c	migrate more ML actions off of using Request suppliers (#44462 ) (#44529 ) many classes still use the Streamable constructors of HandledTransportAction, this commit moves more of those classes to the new Writeable constructors. relates #34389.	2019-07-17 20:28:29 -07:00
Tal Levy	075a3f0e99	remove usage of ActionType#(String) (#44459 ) (#44526 ) this commit removes usage of the deprecated constructor with a single argument and no Writeable.Reader. The purpose of this is to reduce the boilerplate necessary for properly implementing a new action, as well as reducing the chances of using the incorrect super constructor while classes are being migrated to Writeable relates #34389.	2019-07-17 20:28:11 -07:00
Ryan Ernst	2a2686e6e7	Convert remaining ActionTypes to writeable in xpack core (#44467 ) (#44525 ) This commit converts all remaining ActionType response classes to writeable in xpack core. It also converts a few from server which were used by xpack core. relates #34389	2019-07-17 18:01:45 -07:00
Ryan Ernst	17c4b2b839	Convert MasterNodeRequest to implement Writeable.Reader (#44452 ) (#44513 ) This commit converts all MasterNodeRequest subclasses to fullfill Writeable.Reader constructors. relates #34389	2019-07-17 18:01:29 -07:00
Jason Tedor	39c5f98de7	Introduce test issue logging (#44477 ) Today we have an annotation for controlling logging levels in tests. This annotation serves two purposes, one is to control the logging level used in tests, when such control is needed to impact and assert the behavior of loggers in tests. The other use is when a test is failing and additional logging is needed. This commit separates these two concerns into separate annotations. The primary motivation for this is that we have a history of leaving behind the annotation for the purpose of investigating test failures long after the test failure is resolved. The accumulation of these stale logging annotations has led to excessive disk consumption. Having recently cleaned this up, we would like to avoid falling into this state again. To do this, we are adding a link to the test failure under investigation to the annotation when used for the purpose of investigating test failures. We will add tooling to inspect these annotations, in the same way that we have tooling on awaits fix annotations. This will enable us to report on the use of these annotations, and report when stale uses of the annotation exist.	2019-07-18 05:33:33 +09:00
Lisa Cawley	621ec7cb84	[DOCS] Removes out-dated info from Watcher limitations (#44252 )	2019-07-17 13:24:02 -07:00
Ryan Ernst	0755a13c9f	Convert AcknowledgedRequest to Writeable.Reader (#44412 ) (#44454 ) This commit adds constructors to AcknolwedgedRequest subclasses to implement Writeable.Reader, and ensures all future subclasses implement the same. relates #34389	2019-07-17 11:17:36 -07:00
Yannick Welsch	d98b3e4760	Move frozen indices to x-pack module (#44490 ) Backport of #44408 and #44286.	2019-07-17 16:53:10 +02:00
Ignacio Vera	eb348d2593	Upgrade to lucene-8.2.0-snapshot-6413aae226 (#44480 )	2019-07-17 13:28:28 +02:00
Ryan Ernst	6e50bafa8f	Convert Broadcast request and response to use writeable.reader (#44386 ) (#44453 ) This commit converts the request and response classes for broadcast actions to implement ctors for Writeable.Reader and forces all future implementations to implement the same. relates #34389	2019-07-16 23:24:02 -07:00
Tim Brooks	0a352486e8	Isolate nio channel registered from channel active (#44388 ) Registering a channel with a selector is a required operation for the channel to be handled properly. Currently, we mix the registeration with other setup operations (ip filtering, SSL initiation, etc). However, a fail to register is fatal. This PR modifies how registeration occurs to immediately close the channel if it fails. There are still two clear loopholes for how a user can interact with a channel even if registration fails. 1. through the exception handler. 2. through the channel accepted callback. These can perhaps be improved in the future. For now, this PR prevents writes from proceeding if the channel is not registered.	2019-07-16 17:18:57 -06:00
Jason Tedor	100cb89f3e	Avoid stack overflow in auto-follow coordinator (#44421 ) This commit avoids a situation where we might stack overflow in the auto-follower coordinator. In the face of repeated failures to get the remote cluster state, we would previously be called back on the same thread and then recurse to try again. If this failure persists, the repeated callbacks on the same thread would lead to a stack overflow. The repeated failures can occur, for example, if the connect queue is full when we attempt to make a connection to the remote cluster. This commit avoids this by truncating the call stack if we are called back on the same thread as the initial request was made on.	2019-07-17 07:39:11 +09:00
Jason Tedor	becbf450fa	Avoid NPE when checking for CCR index privileges (#44397 ) This commit avoids an NPE when checking for privileges to follow indices. The problem here is that in some cases we might not be able to read the authentication info from the thread context. In that case, a null user would be returned and we were not guarding against this.	2019-07-17 06:27:55 +09:00
Ryan Ernst	c26edb4c43	Ensure replication response/requests implement writeable (#44392 ) (#44446 ) This commit cleans up replication response and request so that the base class does not allow subclasses to implement Streamable. relates #34389	2019-07-16 12:53:08 -07:00
Tal Levy	901310a826	[7.x] Migrate ML Actions to use writeable ActionType (#44302 ) (#44391 ) * Migrate ML Actions to use writeable ActionType (#44302) This commit converts all the StreamableResponseActionType actions in the ML core module to be ActionType and leverage the Writeable infrastructure.	2019-07-16 12:41:10 -07:00
Przemysław Witek	9613700a63	[7.x] Implement MlConfigIndexMappingsFullClusterRestartIT test which verifies that .ml-config index mappings are properly updated during cluster upgrade (#44341 ) (#44366 )	2019-07-16 21:22:40 +02:00
James Rodewig	ac07eef86c	[DOCS] Remove :edit_url: overrides. (#44445 ) These overrides do not work in Asciidoctor and are no longer needed.	2019-07-16 15:04:44 -04:00
James Baiera	a3ba11cfc1	Improve CryptoService error message on missing secure file (#43623 ) (#44364 ) This improves the error message when encrypting of sensitive watcher data is configured, but no system file was specified in the keystore. This error message is displayed on startup. This also closes the input stream of the secure file properly. Closes #43619	2019-07-16 13:58:57 -04:00
Benjamin Trent	2c7ff812da	[ML] Add r_squared eval metric to regression (#44248 ) (#44378 ) * [ML] Add r_squared eval metric to regression * fixing tests and binarysoftclassification class * Update RSquared.java * Update x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/ml/dataframe/evaluation/regression/RSquared.java Co-Authored-By: David Kyle <david.kyle@elastic.co> * removing unnecessary debug test	2019-07-16 11:11:31 -05:00
Benjamin Trent	858dbfc074	[ML][Data Frame] treat bulk index failures as an indexing failure (#44351 ) (#44427 ) * [ML][Data Frame] treat bulk index failures as an indexing failure * removing redundant public modifier * changing to an ElasticsearchException * fixing redundant public modifier	2019-07-16 10:04:28 -05:00
Przemysław Witek	34bf6bcec0	Treat big changes in searchCount as significant and persist the document after such changes (#44413 ) (#44424 )	2019-07-16 16:15:32 +02:00
Jake Landis	eb7d43f4cf	Log write failures for watcher history document. (#44129 ) (#44357 ) The failure is correctly getting propagated, this commit adds support to explicitly look for .watch-history failures using the same logging strategy as triggered watch failures.	2019-07-16 08:48:09 -05:00
Lee Hinman	fb0461ac76	[7.x] Add Snapshot Lifecycle Management (#44382 ) * Add Snapshot Lifecycle Management (#43934) * Add SnapshotLifecycleService and related CRUD APIs This commit adds `SnapshotLifecycleService` as a new service under the ilm plugin. This service handles snapshot lifecycle policies by scheduling based on the policies defined schedule. This also includes the get, put, and delete APIs for these policies Relates to #38461 * Make scheduledJobIds return an immutable set * Use Object.equals for SnapshotLifecyclePolicy * Remove unneeded TODO * Implement ToXContentFragment on SnapshotLifecyclePolicyItem * Copy contents of the scheduledJobIds * Handle snapshot lifecycle policy updates and deletions (#40062) (Note this is a PR against the `snapshot-lifecycle-management` feature branch) This adds logic to `SnapshotLifecycleService` to handle updates and deletes for snapshot policies. Policies with incremented versions have the old policy cancelled and the new one scheduled. Deleted policies have their schedules cancelled when they are no longer present in the cluster state metadata. Relates to #38461 * Take a snapshot for the policy when the SLM policy is triggered (#40383) (This is a PR for the `snapshot-lifecycle-management` branch) This commit fills in `SnapshotLifecycleTask` to actually perform the snapshotting when the policy is triggered. Currently there is no handling of the results (other than logging) as that will be added in subsequent work. This also adds unit tests and an integration test that schedules a policy and ensures that a snapshot is correctly taken. Relates to #38461 * Record most recent snapshot policy success/failure (#40619) Keeping a record of the results of the successes and failures will aid troubleshooting of policies and make users more confident that their snapshots are being taken as expected. This is the first step toward writing history in a more permanent fashion. * Validate snapshot lifecycle policies (#40654) (This is a PR against the `snapshot-lifecycle-management` branch) With the commit, we now validate the content of snapshot lifecycle policies when the policy is being created or updated. This checks for the validity of the id, name, schedule, and repository. Additionally, cluster state is checked to ensure that the repository exists prior to the lifecycle being added to the cluster state. Part of #38461 * Hook SLM into ILM's start and stop APIs (#40871) (This pull request is for the `snapshot-lifecycle-management` branch) This change allows the existing `/_ilm/stop` and `/_ilm/start` APIs to also manage snapshot lifecycle scheduling. When ILM is stopped all scheduled jobs are cancelled. Relates to #38461 * Add tests for SnapshotLifecyclePolicyItem (#40912) Adds serialization tests for SnapshotLifecyclePolicyItem. * Fix improper import in build.gradle after master merge * Add human readable version of modified date for snapshot lifecycle policy (#41035) * Add human readable version of modified date for snapshot lifecycle policy This small change changes it from: ``` ... "modified_date": 1554843903242, ... ``` To ``` ... "modified_date" : "2019-04-09T21:05:03.242Z", "modified_date_millis" : 1554843903242, ... ``` Including the `"modified_date"` field when the `?human` field is used. Relates to #38461 * Fix test * Add API to execute SLM policy on demand (#41038) This commit adds the ability to perform a snapshot on demand for a policy. This can be useful to take a snapshot immediately prior to performing some sort of maintenance. ```json PUT /_ilm/snapshot/<policy>/_execute ``` And it returns the response with the generated snapshot name: ```json { "snapshot_name" : "production-snap-2019.04.09-rfyv3j9qreixkdbnfuw0ug" } ``` Note that this does not allow waiting for the snapshot, and the snapshot could still fail. It does record this information into the cluster state similar to a regularly trigged SLM job. Relates to #38461 * Add next_execution to SLM policy metadata (#41221) * Add next_execution to SLM policy metadata This adds the next time a snapshot lifecycle policy will be executed when retriving a policy's metadata, for example: ```json GET /_ilm/snapshot?human { "production" : { "version" : 1, "modified_date" : "2019-04-15T21:16:21.865Z", "modified_date_millis" : 1555362981865, "policy" : { "name" : "<production-snap-{now/d}>", "schedule" : "/30 * * * ?", "repository" : "repo", "config" : { "indices" : [ "foo-", "important" ], "ignore_unavailable" : true, "include_global_state" : false } }, "next_execution" : "2019-04-15T21:16:30.000Z", "next_execution_millis" : 1555362990000 }, "other" : { "version" : 1, "modified_date" : "2019-04-15T21:12:19.959Z", "modified_date_millis" : 1555362739959, "policy" : { "name" : "<other-snap-{now/d}>", "schedule" : "0 30 2 * ?", "repository" : "repo", "config" : { "indices" : [ "other" ], "ignore_unavailable" : false, "include_global_state" : true } }, "next_execution" : "2019-04-16T02:30:00.000Z", "next_execution_millis" : 1555381800000 } } ``` Relates to #38461 * Fix and enhance tests * Figured out how to Cron * Change SLM endpoint from /_ilm/* to /_slm/* (#41320) This commit changes the endpoint for snapshot lifecycle management from: ``` GET /_ilm/snapshot/<policy> ``` to: ``` GET /_slm/policy/<policy> ``` It mimics the ILM path only using `slm` instead of `ilm`. Relates to #38461 * Add initial documentation for SLM (#41510) * Add initial documentation for SLM This adds the initial documentation for snapshot lifecycle management. It also includes the REST spec API json files since they're sort of documentation. Relates to #38461 * Add `manage_slm` and `read_slm` roles (#41607) * Add `manage_slm` and `read_slm` roles This adds two more built in roles - `manage_slm` which has permission to perform any of the SLM actions, as well as stopping, starting, and retrieving the operation status of ILM. `read_slm` which has permission to retrieve snapshot lifecycle policies as well as retrieving the operation status of ILM. Relates to #38461 * Add execute to the test * Fix ilm -> slm typo in test * Record SLM history into an index (#41707) It is useful to have a record of the actions that Snapshot Lifecycle Management takes, especially for the purposes of alerting when a snapshot fails or has not been taken successfully for a certain amount of time. This adds the infrastructure to record SLM actions into an index that can be queried at leisure, along with a lifecycle policy so that this history does not grow without bound. Additionally, SLM automatically setting up an index + lifecycle policy leads to `index_lifecycle` custom metadata in the cluster state, which some of the ML tests don't know how to deal with due to setting up custom `NamedXContentRegistry`s. Watcher would cause the same problem, but it is already disabled (for the same reason). * High Level Rest Client support for SLM (#41767) * High Level Rest Client support for SLM This commit add HLRC support for SLM. Relates to #38461 * Fill out documentation tests with tags * Add more callouts and asciidoc for HLRC * Update javadoc links to real locations * Add security test testing SLM cluster privileges (#42678) * Add security test testing SLM cluster privileges This adds a test to `PermissionsIT` that uses the `manage_slm` and `read_slm` cluster privileges. Relates to #38461 * Don't redefine vars * Add Getting Started Guide for SLM (#42878) This commit adds a basic Getting Started Guide for SLM. * Include SLM policy name in Snapshot metadata (#43132) Keep track of which SLM policy in the metadata field of the Snapshots taken by SLM. This allows users to more easily understand where the snapshot came from, and will enable future SLM features such as retention policies. * Fix compilation after master merge * [TEST] Move exception wrapping for devious exception throwing Fixes an issue where an exception was created from one line and thrown in another. * Fix SLM for the change to AcknowledgedResponse * Add Snapshot Lifecycle Management Package Docs (#43535) * Fix compilation for transport actions now that task is required * Add a note mentioning the privileges needed for SLM (#43708) * Add a note mentioning the privileges needed for SLM This adds a note to the top of the "getting started with SLM" documentation mentioning that there are two built-in privileges to assist with creating roles for SLM users and administrators. Relates to #38461 * Mention that you can create snapshots for indices you can't read * Fix REST tests for new number of cluster privileges * Mute testThatNonExistingTemplatesAreAddedImmediately (#43951) * Fix SnapshotHistoryStoreTests after merge * Remove overridden newResponse functions that have been removed * Fix compilation for backport * Fix get snapshot output parsing in test * [DOCS] Add redirects for removed autogen anchors (#44380) * Switch <tt>...</tt> in javadocs for {@code ...}	2019-07-16 07:37:13 -06:00
Hendrik Muhs	6c1f740759	[ML-DataFrame] make checkpointing more robust (#44344 ) (#44414 ) make checkpointing more robust: - do not let checkpointing fail if indexes got deleted - treat missing seqNoStats as just created indices (checkpoint 0) - loglevel: do not treat failed updated checks as error fixes #43992	2019-07-16 13:43:13 +02:00
Przemysław Witek	3f3a3d3f2b	[7.x] Add DatafeedTimingStats.average_search_time_per_bucket_ms and TimingStats.total_bucket_processing_time_ms stats (#44125 ) (#44404 )	2019-07-16 12:51:29 +02:00
Ryan Ernst	c4cf98c538	Convert core security actions to use writeable ActionType (#44359 ) (#44390 ) This commit converts all the StreamableResponseActionType security classes in xpack core to ActionType, implementing Writeable for their response classes. relates #34389	2019-07-16 01:11:13 -07:00
Jason Tedor	be98a12cd0	Do not swallow I/O exception getting authentication (#44398 ) When getting authentication info from the thread context, it might be that we encounter an I/O exception. Today we swallow this exception and return a null authentication info to the caller. Yet, this could be hiding bugs or errors. This commits adjusts this behavior so that we no longer swallow the exception.	2019-07-16 16:14:15 +09:00
Ryan Ernst	e0b82e92f3	Convert BaseNode(s) Request/Response classes to Writeable (#44301 ) (#44358 ) This commit converts all BaseNodeResponse and BaseNodesResponse subclasses to implement Writeable.Reader instead of Streamable. relates #34389	2019-07-15 18:07:52 -07:00
Ryan Ernst	7e06888bae	Convert testclusters to use distro download plugin (#44253 ) (#44362 ) Test clusters currently has its own set of logic for dealing with finding different versions of Elasticsearch, downloading them, and extracting them. This commit converts testclusters to use the DistributionDownloadPlugin.	2019-07-15 17:53:05 -07:00
Lisa Cawley	753da8feac	[DOCS] Updates terminology for alerting features (#43945 )	2019-07-15 14:47:33 -07:00
Yannick Welsch	a848fc9bf4	Revert "Add usage stats for frozen indices (#44286 )" This reverts commit `5e73c49ec8`.	2019-07-15 21:41:25 +02:00
Yannick Welsch	7b68bfb4e6	Revert "Add frozen indices usage for all but transport client (#44286 )" This reverts commit `d2d40afc02`.	2019-07-15 21:41:21 +02:00
Yannick Welsch	d2d40afc02	Add frozen indices usage for all but transport client (#44286 ) Backport gone wrong.	2019-07-15 20:49:23 +02:00
Ryan Ernst	59658daef9	Separate streamable based master node actions (#44313 ) This commit creates new base classes for master node actions whose response types still implement Streamable. This simplifies both finding remaining classes to convert, as well as creating new master node actions that use Writeable for their responses. relates #34389	2019-07-15 09:20:20 -07:00
Yannick Welsch	5e73c49ec8	Add usage stats for frozen indices (#44286 ) Adds usage stats for frozen indices of the form: "frozen_indices" : { "available" : true, "enabled" : true, "indices_count" : 0 }	2019-07-15 17:34:46 +02:00
Armin Braun	d73e2f9c56	HLRC: Fix '+' Not Correctly Encoded in GET Req. (#33164 ) (#44324 ) * HLRC: Fix '+' Not Correctly Encoded in GET Req. * Encode `+` correctly as `%2B` in URL paths * Keep encoding `+` as space in URL parameters * Closes #33077	2019-07-15 10:21:54 +02:00
Ryan Ernst	fc6a31e141	Use specific version constant for wire bwc check (#44316 ) This commit modifies bwc behavior in FindFileStructureAction to check against a concrete version instead of Version.CURRENT. Checking against Version.CURRENT does not work since it is changing, in addition to it having different meanings on each branch. relates #42501	2019-07-14 19:05:14 -07:00
Hendrik Muhs	684b562381	[7.x][ML-DataFrame] Rewrite continuous logic to prevent terms count limit (#44287 ) Rewrites how continuous data frame transforms calculates and handles buckets that require an update. Instead of storing the whole set in memory, it pages through the updates using a 2nd cursor. This lowers memory consumption and prevents problems with limits at query time (max_terms_count). The list of updates can be re-retrieved in a failure case (#43662)	2019-07-13 06:58:04 +02:00
Ryan Ernst	1dcf53465c	Reorder HandledTransportAction ctor args (#44291 ) This commit moves the Supplier variant of HandledTransportAction to have a different ordering than the Writeable.Reader variant. The Supplier version is used for the legacy Streamable, and currently having the location of the Writeable.Reader vs Supplier in the same place forces using casts of Writeable.Reader to select the correct super constructor. This change in ordering allows easier migration to Writeable.Reader. relates #34389	2019-07-12 13:45:09 -07:00
Benjamin Trent	51ff6b420a	[ML][Data Frame] prevent task from attempting to run when failed (#44239 ) (#44292 )	2019-07-12 15:24:49 -05:00
Benjamin Trent	79c62fd724	[ML][Data Frame] Fixing default delay set in timesync (#44281 ) (#44293 ) * [ML][Data Frame] Fixing default delay set in timesync * disallowing explicit null, don't do duration check on write	2019-07-12 15:21:47 -05:00
James Baiera	7ad9beb087	Set auto expand replicas on enrich index after force merge is done. (#43600 )	2019-07-12 11:56:56 -04:00
Lisa Cawley	aa6b544fac	[DOCS] Moves Watcher troubleshooting page (#44250 )	2019-07-12 08:28:18 -07:00
Benjamin Trent	68cd675892	[ML][Data Frame] responding with 409 status code when failing _stop (#44231 ) (#44276 ) * [ML][Data Frame] responding with appropriate status code when failing _stop * adding null checks for persistent task data * addressing PR comments	2019-07-12 10:10:24 -05:00
Przemysław Witek	dd5f4ae00e	Update .ml-config mappings before indexing job, datafeed or df analytics config (#44216 ) (#44273 )	2019-07-12 16:49:48 +02:00
Przemysław Witek	40d3c60d7a	Make testDatafeedTimingStats_DatafeedJobIdUpdated test easier to debug (#44206 ) (#44268 )	2019-07-12 13:52:26 +02:00
Ioannis Kakavas	475752be75	Make plugin verification FIPS 140 compliant (#44266 ) This change makes the process of verifying the signature of official plugins FIPS 140 compliant by defaulting to use the BouncyCastle FIPS provider and adding a dependency to bcpg-fips that implement parts of openPGP in a FIPS compliant manner. In already FIPS 140 enabled environments that use the BouncyCastle FIPS provider, the bcfips dependency is redundant but doesn't cause an issue as it will be added only in the classpath of the cli-tools This is a backport of #44224	2019-07-12 14:34:15 +03:00
Albert Zaharovits	e490ecb7d3	Fix X509AuthenticationToken principal (#43932 ) Fixes a bug in the PKI authentication. This manifests when there are multiple PKI realms configured in the chain, with different principal parse patterns. There are a few configuration scenarios where one PKI realm might parse the principal from the Subject DN (according to the `username_pattern` realm setting) but another one might do the truststore validation (according to the truststore.* realm settings). This is caused by the two passes through the realm chain, first to build the authentication token and secondly to authenticate it, and that the X509AuthenticationToken sets the principal during construction.	2019-07-12 11:04:50 +03:00
Mark Vieira	263f76e5ea	Revert "[DOCS] Moves Watcher troubleshooting page (#44144 )" This reverts commit `11375926ec`.	2019-07-11 17:13:08 -07:00
Lisa Cawley	11375926ec	[DOCS] Moves Watcher troubleshooting page (#44144 )	2019-07-11 14:42:32 -07:00
Mayya Sharipova	32cb47b91c	Add l1norm and l2norm distances for vectors (#44116 ) Add L1norm - Manhattan distance Add L2norm - Euclidean distance relates to #37947	2019-07-11 14:30:02 -04:00
Michael Basnight	b4b2ad3593	Ensure enrich policy is immutable (#43604 ) This commit ensures the policy cannot be overwritten. An error is thrown if the policy exists. All tests have been updated accordingly.	2019-07-11 13:23:12 -05:00
Benjamin Trent	40cc081ad3	[ML][Data Frame] adds index validations to _start data frame transform (#44191 ) (#44227 ) * [ML][Data Frame] adds index validations to _start data frame transform * addressing pr comments	2019-07-11 12:50:50 -05:00
Andrei Stefan	e9f9f00940	SQL: add pretty printing to JSON format (#43756 ) (#44220 ) (cherry picked from commit cbd9d4c259bf5a541bc49f65f7973174a36df449)	2019-07-11 20:02:24 +03:00
Benjamin Trent	c82d9c5b50	[ML] Adds support for regression.mean_squared_error to eval API (#44140 ) (#44218 ) * [ML] Adds support for regression.mean_squared_error to eval API * addressing PR comments * fixing tests	2019-07-11 09:22:52 -05:00
Nick Knize	374030a53f	Upgrade to lucene-8.2.0-snapshot-860e0be5378 (#44171 ) (#44184 ) Upgrades lucene library to lucene-8.2.0-snapshot-860e0be5378	2019-07-11 09:17:22 -05:00
Yannick Welsch	2ee07f1ff4	Simplify port usage in transport tests (#44157 ) Simplifies AbstractSimpleTransportTestCase to use JVM-local ports and also adds an assertion so that cases like #44134 can be more easily debugged. The likely reason for that one is that a test, which was repeated again and again while always spawning a fresh Gradle worker (due to Gradle daemon) kept increasing Gradle worker IDs, causing an overflow at some point.	2019-07-11 13:35:37 +02:00
David Roberts	5886aefeed	[ML] Wait for .ml-config primary before assigning persistent tasks (#44170 ) Now that ML job configs are stored in an index rather than cluster state, availability of the .ml-config index is very important to the operation of ML. When a cluster starts up the ML persistent tasks will be considered for node assignment very early on. It is best in this case if assignment is deferred until after the .ml-config index is available. The introduction of data frame analytics jobs has made this problem worse, because anomaly detection jobs already waited for the primary shards of the .ml-state, .ml-anomalies-shared and .ml-meta indices to be available before doing node assignment, and by coincidence this would probably lead to the primary shards of .ml-config also being searchable. But data frame analytics jobs had no other index checks prior to this change. This fixes problem 2 of #44156	2019-07-11 11:43:39 +01:00
Armin Braun	8a554f9737	Remove IncompatibleSnapshots Logic from Codebase (#44096 ) (#44183 ) * The incompatible snapshots logic was created to track 1.x snapshots that became incompatible with 2.x * It serves no purpose at this point * It adds an additional GET request to every loading of RepositoryData (from loading the incompatible snapshots blob)	2019-07-11 07:15:51 +02:00
Igor Motov	df2e1fb43e	Geo: add validator that only checks altitude (#43893 ) By default, we don't check ranges while indexing geo_shapes. As a result, it is possible to index geoshapes that contain contain coordinates outside of -90 +90 and -180 +180 ranges. Such geoshapes will currently break SQL and ML retrieval mechanism. This commit removes these restriction from the validator is used in SQL and ML retrieval.	2019-07-10 16:55:03 -04:00
Ryan Ernst	c6efb9be2a	Convert ReplicationResponse to Writeable (#43953 ) This commit convers ReplicationResponse and all its subclasses to support Writeable.Reader as a constructor. relates #34389	2019-07-10 12:45:10 -07:00
Ryan Ernst	fb77d8f461	Removed writeTo from TransportResponse and ActionResponse (#44092 ) The base classes for transport requests and responses currently implement Streamable and Writeable. The writeTo method on these base classes is implemented with an empty implementation. Not only does this complicate subclasses to think they need to call super.writeTo, but it also can lead to not implementing writeTo when it should have been implemented, or extendiong one of these classes when not necessary, since there is nothing to actually implement. This commit removes the empty writeTo from these base classes, and fixes subclasses to not call super and in some cases implement an empty writeTo themselves. relates #34389	2019-07-10 12:42:04 -07:00
Lisa Cawley	5b71340f99	[DOCS] Moves Watcher limitations (#44141 )	2019-07-10 11:17:12 -07:00
Michael Basnight	d2c3f4bae9	Validate read priv of enrich source indices (#43595 ) This commit adds permissions validation on the indices provided in the enrich policy. These indices should be validated at store time so as not to have cryptic error messages in the event the user does not have permissions to access said indices.	2019-07-10 13:09:10 -05:00
Zachary Tong	92ad588275	Remove generic on AggregatorFactory (#43664 ) (#44079 ) AggregatorFactory was generic over itself, but it doesn't appear we use this functionality anywhere (e.g. to allow the super class to declare arguments/return types generically for subclasses to override). Most places use a wildcard constraint, and even when a concrete type is specified it wasn't used. But since AggFactories are widely used, this led to the generic touching many pieces of code and making type signatures fairly complex	2019-07-10 13:20:28 -04:00
David Roberts	07f53e39b3	[ML] Fix ML memory tracker lockup when inner step fails (#44158 ) When the ML memory tracker is refreshed and a refresh is already in progress the idea is that the second and subsequent refresh requests receive the same response as the currently in progress refresh. There was a bug that if a refresh failed then the ML memory tracker's view of whether a refresh was in progress was not reset, leading to every subsequent request being registered to receive a response that would never come. This change makes the ML memory tracker pass on failures as well as successes to all interested parties and reset the list of interested parties so that further refresh attempts are possible after either a success or failure. This fixes problem 1 of #44156	2019-07-10 15:46:46 +01:00
Andrei Stefan	bb3e5351b5	SQL: double quotes escaping bug fix (#43829 ) (cherry picked from commit d589dcad18c3708913e13c757b91c846aeb35bb4)	2019-07-10 16:05:22 +03:00
Albert Zaharovits	018d946bba	[DOC] Backup & Restore Security Configuration (#42970 ) This commit documents the backup and restore of a cluster's security configuration. It is not possible to only backup (or only restore) security configuration, independent to the rest of the cluster's conf, so this describes how a full configuration backup&restore will include security as well. Moreover, it explains how part of the security conf data resides on the special .security index and how to backup that using regular data snapshot API. Co-Authored-By: Lisa Cawley <lcawley@elastic.co> Co-Authored-By: Tim Vernum <tim@adjective.org>	2019-07-10 14:53:56 +03:00
Andrei Stefan	9567f337f5	SQL: handle SQL not being available in a more graceful way (#43665 ) * Add test for SQL not being available error message in JDBC. * Add a new qa sub-project that explicitly disables SQL XPack module in Gradle. (cherry picked from commit 8a1ac8a3a88a325ec9b99963e0fa288c18ee0ee5)	2019-07-10 14:36:24 +03:00
Andrei Stefan	9957b66d2e	SQL: concrete indices array size bug fix (#43878 ) * The created array didn't have the correct initial size while attempting to resolve multiple indices (cherry picked from commit 341006e9913e831408f5bbc7f8ad8c453a7f630e)	2019-07-10 14:36:23 +03:00
Przemysław Witek	44781e415e	[7.x] [ML] Add DatafeedTimingStats to datafeed GetDatafeedStatsAction.Response (#43045 ) (#44118 )	2019-07-10 11:51:44 +02:00
David Roberts	853ddb5a07	[ML] Fix custom timestamp override with dot-separated fractional seconds (#44127 ) Custom timestamp overrides provided to the find_file_structure endpoint produced an invalid Grok pattern if the fractional seconds separator was a dot rather than a comma or colon. This commit fixes that problem and adds tests for this sort of timestamp override. Fixes #44110	2019-07-10 10:27:08 +01:00
David Roberts	cb62d4acdf	[ML-DataFrame] Add a frequency option to transform config, default 1m (#44120 ) Previously a data frame transform would check whether the source index was changed every 10 seconds. Sometimes it may be desirable for the check to be done less frequently. This commit increases the default to 60 seconds but also allows the frequency to be overridden by a setting in the data frame transform config.	2019-07-10 09:59:00 +01:00
marcos ramos	88ee47c9ba	Fix OIDC documentation settings (#44115 ) Current kibana setting is xpack.security.auth.oidc.realm, but the correct one is xpack.security.authc.oidc.realm	2019-07-09 18:44:35 +03:00
Sachin Frayne	389c923a82	[Docs] Fix json syntax in watcher compare condition (#44032 )	2019-07-09 13:43:18 +02:00
Armin Braun	9eac5ceb1b	Dry up inputstream to bytesreference (#43675 ) (#44094 ) * Dry up Reading InputStream to BytesReference * Dry up spots where we use the same pattern to get from an InputStream to a BytesReferences	2019-07-09 09:18:25 +02:00
Christoph Büscher	8e8d7667cb	[Tests] Fix type inference issue (#44063 )	2019-07-08 17:34:35 +02:00
David Kyle	5fc12917c3	Data frame task failure does not make a 500 response (#44058 ) Data frame task responses had logic to return a HTTP 500 status code if there was any node or task failures even if other tasks in the same request reported correctly. This is different to how other task responses are handled where a 200 is always returned leaving the client should check for failures. Returning a 500 also breaks the high level rest client so always return a 200 Closes #44011	2019-07-08 11:53:11 +01:00
Ioannis Kakavas	9beb51fc44	Revert "Mute testEnableDisableBehaviour (#42929 )" This reverts commit `6ee578c6eb`.	2019-07-08 08:52:21 +03:00
Nhat Nguyen	9089820d8f	Enable indexing optimization using sequence numbers on replicas (#43616 ) This PR enables the indexing optimization using sequence numbers on replicas. With this optimization, indexing on replicas should be faster and use less memory as it can forgo the version lookup when possible. This change also deactivates the append-only optimization on replicas. Relates #34099	2019-07-05 22:12:08 -04:00
Dimitris Athanasiou	d3ddedf9fc	[7.x][ML] Add missing doc links to df-analytics rest spec and HLRC javadocs (#44025 ) (#44033 )	2019-07-06 02:03:29 +03:00
Mayya Sharipova	37e1ad7062	Forbid empty doc values on vector functions (#43944 ) Currently when a document misses a vector value, vector function returns 0 as a score for this document. We think this is incorrect behaviour. With this change, an error will be thrown if vector functions are used with docs that are missing vector doc values. Also VectorScriptDocValues is modified to allow size() function, which can be used to check if a document has a value for the vector field.	2019-07-05 18:09:06 -04:00
Dimitris Athanasiou	a1a62fded3	[7.x][ML] Stop df-analytics action request should filter tasks (#44016 ) (#44023 ) As a `BaseTasksRequest`, `StopDataFrameAnalyticsAction.Request` should implement a `match` method that makes sure only df-analytics tasks are applied.	2019-07-05 23:10:45 +03:00
Yannick Welsch	504a43d43a	Move ConnectionManager to async APIs (#42636 ) This commit converts the ConnectionManager's openConnection and connectToNode methods to async-style. This will allow us to not block threads anymore when opening connections. This PR also adapts the cluster coordination subsystem to make use of the new async APIs, allowing to remove some hacks in the test infrastructure that had to account for the previous synchronous nature of the connection APIs.	2019-07-05 20:40:22 +02:00
Yannick Welsch	1220ff5b6d	Publish to self through transport (#43994 ) This commit ensures that cluster state publications to self also go through the transport layer. This allows voting-only nodes to intercept the publication to self. Fixes an issue discovered by a test failure where a voting-only node, which was the only bootstrapped node, would not step down as master after state transfer because publishing to self would succeed. Closes #43631	2019-07-05 13:00:52 +02:00
Dimitris Athanasiou	30b20920b9	[7.x][ML] Report correct count for df-analytics get-stats API (#43969 ) (#43981 ) The count should match the number of all df-analytics that matched the id in the request. However, we set the count to the number of df-analytics returned which was bound to the `size` parameter. This commit fixes this by setting the count to the count of the `get` response.	2019-07-05 10:28:57 +03:00
Martijn van Groningen	adc06ffd89	take builtin role into account in docs tests	2019-07-05 08:06:18 +02:00
Jim Ferenczi	cdf55cb5c5	Refactor index engines to manage readers instead of searchers (#43860 ) This commit changes the way we manage refreshes in the index engines. Instead of relying on a SearcherManager, this change uses a ReaderManager that creates ElasticsearchDirectoryReader when needed. Searchers are now created on-demand (when acquireSearcher is called) from the current ElasticsearchDirectoryReader. It also slightly changes the Engine.Searcher to extend IndexSearcher in order to simplify the usage in the consumer.	2019-07-04 22:49:43 +02:00
Hendrik Muhs	4128b9b4f7	audit message missing for autostop call onStop when auto stopping (#43984) fixes #43977	2019-07-04 21:40:42 +02:00
Martijn van Groningen	9528c59fb3	added a basic test that enriching data works	2019-07-04 17:42:45 +02:00
Martijn van Groningen	1dd3d14f09	take into account `manage_enrich` builtin role	2019-07-04 16:51:48 +02:00
Martijn van Groningen	ac119b07e7	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-07-04 15:50:11 +02:00
Benjamin Trent	36f7259737	[ML] Fix datafeed checks when a concrete remote index is present (#43923 ) A bug was introduced in 6.6.0 when we added support for rollup indices. Rollup caps does NOT support looking at remote indices, consequently, since we always look up rollup caps, the datafeed fails with an error if its config includes a concrete remote index. (When all remote indices in a datafeed config are wildcards the problem did not occur.) The rollups feature does not support remote indices, so if there is any remote index in a datafeed config (wildcarded or not), we can skip the rollup cap checks. This PR implements that change.	2019-07-04 13:31:45 +01:00
Martijn van Groningen	7ba6e1752a	required changes after merge	2019-07-04 13:17:22 +02:00
Martijn van Groningen	653f1436a0	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-07-04 13:05:10 +02:00
Alan Woodward	4b99255fed	Add name() method to TokenizerFactory (#43909 ) This brings TokenizerFactory into line with CharFilterFactory and TokenFilterFactory, and removes the need to pass around tokenizer names when building custom analyzers. As this means that TokenizerFactory is no longer a functional interface, the commit also adds a factory method to TokenizerFactory to make construction simpler.	2019-07-04 11:28:55 +01:00
Alpar Torok	1b6109517a	Mute failing test Tracking in #43960	2019-07-04 12:13:02 +03:00
Benjamin Trent	7063a40411	[7.x] [ML][Data Frame] Adding bwc tests for pivot transform (#43506 ) (#43929 ) * [ML][Data Frame] Adding bwc tests for pivot transform (#43506) * [ML][Data Frame] Adding bwc tests for pivot transform * adding continuous transforms * adding continuous dataframes to bwc * adding continuous data frame tests * Adding rolling upgrade tests for continuous df * Fixing test * Adjusting indices used in BWC, and handling NPE for seq_no_stats * updating and muting specific bwc test * Adjusting bwc tests for backport	2019-07-03 16:39:38 -05:00
Martijn van Groningen	397150fa1e	Add enrich coordinator proxy action (#43801 ) Introduced proxy api the handle the search request load that originates from enrich processor. The enrich processor can execute many search requests that execute asynchronously in parallel and that can easily overwhelm the search thread pool on nodes. In order to protect this the Coordinator queues the search requests and only executes a fixed number of search requests in parallel. Besides this; the Coordinator tries to include as much as possible search requests (up to a defined maximum) inside a multi search request in order to reduce the number of remote api calls to be made from the node that performs ingestion.	2019-07-03 15:50:40 +02:00
Christoph Büscher	662f517f4e	Add _reload_search_analyzers endpoint to HLRC (#43733 ) This change adds the new endpoint that allows reloading of search analyzers to the high-level java rest client. Relates to #43313	2019-07-03 12:05:59 +02:00
Dimitris Athanasiou	96b0b27f18	[7.x][ML] Set df-analytics task state to failed when appropriate (#43880 ) (#43906 ) This introduces a `failed` state to which the data frame analytics persistent task is set to when something unexpected fails. It could be the process crashing, the results processor hitting some error, etc. The failure message is then captured and set on the task state. From there, it becomes available via the _stats API as `failure_reason`. The df-analytics stop API now has a `force` boolean parameter. This allows the user to call it for a failed task in order to reset it to `stopped` after we have ensured the failure has been communicated to the user. This commit also adds the analytics version in the persistent task params as this allows us to prevent tasks to run on unsuitable nodes in the future.	2019-07-03 12:41:56 +03:00
Jay Modi	1e0f67fb38	Deprecate transport profile security type setting (#43237 ) This commit deprecates the `transport.profiles.*.xpack.security.type` setting. This setting is used to configure a profile that would only allow client actions. With the upcoming removal of the transport client the setting should also be deprecated so that it may be removed in a future version.	2019-07-03 19:31:55 +10:00
Alexander Reelsen	9077c4402f	Watcher: Allow to execute actions for each element in array (#41997 ) This adds the ability to execute an action for each element that occurs in an array, for example you could sent a dedicated slack action for each search hit returned from a search. There is also a limit for the number of actions executed, which is hardcoded to 100 right now, to prevent having watches run forever. The watch history logs each action result and the total number of actions the were executed. Relates #34546	2019-07-03 11:28:50 +02:00
Tim Vernum	2a8f30eb9a	Support builtin privileges in get privileges API (#43901 ) Adds a new "/_security/privilege/_builtin" endpoint so that builtin index and cluster privileges can be retrieved via the Rest API Backport of: #42134	2019-07-03 19:08:28 +10:00
Tim Vernum	deacc2038e	Always attach system user to internal actions (#43902 ) All valid licenses permit security, and the only license state where we don't support security is when there is a missing license. However, for safety we should attach the system (or xpack/security) user to internally originated actions even if the license is missing (or, more strictly, doesn't support security). This allows all nodes to communicate and send internal actions (shard state, handshake/pings, etc) even if a license is transitioning between a broken state and a valid state. Relates: #42215 Backport of: #43468	2019-07-03 19:07:16 +10:00
Tim Vernum	31b19bd022	Use separate BitSet cache in Doc Level Security (#43899 ) Document level security was depending on the shared "BitsetFilterCache" which (by design) never expires its entries. However, when using DLS queries - particularly templated ones - the number (and memory usage) of generated bitsets can be significant. This change introduces a new cache specifically for BitSets used in DLS queries, that has memory usage constraints and access time expiry. The whole cache is automatically cleared if the role cache is cleared. Individual bitsets are cleared when the corresponding lucene index reader is closed. The cache defaults to 50MB, and entries expire if unused for 7 days. Backport of: #43669	2019-07-03 18:04:06 +10:00
Tim Vernum	461aa39daf	Switch WriteActionsTests.testBulk to use hamcrest (#43897 ) If an item in the bulk request fails, that could be for a variety of reasons - it may be that the underlying behaviour of security has changed, or it may just be a transient failure during testing. Simply asserting a `true`/`false` value produces failure messages that are difficult to diagnose and debug. Using hamcert (`assertThat`) will make it easier to understand the causes of failures in this test. Backport of: #43725	2019-07-03 16:29:28 +10:00
Tim Vernum	14884c871f	Document API-Key APIs require manage_api_key priv (#43869 ) Add the "Authorization" section to the API key API docs. These APIs require The new manage_api_key cluster privilege. Relates: #43865 Backport of: #43811	2019-07-03 13:51:44 +10:00
Jake Landis	6e9ccda2c5	ilm test - allow more time for policy completion (#43844 )	2019-07-02 22:05:18 -05:00
Jake Landis	0a79f4ca70	Extend timeout for TimeSeriesLifecycleActionsIT> testFullPolicy (#43891 )	2019-07-02 22:05:04 -05:00
Mayya Sharipova	756c42f99f	Add dims parameter to dense_vector mapping (#43444 ) (#43895 ) Typically, dense vectors of both documents and queries must have the same number of dimensions. Different number of dimensions among documents or query vector indicate an error. This PR enforces that all vectors for the same field have the same number of dimensions. It also enforces that query vectors have the same number of dimensions.	2019-07-02 21:14:16 -04:00
Benjamin Trent	fb825a6470	[7.x] [ML][Data Frame] add node attr to GET _stats (#43842 ) (#43894 ) * [ML][Data Frame] add node attr to GET _stats (#43842) * [ML][Data Frame] add node attr to GET _stats * addressing testing issues with node.attributes * adjusting for backport	2019-07-02 19:35:37 -05:00
Benjamin Trent	2c97e26ce8	[ML][Data Frame] fix progress measurement for continuous transforms (#43838 ) (#43887 ) * [ML][Data Frame] fix progress measurement for continuous transforms * Update DataFrameIndexer.java	2019-07-02 19:35:09 -05:00
Jake Landis	eb73bed40d	7x watcher backport testfixes (#43848 ) * fix org.elasticsearch.xpack.watcher.test.integration.RejectedExecutionTests (#41777) This commit un-mutes org.elasticsearch.xpack.watcher.test.integration.RejectedExecutionTests which was failing intermittently due to a logic bug. It is not possible to use the real Watcher scheduler (which is needed for this test) and reliabliby count the .triggered-watches since current count of documents in the .triggered-watches index is based on the timing of the scheduler and the ability to delete based on the Watcher and Write thread pools. This commit simply removes the .triggered-watch check and relies soley on the .watcher-history index as an indication that operations that can occur when the Watcher threadpool is rejecting. closes #41734 * fix unlikely bug that can prevent Watcher from restarting (#42030) The bug fixed here is unlikely to happen. It requires ES to be started with ILM disabled, Watcher enabled, and Watcher explicitly stopped and restarted. Due to template validation Watcher does not fully start and can result in a partially started state. This is an unlikely scenerio outside of the testing framework. Note - this bug was introduced while the test that would have caught it was muted. The test remains muted since the underlying cuase of the random failures has not been identified. When this test is un-muted it will now work.	2019-07-02 12:16:06 -05:00
Christoph Büscher	31cf96e7bf	Return reloaded analyzers in _reload_search_ananlyzer response (#43813 ) Currently the repsonse of the "_reload_search_analyzer" endpoint contains the index names and nodeIds of indices were analyzers reloading was triggered. This change add the names of the search-time analyzers that were reloaded. Closes #43804	2019-07-02 18:51:15 +02:00
Dimitris Athanasiou	1ea53979b5	[7.x][ML] Get df-analytics action should require monitor privilege (#43831 ) (#43866 )	2019-07-02 16:00:54 +03:00
Tim Vernum	8d099dad38	Add "manage_api_key" cluster privilege (#43865 ) This adds a new cluster privilege for manage_api_key. Users with this privilege are able to create new API keys (as a child of their own user identity) and may also get and invalidate any/all API keys (including those owned by other users). Backport of: #43728	2019-07-02 21:57:42 +10:00
Benjamin Trent	b95ee7ebb2	[7.x] [ML][Data Frame] using transform creation version for node assignment (#43764 ) (#43843 ) * [ML][Data Frame] using transform creation version for node assignment (#43764) * [ML][Data Frame] using transform creation version for node assignment * removing unused imports * Addressing PR comment * adjusing for backport	2019-07-02 06:52:34 -05:00
Benjamin Trent	82c1ddc117	[7.x] [ML][Data Frame] Add deduced mappings to _preview response payload (#43742 ) (#43849 ) * [ML][Data Frame] Add deduced mappings to _preview response payload (#43742) * [ML][Data Frame] Add deduced mappings to _preview response payload * updating preview docs * fixing code for backport	2019-07-02 06:52:14 -05:00
Tanguy Leroux	b977f019b8	Expose translog stats in ReadOnlyEngine (#43752 ) (#43823 ) Backport of #43752 for 7.x.	2019-07-02 13:39:00 +02:00
Ioannis Kakavas	c8ed271937	Use URLEncoder#encode(String, String) as URLEncoder#encode(String, Charset) is only available since Java 10	2019-07-02 14:20:29 +03:00
Ioannis Kakavas	4ea17b76dc	Fix credentials encoding for OIDC token request (#43808 ) As defined in https://tools.ietf.org/html/rfc6749#section-2.3.1 both client id and client secret need to be encoded with the application/x-www-form-urlencoded encoding algorithm when used as credentials for HTTP Basic Authentication in requests to the OP. Resolves #43709	2019-07-02 13:36:00 +03:00
Tomas Della Vedova	4cdb24bceb	Use explicit string keys in data_frame test (#43854 )	2019-07-02 11:06:29 +02:00
Albert Zaharovits	4eb89a6912	UserRoleMapper non-null groups and metadata (#43836 ) This is an odd backport of #41774 UserRoleMapper.UserData is constructed by each realm and it is used to "match" role mapping expressions that eventually supply the role names of the principal. This PR filters out `null` collection values (lists and maps), for the groups and metadata, which get to take part in the role mapping, in preparation for using Java 9 collection APIs. It filters them as soon as possible, during the construction.	2019-07-02 00:10:15 +03:00
Christoph Büscher	fe3f9f0c6b	Yet another `the the` cleanup (#43815 )	2019-07-01 20:22:19 +02:00
Martijn van Groningen	785aedebad	Add restart node enrich tests. (#43579 ) This test verifies that enrich policies still exist after a full cluster restart. If EnrichPolicy is not registered as named xcontent in EnrichPlugin class then this test fails.	2019-07-01 17:36:01 +02:00
Yogesh Gaikwad	031d5e96ac	HLRC changes for kerberos grant type (#43642 ) (#43822 ) The TODO from last PR for kerbero grant type was missed. This commit adds the changes for kerberos grant type in HLRC.	2019-07-02 00:55:02 +10:00
Benjamin Trent	8108834534	[ML][Data Frame] account for delay in writing stats docs (#43703 ) (#43819 )	2019-07-01 09:14:44 -05:00
Benjamin Trent	4c95c0c456	[ML][Data Frame] reduce audit frequency, change log msg, and level (#43771 ) (#43818 )	2019-07-01 09:14:26 -05:00
Mark Vieira	13887c01cc	Remove compile-time dependency on test fixtures (#43651 )	2019-07-01 14:59:41 +03:00
Julie Tibshirani	ffa5919d7c	Add support for 'flattened object' fields. (#43762 ) This commit merges the `object-fields` feature branch. The new 'flattened object' field type allows an entire JSON object to be indexed into a field, and provides limited search functionality over the field's contents.	2019-07-01 12:08:50 +03:00
Hendrik Muhs	a58d231f4d	relax trigger count for transform stats test (#43753 ) relax trigger count test as we can not guarantee it due to async behaviour	2019-07-01 10:30:40 +02:00
Alpar Torok	717d14a7e2	Backport: convert x pack qa (#43763 ) * Revert "Revert "Test clusters: convert x-pack qa tests (#43283)" (#43549)" This reverts commit `8d9a971259`. * Fix failing test	2019-07-01 10:38:56 +03:00
Dimitris Athanasiou	3bdb9d5f08	[7.x][ML] Correct df-analytics version introduced to 7.3.0 (#43784 ) (#43795 )	2019-07-01 09:19:04 +03:00
Martijn van Groningen	237f2bd60a	Make ingest executing non blocking (#43361 ) Added an additional method to the Processor interface to allow a processor implementation to make a non blocking call. Also added semaphore in order to avoid search thread pools from rejecting search requests originating from the match processor. This is a temporary workaround.	2019-07-01 08:01:46 +02:00
Ryan Ernst	3a2c698ce0	Rename Action to ActionType (#43778 ) Action is a class that encapsulates meta information about an action that allows it to be called remotely, specifically the action name and response type. With recent refactoring, the action class can now be constructed as a static constant, instead of needing to create a subclass. This makes the old pattern of creating a singleton INSTANCE both misnamed and lacking a common placement. This commit renames Action to ActionType, thus allowing the old INSTANCE naming pattern to be TYPE on the transport action itself. ActionType also conveys that this class is also not the action itself, although this change does not rename any concrete classes as those will be removed organically as they are converted to TYPE constants. relates #34389	2019-06-30 22:00:17 -07:00
Martijn van Groningen	adcba69d96	required changes after merge	2019-06-30 21:40:40 +02:00
Martijn van Groningen	eb8e03bc8b	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-06-30 21:32:51 +02:00
Dimitris Athanasiou	8f49d01113	[7.x][ML] Rename df-analytics `_id_copy` to `ml__id_copy` (#43754 ) (#43783 ) Renames `_id_copy` to `ml__id_copy` as field names starting with underscore are deprecated. The new field name `ml__id_copy` was chosen as an obscure enough field that users won't have in their data. Otherwise, this field is only intented to be used by df-analytics.	2019-06-30 19:37:00 +03:00
Albert Zaharovits	5e17bc5dcc	Consistent Secure Settings #40416 Introduces a new `ConsistentSecureSettingsValidatorService` service that exposes a single public method, namely `allSecureSettingsConsistent`. The method returns `true` if the local node's secure settings (inside the keystore) are equal to the master's, and `false` otherwise. Technically, the local node has to have exactly the same secure settings - setting names should not be missing or in surplus - for all `SecureSetting` instances that are flagged with the newly introduced `Property.Consistent`. It is worth highlighting that the `allSecureSettingsConsistent` is not a consensus view across the cluster, but rather the local node's perspective in relation to the master.	2019-06-29 23:26:17 +03:00
David Roberts	b599c68d23	[ML] Assert that a no-op job creates no results nor state (#43681 ) If a job is opened and then closed and does nothing in between then it should not persist any results or state documents. This change adapts the no-op job test to assert no results in addition to no state, and to log any documents that cause this assertion to fail. Relates elastic/ml-cpp#512 Relates #43680	2019-06-29 14:57:49 +01:00
Ryan Ernst	28ab77a023	Add StreamableResponseAction to aid in deprecation of Streamable (#43770 ) The Action base class currently works for both Streamable and Writeable response types. This commit intorduces StreamableResponseAction, for which only the legacy Action implementions which provide newResponse() will extend. This eliminates the need for overriding newResponse() with an UnsupportedOperationException. relates #34389	2019-06-28 21:40:00 -07:00
David Roberts	7951c63b91	[ML] Mark ml-cpp dependency as regularly changing (#43760 ) Since #41817 was merged the ml-cpp zip file for any given version has been cached indefinitely by Gradle. This is problematic, particularly in the case of the master branch where the version 8.0.0-SNAPSHOT will be in use for more than a year. This change tells Gradle that the ml-cpp zip file is a "changing" dependency, and to check whether it has changed every two hours. Two hours is a compromise between checking on every build and annoying developers with slow internet connections and checking rarely causing bug fixes in the ml-cpp code to take a long time to propagate through to elasticsearch PRs that rely on them.	2019-06-28 21:21:18 +01:00
Benjamin Trent	67a3c656c3	[7.x] [ML][Data Frame] removing format support (#43659 ) (#43747 ) * [ML][Data Frame] removing format support (#43659) * Fixing conflicts	2019-06-28 10:02:37 -05:00
Jim Ferenczi	7ca69db83f	Refactor IndexSearcherWrapper to disallow the wrapping of IndexSearcher (#43645 ) This change removes the ability to wrap an IndexSearcher in plugins. The IndexSearcherWrapper is replaced by an IndexReaderWrapper and allows to wrap the DirectoryReader only. This simplifies the creation of the context IndexSearcher that is used on a per request basis. This change also moves the optimization that was implemented in the security index searcher wrapper to the ContextIndexSearcher that now checks the live docs to determine how the search should be executed. If the underlying live docs is a sparse bit set the searcher will compute the intersection betweeen the query and the live docs instead of checking the live docs on every document that match the query.	2019-06-28 16:28:02 +02:00
Alpar Torok	d1a4d8866d	Add missing dependencies so we can build in parallel (#43672 )	2019-06-28 16:41:18 +03:00
Dimitris Athanasiou	86c853a7c2	[7.x][ML] Rename outlier score setting to feature_influence_threshold (#43705 ) (#43734 ) Renames outlier score setting `minimum_score_to_write_feature_influence` to `feature_influence_threshold`.	2019-06-28 13:28:25 +03:00
Dimitris Athanasiou	cab879118d	[7.x][ML] Support multiple source indices for df-analytics (#43702 ) (#43731 ) This commit adds support for multiple source indices. In order to deal with multiple indices having different mappings, it attempts a best-effort approach to merge the mappings assuming there are no conflicts. In case conflicts exists an error will be returned. To allow users creating custom mappings for special use cases, the destination index is now allowed to exist before the analytics job runs. In addition, settings are no longer copied except for the `index.number_of_shards` and `index.number_of_replicas`.	2019-06-28 13:28:03 +03:00
Christoph Büscher	2cc7f5a744	Allow reloading of search time analyzers (#43313 ) Currently changing resources (like dictionaries, synonym files etc...) of search time analyzers is only possible by closing an index, changing the underlying resource (e.g. synonym files) and then re-opening the index for the change to take effect. This PR adds a new API endpoint that allows triggering reloading of certain analysis resources (currently token filters) that will then pick up changes in underlying file resources. To achieve this we introduce a new type of custom analyzer (ReloadableCustomAnalyzer) that uses a ReuseStrategy that allows swapping out analysis components. Custom analyzers that contain filters that are markes as "updateable" will automatically choose this implementation. This PR also adds this capability to `synonym` token filters for use in search time analyzers. Relates to #29051	2019-06-28 09:55:40 +02:00
Przemysław Witek	94f18da5df	Add version and create_time to data frame analytics config (#43683 ) (#43712 )	2019-06-28 07:37:21 +02:00
Ryan Ernst	5b4089e57e	Remove nodeId from BaseNodeRequest (#43658 ) TransportNodesAction provides a mechanism to easily broadcast a request to many nodes, and collect the respones into a high level response. Each node has its own request type, with a base class of BaseNodeRequest. This base request requires passing the nodeId to which the request will be sent. However, that nodeId is not used anywhere. It is private to the base class, yet serialized to each node, where the node could just as easily find the nodeId of the node it is on locally. This commit removes passing the nodeId through to the node request creation, and guards its serialization so that we can remove the base request class altogether in the future.	2019-06-27 18:45:14 -07:00
Igor Motov	3607876a71	Geo: Makes coordinate validator in libs/geo plugable (#43657 ) Moves coordinate validation from Geometry constructors into parser. Relates #43644	2019-06-27 19:53:41 -04:00
Nhat Nguyen	ce8771feb7	Do not use MockInternalEngine in GatewayIndexStateIT (#43716 ) GatewayIndexStateIT#testRecoverBrokenIndexMetadata replies on the flushing on shutdown. This behaviour, however, can be randomly disabled in MockInternalEngine. Closes #43034	2019-06-27 18:28:04 -04:00
Przemysław Witek	68dbbd8793	Deduplicate two similar TimeUtils classes. (#43697 ) * Deduplicate org.elasticsearch.xpack.core.dataframe.utils.TimeUtils and org.elasticsearch.xpack.core.ml.utils.time.TimeUtils into a common class: org.elasticsearch.xpack.core.common.time.TimeUtils. * Add unit tests for parseTimeField and parseTimeFieldToInstant methods	2019-06-27 18:51:48 +02:00
Yannick Welsch	6744344ef2	Handle situation where only voting-only nodes are bootstrapped (#43628 ) Adds support for the situation where only voting-only nodes are bootstrapped. In that case, they will still try to become elected and bring full master nodes into the cluster.	2019-06-27 18:10:15 +02:00
David Roberts	f39619d182	[ML] Don't write timing stats on no-op (#43680 ) Similar to elastic/ml-cpp#512, if a job opens and closes and does nothing in between we shouldn't write timing stats to the results index.	2019-06-27 16:37:54 +01:00
Jim Ferenczi	329d05f61e	Fix UOE on search requests that match a sparse role query (#43668 ) Search requests executed through the SecurityIndexSearcherWrapper throw an UnsupportedOperationException if they match a sparse role query. When low level cancellation is activated (which is the default since #42857), the context index searcher creates a weight that doesn't handle #scorer. This change fixes this bug and adds a test to ensure that we check this case.	2019-06-27 16:56:56 +02:00
Martijn van Groningen	683e116601	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-06-27 08:35:37 +02:00
Przemysław Witek	ba518722a2	[7.x] [ML] Tag destination index with data frame metadata (#43567 ) (#43660 )	2019-06-27 08:08:39 +02:00
Benjamin Trent	d05593c3ad	[ML][Data Frame] adds tests for continuous DF (#43601 ) (#43654 )	2019-06-26 14:59:19 -05:00
Benjamin Trent	52e26bbc42	[ML][Data Frame] improve pivot nested field validations (#43548 ) (#43636 ) * [ML][Data Frame] improve pivot nested field validations * addressing pr comments	2019-06-26 13:35:51 -05:00
Armin Braun	c00e305d79	Optimize Selector Wakeups (#43515 ) (#43650 ) * Use atomic boolean to guard wakeups * Don't trigger wakeups from the select loops thread itself for registering and closing channels * Don't needlessly queue writes Co-authored-by: Tim Brooks <tim@uncontended.net>	2019-06-26 20:00:42 +02:00
David Kyle	e1f761dfc7	[Ml Data Frame] Size the GET stats search by number of Ids requested (#43206 ) Set the size of the search request to the number of ids limited by 10,000	2019-06-26 17:01:12 +01:00
Benjamin Trent	c121b00c98	[7.x] [ML][Data Frame] Add support for allow_no_match for endpoints (#43490 ) (#43637 ) * [ML][Data Frame] Add support for allow_no_match for endpoints (#43490) * [ML][Data Frame] Add support for allow_no_match parameter in endpoints Adds support for: * Get Transforms * Get Transforms stats * stop transforms * Update DataFrameTransformDocumentationIT.java	2019-06-26 10:09:56 -05:00
David Roberts	31dc5b7d3a	[TEST] Wait for replicas before stopping nodes in ML distributed test (#43622 ) If we stop a node before replicas exist then the test can fail because we lose a whole index if we stop the node with the primary on.	2019-06-26 11:52:53 +01:00
David Roberts	558e323c89	[ML] Introduce a setting for the process connect timeout (#43234 ) This change introduces a new setting, xpack.ml.process_connect_timeout, to enable the timeout for one of the external ML processes to connect to the ES JVM to be increased. The timeout may need to be increased if many processes are being started simultaneously on the same machine. This is unlikely in clusters with many ML nodes, as we balance the processes across the ML nodes, but can happen in clusters with a single ML node and a high value for xpack.ml.node_concurrent_job_allocations.	2019-06-26 09:22:04 +01:00
Yannick Welsch	2049f715b3	Add voting-only master node (#43410 ) A voting-only master-eligible node is a node that can participate in master elections but will not act as a master in the cluster. In particular, a voting-only node can help elect another master-eligible node as master, and can serve as a tiebreaker in elections. High availability (HA) clusters require at least three master-eligible nodes, so that if one of the three nodes is down, then the remaining two can still elect a master amongst them-selves. This only requires one of the two remaining nodes to have the capability to act as master, but both need to have voting powers. This means that one of the three master-eligible nodes can be made as voting-only. If this voting-only node is a dedicated master, a less powerful machine or a smaller heap-size can be chosen for this node. Alternatively, a voting-only non-dedicated master node can play the role of the third master-eligible node, which allows running an HA cluster with only two dedicated master nodes. Closes #14340 Co-authored-by: David Turner <david.turner@elastic.co>	2019-06-26 08:07:56 +02:00
Yogesh Gaikwad	480453aa24	Make role descriptors optional when creating API keys (#43481 ) (#43614 ) This commit changes the `role_descriptors` field from required to optional when creating API key. The default behavior in .NET ES client is to omit properties with `null` value requiring additional workarounds. The behavior for the API does not change. Field names (`id`, `name`) in the invalidate api keys API documentation have been corrected where they were wrong. Closes #42053	2019-06-26 14:30:51 +10:00
Yogesh Gaikwad	58179af5af	Enable Kerberos tests (#43519 ) (#43612 ) Now that the fix krb5-kdc fixture (entropy problem in docker container) is in and the converting `kerberos-tests` to testclusters is done, enabling the kerberos-tests Closes #40678	2019-06-26 12:55:41 +10:00
Przemysław Witek	76a750a0a0	Remove unused mapStringsOrdered method (#42513 ) (#43585 )	2019-06-25 20:43:38 +02:00
Tanguy Leroux	0dc1c12f13	Fix indices shown in _cat/indices (#43286 ) After two recent changes (#38824 and #33888), the _cat/indices API no longer report information for active recovering indices and non-replicated closed indices. It also misreport replicated closed indices that are potentially not authorized for the user. This commit changes how the cat action works by first using the Get Settings API in order to resolve authorized indices. It then uses the Cluster State, Cluster Health and Indices Stats APIs to retrieve information about the indices. Closes #39933	2019-06-25 20:02:34 +02:00
Martijn van Groningen	d6a7fd9f30	unmuted test	2019-06-25 19:54:00 +02:00
Dimitris Athanasiou	126c2fd2d5	[7.x][ML] Machine learning data frame analytics (#43544 ) (#43592 ) This merges the initial work that adds a framework for performing machine learning analytics on data frames. The feature is currently experimental and requires a platinum license. Note that the original commits can be found in the `feature-ml-data-frame-analytics` branch. A new set of APIs is added which allows the creation of data frame analytics jobs. Configuration allows specifying different types of analysis to be performed on a data frame. At first there is support for outlier detection. The APIs are: - PUT _ml/data_frame/analysis/{id} - GET _ml/data_frame/analysis/{id} - GET _ml/data_frame/analysis/{id}/_stats - POST _ml/data_frame/analysis/{id}/_start - POST _ml/data_frame/analysis/{id}/_stop - DELETE _ml/data_frame/analysis/{id} When a data frame analytics job is started a persistent task is created and started. The main steps of the task are: 1. reindex the source index into the dest index 2. analyze the data through the data_frame_analyzer c++ process 3. merge the results of the process back into the destination index In addition, an evaluation API is added which packages commonly used metrics that provide evaluation of various analysis: - POST _ml/data_frame/_evaluate	2019-06-25 20:29:11 +03:00
James Baiera	1b902aa746	Make enrich processor use search action through a client (#43311 ) Add client to processor parameters in the ingest service. Remove the search provider function from the processor parameters. ExactMatchProcessor and Factory converted to use client. Remove test cases that are no longer applicable from processor.	2019-06-25 13:09:08 -04:00
Benjamin Trent	970e157eac	[ML][Data Frame] Adjusting error message (#43455 ) (#43580 ) * Adjusting error message * Update TransportPutDataFrameTransformAction.java * Update TransportPutDataFrameTransformAction.java	2019-06-25 10:09:39 -05:00
Przemysław Witek	c702cd7415	[7.x] Implement XContentParser.genericMap and XContentParser.genericMapOrdered methods (#42059 ) (#43575 )	2019-06-25 16:04:54 +02:00
Przemysław Witek	b15e40ffad	Extract TimingStats-related functionality into TimingStatsReporter (#43371 ) (#43557 )	2019-06-25 15:48:39 +02:00
Martijn van Groningen	36f0e8a8bb	Added multi node enrich tests and fixed serialization issues. (#43386 ) The test for now tests the enrich APIs in a multi node environment. Picked EsIntegTestCase test over a real qa module in order to avoid adding another module that starts a test cluster.	2019-06-25 14:03:10 +02:00
David Roberts	9c285ddbab	[ML] Improve message when native controller cannot connect (#43565 ) The error message if the native controller failed to run (for example due to running Elasticsearch on an unsupported platform) was not easy to understand. This change removes pointless detail from the message and adds some hints about likely causes. Fixes #42341	2019-06-25 12:06:54 +01:00
Martijn van Groningen	d0634e444d	Fixed compile errors after merge.	2019-06-25 10:12:16 +02:00
Martijn van Groningen	f587519f17	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-06-25 10:09:51 +02:00
Lee Hinman	8d9a971259	Revert "Test clusters: convert x-pack qa tests (#43283 )" (#43549 ) This reverts commit `ccaa8c33ba`.	2019-06-24 17:16:29 -06:00
James Baiera	c0d5ec87e1	Set enrich indices to be read only before swapping their aliases (#42874 )	2019-06-24 15:14:11 -04:00
Tim Brooks	38516a4dd5	Move nio ip filter rule to be a channel handler (#43507 ) Currently nio implements ip filtering at the channel context level. This is kind of a hack as the application logic should be implemented at the handler level. This commit moves the ip filtering into a channel handler. This requires adding an indicator to the channel handler to show when a channel should be closed.	2019-06-24 10:03:24 -06:00
Gordon Brown	fac7efba9a	[7.x] Account for node versions during allocation in ILM Shrink (#43300 ) This commit ensures that ILM's Shrink action will take node versions into account when choosing which node to allocate to when shrinking an index. Prior to this change, ILM could pick a node with a lower version than some shards are already allocated to, which causes the new allocation to fail as shards can't be relocated onto a node with a lower version than they are already on. As part of this, when making the decision about which node to allocate to prior to Shrink, all shards in the index are considered, rather than choosing a random shard to consider. Further, the unit tests for the logic that chooses a node to allocate shards to pre-shrink has been improved to validate the behavior in more realistic and varied initial conditions.	2019-06-24 10:02:49 -06:00
Michael Basnight	6945e5d5e6	Add role for enrich processor (#42677 ) This commit adds the manage_enrich privilege, which grants access to all of the enrich processor lifecycle actions. In addition this commit also creates a role which grants access to the generated indices. Relates #41939 Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com>	2019-06-24 10:47:01 -05:00
Mayya Sharipova	813551e070	Fix eclipse build gradle for vectors project Closes #43496	2019-06-24 09:22:48 -04:00
Martijn van Groningen	101cf384ba	Replace Streamable w/ Writable in AcknowledgedResponse and subclasses (backport 7.x) (#43525 ) This commit replaces usages of Streamable with Writeable for the AcknowledgedResponse and its subclasses, plus associated actions. Note that where possible response fields were made final and default constructors were removed. This is a large PR, but the change is mostly mechanical. Relates to #34389 Backport of #43414	2019-06-24 13:47:37 +02:00
Alpar Torok	ccaa8c33ba	Test clusters: convert x-pack qa tests (#43283 )	2019-06-24 12:20:46 +03:00
Alpar Torok	ea44da6069	Testclusters: conver remaining x-pack (#43335 ) Convert x-pack tests	2019-06-24 12:07:42 +03:00
Martijn van Groningen	df9f06213d	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-06-21 19:58:04 +02:00
Benjamin Trent	f4b75d6d14	[7.x] [ML][Data Frame] Add version and create_time to transform config (#43384 ) (#43480 ) * [ML][Data Frame] Add version and create_time to transform config (#43384) * [ML][Data Frame] Add version and create_time to transform config * s/transform_version/version s/Date/Instant * fixing getter/setter for version * adjusting for backport	2019-06-21 09:11:44 -05:00
David Kyle	73221d2265	[ML] Resolve NetworkDisruptionIT (#43441 ) After the network disruption a partition is created, one side of which can form a cluster the other can't. Ensure requests are sent to a node on the correct side of the cluster	2019-06-21 10:24:02 +01:00
Simon Willnauer	424ef4f158	SecurityIndexSearcherWrapper doesn't always carry over caches and similarity (#43436 ) If DocumentLevelSecurity is enabled SecurityIndexSearcherWrapper doesn't carry over the cache, cache policy and similarity from the incoming searcher.	2019-06-21 10:19:10 +02:00
Tim Vernum	059eb55108	Use SecureString for password length validation (#43465 ) This replaces the use of char[] in the password length validation code, with the use of SecureString Although the use of char[] is not in itself problematic, using a SecureString encourages callers to think about the lifetime of the password object and to clear it after use. Backport of: #42884	2019-06-21 17:11:07 +10:00
Armin Braun	21515b9ff1	Fix IpFilteringIntegrationTests (#43019 ) (#43434 ) * Increase timeout to 5s since we saw 500ms+ GC pauses on CI * closes #40689	2019-06-20 22:31:59 +02:00
Yannick Welsch	7f8e1454ab	Advance checkpoints only after persisting ops (#43205 ) Local and global checkpoints currently do not correctly reflect what's persisted to disk. The issue is that the local checkpoint is adapted as soon as an operation is processed (but not fsynced yet). This leaves room for the history below the global checkpoint to still change in case of a crash. As we rely on global checkpoints for CCR as well as operation-based recoveries, this has the risk of shard copies / follower clusters going out of sync. This commit required changing some core classes in the system: - The LocalCheckpointTracker keeps track now not only of the information whether an operation has been processed, but also whether that operation has been persisted to disk. - TranslogWriter now keeps track of the sequence numbers that have not been fsynced yet. Once they are fsynced, TranslogWriter notifies LocalCheckpointTracker of this. - ReplicationTracker now keeps track of the persisted local and persisted global checkpoints of all shard copies when in primary mode. The computed global checkpoint (which represents the minimum of all persisted local checkpoints of all in-sync shard copies), which was previously stored in the checkpoint entry for the local shard copy, has been moved to an extra field. - The periodic global checkpoint sync now also takes async durability into account, where the local checkpoints on shards only advance when the translog is asynchronously fsynced. This means that the previous condition to detect inactivity (max sequence number is equal to global checkpoint) is not sufficient anymore. - The new index closing API does not work when combined with async durability. The shard verification step is now requires an additional pre-flight step to fsync the translog, so that the main verify shard step has the most up-to-date global checkpoint at disposition.	2019-06-20 11:12:38 +02:00
Andrei Stefan	fe0f9055d8	Fix NPE in case of subsequent scrolled requests for a CSV/TSV formatted response (#43365 ) (cherry picked from commit 0ef7bb0f8b07cd0392d37f96ca9360821b19315a)	2019-06-20 11:26:11 +03:00
Martijn van Groningen	9de4e878f7	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-06-20 09:44:31 +02:00
Jason Tedor	1f1a035def	Remove stale test logging annotations (#43403 ) This commit removes some very old test logging annotations that appeared to be added to investigate test failures that are long since closed. If these are needed, they can be added back on a case-by-case basis with a comment associating them to a test failure.	2019-06-19 22:58:22 -04:00
Lee Hinman	c2bf628a6d	[7.x] Narrow period of Shrink action in which ILM prevents stopping (#43254 ) (#43393 ) * Narrow period of Shrink action in which ILM prevents stopping Prior to this change, we would prevent stopping of ILM if the index was anywhere in the shrink action. This commit changes `IndexLifecycleService` to allow stopping when in any of the innocuous steps during shrink. This changes ILM only to prevent stopping if absolutely necessary. Resolves #43253 * Rename variable for ignore actions -> ignore steps * Fix comment * Factor test out to test all stoppable steps	2019-06-19 16:37:41 -06:00
Benjamin Trent	77ce3260dd	[ML][Data Frame] make response.count be total count of hits (#43241 ) (#43389 ) * [ML][Data Frame] make response.count be total count of hits * addressing line length check * changing response count for filters * adjusting serialization, variable name, and total count logic * making count mandatory for creation	2019-06-19 16:19:06 -05:00
Benjamin Trent	b333ced5a7	[7.x] [ML][Data Frame] adds new pipeline field to dest config (#43124 ) (#43388 ) * [ML][Data Frame] adds new pipeline field to dest config (#43124) * [ML][Data Frame] adds new pipeline field to dest config * Adding pipeline support to _preview * removing unused import * moving towards extracting _source from pipeline simulation * fixing permission requirement, adding _index entry to doc * adjusting for java 8 compatibility * adjusting bwc serialization version to 7.3.0	2019-06-19 16:18:27 -05:00
Christos Soulios	d1637ca476	Backport: Refactor aggregation base classes to remove doEquals() and doHashCode() (#43363 ) This PR is a backport a of #43214 from v8.0.0 A number of the aggregation base classes have an abstract doEquals() and doHashCode() (e.g. InternalAggregation.java, AbstractPipelineAggregationBuilder.java). Theoretically this is so the sub-classes can add to the equals/hashCode and don't need to worry about calling super.equals(). In practice, it's mostly just confusing/inconsistent. And if there are more than two levels, we end up with situations like InternalMappedSignificantTerms which has to call super.doEquals() which defeats the point of having these overridable methods. This PR removes the do versions and just use equals/hashCode ensuring the super when necessary.	2019-06-19 22:31:06 +03:00
Lee Hinman	d81ce9a647	Return 0 for negative "free" and "total" memory reported by the OS (#42725 ) * Return 0 for negative "free" and "total" memory reported by the OS We've had a situation where the MX bean reported negative values for the free memory of the OS, in those rare cases we want to return a value of 0 rather than blowing up later down the pipeline. In the event that there is a serialization or creation error with regard to memory use, this adds asserts so the failure will occur as soon as possible and give us a better location for investigation. Resolves #42157 * Fix test passing in invalid memory value * Fix another test passing in invalid memory value * Also change mem check in MachineLearning.machineMemoryFromStats * Add background documentation for why we prevent negative return values * Clarify comment a bit more	2019-06-19 10:35:48 -06:00
Gordon Brown	23a3471394	Fix randomization in testPerformActionAttrsRequestFails (#43304 ) The randomization in this test would occasionally generate duplicate node attribute keys, causing spurious test failures. This commit adjusts the randomization to not generate duplicate keys and cleans up the data structure used to hold the generated keys.	2019-06-19 10:34:39 -06:00
Martijn van Groningen	a4c45b5d70	Replace Streamable w/ Writeable in SingleShardRequest and subclasses (#43222 ) (#43364 ) Backport of: https://github.com/elastic/elasticsearch/pull/43222 This commit replaces usages of Streamable with Writeable for the SingleShardRequest / TransportSingleShardAction classes and subclasses of these classes. Note that where possible response fields were made final and default constructors were removed. Relates to #34389	2019-06-19 16:15:09 +02:00
Przemysław Witek	86b58d9ff3	Rename AutoDetectResultProcessor* to AutodetectResultProcessor* for consistency with other classes where the spelling is "Autodetect" (#43359 ) (#43366 )	2019-06-19 15:31:26 +02:00
Yogesh Gaikwad	2f173402ec	Add kerberos grant_type to get token in exchange for Kerberos ticket (#42847 ) (#43355 ) Kibana wants to create access_token/refresh_token pair using Token management APIs in exchange for kerberos tickets. `client_credentials` grant_type requires every user to have `cluster:admin/xpack/security/token/create` cluster privilege. This commit introduces `_kerberos` grant_type for generating `access_token` and `refresh_token` in exchange for a valid base64 encoded kerberos ticket. In addition, `kibana_user` role now has cluster privilege to create tokens. This allows Kibana to create access_token/refresh_token pair in exchange for kerberos tickets. Note: The lifetime from the kerberos ticket is not used in ES and so even after it expires the access_token/refresh_token pair will be valid. Care must be taken to invalidate such tokens using token management APIs if required. Closes #41943	2019-06-19 18:26:52 +10:00
Jason Tedor	42cc27e74f	Remove token service trace logging in tests This commit removes some trace logging for the token service in the rolling upgrade tests. If there is an active investigation here, it would be best to annotate this line with a comment in the source indicating such. From my digging, it does not appear there is an active investigation that relies on this logging, so we remove it.	2019-06-18 22:32:38 -04:00
Jason Tedor	fa09113080	Remove trace logging for ML native multi-node tests This trace logging looks like it was copy/pasted from another test, where the logging in that test was only added to investigate a test failure. This commit removes the trace logging.	2019-06-18 22:28:27 -04:00
Jason Tedor	b98574240d	Remove trace logging from ML datafeeds in tests This was added to investigate a test failure over two years ago, yet left behind. Since the test failure has been addressed since then, this commit removes the trace logging.	2019-06-18 22:24:36 -04:00
Igor Motov	9f7d1ff2de	Geo: Add coerce support to libs/geo WKT parser (#43273 ) Adds support for coercing not closed polygons and ignoring Z value to libs/geo WKT parser. Closes #43173	2019-06-18 14:41:01 -04:00
Mayya Sharipova	aa6248d4d7	Move dense_vector and sparse_vector to module (#43280 ) (#43333 )	2019-06-18 11:56:04 -04:00
Przemysław Witek	459d57f4c5	[7.x] [ML] BWC tests for job_stats.timing_stats field (#43267 ) (#43293 )	2019-06-18 15:32:34 +02:00
Alpar Torok	5a9c48369b	TestClusters: Convert the security plugin (#43242 ) * TestClusters: Convert the security plugin This PR moves security tests to use TestClusters. The TLS test required support in testclusters itself, so the correct wait condition is configgured based on the cluster settings. * PR review	2019-06-18 11:55:44 +03:00
Alpar Torok	94930d0e84	Testclusters: convert ml qa tests (#43229 ) * Testclusters: convert ml qa tests This PR converts the ML tests to use testclusters.	2019-06-18 11:55:11 +03:00
David Roberts	da97325790	[ML] Speed up persistent task rechecks in ML failover tests (#43291 ) The ML failover tests sometimes need to wait for jobs to be assigned to new nodes following a node failure. They wait 10 seconds for this to happen. However, if the node that failed was the master node and a new master was elected then this 10 seconds might not be long enough as a refresh of the memory stats will delay job assignment. Once the memory refresh completes the persistent task will be assigned when the next cluster state update occurs or after the periodic recheck interval, which defaults to 30 seconds. Rather than increase the length of the wait for assignment to 31 seconds, this change decreases the periodic recheck interval to 1 second. Fixes #43289	2019-06-18 09:19:20 +01:00
Nhat Nguyen	0c5086d2f3	Rebuild version map when opening internal engine (#43202 ) With this change, we will rebuild the live version map and local checkpoint using documents (including soft-deleted) from the safe commit when opening an internal engine. This allows us to safely prune away _id of all soft-deleted documents as the version map is always in-sync with the Lucene index. Relates #40741 Supersedes #42979	2019-06-17 18:08:09 -04:00
Benjamin Trent	365f87c622	[ML][Data Frame] only complete task after state persistence (#43230 ) (#43294 ) * [ML][Data Frame] only complete task after state persistence There is a race condition where the task could be completed, but there is still a pending document write. This change moves the task cancellation into the actionlistener of the state persistence. intermediate commit intermediate commit * removing unused import * removing unused const * refreshing internal index after waiting for task to complete * adjusting test data generation	2019-06-17 16:49:00 -05:00
Martijn Laarman	8b1b9f8ab9	Introduce stability description to the REST API specification (#38413 ) (#43278 ) * introduce state to the REST API specification * change state over to stability * CCR is no GA updated to stable * SQL is now GA so marked as stable * Introduce `internal` as state for API's, marks stable in terms of lifetime but unstable in terms of guarantees on its output format since it exposes internal representations * make setting a wrong stability value, or not setting it at all an error that causes the YAML test suite to fail * update spec files to be explicit about their stability state * Document the fact that stability needs to be defined Otherwise the YAML test runner will fail (with a nice exception message) * address check style violations * update rest spec unit tests to include stability * found one more test spec file not declaring stability, made sure stability appears after documentation everywhere * cluster.state is stable, mark response in some way to denote its a key value format that can be changed during minors * mark data frame API's as beta * remove internal and private as states for an API * removed the wrong enum values in the Stability Enum in the previous commit (cherry picked from commit 61c34bbd92f8f7e5f22fa411c6b682b0ebd8a99d)	2019-06-17 16:57:13 +02:00
Lee Hinman	21da84edbc	Make ILM force merging best effort (#43246 ) It's possible for force merges kicked off by ILM to silently stop (due to a node relocating for example). In which case, the segment count may not reach what the user configured. In the subsequent `SegmentCountStep` waiting for the expected segment count may wait indefinitely. Because of this, this commit makes force merges "best effort" and then changes the `SegmentCountStep` to simply report (at INFO level) if the merge was not successful. Relates to #42824 Resolves #43245	2019-06-17 08:45:22 -06:00
David Roberts	3effe264da	[ML] Fix problem with lost shards in distributed failure test (#43153 ) We were stopping a node in the cluster at a time when the replica shards of the .ml-state index might not have been created. This change moves the wait for green status to a point where the .ml-state index exists. Fixes #40546 Fixes #41742 Forward port of #43111	2019-06-17 09:28:56 +01:00
Przemysław Witek	b2613a123d	[7.x] Report exponential_avg_bucket_processing_time which gives more weight to recent buckets (#43189 ) (#43263 )	2019-06-17 08:58:26 +02:00
Alpar Torok	a191ebabba	TestClusters: convert kerberos-tests (#43232 ) Looks like cluster formation tasks no longer plays nice wit test.fixtures so we just convert this to use testclusters.	2019-06-17 09:28:04 +03:00
David Roberts	3928c624a3	[ML] Close sample stream in post_data endpoint (#43235 ) A static code analysis revealed that we are not closing the input stream in the post_data endpoint. This actually makes no difference in practice, as the particular InputStream implementation in this case is org.elasticsearch.common.bytes.BytesReferenceStreamInput and its close() method is a no-op. However, it is good practice to close the stream anyway.	2019-06-14 17:54:54 +01:00
Benjamin Trent	8c66149e2d	[ML][Data Frame] have sum map to a double to prevent overflows (#43213 ) (#43219 )	2019-06-14 10:43:36 -05:00
Marios Trivyzas	9cd89c3453	SQL: Increase hard limit for sorting on aggregates (#43220 ) To be consistent with the `search.max_buckets` default setting, set the hard limit of the PriorityQueue used for in memory sorting, when sorting on an aggregate function, to 10000. Fixes: #43168 (cherry picked from commit 079e012fdea68ea0a7daae078359495047e9c407)	2019-06-14 13:51:38 +02:00
Alpar Torok	cce5b0f018	Convert dataframes to use testclusters (#43032 )	2019-06-14 11:02:39 +03:00
Przemysław Witek	65a584b6fb	[7.x] Report timing stats as part of the Job stats response (#42709 ) (#43193 )	2019-06-14 09:03:14 +02:00
Marios Trivyzas	3c73602524	SQL: Fix wrong results when sorting on aggregate (#43154 ) - Previously, when shorting on an aggregate function the bucket processing ended early when the explicit (LIMIT XXX) or the impliciti limit of 512 was reached. As a consequence, only a set of grouping buckets was processed and the results returned didn't reflect the global ordering. - Previously, the priority queue shorting method had an inverse comparison check and the final response from the priority queue was also returned in the inversed order because of the calls to the `pop()` method. Fixes: #42851 (cherry picked from commit 19909edcfdf5792b38c1363b07379783ebd0e6c4)	2019-06-13 21:59:20 +02:00
Jason Tedor	5bc3b7f741	Enable node roles to be pluggable (#43175 ) This commit introduces the possibility for a plugin to introduce additional node roles.	2019-06-13 15:15:48 -04:00
Ryan Ernst	c3ce3f6891	Add native code info to ML info api (#43172 ) The machine learning feature of xpack has native binaries with a different commit id than the rest of code. It is currently exposed in the xpack info api. This commit adds that commit information to the ML info api, so that it may be removed from the info api.	2019-06-13 11:38:58 -07:00
Przemyslaw Gomulka	8f7cd84422	Disable x-pack:qa:kerberos-tests due to failures (#43208 ) relates #40678	2019-06-13 20:19:17 +02:00
Alpar Torok	4ba94a5051	Testclusters: convert ccr tests (#42313 )	2019-06-13 19:19:36 +03:00
Martijn van Groningen	c8e6474eef	Changes required for merging in 7.x branch.	2019-06-13 16:58:27 +02:00
Martijn van Groningen	1f3db7eb3e	Merge remote-tracking branch 'es/7.x' into enrich-7.x	2019-06-13 16:49:38 +02:00
David Roberts	43665183c2	[ML] Restrict detection of epoch timestamps in find_file_structure (#43188 ) Previously 10 digit numbers were considered candidates to be timestamps recorded as seconds since the epoch and 13 digit numbers as timestamps recorded as milliseconds since the epoch. However, this meant that we could detect these formats for numbers that would represent times far in the future. As an example ISBN numbers starting with 9 were detected as milliseconds since the epoch since they had 13 digits. This change tweaks the logic for detecting such timestamps to require that they begin with 1 or 2. This means that numbers that would represent times beyond about 2065 are no longer detected as epoch timestamps. (We can add 3 to the definition as we get closer to the cutoff date.)	2019-06-13 13:15:41 +01:00
Alpar Torok	167e51335d	Convert ILM tests to use testclusters (#43076 ) Also improove the error message when bin scripts are not found	2019-06-13 12:24:48 +03:00
Alpar Torok	eb7a8bb4a4	Testclusters: graph (#43033 ) Convert x-pack graph to use testClusters	2019-06-13 09:50:59 +03:00
Simon Willnauer	f70141c862	Only load FST off heap if we are actually using mmaps for the term dictionary (#43158 ) Given the significant performance impact that NIOFS has when term dicts are loaded off-heap this change enforces FstLoadMode#AUTO that loads term dicts off heap only if the underlying index input indicates a memory map. Relates to #43150	2019-06-13 07:54:02 +02:00
Yogesh Gaikwad	4ae1e30a98	Enable krb5kdc-fixture, kerberos tests mount urandom for kdc container (#41710 ) (#43178 ) Infra has fixed #10462 by installing `haveged` on CI workers. This commit enables the disabled fixture and tests, and mounts `/dev/urandom` for the container so there is enough entropy required for kdc. Note: hdfs-repository tests have been disabled, will raise a separate issue for it. Closes #40624 Closes #40678	2019-06-13 13:02:16 +10:00
Benjamin Trent	ec50d4d281	[ML][Data Frame] write a warning audit on bulk index failures (#43106 ) (#43171 ) * [ML][Data Frame] write a warning audit on bulk index failures * adding failure message and moving to use volalitile	2019-06-12 14:50:17 -05:00
Benjamin Trent	aff4795441	[ML][Data Frame] cleaning up tests since tasks are cancelled onfinish (#43136 ) (#43166 ) * [ML][Data Frame] cleaning up usage test since tasks are cancelled onfinish * Update DataFrameUsageIT.java * Fixing additional test, waiting for task to complete * removing unused import * unmuting test	2019-06-12 14:39:38 -05:00
Benjamin Trent	b110164bf4	[ML][Data Frame] add the src priv check for view_index_metadata (#43118 ) (#43161 )	2019-06-12 13:22:46 -05:00
Benjamin Trent	f13f55ede3	[ML][Data Frame] change failure count reset logic (#43064 ) (#43159 )	2019-06-12 13:22:34 -05:00
David Kyle	597ae5c7b8	[ML DataFrame] Reject Data Frame Ids containing upper case characters (#43145 )	2019-06-12 18:13:18 +01:00
James Baiera	9d56a0365f	Limit a enrich policy execution to only one at a time (#42535 ) Add a keyed lock mechanism to the policy executor to ensure that an enrich policy can only have one execution happening at a time.	2019-06-12 10:45:10 -04:00
Yannick Welsch	110f0c5b7e	Mute testDataFrameTransformCrud Relates to #43139	2019-06-12 14:12:01 +02:00
Dimitris Athanasiou	b28e006f7c	[ML] Lock down extraction method when possible (#43104 ) (#43140 )	2019-06-12 14:07:17 +03:00
Luca Cavanna	afeda1a7b9	Split search in two when made against throttled and non throttled searches (#42510 ) When a search on some indices takes a long time, it may cause problems to other indices that are being searched as part of the same search request and being written to as well, because their search context needs to stay open for a long time. This is especially a problem when searching against throttled and non-throttled indices as part of the same request. The problem can be generalized though: this may happen whenever read-only indices are searched together with indices that are being written to. Search contexts staying open for a long time is only an issue for indices that are being written to, in practice. This commit splits the search in two sub-searches: one for read-only indices, and one for ordinary indices. This way the two don't interfere with each other. The split is done only when size is greater than 0, no scroll is provided and query_then_fetch is used as search type. Otherwise, the search executes like before. Note that the returned num_reduce_phases reflect the number of reduction phases that were run. If the search is split in two, there are three reductions: one non-final for each search, and a final one that merges the results of the previous two. Closes #40900	2019-06-12 11:25:03 +02:00
Nhat Nguyen	5692be2161	Fix timing issue in CcrRetentionLeaseIT (#43054 ) In these tests, we sleep for a small multiple of the renew interval, then check that the retention leases are not changed. If a renewal request takes longer than that interval because of GC or slow CI, then the retention leases are not the same as before sleep. With this change, we relax to assert that we eventually stop the renewable process. Closes #39509	2019-06-11 18:03:16 -04:00
Benjamin Trent	7ff3d86cf0	[ML][Data Frame] adding dest.index and id validations (#43053 ) (#43109 ) * [ML][Data Frame] adding dest.index and id validations * adjusting message format * Adjusting id validity pattern * Update DataFrameStrings.java	2019-06-11 15:55:18 -05:00
Benjamin Trent	e384bf0276	[ML-DataFrame] stop task at completion of data frame function (#42955 ) (#43114 ) * stop data frame task after it finishes * test auto stop * adapt tests * persist the state correctly and move stop into listener * Calling `onStop` even if persistence fails, changing `stop` to rely on doSaveState	2019-06-11 15:55:02 -05:00
Ryan Ernst	172cd4dbfa	Remove description from xpack feature sets (#43065 ) The description field of xpack featuresets is optionally part of the xpack info api, when using the verbose flag. However, this information is unnecessary, as it is better left for documentation (and the existing descriptions describe anything meaningful). This commit removes the description field from feature sets.	2019-06-11 09:22:58 -07:00
David Roberts	d3136f99e6	[ML] Fix race condition when closing time checker (#43098 ) The tests for the ML TimeoutChecker rely on threads not being interrupted after the TimeoutChecker is closed. This change ensures this by making the close() and setTimeoutExceeded() methods synchronized so that the code inside them cannot execute simultaneously. Fixes #43097	2019-06-11 16:39:17 +01:00
Nhat Nguyen	5d3849215b	CCR should not replicate private/internal settings (#43067 ) With this change, CCR will not replicate internal or private settings to follower indices. Closes #41268	2019-06-11 06:59:09 -04:00
Martijn Laarman	cb7ce865b7	remove path from rest-api-spec (#41452 ) (#43084 ) (cherry picked from commit f5fde1d0843d2f0f53d3b9a15b9cfc8b94471ab7)	2019-06-11 12:52:36 +02:00
Ioannis Kakavas	1776d6e055	Refresh remote JWKs on all errors (#42850 ) It turns out that key rotation on the OP, can manifest as both a BadJWSException and a BadJOSEException in nimbus-jose-jwt. As such we cannot depend on matching only BadJWSExceptions to determine if we should poll the remote JWKs for an update. This has the side-effect that a remote JWKs source will be polled exactly one additional time too for errors that have to do with configuration, or for errors that might be caused by not synched clocks, forged JWTs, etc. ( These will throw a BadJWTException which extends BadJOSEException also )	2019-06-11 11:01:54 +03:00
Benjamin Trent	79052050bf	[ML] Adding support for geo_shape, geo_centroid, geo_point in datafeeds (#42969 ) (#43069 ) * [ML] Adding support for geo_shape, geo_centroid, geo_point in datafeeds * only supporting doc_values for geo_point fields * moving validation into GeoPointField ctor	2019-06-10 21:52:53 -05:00
Benjamin Trent	eadfe05587	[ML] Changes slice specification to auto. See #42996 (#43039 ) (#43070 )	2019-06-10 21:52:22 -05:00
Nhat Nguyen	53eb630700	Fix NPE in CcrRetentionLeaseIT (#43059 ) The retention leases stats is null if the processing shard copy is being closed. In this the case, we should check against null then retry to avoid failing a test. Closes #41237	2019-06-10 17:58:37 -04:00
Nhat Nguyen	f2e66e22eb	Increase waiting time when check retention locks (#42994 ) WriteActionsTests#testBulk and WriteActionsTests#testIndex sometimes fail with a pending retention lock. We might leak retention locks when switching to async recovery. However, it's more likely that ongoing recoveries prevent the retention lock from releasing. This change increases the waiting time when we check for no pending retention lock and also ensures no ongoing recovery in WriteActionsTests. Closes #41054	2019-06-10 17:58:37 -04:00
Nhat Nguyen	4191df6e1d	Unmute IndexFollowingIT#testFollowIndex Fixed in #41987	2019-06-10 17:58:37 -04:00
Benjamin Trent	1ddc4c8fc6	[ML][Data Frame] Removes slice specification from DBQ. See #42996 (#43036 ) (#43055 )	2019-06-10 13:40:55 -05:00
Dimitris Athanasiou	76a92b49a8	[ML] Get resources action should be lenient when sort field is unmapped (#42991 ) (#43046 ) Get resources action sorts on the resource id. When there are no resources at all, then it is possible the index does not contain a mapping for the resource id field. In that case, the search api fails by default. This commit adjusts the search request to ignore unmapped fields. Closes elastic/kibana#37870	2019-06-10 19:50:19 +03:00
David Roberts	bf5d56053a	[TEST] Adding a BWC test for ML categorization config (#42988 ) This test coverage was previously missing. Backport of #42981	2019-06-10 15:39:28 +01:00
Alan Woodward	8e23e4518a	Move construction of custom analyzers into AnalysisRegistry (#42940 ) Both TransportAnalyzeAction and CategorizationAnalyzer have logic to build custom analyzers for index-independent analysis. A lot of this code is duplicated, and it requires the AnalysisRegistry to expose a number of internal provider classes, as well as making some assumptions about when analysis components are constructed. This commit moves the build logic directly into AnalysisRegistry, reducing the registry's API surface considerably.	2019-06-10 14:33:25 +01:00
David Turner	68339f90e9	Mute AutodetectMemoryLimitIT#testTooManyPartitions Relates #43013	2019-06-10 09:20:36 +01:00
Andrei Stefan	036f9c4a55	SQL: cover the Integer type when extracting values from _source (#42859 ) * Take into consideration a wider range of Numbers when extracting the values from source, more specifically - BigInteger and BigDecimal. (cherry picked from commit 561b8d73dd7b03c50242e4e3f0128b2142959176)	2019-06-10 09:25:56 +03:00
Jason Tedor	63bad28005	Do not allow modify aliases on followers (#43017 ) Now that aliases are replicated by a follower from its leader, this commit prevents directly modifying aliases on follower indices.	2019-06-09 22:53:54 -04:00
Jason Tedor	915d2f2daa	Refactor put mapping request validation for reuse (#43005 ) This commit refactors put mapping request validation for reuse. The concrete case that we are after here is the ability to apply effectively the same framework to indices aliases requests. This commit refactors the put mapping request validation framework to allow for that.	2019-06-09 10:19:04 -04:00
Benjamin Trent	553c73b22d	[ML][Data Frame] allow null values for aggs with sparse data (#42966 ) (#42998 ) * [ML][Data Frame] allow null values for aggs with sparse data * Making classes static, memory allocation optimization	2019-06-07 15:43:06 -05:00
Benjamin Trent	755ba72896	[ML][Data frame] make sure that fields exist when creating progress (#42943 ) (#42984 )	2019-06-07 10:13:18 -05:00
jalvar08	b77be89c9a	Remove Comma in Example (#41873 ) The comma is there in error as there are no other parameter after 'value'	2019-06-07 08:39:27 -04:00
Henning Andersen	dea935ac31	Reindex max_docs parameter name (#42942 ) Previously, a reindex request had two different size specifications in the body: * Outer level, determining the maximum documents to process * Inside the source element, determining the scroll/batch size. The outer level size has now been renamed to max_docs to avoid confusion and clarify its semantics, with backwards compatibility and deprecation warnings for using size. Similarly, the size parameter has been renamed to max_docs for update/delete-by-query to keep the 3 interfaces consistent. Finally, all 3 endpoints now support max_docs in both body and URL. Relates #24344	2019-06-07 12:16:36 +02:00
Tim Vernum	090d42d3e6	Permit API Keys on Basic License (#42973 ) Kibana alerting is going to be built using API Keys, and should be permitted on a basic license. This commit moves API Keys (but not Tokens) to the Basic license Relates: elastic/kibana#36836 Backport of: #42787	2019-06-07 14:18:05 +10:00
Tim Brooks	667c613d9e	Remove `nonApplicationWrite` from `SSLDriver` (#42954 ) Currently, when the SSLEngine needs to produce handshake or close data, we must manually call the nonApplicationWrite method. However, this data is only required when something triggers the need (starting handshake, reading from the wire, initiating close, etc). As we have a dedicated outbound buffer, this data can be produced automatically. Additionally, with this refactoring, we combine handshake and application mode into a single mode. This is necessary as there are non-application messages that are sent post handshake in TLS 1.3. Finally, this commit modifies the SSLDriver tests to test against TLS 1.3.	2019-06-06 17:44:40 -04:00
henryptung	61b62125b8	Wire query cache into sorting nested-filter computation (#42906 ) Don't use Lucene's default query cache when filtering in sort. Closes #42813	2019-06-06 21:16:58 +02:00
Gordon Brown	e35b240a43	Fix hang in test for "too many fields" dep. check (#42909 ) This commit fixes a rare case where the method to randomly generate a very large mapping could enter an infinite loop.	2019-06-06 08:28:32 -06:00

... 14 15 16 17 18 ...

4824 Commits