OpenSearch

Commit Graph

Author	SHA1	Message	Date
Igor Motov	d2f9ccbe18	Geo: Refactor libs/geo parsers (#42549 ) Refactors the WKT and GeoJSON parsers from an utility class into an instantiatable objects. This is a preliminary step in preparation for moving out coordinate validators from Geometry constructors. This should allow us to make validators plugable.	2019-05-29 20:07:27 -04:00
David Kyle	c5a410f68b	[ML Data Frame] Set DF task state when stopping (#42516 ) Set the state to stopped prior to persisting	2019-05-29 16:39:44 +01:00
Hendrik Muhs	345ff21ae5	[ML-DataFrame] rewrite start and stop to answer with acknowledged (#42589 ) rewrite start and stop to answer with acknowledged fixes #42450	2019-05-29 11:14:32 +02:00
Hendrik Muhs	ace96a2b6e	check position before and after latch (#42623 ) check position before and after latch #fixes 42084	2019-05-28 21:23:29 +02:00
David Kyle	aea600fe7d	[Ml Data Frame] Return bad_request on preview when config is invalid (#42447 )	2019-05-28 15:36:50 +01:00
David Turner	746a2f41fd	Remove PRE_60_NODE_CHECKPOINT (#42531 ) This commit removes the obsolete `PRE_60_NODE_CHECKPOINT` constant for dealing with 5.x nodes' lack of sequence number support. Backport of #42527	2019-05-28 12:25:53 +01:00
Daniel Mitterdorfer	635ce0ca6d	Mute AsyncTwoPhaseIndexerTests#testStateMachine() (#42610 ) Relates #42084 Relates #42609	2019-05-28 10:20:22 +02:00
Nhat Nguyen	2077f9ffbc	Reset mock transport service in CcrRetentionLeaseIT (#42600 ) testRetentionLeaseIsAddedIfItDisappearsWhileFollowing does not reset the mock transport service after test. Surviving transport interceptors from that test can sneaky remove retention leases and make other tests fail. Closes #39331 Closes #39509 Closes #41428 Closes #41679 Closes #41737 Closes #41756	2019-05-27 21:51:25 -04:00
Armin Braun	a96606d962	Safer Wait for Snapshot Success in ClusterPrivilegeTests (#40943 ) (#42575 ) * Safer Wait for Snapshot Success in ClusterPrivilegeTests * The snapshot state returned by the API might become SUCCESS before it's fully removed from the cluster state. * We should fix this race in the transport API but it's not trivial and will be part of the incoming big round of refactoring the repository interaction, this added check fixes the test for now * closes #38030	2019-05-27 12:08:20 +02:00
Armin Braun	a5ca20a250	Some Cleanup in o.e.i.engine (#42278 ) (#42566 ) * Some Cleanup in o.e.i.engine * Remove dead code and parameters * Reduce visibility in some obvious spots * Add missing `assert`s (not that important here since the methods themselves will probably be dead-code eliminated) but still	2019-05-27 11:04:54 +02:00
Nhat Nguyen	85e60850af	Add debug log for retention leases (#42557 ) We need more information to understand why CcrRetentionLeaseIT is failing. This commit adds some debug log to retention leases and enables them in CcrRetentionLeaseIT.	2019-05-26 16:04:47 -04:00
Nhat Nguyen	d6e2f4a43e	Enable recoveries trace log in CcrRetentionLeaseIT Tracked #41679	2019-05-24 22:16:14 -04:00
Tanguy Leroux	6bec876682	Improve Close Index Response (#39687 ) This changes the `CloseIndexResponse` so that it reports closing result for each index. Shard failures or exception are also reported per index, and the global acknowledgment flag is computed from the index results only. The response looks like: ``` { "acknowledged" : true, "shards_acknowledged" : true, "indices" : { "docs" : { "closed" : true } } } ``` The response reports shard failures like: ``` { "acknowledged" : false, "shards_acknowledged" : false, "indices" : { "docs-1" : { "closed" : true }, "docs-2" : { "closed" : false, "shards" : { "1" : { "failures" : [ { "shard" : 1, "index" : "docs-2", "status" : "BAD_REQUEST", "reason" : { "type" : "index_closed_exception", "reason" : "closed", "index_uuid" : "JFmQwr_aSPiZbkAH_KEF7A", "index" : "docs-2" } } ] } } }, "docs-3" : { "closed" : true } } } ``` Co-authored-by: Tanguy Leroux <tlrx.dev@gmail.com>	2019-05-24 21:57:55 -04:00
Julie Tibshirani	3a6c2525ca	Deprecate support for chained multi-fields. (#42330 ) This PR contains a straight backport of #41926, and also updates the migration documentation and deprecation info API for 7.x.	2019-05-24 15:55:06 -07:00
David Roberts	48dc0dca57	[ML] Use map and filter instead of flatMap in find_file_structure (#42534 ) Using map and filter avoids the garbage from all the Stream.of calls that flatMap necessitated. Performance is better when there are masses of fields.	2019-05-24 20:12:06 +01:00
David Roberts	34de68b007	[ML] Fix possible race condition when closing an opening job (#42506 ) This change fixes a race condition that would result in an in-memory data structure becoming out-of-sync with persistent tasks in cluster state. If repeated often enough this could result in it being impossible to open any ML jobs on the affected node, as the master node would think the node had capacity to open another job but the chosen node would error during the open sequence due to its in-memory data structure being full. The race could be triggered by opening a job and then closing it a tiny fraction of a second later. It is unlikely a user of the UI could open and close the job that fast, but a script or program calling the REST API could. The nasty thing is, from the externally observable states and stats everything would appear to be fine - the fast open then close sequence would appear to leave the job in the closed state. It's only later that the leftovers in the in-memory data structure might build up and cause a problem.	2019-05-24 20:11:58 +01:00
Hendrik Muhs	6d47ee9268	[ML-DataFrame] add support for fixed_interval, calendar_interval, remove interval (#42427 ) * add support for fixed_interval, calendar_interval, remove interval * adapt HLRC * checkstyle * add a hlrc to server test * adapt yml test * improve naming and doc * improve interface and add test code for hlrc to server * address review comments * repair merge conflict * fix date patterns * address review comments * remove assert for warning * improve exception message * use constants	2019-05-24 20:30:17 +02:00
Igor Motov	e28a9e99c4	SQL: Moves the JTS-based tests suppression to Before (#42526 ) Moves the test suppression from `ClassRule` to `Before`, where it is properly handled in the CI build. Fixes #42221	2019-05-24 13:58:53 -04:00
Hendrik Muhs	7cee294acf	[ML-DataFrame]backport dataframe changes from 42202, using client instead of transport (#42468 ) backport dataframe changes from #42202, using client instead of transport	2019-05-24 11:05:30 +02:00
David Roberts	f472186b9f	[ML] Improve file structure finder timestamp format determination (#41948 ) This change contains a major refactoring of the timestamp format determination code used by the ML find file structure endpoint. Previously timestamp format determination was done separately for each piece of text supplied to the timestamp format finder. This had the drawback that it was not possible to distinguish dd/MM and MM/dd in the case where both numbers were 12 or less. In order to do this sensibly it is best to look across all the available timestamps and see if one of the numbers is greater than 12 in any of them. This necessitates making the timestamp format finder an instantiable class that can accumulate evidence over time. Another problem with the previous approach was that it was only possible to override the timestamp format to one of a limited set of timestamp formats. There was no way out if a file to be analysed had a timestamp that was sane yet not in the supported set. This is now changed to allow any timestamp format that can be parsed by a combination of these Java date/time formats: yy, yyyy, M, MM, MMM, MMMM, d, dd, EEE, EEEE, H, HH, h, mm, ss, a, XX, XXX, zzz Additionally S letter groups (fractional seconds) are supported providing they occur after ss and separated from the ss by a dot, comma or colon. Spacing and punctuation is also permitted with the exception of the question mark, newline and carriage return characters, together with literal text enclosed in single quotes. The full list of changes/improvements in this refactor is: - Make TimestampFormatFinder an instantiable class - Overrides must be specified in Java date/time format - Joda format is no longer accepted - Joda timestamp formats in outputs are now derived from the determined or overridden Java timestamp formats, not stored separately - Functionality for determining the "best" timestamp format in a set of lines has been moved from TextLogFileStructureFinder to TimestampFormatFinder, taking advantage of the fact that TimestampFormatFinder is now an instantiable class with state - The functionality to quickly rule out some possible Grok patterns when looking for timestamp formats has been changed from using simple regular expressions to the much faster approach of using the Shift-And method of sub-string search, but using an "alphabet" consisting of just 1 (representing any digit) and 0 (representing non-digits) - Timestamp format overrides are now much more flexible - Timestamp format overrides that do not correspond to a built-in Grok pattern are mapped to a %{CUSTOM_TIMESTAMP} Grok pattern whose definition is included within the date processor in the ingest pipeline - Grok patterns that correspond to multiple Java date/time patterns are now handled better - the Grok pattern is accepted as matching broadly, and the required set of Java date/time patterns is built up considering all observed samples - As a result of the more flexible acceptance of Grok patterns, when looking for the "best" timestamp in a set of lines timestamps are considered different if they are preceded by a different sequence of punctuation characters (to prevent timestamps far into some lines being considered similar to timestamps near the beginning of other lines) - Out-of-the-box Grok patterns that are considered now include %{DATE} and %{DATESTAMP}, which have indeterminate day/month ordering - The order of day/month in formats with indeterminate day/month order is determined by considering all observed samples (plus the server locale if the observed samples still do not suggest an ordering) Relates #38086 Closes #35137 Closes #35132	2019-05-24 09:10:08 +01:00
Tim Vernum	567c0d331f	Fix settings prefix for realm truststore password (#42413 ) As part of #30241 realm settings were changed to be true affix settings. In the process of this change, the "ssl." prefix was lost from the realm truststore password. It should be: xpack.security.authc.realms.<type>.<name>.ssl.truststore.password Due to a mismatch between the way we define SSL settings and load SSL contexts, there was no way to define this legacy password setting in a realm config. The settings validation would reject "ssl.truststore.password" but the SSL service would ignore "truststore.password" Backport of: #42336	2019-05-24 13:16:26 +10:00
Ryan Ernst	a49bafc194	Split document and metadata fields in GetResult (#38373 ) (#42456 ) This commit makes creators of GetField split the fields into document fields and metadata fields. It is part of larger refactoring that aims to remove the calls to static methods of MapperService related to metadata fields, as discussed in #24422.	2019-05-23 14:01:07 -07:00
Costin Leau	a48125a9f7	Fix FROZEN indices backport	2019-05-23 21:30:41 +03:00
Costin Leau	9fdf4215dd	Docs: Documentation for the upcoming SQL support of frozen indices (#41863 ) (cherry picked from commit a3cc03eb1503df24c1706a721fcc9af38c3b2873) (cherry picked from commit f42dcf2ffd7bd25f3f91aa6127515f393cd1860f)	2019-05-23 21:16:16 +03:00
Costin Leau	d5f04d29c9	SQL: Add support for FROZEN indices (#41558 ) Allow querying of FROZEN indices both through dedicated SQL grammar extension: > SELECT field FROM FROZEN index and also through driver configuration parameter, namely: > index.include.frozen: true/false Fix #39390 Fix #39377 (cherry picked from commit 2445a933915f420c7f51e8505afa0a7978ce6b0f)	2019-05-23 21:16:16 +03:00
Zachary Tong	6d8a0e36ec	Re-mute all ml_datafeed_crud rolling upgrade tests AwaitsFix https://github.com/elastic/elasticsearch/issues/42258 Thought this was fixed, but throwing deprecation warnings at an unexpected time so putting this back on mute until we figure it out.	2019-05-23 09:50:27 -04:00
David Kyle	a23257ce06	[ML Data Frame] Account for completed data frames in test (#42351 ) When asserting on the checkpoint value if the DF has completed the checkpoint will be 1 else 0. Similarly state may be started or indexing. Closes #42309	2019-05-23 14:05:09 +01:00
Jim Ferenczi	b88e80ab89	Upgrade to Lucene 8.1.0 (#42214 ) This commit upgrades to the GA release of Lucene 8.1.0	2019-05-23 11:46:45 +02:00
Jim Ferenczi	4ca5649a0d	Upgrade to lucene 8.1.0-snapshot-e460356abe (#40952 )	2019-05-23 11:45:33 +02:00
Mengwei Ding	fa98cbe320	Add .code_internal-* index pattern to kibana user (#42247 ) (#42387 )	2019-05-22 20:25:45 -07:00
Luca Cavanna	c2af62455f	Cut over SearchResponse and SearchTemplateResponse to Writeable (#41855 ) Relates to #34389	2019-05-22 18:47:54 +02:00
Luca Cavanna	29c9bb9181	Clean up ShardId usage of Streamable (#41843 ) ShardId already implements Writeable so there is no need for it to implement Streamable too. Also the readShardId static method can be easily replaced with direct usages of the constructor that takes a StreamInput as argument.	2019-05-22 18:47:54 +02:00
Yannick Welsch	5d8605c790	Fix testAutoFollowManyIndices On a slow CI worker, the test was failing an assertion. Closes #41234	2019-05-22 17:33:34 +02:00
David Kyle	075cc7c5cf	[ML Data Frame] Persist data frame after state changes (#42347 )	2019-05-22 15:40:40 +01:00
David Kyle	f696769a39	Mute Data Frame integration tests Relates to https://github.com/elastic/elasticsearch/issues/42344	2019-05-22 15:03:13 +01:00
Simon Willnauer	a79cd77e5c	Remove IndexShard dependency from Repository (#42213 ) * Remove IndexShard dependency from Repository In order to simplify repository testing especially for BlobStoreRepository it's important to remove the dependency on IndexShard and reduce it to Store and MapperService (in the snapshot case). This significantly reduces the dependcy footprint for Repository and allows unittesting without starting nodes or instantiate entire shard instances. This change deprecates the old method signatures and adds a unittest for FileRepository to show the advantage of this change. In addition, the unittesting surfaced a bug where the internal file names that are private to the repository were used in the recovery stats instead of the target file names which makes it impossible to relate to the actual lucene files in the recovery stats. * don't delegate deprecated methods * apply comments * test	2019-05-22 14:27:11 +02:00
Alpar Torok	eb1639c5fc	TestClusters: Convert docs (#42100 ) * TestClusters: Convert docs	2019-05-22 14:44:08 +03:00
Ioannis Kakavas	aab97f1311	Fail early when rp.client_secret is missing in OIDC realm (#42256 ) rp.client_secret is a required secure setting. Make sure we fail with a SettingsException and a clear, actionable message when building the realm, if the setting is missing.	2019-05-22 13:20:41 +03:00
Ioannis Kakavas	ccdc0e6b3e	Merge claims from userinfo and ID Token correctly (#42277 ) Enhance the handling of merging the claims sets of the ID Token and the UserInfo response. JsonObject#merge would throw a runtime exception when attempting to merge two objects with the same key and different values. This could happen for an OP that returns different vales for the same claim in the ID Token and the UserInfo response ( Google does that for profile claim ). If a claim is contained in both sets, we attempt to merge the values if they are objects or arrays, otherwise the ID Token claim value takes presedence and overwrites the userinfo response.	2019-05-22 13:15:41 +03:00
Ioannis Kakavas	7af30345b4	Revert "mute failing filerealm hash caching tests (#42304 )" This reverts commit `39fbed1577`.	2019-05-22 13:15:00 +03:00
Dimitris Athanasiou	a6eb20ad35	[ML] Include node name when native controller cannot start process (#42225 ) (#42338 ) This adds the node name where we fail to start a process via the native controller to facilitate debugging as otherwise it might not be known to which node the job was allocated.	2019-05-22 12:42:04 +03:00
Yannick Welsch	770d8e9e39	Remove usage of max_local_storage_nodes in test infrastructure (#41652 ) Moves the test infrastructure away from using node.max_local_storage_nodes, allowing us in a follow-up PR to deprecate this setting in 7.x and to remove it in 8.0. This also changes the behavior of InternalTestCluster so that starting up nodes will not automatically reuse data folders of previously stopped nodes. If this behavior is desired, it needs to be explicitly done by passing the data path from the stopped node to the new node that is started.	2019-05-22 11:04:55 +02:00
Hendrik Muhs	ad24231c1a	[ML-DataFrame] validate group name to not contain invalid characters (#42292 ) disallows of creating groupBy field with '[', ']', '>' in the name to be consistent with aggregations	2019-05-22 09:39:59 +02:00
Hendrik Muhs	3493f3b637	move latch await to doNextSearch (#42275 ) move latch await to doNextSearch, fixes a race condition when the executor thread is faster than the coordinator thread fixes #42084	2019-05-22 09:39:59 +02:00
Ioannis Kakavas	34dda75cdf	Ensure SHA256 is not used in tests (#42289 ) SHA256 was recently added to the Hasher class in order to be used in the TokenService. A few tests were still using values() to get the available algorithms from the Enum and it could happen that SHA256 would be picked up by these. This change adds an extra convenience method (Hasher#getAvailableAlgoCacheHash) and enures that only this and Hasher#getAvailableAlgoStoredHash are used for getting the list of available password hashing algorithms in our tests.	2019-05-22 09:54:24 +03:00
Ioannis Kakavas	cdf9485e33	Allow Kibana user to use the OpenID Connect APIs (#42305 ) Add the manage_oidc privilege to the kibana user and to the role privileges list	2019-05-22 09:44:37 +03:00
Tim Vernum	c5f191f6af	Add cluster restart for security on basic (#42217 ) This performs a simple restart test to move a basic licensed cluster from no security (the default) to security & transport TLS enabled. Backport of: #41933	2019-05-22 14:27:45 +10:00
Ed Savage	685a206891	Merge branch '7.x' of github.com:elastic/elasticsearch into 7.x	2019-05-21 19:14:17 +01:00
David Kyle	7e4d3c695b	[ML Data Frame] Persist and restore checkpoint and position (#41942 ) Persist and restore Data frame's current checkpoint and position	2019-05-21 18:57:13 +01:00
Ed Savage	d97f4d5e28	[ML][TEST] Fix limits in AutodetectMemoryLimitIT (#42279 ) Re-enable muted tests and accommodate recent backend changes that result in higher memory usage being reported for a job at the start of its life-cycle	2019-05-21 18:44:47 +01:00
Tal Levy	39fbed1577	mute failing filerealm hash caching tests (#42304 ) some tests are failing after the introduction of #41792. relates #42267 and #42289.	2019-05-21 10:40:14 -07:00
Dimitris Athanasiou	a4e6fb4dd2	[ML] Fix logger declaration in ML plugins (#42222 ) (#42238 ) This corrects what appears to have been a copy-paste error where the logger for `MachineLearning` and `DataFrame` was wrongly set to be that of `XPackPlugin`.	2019-05-21 18:03:24 +03:00
David Kyle	0fd42ce1f5	[ML Data Frame] Start directly data frame rather than via the scheduler (#42224 ) Trigger indexer start directly to put the indexer in INDEXING state immediately	2019-05-21 15:48:45 +01:00
jimczi	0449869511	Fix unchecked warning in RollupIndexerIndexingTests#testSimpleDateHistoWithOverlappingDelay	2019-05-21 12:28:57 +02:00
David Kyle	ffefc66260	Mute failing AsyncTwoPhaseIndexerTests See https://github.com/elastic/elasticsearch/issues/42084	2019-05-21 10:24:46 +01:00
David Kyle	24144aead2	[ML] Complete the Data Frame task on stop (#41752 ) (#42063 ) Wait for indexer to stop then complete the persistent task on stop. If the wait_for_completion is true the request will not return until stopped.	2019-05-21 10:24:20 +01:00
Tim Vernum	7b3a9c7033	Do not refresh realm cache unless required (#42212 ) If there are no realms that depend on the native role mapping store, then changes should it should not perform any cache refresh. A refresh with an empty realm array will refresh all realms. This also fixes a spurious log warning that could occur if the role mapping store was notified that the security index was recovered before any realm were attached. Backport of: #42169	2019-05-21 18:14:22 +10:00
Jim Ferenczi	ec63160243	Fix max boundary for rollups job that use a delay (#42158 ) Rollup jobs can define how long they should wait before rolling up new documents. However if the delay is smaller or if it's not a multiple of the rollup interval the job can create incomplete buckets because the max boundary for a job is computed from the time when the job started rounded to the interval minus the delay. This change fixes this computation by applying the delay substraction before the rounding in order to ensure that we never create a boundary that falls in a middle of a bucket.	2019-05-21 08:48:53 +02:00
Zachary Tong	6ae6f57d39	[7.x Backport] Force selection of calendar or fixed intervals (#41906 ) The date_histogram accepts an interval which can be either a calendar interval (DST-aware, leap seconds, arbitrary length of months, etc) or fixed interval (strict multiples of SI units). Unfortunately this is inferred by first trying to parse as a calendar interval, then falling back to fixed if that fails. This leads to confusing arrangement where `1d` == calendar, but `2d` == fixed. And if you want a day of fixed time, you have to specify `24h` (e.g. the next smallest unit). This arrangement is very error-prone for users. This PR adds `calendar_interval` and `fixed_interval` parameters to any code that uses intervals (date_histogram, rollup, composite, datafeed, etc). Calendar only accepts calendar intervals, fixed accepts any combination of units (meaning `1d` can be used to specify `24h` in fixed time), and both are mutually exclusive. The old interval behavior is deprecated and will throw a deprecation warning. It is also mutually exclusive with the two new parameters. In the future the old dual-purpose interval will be removed. The change applies to both REST and java clients.	2019-05-20 12:07:29 -04:00
Alexander Reelsen	c72c76b5ea	Update to joda time 2.10.2 (#42199 )	2019-05-20 16:58:54 +02:00
Ioannis Kakavas	b4a413c4d0	Hash token values for storage (#41792 ) (#42220 ) This commit changes how access tokens and refresh tokens are stored in the tokens index. Access token values are now hashed before being stored in the id field of the `user_token` and before becoming part of the token document id. Refresh token values are hashed before being stored in the token field of the `refresh_token`. The tokens are hashed without a salt value since these are v4 UUID values that have enough entropy themselves. Both rainbow table attacks and offline brute force attacks are impractical. As a side effect of this change and in order to support multiple concurrent refreshes as introduced in #39631, upon refreshing an <access token, refresh token> pair, the superseding access token and refresh tokens values are stored in the superseded token doc, encrypted with a key that is derived from the superseded refresh token. As such, subsequent requests to refresh the same token in the predefined time window will return the same superseding access token and refresh token values, without hitting the tokens index (as this only stores hashes of the token values). AES in GCM mode is used for encrypting the token values and the key derivation from the superseded refresh token uses a small number of iterations as it needs to be quick. For backwards compatibility reasons, the new behavior is only enabled when all nodes in a cluster are in the required version so that old nodes can cope with the token values in a mixed cluster during a rolling upgrade.	2019-05-20 17:55:29 +03:00
Jay Modi	dbbdcea128	Update ciphers for TLSv1.3 and JDK11 if available (#42082 ) This commit updates the default ciphers and TLS protocols that are used when the runtime JDK supports them. New cipher support has been introduced in JDK 11 and 12 along with performance fixes for AES GCM. The ciphers are ordered with PFS ciphers being most preferred, then AEAD ciphers, and finally those with mainstream hardware support. When available stronger encryption is preferred for a given cipher. This is a backport of #41385 and #41808. There are known JDK bugs with TLSv1.3 that have been fixed in various versions. These are: 1. The JDK's bundled HttpsServer will endless loop under JDK11 and JDK 12.0 (Fixed in 12.0.1) based on the way the Apache HttpClient performs a close (half close). 2. In all versions of JDK 11 and 12, the HttpsServer will endless loop when certificates are not trusted or another handshake error occurs. An email has been sent to the openjdk security-dev list and #38646 is open to track this. 3. In JDK 11.0.2 and prior there is a race condition with session resumption that leads to handshake errors when multiple concurrent handshakes are going on between the same client and server. This bug does not appear when client authentication is in use. This is JDK-8213202, which was fixed in 11.0.3 and 12.0. 4. In JDK 11.0.2 and prior there is a bug where resumed TLS sessions do not retain peer certificate information. This is JDK-8212885. The way these issues are addressed is that the current java version is checked and used to determine the supported protocols for tests that provoke these issues.	2019-05-20 09:45:36 -04:00
Lisa Cawley	fd2d4d761b	[DOCS] Updates TLS configuration info (#41983 )	2019-05-20 09:13:37 -04:00
Ed Savage	840af87a74	[ML] Temporarily muting failing tests Muting a number of AutoDetectMemoryLimitIT tests to give CI a chance to settle before easing in required backend changes. relates elastic/ml-cpp#486 relates #42086	2019-05-19 08:29:50 -04:00
Ed Savage	a68b04e47b	[ML] Improve hard_limit audit message (#42086 ) Improve the hard_limit memory audit message by reporting how many bytes over the configured memory limit the job was at the point of the last allocation failure. Previously the model memory usage was reported, however this was inaccurate and hence of limited use - primarily because the total memory used by the model can decrease significantly after the models status is changed to hard_limit but before the model size stats are reported from autodetect to ES. While this PR contains the changes to the format of the hard_limit audit message it is dependent on modifications to the ml-cpp backend to send additional data fields in the model size stats message. These changes will follow in a subsequent PR. It is worth noting that this PR must be merged prior to the ml-cpp one, to keep CI tests happy.	2019-05-17 17:40:08 -04:00
Benjamin Trent	f2447364fd	[ML] adds geo_centroid aggregation support to data frames (#42088 ) (#42094 )	2019-05-17 16:51:05 -04:00
Igor Motov	076ca75ea5	SQL: Suppress geo tests failing on tr-TR locale (#42200 ) Due to a bug in JTS WKT parser, JTS cannot parse most of WKT shapes if the shape type is written in the lower case. For examples `point (1 2)` is causing JTS inside H2GIS to fail on tr-TR locale as a result of case-insensitive comparison.	2019-05-17 16:00:54 -04:00
David Roberts	226df35d96	[ML] Improve message misformation error in file structure finder (#42175 ) This change replaces the extremely unfriendly message "Number of messages analyzed must be positive" in the case where the sample lines were incorrectly grouped into just one message to an error that more helpfully explains the likely root cause of the problem.	2019-05-16 18:29:38 +01:00
Ryan Ernst	fa1d1d1f57	Deprecate the native realm migration tool (#42142 ) The migrate tool was added when the native realm was created, to aid users in converting from file realms that were per node, into the cluster managed native realm. While this tool was useful at the time, users should now be using the native realm directly. This commit deprecates the tool, to be removed in a followup for 8.0.	2019-05-16 09:52:31 -04:00
Marios Trivyzas	7473742e6e	SQL: Fix issue regarding INTERVAL * number (#42014 ) Interval * integer number is a valid operation which previously was only supported for foldables (literals) and not when a field was involved. That was because: 1. There was no common type returned for that combination 2. The `BinaryArithmeticOperation` was permitting the multiplication (called by fold()) but the BinaryArithmeticProcessor didn't allow it Moreover the error message for invalid arithmetic operations was wrong because of the issue with the overloading methods of `LoggerMessageFormat.format`. Fixes: #41239 Fixes: #41200 (cherry picked from commit 91039bab12d3ef27d6eac9cdc891a3b3ad0c694d)	2019-05-15 16:06:55 -04:00
Tim Vernum	9191b02213	Enforce transport TLS on Basic with Security (#42150 ) If a basic license enables security, then we should also enforce TLS on the transport interface. This was already the case for Standard/Gold/Platinum licenses. For Basic, security defaults to disabled, so some of the process around checking whether security is actuallY enabled is more complex now that we need to account for basic licenses.	2019-05-15 13:59:27 -04:00
Igor Motov	70ea3cf847	SQL: Add initial geo support (#42031 ) (#42135 ) Adds an initial limited implementations of geo features to SQL. This implementation is based on the [OpenGIS® Implementation Standard for Geographic information - Simple feature access](http://www.opengeospatial.org/standards/sfs), which is the current standard for GIS system implementation. This effort is concentrate on SQL option AKA ISO 19125-2. Queries that are supported as a result of this initial implementation Metadata commands - `DESCRIBE table` - returns the correct column types `GEOMETRY` for geo shapes and geo points. - `SHOW FUNCTIONS` - returns a list that includes supported `ST_` functions - `SYS TYPES` and `SYS COLUMNS` display correct types `GEO_SHAPE` and `GEO_POINT` for geo shapes and geo points accordingly. Returning geoshapes and geopoints from elasticsearch - `SELECT geom FROM table` - returns the geoshapes and geo_points as libs/geo objects in JDBC or as WKT strings in console. - `SELECT ST_AsWKT(geom) FROM table;` and `SELECT ST_AsText(geom) FROM table;`- returns the geoshapes ang geopoints in their WKT representation; Using geopoints to elasticsearch - The following functions will be supported for geopoints in queries, sorting and aggregations: `ST_GeomFromText`, `ST_X`, `ST_Y`, `ST_Z`, `ST_GeometryType`, and `ST_Distance`. In most cases when used in queries, sorting and aggregations, these function are translated into script. These functions can be used in the SELECT clause for both geopoints and geoshapes. - `SELECT * FROM table WHERE ST_Distance(ST_GeomFromText(POINT(1 2), point) < 10;` - returns all records for which `point` is located within 10m from the `POINT(1 2)`. In this case the WHERE clause is translated into a range query. Limitations: Geoshapes cannot be used in queries, sorting and aggregations as part of this initial effort. In order to fully take advantage of geoshapes we would need to have access to geoshape doc values, which is coming in #37206. `ST_Z` cannot be used on geopoints in queries, sorting and aggregations since we don't store altitude in geo_point doc values. Relates to #29872 Backport of #42031	2019-05-14 18:57:12 -05:00
Gordon Brown	a85189a558	Remove toStepKeys from LifecycleAction (#41775 ) The `toStepKeys()` method was only called in its own test case. The real list of StepKeys that's used in action execution is generated from the list of actual step objects returned by `toSteps()`. This commit removes that method.	2019-05-10 16:06:42 -06:00
Benjamin Trent	febee07dcc	[ML] adding pivot.max_search_page_size option for setting paging size (#41920 ) (#42079 ) * [ML] adding pivot.size option for setting paging size * Changing field name to address PR comments * fixing ctor usage * adjust hlrc for field name change	2019-05-10 13:22:31 -05:00
Benjamin Trent	0931815355	[ML] properly nesting objects in document source (#41901 ) (#42077 ) * [ML] properly nesting objects in document source * Throw exception on agg extraction failure, cause it to fail df * throwing error to stop df if unsupported agg is found	2019-05-10 13:22:12 -05:00
Tal Levy	5640197632	Refactor TransportSingleShardAction to serialize Writeable responses (#41985 ) (#42040 ) Previously, TransportSingleShardAction required constructing a new empty response object. This response object's Streamable readFrom was used. As part of the migration to Writeable, the interface here was updated to leverage Writeable.Reader. relates to #34389.	2019-05-09 22:08:31 -07:00
Benjamin Trent	b23b06dded	[ML] verify that there are no duplicate leaf fields in aggs (#41895 ) (#42025 ) * [ML] verify that there are no duplicate leaf fields in aggs * addressing pr comments * addressing PR comments * optmizing duplication check	2019-05-09 14:29:10 -05:00
Benjamin Trent	bf5a40c754	[ML] relax set upgrade mode test to match what is guaranteed (#41958 ) (#41979 ) * [ML] relax set upgrade mode test to match what is guaranteed * removing unused import	2019-05-09 14:28:50 -05:00
Lisa Cawley	9284a70ec8	[DOCS] Updates security configuration overview (#41982 )	2019-05-09 07:50:51 -07:00
David Kyle	c0d67919c8	Mute ApiKeyIntegTests See https://github.com/elastic/elasticsearch/issues/41747	2019-05-09 13:24:52 +01:00
Zachary Tong	f410f91f13	Cleanup RollupSearch exceptions, disallow partial results (#41272 ) - msearch exceptions should be thrown directly instead of wrapping in a RuntimeException - Do not allow partial results (where some indices are missing), instead throw an exception if any index is missing	2019-05-08 12:38:42 -04:00
David Kyle	ba9d2ccc1f	[ML Data Frame] Set executing nodes in task actions (#41798 ) Direct the task request to the node executing the task and also refactor the task responses so all errors are returned and set the HTTP status code based on presence of errors.	2019-05-08 12:25:36 +01:00
Alpar Torok	711ace0533	Testclusters: support for security and convert example plugins (#41864 ) testclusters detect from settings that security is enabled if a user is not specified using the DSL introduced in this PR, a default one is created the appropriate wait conditions are used authenticating with the first user defined in the DSL ( or the default user ). an example DSL to create a user is user username:"test_user" password:"x-pack-test-password" role: "superuser" all keys are optional and default to the values shown in this example	2019-05-08 14:04:00 +03:00
Jason Tedor	8bea3c3a58	Enable trace logging in CCR retention lease tests These tests are failing somewhat mysteriously, indicating that when we renew retention leaess during a restore that our retention leases that we added before starting the restore suddenly do not exist. To make sense of this, this commit enables trace logging.	2019-05-07 22:44:55 -04:00
Marios Trivyzas	d5b0badeb7	SQL: Remove CircuitBreaker from parser (#41835 ) The CircuitBreaker was introduced as means of preventing a `StackOverflowException` during the build of the AST by the parser. The ANTLR4 grammar causes a weird behaviour for a Parser Listener. The `enterEveryRule()` method is often called with a different parsing context than the respective `exitEveryRule()`. This makes it difficult to keep track of the tree's depth, and a custom Map was used as an attempt of matching the contextes as they are encounter during `enter` and during `exit` of the rules. This approach had 2 important drawbacks: 1. It's hard to maintain this custom Map as the grammar changes. 2. The CircuitBreaker could often lead to false positives which caused valid queries to return an Exception and prevent them from executing. So, this removes completely the CircuitBreaker which is replaced be a simple handling of the `StackOverflowException` Fixes: #41471 (cherry picked from commit 1559a8e2dbd729138b52e89b7e80264c9f4ad1e7)	2019-05-07 23:25:37 +03:00
Ioannis Kakavas	58041f3fdb	Remove op.name configuration setting (#41445 ) This setting was not eventually used in the realm and thus can be removed	2019-05-07 19:01:55 +03:00
Jason Tedor	d7fd51a84e	Provide names for all artifact repositories (#41857 ) This commit adds a name for each Maven and Ivy repository used in the build.	2019-05-07 06:35:28 -04:00
Tim Vernum	3508b6c641	Log warning when unlicensed realms are skipped (#41828 ) Because realms are configured at node startup, but license levels can change dynamically, it is possible to have a running node that has a particular realm type configured, but that realm is not permitted under the current license. In this case the realm is silently ignored during authentication. This commit adds a warning in the elasticsearch logs if authentication fails, and there are realms that have been skipped due to licensing. This message is not intended to imply that the realms could (or would) have successfully authenticated the user, but they may help reduce confusion about why authentication failed if the caller was expecting the authentication to be handled by a particular realm that is in fact unlicensed. Backport of: #41778	2019-05-07 09:55:48 +10:00
Ryan Ernst	6fd8924c5a	Switch run task to use real distro (#41590 ) The run task is supposed to run elasticsearch with the given plugin or module. However, for modules, this is most realistic if using the full distribution. This commit changes the run setup to use the default or oss as appropriate.	2019-05-06 12:34:07 -07:00
Benjamin Trent	50fc27e9a0	[ML] addresses preview bug, and adds check to PUT (#41803 ) (#41850 )	2019-05-06 10:56:26 -05:00
Tim Brooks	927013426a	Read multiple TLS packets in one read call (#41820 ) This is related to #27260. Currently we have a single read buffer that is no larger than a single TLS packet. This prevents us from reading multiple TLS packets in a single socket read call. This commit modifies our TLS work to support reading similar to the plaintext case. The data will be copied to a (potentially) recycled TLS packet-sized buffer for interaction with the SSLEngine.	2019-05-06 09:51:32 -06:00
Daniel Mitterdorfer	8580053818	Mute PermissionsIT#testWhen[...]ByILMPolicy (#41859 ) Relates #41440 Relates #41858	2019-05-06 16:15:37 +02:00
Hendrik Muhs	d54a921032	remove unused import	2019-05-06 10:14:35 +02:00
Hendrik Muhs	0c03707704	[ML-DataFrame] reset/clear the position after indexer is done (#41736 ) reset/clear the position after indexer is done	2019-05-06 09:41:51 +02:00
Tim Vernum	ee84038699	Update security acknowledgement messages for basic (#41825 ) When applying a license update, we provide "acknowledgement messages" that indicate which features will be affected by the change in license. This commit updates the messages that are provided when installing a basic license, so that they reflect the changes made to the security features that are included in that license type. Backport of: #41776	2019-05-06 16:40:38 +10:00
Hicham Mallah	4a88da70c5	Add index name to cluster block exception (#41489 ) Updates the error message to reveal the index name that is causing it. Closes #40870	2019-05-04 19:11:59 -04:00
Jason Tedor	f4da98ca3d	Use a proper repository for ml-cpp artifacts (#41817 ) This switches the strategy used to download machine learning artifacts from a manual download through S3 to using an Ivy repository on top of S3. This gives us all the benefits of Gradle dependency resolution including local caching.	2019-05-04 12:44:19 -04:00
Benjamin Trent	b69e28177b	[ML] rewriting stats gathering to use callbacks instead of a latch (#41793 ) (#41804 )	2019-05-03 18:18:27 -05:00
Nhat Nguyen	c7924014fa	Verify consistency of version and source in disruption tests (#41614 ) (#41661 ) With this change, we will verify the consistency of version and source (besides id, seq_no, and term) of live documents between shard copies at the end of disruption tests.	2019-05-03 18:47:14 -04:00
Jason Tedor	241c4ef97a	Use https for artifact locations This commit switches to using https for some artifact locations.	2019-05-03 16:15:48 -04:00

1 2 3 4 5 ...

3245 Commits