OpenSearch

Commit Graph

Author	SHA1	Message	Date
Andrei Stefan	fbba65d8b3	SQL: SubSelect unresolved bugfix (#55956 ) (#56055 ) * Resolve the missing refs only after the aggregate tree is resolved (cherry picked from commit 10167b1cf2df6b074a1ba0c8e73c261ff9e9d1db)	2020-05-01 07:48:11 +03:00
Ryan Ernst	52b9d8d15e	Convert remaining license methods to isAllowed (#55908 ) (#55991 ) This commit converts the remaining isXXXAllowed methods to instead of use isAllowed with a Feature value. There are a couple other methods that are static, as well as some licensed features that check the license directly, but those will be dealt with in other followups.	2020-04-30 15:52:22 -07:00
Igor Motov	d8f9df771d	Expose agg usage in Feature Usage API (#55732 ) (#56048 ) Counts usage of the aggs and exposes them on the _nodes/usage/. Closes #53746	2020-04-30 12:53:36 -04:00
Przemko Robakowski	797f63e743	[7.x] Emit deprecation warning if multiple v1 templates match with a new index (#55558 ) (#56038 ) * Emit deprecation warning if multiple v1 templates match with a new index (#55558) * Emit deprecation warning if multiple v1 templates match with a new index * DEPRECATION_LOGGER rename	2020-04-30 17:36:17 +02:00
Luca Cavanna	fc6422ffcc	Consolidate DelayableWriteable (#55932 ) This commit includes a number of minor improvements around `DelayableWriteable`: javadocs were expanded and reworded, `get` was renamed to `expand` and `DelayableWriteable` no longer implements `Supplier`. Also a couple of methods are now private instead of package private.	2020-04-30 17:16:58 +02:00
Benjamin Trent	c36bcb4dd0	[ML] fixing file structure finder multiline merge max for delimited formats (#56023 ) (#56035 ) This commit correctly sets the maxLinesPerRow in the CsvPreference for delimited files given the file structure finder settings. Previously, it was silently ignored.	2020-04-30 10:51:32 -04:00
Benjamin Trent	04b1f6498b	[ML] using new fixed interval in ml tests (#56021 ) (#56031 ) This commit removes deprecated references to DateHistogram.interval from ml tests	2020-04-30 10:26:39 -04:00
Dimitris Athanasiou	17b904def5	[7.x][ML] Decouple DFA progress testing from analyses phases (#55925 ) (#56024 ) This refactors native integ tests to assert progress without expecting explicit phases for analyses. We can test those with yaml tests in a single place. Backport of #55925	2020-04-30 17:05:47 +03:00
William Brafford	273ff6a105	Make xpack.ilm.enabled setting a no-op (#55592 ) (#55980 ) * Make xpack.ilm.enabled setting a no-op * Add watcher setting to not use ILM * Update documentation for no-op setting * Remove NO_ILM ml index templates * Remove unneeded setting from test setup * Inline variable definitions for ML templates * Use identical parameter names in templates * New ILM/watcher setting falls back to old setting * Add fallback unit test for watcher/ilm setting	2020-04-30 09:50:18 -04:00
David Kyle	c204353249	[ML] Wait for model loaded and cached in ModelLoadingServiceTests (#56014 ) Fixes test by exposing the method ModelLoadingService::addModelLoadedListener() so that the test class can be notified when a model is loaded which happens in a background thread	2020-04-30 13:32:07 +01:00
Yang Wang	317d9fb88f	Remove synthetic role names of API keys as they confuse users (#56005 ) (#56011 ) Synthetic role names of API keys add confusion to users. This happens to API responses as well as audit logs. The PR removes them for clarity.	2020-04-30 21:32:55 +10:00
Hendrik Muhs	d3bcef2962	[7.x][Transform] implement throttling in indexer (#55011 ) (#56002 ) implement throttling in async-indexer used by rollup and transform. The added docs_per_second parameter is used to calculate a delay before the next search request is send. With re-throttle its possible to change the parameter at runtime. When stopping a running job, its ensured that despite throttling the indexer stops in reasonable time. This change contains the groundwork, but does not expose the new functionality. relates #54862 backport: #55011	2020-04-30 11:20:35 +02:00
Ioannis Kakavas	3c7c9573b4	Fix PemKeyConfigTests (#55577 ) (#55996 ) We were creating PemKeyConfig objects using different private keys but always using testnode.crt certificate that uses the RSA public key. The PemKeyConfig was built but we would then later fail to handle SSL connections during the TLS handshake eitherway. This became obvious in FIPS tests where the consistency checks that FIPS 140 mandates kick in and failed early becausethe private key was of different type than the public key	2020-04-30 12:05:27 +03:00
Yang Wang	84a2f1adf2	Resolve anonymous roles and deduplicate roles during authentication (#53453 ) (#55995 ) Anonymous roles resolution and user role deduplication are now performed during authentication instead of authorization. The change ensures: * If anonymous access is enabled, user will be able to see the anonymous roles added in the roles field in the /_security/_authenticate response. * Any duplication in user roles are removed and will not show in the above authenticate response. * In any other case, the response is unchanged. It also introduces a behaviour change: the anonymous role resolution is now authentication node specific, previously it was authorization node specific. Details can be found at #47195 (comment)	2020-04-30 17:34:14 +10:00
Lisa Cawley	006e00ed0a	[DOCS] Adds documentation for secondary authorization headers (#55365 ) (#55986 )	2020-04-29 16:29:38 -07:00
Lisa Cawley	5100fd7eb2	[DOCS] Add token based authn documentation (#55957 )	2020-04-29 14:47:02 -07:00
Christos Soulios	43dab77186	[7.x] Modified searchAndReduce() to return empty agg when no docs exist (#55967 ) Backports #55826 to 7.x Modified AggregatorTestCase.searchAndReduce() method so that it returns an empty aggregation result when no documents have been inserted. Also refactored several aggregation tests so they do not re-implement method AggregatorTestCase.testCase() Fixes #55824	2020-04-30 00:28:32 +03:00
jimczi	86ee8974d0	Revert "Mute failing tests in AsyncSearchActionIT" This reverts commit `2fe4801ca1`.	2020-04-29 22:22:21 +02:00
Mark Vieira	2fe4801ca1	Mute failing tests in AsyncSearchActionIT	2020-04-29 10:59:10 -07:00
Dimitris Athanasiou	c5aa281171	[7.x][ML] Remove error on parsing progress for unknown phase in DFA (#55926 ) (#55954 ) On second thought, this check does not seem to be adding value. We can test that the phases are as we expect them for each analysis by adding yaml tests. Those would fail if we introduce new phases from c++ accidentally or without coordination. This would achieve the same thing. At the same time we would not have to comment out this code each time a new phase is introduced. Instead we can just temporarily mute those yaml tests. Note I will add those tests right after the imminent new phases are added to the c++ side. Backport of #55926	2020-04-29 20:11:33 +03:00
Benjamin Trent	edd049f9cd	[ML] Allow a certain number of ill-formatted rows when delimited format is specified (#55735 ) (#55944 ) While it is good to not be lenient when attempting to guess the file format, it is frustrating to users when they KNOW it is CSV but there are a few ill-formatted rows in the file (via some entry error, etc.). This commit allows for up to 10% of sample rows to be considered "bad". These rows are effectively ignored while guessing the format. This percentage of "allows bad rows" is only applied when the user has specified delimited formatting options. As the structure finder needs some guidance on what a "bad row" actually means. related to https://github.com/elastic/elasticsearch/issues/38890	2020-04-29 11:15:21 -04:00
Jim Ferenczi	293c81dd59	Fix AsyncSearchActionIT#testTermsAggregation (#55924 ) This commit fixes the initialization of total hits in the async search response. Relates #55683 Closes #55920	2020-04-29 15:44:10 +02:00
Jake Landis	ae4d980c8c	[7.x] json spec - add description for autoscaling (#55748 ) (#55901 )	2020-04-29 08:40:11 -05:00
Andrei Dan	6a0e1e161b	ILM stop step execution if writeIndex is false (#54805 ) (#55923 ) (cherry picked from commit 47a9fd760f7bf2cc6cd778485dc057b6aaf07709) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-29 13:39:37 +01:00
Christos Soulios	02bf0c586a	[7.x] Histogram field type support for Sum aggregation (#55916 ) Implements Sum aggregation over Histogram fields by summing the value of each bucket multiplied by their count as requested in #53285 Backports #55681 to 7.x	2020-04-29 15:06:12 +03:00
David Roberts	6ad497bfda	Muting AsyncSearchActionIT.testTermsAggregation Due to https://github.com/elastic/elasticsearch/issues/55920	2020-04-29 12:34:47 +01:00
Dimitris Athanasiou	d9685a0f19	[7.x][ML] Validate at least one feature is available for DF analytics (#55876 ) (#55914 ) We were previously checking at least one supported field existed when the _explain API was called. However, in the case of analyses with required fields (e.g. regression) we were not accounting that the dependent variable is not a feature and thus if the source index only contains the dependent variable field there are no features to train a model on. This commit adds a validation that at least one feature is available for analysis. Note that we also move that validation away from `ExtractedFieldsDetector` and the _explain API and straight into the _start API. The reason for doing this is to allow the user to use the _explain API in order to understand why they would be seeing an error like this one. For example, the user might be using an index that has fields but they are of unsupported types. If they start the job and get an error that there are no features, they will wonder why that is. Calling the _explain API will show them that all their fields are unsupported. If the _explain API was failing instead, there would be no way for the user to understand why all those fields are ignored. Closes #55593 Backport of #55876	2020-04-29 11:39:58 +03:00
David Roberts	61ac09ae21	[ML] Add daily_model_snapshot_retention_after_days to job config (#55891 ) This change adds a new setting, daily_model_snapshot_retention_after_days, to the anomaly detection job config. Initially this has no effect, the effect will be added in a followup PR. This PR gets the complexities of making changes that interact with BWC over well before feature freeze. Backport of #55878	2020-04-29 09:12:53 +01:00
Nik Everett	a5d0409a8f	Save memory in on aggs in async search (#55683 ) (#55879 ) This replaces a reference to the result of partially reducing aggregations that async search keeps with a reference to the serialized form of the result of the partial reduction which we need to keep anyway.	2020-04-28 16:23:30 -04:00
Larry Gregory	47d252424b	Backport: Deprecate the kibana reserved user (#54967 ) (#55822 )	2020-04-28 10:30:25 -04:00
Christos Soulios	fae9ec13dd	Removed ValuesSourceRegistry.registerAny() (#55846 ) * Backports #55747 to 7.x * All ValuesSourceTypes must be registered explicitly * Removed lambdas in ValuesSourceRegistry	2020-04-28 15:44:42 +03:00
Adrien Grand	58c3bb5ae1	Repurpose `ignore_throttled` to be only about frozen indices. (#55047 ) (#55852 ) This has no practical impact on users since frozen indices are the only throttled indices today. However this has an impact on upcoming features that would use search throttling. Filtering out throttled indices made sense a couple years ago, but as we're now improving support for slow requests with `_async_search` and exploring ways to reduce storage costs, this feature has most likely become a trap, that we'd like to not have with upcoming features that would use search throttling. Relates #54058	2020-04-28 14:31:54 +02:00
David Turner	3f2d10d8fc	Permit searches to be concurrent to prewarming (#55795 ) Today when prewarming a searchable snapshot we use the `SparseFileTracker` to lock each (part of a) snapshotted blob, blocking any other readers from accessing this data until the whole part is available. This commit changes this strategy: instead we optimistically start to download the blob without any locking, and then lock much smaller ranges after each individual `read()` call. This may mean that some bytes are downloaded twice, but reduces the time that other readers may need to wait before the data they need is available. As a best-effort optimisation we try to request the smallest possible single range of missing bytes in the part by first checking how many of the initial and terminal bytes of the part are already present in cache. In particular if the part is already fully cached before prewarming then this check means we skip the part entirely.	2020-04-28 10:44:05 +01:00
Tim Brooks	80662f31a1	Introduce mechanism to stub request handling (#55832 ) Currently there is a clear mechanism to stub sending a request through the transport. However, this is limited to testing exceptions on the sender side. This commit reworks our transport related testing infrastructure to allow stubbing request handling on the receiving side.	2020-04-27 16:57:15 -06:00
Tal Levy	6ba5148ead	Add geo_shape support for the geo_centroid aggregation (#55602 ) (#55819 ) this commit leverages the new geo_shape doc values to register a new geo_centroid aggregator that works on geo_shape field.	2020-04-27 12:16:10 -07:00
Ioannis Kakavas	ca5d677130	Mute-55816 (#55818 ) See #55816	2020-04-27 21:26:02 +03:00
Hendrik Muhs	4b93f17b24	[Transform] improve TransformRestTestCase robustness (#55786 ) handles/retries temporary SearchPhaseExecutionErrors fixes #54810	2020-04-27 17:17:53 +02:00
Jake Landis	6f392cf5b9	[7.x] json spec - add description for searchable snapshots (#55746 ) (#55809 )	2020-04-27 10:08:09 -05:00
Mark Tozzi	22a98ec279	Aggregation support for Value Scripts that change types (#54830 ) (#55752 )	2020-04-27 09:57:05 -04:00
Dimitris Athanasiou	abab4c4d4f	[7.x][ML] Do not fail DFA task when it's stopped whilst reindexing (#55797 ) (#55800 ) Adding to #55659, we missed another way we could set the task to failed due to task cancellation. CI revealed that we might also get a `SearchPhaseExecutionException` whose cause is a `TaskCancelledException`. That exception is not wrapped so unwrapping it will not return the underlying `TaskCancelledException`. Thus to be complete in catching this, we also need to check the error's cause. Closes #55068 Backport of #55797	2020-04-27 16:03:57 +03:00
Dimitris Athanasiou	7f100c1196	[7.x][ML] Allow analytics process define its own progress phases (#55763 ) (#55791 ) This is a continuation from #55580. Now that we're parsing phase progresses from the analytics process we change `ProgressTracker` to allow for custom phases between the `loading_data` and `writing_results` phases. Each `DataFrameAnalysis` may declare its own phases. This commit sets things in place for the analytics process to start reporting different phases per analysis type. However, this is still preserving existing behaviour as all analyses currently declare a single `analyzing` phase. Backport of #55763	2020-04-27 13:30:05 +03:00
Ioannis Kakavas	d56f25acb4	Validate hashing algorithm in users tool (#55628 ) (#55734 ) This change adds validation when running the users tool so that if Elasticsearch is expected to run in a JVM that is configured to be in FIPS 140 mode and the password hashing algorithm is not compliant, we would throw an error. Users tool uses the configuration from the node and this validation would also happen upon node startup but users might be added in the file realm before the node is started and we would have the opportunity to notify the user of this misconfiguration. The changes in #55544 make this much less probable to happen in 8 since the default algorithm will be compliant but this change can act as a fallback in anycase and makes for a better user experience.	2020-04-27 12:23:41 +03:00
Ioannis Kakavas	38b55f06ba	Fix concurrent refresh of tokens (#55114 ) (#55733 ) Our handling for concurrent refresh of access tokens suffered from a race condition where: 1. Thread A has just finished with updating the existing token document, but hasn't stored the new tokens in a new document yet 2. Thread B attempts to refresh the same token and since the original token document is marked as refreshed, it decrypts and gets the new access token and refresh token and returns that to the caller of the API. 3. The caller attempts to use the newly refreshed access token immediately and gets an authentication error since thread A still hasn't finished writing the document. This commit changes the behavior so that Thread B, would first try to do a Get request for the token document where it expects that the access token it decrypted is stored(with exponential backoff ) and will not respond until it can verify that it reads it in the tokens index. That ensures that we only ever return tokens in a response if they are already valid and can be used immediately It also adjusts TokenAuthIntegTests to test authenticating with the tokens each thread receives, which would fail without the fix. Resolves: #54289	2020-04-27 12:23:17 +03:00
David Roberts	3ba44a5af8	[ML] Adding failed_category_count to model_size_stats (#55761 ) The failed_category_count statistic records the number of times categorization wanted to create a new category but couldn't because the job had reached its model_memory_limit. Backport of #55716	2020-04-25 10:36:49 +01:00
Aleksandr Maus	ad54cca823	EQL: implement math functions: add, divide, module, multiply, subtract (#55137 ) (#55737 ) * EQL: implement math functions: add, divide, module, multiply, subtract	2020-04-24 15:52:27 -04:00
James Rodewig	c1b0548db0	[DOCS] Document EQL search REST API (#52384 )	2020-04-24 15:36:01 -04:00
Nick Knize	b0e8a8a4d1	[Backport] Refactor Spatial Field Mappers (#55696 ) This commit refactors all spatial Field Mappers to a common AbstractGeometryFieldMapper that implements shared parameter functionality (e.g., ignore_malformed, ignore_z_value) and provides a common framework for overriding type parsing, and building in xpack. Common shape functionality is implemented in a new AbstractShapeGeometryFieldMapper that is reused and overridden in GeoShapeFieldMapper, GeoShapeFieldMapperWithDocValues, LegacyGeoShapeFieldMapper, and ShapeFieldMapper. This abstraction provides a reusable foundation for adding new xpack features; such as coordinate reference system support.	2020-04-24 14:05:16 -05:00
Mark Tozzi	87b4979c24	[7.x] Make ValuesSourceRegistry immutable after initilization #55493 (#55697 )	2020-04-24 13:33:38 -04:00
Jason Tedor	22a8b60187	Reduce code duplication in CCR non-compliance tests This commit removes some code duplication in the CCR non-compliance tests by refactoring an assertion method so that it can be used in both tests that are present there.	2020-04-24 13:24:56 -04:00
Tanguy Leroux	41ddbd4188	Allow to prewarm the cache for searchable snapshot shards (#55322 ) Relates #50999	2020-04-24 18:03:34 +02:00
Dimitris Athanasiou	210b7f1b76	[7.x][ML] Remove parsing of old progress format in DF Analytics (#55711 ) (#55720 ) Since #55580 we've introduced a new format for parsing progress from the data frame analytics process. As the process is now writing out progress in this new way, we can remove the parsing of the old format. Backport of #55711	2020-04-24 16:50:56 +03:00
David Turner	aa9a2bce37	Avoid accidental contiguous read (#55713 ) If we choose to read from two random positions that are 1024 bytes apart then this counts as a contiguous read for stats purposes, failing this test. This commit ensures that we always perform a non-contiguous read.	2020-04-24 11:50:31 +01:00
David Turner	de30550aea	Relax elapsed time stats assertion (#55710 ) `SearchableSnapshotDirectoryStatsTests#testCachedBytesReadsAndWrites` asserts that each write takes one clock tick, but we now permit concurrent reads and writes so each write might take longer. This commit relaxes the assertion to match. Closes #55707	2020-04-24 10:21:08 +01:00
Przemysław Witek	c89917c799	Register DFA jobs on putAnalytics rather than via a separate method (#55458 ) (#55708 )	2020-04-24 10:59:32 +02:00
Dimitris Athanasiou	b8379872a7	[7.x][ML] Logs error when DFA task is set to failed (#55545 ) (#55668 ) Also unmutes the integ test that stops and restarts an outlier detection job with the hope of learning more of the failure in #55068. Backport of #55545 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-04-24 11:06:07 +03:00
Jim Ferenczi	0a6c74b7d3	AsyncSearchMaintenanceService should stop when closing a node (#55651 ) This change turns the AsyncSearchMaintenanceService into an AbstractLifecycleComponent and ensures that the service is stopped when a node is closing. Closes #55646	2020-04-24 09:38:40 +02:00
Hendrik Muhs	b213209f0c	[Rollup] improve stopping tests (#55666 ) improve tests related to stopping using a client that answers and can be synchronized with the test thread in order to test special situations relates #55011	2020-04-24 08:48:36 +02:00
Jay Modi	30f8c326fe	Test: fix SSLReloadDuringStartupIntegTests (#55637 ) This commit fixes reproducible test failures with the SSLReloadDuringStartupIntegTests on the 7.x branch. The failures only occur on 7.x due to the existence of the transport client and its usage in our test infrastructure. This change removes the randomized usage of transport clients when retrieving a client from a node in the internal cluster. Transport clients do not support the reloading of files for TLS configuration changes but if we build one from the nodes settings and attempt to use it after the files have been changed, the client will not know about the changes and the TLS connection will fail. Closes #55524	2020-04-23 21:36:43 -06:00
Ryan Ernst	97c4b64fb1	Add isAllowed license utility (#55424 ) (#55700 ) License state is currently made up of boolean methods that check whether a particular feature is allowed by the current license state. Each new feature must copy/past boiler plate code. While that has gotten easier with utilities like isAllowedByLicense, this is still more cumbersome than should be necessary. This commit adds a general purpose isAllowed method which takes a new Feature enum, where each value of the enum defines the minimum license mode and whether the license must be active to be allowed. Only security features are converted in this PR, in order to keep the commit size relatively small. The rest of the features will be converted in a followup.	2020-04-23 16:28:28 -07:00
Zachary Tong	715c90bf7d	Aggs must specify a `field` or `script` (or both) (#52226 ) This adds a validation to VSParserHelper to ensure that a field or script or both are specified by the user. This is technically required today already, but throws an exception much deeper in the agg framework and has a very unintuitive error for the user (as well as eating more resources instead of failing early)	2020-04-23 19:23:41 -04:00
jimczi	c857adf603	Fix AsyncSearchTaskTests#testWithFetchFailures Fix usage of a possible invalid random range [1, 0]. Relates #55688	2020-04-24 00:45:17 +02:00
Jim Ferenczi	31d1727698	Fix (de)serialization of async search failures (#55688 ) The (de)serialization code of the async search response cannot handle exceptions that extend ElasticsearchException (e.g. ScriptException). This commit fixes this bug by serializing the error with the more generic StreamInput#writeException.	2020-04-24 00:44:43 +02:00
Igor Motov	8c7ef2417f	Make AsyncSearchIndexService reusable (#55598 ) EQL will require very similar functionality to async search. This PR refactors AsyncSearchIndexService to make it reusable for EQL. Supersedes #55119 Relates to #49638	2020-04-23 18:02:17 -04:00
Nick Knize	96a02089c2	Refactor GeoShape DocValues in spatial xpack (#55691 ) This commit refactors geo_shape doc values, fielddata, and utility classes from the single mapper package in x-pack spatial plugin to a package structure that is consistent with the server module.	2020-04-23 15:32:23 -05:00
David Roberts	46be9959a0	[ML] Audit when unassigned datafeeds are stopped (#55667 ) Previously audit messages were indexed when datafeeds that were assigned to a node were stopped, but not datafeeds that were unassigned at the time they were stopped. This change adds auditing for the unassigned case. Backport of #55656	2020-04-23 20:46:35 +01:00
Dan Hermann	dd5c96c2ed	[7.x] Rollover for data streams	2020-04-23 12:04:34 -05:00
Zachary Tong	4f483ac370	Fix half-float range in SupportedTypeTests (#55409 ) Also adds a comment to the half-float number field type tests indicating why 70000 is used instead of 65504	2020-04-23 11:36:37 -04:00
Dimitris Athanasiou	4b11adf074	[7.x][ML] Do not fail DFA task that is stopped during reindexing (#55659 ) (#55663 ) While we were catching `TaskCancelledException` while we wait for reindexing to complete, we missed the fact that this exception may be wrapped in a multi-node cluster. This is the reason we may still fail the task when stop is called while reindexing. Some times we're lucky and the exception is thrown by the same node that runs the job. Then the exception is not wrapped and things work fine. But when that is not the case the exception is wrapped, we fail to catch it, and set the task to failed. The fix is to simply unwrap the exception when we check it it is `TaskCancelledException`. Closes #55068 Backport of #55659	2020-04-23 15:57:01 +03:00
Tanguy Leroux	8669766a81	Reduce contention in CacheFile.fileLock() method (#55662 ) The CacheFile.fileLock() method is used to acquire a lock on a cache file so that the file can't be deleted (or its file handle closed) during the execution of a read or a write operation. Today this lock is obtained by first acquiring the eviction lock (the write lock of the readwrite lock), then by checking if the cache file is evicted and the file channel still open, and finally by obtaining the file lock (the read lock of the readwrite lock). Acquiring the read lock while the eviction lock is held ensures that the cache file eviction cannot start in the meanwhile. But eviction starts (and terminations) also acquire the eviction lock; and this lock cannot be obtained while a read lock is held (the write lock of a readwrite lock is exclusive). If we were acquiring a read lock and checking the eviction flag and file channel existence while holding the read lock we know that no eviction can start or finish until the read lock is released.	2020-04-23 14:40:27 +02:00
Rory Hunter	d66af46724	Always use deprecateAndMaybeLog for deprecation warnings (#55319 ) Backport of #55115. Replace calls to deprecate(String,Object...) with deprecateAndMaybeLog(...), with an appropriate key, so that all messages can potentially be deduplicated.	2020-04-23 09:20:54 +01:00
David Roberts	87f4751eca	[ML] Make find_file_structure recognize Kibana CSV report timestamps (#55609 ) The Kibana CSV export feature uses a non-standard timestamp format. This change adds it to the formats the find_file_structure endpoint recognizes out-of-the-box, to make round-tripping data from Kibana back to Kibana via CSV files easier. Fixes #55586	2020-04-23 08:39:07 +01:00
Jake Landis	25ea6a74f0	[7.x] Validate REST specs against schema (#55117 ) (#55563 ) A JSON schema was recently introduced for the REST API specification. #54252 This PR introduces a 3rd party validation tool to ensure that the REST specification conforms to the schema. The task is applied to the 3 projects that contain REST API specifications. The plugin wires this task into the precommit commit task, and should be considered as part of the public API for the build tools for any plugin developer to contribute their plugin's specification. An ignore parameter has been introduced for the task to allow specific file to be ignored from the validation. The ignored files in this PR will soon get issues logged and a link so they can be fixed. Closes #54314	2020-04-22 14:14:03 -05:00
Albert Zaharovits	82ed0ab420	Update the audit logfile list of system users (#55578 ) Out of the box "access granted" audit events are not logged for system users. The list of system users was stale and included only the _system and _xpack users. This commit expands this list with _xpack_security and _async_search, effectively reducing the auditing noise by not logging the audit events of these system users out of the box. Closes #37924	2020-04-22 21:59:31 +03:00
Tal Levy	c370b83bd7	Fix locale lowercase test issue in GenerateSnapshotNameStepTests (#55597 ) (#55605 ) The testPerformAction test has been failing periodically due to how Hamcrest's containsStringIgnoringCase does not lowercase using the same Locale set in the test infrastructure. This commit falls back to explicitly lowercasing using the root locale	2020-04-22 11:29:57 -07:00
Tal Levy	f27ce69f0c	[backport] Add geo_bounds aggregation support for geo_shape (#55328 ) (#55600 ) This commit adds a new GeoShapeBoundsAggregator to the spatial plugin and registers it with the GeoShapeValuesSourceType. This enables geo_bounds aggregations on geo_shape fields	2020-04-22 11:29:35 -07:00
Tal Levy	0844455505	Add geo_shape mapper supporting doc-values in Spatial Plugin (#55037 ) (#55500 ) After #53562, the `geo_shape` field mapper is registered within a module. This opens the door for introducing a new `geo_shape` field mapper into the Spatial Plugin that has doc-values support. This is very much an extension of server's GeoShapeFieldMapper, but with the addition of the doc values implementation.	2020-04-22 08:12:54 -07:00
Dimitris Athanasiou	50a5afed15	[7.x][ML] Prepare parsing phase_progress from DFA process (#55580 ) (#55587 ) Data frame analytics process currently reports progress as an integer `progress_percent`. We parse that and report it from the _stats API as the progress of the `analyzing` phase. However, we want to allow the DFA process to report progress for more than one phase. This commit prepares for this by parsing `phase_progress` from the process, an object that contains the `phase` name plus the `progress_percent` for that phase. Backport of #55580	2020-04-22 16:38:32 +03:00
Benjamin Trent	7c81cd7833	[ML] explicitly disallow partial results in datafeed extractors (#55537 ) (#55585 ) Instead of doing our own checks against REST status, shard counts, and shard failures, this commit changes all our extractor search requests to set `.setAllowPartialSearchResults(false)`. - Scrolls are automatically cleared when a search failure occurs with `.setAllowPartialSearchResults(false)` set. - Code error handling is simplified closes https://github.com/elastic/elasticsearch/issues/40793	2020-04-22 09:07:44 -04:00
David Roberts	810caf5ffe	[ML] Test that audit message is written when closing unassigned job (#55582 ) Issue #55521 suggested that audit messages were not written when closing an unassigned job. This is not the case, but we didn't have a test to prove it. Backport of #55571	2020-04-22 13:23:43 +01:00
David Roberts	2dc5586afe	[ML] Add effective max model memory limit to ML info (#55581 ) The ML info endpoint returns the max_model_memory_limit setting if one is configured. However, it is still possible to create a job that cannot run anywhere in the current cluster because no node in the cluster has enough memory to accommodate it. This change adds an extra piece of information, limits.effective_max_model_memory_limit, to the ML info response that returns the biggest model memory limit that could be run in the current cluster assuming no other jobs were running. The idea is that the ML UI will be able to warn users who try to create jobs with higher model memory limits that their jobs will not be able to start unless they add a bigger ML node to their cluster. Backport of #55529	2020-04-22 12:28:50 +01:00
David Roberts	da5aeb8be7	[ML] Return assigned node in start/open job/datafeed response (#55570 ) Adds a "node" field to the response from the following endpoints: 1. Open anomaly detection job 2. Start datafeed 3. Start data frame analytics job If the job or datafeed is assigned to a node immediately then this field will return the ID of that node. In the case where a job or datafeed is opened or started lazily the node field will contain an empty string. Clients that want to test whether a job or datafeed was opened or started lazily can therefore check for this. Backport of #55473	2020-04-22 12:06:53 +01:00
David Kyle	e99ef3542c	Mute ModelLoadingServiceTests::testMaxCachedLimitReached	2020-04-22 11:53:07 +01:00
Tim Vernum	8b566aea47	Fix use of password protected PKCS#8 keys for SSL (#55567 ) PEMUtils would incorrectly fill the encryption password with zeros (the '\0' character) after decrypting a PKCS#8 key. Since PEMUtils did not take ownership of this password it should not zero it out because it does not know whether the caller will use that password array again. This is actually what PEMKeyConfig does - it uses the key encryption password as the password for the ephemeral keystore that it creates in order to build a KeyManager. Backport of: #55457	2020-04-22 16:38:51 +10:00
Yang Wang	32e46bf552	Fix certutil http for empty password with JDK 11 and lower (#55437 ) (#55565 ) Fix elasticseaerch-certutil http command so that it correctly accepts empty keystore password with JDK version 11 and lower.	2020-04-22 15:03:10 +10:00
David Kyle	8e8c6b4aee	Fix accounting in ModelLoadingServiceTests (#55307 ) (#55547 ) In the test after the first load event is is not known which models are cached as loading a later one will evict an earlier one and the order is not known. The models could have been loaded 1 or 2 times not exactly twice	2020-04-21 19:25:06 +01:00
Armin Braun	db7eb8e8ff	Remove Redundant CS Update on Snapshot Finalization (#55276 ) (#55528 ) This change folds the removal of the in-progress snapshot entry into setting the safe repository generation. Outside of removing an unnecessary cluster state update, this also has the advantage of removing a somewhat inconsistent cluster state where the safe repository generation points at `RepositoryData` that contains a finished snapshot while it is still in-progress in the cluster state, making it easier to reason about the state machine of upcoming concurrent snapshot operations.	2020-04-21 15:33:17 +02:00
David Turner	be60d50452	Allow searching of snapshot taken while indexing (#55511 ) Today a read-only engine requires a complete history of operations, in the sense that its local checkpoint must equal its maximum sequence number. This is a valid check for read-only engines that were obtained by closing an index since closing an index waits for all in-flight operations to complete. However a snapshot may not have this property if it was taken while indexing was ongoing, but that's ok. This commit weakens the check for a complete history to exclude the case of a searchable snapshot. Relates #50999	2020-04-21 13:21:38 +01:00
Ignacio Vera	e4c65b4388	mute test SSLReloadDuringStartupIntegTests.testReloadDuringStartup (#55525 )	2020-04-21 14:13:13 +02:00
Jim Ferenczi	0b3bdfcc3e	Fix expiration time in async search response (#55435 ) This change ensures that we return the latest expiration time when retrieving the response from the index. This commit also fixes a bug that stops the garbage collection of saved responses if the async search index is deleted.	2020-04-21 14:04:29 +02:00
Przemysław Witek	59d377462f	Apply default timeout in StopDataFrameAnalyticsAction.Request (#55512 ) (#55517 )	2020-04-21 13:05:48 +02:00
Nhat Nguyen	3cc4e0dd09	Retry follow task when remote connection queue full (#55314 ) If more than 100 shard-follow tasks are trying to connect to the remote cluster, then some of them will abort with "connect listener queue is full". This is because we retry on ESRejectedExecutionException, but not on RejectedExecutionException.	2020-04-20 22:43:05 -04:00
Stuart Tettemer	93a2e9b0f9	Test: MockScoreScript can be cacheable. (#55499 ) Backport: 0ed1eb5	2020-04-20 17:09:58 -06:00
Benjamin Trent	cabff65aec	[ML] Fixing inference stats race condition (#55163 ) (#55486 ) `updateAndGet` could actually call the internal method more than once on contention. If I read the JavaDocs, it says: ```* @param updateFunction a side-effect-free function``` So, it could be getting multiple updates on contention, thus having a race condition where stats are double counted. To fix, I am going to use a `ReadWriteLock`. The `LongAdder` objects allows fast thread safe writes in high contention environments. These can be protected by the `ReadWriteLock::readLock`. When stats are persisted, I need to call reset on all these adders. This is NOT thread safe if additions are taking place concurrently. So, I am going to protect with `ReadWriteLock::writeLock`. This should prevent race conditions while allowing high (ish) throughput in the highly contention paths in inference. I did some simple throughput tests and this change is not significantly slower and is simpler to grok (IMO). closes https://github.com/elastic/elasticsearch/issues/54786	2020-04-20 16:21:18 -04:00
Benjamin Trent	24d41eb695	[ML] partitions model definitions into chunks (#55260 ) (#55484 ) This paves the data layer way so that exceptionally large models are partitioned across multiple documents. This change means that nodes before 7.8.0 will not be able to use trained inference models created on nodes on or after 7.8.0. I chose the definition document limit to be 100. This SHOULD be plenty for any large model. One of the largest models that I have created so far had the following stats: ~314MB of inflated JSON, ~66MB when compressed, ~177MB of heap. With the chunking sizes of `16 * 1024 * 1024` its compressed string could be partitioned to 5 documents. Supporting models 20 times this size (compressed) seems adequate for now.	2020-04-20 16:08:54 -04:00
Benjamin Trent	fa0373a19f	[7.x] [ML] Fix log spam and disable ILM/SLM history for native ML tests (#55475 ) * [ML] fix native ML test log spam (#55459) This adds a dependency to ingest common. This removes the log spam resulting from basic plugins being enabled that require the common ingest processors. * removing unnecessary changes * removing unused imports * removing unnecessary java setting	2020-04-20 15:41:30 -04:00
Lee Hinman	9eddd2bcc9	[7.x] Add prefer_v2_templates flag and index setting (#55411 ) (#55476 ) This commit adds a new querystring parameter on the following APIs: - Index - Update - Bulk - Create Index - Rollover These APIs now support a `?prefer_v2_templates=true\|false` flag. This flag changes the preference creation to use either V2 index templates or V1 templates. This flag defaults to `false` and will be changed to `true` for 8.0+ in subsequent work. Additionally, setting this flag internally sets the `index.prefer_v2_templates` index-level setting. This setting is used so that actions that automatically create a new index (things like rollover initiated by ILM) will inherit the preference from the original index. This setting is dynamic so that a transition from v1 to v2 templates can occur for long-running indices grouped by an alias performing periodic rollover. This also adds support for sending this parameter to the High Level Rest Client. Relates to #53101	2020-04-20 12:05:42 -06:00
Armin Braun	a0763d958d	Make RepositoryData Less Memory Heavy (#55293 ) (#55468 ) We don't really need `LinkedHashSet` here. We can assume that all the entries are unique and just use a list and use the list utilities to create the cheapest possible version of the list. Also, this fixes a bug in `addSnapshot` which would mutate the existing linked hash set on the current instance (fortunately this never caused a real world bug) and brings the collection in line with the java docs on its getter that claim immutability.	2020-04-20 18:28:06 +02:00
William Brafford	7817948926	Disable monitoring in ML multinode tests (#55461 ) Removing the deprecated "xpack.monitoring.enabled" setting introduced log spam and potentially some failures in ML tests. It's possible to use a different, non-deprecated setting to disable monitoring, so we do that here.	2020-04-20 10:51:16 -04:00
David Turner	0df329dde7	Use soft deletes for searchable snapshots tests (#55453 ) This allows us to perform some dummy indexing including updates/deletes.	2020-04-20 14:37:51 +01:00
Przemysław Witek	7d5f74e964	Fix and unmute testSetUpgradeMode_ExistingTaskGetsUnassigned (#55368 ) (#55452 )	2020-04-20 13:29:29 +02:00

1 2 3 4 5 ...

5405 Commits