OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jake Landis	a22690c9ca	[7.x] Ensure that the monitoring export exceptions are logged. (#56237 ) (#56251 ) If an exception occurs while flushing a bulk the cause of the exception can be lost. This commit ensures that cause of the exception is carried forward and gets logged.	2020-05-05 19:24:26 -05:00
Julie Tibshirani	49de092b38	Mute RegressionIT.testTwoJobsWithSameRandomizeSeedUseSameTrainingSet.	2020-05-05 16:25:36 -07:00
Bogdan Pintea	47250b14a4	SQL: Add BigDecimal support to JDBC (#56015 ) (#56220 ) * SQL: Add BigDecimal support to JDBC (#56015) * Introduce BigDecimal support to JDBC -- fetching This commit adds support for the getBigDecimal() methods. * Allow BigDecimal params in double range A prepared statement will now accept a BigDecimal parameter as a proxy for a double, if the conversion is lossless. (cherry picked from commit e9a873ad7f387682e3472110b1d7c0514bd347c9) * Fix compilation error Dimond notation with anonymous inner classes not avail in Java8.	2020-05-05 23:19:36 +02:00
Bogdan Pintea	f159fd8a20	Fix test on incompatible client versions (#56234 ) (#56241 ) The incomatible client version test is changed to: - iterate on all versions prior to the allowed one_s; - format the exception message just as the server does it. The defect stemed from the fact that the clients will not send a version's qualifier, but just major.minor.revision, so the raised error/exception_message won't contain it, while the test expected it. (cherry picked from commit 4a81c8f7a1f4573e3be95f346d9fb18772b297ee)	2020-05-05 23:18:29 +02:00
Julie Tibshirani	63062ec7bd	Mute ClassificationIT.testDependentVariableCardinalityTooHighButWithQueryMakesItWithinRange.	2020-05-05 13:48:35 -07:00
Dan Hermann	6674f14fb3	[7.x] Get index includes parent data stream for backing indices (#56238 )	2020-05-05 15:43:42 -05:00
Benjamin Trent	e1c5ca421e	[7.x] [ML] lay ground work for handling >1 result indices (#55892 ) (#56192 ) * [ML] lay ground work for handling >1 result indices (#55892) This commit removes all but one reference to `getInitialResultsIndexName`. This is to support more than one result index for a single job.	2020-05-05 15:54:08 -04:00
Julie Tibshirani	793f265451	Mute SearchableSnapshotDirectoryTests.testIndexSearcher.	2020-05-05 12:29:05 -07:00
Ross Wolf	389082033e	EQL: Add concat function (#55193 ) * EQL: Add concat function * EQL: for loop spacing for concat * EQL: return unresolved arguments to concat early * EQL: Add concat integration tests * EQL: Fix concat query fail test * EQL: Add class for concat function testing * EQL: Add concat integration tests * EQL: Update concat() null behavior	2020-05-05 12:53:34 -06:00
Bogdan Pintea	23c35e32f2	SQL: introduce a query builder for the Rest tests (#55094 ) (#56221 ) * Introduce a query builder for the rest tests The new BaseRestSqlTestCase.RequestObjectBuilder class is a helper class to build REST request objects for the tests. Consequently, "manual" string concatenation to form JSON is done away with. The class mimics SqlQueryRequestBuilder API. (cherry picked from commit c8363f04c029542c233a758e9286d33c51d9c0c4)	2020-05-05 18:55:41 +02:00
Tal Levy	e4f2c3105d	Add geo_shape support for geotile_grid and geohash_grid (#55966 ) (#56228 ) this commit adds aggregation support for the geo_shape field type on geo*_grid aggregations. it introduces a Tiler for both tiles and hashes that enables a new type of ValuesSource to replace the GeoPoint's CellIdSource. This makes it possible for the existing Aggregator to be re-used, so no new implementations of the grid aggregators are added.	2020-05-05 09:54:14 -07:00
Benjamin Trent	641f598364	[Transform] fixes http status code when bad scripts are provided (#56117 ) (#56219 ) Transforms should propagate up the search execution exception if one is returned when it does the test query. this allows transforms to return a `4xx` when the aggs are malformed but parseable. closes https://github.com/elastic/elasticsearch/issues/55994	2020-05-05 12:36:22 -04:00
Bogdan Pintea	0e5632dc3a	SQL: relax version lock between server and clients (#56148 ) (#56223 ) * Relax version lock between ES/SQL and clients Allow older-than-server clients to connect, if these are past or on a certain min release. (cherry picked from commit 108f907297542ce649aa7304060aaf0a504eb699)	2020-05-05 18:27:06 +02:00
William Brafford	3499fa917c	Deprecated xpack "enable" settings should be no-ops (#55416 ) (#56167 ) The following settings are now no-ops: * xpack.flattened.enabled * xpack.logstash.enabled * xpack.rollup.enabled * xpack.slm.enabled * xpack.sql.enabled * xpack.transform.enabled * xpack.vectors.enabled Since these settings no longer need to be checked, we can remove settings parameters from a number of constructors and methods, and do so in this commit. We also update documentation to remove references to these settings.	2020-05-05 10:40:49 -04:00
Tanguy Leroux	b9636713b1	Searchable Snapshots should respect max_restore_bytes_per_sec (#55952 ) (#56199 ) This commit changes searchable snapshots so that it now respects the repository's max_restore_bytes_per_sec setting when it downloads blobs. Backport of #55952 for 7.x	2020-05-05 15:43:06 +02:00
David Roberts	7aa0daaabd	[7.x][ML] More advanced model snapshot retention options (#56194 ) This PR implements the following changes to make ML model snapshot retention more flexible in advance of adding a UI for the feature in an upcoming release. - The default for `model_snapshot_retention_days` for new jobs is now 10 instead of 1 - There is a new job setting, `daily_model_snapshot_retention_after_days`, that defaults to 1 for new jobs and `model_snapshot_retention_days` for pre-7.8 jobs - For days that are older than `model_snapshot_retention_days`, all model snapshots are deleted as before - For days that are in between `daily_model_snapshot_retention_after_days` and `model_snapshot_retention_days` all but the first model snapshot for that day are deleted - The `retain` setting of model snapshots is still respected to allow selected model snapshots to be retained indefinitely Backport of #56125	2020-05-05 14:31:58 +01:00
David Turner	40ea0eabd9	Forbid snapshot access on applier thread (#56044 ) This commit strengthens the assertion about which threads may access a blob store to exclude the cluster applier thread, since we no longer need to do so. Relates #50999	2020-05-05 13:27:55 +01:00
Dimitris Athanasiou	2d7899c83c	[7.x][ML] Adjust DF Analytics process phases (#56107 ) (#56177 ) As of elastic/ml-cpp#1179, the analytics process reports phases depending on the analysis type. This commit adjusts the phases of current analyses from `analyzing` to the following: - outlier_detection: [`computing_outlier`] - regression/classification: [`feature_selection`, `coarse_parameter_search`, `fine_tuning_parameters`, `final_training`] Backport of #56107	2020-05-05 15:00:07 +03:00
Dimitris Athanasiou	75dadb7a6d	[7.x][ML] Add loss_function to regression (#56118 ) (#56187 ) Adds parameters `loss_function` and `loss_function_parameter` to regression. Backport of #56118	2020-05-05 14:59:51 +03:00
Hendrik Muhs	e177a38504	[7.x][Transform] add throttling (#56007 ) (#56184 ) add throttling to transform, throttling will slow down search requests by delaying the execution based on a documents per second metric. fixes #54862	2020-05-05 13:09:02 +02:00
Marios Trivyzas	363e994171	SQL: Fix DATETIME_PARSE behaviour regarding timezones (#56158 ) (#56182 ) Previously, when the timezone was missing from the datetime string and the pattern, UTC was used, instead of the session defined timezone. Moreover, if a timezone was included in the datetime string and the pattern then this timezone was used. To have a consistent behaviour the resulting datetime will always be converted to the session defined timezone, e.g.: ``` SELECT DATETIME_PARSE('2020-05-04 10:20:30.123 +02:00', 'HH:mm:ss dd/MM/uuuu VV') AS datetime; ``` with `time_zone` set to `-03:00` will result in ``` 2020-05-04T05:20:40.123-03:00 ``` Follows: #54960 (cherry picked from commit 8810ed03a209cc8fe1bad309a81e85b56a39da27)	2020-05-05 12:08:39 +02:00
Tanguy Leroux	f717830563	Use workers to warm cache parts (#55793 ) (#56181 ) Today the cache prewarming introduced in #55322 works by enqueuing altogether the files parts to warm in the searchable_snapshots thread pool. In order to make this fairer among concurrent warmings, this commit starts workers that concurrently polls file parts to warm from a queue, warms the part and then immediately schedule another warming execution. This should leave more room for concurrent shard warming to sneak in and be executed. Relates #55322	2020-05-05 11:48:06 +02:00
Tanguy Leroux	35622747fd	Add Minio tests for searchable snapshots (#56112 ) (#56179 ) This commit adds QA tests for searchable snapshot on MinIO, similarly to what already exist for S3, GCS and Azure.	2020-05-05 11:40:06 +02:00
Marios Trivyzas	cc21468559	SQL: Fix issue with date range queries and timezone (#56115 ) (#56174 ) Previously, the timezone parameter was not passed to the RangeQuery and as a results queries that use the ES date math notation (now, now-1d, now/d, now/h, now+2h, etc.) were using the UTC timezone and not the one passed through the "timezone"/"time_zone" JDBC/REST params. As a consequence, the date math defined dates were always considered in UTC and possibly led to incorrect results for queries like: ``` SELECT * FROM t WHERE date BETWEEN now-1d/d AND now/d ``` Fixes: #56049 (cherry picked from commit 300f010c0b18ed0f10a41d5e1606466ba0a3088f)	2020-05-05 10:54:23 +02:00
Dimitris Athanasiou	6061aa3db4	[7.x][ML] Fix race condition updating reindexing progress (#56135 ) (#56146 ) In #55763 I thought I could remove the flag that marks reindexing was finished on a data frame analytics task. However, that exposed a race condition. It is possible that between updating reindexing progress to 100 because we have called `DataFrameAnalyticsManager.startAnalytics()` and a call to the _stats API which updates reindexing progress via the method `DataFrameAnalyticsTask.updateReindexTaskProgress()` we end up overwriting the 100 with a lower progress value. This commit fixes this issue by bringing back the help of a `isReindexingFinished` flag as it was prior to #55763. Closes #56128 Backport of #56135	2020-05-05 10:48:42 +03:00
Albert Zaharovits	e8763bad41	Let realms gracefully terminate the authN chain (#55623 ) AuthN realms are ordered as a chain so that the credentials of a given user are verified in succession. Upon the first successful verification, the user is authenticated. Realms do however have the option to cut short this iterative process, when the credentials don't verify and the user cannot exist in any other realm. This mechanism is currently used by the Reserved and the Kerberos realm. This commit improves the early termination operation by allowing realms to gracefully terminate authentication, as if the chain has been tried out completely. Previously, early termination resulted in an authentication error which varies the response body compared to the failed authentication outcome where no realm could verify the credentials successfully. Reserved users are hence denied authentication in exactly the same way as other users are when no realm can validate their credentials.	2020-05-05 10:11:49 +03:00
Martijn van Groningen	2ac32db607	Move includeDataStream flag from IndicesOptions to IndexNameExpressionResolver.Context (#56151 ) Backport of #56034. Move includeDataStream flag from an IndicesOptions to IndexNameExpressionResolver.Context as a dedicated field that callers to IndexNameExpressionResolver can set. Also alter indices stats api to support data streams. The rollover api uses this api and otherwise rolling over data stream does no longer work. Relates to #53100	2020-05-04 22:38:33 +02:00
Dan Hermann	9892813842	[7.x] Delay warning about missing x-pack (#56142 ) * Delay warning about missing x-pack (#54265) Currently, when monitoring is enabled in a freshly-installed cluster, the non-master nodes log a warning message indicating that master may not have x-pack installed. The message is often printed even when the master does have x-pack installed but takes some time to setup the local exporter for monitoring. This commit adds the local exporter setting `wait_master.timeout` which defaults to 30 seconds. The setting configures the time that the non-master nodes should wait for master to setup monitoring. After the time elapses, they log a message to the user about possible missing x-pack installation on master. The logging of this warning was moved from `resolveBulk()` to `openBulk()` since `resolveBulk()` is called only on cluster updates and the message might not be logged until a new cluster update occurs. Closes #40898	2020-05-04 14:16:18 -05:00
Benjamin Trent	6c26de444d	[ML] reduce InferenceProcessor.Factory log spam by not parsing pipelines (#56020 ) (#56126 ) If there are ill-formed pipelines, or other pipelines are not ready to be parsed, `InferenceProcessor.Factory::accept(ClusterState)` logs warnings. This can be confusing and cause log spam. It might lead folks to think there an issue with the inference processor. Also, they would see logs for the inference processor even though they might not be using the inference processor. Leading to more confusion. Additionally, pipelines might not be parseable in this method as some processors require the new cluster state metadata before construction (e.g. `enrich` requires cluster metadata to be set before creating the processor). closes https://github.com/elastic/elasticsearch/issues/55985	2020-05-04 13:32:01 -04:00
Martijn van Groningen	6d03081560	Add auto create action (#56122 ) Backport of #55858 to 7.x branch. Currently the TransportBulkAction detects whether an index is missing and then decides whether it should be auto created. The coordination of the index creation also happens in the TransportBulkAction on the coordinating node. This change adds a new transport action that the TransportBulkAction delegates to if missing indices need to be created. The reasons for this change: * Auto creation of data streams can't occur on the coordinating node. Based on the index template (v2) either a regular index or a data stream should be created. However if the coordinating node is slow in processing cluster state updates then it may be unaware of the existence of certain index templates, which then can load to the TransportBulkAction creating an index instead of a data stream. Therefor the coordination of creating an index or data stream should occur on the master node. See #55377 * From a security perspective it is useful to know whether index creation originates from the create index api or from auto creating a new index via the bulk or index api. For example a user would be allowed to auto create an index, but not to use the create index api. The auto create action will allow security to distinguish these two different patterns of index creation. This change adds the following new transport actions: AutoCreateAction, the TransportBulkAction redirects to this action and this action will actually create the index (instead of the TransportCreateIndexAction). Later via #55377, can improve the AutoCreateAction to also determine whether an index or data stream should be created. The create_index index privilege is also modified, so that if this permission is granted then a user is also allowed to auto create indices. This change does not yet add an auto_create index privilege. A future change can introduce this new index privilege or modify an existing index / write index privilege. Relates to #53100	2020-05-04 19:10:09 +02:00
Julie Tibshirani	6b5cf1b031	For constant_keyword, make sure exists query handles missing values. (#55757 ) It's possible for a constant_keyword to have a 'null' value before any documents are seen that contain a value for the field. In this case, no documents have a value for the field, and 'exists' queries should return no documents.	2020-05-04 09:41:52 -07:00
Ross Wolf	6da686c7e0	EQL: Add match function implementation (#55182 ) * EQL: Add Match function * EQL: Add note about character classes * EQL: QueryFolderFailTests.java * EQL: Add match() fail tests * EQL: Add match tests and fix alias * EQL: Add match verifier failure tests * EQL: Reorder query folder fail tests	2020-05-04 09:34:20 -06:00
Dimitris Athanasiou	76fa5a2397	[7.x][ML] Improve cleanup for DF Analytics HLRC tests (#56101 ) (#56109 ) Adds the step of stopping all data frame analytics before deleting them to the cleanup of the corresponding HLRC tests. Closes #56097 Backport of #56101	2020-05-04 16:08:08 +03:00
Andrei Stefan	5d1bc6c89c	EQL: reject queries that use a nested field or a sub-field of a nested field (#56108 ) * Reject queries that act on nested fields or fields with nested field types in their hierarchy (#55721) (cherry picked from commit 2a024461cd9da821112953d4c6e565ea622c678b)	2020-05-04 15:50:31 +03:00
Przemysław Witek	44f5a8ccd3	Use snapshot's latest result time rather than snapshot's creation time when creating an annotation (#56093 ) (#56103 )	2020-05-04 12:36:12 +02:00
Christos Soulios	c65f828cb7	[7.x] Histogram field type support for ValueCount and Avg aggregations (#56099 ) Backports #55933 to 7.x Implements value_count and avg aggregations over Histogram fields as discussed in #53285 - value_count returns the sum of all counts array of the histograms - avg computes a weighted average of the values array of the histogram by multiplying each value with its associated element in the counts array	2020-05-04 13:23:02 +03:00
Armin Braun	0860d1dc74	Remove Dead Code in SLM Delete Handling (#56081 ) (#56098 ) The delete response is always acknowledged. No need to handle anything else.	2020-05-04 12:22:06 +02:00
Armin Braun	e01b999ef0	Add Functionality to Consistently Read RepositoryData For CS Updates (#55773 ) (#56091 ) Using optimistic locking, add the ability to run a repository state update task with a consistent view of the current repository data. Allows for a follow-up to remove the snapshot INIT state.	2020-05-04 08:13:14 +02:00
Armin Braun	3a64ecb6bf	Allow Deleting Multiple Snapshots at Once (#55474 ) (#56083 ) * Allow Deleting Multiple Snapshots at Once (#55474) Adds deleting multiple snapshots in one go without significantly changing the mechanics of snapshot deletes otherwise. This change does not yet allow mixing snapshot delete and abort. Abort is still only allowed for a single snapshot delete by exact name.	2020-05-03 20:30:58 +02:00
William Brafford	d53c941c41	Make xpack.monitoring.enabled setting a no-op (#55617 ) (#56061 ) * Make xpack.monitoring.enabled setting a no-op This commit turns xpack.monitoring.enabled into a no-op. Mostly, this involved removing the setting from the setup for integration tests. Monitoring may introduce some complexity for test setup and teardown, so we should keep an eye out for turbulence and failures * Docs for making deprecated setting a no-op	2020-05-01 16:42:11 -04:00
Andrei Stefan	fbba65d8b3	SQL: SubSelect unresolved bugfix (#55956 ) (#56055 ) * Resolve the missing refs only after the aggregate tree is resolved (cherry picked from commit 10167b1cf2df6b074a1ba0c8e73c261ff9e9d1db)	2020-05-01 07:48:11 +03:00
Ryan Ernst	52b9d8d15e	Convert remaining license methods to isAllowed (#55908 ) (#55991 ) This commit converts the remaining isXXXAllowed methods to instead of use isAllowed with a Feature value. There are a couple other methods that are static, as well as some licensed features that check the license directly, but those will be dealt with in other followups.	2020-04-30 15:52:22 -07:00
Igor Motov	d8f9df771d	Expose agg usage in Feature Usage API (#55732 ) (#56048 ) Counts usage of the aggs and exposes them on the _nodes/usage/. Closes #53746	2020-04-30 12:53:36 -04:00
Przemko Robakowski	797f63e743	[7.x] Emit deprecation warning if multiple v1 templates match with a new index (#55558 ) (#56038 ) * Emit deprecation warning if multiple v1 templates match with a new index (#55558) * Emit deprecation warning if multiple v1 templates match with a new index * DEPRECATION_LOGGER rename	2020-04-30 17:36:17 +02:00
Luca Cavanna	fc6422ffcc	Consolidate DelayableWriteable (#55932 ) This commit includes a number of minor improvements around `DelayableWriteable`: javadocs were expanded and reworded, `get` was renamed to `expand` and `DelayableWriteable` no longer implements `Supplier`. Also a couple of methods are now private instead of package private.	2020-04-30 17:16:58 +02:00
Benjamin Trent	c36bcb4dd0	[ML] fixing file structure finder multiline merge max for delimited formats (#56023 ) (#56035 ) This commit correctly sets the maxLinesPerRow in the CsvPreference for delimited files given the file structure finder settings. Previously, it was silently ignored.	2020-04-30 10:51:32 -04:00
Benjamin Trent	04b1f6498b	[ML] using new fixed interval in ml tests (#56021 ) (#56031 ) This commit removes deprecated references to DateHistogram.interval from ml tests	2020-04-30 10:26:39 -04:00
Dimitris Athanasiou	17b904def5	[7.x][ML] Decouple DFA progress testing from analyses phases (#55925 ) (#56024 ) This refactors native integ tests to assert progress without expecting explicit phases for analyses. We can test those with yaml tests in a single place. Backport of #55925	2020-04-30 17:05:47 +03:00
William Brafford	273ff6a105	Make xpack.ilm.enabled setting a no-op (#55592 ) (#55980 ) * Make xpack.ilm.enabled setting a no-op * Add watcher setting to not use ILM * Update documentation for no-op setting * Remove NO_ILM ml index templates * Remove unneeded setting from test setup * Inline variable definitions for ML templates * Use identical parameter names in templates * New ILM/watcher setting falls back to old setting * Add fallback unit test for watcher/ilm setting	2020-04-30 09:50:18 -04:00
David Kyle	c204353249	[ML] Wait for model loaded and cached in ModelLoadingServiceTests (#56014 ) Fixes test by exposing the method ModelLoadingService::addModelLoadedListener() so that the test class can be notified when a model is loaded which happens in a background thread	2020-04-30 13:32:07 +01:00

1 2 3 4 5 ...

4760 Commits