OpenSearch

Commit Graph

Author	SHA1	Message	Date
Igor Motov	a66988281f	Add histogram field type support to boxplot aggs (#52265 ) Add support for the histogram field type to boxplot aggs. Closes #52233 Relates to #33112	2020-02-13 18:09:26 -05:00
Julie Tibshirani	0d7165a40b	Standardize naming of fetch subphases. (#52171 ) This commit makes the names of fetch subphases more consistent: * Now the names end in just 'Phase', whereas before some ended in 'FetchSubPhase'. This matches the query subphases like AggregationPhase. * Some names include 'fetch' like FetchScorePhase to avoid ambiguity about what they do.	2020-02-13 13:00:46 -08:00
Przemysław Witek	0da3af7581	[7.x] [ML] Add _cat/ml/data_frame/analytics API (#52260 ) (#52312 )	2020-02-13 16:55:47 +01:00
Marios Trivyzas	ea6f0e39bc	[Tests] Update skip version for YAML tests (#52310 ) Update skip versions upper boundary to match the release or intented release version of the feature/fix.	2020-02-13 15:36:31 +01:00
Costin Leau	5373a77fb9	QL: Extract common Failure class (#52281 ) Shared across SQL and EQL (cherry picked from commit 1aeda20d3ec3d6c885de03c6043dd1e8eab9f230)	2020-02-13 14:35:15 +02:00
David Roberts	3ea49557fe	Add cluster:admin/analyze permission to Kibana system role (#52259 ) This is to support the ML categorization wizard. Currently cluster:admin/analyze is only provided with the "manage" cluster privilege, which is an excessive privilege level to provide access to this single feature. It means that the ML categorization wizard only works for extremely highly privileged users. Following this change the Kibana system user will be permitted to run the _analyze endpoint on supplied strings (not on an index). The ML UI will then call the _analyze endpoint as the Kibana system user after first checking that the logged-in user is permitted to create an ML job. This will mean that users with the more reasonable "manage_ml" cluster privilege will be permitted to use the ML categorization wizard. (This is also consistent with the way the ML UI will access _all_ Elasticsearch functionality when the "ML in Spaces" project is completed.) Closes #51391 Relates elastic/kibana#57375	2020-02-13 11:01:27 +00:00
Nik Everett	2dac36de4d	HLRC support for string_stats (#52163 ) (#52297 ) This adds a builder and parsed results for the `string_stats` aggregation directly to the high level rest client. Without this the HLRC can't access the `string_stats` API without the elastic licensed `analytics` module. While I'm in there this adds a few of our usual unit tests and modernizes the parsing.	2020-02-12 19:25:05 -05:00
Julie Tibshirani	f0668cabbc	Adjust the 'skip' version in flattened REST tests. (#52293 ) I forgot to adjust it after backporting the flattened fields feature.	2020-02-12 15:17:44 -08:00
Jay Modi	5bcc6fce5c	Remove DeprecationLogger from route objects (#52285 ) This commit removes the need for DeprecatedRoute and ReplacedRoute to have an instance of a DeprecationLogger. Instead the RestController now has a DeprecationLogger that will be used for all deprecated and replaced route messages. Relates #51950 Backport of #52278	2020-02-12 15:05:41 -07:00
Marios Trivyzas	dac720d7a1	Add a cluster setting to disallow expensive queries (#51385 ) (#52279 ) Add a new cluster setting `search.allow_expensive_queries` which by default is `true`. If set to `false`, certain queries that have usually slow performance cannot be executed and an error message is returned. - Queries that need to do linear scans to identify matches: - Script queries - Queries that have a high up-front cost: - Fuzzy queries - Regexp queries - Prefix queries (without index_prefixes enabled - Wildcard queries - Range queries on text and keyword fields - Joining queries - HasParent queries - HasChild queries - ParentId queries - Nested queries - Queries on deprecated 6.x geo shapes (using PrefixTree implementation) - Queries that may have a high per-document cost: - Script score queries - Percolate queries Closes: #29050 (cherry picked from commit a8b39ed842c7770bd9275958c9f747502fd9a3ea)	2020-02-12 22:56:14 +01:00
Bogdan Pintea	5dfe27601e	SQL: supplement input checks on received request parameters (#52229 ) (#52277 ) * Add more checks around parameter conversions This commit adds two necessary verifications on received parameters: - it checks the validity of the parameter's data type: if the declared data type is resolved to an ES or Java type; - it checks if the returned converter is non-null (i.e. a conversion is possible) and generates an appropriate exception otherwise. (cherry picked from commit eda30ac9c69383165324328c599ace39ac064342)	2020-02-12 19:45:12 +01:00
Costin Leau	26900bfb05	EQL: Add infra for planning and query folding (#52065 ) Actual folding not yet in place (TBD) (cherry picked from commit d52b96f273a94c90e475a5035cd57baa086fb0c0)	2020-02-12 18:51:42 +02:00
Hendrik Muhs	5d35eaa1cb	[Transform] improve irrecoverable error detection - part 2 (#52003 ) base error handling on rest status instead of listing individual exception types relates to #51820	2020-02-12 14:38:42 +01:00
James Rodewig	3f151d1d75	[DOCS] Add redirects, update JSON spec to fix docs build (#51747 ) Docs build [#11556][0] broke due to several outdated or incorrect links in the JSON REST spec. This fixes those links where possible and adds redirects. [0]: https://elasticsearch-ci.elastic.co/job/elastic+docs+master+build/11556/	2020-02-12 08:30:59 -05:00
Andrei Stefan	a3ebacfcf3	52169 & 52172 7x backport (#52256 ) * Extract common optimizer tests (#52169) (cherry picked from commit e5ad72bc22e9ec0686ab582195f0032efcb880bf) * Hook in the optimizer rules (#52172) (cherry picked from commit 1f90d8cc56052fbf2af604e72f9f5ca73f5e75d5)	2020-02-12 11:20:03 +02:00
Marios Trivyzas	daab242c75	SQL: Fix ORDER BY on aggregates and GROUPed BY fields (#51894 ) Previously, in the in-memory sorting module `LocalAggregationSorterListener` only the aggregate functions where used (grabbed by the `sortingColumns`). As a consequence, if the ORDER BY was also using columns of the GROUP BY clause, (especially in the case of higher priority - before the aggregate functions) wrong results were produced. E.g.: ``` SELECT gender, MAX(salary) AS max FROM test_emp GROUP BY gender ORDER BY gender, max ``` Add all columns of the ORDER BY to the `sortingColumns` so that the `LocalAggregationSorterListener` can use the correct comparators in the underlying PriorityQueue used to implement the in-memory sorting. Fixes: #50355 (cherry picked from commit be680af11c823292c2d115bff01658f7b75abd76)	2020-02-12 09:38:47 +01:00
Hendrik Muhs	edaf6d1f79	[Transform] maintain a list of unsupported aggregations in transforms (#52190 ) (#52222 ) add a list of unsupported aggs in transforms and create a test that fails if a new aggregation is added. Limitation: works only if a new agg is added to either the core or a known plugin (Analytics, MatrixAggregation).	2020-02-12 07:48:04 +01:00
Benjamin Trent	2a968f4f2b	[ML] job results provider refactoring (#52012 ) (#52238 ) During a bug hunt, I caught a handful of things (unrelated to the bug) that could be potential issues: 1. Needlessly wrapping in exception handling (minor cleanup) 2. Potential of notifying listeners of a failure multiple times + even trying to notify of a success after a failure notification	2020-02-11 17:54:44 -05:00
Gordon Brown	d48ce12920	Convert ILM and SLM histories into hidden indices (#51456 ) Modifies SLM's and ILM's history indices to be hidden indices for added protection against accidental querying and deletion, and improves IndexTemplateRegistry to handle upgrading index templates. Also modifies the REST test cleanup to delete hidden indices.	2020-02-11 14:18:55 -07:00
Albert Zaharovits	cc1fce96ba	Add a new async search security origin (#52141 ) This commit adds a new security origin, and an associated reserved user and role, named `_async_search`, which can be used by internal clients to manage the `.async-search-*` restricted index namespace.	2020-02-11 19:58:06 +02:00
James Rodewig	d68a4ec82e	[7.x] Permit EQL feature flag in release builds (#52201 ) (#52214 ) 7.x backport of #52201 Provides a path to set register the EQL feature flag in release builds. This enables EQL in release builds so that release docs tests pass. Release docs tests do not have infrastructure in place to only register snippets from included portions of the docs, they instead include all docs snippets. Since EQL can not be enabled in release builds, this meant that the EQL snippets fail in the release docs tests. This adds the ability to enable EQL in the release docs tests. This system property will be removed when EQL is ready for release.	2020-02-11 11:49:49 -05:00
Hendrik Muhs	098380e483	Percentiles aggregation validation checks for range (#51871 ) disallow to specify percentile out of range [0,100]. This also fixes a problem in transform by failing validation if an invalid percentile configuration is used.	2020-02-11 17:25:39 +01:00
David Roberts	d1d9c40e71	[ML] Switch poor categorization audit warning to use status field (#52195 ) In #51146 a rudimentary check for poor categorization was added to 7.6. This change replaces that warning based on a Java-side check with a new one based on the categorization_status field that the ML C++ sets. categorization_status was added in 7.7 and above by #51879, so this new warning based on more advanced conditions will also be in 7.7 and above. Closes #50749	2020-02-11 15:33:27 +00:00
David Roberts	473468d763	[ML] Better error when persistent task assignment disabled (#52014 ) Changes the misleading error message when attempting to open a job while the "cluster.persistent_tasks.allocation.enable" setting is set to "none" to a clearer message that names the setting. Closes #51956	2020-02-11 15:23:21 +00:00
Igor Motov	667e1a5225	Add Boxplot Aggregation (#52174 ) Adds a `boxplot` aggregation that calculates min, max, medium and the first and the third quartiles of the given data set. Closes #33112	2020-02-11 09:38:17 -05:00
Marios Trivyzas	204d086266	SQL: Fix issue with timezone when paginating (#52101 ) Previously, when the specified (or default) fetchSize led to subsequent HTTP requests and the usage of cursors, those subsequent were no longer using the client timezone specified in the initial SQL query. As a consequence, Even though the query is executed once (with the correct timezone) the processing of the query results by the HitExtractors in the next pages was done using the default timezone Z. This could lead to incorrect results. Fix the issue by correctly using the initially specified timezone, which is found in the deserialisation of the cursor string. Fixes: #51258 (cherry picked from commit 8f7afbdeb9295999b48a6c36db5b31cbe0cee432)	2020-02-11 15:27:56 +01:00
Yang Wang	16ba59e9d1	Expose more authentication info to ingest pipeline (#51305 ) (#52119 ) The changes add more granularity for identiying the data ingestion user. The ingest pipeline can now be configure to record authentication realm and type. It can also record API key name and ID when one is in use. This improves traceability when data are being ingested from multiple agents and will become more relevant with the incoming support of required pipelines (#46847) Resolves: #49106	2020-02-11 23:05:01 +11:00
Tim Vernum	b0b1b13311	Extract class to store Authentication in context (#52183 ) This change extracts the code that previously existed in the "Authentication" class that was responsible for reading and writing authentication objects to/from the ThreadContext. This is needed to support multiple authentication objects under separate keys. This refactoring highlighted that there were a large number of places where we extracted the Authentication/User objects from the thread context, in a variety of ways. These have been consolidated to rely on the SecurityContext object. Backport of: #52032	2020-02-11 20:59:06 +11:00
Dimitris Athanasiou	6086fadf00	[7.x][ML] Prepare to hold additional stats in DF Analytics task (#52134 ) (#52187 ) Refactors `DataFrameAnalyticsTask` to hold a `StatsHolder` object. That just has a `ProgressTracker` for now but this is paving the way to add additional stats like memory usage, analysis stats, etc. Backport #52134	2020-02-11 11:18:45 +02:00
Dimitris Athanasiou	cbebc26f50	[7.x][ML] Retry persisting DF Analytics results (#52048 ) (#52160 ) Employs `ResultsPersisterService` from `DataFrameRowsJoiner` in order to add retries when a data frame analytics job is persisting the results to the destination data frame. Backport of #52048	2020-02-11 09:55:00 +02:00
Andrei Stefan	2f1631d9d0	Telemetry data initial implementation (#51715 ) (#52175 ) (cherry picked from commit f1d1cceacaacf226fcd2459f34689843b822fe4b)	2020-02-11 09:15:47 +02:00
Marios Trivyzas	6b600855a9	SQL: Make parsing of date more lenient (#52137 ) Make the parsing of date more lenient - as an escaped literal: `{d '2020-02-10[[T\| ]10:20[:30][.123456789][tz]]'}` - cast a string to a date: `CAST(2020-02-10[[T\| ]10:20[:30][.123456789][tz]]' AS DATE)` Closes: #49379 (cherry picked from commit 5863b27500d5e7f6cdd8c6c62b09b84e53ca724a)	2020-02-10 21:47:00 +01:00
Julie Tibshirani	28a8db730f	In FieldTypeLookup, factor out flat object field logic. (#52091 ) Currently, the logic for looking up `flattened` field types lives in the top-level `FieldTypeLookup`. This PR moves it into a dedicated class `DynamicKeyFieldTypeLookup`.	2020-02-10 10:44:02 -08:00
Bogdan Pintea	7b58ed0dd7	Fix milliseconds handling in intervals (#51675 ) (#52156 ) This fixes: - the parsing of milliseconds in intervals: everything past the . used to be converted as-is to milliseconds, with no normalisation of the unit; thus, a value of .23 ended up as 23 millis in the interval, instead of 230. - the printing of a trailing .0, in case the interval lacks the fractional part; - tests generating a random millisecond value used to simply print it in the string about to be evaluated without a necessary front-filling of 0[s], where the amount was below 100/10. (The combination of first and last issues above, plus statistical "luck" made the incorrect handling pass the tests.) (cherry picked from commit 4de8c64f63ee37c1bcfdb9b9d3a07d09be243222)	2020-02-10 19:24:26 +01:00
Lee Hinman	37a2e9bac6	[7.x] Allow forcemerge in the hot phase for ILM policies (#520… (#52083 ) * Allow forcemerge in the hot phase for ILM policies This commit changes the `forcemerge` action to also be allowed in the `hot` phase for policies. The forcemerge will occur after a rollover, and allows users to take advantage of higher disk speeds for performing the force merge (on a separate node type, for example). On caveat with this is that a `forcemerge` in the `hot` phase MUST be accompanied by a `rollover` action. ILM validates policies to ensure this is the case. Resolves #43165 * Use anyMatch instead of findAny in validation * Make randomTimeseriesLifecyclePolicy single-pass	2020-02-10 08:54:49 -07:00
Przemysław Witek	c7cc383d33	[7.x] Update persistent state document in the index the document belongs to (#51751 ) (#52145 )	2020-02-10 16:32:34 +01:00
Nhat Nguyen	864e9d875d	Bubble up exception in follow task in ccr tests (#52085 ) It's perfectly fine if a bulk request on the follower hits IndexShardClosedException in some CCR tests because we sometimes close some follower shards while the follow-task is replicating operations. Instead of failing the test immediately, this commit bubbles up that failure to the shard follow task. Closes #52052	2020-02-10 08:27:04 -05:00
Marios Trivyzas	27265f032a	SQL: Enhance timestamp escaped literal parsing (#52097 ) Allow also whitespace ` ` (together with `T`) as a separator between date and time parts of the timestamp string. E.g.: ``` {ts '2020-02-08 12.10.45'} ``` or ``` {ts '2020-02-08T12.10.45'} ``` Fixes: #46069 (cherry picked from commit 07c977023fb8ceab5991c359a6cbfe07beaad9bb)	2020-02-10 11:24:55 +01:00
Tim Vernum	4e4815355a	Mute DocumentSubsetBitsetCacheTests.testCacheUnderConcurrentAccess (#52135 ) Test does not always complete in expected time. Relates: #51914 Backport of: #52122	2020-02-10 21:19:18 +11:00
Andrei Stefan	fa4dcd50d9	Extract common optimization rules for QL (#52054 ) (#52132 ) (cherry picked from commit ee43115531234c2d955193ce0c9c268e1f02ab43)	2020-02-10 11:48:45 +02:00
Ignacio Vera	80e3c97210	Upgrade to lucene-8.5.0-snapshot-d62f6307658 (#52039 ) (#52130 )	2020-02-10 10:13:22 +01:00
David Roberts	1cefafdd14	[ML] Add new categorization stats to model_size_stats (#52009 ) This change adds support for the following new model_size_stats fields: - categorized_doc_count - total_category_count - frequent_category_count - rare_category_count - dead_category_count - categorization_status Backport of #51879	2020-02-10 09:10:50 +00:00
Jay Modi	3edadfefd0	RestHandlers declare handled routes (#52123 ) This commit changes how RestHandlers are registered with the RestController so that a RestHandler no longer needs to register itself with the RestController. Instead the RestHandler interface has new methods which when called provide information about the routes (method and path combinations) that are handled by the handler including any deprecated and/or replaced combinations. This change also makes the publication of RestHandlers safe since they no longer publish a reference to themselves within their constructors. Closes #51622 Co-authored-by: Jason Tedor <jason@tedor.me> Backport of #51950	2020-02-09 22:48:32 -07:00
Ioannis Kakavas	8c0b49cd32	Adjust jarHell and 3rd party audit exclusions (#51733 ) (#51766 ) Now that the FIPS 140 security provider is simply a test dependency we don't need the thirdPartyAudit exceptions, but plugin-cli and transport-netty4 do need jarHell disabled as they use the non fips BouncyCastle security provider as a test dependency too.	2020-02-10 07:38:59 +02:00
Tim Vernum	d5c015062d	Don't allow null User.principal (#52049 ) Some parts of the User class (e.g. equals/hashCode) assumed that principal could never be null, but the constructor didn't enforce that. This adds a null check into the constructor and fixes a few tests that relied on being able to pass in null usernames. Backport of: #51988	2020-02-10 12:23:55 +11:00
Jason Tedor	2b99291187	Add autoscaling feature flag in release REST tests (#52096 ) The REST tests for autoscaling either need to be skipped in a non-snapshot build, or alternatively, the feature flag registered so that autoscaling can be enabled. We prefer the latter approach, as it allows us to also test autoscaling in non-snapshot builds incrementally, instead of at the end of development as autoscaling prepares for release. This commit registers the autoscaling feature flag in REST tests for non-snapshot builds.	2020-02-09 15:49:01 -05:00
Armin Braun	90eb6a020d	Remove Redundant Loading of RepositoryData during Restore (#51977 ) (#52108 ) We can just put the `IndexId` instead of just the index name into the recovery soruce and save one load of `RepositoryData` on each shard restore that way.	2020-02-09 21:44:18 +01:00
Marios Trivyzas	3e7f939f63	SQL: [Tests] Add more tests for aggs and literals (#52086 ) Add some more tests where more than one literal is selected, unaliased and aliased. Follows: #42121 (cherry picked from commit 405271d408a233e697eb2e9ded3005a71f4df5e7)	2020-02-09 18:01:05 +01:00
Costin Leau	214beed90f	QL: move query AST from SQL to QL (#52069 ) (cherry picked from commit 59368968b698652352be1bb2a60d5a357a01b978)	2020-02-08 23:10:51 +02:00
Jason Tedor	8b1d2c5b95	Permit autoscaling feature flag in release builds (#52088 ) This commit provides a path to set register the autoscaling feature flag in release builds, and therefore enabling autoscaling in release builds. The primary reason that we add this is so that our release docs tests can pass. Our release docs tests do not have infrastructure in place to only register snippets from included portions of the docs, they instead include all docs snippets. Since autoscaling can not be enabled in release builds, this meant that the autoscaling snippets would fail in the release docs tests. To address then, we need the ability to enable autoscaling in the release docs tests which we can now do with the system property added here. This system property will be removed when autoscaling is ready for release.	2020-02-07 21:40:51 -05:00

1 2 3 4 5 ...

4122 Commits