OpenSearch

Commit Graph

Author	SHA1	Message	Date
Marios Trivyzas	dac720d7a1	Add a cluster setting to disallow expensive queries (#51385 ) (#52279 ) Add a new cluster setting `search.allow_expensive_queries` which by default is `true`. If set to `false`, certain queries that have usually slow performance cannot be executed and an error message is returned. - Queries that need to do linear scans to identify matches: - Script queries - Queries that have a high up-front cost: - Fuzzy queries - Regexp queries - Prefix queries (without index_prefixes enabled - Wildcard queries - Range queries on text and keyword fields - Joining queries - HasParent queries - HasChild queries - ParentId queries - Nested queries - Queries on deprecated 6.x geo shapes (using PrefixTree implementation) - Queries that may have a high per-document cost: - Script score queries - Percolate queries Closes: #29050 (cherry picked from commit a8b39ed842c7770bd9275958c9f747502fd9a3ea)	2020-02-12 22:56:14 +01:00
Lisa Cawley	40b58e612d	[DOCS] Fixes, sorts ML tagged regions (#52283 )	2020-02-12 13:52:34 -08:00
Marios Trivyzas	d9fd6fc90c	SQL: [Docs] Fix typo Add missing closing "`" Follows: `c2e0552537`	2020-02-12 21:50:57 +01:00
Nik Everett	0c1889389a	Update skip for backported fix (#52241 ) Now that #51172 is fully backported we can fix the `skip` clause in the bwc tests for it.	2020-02-12 13:55:47 -05:00
Ryan Ernst	c07f46409c	Fix single newline in logging output stream buffer (#52253 ) The buffer in LoggingOutputStream skips flushing when only a newline appears. However, if a windows newline appeared, the buffer length was not reset. This commit resets the length so the \r does not appear in the next logging message. closes #51838	2020-02-12 10:48:55 -08:00
Bogdan Pintea	5dfe27601e	SQL: supplement input checks on received request parameters (#52229 ) (#52277 ) * Add more checks around parameter conversions This commit adds two necessary verifications on received parameters: - it checks the validity of the parameter's data type: if the declared data type is resolved to an ES or Java type; - it checks if the returned converter is non-null (i.e. a conversion is possible) and generates an appropriate exception otherwise. (cherry picked from commit eda30ac9c69383165324328c599ace39ac064342)	2020-02-12 19:45:12 +01:00
James Rodewig	fc964643bd	[DOCS] Add docs build info to TESTING.asciidoc (#52271 ) Adds a brief section about Elasticsearch docs and how users can test/build them locally.	2020-02-12 13:00:45 -05:00
Armin Braun	6ea3f5ada1	Move EC2 Discovery Tests to Mock Rest API (#50605 ) (#52270 ) Move EC2 discovery tests to using the mock REST API introduced in https://github.com/elastic/elasticsearch/pull/50550 instead of mocking the AWS SDK classes manually. Move the trivial remaining AWS SDK mocks to the single test suit that was using them.	2020-02-12 18:35:50 +01:00
Costin Leau	26900bfb05	EQL: Add infra for planning and query folding (#52065 ) Actual folding not yet in place (TBD) (cherry picked from commit d52b96f273a94c90e475a5035cd57baa086fb0c0)	2020-02-12 18:51:42 +02:00
Nhat Nguyen	e098e837f7	Fix testShouldPeriodicallyFlushAfterMerge (#52243 ) MockRandomMergePolicy randomly determines if a segment should use a compound format. This can cause a force merge performing two merges: (1) merging to a single segment, (2) rewriting the new segment using the compound format. If the second merge completes after we have flushed, then it can flip the flag shouldPeriodicallyFlushAfterBigMerge to true. Closes #52205	2020-02-12 11:25:39 -05:00
Nhat Nguyen	257eb0212c	Mute ‘test user agent processor with non-ECS schema’ Tracked at #52266	2020-02-12 10:27:18 -05:00
James Rodewig	ca34817659	[DOCS] Add EQL limitations page (#52001 ) Documents limitations for EQL in Elasticsearch.	2020-02-12 08:45:43 -05:00
James Rodewig	20453d3ac8	[DOCS] Add basic EQL search tutorial docs (#51574 ) I plan to add additional sections to this page with future PRs: * Specify timestamp and event type fields * Specify a join key field * Filter using query DSL * Paginate a large response See #51057.	2020-02-12 08:42:09 -05:00
Hendrik Muhs	5d35eaa1cb	[Transform] improve irrecoverable error detection - part 2 (#52003 ) base error handling on rest status instead of listing individual exception types relates to #51820	2020-02-12 14:38:42 +01:00
James Rodewig	3f151d1d75	[DOCS] Add redirects, update JSON spec to fix docs build (#51747 ) Docs build [#11556][0] broke due to several outdated or incorrect links in the JSON REST spec. This fixes those links where possible and adds redirects. [0]: https://elasticsearch-ci.elastic.co/job/elastic+docs+master+build/11556/	2020-02-12 08:30:59 -05:00
Marios Trivyzas	c2e0552537	SQL: [Docs] Add limitation for sorting on aggs (#52210 ) Add a section to point out that when ordering by an aggregate only plain aggregate functions are allowed, no scalars/operators can be used on top of them. Fixes: #52204 (cherry picked from commit 78a1185549ff7f3229fd2d036567eb2a4f2cf230)	2020-02-12 12:56:06 +01:00
Andrei Stefan	a3ebacfcf3	52169 & 52172 7x backport (#52256 ) * Extract common optimizer tests (#52169) (cherry picked from commit e5ad72bc22e9ec0686ab582195f0032efcb880bf) * Hook in the optimizer rules (#52172) (cherry picked from commit 1f90d8cc56052fbf2af604e72f9f5ca73f5e75d5)	2020-02-12 11:20:03 +02:00
Marios Trivyzas	daab242c75	SQL: Fix ORDER BY on aggregates and GROUPed BY fields (#51894 ) Previously, in the in-memory sorting module `LocalAggregationSorterListener` only the aggregate functions where used (grabbed by the `sortingColumns`). As a consequence, if the ORDER BY was also using columns of the GROUP BY clause, (especially in the case of higher priority - before the aggregate functions) wrong results were produced. E.g.: ``` SELECT gender, MAX(salary) AS max FROM test_emp GROUP BY gender ORDER BY gender, max ``` Add all columns of the ORDER BY to the `sortingColumns` so that the `LocalAggregationSorterListener` can use the correct comparators in the underlying PriorityQueue used to implement the in-memory sorting. Fixes: #50355 (cherry picked from commit be680af11c823292c2d115bff01658f7b75abd76)	2020-02-12 09:38:47 +01:00
Andrei Stefan	74e7777cbb	Hook in the optimizer rules (#52172 ) (cherry picked from commit 1f90d8cc56052fbf2af604e72f9f5ca73f5e75d5)	2020-02-12 09:32:34 +02:00
Andrei Stefan	a21e2b211a	Extract common optimizer tests (#52169 ) (cherry picked from commit e5ad72bc22e9ec0686ab582195f0032efcb880bf)	2020-02-12 09:32:33 +02:00
Hendrik Muhs	edaf6d1f79	[Transform] maintain a list of unsupported aggregations in transforms (#52190 ) (#52222 ) add a list of unsupported aggs in transforms and create a test that fails if a new aggregation is added. Limitation: works only if a new agg is added to either the core or a known plugin (Analytics, MatrixAggregation).	2020-02-12 07:48:04 +01:00
Lisa Cawley	dd14210689	[DOCS] Clarifies machine learning built-in roles (#51504 )	2020-02-11 18:28:53 -08:00
Jason Tedor	79e5e809b6	Add unit tests for reading JVM options files (#52176 ) This commit adds some unit tests to cover the reading of JVM options files.	2020-02-11 21:02:34 -05:00
Benjamin Trent	2a968f4f2b	[ML] job results provider refactoring (#52012 ) (#52238 ) During a bug hunt, I caught a handful of things (unrelated to the bug) that could be potential issues: 1. Needlessly wrapping in exception handling (minor cleanup) 2. Potential of notifying listeners of a failure multiple times + even trying to notify of a success after a failure notification	2020-02-11 17:54:44 -05:00
Mark Vieira	28c56da754	Don't track absolute path as test input to improve cacheability (#52235 )	2020-02-11 13:32:59 -08:00
Gordon Brown	d48ce12920	Convert ILM and SLM histories into hidden indices (#51456 ) Modifies SLM's and ILM's history indices to be hidden indices for added protection against accidental querying and deletion, and improves IndexTemplateRegistry to handle upgrading index templates. Also modifies the REST test cleanup to delete hidden indices.	2020-02-11 14:18:55 -07:00
Jason Tedor	bb2e04bc16	Use absolute path for temporary directory in tests (#52228 ) We explicitly set the path for the temporary directory to use in test tasks, but today this path is a relative path, relative to the current working directory of the test task. The fact that we are using a relative path here appears to be legacy, simply leftover from the days of the Maven build. An absolute path is preferred here, since it's explicit and we do not have to rely on everyone resolving the path properly relative to the working directory.	2020-02-11 15:17:45 -05:00
Jason Tedor	6ed3311443	Ensure test temporary directory exists (#52227 ) Today we we set the test temporary directory explicitly by controling java.io.tmpdir. Yet, we do not guarantee this directory exists, instead relying on a test base class (LuceneTestCase) to create this directory when it initializes. However, some of our tests do not rely on our test framework, and thus do not have access to LuceneTestCase, instead relying on RandomizedRunner directly. We should not be relying on the temporary directory being implicitly created, instead guaranteeing that it exists before test execution starts. This commit does that by creating the test temporary directory before the test task executes (via a doFirst).	2020-02-11 14:53:16 -05:00
Zachary Tong	0372d6d239	Allow ObjectParsers to specify required sets of fields (#49661 ) ConstructingObjectParser can be used to specify required fields, but it is still difficult to configure "sets" of fields where only one of the set is required (requiring hand-rolled logic in each ConstructingObjectParser, or adding special validation methods to objects that are called after building the object). This commit adds a new method on ObjectParser which allows the parsers to register required sets. E.g. ["foo", "bar"] can be registered, which means "foo", "bar" or both must be configured by the user otherwise an exception is thrown. This pattern crops up in many places in our parsers; a good example are the aggregation "field" and "script" fields. One or both must be configured on all aggregations, omitting both should result in an exception. This was previously handled far downstream resulting in an aggregation exception, when it should be a parse exception.	2020-02-11 13:03:33 -05:00
Nik Everett	86d5211c05	Make sorting by an agg results a real abstraction (#52007 ) (#52212 ) This removes a bunch of `instanceof`s in favor of two new methods on `InernalAggregation`. The default implementations of these methods just throw exceptions explaining that you can't sort on this aggregation. They are overridden by all of the classes that used to have `instanceof` checks against them. I doubt this is really any faster in practice. The real benefit here is that it is a little more obvious that you can sort by the results of an aggregation and it should be much more obvious where to look at how aggregations sort themselves. There are still a bunch more `instanceof`s in left in `AggregationPath` but those will wait for a followup change.	2020-02-11 12:58:40 -05:00
Albert Zaharovits	cc1fce96ba	Add a new async search security origin (#52141 ) This commit adds a new security origin, and an associated reserved user and role, named `_async_search`, which can be used by internal clients to manage the `.async-search-*` restricted index namespace.	2020-02-11 19:58:06 +02:00
James Rodewig	d68a4ec82e	[7.x] Permit EQL feature flag in release builds (#52201 ) (#52214 ) 7.x backport of #52201 Provides a path to set register the EQL feature flag in release builds. This enables EQL in release builds so that release docs tests pass. Release docs tests do not have infrastructure in place to only register snippets from included portions of the docs, they instead include all docs snippets. Since EQL can not be enabled in release builds, this meant that the EQL snippets fail in the release docs tests. This adds the ability to enable EQL in the release docs tests. This system property will be removed when EQL is ready for release.	2020-02-11 11:49:49 -05:00
Hendrik Muhs	098380e483	Percentiles aggregation validation checks for range (#51871 ) disallow to specify percentile out of range [0,100]. This also fixes a problem in transform by failing validation if an invalid percentile configuration is used.	2020-02-11 17:25:39 +01:00
James Rodewig	6fe8f1649b	[DOCS] Include docs on permanently unreleased branches only (#51743 ) Adds the ability to display docs on permanently unreleased branches, such as `master` and `7.x`. Also updates how the autoscaling and EQL docs are included. Currently, these feature-flag docs would display on any unreleased branches that contain the changes, such as 7.7.	2020-02-11 11:24:13 -05:00
David Roberts	d1d9c40e71	[ML] Switch poor categorization audit warning to use status field (#52195 ) In #51146 a rudimentary check for poor categorization was added to 7.6. This change replaces that warning based on a Java-side check with a new one based on the categorization_status field that the ML C++ sets. categorization_status was added in 7.7 and above by #51879, so this new warning based on more advanced conditions will also be in 7.7 and above. Closes #50749	2020-02-11 15:33:27 +00:00
David Roberts	473468d763	[ML] Better error when persistent task assignment disabled (#52014 ) Changes the misleading error message when attempting to open a job while the "cluster.persistent_tasks.allocation.enable" setting is set to "none" to a clearer message that names the setting. Closes #51956	2020-02-11 15:23:21 +00:00
Zachary Tong	87854573e4	Add version constant for 7.6.1	2020-02-11 09:44:43 -05:00
Igor Motov	667e1a5225	Add Boxplot Aggregation (#52174 ) Adds a `boxplot` aggregation that calculates min, max, medium and the first and the third quartiles of the given data set. Closes #33112	2020-02-11 09:38:17 -05:00
Marios Trivyzas	204d086266	SQL: Fix issue with timezone when paginating (#52101 ) Previously, when the specified (or default) fetchSize led to subsequent HTTP requests and the usage of cursors, those subsequent were no longer using the client timezone specified in the initial SQL query. As a consequence, Even though the query is executed once (with the correct timezone) the processing of the query results by the HitExtractors in the next pages was done using the default timezone Z. This could lead to incorrect results. Fix the issue by correctly using the initially specified timezone, which is found in the deserialisation of the cursor string. Fixes: #51258 (cherry picked from commit 8f7afbdeb9295999b48a6c36db5b31cbe0cee432)	2020-02-11 15:27:56 +01:00
David Turner	00b9098250	Ignore timeouts with single-node discovery (#52159 ) Today we use `cluster.join.timeout` to prevent nodes from waiting indefinitely if joining a faulty master that is too slow to respond, and `cluster.publish.timeout` to allow a faulty master to detect that it is unable to publish its cluster state updates in a timely fashion. If these timeouts occur then the node restarts the discovery process in an attempt to find a healthier master. In the special case of `discovery.type: single-node` there is no point in looking for another healthier master since the single node in the cluster is all we've got. This commit suppresses these timeouts and instead lets the node wait for joins and publications to succeed no matter how long this might take.	2020-02-11 14:15:01 +00:00
David Roberts	4c88996cd7	[DOCS] Correct important note for xpack.transform.enabled (#52194 ) Because transforms get assigned to an arbitrary data node it is important that the transforms plugin is enabled on every data node.	2020-02-11 13:02:10 +00:00
Yang Wang	16ba59e9d1	Expose more authentication info to ingest pipeline (#51305 ) (#52119 ) The changes add more granularity for identiying the data ingestion user. The ingest pipeline can now be configure to record authentication realm and type. It can also record API key name and ID when one is in use. This improves traceability when data are being ingested from multiple agents and will become more relevant with the incoming support of required pipelines (#46847) Resolves: #49106	2020-02-11 23:05:01 +11:00
David Kyle	343ced42be	Mute LoggingOutputStreamTests.testMaxBuffer (#52193 ) Relates to https://github.com/elastic/elasticsearch/issues/51838	2020-02-11 11:46:17 +00:00
Tim Vernum	b0b1b13311	Extract class to store Authentication in context (#52183 ) This change extracts the code that previously existed in the "Authentication" class that was responsible for reading and writing authentication objects to/from the ThreadContext. This is needed to support multiple authentication objects under separate keys. This refactoring highlighted that there were a large number of places where we extracted the Authentication/User objects from the thread context, in a variety of ways. These have been consolidated to rely on the SecurityContext object. Backport of: #52032	2020-02-11 20:59:06 +11:00
Dimitris Athanasiou	6086fadf00	[7.x][ML] Prepare to hold additional stats in DF Analytics task (#52134 ) (#52187 ) Refactors `DataFrameAnalyticsTask` to hold a `StatsHolder` object. That just has a `ProgressTracker` for now but this is paving the way to add additional stats like memory usage, analysis stats, etc. Backport #52134	2020-02-11 11:18:45 +02:00
Martijn van Groningen	c14e4666df	Wait for watcher to be started prior to rolling upgrade tests. (#52186 ) Backport: #52139 In the rolling upgrade tests, watcher is manually executed, in rare scenarios this happens before watcher is started, resulting in the manual execution to fail. Relates to #33185	2020-02-11 09:39:20 +01:00
Dimitris Athanasiou	cbebc26f50	[7.x][ML] Retry persisting DF Analytics results (#52048 ) (#52160 ) Employs `ResultsPersisterService` from `DataFrameRowsJoiner` in order to add retries when a data frame analytics job is persisting the results to the destination data frame. Backport of #52048	2020-02-11 09:55:00 +02:00
Andrei Stefan	2f1631d9d0	Telemetry data initial implementation (#51715 ) (#52175 ) (cherry picked from commit f1d1cceacaacf226fcd2459f34689843b822fe4b)	2020-02-11 09:15:47 +02:00
Lisa Cawley	c4525f8cca	[DOCS] Adds ml-cpp PRs to release notes (#52158 ) Co-Authored-By: David Roberts <dave.roberts@elastic.co>	2020-02-10 18:06:01 -08:00
Jason Tedor	91d0996e08	Remove unnecessary method in JvmOptionsParser (#52173 ) Back when the distribution launchers were compiled to target JDK 7, we did not have access to the String#join method to space-delimit JVM options. Since the launchers now target the same minimum JDK as Elasticsearch itself, we now have access to this method and can replace the use of spaceDelimitJvmOptions with String#join. This commit does that.	2020-02-10 20:22:02 -05:00

... 2 3 4 5 6 ...

50122 Commits All Branches Search

50122 Commits

All Branches