OpenSearch

Commit Graph

Author	SHA1	Message	Date
Andras Palinkas	7f7e938a25	{S,E}QL: Fix optimization of `NotEquals` in conjunctions (#65331 ) (#65449 ) * Fix the `CombineBinaryComparisons` optimizer rule, so that semantic equality taken into account during the optimization of `NotEquals` Examples that previously removed the `NotEquals` expressions (leading to incorrect results): ``` double >= 10 AND integer != 9 --> double >= 10 keyword != '2021' AND datetime >= '2020-01-01T00:00:00' --> datetime >= '2020-01-01T00:00:00' ``` With the fix, expressions like the above will not be touched. `NotEquals` will only be eliminated from the `AND` expression if the left side of the `NotEquals` `semanticEquals()` to the left side of the other expressions within the conjunction (comparisons against the same field/expression). * Unit tests and integration tests Close #65322 (cherry-picked from 8b2b7fa)	2020-11-24 13:20:32 -05:00
Andras Palinkas	3d8e17f3bd	SQL: Fix incorrect parameter resolution (#63710 ) (#64615 ) Summary of the issue and the root cause: ``` (1) SELECT 100, 100 -> success (2) SELECT ?, ? (with params: 100, 100) -> success (3) SELECT 100, 100 FROM test -> Unknown output attribute exception for the second 100 (4) SELECT ?, ? FROM test (params: 100, 100) -> Unknown output attribute exception for the second ? (5) SELECT field1 as "x", field1 as "x" FROM test -> Unknown output attribute exception for the second "x" ``` There are two separate issues at play here: 1. Construction of `AttributeMap`s keeps only one of the `Attribute`s with the same name even if the `id`s are different (see the `AttributeMapTests` in this PR). This should be fixed no matter what, we should not overwrite attributes with one another during the construction of the `AttributeMap`. 2. The `id` on the `Alias`es is not the same in case the `Alias`es have the same `name` and same `child` It was considered to simpy fix the second issue by just reassigning the same `id`s to the `Alias`es with the same name and child, but it would not solve the `unknown output attribute exception` (see notes below). This PR covers the fix for the first issue. Relates to #56013	2020-11-04 20:38:00 -05:00
Andrei Stefan	5f3c79d64b	Remove filter from QL's field_caps requests (#63840 ) (#63845 ) (cherry picked from commit f009e6341d0fc0471f212d5a41c91e7aab77e006)	2020-10-17 01:36:26 +03:00
Costin Leau	06eae58d40	EQL: Fix translation of bool fields (#63694 ) This commit fixes two issues in dealing with bool fields in EQL: - avoid simplifications of field == true expressions - adding comparison to clauses on fields missing logic (where bool) Fix #63693 (cherry picked from commit d10a5d0e842bbd4e0031834de948ceb24da3872b) (cherry picked from commit 0227da3a275c7f22ff524d99d53e1a79146f9e28)	2020-10-15 14:38:31 +03:00
Ryland Herrick	7e8769a666	EQL: make allow_no_indices true by default (#63573 ) (#63645 ) * Allow all indices options variants Irrespective of allow_no_indices value, throw VerificationException when there is no index validated Co-authored-by: Andrei Stefan <astefan@users.noreply.github.com>	2020-10-14 03:41:04 +03:00
Gordon Brown	5c8b0662df	Deprecate REST access to System Indices (#63274 ) (Original #60945 ) This PR adds deprecation warnings when accessing System Indices via the REST layer. At this time, these warnings are only enabled for Snapshot builds by default, to allow projects external to Elasticsearch additional time to adjust their access patterns. Deprecation warnings will be triggered by all REST requests which access registered System Indices, except for purpose-specific APIs which access System Indices as an implementation detail a few specific APIs which will continue to allow access to system indices by default: - `GET _cluster/health` - `GET {index}/_recovery` - `GET _cluster/allocation/explain` - `GET _cluster/state` - `POST _cluster/reroute` - `GET {index}/_stats` - `GET {index}/_segments` - `GET {index}/_shard_stores` - `GET _cat/[indices,aliases,health,recovery,shards,segments]` Deprecation warnings for accessing system indices take the form: ``` this request accesses system indices: [.some_system_index], but in a future major version, direct access to system indices will be prevented by default ```	2020-10-06 13:41:40 -06:00
Andrei Stefan	76bba601ab	Remove case_sensitive request option (#63218 ) (#63244 ) Make EQL case sensitive by default and adapt some of the string functions Remove the case sensitive option from Between string function Add case_insensitive option to term and wildcard queries usage (cherry picked from commit 7550e0664c8c2f1f13519036c759b1e76345551f)	2020-10-05 22:04:42 +03:00
Costin Leau	b67d2274ae	QL: Optimize regexs without patterns as equality (#63216 ) If a QL regex doesn't contain any pattern, convert it to Equals. Close #63196 (cherry picked from commit e22a843124290aaacd0e80d7ae9b883e5ec2431e)	2020-10-05 14:21:42 +03:00
Costin Leau	ef7a6ce4b2	EQL: Refactor testing infrastructure (#62928 ) Extract reusable methods inside QL TestUtils Rename abstract base classes for clarity Clean-up EQL DataLoader (cherry picked from commit 48db3f285aa8976ead5a9f5d071a9c1046d7bd31)	2020-09-28 14:22:56 +03:00
Costin Leau	71b92f8699	QL: Optimize Like/Rlike all (#62682 ) Replace common Like and RLike queries that match all characters with IsNotNull (exists) queries Fix #62585 (cherry picked from commit 4c23fad0468a9edd7325b06c6a96f7af37625dbf)	2020-09-24 13:44:53 +03:00
Nik Everett	fa13585fae	Fix Eclipse build (#62733 ) (#62786 ) Eclipse was confused for two reasons: 1. `:x-pack:plugin` depended on itself. 2. `ql`, `sql`, and `eql` couldn't see some methods. I fixed problem 1 by only adding the "depends on itself" configuration outside of eclipse. I fixed problem 2 by making a `test` sub-project in `ql` that contains test utilities and depending on those where possible.	2020-09-22 17:44:25 -04:00
Costin Leau	03d2395183	EQL: Use Point In Time inside sequences (#62276 ) Use the newly introduced PIT API to have a consistent view of the data while doing sequence matching, which involves multiple calls, aka repeatable reads and thus avoid race conditions or any in-flight updates on the data. (cherry picked from commit daa72fc3c71fd36afb55278021ff6bbc591ef148)	2020-09-15 15:40:03 +03:00
Jake Landis	d8dad9ab2c	[7.x] Remove integTest task from PluginBuildPlugin (#61879 ) (#62135 ) This commit removes `integTest` task from all es-plugins. Most relevant projects have been converted to use yamlRestTest, javaRestTest, or internalClusterTest in prior PRs. A few projects needed to be adjusted to allow complete removal of this task * x-pack/plugin - converted to use yamlRestTest and javaRestTest * plugins/repository-hdfs - kept the integTest task, but use `rest-test` plugin to define the task * qa/die-with-dignity - convert to javaRestTest * x-pack/qa/security-example-spi-extension - convert to javaRestTest * multiple projects - remove the integTest.enabled = false (yay!) related: #61802 related: #60630 related: #59444 related: #59089 related: #56841 related: #59939 related: #55896	2020-09-09 14:25:41 -05:00
Andrei Stefan	db8788e5a2	QL: wildcard field type support (#58062 ) (#61205 ) (cherry picked from commit c874e6cdd3e051ce599b50c18642de038b84105f)	2020-08-17 18:24:32 +03:00
Andrei Stefan	90e116738e	QL: add filtering query dsl support to IndexResolver (#60514 ) (#61200 ) (cherry picked from commit 7b3635d796be26af9f87d19963a8ed4ab4bbf13f)	2020-08-17 17:59:58 +03:00
Jake Landis	bcb9d06bb6	[7.x] Cleanup xpack build.gradle (#60554 ) (#60603 ) This commit does three things: * Removes all Copyright/license headers for the build.gradle files under x-pack. (implicit Apache license) * Removes evaluationDependsOn(xpackModule('core')) from build.gradle files under x-pack * Removes a place holder test in favor of disabling the test task (in the async plugin)	2020-08-03 13:11:43 -05:00
Rene Groeschke	ed4b70190b	Replace immediate task creations by using task avoidance api (#60071 ) (#60504 ) - Replace immediate task creations by using task avoidance api - One step closer to #56610 - Still many tasks are created during configuration phase. Tackled in separate steps	2020-07-31 13:09:04 +02:00
Bogdan Pintea	4c771485f6	SQL: fix NPE on ambiguous GROUP BY (#59370 ) (#60416 ) * fix npe on ambiguous group by * add tests for aggregates and group by, add quotes to error message * add more cases for Group By ambiguity test * change error messages for field ambiguity * change collection aliases approach * add locations of attributes for ambiguous grouping error * Adress review comments - remove Comparable implementations from Attribute and Location; - add ad-hoc comparator for sorting locations in ambiguity message; - remove added AttributeAlias class with Touple; - add code comment to explain issue with Location overwriting. * Fix c&p error in location ref generation comparator Fix copy&paste error in dedicated comparator used for sorting ambiguity location references. Slightly increase its readability. Co-authored-by: Nikita Verkhovin <verkhovin13@gmail.com> (cherry picked from commit 9ba70a3483f0f4987229bec231cdc004f51b88a5)	2020-07-29 20:44:28 +02:00
Julie Tibshirani	c7bfb5de41	Add search `fields` parameter to support high-level field retrieval. (#60258 ) This feature adds a new `fields` parameter to the search request, which consults both the document `_source` and the mappings to fetch fields in a consistent way. The PR merges the `field-retrieval` feature branch. Addresses #49028 and #55363.	2020-07-28 10:58:20 -07:00
Costin Leau	fe775a315f	EQL: Obey size request parameter (#59014 ) While at it, change the default size to 10 (to align it with the search API defaults). (cherry picked from commit 45795939b277e736a9e4f2f008d1c3f406239075)	2020-07-06 19:14:25 +03:00
Costin Leau	3c81b91474	EQL: Add Head/Tail pipe support (#58536 ) Introduce pipe support, in particular head and tail (which can also be chained). (cherry picked from commit 4521ca3367147d4d6531cf0ab975d8d705f400ea) (cherry picked from commit d6731d659d012c96b19879d13cfc9e1eaf4745a4)	2020-06-27 09:49:14 +03:00
Andrei Stefan	69f73d948b	EQL: code cleanup and further tests (#58458 ) (#58497 ) Add FunctionPipe tests to all functions. Cleanup functions code. (cherry picked from commit 0f83d5799841fe99d8aeaf46e50dd11aa6bf8a57)	2020-06-24 17:38:56 +03:00
Rene Groeschke	01e9126588	Remove deprecated usage of testCompile configuration (#57921 ) (#58083 ) * Remove usage of deprecated testCompile configuration * Replace testCompile usage by testImplementation * Make testImplementation non transitive by default (as we did for testCompile) * Update CONTRIBUTING about using testImplementation for test dependencies * Fail on testCompile configuration usage	2020-06-14 22:30:44 +02:00
Aleksandr Maus	ec60335496	EQL: implement case sensitivity for indexOf and endsWith string functions (#57707 ) (#57908 ) * EQL: implement case sensitivity for indexOf and endsWith string functions	2020-06-10 08:55:49 -04:00
Costin Leau	439205d1ea	EQL: Introduce tie breaker support (#57787 ) Allow a field inside the data to be used as a tie breaker for events that have the same timestamp. The field is optional by default. If used, the tie-breaker always requires a non-null value since it is used inside `search_after` which requires a non-null value. Fix #56824 (cherry picked from commit e5719ecb474b32730d93afdbb6834a32b0b2df8b)	2020-06-09 22:50:19 +03:00
Andrei Stefan	3cc8166946	SQL: handle MIN and MAX functions on dates in Painless scripts (#57605 ) (#57863 ) * Convert to date/datetime the result of numeric aggregations (min, max) in Painless scripts (cherry picked from commit f1de99e2a6fbf3806c4f2b6b809738aa8faa2d75)	2020-06-09 10:09:01 +03:00
Marios Trivyzas	52c555e286	SQL: Make CASTing string to DATETIME more lenient (#57451 ) (#57509 ) Some BI tools (i.e. Tableau) would try to cast strings where the time part is separated from the date part with a whitespace instead of `T`. Adjust type conversion used by CAST to support this. (cherry picked from commit 0e18321e7ad9f779c42855efbf93f171b9128a5e)	2020-06-02 10:54:03 +02:00
Bogdan Pintea	74b2c8a770	Change error message for comp against fields (#57126 ) Change the error message wording for comparisons against fields in filtering (s/variables/fields). (cherry picked from commit d9a1cb50940d0a98fd75b9c0123ca6e1d862f65d)	2020-05-26 17:57:51 +02:00
Andrei Stefan	4d47d63f55	SQL: implement SUM, MIN, MAX, AVG over literals (#56786 ) (#56850 ) * Adds support for MIN, MAX, AVG, SUM aggregates acting on literals. SELECT SUM(1) FROM index and SELECT SUM(1), AVG(2) work both on indices and as local execution. (cherry picked from commit efb72907c0391612c4a2b6256e327060b4167912)	2020-05-16 02:13:55 +03:00
Aleksandr Maus	87a10806ab	EQL: Fix cidrMatch function fails to match when used in scripts (#56246 ) (#56735 ) EQL: Fix cidrMatch function fails to match when used in scripts (#56246) Addresses https://github.com/elastic/elasticsearch/issues/55709	2020-05-13 22:41:24 -04:00
Costin Leau	9f1ecd52eb	EQL: Introduce support for sequences (#56300 ) Initial support for EQL sequences The current algorithm is focused on correctness and does not contain any optimization which is left for the future. The current implementation uses a state machine approach which moves ascending and runs each query one after the other working on computing sequences as the data comes in. For each result, the key and its timestamp are being extracted which are then used for matching/building a sequence. (cherry picked from commit 4f3e18c894a1841d333022361ad9d1fdf1477dc3)	2020-05-13 15:42:31 +03:00
Marios Trivyzas	cbbbd499bf	SQL/EQL: Add support for scalars within LIKE/RLIKE (#56495 ) (#56674 ) - Add support for scalar functions on the field of SQL's LIKE/RLIKE - Add support for scalar functions on the field of EQL's match/matchLite Closes: #55058 (cherry picked from commit 51c14e2dbb7fb29004a23369c449d425b3ac8fe2)	2020-05-13 13:40:24 +02:00
Andrei Stefan	f0074e93a0	QL: case sensitive support in EQL (#56404 ) (#56597 ) * QL: case sensitive support in EQL (#56404) * adds a generic startsWith function to QL * modifies the existent EQL startsWith function to be case sensitive aware * improves the existent EQL startsWith function to use a prefix query when the function is used in a case sensitive context. Same improvement is used in SQL's newly added STARTS_WITH function. * adds case sensitivity to EQL configuration through a case_sensitive parameter in the eql request, as established in #54411. The case_sensitive parameter can be specified when running queries (default is case insensitive) (cherry picked from commit ee5a09ea840167566e34c28c8225dc38bc6a7ae8)	2020-05-12 16:56:18 +03:00
Andrei Stefan	980f175222	EQL: simplify equals/not-equals TRUE/FALSE expressions (#56191 ) (#56306 ) * Simplify equals/not-equals TRUE/FALSE expressions, by returning them as is (TRUE variant) or negating them (FALSE variant) (cherry picked from commit 17858afbe6da5fa0b3ecfc537cabb337e4baaffe)	2020-05-07 03:02:04 +03:00
Ross Wolf	389082033e	EQL: Add concat function (#55193 ) * EQL: Add concat function * EQL: for loop spacing for concat * EQL: return unresolved arguments to concat early * EQL: Add concat integration tests * EQL: Fix concat query fail test * EQL: Add class for concat function testing * EQL: Add concat integration tests * EQL: Update concat() null behavior	2020-05-05 12:53:34 -06:00
Marios Trivyzas	cc21468559	SQL: Fix issue with date range queries and timezone (#56115 ) (#56174 ) Previously, the timezone parameter was not passed to the RangeQuery and as a results queries that use the ES date math notation (now, now-1d, now/d, now/h, now+2h, etc.) were using the UTC timezone and not the one passed through the "timezone"/"time_zone" JDBC/REST params. As a consequence, the date math defined dates were always considered in UTC and possibly led to incorrect results for queries like: ``` SELECT * FROM t WHERE date BETWEEN now-1d/d AND now/d ``` Fixes: #56049 (cherry picked from commit 300f010c0b18ed0f10a41d5e1606466ba0a3088f)	2020-05-05 10:54:23 +02:00
Adrien Grand	58c3bb5ae1	Repurpose `ignore_throttled` to be only about frozen indices. (#55047 ) (#55852 ) This has no practical impact on users since frozen indices are the only throttled indices today. However this has an impact on upcoming features that would use search throttling. Filtering out throttled indices made sense a couple years ago, but as we're now improving support for slow requests with `_async_search` and exploring ways to reduce storage costs, this feature has most likely become a trap, that we'd like to not have with upcoming features that would use search throttling. Relates #54058	2020-04-28 14:31:54 +02:00
Aleksandr Maus	ad54cca823	EQL: implement math functions: add, divide, module, multiply, subtract (#55137 ) (#55737 ) * EQL: implement math functions: add, divide, module, multiply, subtract	2020-04-24 15:52:27 -04:00
Bogdan Pintea	8d6d7b88d8	SQL: drop BASE TABLE type in favour for just TABLE (#54836 ) (#54951 ) * Drop BASE TABLE type in favour for just TABLE This commit drops the table type 'BASE TABLE' and replaces all occurences with just 'TABLE', since his type is wider-used and friendlier to the client applications that query for certain table types in their discovery mode. The 'TABLE' type is also explicitely mentioned by the JDBC and ODBC standards and although other data source-specific types are permitted, older apps will not work well with them. * Refactor table type constants out of IndexType Move SQL_TABLE/_ALIAS out of IndexType, so that they can also be used in that Enum definition. (cherry picked from commit 70241b52697ac2cf71004040042123c1ec050299)	2020-04-08 16:02:12 +02:00
Aleksandr Maus	d02f774cb6	EQL: implement cidrMatch function (#54186 ) (#54928 ) Related to https://github.com/elastic/elasticsearch/issues/54132	2020-04-07 22:07:28 -04:00
Aleksandr Maus	868798e4db	EQL: implement between function (#54277 ) (#54913 )	2020-04-07 16:52:30 -04:00
Costin Leau	99846f47b7	QL: Introduce infrastructure for surrogate functions (#54795 ) Some functions act as shortcuts for more verbose declarations (sometimes with certain constraints). This PR removes the boilerplate around declaring such functions as well as a dedicated rule for the optimizer to perform the actual substitution. Fix #54334 (cherry picked from commit 3231d01b0c583deb89252fafe84db48878da3246)	2020-04-07 00:46:50 +03:00
Ross Wolf	022f829d84	EQL: Add wildcard function (#54020 ) * EQL: Add wildcard function * EQL: Cleanup Wildcard.getArguments * EQL: Cleanup Wildcard and rearrange methods * EQL: Wildcard newline lint * EQL: Make StringUtils function final * EQL: Make Wildcard.asLikes return ScalarFunction * QL: Restore BinaryLogic.java * EQL: Add Wildcard PR feedback * EQL: Add Wildcard verification tests * EQL: Switch wildcard to isFoldable test * EQL: Change wildcard test to numeric field * EQL: Remove Wildcard.get_arguments	2020-04-03 10:15:43 -06:00
Jason Tedor	5fcda57b37	Rename MetaData to Metadata in all of the places (#54519 ) This is a simple naming change PR, to fix the fact that "metadata" is a single English word, and for too long we have not followed general naming conventions for it. We are also not consistent about it, for example, METADATA instead of META_DATA if we were trying to be consistent with MetaData (although METADATA is correct when considered in the context of "metadata"). This was a simple find and replace across the code base, only taking a few minutes to fix this naming issue forever.	2020-03-31 17:24:38 -04:00
Andrei Stefan	977302e46c	EQL: startsWith and endsWith functions implementation (#54504 ) * EQL: startsWith function implementation (#54400) (cherry picked from commit 666719fcfc40f6fc0535609577791369123320ab) * EQL: endsWith function implementation (#54442) (cherry picked from commit 554a4c8ef04b67eed107d29b57185e9af25d9d4f)	2020-03-31 18:06:03 +03:00
Ross Wolf	d11e977b1f	EQL: Use In from QL (#53244 ) * EQL: Use In from QL * EQL: Add more In tests * EQL: Test In duplicates * EQL: Add test for In mixed types * EQL: Copy In translation to QL * SQL: Use InComparisons from QL * EQL: Remove boost checks from QueryFolderOkTests * QL: Add TranslatorHandler.convert	2020-03-30 15:19:23 -06:00
Costin Leau	68f74cf593	EQL: Fix custom scripting for functions (#53935 ) (#54114 ) Improve separation of scripting between EQL and SQL by delegating common methods to QL. The context detection is determined based on the package to avoid having repetitive class hierarchies. The Painless whitelists have been improved so that the declaring class is used instead of the inherited one. Relates #53688 (cherry picked from commit 6d46033e736c64ac9255c5d6964600d2a931430a) EQL: Add Substring function with Python semantics (#53688) Does not reuse substring from SQL due to the difference in semantics and the accepted arguments. Currently it is missing full integration tests as, due to the usage of scripting, requires an actual integration test against a proper cluster (and likely its own QA project). (cherry picked from commit f58680bad33d5ce4139157a69a4d9f5f286bc3c4)	2020-03-24 20:54:19 +02:00
Marios Trivyzas	af03200ad6	SQL: Extend DATE_TRUNC to also operate on intervals(elastic - #46632 ) (#47720 ) (#53972 ) The function is extended to operate on intervals according to the PostgreSQL: https://www.postgresql.org/docs/9.1/functions-datetime.html#FUNCTIONS-DATETIME-TRUNC Closes : #46632 (cherry picked from commit 2dc79505825fa75e0711dcfa8e9c69e8028fc979) Co-authored-by: musteaf <gs_mustea@hotmail.com>	2020-03-23 15:05:16 +01:00
Andrei Stefan	79600eb38b	SQL: add support for index aliases for SYS COLUMNS command (#53525 ) (#53653 ) (cherry picked from commit f65e4d6ff7b2e00eb6f9c985fbe7cb24de00f045)	2020-03-17 12:49:08 +02:00
Andrei Stefan	91ca9c5c33	QL: constant_keyword support (#53241 ) (#53602 ) (cherry picked from commit d6cd4ce7849ba215407c8c5fa815c9b373fb8480)	2020-03-16 18:06:31 +02:00

1 2

74 Commits