OpenSearch

Commit Graph

Author	SHA1	Message	Date
Marios Trivyzas	54e7e4c9de	EQL: [Tests] Adjust README for preserving test data (#65460 ) Adjusted the README file to mention both the option to preserve the test data when simple reproducing/executing the tests, but also when starting the server node manually and issuing the query(ies) against it. Follows: #65400 (cherry picked from commit e3a1910d28d8b0ed20997754c74fa4d4d52cda15)	2020-11-25 14:30:25 +01:00
Mark Vieira	f8f5d27f6b	Add option to preserve data in test clusters (#65400 ) (cherry picked from commit 1ce323e1368cf5231181f1efaba1c4e425066e37)	2020-11-24 11:56:56 -08:00
Jim Ferenczi	9f3e3e2162	Fix "resource not found" exception on existing EQL async search (#65167 ) This change fixes the initialization of the async results service for the EQL get async action. The boolean that differentiates EQL from normal _async_search request is set incorrectly, which results in errors (404) when extending the keep alive of a running EQL search. Fixes #65108	2020-11-18 09:10:31 +01:00
Costin Leau	f089547b20	EQL: Fix aggressive/incorrect until policy in sequences (#65156 ) The current until implementation in sequences is too optimistic, leading to an aggressive match that discards correct data leading to invalid results. This commit addresses this issue and also unifies the until usage inside TumblingWindow. Further more it packs together the UntilGroup with SequenceGroup to minimize memory usage and improve clean-up. (cherry picked from commit de2724e92c732c66436939dbbedef93c9981b435) (cherry picked from commit a60757756aae5f5abb31176fee972a7cdeac3649)	2020-11-18 09:34:33 +02:00
Costin Leau	74fde15833	EQL: Allow null tiebreakers inside ordinals/sequences (#65033 ) Align Ordinal comparator to consider nulls last (higher) in tiebreakers. Add unit tests to Ordinal comparisons and criterion extraction. Fix #64706 (cherry picked from commit 93dc883abd6b8855ff1618a574412b7f773b8ff5) (cherry picked from commit 936e5f1a2cc29c1d5662cb8aa90c629af563a987)	2020-11-16 16:52:55 +02:00
Costin Leau	9551cb3420	EQL: small improvements to the testing base class Extract request settings into dedicated methods for easier adjustments (cherry picked from commit 4f93591cc561c7f8ff7c2f070dd1180f209810b7) (cherry picked from commit ff7e8427345c304f5a37612c870b48555484b692)	2020-11-14 16:40:48 +02:00
Costin Leau	f7cc570c4f	EQL: Re-enable correctness tests (#65041 ) Enable previously disabled tests - only two type of queries remain disabled: one that does pattern matching and another one for case-insensitivity. Fix #63742 (cherry picked from commit 20210cc43b34438c40b8b5aebf0aa2b8161c4104) (cherry picked from commit 95d08f2c8d0aac52cc1ed470fa489c239ee25159)	2020-11-14 16:09:11 +02:00
Costin Leau	76e73fec79	EQL: Add option for returning results from the tail of the stream (#64869 ) (#65040 ) Introduce option for specifying whether the results are returned from the tail (end) of the stream or the head (beginning). Improve sequencing algorithm by significantly eliminating the number of in-flight sequences for spare datasets. Refactor the sequence class by eliminating some of the redundant code. Change matching behavior for tail sequences. Return results based on their first entry ordinal instead of insertion order (which was ordered on the last match ordinal). Randomize results position inside test suite. Close #58646 (cherry picked from commit e85d9d1bbee13ad408e789fd62efb30bc8d223f2) (cherry picked from commit 452c674a10cdc16dced3cde7babf5d5a9d64a6d9)	2020-11-14 13:44:17 +02:00
Marios Trivyzas	0a9481fcaf	EQL: [Tests] enable server side debugging (#64308 ) (#64449 ) Register a new task `runEqlCorrectnessNode` which enables developers to start an ES node in debug mode, properly restore the correctness data and then run queries against it. Assert the index is restored correctly and use new snapshot. (cherry picked from commit fc8c6dd56d602b4a62ee1ff484f00caab92dc6e2)	2020-10-31 11:55:39 +01:00
Andrei Stefan	a6d8319231	* Wrap a verification_exception in case there is no valid index available (#64267 ) Wrap a verification_exception in case there is no valid index available in an index_not_found_exception providing also the original index pattern that may be lost in the chain of filters involving the Security one. (cherry picked from commit 9c9da2f2f9a4ad12704f7d3a273f067e96cd2054)	2020-10-29 10:14:50 +02:00
Costin Leau	6ca0b6ae6d	EQL: Improve request logging (#64206 ) Add logging to multi-search queries Log response count (cherry picked from commit ee9b9d58f68e2d545d5d841e2f683ec4e96f79e6) (cherry picked from commit 02a4c6b83475cebe715311eeba123ad6fc8d6ba1)	2020-10-27 17:23:43 +02:00
Costin Leau	2363c4be4b	EQL: Polish testing infra (#64205 ) Add tie-breaker inside request creation Add configurable timeout (cherry picked from commit ff281d7b6fd7b4cd2f08bac49aa0b354b6812940) (cherry picked from commit 34bd76fc2987b1ad0b6275ac4358e362a0ba7fb0)	2020-10-27 17:23:43 +02:00
Andrei Stefan	5f3c79d64b	Remove filter from QL's field_caps requests (#63840 ) (#63845 ) (cherry picked from commit f009e6341d0fc0471f212d5a41c91e7aab77e006)	2020-10-17 01:36:26 +03:00
Marios Trivyzas	1dbd3a90ae	EQL: [Tests] Use snapshot from 7.10 To be able to run the tests from 7.10 onwards use a snapshot created with 7.10. Follows: #63735	2020-10-15 17:28:52 +02:00
Marios Trivyzas	095f979060	EQL: [Tests] Add correctness integration tests (#63644 ) (#63735 ) Add a new gradle module under eql/qa which runs and validates a set of queries over a 4m event dataset (restored from a snapshot residing in a gcs bucket). The results are providing by running the exact set of queries with Python EQL against the same dataset. Co-authored-by: Marios Trivyzas <matriv@users.noreply.github.com> (cherry picked from commit 1cf789e5fcfb0f364f665bfaac021e24a4c2f556) Co-authored-by: Mark Vieira <portugee@gmail.com>	2020-10-15 15:28:26 +02:00
Costin Leau	06eae58d40	EQL: Fix translation of bool fields (#63694 ) This commit fixes two issues in dealing with bool fields in EQL: - avoid simplifications of field == true expressions - adding comparison to clauses on fields missing logic (where bool) Fix #63693 (cherry picked from commit d10a5d0e842bbd4e0031834de948ceb24da3872b) (cherry picked from commit 0227da3a275c7f22ff524d99d53e1a79146f9e28)	2020-10-15 14:38:31 +03:00
Ryland Herrick	7e8769a666	EQL: make allow_no_indices true by default (#63573 ) (#63645 ) * Allow all indices options variants Irrespective of allow_no_indices value, throw VerificationException when there is no index validated Co-authored-by: Andrei Stefan <astefan@users.noreply.github.com>	2020-10-14 03:41:04 +03:00
Costin Leau	2ab5f226c4	EQL: Avoid filtering on tiebreakers (#63415 ) Do not filter by tiebreaker while searching sequence matches as it's not monotonic and thus can filter out valid data. Add handling for data 'near' the boundary that has the same timestamp but different tie-breaker and thus can be just outside the window. Fix #62781 Relates #63215 (cherry picked from commit 36f834600d4d9ded0fb7b1440274b2e597733770) (cherry picked from commit 72a2ce825f3bfd13f87423ba7f3c739ea64c57f6)	2020-10-08 13:50:41 +03:00
Costin Leau	d027e24b31	EQL: Remove match functions (#63275 ) Since match (for matching regex) is not currently in use remove it for now. Close #63263 (cherry picked from commit 6abd531cf457f3c5686f59709647bed3276e3c6b)	2020-10-05 23:30:41 +03:00
Costin Leau	6856306dcf	EQL: Remove wildcard functionality from : (#63276 ) Restrict : operator to only case insensitive matching on strings Close #63262 (cherry picked from commit bc02e77150cdd85594dfac4f03d8aeb85aaddbb3)	2020-10-05 23:30:41 +03:00
Andrei Stefan	76bba601ab	Remove case_sensitive request option (#63218 ) (#63244 ) Make EQL case sensitive by default and adapt some of the string functions Remove the case sensitive option from Between string function Add case_insensitive option to term and wildcard queries usage (cherry picked from commit 7550e0664c8c2f1f13519036c759b1e76345551f)	2020-10-05 22:04:42 +03:00
Marios Trivyzas	19650e860a	EQL: [Test] Add a test for `identifier` as eventType (#63227 ) (#63235 ) Add a unit test to verify that an identifier surrounded with backquotes is not a valid syntax for eventType value, as eventType is schemantically a string literal and not a field identifier. Follows: #63169 (cherry picked from commit ff12c1340b3890ac52251f31259fa9a719d9eacc)	2020-10-05 15:23:08 +02:00
Costin Leau	1047d67199	Revert "EQL: Avoid filtering on tiebreakers (#63215 )" This reverts commit `efd2243886`.	2020-10-05 15:55:59 +03:00
Costin Leau	8c4503bcc3	EQL: Change default indices options (#63192 ) Ignore by default unavailable indices (same as ES) and verify that allowNoIndices is set to false since at least one index is required for validating the query. Fix #62986 (cherry picked from commit fd75ac27223cd1b699b8d9c311dc401a39f9e0c8)	2020-10-05 14:21:56 +03:00
Costin Leau	b67d2274ae	QL: Optimize regexs without patterns as equality (#63216 ) If a QL regex doesn't contain any pattern, convert it to Equals. Close #63196 (cherry picked from commit e22a843124290aaacd0e80d7ae9b883e5ec2431e)	2020-10-05 14:21:42 +03:00
Costin Leau	efd2243886	EQL: Avoid filtering on tiebreakers (#63215 ) Do not filter by tiebreaker while searching sequence matches as it's not monotonic and thus can filter out valid data. Fix #62781 (cherry picked from commit 4d62198df70f3b70f8b6e7730e888057652c18a8)	2020-10-05 14:21:30 +03:00
Costin Leau	4f593bdd69	EQL: Make queries using Point-In-Time rely on index filtering (#63161 ) Point-In-Time queries cannot be ran on individual indices but on all. Thus all PIT queries move their index from the request level to a filter so this condition is fulfilled while keeping the query scoped accordingly. Fix #63132 (cherry picked from commit c8eb4f724d5dcc0fcc172c6219ecfbc1dc1fbbae)	2020-10-05 14:21:09 +03:00
Marios Trivyzas	3cac996373	EQL: Fix syntax for event type (#63169 ) (#63194 ) Event type is actually a string value for event.category which can contain any kind of characters, or start with a digit, which currently is not supported, so we introduce the possibility to be able to use the usual syntax of " and """ for strings and raw strings. Make the grammar a bit cleaner by using the identifier only where it's actually an identifier in terms of query scemantics. Fixes: #62933 (cherry picked from commit 306e1d76da3db652db57f11f847705b3995609ff)	2020-10-02 17:28:13 +02:00
Marios Trivyzas	7d74fb8577	EQL: Replace ?"..." with """...""" for unescaped strings (#62539 ) (#63174 ) Use triple double quotes enclosing a string literal to interpret it as unescaped, in order to use `?` for marking query params and avoid user confusion. `?` also usually implies regex expressions. Any character inside the `"""` beginning-closing markings is considered raw and the only thing that is not permitted is the `"""` sequence itself. If a user wants to use that, needs to resort to the normal `"` string literal and use proper escaping. Relates to #61659 (cherry picked from commit d87c2ca2eacab5552bca1e520d33cf71da40bcfd)	2020-10-02 14:58:50 +02:00
Costin Leau	614f4c13a5	EQL: Introduce case-sensitive equality (#63121 ) Introduce : operator for doing case insensitive string comparisons. Recognizes "*" for wildcard matches in string literals. Restricted only to string types. Relates #62941 (cherry picked from commit 201e577e65f26a9b958a6197fe6c7268da39de29)	2020-10-02 00:23:08 +03:00
Marios Trivyzas	3ad4b00c7e	EQL: Clean grammar from `fork` (#63094 ) (#63138 ) Since `fork` is not used, is undocumented in Python EQL and there is no plan at the moment to implement it in the future, removing it from the grammar. User will get parsing exceptions instead of higher level messages about unsupported features which can lead to wrong expectations. (cherry picked from commit f6a0f8f01c1b1893bab86629d1de73e9f9dae8dc)	2020-10-01 21:14:41 +02:00
Costin Leau	c2992ea287	EQL: Fix NPE from incorrect use of ids search (#63032 ) This fixes a bug introduced when moving from mget to ids query. While mget returns all the ids given, id query is a search query and thus by default returns only 10 documents. The fix correctly sets the expected size so all the information is returned inside the response. Fix #63030 (cherry picked from commit 09ba85548a0142a1fe8376efea9cc4e7764a207c)	2020-10-01 13:49:58 +03:00
Marios Trivyzas	0ebaf8a3ec	EQL: Allow escaped backquote in identifiers (#62932 ) (#63082 ) Previously, backquote couldn't not be used inside an escaped identifier, e.g.: ``` `my`identifier` = "some_value" ``` was not allowed. Introduce escaping of the backquote by using a double backquote: ``` `my``identifier` = "some_value" ``` (cherry picked from commit 49514121486f42a58674b3e5901de4021fda5c15)	2020-09-30 19:10:09 +02:00
Costin Leau	a6b903b783	EQL: Remove unused classes from reponse API (#62134 ) Remove Count class and related artifacts since that functionality is not (yet) available. Update parser name for better error reporting. Fix #62131 (cherry picked from commit 060f500346788c4c5d0b3b9c045facec5d677d3d)	2020-09-30 15:45:30 +03:00
Costin Leau	3bee28056f	EQL: Fix bug in sequences with any pattern (#63007 ) Fix query creation inside sequences with any queries due to lacking a clause to combine, which lead to an invalid request being created. Fix #62967 (cherry picked from commit ff59d8823919a6e70928816e5c3687308ebde33f)	2020-09-29 18:19:25 +03:00
Costin Leau	ef7a6ce4b2	EQL: Refactor testing infrastructure (#62928 ) Extract reusable methods inside QL TestUtils Rename abstract base classes for clarity Clean-up EQL DataLoader (cherry picked from commit 48db3f285aa8976ead5a9f5d071a9c1046d7bd31)	2020-09-28 14:22:56 +03:00
Costin Leau	71b92f8699	QL: Optimize Like/Rlike all (#62682 ) Replace common Like and RLike queries that match all characters with IsNotNull (exists) queries Fix #62585 (cherry picked from commit 4c23fad0468a9edd7325b06c6a96f7af37625dbf)	2020-09-24 13:44:53 +03:00
Nhat Nguyen	663b85b98f	Make keep alive optional in PointInTimeBuilder (#62720 ) Remove the keepAlive parameter from the constructor of PointInTimeBuilder as it's optional.	2020-09-22 18:52:54 -04:00
Nik Everett	fa13585fae	Fix Eclipse build (#62733 ) (#62786 ) Eclipse was confused for two reasons: 1. `:x-pack:plugin` depended on itself. 2. `ql`, `sql`, and `eql` couldn't see some methods. I fixed problem 1 by only adding the "depends on itself" configuration outside of eclipse. I fixed problem 2 by making a `test` sub-project in `ql` that contains test utilities and depending on those where possible.	2020-09-22 17:44:25 -04:00
Marios Trivyzas	1e72144847	EQL: Remove support for `=` for comparisons (#62756 ) (#62775 ) Since `=` is rarely used and is undocumented we its support for equality comparisons keeping `==` as the only option. `=` is now only used for assignments like in `maxspan=10m`. Closes: #62650 (cherry picked from commit ad5ae4d887b5c2feca2d0e874d7bdf738e3fd54e)	2020-09-22 20:56:04 +02:00
Marios Trivyzas	b072de4ce0	EQL: Disallow chained comparisons (#62567 ) (#62601 ) Expressions like `1 = 2 = 3 = 4` or `1 < 2 = 3 >= 4` were treated with leftmost priority: ((1 = 2) = 3) = 4 which can lead to confusing results. Since such expressions don't make so much change for EQL filters we disallow them in the parser to prevent unexpected results from their bad usage. Major DBs like PostgreSQL and Oracle also disallow them in their SQL syntax. (counter example would be MySQL which interprets them as we did before with leftmost priority). Fixes: #61654 (cherry picked from commit 8f94981bb093f104228d267b532e0a3d5b7f6a38)	2020-09-18 10:48:14 +02:00
Costin Leau	81f2f84177	EQL: Allow requests with size 0 (#62537 ) The purpose for this change is to allow validation of queries without having to actually execute them. The optimizer already picks up this case. Fix #62494 (cherry picked from commit 675889559b2f96a0c1faa6fc84fd537148ba2cce)	2020-09-18 11:24:39 +03:00
William Brafford	5a0dca2491	Deprecate xpack.eql.enabled setting and make it a no-op (#61375 ) (#62491 ) * Deprecate xpack.eql.enabled and make it a no-op * Remove uses of xpack.eql.enabled	2020-09-17 14:17:27 -04:00
Marios Trivyzas	abce04888f	EQL: Forbid usage of ['] for string literals (#62458 ) (#62496 ) The usage of single quotes to wrap a string literal is forbidden and an error encouraging the user to user double quotes is returned. Tests are properly adjusted. Relates to #61659 (cherry picked from commit 8be400b77370bf4cf68c89f492c2d235f3cce43c)	2020-09-17 11:29:09 +02:00
Costin Leau	ceaf96061c	EQL: Fetch sequence documents using Point-In-Time (#62469 ) To preserve the PIT semantics, the retrieval of results has moved from using multi-get to using an idsQuery. (cherry picked from commit 1c2362fcf2be62ce568b3772924abce7331ef23c)	2020-09-17 00:12:19 +03:00
Costin Leau	03d2395183	EQL: Use Point In Time inside sequences (#62276 ) Use the newly introduced PIT API to have a consistent view of the data while doing sequence matching, which involves multiple calls, aka repeatable reads and thus avoid race conditions or any in-flight updates on the data. (cherry picked from commit daa72fc3c71fd36afb55278021ff6bbc591ef148)	2020-09-15 15:40:03 +03:00
Nhat Nguyen	3d69b5c41e	Introduce point in time APIs in x-pack basic (#61062 ) This commit introduces a new API that manages point-in-times in x-pack basic. Elasticsearch pit (point in time) is a lightweight view into the state of the data as it existed when initiated. A search request by default executes against the most recent point in time. In some cases, it is preferred to perform multiple search requests using the same point in time. For example, if refreshes happen between search_after requests, then the results of those requests might not be consistent as changes happening between searches are only visible to the more recent point in time. A point in time must be opened before being used in search requests. The `keep_alive` parameter tells Elasticsearch how long it should keep a point in time around. ``` POST /my_index/_pit?keep_alive=1m ``` The response from the above request includes a `id`, which should be passed to the `id` of the `pit` parameter of search requests. ``` POST /_search { "query": { "match" : { "title" : "elasticsearch" } }, "pit": { "id": "46ToAwMDaWR4BXV1aWQxAgZub2RlXzEAAAAAAAAAAAEBYQNpZHkFdXVpZDIrBm5vZGVfMwAAAAAAAAAAKgFjA2lkeQV1dWlkMioGbm9kZV8yAAAAAAAAAAAMAWICBXV1aWQyAAAFdXVpZDEAAQltYXRjaF9hbGw_gAAAAA==", "keep_alive": "1m" } } ``` Point-in-times are automatically closed when the `keep_alive` is elapsed. However, keeping point-in-times has a cost; hence, point-in-times should be closed as soon as they are no longer used in search requests. ``` DELETE /_pit { "id" : "46ToAwMDaWR4BXV1aWQxAgZub2RlXzEAAAAAAAAAAAEBYQNpZHkFdXVpZDIrBm5vZGVfMwAAAAAAAAAAKgFjA2lkeQV1dWlkMioGbm9kZV8yAAAAAAAAAAAMAWIBBXV1aWQyAAA=" } ``` #### Notable works in this change: - Move the search state to the coordinating node: #52741 - Allow searches with a specific reader context: #53989 - Add the ability to acquire readers in IndexShard: #54966 Relates #46523 Relates #26472 Co-authored-by: Jim Ferenczi <jimczi@apache.org>	2020-09-10 19:25:47 -04:00
Andrei Stefan	cce6da7d52	EQL: add the wildcard field type to the IT tests (#62166 ) (#62200 ) * Add wildcard field type as an option for randomized testing of IT queries (cherry picked from commit 87b14c409c180c4d53c3c61a30bd69f1b81a2823)	2020-09-10 12:36:36 +03:00
Jake Landis	d8dad9ab2c	[7.x] Remove integTest task from PluginBuildPlugin (#61879 ) (#62135 ) This commit removes `integTest` task from all es-plugins. Most relevant projects have been converted to use yamlRestTest, javaRestTest, or internalClusterTest in prior PRs. A few projects needed to be adjusted to allow complete removal of this task * x-pack/plugin - converted to use yamlRestTest and javaRestTest * plugins/repository-hdfs - kept the integTest task, but use `rest-test` plugin to define the task * qa/die-with-dignity - convert to javaRestTest * x-pack/qa/security-example-spi-extension - convert to javaRestTest * multiple projects - remove the integTest.enabled = false (yay!) related: #61802 related: #60630 related: #59444 related: #59089 related: #56841 related: #59939 related: #55896	2020-09-09 14:25:41 -05:00
Costin Leau	0f9532689f	EQL: Propagate key constraints through the query (#62073 ) Since join keys are common across all queries in a Join/Sequence, any constraint applied on one query needs to be obeyed but all the other queries. This PR enhances the optimizer to propagate such constraints across all queries so they get pushed down to the actual generated ES queries. Fix #58937 (cherry picked from commit 4afa5debc199c132c07015bfae17952c40a21e5d)	2020-09-08 18:40:47 +03:00

1 2 3 4

178 Commits