OpenSearch

Commit Graph

Author	SHA1	Message	Date
Hicham Mallah	5b32d112e1	SQL: Fix issues with GROUP BY queries (#41964 ) Translate to an agg query even if only literals are selected, so that the correct number of rows is returned (number of buckets). Fix issue with key only in GROUP BY (not in select) and WHERE clause: Resolve aggregates and groupings based on the child plan which holds the info info for all the fields of the underlying table. Fixes: #41951 Fixes: #41413 (cherry picked from commit 45b85809678b34a448639a420b97e25436ae851f)	2020-02-15 10:38:24 +01:00
Marios Trivyzas	51e74be1bb	SQL: [Tests] Add tests for fixed issues (#52335 ) Add tests to verify behaviour for fixed issues: #33724 & #38306 (cherry picked from commit 89fb6753a9db9484a5622417cd4ffea9af0347ad)	2020-02-14 11:23:30 +01:00
Costin Leau	5373a77fb9	QL: Extract common Failure class (#52281 ) Shared across SQL and EQL (cherry picked from commit 1aeda20d3ec3d6c885de03c6043dd1e8eab9f230)	2020-02-13 14:35:15 +02:00
Bogdan Pintea	5dfe27601e	SQL: supplement input checks on received request parameters (#52229 ) (#52277 ) * Add more checks around parameter conversions This commit adds two necessary verifications on received parameters: - it checks the validity of the parameter's data type: if the declared data type is resolved to an ES or Java type; - it checks if the returned converter is non-null (i.e. a conversion is possible) and generates an appropriate exception otherwise. (cherry picked from commit eda30ac9c69383165324328c599ace39ac064342)	2020-02-12 19:45:12 +01:00
Marios Trivyzas	daab242c75	SQL: Fix ORDER BY on aggregates and GROUPed BY fields (#51894 ) Previously, in the in-memory sorting module `LocalAggregationSorterListener` only the aggregate functions where used (grabbed by the `sortingColumns`). As a consequence, if the ORDER BY was also using columns of the GROUP BY clause, (especially in the case of higher priority - before the aggregate functions) wrong results were produced. E.g.: ``` SELECT gender, MAX(salary) AS max FROM test_emp GROUP BY gender ORDER BY gender, max ``` Add all columns of the ORDER BY to the `sortingColumns` so that the `LocalAggregationSorterListener` can use the correct comparators in the underlying PriorityQueue used to implement the in-memory sorting. Fixes: #50355 (cherry picked from commit be680af11c823292c2d115bff01658f7b75abd76)	2020-02-12 09:38:47 +01:00
Marios Trivyzas	204d086266	SQL: Fix issue with timezone when paginating (#52101 ) Previously, when the specified (or default) fetchSize led to subsequent HTTP requests and the usage of cursors, those subsequent were no longer using the client timezone specified in the initial SQL query. As a consequence, Even though the query is executed once (with the correct timezone) the processing of the query results by the HitExtractors in the next pages was done using the default timezone Z. This could lead to incorrect results. Fix the issue by correctly using the initially specified timezone, which is found in the deserialisation of the cursor string. Fixes: #51258 (cherry picked from commit 8f7afbdeb9295999b48a6c36db5b31cbe0cee432)	2020-02-11 15:27:56 +01:00
Bogdan Pintea	7b58ed0dd7	Fix milliseconds handling in intervals (#51675 ) (#52156 ) This fixes: - the parsing of milliseconds in intervals: everything past the . used to be converted as-is to milliseconds, with no normalisation of the unit; thus, a value of .23 ended up as 23 millis in the interval, instead of 230. - the printing of a trailing .0, in case the interval lacks the fractional part; - tests generating a random millisecond value used to simply print it in the string about to be evaluated without a necessary front-filling of 0[s], where the amount was below 100/10. (The combination of first and last issues above, plus statistical "luck" made the incorrect handling pass the tests.) (cherry picked from commit 4de8c64f63ee37c1bcfdb9b9d3a07d09be243222)	2020-02-10 19:24:26 +01:00
Marios Trivyzas	3e7f939f63	SQL: [Tests] Add more tests for aggs and literals (#52086 ) Add some more tests where more than one literal is selected, unaliased and aliased. Follows: #42121 (cherry picked from commit 405271d408a233e697eb2e9ded3005a71f4df5e7)	2020-02-09 18:01:05 +01:00
Emanuele Sabellico	282e919607	SQL: [Tests] Add integ tests for selecting a literal and an aggregate (#42121 ) The related issue regarding aggregation queries where some literals are also selected together with aggregate function has been fixed with #49570. Add integration tests to verify the behavior. Relates to: #41411 (cherry picked from commit 9f414a8d05c75e1a9f8250084f6dcd634d5d78d8)	2020-02-07 19:00:15 +01:00
Marios Trivyzas	64f9a2089b	SQL: [Tests] add tests for literals and GROUP BY (#51878 ) Add unit and integration tests where literals are SELECTed in combination with GROUP BY and possibly aggregate functions. Relates to #41411 and #34583 which have been fixed. (cherry picked from commit b97f1ca12675d6ea4772c60578922fe1cc2409ee)	2020-02-05 12:55:56 +01:00
Ryan Ernst	21224caeaf	Remove comparison to true for booleans (#51723 ) While we use `== false` as a more visible form of boolean negation (instead of `!`), the true case is implied and the true value does not need to explicitly checked. This commit converts cases that have slipped into the code checking for `== true`.	2020-01-31 16:35:43 -08:00
Marios Trivyzas	f373020349	SQL: Fix ORDER BY YEAR() function (#51562 ) Previously, if YEAR() was used as and ORDER BY argument without being wrapped with another scalar (e.g. YEAR(birth_date) + 10), no script ordering was used but instead the underlying field (e.g. birth_date) was used instead as a performance optimisation. This works correctly if YEAR() is the only ORDER BY arg but if further args are used as tie breakers for the ordering wrong results are produced. This is because 2 rows with the different birth_date but on the same year are not tied as the underlying ordering is on birth_date and not on the YEAR(birth_date), and the following ORDER BY args are ignored. Remove this optimisation for YEAR() to avoid incorrect results in such cases. As a consequence another bug is revealed: scalar functions on top of nested fields produce scripted sorting/filtering which is not yet supported. In such cases no error was thrown but instead all values for such nested fields were null and were passed to the script implementing the sorting/filtering, producing incorrect results. Detect such cases and throw a validation exception. Fixes: #51224 (cherry picked from commit f41efd6753dc3650a7eabb3e07b02b3b32c5704c)	2020-01-30 15:29:36 +01:00
Jason Tedor	3a7192966a	Check if interface is up for loopback devices only (#51583 ) In the SQL with SSL tests, we need to find the interfaces that are up, are loopback devices, or have a loopback address. If we check if the device is up first, we can run into situations where the device is a virtual ethernet device that might have disappeared between us seeing the device, and checking if it is up. By first checking if the device is a loopback device or it has a loopback address, then we can avoid checking if the device is up except for loopback devices and therefore we can avoid the disappearing virtual ethernet device problem.	2020-01-28 18:38:46 -05:00
Costin Leau	e22f501018	QL: Backport project to 7.x (#51497 ) * Introduce reusable QL plugin for SQL and EQL (#50815) Extract reusable functionality from SQL into its own dedicated project QL. Implemented as a plugin, it provides common components across SQL and the upcoming EQL. While this commit is fairly large, for the most part it's just a big file move from sql package to the newly introduced ql. (cherry picked from commit ec1ac0d463bfa12a02c8174afbcdd6984345e8b4) * SQL: Fix incomplete registration of geo NamedWritables (cherry picked from commit e295763686f9592976e551e504fdad1d2a3a566d) * QL: Extend NodeSubclass to read classes from jars (#50866) As the test classes are spread across more than one project, the Gradle classpath contains not just folders but also jars. This commit allows the test class to explore the archive content and load matching classes from said source. (cherry picked from commit 25ad74928afcbf286dc58f7d430491b0af662f04) * QL: Remove implicit conversion inside Literal (#50962) Literal constructor makes an implicit conversion for each value given which turns out has some subtle side-effects. Improve MathProcessors to preserve numeric type where possible Fix bug on issue compatibility between date and intervals Preserve the source when folding inside the Optimizer (cherry picked from commit 9b73e225b0aa07a23859550fb117bae571a2b672) * QL: Refactor DataType for pluggability (#51328) Change DataType from enum to class Break DataType enums into QL (default) and SQL types Make data type conversion pluggable so that new types can be introduced As part of the process: - static type conversion in QL package (such as Literal) has been removed - several utility classes have been broken into base (QL) and extended (SQL) parts based on type awareness - operators (+,-,/,) are - due to extensibility, serialization of arithmetic operation has been slightly changed and pushed down to the operator executor itself (cherry picked from commit aebda81b30e1563b877a8896309fd50633e0b663) Compilation fixes for 7.x	2020-01-27 22:03:58 +02:00
Andrei Stefan	2908b7e5fc	SQL: add support for passing query parameters in REST API calls (#51029 ) (#51222 ) * REST PreparedStatement-like query parameters are now supported in the form of an array of non-object, non-array values where ES SQL parser will try to infer the data type of the value being passed as parameter. (cherry picked from commit 45b8bf619aecb1c03d7bc0cf06928dcc36005a66)	2020-01-20 16:40:19 +02:00
Nik Everett	fc5fde7950	Add "did you mean" to ObjectParser (#50938 ) (#50985 ) Check it out: ``` $ curl -u elastic:password -HContent-Type:application/json -XPOST localhost:9200/test/_update/foo?pretty -d'{ "dac": {} }' { "error" : { "root_cause" : [ { "type" : "x_content_parse_exception", "reason" : "[2:3] [UpdateRequest] unknown field [dac] did you mean [doc]?" } ], "type" : "x_content_parse_exception", "reason" : "[2:3] [UpdateRequest] unknown field [dac] did you mean [doc]?" }, "status" : 400 } ``` The tricky thing about implementing this is that x-content doesn't depend on Lucene. So this works by creating an extension point for the error message using SPI. Elasticsearch's server module provides the "spell checking" implementation. s	2020-01-14 17:53:41 -05:00
Albert Zaharovits	2b789fa3e6	Make .async-search-* a restricted namespace (#50294 ) Hide the `.async-search-*` in Security by making it a restricted index namespace. The namespace is hard-coded. To grant privileges on restricted indices, one must explicitly toggle the `allow_restricted_indices` flag in the indices permission in the role definition. As is the case with any other index, if a certain user lacks all permissions for an index, that index is effectively nonexistent for that user.	2020-01-13 12:20:54 +02:00
Igor Motov	c77ca98928	Geo: Switch generated WKT to upper case (#50285 ) Switches generated WKT to upper case to conform to the standard recommendation. Relates #49568	2019-12-18 17:29:08 -05:00
Andrei Stefan	c6fdf9ed8a	Handle NULL in ResultSet's getDate() method (#50184 ) (cherry picked from commit 08214eb1338fef5c8082c3f8b84c24dd53224ebe)	2019-12-17 10:03:23 +02:00
Andrei Stefan	e9e2e5fc71	Have COUNT DISTINCT return 0 instead of NULL for no documents matching. (#50037 ) (cherry picked from commit cb94731e6f41bc51c23e4aab495b64eea731a061)	2019-12-12 00:34:04 +02:00
Costin Leau	5b896c5bb5	SQL: Refactor usage of NamedExpression (#49693 ) To recap, Attributes form the properties of a derived table. Each LogicalPlan has Attributes as output since each one can be part of a query and as such its result are sent to its consumer. This change essentially removes the name id comparison so any changes applied to existing expressions should work as long as the said expressions are semantically equivalent. This change enforces the hashCode and equals which has the side-effect of using hashCode as identifiers for each expression. By removing any property from an Attribute, the various components need to look the original source for comparison which, while annoying, should prevent a reference from getting out of sync with its source due to optimizations. Essentially going forward there are only 3 types of NamedExpressions: Alias - user define (implicit or explicit) name FieldAttribute - field backed by Elasticsearch ReferenceAttribute - a reference to another source acting as an Attribute. Typically the Attribute of an Alias. * Remove the usage of NamedExpression as basis for all Expressions. Instead, restrict their use only for named context, such as projections by using Aliasing instead. * Remove different types of Attributes and allow only FieldAttribute, UnresolvedAttribute and ReferenceAttribute. To avoid issues with rewrites, resolve the references inside the QueryContainer so the information always stays on the source. * Side-effect, simplify the rules as the state for InnerAggs doesn't have to be contained anymore. * Improve ResolveMissingRef rule to handle references to named non-singular expression tree against the same expression used up the tree. #49693 backport to 7.x (cherry picked from commit 5d095e2173bcbf120f534a6f2a584185a7879b57)	2019-12-07 11:02:14 +02:00
Andrei Stefan	e2982b2110	SQL: handle NULL arithmetic operations with INTERVALs (#49633 ) (cherry picked from commit ce727615c08cf5ae422feb77f69ea24fb53cd9d1)	2019-12-02 17:31:05 +02:00
Andrei Stefan	34311dd818	Fix NULL handling for FLOOR and CEIL math functions (#49644 ) (cherry picked from commit 034f4cf7b4bd062c157d40f1e7a8760de31de568)	2019-12-02 17:31:04 +02:00
Andrei Stefan	4dc83a7db9	Fix Locate function optional parameter handling (#49666 ) (cherry picked from commit dd3aeb8f5497bec4b050beaaf9d628a179b5454f)	2019-12-02 17:31:03 +02:00
Marios Trivyzas	901a8d1dcc	SQL: Fix issues with WEEK/ISO_WEEK/DATEDIFF (#49405 ) Some extended testing with MS-SQL server and H2 (which agree on results) revealed bugs in the implementation of WEEK related extraction and diff functions. Non-iso WEEK seems to be broken since #48209 because of the replacement of Calendar and the change in the ISO rules. ISO_WEEK failed for some edge cases around the January 1st. DATE_DIFF was previously based on non-iso WEEK extraction which seems not to be the case. Fixes: #49376 (cherry picked from commit 54fe7f57289c46bb0905b1418f51a00e8c581560)	2019-11-29 17:07:30 +01:00
Marios Trivyzas	b0cb7bf229	SQL: Fix issue with GROUP BY YEAR() (#49559 ) Grouping By YEAR() is translated to a histogram aggregation, but previously if there was a scalar function invloved (e.g.: `YEAR(date + INTERVAL 2 YEARS)`), there was no proper script created and the histogram was applied on a field with name: `date + INTERVAL 2 YEARS` which doesn't make sense, and resulted in null result. Check the underlying field of YEAR() and if it's a function call `asScript()` to properly get the painless script on which the histogram is applied. Fixes: #49386 (cherry picked from commit 93c37abc943d00d3a14ba08435d118a6d48874c7)	2019-11-26 14:11:11 +01:00
Marios Trivyzas	3c69d4d0bd	SQL: Add TRUNC alias for TRUNCATE (#49571 ) Add TRUNC as alias to already implemented TRUNCATE numeric function which is the flavour supported by Oracle and PostgreSQL. Relates to: #41195 (cherry picked from commit f2aa7f0779bc5cce40cc0c1f5e5cf1a5bb7d84f0)	2019-11-26 12:32:54 +01:00
Marios Trivyzas	5d306ae3b2	SQL: Fix issue with CASE/IIF pre-calculating results (#49553 ) Previously, CaseProcessor was pre-calculating (called `process()`) on all the building elements of a CASE/IIF expression, not only the conditions involved but also the results, as well as the final else result. In case one of those results had an erroneous calculation (e.g.: division by zero) this was executed and resulted in an Exception to be thrown, even if this result was not used because of the condition guarding it. e.g.: ``` SELECT CASE myField1 = 0 THEN NULL ELSE myField2 / myField1 END FROM test; ``` Fixes: #49388 (cherry picked from commit dbd169afc98686cae1bc72024fad0ca32b272efd)	2019-11-26 10:48:07 +01:00
Bogdan Pintea	8c2ab8bb72	SQL:Docs: add the PIVOT clause to SELECT section (#49129 ) The PR adds the documentation on the PIVOT clause. (cherry picked from commit a55b36065e6496c44b6e3191296931d477a8e5f5)	2019-11-20 18:21:06 +01:00
Marios Trivyzas	fd1bb4a33a	SQL: Fix issue with mins & hours for DATEDIFF (#49252 ) Previously, DATEDIFF for minutes and hours was doing a rounding calculation using all the time fields (secs, msecs/micros/nanos). Instead it should first truncate the 2 dates to the respective field (mins or hours) zeroing out all the more detailed time fields and then make the subtraction. (cherry picked from commit 124cd18e20429e19d52fd8dc383827ea5132d428)	2019-11-19 14:25:28 +01:00
Rory Hunter	c46a0e8708	Apply 2-space indent to all gradle scripts (#49071 ) Backport of #48849. Update `.editorconfig` to make the Java settings the default for all files, and then apply a 2-space indent to all `*.gradle` files. Then reformat all the files.	2019-11-14 11:01:23 +00:00
Mark Vieira	6ab4645f4e	[7.x] Introduce type-safe and consistent pattern for handling build globals (#48818 ) This commit introduces a consistent, and type-safe manner for handling global build parameters through out our build logic. Primarily this replaces the existing usages of extra properties with static accessors. It also introduces and explicit API for initialization and mutation of any such parameters, as well as better error handling for uninitialized or eager access of parameter values. Closes #42042	2019-11-01 11:33:11 -07:00
Andrei Stefan	2c73c7dfe3	SQL: binary communication implementation for drivers and the CLI (#48261 ) * Introduce binary_format request parameter (binary.format for JDBC) to disable binary communication between clients (jdbc/odbc) and server. * for CLI - "binary" command line parameter (or -b) is introduced. Default value is "true". * binary communication (cbor) is enabled by default * disabling request parameter introduced for debugging purposes only (cherry picked from commit f96a5ca61cb9fad9ed59357320af20e669348ce7)	2019-10-31 20:39:41 -04:00
emasab	185e067442	SQL: Failing Group By queries due to different ExpressionIds (#43072 ) Fix an issue that arises from the use of ExpressionIds as keys in a lookup map that helps the QueryTranslator to identify the grouping columns. The issue is that the same expression in different parts of the query (SELECT clause and GROUP BY clause) ends up with different ExpressionIds so the lookup fails. So, instead of ExpressionIds use the hashCode() of NamedExpression. Fixes: #41159 Fixes: #40001 Fixes: #40240 Fixes: #33361 Fixes: #46316 Fixes: #36074 Fixes: #34543 Fixes: #37044 Fixes: #42041 (cherry picked from commit 3c38ea555984fcd2c6bf9e39d0f47a01b09e7c48)	2019-10-31 14:49:16 +01:00
Mark Vieira	e5c6440a4f	Simplify usage of Gradle Shadow plugin (#48478 ) (#48597 ) This commit simplifies and standardizes our usage of the Gradle Shadow plugin to conform more to plugin conventions. The custom "bundle" plugin has been removed as it's not necessary and performs the same function as the Shadow plugin's default behavior with existing configurations. Additionally, this removes unnecessary creation of a "nodeps" artifact, which is unnecessary because by default project dependencies will in fact use the non-shadowed JAR unless explicitly depending on the "shadow" configuration. Finally, we've cleaned up the logic used for unit testing, so we are now correctly testing against the shadow JAR when the plugin is applied. This better represents a real-world scenario for consumers and provides better test coverage for incorrectly declared dependencies. (cherry picked from commit 3698131109c7e78bdd3a3340707e1c7b4740d310)	2019-10-28 12:11:55 -07:00
Marios Trivyzas	124f6d098b	SQL: [Tests] Renable CliSecurityIT (#48581 ) Seems that the issue has been fixed with: #48098 Closes: #48117 (cherry picked from commit 470362361ffce794a6a12ce7a81a8029ec7d54de)	2019-10-28 15:08:38 +01:00
Alpar Torok	fe265f0308	Mute CliSecurityIT tracking in #48117	2019-10-21 11:25:56 +03:00
Marios Trivyzas	7fddf198b7	SQL: Implement DATEDIFF function (#47920 ) Implement DATEDIFF/TIMESTAMPDIFF function as per the MS-SQL spec: https://docs.microsoft.com/en-us/sql/t-sql/functions/datediff-transact-sql?view=sql-server-2017 which allows a user to substract two date/datetime fields and return the difference in the date/time unit specified. Closes: #47919 (cherry picked from commit 745699f38dc8222670ffd65b66df33b5da39040b)	2019-10-15 15:12:11 +02:00
Marios Trivyzas	6589617a51	SQL: Fix arg verification for DateAddProcessor (#48041 ) Previously, the safety check for the 2nd argument of the DateAddProcessor was restricting it to Integer which was wrong since we allow all non-rational numbers, so it's changed to a Number check as it's done in other cases. Enhanced some tests regarding the check for an integer (non-rational argument). (cherry picked from commit 0516b6eaf5eb98fa5bd087c3fece80139a6b118e)	2019-10-15 12:52:11 +02:00
Marios Trivyzas	59b3294bc9	SQL: Implement DATEADD function (#47747 ) Implement DATEADD/TIMESTAMPADD function as per the MS-SQL spec: https://docs.microsoft.com/en-us/sql/t-sql/functions/dateadd-transact-sql?view=sql-server-2017 which allows a user to add/subtract specified number of specified units to/from a date/datetime field/expression. Closes: #47746 (cherry picked from commit e624bc281bebb4bbe0b0c2e0a8cbc712e50097a8)	2019-10-10 16:22:13 +02:00
Costin Leau	dc6f0f9dc7	SQL: Re-enable muted test Close #47080 (cherry picked from commit 63a0aa7b392f565ea01ac478fec1dd91a80202e5)	2019-10-10 15:47:47 +03:00
Andrei Stefan	6a4bf5de2c	SQL: make date/datetime and interval types compatible in conditional functions (#47595 ) (cherry picked from commit 6ff953e6396d7cc90640419aee5d036954e2eae3)	2019-10-10 13:58:35 +03:00
Andrei Stefan	75a7daae73	SQL: use calendar interval of 1y instead of fixed interval for grouping by YEAR and HISTOGRAMs (#47558 ) (cherry picked from commit 55f5463eee4ecea3537df4b34645f1d87472a802)	2019-10-09 11:51:35 +03:00
Alpar Torok	36d018c909	Convert RunTask to use testclusers, remove ClusterFormationTasks (#47572 ) * Convert RunTask to use testclusers, remove ClusterFormationTasks This PR adds a new RunTask and a way for it to start a testclusters cluster out of band and block on it to replace the old RunTask that used ClusterFormationTasks. With this we can now remove ClusterFormationTasks.	2019-10-08 14:43:29 +03:00
Andrei Stefan	a46f312ded	SQL: fix multi full-text functions usage with aggregate functions (#47444 ) * Skip functions involving full-text predicates when replacing multiple aggregate functions with "stats" or "matrix_stats" aggregations. (cherry picked from commit bb14ba83128dfb7a70f825ea08b1524072fb9ad0)	2019-10-04 16:27:22 +03:00
Marios Trivyzas	f792dbf239	SQL: Implement DATE_PART function (#47206 ) DATE_PART(<datetime unit>, <date/datetime>) is a function that allows the user to extract the specified unit from a date/datetime field similar to the EXTRACT (<datetime unit> FROM <date/datetime>) but with different names and aliases for the units and it also provides more options like `DATE_PART('tzoffset', datetimeField)`. Implemented following the SQL server's spec: https://docs.microsoft.com/en-us/sql/t-sql/functions/datepart-transact-sql?view=sql-server-2017 with the difference that the <datetime unit> argument is either a literal single quoted string or gets a value from a table field, whereas in SQL server keywords are used (unquoted identifiers) and it's not possible to use a value coming for a table column. Closes: #46372 (cherry picked from commit ead743d3579eb753fd314d4a58fae205e465d72e)	2019-10-01 16:28:27 +03:00
Marios Trivyzas	fa0b1b641a	SQL: Add examples fo muting sql/csv integ tests (#47291 ) Add examples of failures for both sql and csv integeration tests and instructions on how to mute them. (cherry picked from commit 591bba46516d770f5fc95a4c536dd7448b74dd49)	2019-10-01 09:12:20 +03:00
Marios Trivyzas	bd2abeef40	SQL: [TESTS] Improve error messages on failures (#47308 ) When an integration test fails before the assertion of the results it's missing information, like the file name and the line in the file where the test resides. (cherry picked from commit 683dc7213311d13c81e06829e08f3f9f80ebf73a)	2019-09-30 22:18:39 +03:00
emasab	87156ad93b	SQL: Fix issue with duplicate columns in SELECT (#42122 ) Previously, if a column (field, scalar, alias) appeared more than once in the SELECT list, the value was returned only once (1st appearance) in each row. Fixes: #41811 (cherry picked from commit 097ea36581a751605fc4f2088319d954ce35b5d1)	2019-09-30 15:56:29 +03:00
Marios Trivyzas	01623f9f1c	SQL: Add alias DATETRUNC to DATE_TRUNC function (#47173 ) To be on the safe side in terms of use cases also add the alias DATETRUNC to the DATE_TRUNC function. Follows: #46473 (cherry picked from commit 9ac223cb1fc66486f86e218fa785a32b61e9bacc)	2019-09-27 15:38:51 +03:00
Costin Leau	b29a2cb360	SQL: Check case where the pivot limit is reached (#47121 ) In some cases, the fetch size affects the way the groups are returned causing the last page to go beyond the limit. Add dedicated check to prevent extra data from being returned. Fix #47002 (cherry picked from commit f4c29646f097bbd29855300342823ef4cef61c05)	2019-09-26 22:32:42 +03:00
Igor Motov	ae202fda21	SQL: Add support for shape type (#46464 ) Enables support for Cartesian geometries shape type. We still need to decide how to handle the distance function since it is currently using the haversine distance formula and returns results in meters, which doesn't make any sense for Cartesian geometries. Closes #46412 Relates to #43644	2019-09-26 09:47:42 -04:00
Yannick Welsch	056ac32738	Mute JdbcCsvSpecIT.testAverageWithOneValueAndLimit Relates to #47080	2019-09-25 10:36:53 +02:00
Dimitris Athanasiou	64bf1b56fe	[7.x] SQL: Mute pivot testAverageWithOneValueAndOrder and testSumWithoutSubquery (#47030 ) (#47033 ) Relates #47002	2019-09-24 19:04:52 +03:00
Costin Leau	a610503783	SQL: Add PIVOT support (#46489 ) Add initial PIVOT support for transforming a regular table into a statistics table around an arbitrary pivoting column: SELECT * FROM (SELECT languages, country, salary, FROM mp) PIVOT (AVG(salary) FOR countries IN ('NL', 'DE', 'ES', 'RO', 'US')) In the current implementation PIVOT allows only one aggregation however this restriction is likely to be lifted in the future. Also not all aggregations are working, in particular MatrixStats are not yet supported. (cherry picked from commit d91263746a222915c570d4a662ec48c1d6b4f583)	2019-09-23 21:04:13 +03:00
Alpar Torok	f3e67bdd17	Add resolution rule to allow resolving all deps (#46768 ) Since the `resolveAllDependencies` task resolves all the congfigurations it can find, this was not caught by our testing, but it's required to be configuraed specifically. We should probably cut-over to the new configurations at some point to avoid problems like this. Closes elastic/infra#14580	2019-09-18 11:09:43 +03:00
Costin Leau	92e518e789	SQL: Properly handle indices with no/empty mapping (#46775 ) When encountering only indices with empty mapping, the IndexResolver throws an exception as it expects to find at least one entry. This commit fixes this case so that an empty mapping is returned. Fix #46757 (cherry picked from commit 5f4f5807acb93b5fab36718c092c328977a396b6)	2019-09-17 16:01:22 +03:00
Costin Leau	683b5fdeca	SQL: Support queries with HAVING over SELECT (#46709 ) Handle queries with implicit GROUP BY where the aggregation is not in the projection/SELECT but inside the filter/HAVING such as: SELECT 1 FROM x HAVING COUNT(*) > 0 The engine now properly identifies the case and handles it accordingly. Fix #37051 (cherry picked from commit fa53ca05d8219c27079b50b4a5b7aeb220c7cde2)	2019-09-17 11:14:39 +03:00
Costin Leau	90f4c2379b	SQL: improve ResultSet behavior when no rows are available (#46753 ) Improve the defensive behavior of ResultSet when dealing with incorrect API usage. In particular handle the case of dealing with no row available (either because the cursor is before the first entry or after the last). Fix #46750 (cherry picked from commit 58fa38e4606625962e879265d35eacb0960c6cdb)	2019-09-17 11:14:38 +03:00
Andrei Stefan	40e9353947	SQL: use the correct data type for types conversion (#46574 ) (cherry picked from commit 3e25db2f302c3aafe27e4d8d4fb1743401d85e6d)	2019-09-16 15:36:17 +03:00
Marios Trivyzas	d956509394	SQL: Implement DATE_TRUNC function (#46473 ) DATE_TRUNC(<truncate field>, <date/datetime>) is a function that allows the user to truncate a timestamp to the specified field by zeroing out the rest of the fields. The function is implemented according to the spec from PostgreSQL: https://www.postgresql.org/docs/current/functions-datetime.html#FUNCTIONS-DATETIME-TRUNC Closes: #46319 (cherry picked from commit b37e96712db1aace09f17b574eb02ff6b942a297)	2019-09-11 21:41:02 +03:00
Ryan Ernst	86290cb3d9	Make reuse of sql test code explicit (#45884 ) The sql project uses a common set of security tests, which are run in subprojects. Currently these are shared through a shared directory, but this is not setup correctly to ensure it is built before tests run. This commit changes the test classes to be an artifact of the sql/qa/security project and makes the test runner use the built artifact (a directory of classes) for tests. closes #45866	2019-09-11 10:56:07 -07:00
Andrei Stefan	7b26a8c041	Use `null` schema response for `SYS TABLES` command. (#46386 ) (cherry picked from commit a6152f42a47a1ccd668e5892778c8bd2d3a78c4c)	2019-09-07 09:24:54 +03:00
Andrei Stefan	7cf100ba07	SQL: fix scripting for grouped by datetime functions (#46421 ) * Fix issue with painless scripting not being correctly generated when datetime functions are used for GROUPing of an INTERVAL operation. (cherry picked from commit cb92828e8ec9d9d241bd6189e5835fd99f8b9a44)	2019-09-07 09:24:53 +03:00
Marios Trivyzas	fd0affb503	SQL: Fix issue with IIF function when condition folds (#46290 ) Previously, when the condition (1st argument) of the IIF function could be evaluated (folded) to false, the `IfConditional` was eliminated which caused `IndexOutOfBoundsException` to be thrown when `info()` and `resolveType()` methods where called. Fixes: #46268 (cherry picked from commit 9a885a3ac47bc8f52c07770d1d8d670ce0af1e59)	2019-09-04 10:32:49 +03:00
Armin Braun	bfddaaa2ae	Acknowledge Indices Were Wiped Successfully in REST Tests (#45832 ) (#45842 ) In internal test clusters tests we check that wiping all indices was acknowledged but in REST tests we didn't. This aligns the behavior in both kinds of tests. Relates #45605 which might be caused by unacked deletes that were just slow.	2019-08-22 17:19:51 +02:00
Igor Motov	98c850c08b	Geo: Change order of parameter in Geometries to lon, lat 7.x (#45618 ) Changes the order of parameters in Geometries from lat, lon to lon, lat and moves all Geometry classes are moved to the org.elasticsearch.geomtery package. Backport of #45332 Closes #45048	2019-08-16 14:42:02 -04:00
Andrei Stefan	adf8e20021	SQL: adds format parameter to range queries for constant date comparisons (#45503 ) * Add format parameter to the range queries built for CURRENT_* functions used in comparison conditions * Use range queries for date fields equality/non-equality as well. (cherry picked from commit c1e81e90f937ee5a002524d632bfce74d76962f9)	2019-08-13 23:04:30 +03:00
Andrei Stefan	740d58fd46	SQL: Uniquely named inner_hits sections for each nested field condition (#45341 ) * Name each inner_hits section of nested queries differently and extract and combine the multiple values it generates into a single list. This also introduces a limitation (its origin it's with Elasticsearch though) on the sorting capabilities when the sorting is based on the nested fields filtered: only one of the conditions applied to nested documents will be used in the nested sorting. (cherry picked from commit cfc5cf68f6e83b07bb9006986d0903d6be418ec6)	2019-08-09 00:22:49 +03:00
Andrei Stefan	2633d11eb7	Switch from using docvalue_fields to extracting values from _source (#44062 ) (#44804 ) * Switch from using docvalue_fields to extracting values from _source where applicable. Doing this means parsing the _source and handling the numbers parsing just like Elasticsearch is doing it when it's indexing a document. * This also introduces a minor limitation: aliases type of fields that are NOT part of a tree of sub-fields will not be able to be retrieved anymore. field_caps API doesn't shed any light into a field being an alias or not and at _source parsing time there is no way to know if a root field is an alias or not. Fields of the type "a.b.c.alias" can be extracted from docvalue_fields, only if the field they point to can be extracted from docvalue_fields. Also, not all fields in a hierarchy of fields can be evaluated to being an alias. (cherry picked from commit 8bf8a055e38f00df5f49c8d97f632f69d6e00c2c)	2019-07-25 10:02:41 +03:00
Ryan Ernst	7e06888bae	Convert testclusters to use distro download plugin (#44253 ) (#44362 ) Test clusters currently has its own set of logic for dealing with finding different versions of Elasticsearch, downloading them, and extracting them. This commit converts testclusters to use the DistributionDownloadPlugin.	2019-07-15 17:53:05 -07:00
Andrei Stefan	e9f9f00940	SQL: add pretty printing to JSON format (#43756 ) (#44220 ) (cherry picked from commit cbd9d4c259bf5a541bc49f65f7973174a36df449)	2019-07-11 20:02:24 +03:00
Igor Motov	df2e1fb43e	Geo: add validator that only checks altitude (#43893 ) By default, we don't check ranges while indexing geo_shapes. As a result, it is possible to index geoshapes that contain contain coordinates outside of -90 +90 and -180 +180 ranges. Such geoshapes will currently break SQL and ML retrieval mechanism. This commit removes these restriction from the validator is used in SQL and ML retrieval.	2019-07-10 16:55:03 -04:00
Andrei Stefan	9567f337f5	SQL: handle SQL not being available in a more graceful way (#43665 ) * Add test for SQL not being available error message in JDBC. * Add a new qa sub-project that explicitly disables SQL XPack module in Gradle. (cherry picked from commit 8a1ac8a3a88a325ec9b99963e0fa288c18ee0ee5)	2019-07-10 14:36:24 +03:00
Igor Motov	3607876a71	Geo: Makes coordinate validator in libs/geo plugable (#43657 ) Moves coordinate validation from Geometry constructors into parser. Relates #43644	2019-06-27 19:53:41 -04:00
Alpar Torok	ea44da6069	Testclusters: conver remaining x-pack (#43335 ) Convert x-pack tests	2019-06-24 12:07:42 +03:00
Andrei Stefan	fe0f9055d8	Fix NPE in case of subsequent scrolled requests for a CSV/TSV formatted response (#43365 ) (cherry picked from commit 0ef7bb0f8b07cd0392d37f96ca9360821b19315a)	2019-06-20 11:26:11 +03:00
Jason Tedor	1f1a035def	Remove stale test logging annotations (#43403 ) This commit removes some very old test logging annotations that appeared to be added to investigate test failures that are long since closed. If these are needed, they can be added back on a case-by-case basis with a comment associating them to a test failure.	2019-06-19 22:58:22 -04:00
Igor Motov	9f7d1ff2de	Geo: Add coerce support to libs/geo WKT parser (#43273 ) Adds support for coercing not closed polygons and ignoring Z value to libs/geo WKT parser. Closes #43173	2019-06-18 14:41:01 -04:00
Marios Trivyzas	9cd89c3453	SQL: Increase hard limit for sorting on aggregates (#43220 ) To be consistent with the `search.max_buckets` default setting, set the hard limit of the PriorityQueue used for in memory sorting, when sorting on an aggregate function, to 10000. Fixes: #43168 (cherry picked from commit 079e012fdea68ea0a7daae078359495047e9c407)	2019-06-14 13:51:38 +02:00
Marios Trivyzas	3c73602524	SQL: Fix wrong results when sorting on aggregate (#43154 ) - Previously, when shorting on an aggregate function the bucket processing ended early when the explicit (LIMIT XXX) or the impliciti limit of 512 was reached. As a consequence, only a set of grouping buckets was processed and the results returned didn't reflect the global ordering. - Previously, the priority queue shorting method had an inverse comparison check and the final response from the priority queue was also returned in the inversed order because of the calls to the `pop()` method. Fixes: #42851 (cherry picked from commit 19909edcfdf5792b38c1363b07379783ebd0e6c4)	2019-06-13 21:59:20 +02:00
Mark Vieira	e44b8b1e2e	[Backport] Remove dependency substitutions 7.x (#42866 ) * Remove unnecessary usage of Gradle dependency substitution rules (#42773) (cherry picked from commit 12d583dbf6f7d44f00aa365e34fc7e937c3c61f7)	2019-06-04 13:50:23 -07:00
James Rodewig	f51f8ed04c	[DOCS] Remove unneeded options from `[source,sql]` code blocks (#42759 ) In AsciiDoc, `subs="attributes,callouts,macros"` options were required to render `include-tagged::` in a code block. With elastic/docs#827, Elasticsearch Reference documentation migrated from AsciiDoc to Asciidoctor. In Asciidoctor, the `subs="attributes,callouts,macros"` options are no longer needed to render `include-tagged::` in a code block. This commit removes those unneeded options. Resolves #41589	2019-05-31 13:05:13 -04:00
Mark Vieira	c1816354ed	[Backport] Improve build configuration time (#42674 )	2019-05-30 10:29:42 -07:00
Igor Motov	d2f9ccbe18	Geo: Refactor libs/geo parsers (#42549 ) Refactors the WKT and GeoJSON parsers from an utility class into an instantiatable objects. This is a preliminary step in preparation for moving out coordinate validators from Geometry constructors. This should allow us to make validators plugable.	2019-05-29 20:07:27 -04:00
Igor Motov	e28a9e99c4	SQL: Moves the JTS-based tests suppression to Before (#42526 ) Moves the test suppression from `ClassRule` to `Before`, where it is properly handled in the CI build. Fixes #42221	2019-05-24 13:58:53 -04:00
Costin Leau	a48125a9f7	Fix FROZEN indices backport	2019-05-23 21:30:41 +03:00
Costin Leau	9fdf4215dd	Docs: Documentation for the upcoming SQL support of frozen indices (#41863 ) (cherry picked from commit a3cc03eb1503df24c1706a721fcc9af38c3b2873) (cherry picked from commit f42dcf2ffd7bd25f3f91aa6127515f393cd1860f)	2019-05-23 21:16:16 +03:00
Costin Leau	d5f04d29c9	SQL: Add support for FROZEN indices (#41558 ) Allow querying of FROZEN indices both through dedicated SQL grammar extension: > SELECT field FROM FROZEN index and also through driver configuration parameter, namely: > index.include.frozen: true/false Fix #39390 Fix #39377 (cherry picked from commit 2445a933915f420c7f51e8505afa0a7978ce6b0f)	2019-05-23 21:16:16 +03:00
Igor Motov	076ca75ea5	SQL: Suppress geo tests failing on tr-TR locale (#42200 ) Due to a bug in JTS WKT parser, JTS cannot parse most of WKT shapes if the shape type is written in the lower case. For examples `point (1 2)` is causing JTS inside H2GIS to fail on tr-TR locale as a result of case-insensitive comparison.	2019-05-17 16:00:54 -04:00
Marios Trivyzas	7473742e6e	SQL: Fix issue regarding INTERVAL * number (#42014 ) Interval * integer number is a valid operation which previously was only supported for foldables (literals) and not when a field was involved. That was because: 1. There was no common type returned for that combination 2. The `BinaryArithmeticOperation` was permitting the multiplication (called by fold()) but the BinaryArithmeticProcessor didn't allow it Moreover the error message for invalid arithmetic operations was wrong because of the issue with the overloading methods of `LoggerMessageFormat.format`. Fixes: #41239 Fixes: #41200 (cherry picked from commit 91039bab12d3ef27d6eac9cdc891a3b3ad0c694d)	2019-05-15 16:06:55 -04:00
Igor Motov	70ea3cf847	SQL: Add initial geo support (#42031 ) (#42135 ) Adds an initial limited implementations of geo features to SQL. This implementation is based on the [OpenGIS® Implementation Standard for Geographic information - Simple feature access](http://www.opengeospatial.org/standards/sfs), which is the current standard for GIS system implementation. This effort is concentrate on SQL option AKA ISO 19125-2. Queries that are supported as a result of this initial implementation Metadata commands - `DESCRIBE table` - returns the correct column types `GEOMETRY` for geo shapes and geo points. - `SHOW FUNCTIONS` - returns a list that includes supported `ST_` functions - `SYS TYPES` and `SYS COLUMNS` display correct types `GEO_SHAPE` and `GEO_POINT` for geo shapes and geo points accordingly. Returning geoshapes and geopoints from elasticsearch - `SELECT geom FROM table` - returns the geoshapes and geo_points as libs/geo objects in JDBC or as WKT strings in console. - `SELECT ST_AsWKT(geom) FROM table;` and `SELECT ST_AsText(geom) FROM table;`- returns the geoshapes ang geopoints in their WKT representation; Using geopoints to elasticsearch - The following functions will be supported for geopoints in queries, sorting and aggregations: `ST_GeomFromText`, `ST_X`, `ST_Y`, `ST_Z`, `ST_GeometryType`, and `ST_Distance`. In most cases when used in queries, sorting and aggregations, these function are translated into script. These functions can be used in the SELECT clause for both geopoints and geoshapes. - `SELECT * FROM table WHERE ST_Distance(ST_GeomFromText(POINT(1 2), point) < 10;` - returns all records for which `point` is located within 10m from the `POINT(1 2)`. In this case the WHERE clause is translated into a range query. Limitations: Geoshapes cannot be used in queries, sorting and aggregations as part of this initial effort. In order to fully take advantage of geoshapes we would need to have access to geoshape doc values, which is coming in #37206. `ST_Z` cannot be used on geopoints in queries, sorting and aggregations since we don't store altitude in geo_point doc values. Relates to #29872 Backport of #42031	2019-05-14 18:57:12 -05:00
James Rodewig	d548901855	[DOCS] Add space to fix Asciidoctor output (#41579 )	2019-04-26 12:13:33 -04:00
Costin Leau	b288b88ba0	SQL: Use field caps inside DESCRIBE TABLE as well (#41377 ) Thanks to #34071, there is enough information in field caps to infer the table structure and thus use the same API consistently across the IndexResolver. (cherry picked from commit f99946943a3350206b6bca774b2f060f41a787b3)	2019-04-25 23:41:17 +03:00
Marios Trivyzas	e991175776	SQL: Implement IIF(<cond>, <result1>, <result2>) (#41420 ) Implement a more trivial case of the CASE expression which is expressed as a traditional function with 2 or 3 arguments. e.g.: IIF(a = 1, 'one', 'many') IIF(a > 0, 'positive') Closes: #40917 (cherry picked from commit add02f4f553ad472026dcc1eaa84245a0558a4b0)	2019-04-23 16:31:25 +03:00
Marios Trivyzas	67d4e399c2	SQL: Implement CASE... WHEN... THEN... ELSE... END (#41349 ) Implement the ANSI SQL CASE expression which provides the if/else functionality common to most programming languages. The CASE expression can have multiple WHEN branches and becomes a powerful tool for SQL queries as it can be used in SELECT, WHERE, GROUP BY, HAVING and ORDER BY clauses. Closes: #36200 (cherry picked from commit 8b2577406f47ae60d15803058921d128390af0b6)	2019-04-22 19:26:56 +03:00
Marios Trivyzas	7a34ba35f7	SQL: Fix bug with optimization of null related conditionals (#41355 ) The SimplifyConditional rule is removing NULL literals from those functions to simplify their evaluation. This happens in the Optimizer and a new instance of the conditional function is generated. Previously, the dataType was not set properly (defaulted to DataType.NULL) for those new instances and since the resolveType() wasn't called again it resulted in returning always null. E.g.: SELECT COALESCE(null, 'foo', null, 'bar') COALESCE(null, 'foo', null, 'bar') ----------------- null This issue was not visible before because the tests always used an alias for the conditional function which caused the resolveType() to be called which sets the dataType properly. E.g.: SELECT COALESCE(null, 'foo', null, 'bar') as c c ----------------- foo (cherry picked from commit c39980a65dd593363f1d8d1b038b26cb0ce02aaf)	2019-04-19 19:04:32 +03:00
Andrei Stefan	cfed5d65be	SQL: fix SecurityIT tests by covering edge case scenarios when audit file rolls over at midnight (#41328 ) Handle the scenario where assertLogs() is not called from a test method but the audit rolling file rolls over. * Use a local boolean variable instead of the static one to account for assertBusy() code block possibly being called multiple times and having different execution paths. (cherry picked from commit 6f642196cbab90079c610097befc794746170df1)	2019-04-18 21:24:18 +03:00
Mark Vieira	0227ac5690	Fix issue with subproject test task dependencies (#41321 ) (#41351 )	2019-04-18 11:15:34 -07:00
Costin Leau	85912b89fe	SQL: Fix LIMIT bug in agg sorting (#41258 ) When specifying a limit over an agg sorting, the limit will be pushed down to the grouping which affects the custom sorting. This commit fixes that and restricts the limit only to sorting. Fix #40984 (cherry picked from commit da3726528d9011b05c0677ece6d11558994eccd9)	2019-04-16 22:40:41 +03:00

1 2 3 4 5 ...

281 Commits