OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-06 04:58:50 +00:00

Author	SHA1	Message	Date
Andrei Stefan	1a5ff05870	SQL: fix LIKE function equality by considering its pattern as well (#40260 ) * Define a equals method for Like function so that the pattern used is considered in the equality check. Whenever the functions are resolved this check should be used. (cherry picked from commit 4e5d5af58a140573b8ee19d57c7839db7b779e3b)	2019-03-21 11:44:57 +02:00
Marios Trivyzas	f37f2b5d39	SQL: Fix issue with optimization on queries with ORDER BY/LIMIT (#40256 ) Previously, when a trival plain `SELECT` or a trivial `SELECT` with aggregations has also an `ORDER BY` or a `LIMIT` or both, then the optimization to convert it to a `LocalRelation` was skipped resulting in exception thrown. E.g.:: ``` SELECT 'foo' FROM test LIMIT 10 ``` or ``` SELECT 'foo' FROM test GROUP BY 1 ORDER BY 1 ``` Fixes: #40211	2019-03-20 23:52:35 +01:00
Marios Trivyzas	bc4c8e53c5	SQL: Fix issue with date columns returned always in UTC (#40163 ) When selecting columns of ES type `date` (SQL's DATETIME) the `FieldHitExtractor` was not using the timezone of the client session but always resorted to UTC. The same behaviour (UTC only) was encountered also for grouping keys (`CompositeKeyExtractor`) and for First/Last functions on dates (`TopHitsAggExtractor`). Fixes: #40152	2019-03-20 20:32:33 +01:00
Andrei Stefan	791814bb47	SQL: fix incorrect ordering of groupings (GROUP BY) based on orderings (ORDER BY) (#40087 ) * Take into consideration aliases that can be used as aggregates and in the ORDER BY element so that the groupings are re-ordered inside the composite aggregation according to the ORDER BY ordering. (cherry picked from commit 110c0b90b9cf2e9344ab3f412cfa8f8cd94ad71f)	2019-03-18 15:37:45 +02:00
Costin Leau	076a68007c	SQL: Add multi_value_field_leniency inside FieldHitExtractor (#40113 ) For cases where fields can have multi values, allow the behavior to be customized through a dedicated configuration field. By default this will be enabled on the drivers so that existing datasets work instead of throwing an exception. For regular SQL usage, the behavior is false so that the user is aware of the underlying data. Fix #39700 (cherry picked from commit 2b351571961f172fd59290ee079126bbd081ceaf)	2019-03-18 14:56:03 +02:00
Igor Motov	a019af7690	SQL: Refactor Literals serialization method (#40058 ) Since other classes besides intervals can be serialized as part of the Cursor, the getNamedWritables method should be moved from Intervals to a more generic class Literals. Relates to #39973	2019-03-15 14:30:28 -04:00
Costin Leau	3960374a6f	SQL: Introduce MAD (MedianAbsoluteDeviation) aggregation (#40048 ) Add Median Absolute Deviation aggregation Fix #39597 (cherry picked from commit 4f09613942a9249d06c74da64ad7e6f362e97f56)	2019-03-15 11:45:15 +02:00
Marios Trivyzas	4e9657f93f	SQL: Fix bug with JDBC timezone setting and DATE type (#39978 ) Previously, JDBC's REST call to the server was always sending UTC instead of the timezone passed through connection string/properties. Moreover the conversion to java.sql.Date was problematic as a calculation on the epoch millis was used to set the time to 00:00:00.000 and the timezone info was lost. This caused the resulting java.sql.Date object which is always using the JVM's timezone (no matter what timezone setting is used in the connection string/properties) to be wrongly created. Fixes: #39915	2019-03-14 13:41:53 +01:00
Andrei Stefan	4d1305b6df	SQL: Extend the multi dot field notation extraction to lists of values (#39823 ) (cherry picked from commit 300ae485dd08373727ca111a4d21276dd47d9a27)	2019-03-14 11:21:53 +02:00
Igor Motov	2f47e3d05a	SQL: values in datetime script aggs should be treated as long (#39773 ) When a query is translated into script terms agg where key has a date type, it should generate a terms agg with value_type long instead of date, otherwise the key gets formatted as a string, which confuses hit extractor. Fixes #37042	2019-03-11 17:41:12 -04:00
Costin Leau	92a87a45bf	SQL: Wrap ZonedDateTime parameters inside scripts (#39911 ) Painless allows ZonedDateTime objects to be passed natively to scripts which creates problematic translate queries as the ZonedDateTime is passed as a string instead. Wrap this with a dedicated method to perform the conversion. Fix #39877 (cherry picked from commit 4957cad5bda77257d10430ac102e93f5e062148a)	2019-03-11 17:44:03 +02:00
Costin Leau	a079b9fd6d	SQL: ConstantProcessor can now handle NamedWriteable (#39876 ) Enhance ConstantProcessor to properly serialize complex objects (Intervals) that have their own custom serialization/deserialization mechanism Fix #39875 (cherry picked from commit ed8a1f9340673e69a44ea7a89679cadb4762e43d)	2019-03-11 12:49:23 +02:00
Marios Trivyzas	c72a7998f5	SQL: Don't allow inexact fields for MIN/MAX (#39563 ) MIN/MAX on strings are supported and are implemented with TopAggs FIRST/LAST respectively, but they cannot operate on `text` fields without underlying `keyword` fields => inexact. Follows: #39427	2019-03-04 15:35:11 +01:00
Costin Leau	e038ccef13	SQL: Fix merging of incompatible multi-fields (#39560 ) Fix bug in IndexResolver that caused conflicts in multi-field types to be ignored up (causing the query to fail later on due to mapping conflicts). The issue was caused by the multi-field which forced the parent creation before checking its validity across mappings Fix #39547 (cherry picked from commit 4e4fe289f90b9b5eae09072d54903701a3128696)	2019-03-02 10:30:02 +02:00
Costin Leau	dfe81b260e	SQL: Enable accurate hit tracking on demand (#39527 ) Queries that require counting of all hits (COUNT(*) on implicit group by), now enable accurate hit tracking. Fix #37971 (cherry picked from commit 265b637cf6df08986a890b8b5daf012c2b0c1699)	2019-03-01 23:09:04 +02:00
Andrei Stefan	06d0e0efad	Removed custom naming for DISTINCT COUNT (#39537 ) (cherry picked from commit 9412a2ee01a60dd6449bbced1273ec0b37b65589)	2019-03-01 15:26:32 +02:00
Andrei Stefan	ba44f28340	SQL: ignore UNSUPPORTED fields for JDBC and ODBC modes in 'SYS COLUMNS' (#39518 ) * SYS COLUMNS will skip UNSUPPORTED field types in ODBC and JDBC, as well. NESTED and OBJECT types were already skipped in ODBC mode, now they are skipped in JDBC mode, as well. (cherry picked from commit 9e0df64b2d36c9069dfa506570468f0522c86417)	2019-03-01 15:26:31 +02:00
Marios Trivyzas	9fb2f670dc	SQL: Enhance checks for inexact fields (#39427 ) For functions: move checks for `text` fields without underlying `keyword` fields or with many of them (ambiguity) to the type resolution stage. For Order By/Group By: move checks to the `Verifier` to catch early before `QueryTranslator` or execution. Closes: #38501 Fixes: #35203	2019-03-01 10:40:57 +01:00
Marios Trivyzas	a2c07b5011	SQL: Use underlying exact field for LIKE/RLIKE (#39443 ) Previously, if a text field had an underlying keyword field the latter was not used instead of the text leading to wrong results returned by queries filtering with LIKE/RLIKE. Fixes: #39442	2019-02-27 14:46:54 +01:00
Andrei Stefan	542e2c55f6	SQL: change the default precision for CURRENT_TIMESTAMP function (#39391 ) (cherry picked from commit dbb93310b083226c96e4bde3eef0079eb01cbca9)	2019-02-27 09:49:42 +02:00
Andrei Stefan	4deb69e9e4	SQL: introduce the columnar option for REST requests (#39287 ) * Add "columnar" option for REST requests (but be lenient for non-"plain" modes) for json, yaml, smile and cbor formats. * Updated documentation (cherry picked from commit 5b7e0de237fb514d14a61a347bc669d4b4adbe56)	2019-02-27 09:37:28 +02:00
Marios Trivyzas	032bcf99d6	SQL: Implement `::` cast operator (#38774 ) `<expression>::<dataType>` is a simplified altenative syntax to `CAST(<expression> AS <dataType> which exists in PostgreSQL and provides an improved user experience and possibly more compact SQL queries. Fixes: #38717	2019-02-12 16:54:14 +02:00
Costin Leau	794ee4fb10	SQL: Prevent grouping over grouping functions (#38649 ) Improve verifier to disallow grouping over grouping functions (e.g. HISTOGRAM over HISTOGRAM). Close #38308 (cherry picked from commit 4e9b1cfd4df38c652bba36b4b4b538ce7c714b6e)	2019-02-09 09:30:06 +02:00
Marios Trivyzas	871036bd21	SQL: Relax StackOverflow circuit breaker for constants (#38572 ) Constant numbers (of any form: integers, decimals, negatives, scientific) and strings shouldn't increase the depth counters as they don't contribute to the increment of the stack depth. Fixes: #38571	2019-02-09 09:18:21 +02:00
Marios Trivyzas	af8a444caa	SQL: Replace joda with java time (#38437 ) Replace remaining usages of joda classes with java time. Fixes: #37703	2019-02-08 22:58:07 +02:00
Marios Trivyzas	f96bd2ad71	SQL: Fix issue with IN not resolving to underlying keyword field (#38440 ) - Add resolution to the exact keyword field (if exists) for text fields. - Add proper verification and error message if underlying keyword doesn'texist. - Move check for field attribute in the comparison list to the `resolveType()` method of `IN`. Fixes: #38424	2019-02-06 16:25:06 +02:00
Costin Leau	1a02445ae1	SQL: Allow look-ahead resolution of aliases for WHERE clause (#38450 ) Aliases defined in SELECT (Project or Aggregate) are now resolved in the following WHERE clause. The Analyzer has been enhanced to identify this rule and replace the field accordingly. Close #29983	2019-02-06 12:08:32 +02:00
Marios Trivyzas	2c30501c74	SQL: Fix esType for DATETIME/DATE and INTERVALS (#38179 ) Since introduction of data types that don't have a corresponding type in ES the `esType` is error-prone when used for `unmappedType()` calls. Moreover since the renaming of `DATE` to `DATETIME` and the introduction of an actual date-only `DATE` the `esType` would return `datetime` which is not a valid type for ES mapping. Fixes: #38051	2019-02-05 23:12:52 +02:00
Marios Trivyzas	c9701be1e8	SQL: Implement CURRENT_DATE (#38175 ) Since DATE data type is now available, this implements the `CURRENT_DATE/CURRENT_DATE()/TODAY()` similar to `CURRENT_TIMESTAMP`. Closes: #38160	2019-02-05 18:15:26 +02:00
Andrei Stefan	cea81b199d	Change the milliseconds precision to 3 digits for intervals. (#38297 )	2019-02-05 12:00:49 +02:00
Costin Leau	75f0750ff7	SQL: Remove exceptions from Analyzer (#38260 ) Instead of throwing an exception, use an unresolved attribute to pass the message to the Verifier. Additionally improve the parser to save the extended source for the Aggregate and OrderBy. Close #38208	2019-02-03 22:32:16 +02:00
Costin Leau	a088155f4d	SQL: Move metrics tracking inside PlanExecutor (#38259 ) Move metrics in one place, from the transport layer inside the PlanExecutor Remove unused class Close #38258	2019-02-03 22:31:35 +02:00
Andrei Stefan	6968f0925b	SQL: Generate relevant error message when grouping functions are not used in GROUP BY (#38017 ) * Add checks for Grouping functions restriction to be placed inside GROUP BY * Fixed bug where GROUP BY HISTOGRAM (not using alias) wasn't recognized properly in the Verifier due to functions equality not working correctly.	2019-02-02 22:05:47 +02:00
Costin Leau	783c9ed372	SQL: Allow sorting of groups by aggregates (#38042 ) Introduce client-side sorting of groups based on aggregate functions. To allow this, the Analyzer has been extended to push down to underlying Aggregate, aggregate function and the Querier has been extended to identify the case and consume the results in order and sort them based on the given columns. The underlying QueryContainer has been slightly modified to allow a view of the underlying values being extracted as the columns used for sorting might not be requested by the user. The PR also adds minor tweaks, mainly related to tree output. Close #35118	2019-02-02 01:38:25 +02:00
Marios Trivyzas	4710a7472f	SQL: Implement FIRST/LAST aggregate functions (#37936 ) FIRST and LAST can be used with one argument and work similarly to MIN and MAX but they are implemented using a Top Hits aggregation and therefore can also operate on keyword fields. When a second argument is provided then they return the first/last value of the first arg when its values are ordered ascending/descending (respectively) by the values of the second argument. Currently because of the usage of a Top Hits aggregation FIRST and LAST cannot be used in the HAVING clause of a GROUP BY query to filter on the results of the aggregation. Closes: #35639	2019-01-31 16:33:05 +02:00
Andrei Stefan	908c8def06	SQL: Skip the nested and object field types in case of an ODBC request (#37948 )	2019-01-30 11:34:47 +02:00
Adrien Grand	c8af0f4bfa	Use mappings to format doc-value fields by default. (#30831 ) Doc-value fields now return a value that is based on the mappings rather than the script implementation by default. This deprecates the special `use_field_mapping` docvalue format which was added in #29639 only to ease the transition to 7.x and it is not necessary anymore in 7.0.	2019-01-30 10:31:51 +01:00
Marios Trivyzas	e9332331a3	SQL: Make error msg for validation of 2nd arg of PERCENTILE[_RANK] consistent (#37937 ) Use `first` and `second` instead of `1st` and `2nd`.	2019-01-29 21:20:09 +02:00
Marios Trivyzas	d1ff450edc	SQL: Fix casting from date to numeric type to use millis (#37869 ) Previously casting from a DATE[TIME] type to a numeric (DOUBLE, LONG, INT, etc. used seconds instead of the epoch millis. Fixes: #37655	2019-01-25 23:29:10 +02:00
Martijn Laarman	dfecb256cb	Exit batch files explictly using ERRORLEVEL (#29583 ) * Exit batch files explictly using ERRORLEVEL This makes sure the exit code is preserved when calling the batch files from different contexts other than DOS Fixes #29582 This also fixes specific error codes being masked by an explict exit /b 1 causing the useful exitcodes from ExitCodes to be lost. * fix line breaks for calling cli to match the bash scripts * indent size of bash files is 2, make sure editorconfig does the same for bat files * update indenting to match bash files * update elasticsearch-keystore.bat indenting * Update elasticsearch-node.bat to exit outside of endlocal	2019-01-25 16:44:33 +01:00
Christoph Büscher	b4b4cd6ebd	Clean codebase from empty statements (#37822 ) * Remove empty statements There are a couple of instances of undocumented empty statements all across the code base. While they are mostly harmless, they make the code hard to read and are potentially error-prone. Removing most of these instances and marking blocks that look empty by intention as such. * Change test, slightly more verbose but less confusing	2019-01-25 14:23:02 +01:00
Jim Ferenczi	787acb14b9	Track total hits up to 10,000 by default (#37466 ) This commit changes the default for the `track_total_hits` option of the search request to `10,000`. This means that by default search requests will accurately track the total hit count up to `10,000` documents, requests that match more than this value will set the `"total.relation"` to `"gte"` (e.g. greater than or equals) and the `"total.value"` to `10,000` in the search response. Scroll queries are not impacted, they will continue to count the total hits accurately. The default is set back to `true` (accurate hit count) if `rest_total_hits_as_int` is set in the search request. I choose `10,000` as the default because that's also the number we use to limit pagination. This means that users will be able to know how far they can jump (up to 10,000) even if the total number of hits is not accurate. Closes #33028	2019-01-25 13:45:39 +01:00
Marios Trivyzas	74b6f308e9	SQL: Fix issue with complex expression as args of PERCENTILE/_RANK (#37102 ) When the arguements of PERCENTILE and PERCENTILE_RANK can be folded, the `ConstantFolding` rule kicks in and calls the `replaceChildren()` method on `InnerAggregate` which is created from the aggregation rules of the `Optimizerz. `InnerAggregate` in turn, cannot implement the method as the logic of creating a new `InnerAggregate` instance from a list of `Expression`s resides in the Optimizer. So, instead, `ConstantFolding` should be applied before any of the aggregations related rules. Fixes: #37099	2019-01-24 18:40:20 +02:00
Marios Trivyzas	9357929309	SQL: Improve handling of invalid args for PERCENTILE/PERCENTILE_RANK (#37803 ) Improve the Exception and the error message returned when 2nd argument of PERCENTILE and PERCENTILE_RANK is not a constant.	2019-01-24 15:03:49 +02:00
Marios Trivyzas	f707fa9e0a	SQL: Introduce SQL DATE data type (#37693 ) * SQL: Introduce SQL DATE data type Support ANSI SQL's DATE type by introducing a runtime-only ES SQL date type. Closes: #37340	2019-01-24 13:41:58 +02:00
Alexander Reelsen	daa2ec8a60	Switch mapping/aggregations over to java time (#36363 ) This commit moves the aggregation and mapping code from joda time to java time. This includes field mappers, root object mappers, aggregations with date histograms, query builders and a lot of changes within tests. The cut-over to java time is a requirement so that we can support nanoseconds properly in a future field mapper. Relates #27330	2019-01-23 10:40:05 +01:00
Andrei Stefan	7507af29fa	SQL: Return Intervals in SQL format for CLI (#37602 ) * Add separate CLI Mode * Use the correct Mode for cursor close requests * Renamed CliFormatter and have different formatting behavior for CLI and "text" format.	2019-01-22 14:55:28 +02:00
Igor Motov	54af8a4e7a	SQL: fix object extraction from sources (#37502 ) Throws an exception if hit extractor tries to retrieve unsupported object. For example, selecting "a" from `{"a": {"b": "c"}}` now throws an exception instead of returning null. Relates to #37364	2019-01-18 14:03:48 -05:00
Marios Trivyzas	1686c32ba9	SQL: Rename SQL type DATE to DATETIME (#37395 ) * SQL: Rename SQL data type DATE to DATETIME SQL data type DATE has only the date part (e.g.: 2019-01-14) without any time information. Previously the SQL type DATE was referring to the ES DATE which contains also the time part along with TZ information. To conform with SQL data types the data type `DATE` is renamed to `DATETIME`, since it includes also the time, as a new runtime SQL `DATE` data type will be introduced down the road, which only contains the date part and meets the SQL standard. Closes: #36440 * Address comments	2019-01-17 10:17:58 +02:00
Marios Trivyzas	ecf0de30ba	SQL: Lowercase the datatypes in validation error msgs (#37524 ) To follow the ES convention display the datatypes in lowercase in error messages thrown during validation if `IN` and conditional functions.	2019-01-16 18:41:10 +02:00

... 2 3 4 5 6 ...

374 Commits