OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-10 15:05:33 +00:00

Author	SHA1	Message	Date
Marios Trivyzas	e1eb683c51	SQL: Fix issue with getting DATE type in JDBC (#40207 ) Previously, calling getDate()/getTime()/getTimestamp() and getObject() with the corresponding java.sql class on a column of SQL DATE type from the JDBC result set would throw an Exception.	2019-03-21 01:48:06 +01:00
Marios Trivyzas	bc4c8e53c5	SQL: Fix issue with date columns returned always in UTC (#40163 ) When selecting columns of ES type `date` (SQL's DATETIME) the `FieldHitExtractor` was not using the timezone of the client session but always resorted to UTC. The same behaviour (UTC only) was encountered also for grouping keys (`CompositeKeyExtractor`) and for First/Last functions on dates (`TopHitsAggExtractor`). Fixes: #40152	2019-03-20 20:32:33 +01:00
Andrei Stefan	791814bb47	SQL: fix incorrect ordering of groupings (GROUP BY) based on orderings (ORDER BY) (#40087 ) * Take into consideration aliases that can be used as aggregates and in the ORDER BY element so that the groupings are re-ordered inside the composite aggregation according to the ORDER BY ordering. (cherry picked from commit 110c0b90b9cf2e9344ab3f412cfa8f8cd94ad71f)	2019-03-18 15:37:45 +02:00
Costin Leau	076a68007c	SQL: Add multi_value_field_leniency inside FieldHitExtractor (#40113 ) For cases where fields can have multi values, allow the behavior to be customized through a dedicated configuration field. By default this will be enabled on the drivers so that existing datasets work instead of throwing an exception. For regular SQL usage, the behavior is false so that the user is aware of the underlying data. Fix #39700 (cherry picked from commit 2b351571961f172fd59290ee079126bbd081ceaf)	2019-03-18 14:56:03 +02:00
Costin Leau	3960374a6f	SQL: Introduce MAD (MedianAbsoluteDeviation) aggregation (#40048 ) Add Median Absolute Deviation aggregation Fix #39597 (cherry picked from commit 4f09613942a9249d06c74da64ad7e6f362e97f56)	2019-03-15 11:45:15 +02:00
Marios Trivyzas	4e9657f93f	SQL: Fix bug with JDBC timezone setting and DATE type (#39978 ) Previously, JDBC's REST call to the server was always sending UTC instead of the timezone passed through connection string/properties. Moreover the conversion to java.sql.Date was problematic as a calculation on the epoch millis was used to set the time to 00:00:00.000 and the timezone info was lost. This caused the resulting java.sql.Date object which is always using the JVM's timezone (no matter what timezone setting is used in the connection string/properties) to be wrongly created. Fixes: #39915	2019-03-14 13:41:53 +01:00
Igor Motov	2f47e3d05a	SQL: values in datetime script aggs should be treated as long (#39773 ) When a query is translated into script terms agg where key has a date type, it should generate a terms agg with value_type long instead of date, otherwise the key gets formatted as a string, which confuses hit extractor. Fixes #37042	2019-03-11 17:41:12 -04:00
Costin Leau	a079b9fd6d	SQL: ConstantProcessor can now handle NamedWriteable (#39876 ) Enhance ConstantProcessor to properly serialize complex objects (Intervals) that have their own custom serialization/deserialization mechanism Fix #39875 (cherry picked from commit ed8a1f9340673e69a44ea7a89679cadb4762e43d)	2019-03-11 12:49:23 +02:00
Ryan Ernst	465343f12a	Bundle java in distributions (#38013 ) * Bundle java in distributions Setting up a jdk is currently a required external step when installing elasticsearch. This is particularly problematic for the rpm/deb packages as installing a jdk in the same package installation command does not guarantee any order, so must be done in separate steps. Additionally, JAVA_HOME must be set and often causes problems in selecting a correct jdk when, for example, the system java is an older unsupported version. This commit bundles platform specific openjdks into each distribution. In addition to eliminating the issues above, it also presents future possible improvements like using jlink to build jdk images only containing modules that elasticsearch uses. closes #31845	2019-03-08 11:04:18 -08:00
Costin Leau	dfe81b260e	SQL: Enable accurate hit tracking on demand (#39527 ) Queries that require counting of all hits (COUNT(*) on implicit group by), now enable accurate hit tracking. Fix #37971 (cherry picked from commit 265b637cf6df08986a890b8b5daf012c2b0c1699)	2019-03-01 23:09:04 +02:00
Andrei Stefan	06d0e0efad	Removed custom naming for DISTINCT COUNT (#39537 ) (cherry picked from commit 9412a2ee01a60dd6449bbced1273ec0b37b65589)	2019-03-01 15:26:32 +02:00
Marios Trivyzas	9fb2f670dc	SQL: Enhance checks for inexact fields (#39427 ) For functions: move checks for `text` fields without underlying `keyword` fields or with many of them (ambiguity) to the type resolution stage. For Order By/Group By: move checks to the `Verifier` to catch early before `QueryTranslator` or execution. Closes: #38501 Fixes: #35203	2019-03-01 10:40:57 +01:00
Marios Trivyzas	a2c07b5011	SQL: Use underlying exact field for LIKE/RLIKE (#39443 ) Previously, if a text field had an underlying keyword field the latter was not used instead of the text leading to wrong results returned by queries filtering with LIKE/RLIKE. Fixes: #39442	2019-02-27 14:46:54 +01:00
Andrei Stefan	4deb69e9e4	SQL: introduce the columnar option for REST requests (#39287 ) * Add "columnar" option for REST requests (but be lenient for non-"plain" modes) for json, yaml, smile and cbor formats. * Updated documentation (cherry picked from commit 5b7e0de237fb514d14a61a347bc669d4b4adbe56)	2019-02-27 09:37:28 +02:00
Tim Vernum	30687cbe7f	Switch internal security index to ".security-7" (#39422 ) This changes the name of the internal security index to ".security-7", but supports indices that were upgraded from earlier versions and use the ".security-6" name. In all cases, both ".security-6" and ".security-7" are considered to be restricted index names regardless of which name is actually in use on the cluster. Backport of: #39337	2019-02-27 12:49:44 +11:00
Andrei Stefan	c1018db404	SQL: enforce JDBC driver - ES server version parity (#38972 ) (cherry picked from commit 822a21f29491f295b22dacd04b747781a69ffa61)	2019-02-20 11:29:02 +02:00
Andrei Stefan	7d78f4641b	SQL: fall back to using the field name for column label (#38842 ) (cherry picked from commit 0567bf24957be477e7649cff94872b0e7dc4d284)	2019-02-14 14:10:59 +02:00
Marios Trivyzas	032bcf99d6	SQL: Implement `::` cast operator (#38774 ) `<expression>::<dataType>` is a simplified altenative syntax to `CAST(<expression> AS <dataType> which exists in PostgreSQL and provides an improved user experience and possibly more compact SQL queries. Fixes: #38717	2019-02-12 16:54:14 +02:00
Andrei Stefan	6359d988f0	Account for a possible rolled over file while reading the audit log file (#34909 ) (cherry picked from commit 75cb6b38ed67dc9d32c9291b0c174ffa94e473bc)	2019-02-08 17:49:00 +02:00
Julie Tibshirani	3ce7d2c9b6	Make sure to reject mappings with type _doc when include_type_name is false. (#38270 ) `CreateIndexRequest#source(Map<String, Object>, ... )`, which is used when deserializing index creation requests, accidentally accepts mappings that are nested twice under the type key (as described in the bug report #38266). This in turn causes us to be too lenient in parsing typeless mappings. In particular, we accept the following index creation request, even though it should not contain the type key `_doc`: ``` PUT index?include_type_name=false { "mappings": { "_doc": { "properties": { ... } } } } ``` There is a similar issue for both 'put templates' and 'put mappings' requests as well. This PR makes the minimal changes to detect and reject these typed mappings in requests. It does not address #38266 generally, or attempt a larger refactor around types in these server-side requests, as I think this should be done at a later time.	2019-02-05 10:52:32 -08:00
Marios Trivyzas	c9701be1e8	SQL: Implement CURRENT_DATE (#38175 ) Since DATE data type is now available, this implements the `CURRENT_DATE/CURRENT_DATE()/TODAY()` similar to `CURRENT_TIMESTAMP`. Closes: #38160	2019-02-05 18:15:26 +02:00
Costin Leau	a088155f4d	SQL: Move metrics tracking inside PlanExecutor (#38259 ) Move metrics in one place, from the transport layer inside the PlanExecutor Remove unused class Close #38258	2019-02-03 22:31:35 +02:00
Andrei Stefan	6968f0925b	SQL: Generate relevant error message when grouping functions are not used in GROUP BY (#38017 ) * Add checks for Grouping functions restriction to be placed inside GROUP BY * Fixed bug where GROUP BY HISTOGRAM (not using alias) wasn't recognized properly in the Verifier due to functions equality not working correctly.	2019-02-02 22:05:47 +02:00
Costin Leau	783c9ed372	SQL: Allow sorting of groups by aggregates (#38042 ) Introduce client-side sorting of groups based on aggregate functions. To allow this, the Analyzer has been extended to push down to underlying Aggregate, aggregate function and the Querier has been extended to identify the case and consume the results in order and sort them based on the given columns. The underlying QueryContainer has been slightly modified to allow a view of the underlying values being extracted as the columns used for sorting might not be requested by the user. The PR also adds minor tweaks, mainly related to tree output. Close #35118	2019-02-02 01:38:25 +02:00
Marios Trivyzas	4710a7472f	SQL: Implement FIRST/LAST aggregate functions (#37936 ) FIRST and LAST can be used with one argument and work similarly to MIN and MAX but they are implemented using a Top Hits aggregation and therefore can also operate on keyword fields. When a second argument is provided then they return the first/last value of the first arg when its values are ordered ascending/descending (respectively) by the values of the second argument. Currently because of the usage of a Top Hits aggregation FIRST and LAST cannot be used in the HAVING clause of a GROUP BY query to filter on the results of the aggregation. Closes: #35639	2019-01-31 16:33:05 +02:00
Josh Soref	0154052335	spelling: java script -- not JavaScript (#37057 )	2019-01-31 14:09:36 +02:00
Adrien Grand	c8af0f4bfa	Use mappings to format doc-value fields by default. (#30831 ) Doc-value fields now return a value that is based on the mappings rather than the script implementation by default. This deprecates the special `use_field_mapping` docvalue format which was added in #29639 only to ease the transition to 7.x and it is not necessary anymore in 7.0.	2019-01-30 10:31:51 +01:00
Przemyslaw Gomulka	4f4113e964	Rename security audit.log to _audit.json (#37916 ) in order to keep json logs consistent the security audit logs are renamed from .log to .json relates #32850	2019-01-29 14:53:55 +01:00
Marios Trivyzas	d1ff450edc	SQL: Fix casting from date to numeric type to use millis (#37869 ) Previously casting from a DATE[TIME] type to a numeric (DOUBLE, LONG, INT, etc. used seconds instead of the epoch millis. Fixes: #37655	2019-01-25 23:29:10 +02:00
Marios Trivyzas	74b6f308e9	SQL: Fix issue with complex expression as args of PERCENTILE/_RANK (#37102 ) When the arguements of PERCENTILE and PERCENTILE_RANK can be folded, the `ConstantFolding` rule kicks in and calls the `replaceChildren()` method on `InnerAggregate` which is created from the aggregation rules of the `Optimizerz. `InnerAggregate` in turn, cannot implement the method as the logic of creating a new `InnerAggregate` instance from a list of `Expression`s resides in the Optimizer. So, instead, `ConstantFolding` should be applied before any of the aggregations related rules. Fixes: #37099	2019-01-24 18:40:20 +02:00
Marios Trivyzas	f707fa9e0a	SQL: Introduce SQL DATE data type (#37693 ) * SQL: Introduce SQL DATE data type Support ANSI SQL's DATE type by introducing a runtime-only ES SQL date type. Closes: #37340	2019-01-24 13:41:58 +02:00
Albert Zaharovits	b6936e3c1e	Remove index audit output type (#37707 ) This commit removes the Index Audit Output type, following its deprecation in 6.7 by 8765a31d4e6770. It also adds the migration notice (settings notice). In general, the problem with the index audit output is that event indexing can be slower than the rate with which audit events are generated, especially during the daily rollovers or the rolling cluster upgrades. In this situation audit events will be lost which is a terrible failure situation for an audit system. Besides of the settings under the `xpack.security.audit.index` namespace, the `xpack.security.audit.outputs` setting has also been deprecated and will be removed in 7. Although explicitly configuring the logfile output does not touch any deprecation bits, this setting is made redundant in 7 so this PR deprecates it as well. Relates #29881	2019-01-24 12:36:10 +02:00
Andrei Stefan	7507af29fa	SQL: Return Intervals in SQL format for CLI (#37602 ) * Add separate CLI Mode * Use the correct Mode for cursor close requests * Renamed CliFormatter and have different formatting behavior for CLI and "text" format.	2019-01-22 14:55:28 +02:00
Andrei Stefan	90ae556d97	Define constants for REST requests endpoints in tests (#37610 )	2019-01-22 10:01:51 +02:00
Marios Trivyzas	1686c32ba9	SQL: Rename SQL type DATE to DATETIME (#37395 ) * SQL: Rename SQL data type DATE to DATETIME SQL data type DATE has only the date part (e.g.: 2019-01-14) without any time information. Previously the SQL type DATE was referring to the ES DATE which contains also the time part along with TZ information. To conform with SQL data types the data type `DATE` is renamed to `DATETIME`, since it includes also the time, as a new runtime SQL `DATE` data type will be introduced down the road, which only contains the date part and meets the SQL standard. Closes: #36440 * Address comments	2019-01-17 10:17:58 +02:00
Marios Trivyzas	2cf4b1863f	SQL: Lowercase es data type (mapping) returned from SQL Commands (#37531 ) To follow the ES convention, convert the es data type, returned as column `mapping` from SQL Commands, to lowercase. Fixes: #37521	2019-01-16 18:08:33 +02:00
Costin Leau	1f76b5fc31	SQL: Describe aliases as views (#37496 ) When reporting metadata, several clients have issues with the 'ALIAS' type. To improve compatibility and be consistent with the ANSI SQL expectations and because they are similar, aliases targets are now reported as views. Close #37422	2019-01-16 17:26:00 +02:00
Andrei Stefan	659326fdd6	SQL: Add protocol tests and remove jdbc_type from drivers response (#37516 )	2019-01-16 16:28:46 +02:00
Costin Leau	023bb2f1e4	SQL: Remove slightly used meta commands (#37506 ) Remove SYS CATALOGS and SYS TABLE TYPES as they are a subset of SYS TABLES (and thus somewhat redundant) and used only by JDBC. Close #37409	2019-01-16 12:36:35 +02:00
Julie Tibshirani	36a3b84fc9	Update the default for include_type_name to false. (#37285 ) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs.	2019-01-14 13:08:01 -08:00
Jay Modi	f3edbe2911	Security: remove SSL settings fallback (#36846 ) This commit removes the fallback for SSL settings. While this may be seen as a non user friendly change, the intention behind this change is to simplify the reasoning needed to understand what is actually being used for a given SSL configuration. Each configuration now needs to be explicitly specified as there is no global configuration or fallback to some other configuration. Closes #29797	2019-01-14 14:06:22 -07:00
Costin Leau	a4339ec7e9	SQL: Use declared source for error messages (#37161 ) Improve error messages by returning the original SQL statement declaration instead of trying to reproduce it as the casing and whitespaces are not preserved accurately leading to small differences. Close #37161	2019-01-13 01:40:22 +02:00
markharwood	434430506b	Type removal - added deprecation warnings to _bulk apis (#36549 ) Added warnings checks to existing tests Added “defaultTypeIfNull” to DocWriteRequest interface so that Bulk requests can override a null choice of document type with any global custom choice. Related to #35190	2019-01-10 21:35:19 +00:00
Costin Leau	83f7423cd6	SQL: Fix bug regarding alias fields with dots (#37279 ) Field of types aliases that have dots in name are returned without a hierarchy by field_caps, as oppose to the mapping api or field with concrete types, which in turn breaks IndexResolver. This commit fixes this by creating the backing hierarchy similar to the mapping api. Close #37224	2019-01-10 22:18:53 +02:00
Andrei Stefan	4a92de214a	SQL: Proper handling of COUNT(field_name) and COUNT(DISTINCT field_name) (#37254 ) * provide overriden `hashCode` and toString methods to account for `DISTINCT` * change the analyzer for scenarios where `COUNT <field_name>` and `COUNT DISTINCT` have different paths * defined a new `filter` aggregation encapsulating an `exists` query to filter out null or missing values	2019-01-10 09:51:51 +02:00
Alpar Torok	6344e9a3ce	Testing conventions: add support for checking base classes (#36650 )	2019-01-08 13:39:03 +02:00
Andrei Stefan	3fad9d25f6	SQL: fix COUNT DISTINCT filtering (#37176 ) * Use `_count` aggregation value only for not-DISTINCT COUNT function calls * COUNT DISTINCT will use the _exact_ version of a field (the `keyword` sub-field for example), if there is one	2019-01-08 08:47:35 +02:00
Alpar Torok	a7c3d5842a	Split third party audit exclusions by type (#36763 )	2019-01-07 17:24:19 +02:00
Andrei Stefan	39a072389c	SQL: add sub-selects to the Limitations page (#37012 )	2019-01-07 10:08:51 +02:00
Marios Trivyzas	e778abaac5	SQL: Improve error message when unable to translate to ES query DSL (#37129 ) Improve error message returned to the client when an SQL statement cannot be translated to a ES query DSL. Cases: 1. WHERE clause evaluates to FALSE => No results returned 1. Missing FROM clause => Local execution, e.g.: SELECT SIN(PI()) 3. Special SQL command => Only valid of SQL iface, e.g.: SHOW TABLES Fixes: #37040	2019-01-07 09:21:23 +02:00

1 2 3 4 5

248 Commits