druid

Commit Graph

Author	SHA1	Message	Date
Jonathan Wei	94463b5778	Add missing redirects and fix broken links (#7213 ) * Add missing redirects * Fix zookeeper redirect * Fix broken links	2019-03-09 15:16:23 -08:00
Xue Yu	65118277a3	support sin cos etc trigonometric function in sql (#7182 ) * support triangle function in sql * feedback address	2019-03-04 19:18:22 -08:00
Jonathan Wei	32c418fdd8	Reword 'node' to 'process' (#7172 )	2019-02-28 18:10:39 -08:00
Surekha	2b04e6d0bc	add note on consistency of results for sys.segments queries (#7034 ) * add doc * change docs * PR comments * few more changes	2019-02-19 10:52:37 -08:00
Jonathan Wei	953b96d0a4	Add more sketch aggregator support in Druid SQL (#6951 ) * Add more sketch aggregator support in Druid SQL * Add docs * Tweak module serde register * Fix tests * Checkstyle * Test fix * PR comment * PR comment * PR comments	2019-02-02 22:34:53 -08:00
Surekha	7baa33049c	Introduce published segment cache in broker (#6901 ) * Add published segment cache in broker * Change the DataSegment interner so it's not based on DataSEgment's equals only and size is preserved if set * Added a trueEquals to DataSegment class * Use separate interner for realtime and historical segments * Remove trueEquals as it's not used anymore, change log message * PR comments * PR comments * Fix tests * PR comments * Few more modification to * change the coordinator api * removeall segments at once from MetadataSegmentView in order to serve a more consistent view of published segments * Change the poll behaviour to avoid multiple poll execution at same time * minor changes * PR comments * PR comments * Make the segment cache in broker off by default * Added a config to PlannerConfig * Moved MetadataSegmentView to sql module * Add doc for new planner config * Update documentation * PR comments * some more changes * PR comments * fix test * remove unintentional change, whether to synchronize on lifecycleLock is still in discussion in PR * minor changes * some changes to initialization * use pollPeriodInMS * Add boolean cachePopulated to check if first poll succeeds * Remove poll from start() * take the log message out of condition in stop()	2019-02-02 22:27:13 -08:00
Clint Wylie	7a5827e12e	bloom filter sql aggregator (#6950 ) * adds sql aggregator for bloom filter, adds complex value serde for sql results * fix tests * checkstyle * fix copy-paste	2019-02-01 13:54:46 -08:00
Surekha	4c211ab2b4	update sys table docs (#6955 ) * update sys table docs * Capitalize SQL	2019-01-31 08:51:39 -08:00
Jonathan Wei	82137874ea	Add master/data/query server concepts to docs/packaging (#6916 ) * Add master/data/query server concepts to docs/packaging * PR comments * TOC and markdown fix * Update image legend * PR comment * More PR comments	2019-01-30 19:41:07 -08:00
Dayue Gao	5b8a221713	Add SQL id, request logs, and metrics (#6302 ) * use SqlLifecyle to manage sql execution, add sqlId * add sql request logger * fix UT * rename sqlId to sqlQueryId, sql/time to sqlQuery/time, etc * add docs and more sql request logger impls * add UT for http and jdbc * fix forbidden use of com.google.common.base.Charsets * fix UT in QuantileSqlAggregatorTest, supressed unused warning of getSqlQueryId * do not use default method in QueryMetrics interface * capitalize 'sql' everywhere in the non-property parts of the docs * use RequestLogger interface to log sql query * minor bugfixes and add switching request logger * add filePattern configs for FileRequestLogger * address review comments, adjust sql request log format * fix inspection error * try SuppressWarnings("RedundantThrows") to fix inspection error on ComposingRequestLoggerProvider	2019-01-15 23:12:59 -08:00
Benjamin Hopp	ef80c4e036	Update sql.md (#6821 ) Corrected defaults for druid.sql.avatica.maxStatementsPerConnection and druid.sql.avatica.maxConnections	2019-01-08 10:15:12 -08:00
Clint Wylie	67f832957b	add bloom filter operator to general sql docs (#6785 )	2018-12-31 11:30:33 -08:00
David Lim	f7bbee2e65	Front Matter header needs to be on the first line for md to be rendered properly by jekyll (#6733 )	2018-12-13 11:47:20 -08:00
Vadim Ogievetsky	da4836f38c	Added titles and harmonized docs to improve usability and SEO (#6731 ) * added titles and harmonized docs * manually fixed some titles	2018-12-12 20:42:12 -08:00
Gian Merlino	154b6fbcef	SQL: Add "POSITION" function. (#6596 ) Also add a "fromIndex" argument to the strpos expression function. There are some -1 and +1 adjustment terms due to the fact that the strpos expression behaves like Java indexOf (0-indexed), but the POSITION SQL function is 1-indexed.	2018-11-13 13:39:00 -08:00
David Lim	afb239b17a	add missing license headers, in particular to MD files; clean up RAT … (#6563 ) * add missing license headers, in particular to MD files; clean up RAT exclusions * revert inadvertent doc changes * docs * cr changes * fix modified druid-production.svg	2018-11-13 09:38:37 -08:00
Surekha	3a0a667fe0	Introduce SystemSchema tables (#5989 ) (#6094 ) * Added SystemSchema with following tables (#5989) * SEGMENTS table provides details on served and published segments * SERVERS table provides details on data servers * SERVERSEGMETS table is the JOIN of SEGMENTS and SERVERS * TASKS table provides details on tasks * Add documentation for system schema * Fix static-analysis warnings * Address PR comments Add unit tests Fix a test * Try to fix a test * Fix a bug around replica count * rename io.druid to org.apache.druid * Major change is to make tasks and segment queries streaming * Made tasks/segments stream to calcite instead of storing it in memory * Add num_rows to segments table * Refactor JsonParserIterator * Replace with closeable iterator * Fix docs, make num_rows column nullable, some unit test changes * make num_rows column type long, allow it to be null fix a compile error after merge, add TrafficCop param to InputStreamResponseHandler * Filter null rows for segments table from Linq4j enumerable * change num_replicas datatype to long in segments table * Fix some tests and address comments * Doc updates, other PR comments * Update tests * Address comments * Add auth check * Update docs * Refactoring * Fix teamcity warning, change the getQueryableServer in TimelineServerView * Fix compilation after rebase * Use the stream API from AuthorizationUtils * Added LeaderClient interface and NoopDruidLeaderClient class * Revert "Added LeaderClient interface and NoopDruidLeaderClient class" This reverts commit `100fa46e39`. * Make the naming consistent to server_segments for the join table * Add ForbiddenException on auth check failure * Remove static block from SystemSchema * Try to fix a test in CalciteQueryTest due to rename of server_segments * Fix the json output format in the coordinator API * Add auth check in the segments API * Add null check to avoid NPE * Use annonymous class object instead of mock for DruidLeaderClient in SqlBenchmark * Fix test failures, type long/BIGINT can be nullable * Revert long nullability to fix tests * Fix style for tests * PR comments * Address PR comments * Add the missing BytesAccumulatingResponseHandler class * Use Sequences.withBaggage in DruidPlanner * Fix docs, add comments * Close the iterator if hasNext returns false	2018-10-10 17:17:29 -07:00
Gian Merlino	244046fda5	SQL: Fix too-long headers in http responses. (#6411 ) Fixes #6409 by moving column name info from HTTP headers into the result body.	2018-10-01 18:13:08 -07:00
Dayue Gao	edf0c13807	add a sql option to force user to specify time condition (#6246 ) * add a sql option to force user to specify time condition * rename forceTimeCondition to requireTimeCondition, refine error message	2018-09-17 13:52:24 -07:00
Gian Merlino	4669f0878f	SQL: UNION ALL operator. (#6314 ) * SQL: UNION ALL operator. * Remove unused import.	2018-09-09 22:32:56 -07:00
Jonathan Wei	180e3ccfad	Docs consistency cleanup (#6259 )	2018-09-04 12:54:41 -07:00
Gian Merlino	0172326c62	SQL: Support more result formats, add columns header. (#6191 ) * SQL: Support more result formats, add columns header. - Add result formats for line-based JSON and CSV. - Add X-Druid-Sql-Columns header with a list of all columns that the response will contain. - Add more comprehensive documentation on what callers should expect when making Druid SQL queries. * Fix some tests. * Adjust tests. * Adjust trailer, add types header. * Fix trailers.	2018-08-26 23:00:14 -06:00
Jonathan Wei	efab3b0160	Add concat and textcat SQL functions (#6005 )	2018-07-20 11:21:04 -07:00
Gian Merlino	cd8ea3da8d	SQL: Add server-wide default time zone config. (#5993 ) * SQL: Add server-wide default time zone config. * Switch API.	2018-07-18 13:12:40 -07:00
Arup Malakar	0c4598c1fe	Fix typo in avatica java client code documenation (#5553 )	2018-03-29 16:36:40 -05:00
Gian Merlino	0851f2206c	Expanded documentation for DataSketches aggregators. (#5513 ) Originally written by @AlexanderSaydakov in druid-io/druid-io.github.io#448. I also added redirects and updated links to point to the new datasketches-extension.html landing page for the extension, rather than to the old page about theta sketches.	2018-03-21 18:19:27 -07:00
Gian Merlino	f3796bc81b	SQL: Lower default JDBC frame size. (#5409 ) The previous default of 100,000 was a bit excessive and could easily lead to OOM errors on "select *" style queries.	2018-02-21 10:00:48 -08:00
Gian Merlino	53e3c7d1b2	SQL: Add additional unsupported features to the docs. (#5290 )	2018-01-24 11:27:47 -08:00
Gian Merlino	486159ba8c	SQL: Add TIMESTAMPADD. (#5079 )	2017-11-16 12:00:34 -08:00
Gian Merlino	4fd4444b42	SQL: Add "array" result format, and document result formats. (#5032 ) * SQL: Add "array" result format, and document result formats. * Code style.	2017-11-13 20:24:06 -08:00
Jonathan Wei	6840eabd87	Add Router connection balancers for Avatica queries (#4983 ) * Add Router connection balancers for Avatica queries * PR comments * Adjust test bounds * PR comments * Add doc comments * PR comments * PR comment * Checkstyle fix	2017-11-01 14:01:13 -07:00
Gian Merlino	d5e83f9d50	Fix docs for MOD. (#4971 )	2017-10-18 16:43:28 -07:00
Gian Merlino	f51f346e36	SQL: Fix POWER doc, add test. (#4953 )	2017-10-13 14:38:15 -07:00
Gian Merlino	5cfc7f9ef7	Fix formatting of SQL TRIM docs. (#4951 )	2017-10-13 14:38:06 -07:00
Gian Merlino	b20e3038b6	SQL: Upgrade to Calcite 1.14.0, some refactoring of internals. (#4889 ) * SQL: Upgrade to Calcite 1.14.0, some refactoring of internals. This brings benefits: - Ability to do GROUP BY and ORDER BY with ordinals. - Ability to support IN filters beyond 19 elements (fixes #4203). Some refactoring of druid-sql internals: - Builtin aggregators and operators are implemented as SqlAggregators and SqlOperatorConversions rather being special cases. This simplifies the Expressions and GroupByRules code, which were becoming complex. - SqlAggregator implementations are no longer responsible for filtering. Added new functions: - Expressions: strpos. - SQL: TRUNCATE, TRUNC, LENGTH, CHAR_LENGTH, STRLEN, STRPOS, SUBSTR, and DATE_TRUNC. * Add missing @Override annotation. * Adjustments for forbidden APIs. * Adjustments for forbidden APIs. * Disable GROUP BY alias. * Doc reword.	2017-10-10 12:44:05 -07:00
Gian Merlino	2ce8123bdb	Move scan-query from a contrib extension into core. (#4751 ) * Move scan-query from a contrib extension into core. Based on a proposal at: https://groups.google.com/d/topic/druid-development/ME_OatUDnbk/discussion This patch also adds support for virtual columns to the Scan query, and updates Druid SQL to use Scan instead of Select. This patch also makes some behavioral changes to handling of the __time column. In particular, it is now is returned as "__time" rather than "timestamp"; it is no longer included if you do not specifically ask for it in your "columns"; and it is returned as a long rather than a string. Users can revert time handling to the legacy extension behavior by setting "legacy" : true in their queries, or setting the property druid.query.scan.legacy = true. This is meant to provide a migration path for users that were formerly using the contrib extension. * Adjustments from review. * Add back Select query. * Adjust SQL docs. * Restore SelectQuery link.	2017-09-13 09:51:24 -07:00
Gian Merlino	4909c48b0c	SQL: Full TRIM support. (#4750 ) * SQL: Full TRIM support. - Support trimming arbitrary characters - Support BOTH, LEADING, and TRAILING * Remove unused import. * Fix tests, add RTRIM / LTRIM. * Remove unused imports. * BTRIM and docs. * Replace for with foreach.	2017-09-12 11:49:08 -07:00
Gian Merlino	d4ef0f6d94	Improved SQL support for floats and doubles. (#4598 ) * Improved SQL support for floats and doubles. - Use Druid FLOAT for SQL FLOAT, and Druid DOUBLE for SQL DOUBLE, REAL, and DECIMAL. - Use float* aggregators when appropriate. - Add tests involving both float and double columns. - Adjust documentation accordingly. * CR comments. * Fix braces.	2017-07-25 13:54:44 -07:00
Gian Merlino	16817e408d	SQL + Expressions = Best friends forever. (#4360 ) * SQL + Expressions = Best friends forever. - Use expressions as a projection layer for anything that can't be expressed using traditional Druid extractionFns. Sometimes they're embedded directly (like "expression" filters, builtin aggregators, or "expression" post-aggregators). Sometimes they're referenced through virtual columns (like dimensionSpecs, which can't innately reference functions of more than one column without the virtual column layer). - Add many new functions and operators, taking advantage of the expression capability (see the querying/sql.md doc). - Improve consistency of constant reduction and of casting by using Druid expressions for this instead of Calcite's RexExecutor. * Fix casting bug, and other code review comments. * Fix docs.	2017-07-07 08:48:26 -07:00
Jonathan Wei	3b70995bb3	Configurable row limit for JDBC frames (#4417 )	2017-06-16 17:07:40 -07:00
Gian Merlino	3216134f8c	SQL: Make row extractions extensible and add one for lookups. (#3991 ) This is a reopening of #3989, since that PR was merged to master prematurely and accidentally.	2017-03-13 21:56:16 -07:00
Gian Merlino	960769c583	SQL: Fix example INFORMATION_SCHEMA query. (#4017 )	2017-03-06 16:07:47 -08:00
Gian Merlino	af5a4cce3c	SQL: Clarify approximate distinct count behavior. (#4000 )	2017-03-03 13:42:30 -08:00
Gian Merlino	4a56d7d8a0	SQL: Ability to generate exact distinct count queries. (#3999 )	2017-03-03 23:40:36 +05:30
Gian Merlino	e63eefd7ff	Revert "SQL: Make row extractions extensible and add one for lookups. (#3989 )" The PR was merged to master accidentally. This reverts commit `23927a3c96`.	2017-03-01 17:06:12 -08:00
Gian Merlino	23927a3c96	SQL: Make row extractions extensible and add one for lookups. (#3989 ) * SQL: Make row extractions extensible and add one for lookups. * Fix QuantileSqlAggregatorTest.	2017-03-01 17:03:43 -08:00
Jihoon Son	7200dce112	Atomic merge buffer acquisition for groupBys (#3939 ) * Atomic merge buffer acquisition for groupBys * documentation * documentation * address comments * address comments * fix test failure * Addressed comments - Add InsufficientResourcesException - Renamed GroupByQueryBrokerResource to GroupByQueryResource * addressed comments * Add takeBatch() to BlockingPool	2017-02-22 14:49:37 -06:00
Gian Merlino	e7d01b67b6	Move SQL configs to sql.md. (#3959 ) This puts all the SQL stuff in one place. It also makes life easier by pointing out that configs be made in either common.runtime.properties or the broker runtime.properties.	2017-02-22 08:37:24 -08:00
Gian Merlino	ca6053d045	SQL: Resolve column type conflicts in favor of newer segments. (#3930 ) * SQL: Resolve column type conflicts in favor of newer segments. Helps with schema evolution from e.g. long -> float, which is supported on the query side. * Take columns from highest timestamp instead of max segment id. * Fixes and docs.	2017-02-15 17:48:49 -08:00
Gian Merlino	16ef513c7d	SQL: Add context and contextual functions to planner. (#3919 ) * SQL: Add context and contextual functions to planner. Added support for context parameters specified as JDBC connection properties or a JSON object for SQL-over-JSON-over-HTTP. Also added features that depend on context functionality: - Added CURRENT_DATE, CURRENT_TIME, CURRENT_TIMESTAMP functions. - Added support for time zones other than UTC via a "timeZone" context. - Pass down query context to Druid queries too. Also some bug fixes: - Fix DATE handling, it was largely done incorrectly before. - Fix CAST(__time TO DATE) which should do a floor-to-day. - Fix non-equality comparisons to FLOOR(__time TO X). - Fix maxQueryCount property. * Pass down context to nested queries too.	2017-02-15 14:09:14 -08:00

1 2

57 Commits