druid

Commit Graph

Author	SHA1	Message	Date
Vadim Ogievetsky	ac0a45471e	Web console: add sort to tiers list (#10416 ) * add sort to tiers list * update snapshot	2020-09-22 19:00:55 -07:00
Vadim Ogievetsky	7cc0a7be68	Web console: clean up styling imports (#10410 ) * fix styling for importing * fix quotes	2020-09-21 17:30:25 -07:00
Tarun	49a09302f3	Issue fix for CSV loading with header and skip header not parsing well. (#10398 )	2020-09-21 15:14:22 -07:00
Vadim Ogievetsky	6c5c86d800	Web console: fix lookup edit dialog, allow column renaming (#10406 ) * column rename * update licenses file * remove empty file * update license file * move comment	2020-09-20 14:10:05 -07:00
sthetland	ae247b6e63	Document change in results of groupBy queries with subtotalsSpec (#10405 ) * subtotalsSpec results with null values Document the format change in results of a groupBy query with a subtotalsSpec. This update applies to 0.18 and later. * Review catches	2020-09-19 10:51:23 -07:00
Maytas Monsereenusorn	e78d7862a8	Auto-compaction snapshot status API (#10371 ) * Auto-compaction snapshot API * Auto-compaction snapshot API * Auto-compaction snapshot API * Auto-compaction snapshot API * Auto-compaction snapshot API * Auto-compaction snapshot API * Auto-compaction snapshot API * fix when not all compacted segments are iterated * add unit tests * add unit tests * add unit tests * add unit tests * add unit tests * add unit tests * add some tests to make code cov happy * address comments * address comments * address comments * address comments * make code coverage happy * address comments * address comments * address comments * address comments	2020-09-18 16:37:58 -07:00
Igor Dvorzhak	d0ee2e3a48	Upgrade ORC to 1.5.10 version (#10291 )	2020-09-18 13:38:45 -07:00
Dylan Wylie	f3eb0cfb3b	Avoid large limits causing int overflow in buffer size checks (#10356 ) * Avoid large limits causing int overflow in buffer size checks * fix lgtm overflow warning Co-authored-by: Dylan <dwylie@spotx.tv>	2020-09-18 13:08:49 -07:00
belugabehr	74368d95af	Remove JODA Time Dependency from Avro Extensions (#10010 )	2020-09-18 12:41:42 -07:00
Mainak Ghosh	d9beda7f24	Adding the missing sqlQueryContext api (#10368 ) * Adding the missing sqlQueryContext api * Adding a serialization test for DefaultRequestLogEvent * Fixing the unit test failure	2020-09-18 00:46:31 -07:00
Mainak Ghosh	14072d3ab0	Adding more dimensions to the audit log entry (#10373 ) * Adding more dimensions to the audit log entry * Making adding payload in audit metric optional * Changing the name of the parameter to includePayloadAsDimensionInMetric. Adding a unit test * Fixing the intellij code introspection issues	2020-09-17 18:36:28 -07:00
Suneet Saldanha	0b4c897fbe	Vectorized variance aggregators (#10390 ) * wip vectorize * close but not quite * faster * unit tests * fix complex types for variance	2020-09-17 15:05:40 -07:00
Arvin.Z	1b05d6e542	recreate the balancer executor only when needed (#10280 ) * recreate the balancer executor only when needed * fix UT error * shutdown the balancer executor in stopBeingLeader and stop * remove commented code * remove comments	2020-09-16 14:25:57 -05:00
Atul Mohan	94226f1b3d	Disable sending server version in response headers (#9832 ) * Toggle sending of server version * Remove config Co-authored-by: Atul Mohan <atulmohan@yahoo-inc.com>	2020-09-15 22:48:00 -07:00
Atul Mohan	b6ad790dc7	Support combining inputsource for parallel ingestion (#10387 ) * Add combining inputsource * Fix documentation Co-authored-by: Atul Mohan <atulmohan@yahoo-inc.com>	2020-09-15 16:25:35 -07:00
Jihoon Son	8657b23ab2	Integration tests and docs for auto compaction with different partitioning (#10354 ) * Working * add test * doc * fix test * split other integration test * exclude other-index from other tests * doc anchor fix * adjust task slots and number of merge tasks * spell check * reduce maxNumConcurrentSubTasks to 1 * maxNumConcurrentSubtasks for range partitinoing * reduce memory for historical * change group name	2020-09-15 11:28:09 -07:00
Vadim Ogievetsky	e465f05717	Web console: Improve number alignment in tables (#10389 ) * Improve tables * removed unused state interfaces * better copy * one more functional component * updated e2e tests * extract braced text correctly	2020-09-14 19:53:38 -07:00
Chi Cao Minh	5751d0edc1	Skip coverage check for tag builds (#10397 ) The code coverage diff calculation assumes the TRAVIS_BRANCH environment variable is the name of a branch; however, for tag builds it is the name of the tag so the diff calculation fails. Since builds triggered by tags do not have a code diff, the coverage check should be skipped to avoid the error and to save some CI resources.	2020-09-14 19:46:33 -07:00
Suneet Saldanha	f71ba6f2c2	Vectorized ANY aggregators (#10338 ) * WIP vectorized ANY aggregators * tests * fix aggs * cleanup * code review + tests * docs * use NilVectorSelector when needed * fix spellcheck * dont instantiate vectors * cleanup	2020-09-14 19:44:58 -07:00
Clint Wylie	e012d5c41b	allow vectorized query engines to utilize vectorized virtual columns (#10388 ) * allow vectorized query engines to utilize vectorized virtual column implementations * javadoc, refactor, checkstyle * intellij inspection and more javadoc * better * review stuffs * fix incorrect refactor, thanks tests * minor adjustments	2020-09-14 19:29:35 -07:00
Clint Wylie	184b202411	add computed Expr output types (#10370 ) * push down ValueType to ExprType conversion, tidy up * determine expr output type for given input types * revert unintended name change * add nullable * tidy up * fixup * more better * fix signatures * naming things is hard * fix inspection * javadoc * make default implementation of Expr.getOutputType that returns null * rename method * more test * add output for contains expr macro, split operation and function auto conversion	2020-09-14 18:18:56 -07:00
Clint Wylie	084b23deed	benchmark for indexed table experiments (#10327 ) * benchmark for indexed table experiments * fix style * teardown outside of measurement	2020-09-14 15:14:38 -07:00
Abhishek Agarwal	f5e2645bbb	Support SearchQueryDimFilter in sql via new methods (#10350 ) * Support SearchQueryDimFilter in sql via new methods * Contains is a reserved word * revert unnecessary change * Fix toDruidExpression method * rename methods * java docs * Add native functions * revert change in dockerfile * remove changes from dockerfile * More tests * travis fix * Handle null values better	2020-09-14 09:57:54 -07:00
Cheng Pan	3d4b48e0aa	TransformSpecTest should extends InitializedNullHandlingTest (#10392 )	2020-09-14 08:22:24 -07:00
Vadim Ogievetsky	3c8eacb2d4	Web console: improve query manager (convert to React hook) (#10360 ) * Better query running * update licenses * update tests * updated tests v2 * fade in cancel * add exemplary tests * update mkcomp * fix inconsistent state update * remove lastParsedQuery * work if not a valid literal * remove unused params * fix licenses * better state update * get error message * isEmpty tidy * add tests around error message highlighting * pull live query selector into a component * add LiveQueryModeSelector tests * update snapshots	2020-09-11 19:42:50 -07:00
Curt Buechter	e3735602f2	Fix typo (#10385 )	2020-09-11 16:31:36 -07:00
Jihoon Son	8f14ac814e	More structured way to handle parse exceptions (#10336 ) * More structured way to handle parse exceptions * checkstyle; add more tests * forbidden api; test * address comment; new test * address review comments * javadoc for parseException; remove redundant parseException in streaming ingestion * fix tests * unnecessary catch * unused imports * appenderator test * unused import	2020-09-11 16:31:10 -07:00
Cheng Pan	8aea8cf1c6	Unit tests fail due to missing extend InitializedNullHandlingTest (#10382 ) * CsvInputFormatTest should extend InitializedNullHandlingTest * FirehoseFactoryToInputSourceAdaptorTest should extends InitializedNullHandlingTest	2020-09-11 16:23:46 -07:00
Lucas Capistrant	690e070c43	Fix doc for name of dynamic config to pause coordination (#10345 )	2020-09-11 08:40:06 -05:00
Abhishek Agarwal	a5c46dc84b	Add vectorization for druid-histogram extension (#10304 ) * First draft * Remove redundant code from FixedBucketsHistogramAggregator classes * Add test cases for new classes * Fix tests in sql compatible mode * Typo fix * Fix comment * Add spelling * Vectorize only for supported types * Rename internal aggregator files * Fix tests	2020-09-09 13:56:33 -07:00
Joy Kent	e5f0da30ae	Fix stringFirst/stringLast rollup during ingestion (#10332 ) * Add IndexMergerRollupTest This changelist adds a test to merge indexes with StringFirst/StringLast aggregator. * Fix StringFirstAggregateCombiner/StringLastAggregateCombiner The segment-level type for stringFirst/stringLast is SerializablePairLongString, not String. This changelist fixes it. * Fix EarliestLatestAnySqlAggregator to handle COMPLEX type This changelist allows EarliestLatestAnySqlAggregator to accept COMPLEX type as an operand. For its return type, we set it to VARCHAR, since COMPLEX column is only generated by stringFirst/stringLast during ingestion rollup. * Return value with smaller timestamp in StringFirstAggregatorFactory.combine function * Add integration tests for stringFirst/stringLast during ingestion * Use one EarliestLatestReturnTypeInference instance Co-authored-by: Joy Kent <joy@automonic.ai>	2020-09-08 17:36:04 -07:00
Jihoon Son	d32d1e7004	Fix result-level caching (#10341 ) * create baseSequence early * unit test * add comment and a new test	2020-09-08 11:04:00 -07:00
Chi Cao Minh	176b715624	Ignore CVEs from htrace and ambari transitive deps (#10353 ) * Ignore CVEs from htrace and ambari transitive deps htrace CVEs are suppressed for now as addressing them requires updating the hadoop version. ambari CVEs are suppressed for now since ambari is updated to the latest version and is no longer actively maintained. * Fix compilation issue from ambari upgrade * Add missing test coverage	2020-09-04 15:22:26 -07:00
Suneet Saldanha	91a153820e	fix NPE in StringGroupByColumnSelectorStrategy#bufferComparator (#10325 ) * fix NPE in StringGroupByColumnSelectorStrategy#bufferComparator * Add tests * javadocs	2020-09-04 13:23:40 -07:00
Gian Merlino	d7fcff3aba	StringFirstAggregatorFactory: Fix incorrect "combine" method. (#10351 ) * StringFirstAggregatorFactory: Fix incorrect "combine" method. There was a test, but it was wrong. * Fix superclass.	2020-09-03 20:03:26 -07:00
LightGHLi	a3bb6ee4a6	Add missing comma between JSON members in data-formats.md (#10343 )	2020-09-03 20:03:06 -07:00
Suneet Saldanha	a5cd5f1e84	Fix VARIANCE aggregator comparator (#10340 ) * Fix VARIANCE aggregator comparator The comparator for the variance aggregator used to compare values using the count. This is now fixed to compare values using the variance. If the variance is equal, the count and sum are used as tie breakers. * fix tests + sql compatible mode * code review * more tests * fix last test	2020-09-03 17:38:37 -07:00
xiangqiao123	3fc8bc0701	optimize announceHistoricalSegments (#9935 ) * optimize announceHistoricalSegment * optimize announceHistoricalSegment * revert offline SegmentTransactionalInsertAction uses a separate lock * optimize segmentExistsBatch: Avoid too many elements in the in condition * add unit test && Modified according to cr Co-authored-by: xiangqiao <xiangqiao@kuaishou.com>	2020-09-02 13:07:10 -07:00
Clint Wylie	a7924a9dee	add link to Docker quickstart in github README (#10299 ) Per suggestion in comment https://github.com/apache/druid/pull/9262#issuecomment-675732237, I think this should eventually result in the copy mirrored on dockerhub to also be updated, if I understand how things work. Only the github `README.md` has been updated, not the `README.template` used for src and bin packages because presumably if you are reading from either of those you are just going to run locally and so the local quickstart is appropriate.	2020-09-02 01:17:34 -07:00
Vadim Ogievetsky	e81a9df507	Web console: add tile for Azure Event Hubs (via Kafka API) (#10317 ) * Add Azure Event Hubs * better note * update icon	2020-08-31 20:58:52 -07:00
Clint Wylie	475d86a4f7	split up Expr.java (#10333 )	2020-08-31 12:51:53 -07:00
Gian Merlino	8ab1979304	Remove implied profanity from error messages. (#10270 ) i.e. WTF, WTH.	2020-08-28 11:38:50 -07:00
Gian Merlino	5cd7610fb6	SQL support for union datasources. (#10324 ) * SQL support for union datasources. Exposed via the "UNION ALL" operator. This means that there are now two different implementations of UNION ALL: one at the top level of a query that works by concatenating subquery results, and one at the table level that works by creating a UnionDataSource. The SQL documentation is updated to discuss these two use cases and how they behave. Future work could unify these by building support for a native datasource that represents the union of multiple subqueries. (Today, UnionDataSource can only represent the union of tables, not subqueries.) * Fixes. * Error message for sanity check. * Additional test fixes. * Add some error messages.	2020-08-28 07:57:06 -07:00
Jihoon Son	f82fd22fa7	Move tools for indexing to TaskToolbox instead of injecting them in constructor (#10308 ) * Move tools for indexing to TaskToolbox instead of injecting them in constructor * oops, other changes * fix test * unnecessary new file * fix test * fix build	2020-08-26 17:08:12 -07:00
Gian Merlino	21703d81ac	Fix handling of 'join' on top of 'union' datasources. (#10318 ) * Fix handling of 'join' on top of 'union' datasources. The problem is that unions are typically rewritten into a series of individual queries on the underlying tables, but this isn't done when the union is wrapped in a join. The main changes are in UnionQueryRunner: 1) Replace an instanceof UnionQueryRunner check with DataSourceAnalysis. 2) Replace a "query.withDataSource" call with a new function, "Queries.withBaseDataSource". Together, these enable UnionQueryRunner to "see through" a join. * Tests. * Adjust heap sizes for integration tests. * Different approach, more tests. * Tweak. * Styling.	2020-08-26 14:23:54 -07:00
Jihoon Son	b9ff3483ac	Add support for all partitioing schemes for auto compaction (#10307 ) * Add support for all partitioing schemes for auto compaction * annotate last compaction state for multi phase parallel indexing * fix build and tests * test * better home	2020-08-26 13:19:18 -07:00
Fernando	69d8645425	Adding supported compression formats for native batch ingestion (#10306 ) * Adding supported compression formats for native batch ingestion * Update docs/ingestion/native-batch.md Co-authored-by: sthetland <steve.hetland@imply.io> * fix spellcheck Co-authored-by: Suneet Saldanha <suneet@apache.org> Co-authored-by: sthetland <steve.hetland@imply.io>	2020-08-26 12:39:48 -07:00
Abhishek Agarwal	d4ac62f284	Handle internal kinesis sequence numbers when reporting lag (#10315 ) * Handle internal kinesis sequence numbers when reporting lag * add unit test	2020-08-26 11:27:37 -07:00
Clint Wylie	ab60661008	refactor internal type system (#9638 ) * better type tracking: add typed postaggs, finalized types for agg factories * more javadoc * adjustments * transition to getTypeName to be used exclusively for complex types * remove unused fn * adjust * more better * rename getTypeName to getComplexTypeName * setup expression post agg for type inference existing * more javadocs * fixup * oops * more test * more test * more comments/javadoc * nulls * explicitly handle only numeric and complex aggregators for incremental index * checkstyle * more tests * adjust * more tests to showcase difference in behavior * timeseries longsum array	2020-08-26 10:53:44 -07:00
Suneet Saldanha	a9de00d43a	Remove NUMERIC_HASHING_THRESHOLD (#10313 ) * Make NUMERIC_HASHING_THRESHOLD configurable Change the default numeric hashing threshold to 1 and make it configurable. Benchmarks attached to this PR show that binary searches are not more faster than doing a set contains check. The attached flamegraph shows the amount of time a query spent in the binary search. Given the benchmarks, we can expect to see roughly a 2x speed up in this part of the query which works out to ~ a 10% faster query in this instance. * Remove NUMERIC_HASHING_THRESHOLD * Remove stale docs	2020-08-25 20:05:39 -07:00

1 2 3 4 5 ...

10583 Commits All Branches Search

10583 Commits

All Branches