druid

Commit Graph

Author	SHA1	Message	Date
Clint Wylie	2e9548d93d	refactor SeekableStreamSupervisor usage of RecordSupplier (#9819 ) * refactor SeekableStreamSupervisor usage of RecordSupplier to reduce contention between background threads and main thread, refactor KinesisRecordSupplier, refactor Kinesis lag metric collection and emitting * fix style and test * cleanup, refactor, javadocs, test * fixes * keep collecting current offsets and lag if unhealthy in background reporting thread * review stuffs * add comment	2020-05-16 14:09:39 -07:00
Alexander Saydakov	522df300c2	Datasketches 1 3 0 (#9880 ) * use the latest datasketches release * new sketch debug print Co-authored-by: AlexanderSaydakov <AlexanderSaydakov@users.noreply.github.com>	2020-05-16 14:09:23 -07:00
Joseph Glanville	793f386d6a	Add support for Avro OCF using InputFormat (#9671 ) * Add AvroOCFInputFormat * Support supplying a reader schema in AvroOCFInputFormat * Add docs for Avro OCF input format * Address review comments * Address second round of review	2020-05-16 14:09:12 -07:00
Jihoon Son	46beaa0640	Fix potential resource leak in ParquetReader (#9852 ) * Fix potential resource leak in ParquetReader * add test * never thrown exception * catch potential exceptions	2020-05-16 09:57:12 -07:00
Maytas Monsereenusorn	0a8bf83bc5	Bad plan for table-lookup-lookup join with filter on first lookup and outer limit (#9773 ) * Bad plan for table-lookup-lookup join with filter on first lookup and outer limit * Bad plan for table-lookup-lookup join with filter on first lookup and outer limit * Bad plan for table-lookup-lookup join with filter on first lookup and outer limit * Bad plan for table-lookup-lookup join with filter on first lookup and outer limit * Bad plan for table-lookup-lookup join with filter on first lookup and outer limit * Bad plan for table-lookup-lookup join with filter on first lookup and outer limit * address comments * address comments * fix checkstyle * address comments * address comments	2020-05-14 16:56:40 -07:00
zachjsh	80b212fe43	druid.storage.maxListingLength should default to 1000 for s3 (#9858 ) * druid.storage.maxListingLength should default to 1000 for s3 * * Address review comments * * Address review comments * * Address comments	2020-05-14 07:00:51 -07:00
Chi Cao Minh	41cf826928	Console E2E test docs (#9864 )	2020-05-13 16:41:04 -07:00
Suneet Saldanha	d38d77cb3a	Add back FieldMayBeFinal inspection (#9865 )	2020-05-13 16:32:35 -07:00
Suneet Saldanha	b0167295d7	Fail incorrectly constructed join queries (#9830 ) * Fail incorrectly constructed join queries * wip annotation for equals implementations * Add equals tests * fix tests * Actually fix the tests * Address review comments * prohibit Pattern.hashCode()	2020-05-13 14:23:04 -07:00
Clint Wylie	6bc1d1b33f	fix license registry for com.nimbusds lang-tag (#9860 )	2020-05-13 09:18:18 -07:00
Jihoon Son	c06d3f14b1	Add javadoc for stream ingestion integration tests (#9795 )	2020-05-12 08:56:43 -07:00
awelsh93	6f25a84d2e	Add TaskCountStatsMonitor to config docs (#9447 )	2020-05-11 14:08:46 -07:00
Jonathan Wei	16d293d6e0	Directly rewrite filters on RHS join columns into LHS equivalents (#9818 ) * Directly rewrite filters on RHS join columns into LHS equivalents * PR comments * Fix inspection * Revert unnecessary ExprMacroTable change * Fix build after merge * Address PR comments	2020-05-08 23:45:35 -07:00
mcbrewster	28be107a1c	add flag to flattenSpec to keep null columns (#9814 ) * add flag to flattenSpec to keep null columns * remove changes to inputFormat interface * add comment * change comment message * update web console e2e test * move keepNullColmns to JSONParseSpec * fix merge conflicts * fix tests * set keepNullColumns to false by default * fix lgtm * change Boolean to boolean, add keepNullColumns to hash, add tests for keepKeepNullColumns false + true with no nuulul columns * Add equals verifier tests	2020-05-08 21:53:39 -07:00
Clint Wylie	339876b69d	fill out missing test coverage for druid-stats, druid-momentsketch, druid-tdigestsketch postaggs (#9740 ) * postagg test coverage for druid-stats, druid-momentsketch, druid-tdigestsketch and fixes * style fixes * fix comparator for TDigestQuantilePostAggregator	2020-05-07 13:48:33 -07:00
Clint Wylie	267a6cc175	low hanging fruit - presize hash map for DruidSegmentReader (#9836 )	2020-05-07 12:39:14 -07:00
Clint Wylie	2c0746cfab	increase druid-histogram postagg test coverage (#9732 )	2020-05-07 00:10:29 -07:00
Maytas Monsereenusorn	accd710115	Add equivalent test coverage for all RHS join impls (#9831 ) * Add equivalent test coverage for all RHS join impls * address comments	2020-05-06 16:10:41 -07:00
Jihoon Son	6674d721bc	Avoid sorting values in InDimFilter if possible (#9800 ) * Avoid sorting values in InDimFilter if possible * tests * more tests * fix and and or filters * fix build * false and true vector matchers * fix vector matchers * checkstyle * in filter null handling * remove wrong test * address comments * remove unnecessary null check * redundant separator * address comments * typo * tests	2020-05-06 15:26:36 -07:00
sthetland	ce03f31a73	Clarifying workerThreads and a few other nits (#9804 ) * Update data-formats.md Per Suneet, "Since you're editing this file can you also fix the json on line 177 please - it's missing a comma after the }" * Light text cleanup * Removing discussion of sample data, since it's repeated in the data loading tutorial, and not immediately relevant here. * Clarifying accepted values for URI lookup * Update index.md * original quickstart full first pass * original quickstart full first pass * first pass all the way through * straggler * image touchups and finished old tutorial * a bit of finishing up * druid-caffeine-cache ext previously removed * Sample MaxDirectMemorySize value unrealistic * Review comments * fixing links * spell checking gymnastics * workerThreads desc slightly expanded * typo * Typo * Reversing Kafka config order * Changing order of configs for Kinesis * Trying this again: ioConfig then tuningConfig	2020-05-06 09:05:18 -07:00
Suneet Saldanha	1e857c5303	Ignore druid-processing benchmarks in tests (#9821 )	2020-05-06 08:59:48 -07:00
Jihoon Son	964a1fc9df	Remove ParseSpec.toInputFormat() (#9815 ) * Remove toInputFormat() from ParseSpec * fix test	2020-05-05 11:17:57 -07:00
Jihoon Son	c6caae9a24	Fix filtering on boolean values in transformation (#9812 ) * Fix filter on boolean value in Transform * assert * more descriptive test * remove assert * add assert for cached string; disable tests * typo	2020-05-04 18:47:10 -07:00
Alexander Saydakov	844d626738	added number of bins parameter (#9436 ) * added number of bins parameter * addressed review points * test equals Co-authored-by: AlexanderSaydakov <AlexanderSaydakov@users.noreply.github.com>	2020-05-04 16:53:09 -07:00
Jian Wang	85dfbb64cb	Update documention for metricCompression (#9811 )	2020-05-03 12:56:48 -07:00
Aleksey Plekhanov	9341ea828a	Fixed flaky BlockingPoolTest.testConcurrentTakeBatch() (#9692 )	2020-05-03 12:54:27 -07:00
BIGrey	ee9a721acc	fix npe in IncrementalIndexReadBenchmark (#9754 ) Co-authored-by: 黄辉 <huanghui.bigrey@bytedance.com>	2020-05-03 12:52:50 -07:00
Jihoon Son	9ab49b34db	Update notice; fix version of druid-query-toolkit (#9799 )	2020-05-02 20:00:43 -07:00
Clint Wylie	9a293d554d	remove UnionMergeRule rules from SQL planner (#9797 )	2020-05-01 12:50:11 -07:00
Jonathan Wei	61295bd002	More Hadoop integration tests (#9714 ) * More Hadoop integration tests * Add missing s3 instructions * Address PR comments * Address PR comments * PR comments * Fix typo	2020-04-30 14:33:01 -07:00
sthetland	c61365c1e0	Druid Quickstart refactor and update (#9766 ) * Update data-formats.md Per Suneet, "Since you're editing this file can you also fix the json on line 177 please - it's missing a comma after the }" * Light text cleanup * Removing discussion of sample data, since it's repeated in the data loading tutorial, and not immediately relevant here. * Update index.md * original quickstart full first pass * original quickstart full first pass * first pass all the way through * straggler * image touchups and finished old tutorial * a bit of finishing up * Review comments * fixing links * spell checking gymnastics	2020-04-30 12:07:28 -07:00
Jihoon Son	39722bd064	Integration tests for stream ingestion with various data formats (#9783 ) * Integration tests for stream ingestion with various data formats * fix npe * better logging; fix tsv * fix tsv * exclude kinesis from travis * some readme	2020-04-29 13:18:01 -07:00
Suneet Saldanha	7510e6e722	Fix potential NPEs in joins (#9760 ) * Fix potential NPEs in joins intelliJ reported issues with potential NPEs. This was first hit in testing with a filter being pushed down to the left hand table when joining against an indexed table. * More null check cleanup * Optimize filter value rewrite for IndexedTable * Add unit tests for LookupJoinable * Add tests for IndexedTableJoinable * Add non null assert for dimension selector * Supress null warning in LookupJoinMatcher * remove some null checks on hot path	2020-04-29 11:03:13 -07:00
Aleksei Chumagin	0642f778fa	changed Preview to Apply (#9757 )	2020-04-29 09:53:25 -07:00
Maytas Monsereenusorn	6bc64b731f	Improve "waiting for tasks complete" logic in integration tests (#9759 ) * improve waiting for tasks complete logic in integration tests * improve waiting for tasks complete logic in integration tests * fix forbidden check	2020-04-29 08:53:45 -07:00
Maytas Monsereenusorn	a107ee3ed2	Fix problem when running single integration test using -Dit.test= (#9778 ) * fix running single it * fix checksyle	2020-04-29 08:53:25 -07:00
James Dalton	b279e04a31	table fix (#9769 )	2020-04-28 11:23:24 -07:00
Francesco Nidito	e7e41e3a36	Adding support for autoscaling in GCE (#8987 ) * Adding support for autoscaling in GCE * adding extra google deps also in gce pom * fix link in doc * remove unused deps * adding terms to spelling file * version in pom 0.17.0-incubating-SNAPSHOT --> 0.18.0-SNAPSHOT * GCEXyz -> GceXyz in naming for consistency * add preconditions * add VisibleForTesting annotation * typos in comments * use StringUtils.format instead of String.format * use custom exception instead of exit * factorize interval time between retries * making literal value a constant * iter all network interfaces * use provided on google (non api) deps * adding missing dep * removing unneded this and use Objects methods instead o 3-way if in hash and comparison * adding import * adding retries around getRunningInstances and adding limit for operation end waiting * refactor GceEnvironmentConfig.hashCode * 0.18.0-SNAPSHOT -> 0.19.0-SNAPSHOT * removing unused config * adding tests to hash and equals * adding nullable to waitForOperationEnd * adding testTerminate * adding unit tests for createComputeService * increasing retries in unrelated integration-test to prevent sporadic failure (hopefully) * reverting queryResponseTemplate change * adding comment for Compute.Builder.build() returning null	2020-04-28 03:13:39 -07:00
Maytas Monsereenusorn	8b78eebdbd	Test reading from empty kafka/kinesis partitions (#9729 ) * add test for stream sequence number returns null * fix checkstyle * add index test for when stream returns null * retrigger test	2020-04-27 10:23:56 -07:00
Jonathan Wei	fe000a9e4b	Adjust string comparators used for ingestion (#9742 ) * Adjust string comparators used for ingestion * Small tweak * Fix inspection, more javadocs * Address PR comment * Add rollup comment * Add ordering test * Fix IncrementaIndexRowCompTest	2020-04-25 13:47:07 -07:00
Clint Wylie	7711f776a0	fix issue where CloseableIterator.flatMap does not close inner CloseableIterator (#9761 ) * fix issue where CloseableIterator.flatMap does not close inner CloseableIterator * more test * style * clarify test	2020-04-24 13:52:50 -07:00
Clint Wylie	fc5383cd00	revert datasketches-java version to 1.1.0-incubating until new version is released (#9751 ) * revert datasketches-java version to 1.1.0-incubating until fix is in place * fix tests * checkstyle	2020-04-24 12:52:12 -07:00
Jihoon Son	7fa72fbf15	Initialize SettableByteEntityReader only when inputFormat is not null (#9734 ) * Lazy initialization of SettableByteEntityReader to avoid NPE * toInputFormat for tsv * address comments * common code	2020-04-24 10:22:51 -07:00
BIGrey	c5bfe36011	Optimize FileWriteOutBytes to avoid high system cpu usage (#9722 ) * optimize FileWriteOutBytes to avoid high sys cpu * optimize FileWriteOutBytes to avoid high sys cpu -- remove IOException * optimize FileWriteOutBytes to avoid high sys cpu -- remove IOException in writeOutBytes.size * Revert "optimize FileWriteOutBytes to avoid high sys cpu -- remove IOException in writeOutBytes.size" This reverts commit `965f7421` * Revert "optimize FileWriteOutBytes to avoid high sys cpu -- remove IOException" This reverts commit `149e08c0` * optimize FileWriteOutBytes to avoid high sys cpu -- avoid IOEception never thrown check * Fix size counting to handle IOE in FileWriteOutBytes + tests * remove unused throws IOException in WriteOutBytes.size() * Remove redundant throws IOExcpetion clauses * Parameterize IndexMergeBenchmark Co-authored-by: huanghui.bigrey <huanghui.bigrey@bytedance.com> Co-authored-by: Suneet Saldanha <suneet.saldanha@imply.io>	2020-04-23 20:18:42 -07:00
Gian Merlino	4087a015e8	Datasource doc structure adjustments. (#9716 ) - Reorder both the datasource and query-execution page orderings to table, lookup, union, inline, query, join. (Roughly increasing order of conceptual "fanciness".) - Add more crosslinks from datasource page to query-execution page: one per datasource type.	2020-04-23 16:04:59 -07:00
Maytas Monsereenusorn	16f5ae4405	Add integration tests for kafka ingestion (#9724 ) * add kafka admin and kafka writer * refactor kinesis IT * fix typo refactor * parallel * parallel * parallel * parallel works now * add kafka it * add doc to readme * fix tests * fix failing test * test * test * test * test * address comments * addressed comments	2020-04-22 10:43:34 -07:00
Gian Merlino	479c290fb9	Add QueryResource to log4j2 template. (#9735 )	2020-04-22 09:18:45 -07:00
Maytas Monsereenusorn	cff39892ba	Fixes intermittent failure in ITAutoCompactionTest (#9739 ) * fix intermittent failure in ITAutoCompactionTest * fix typo * update javadoc	2020-04-21 20:56:17 -07:00
calvinhkf	b146f8a2a7	Align library version (#9636 ) * align JUnitParams version 1.1.1,1.0.4 to 1.1.1 * aligin junit version 4.8.1,4.12 to 4.12 * exclude explicitly specified version	2020-04-21 20:19:38 -07:00
Abhishek Radhakrishnan	8abcbf671d	Fix numbered list formatting in markdown. (#9664 )	2020-04-21 20:18:12 -07:00

... 2 3 4 5 6 ...

10479 Commits All Branches Search

10479 Commits

All Branches