druid

Commit Graph

Author	SHA1	Message	Date
Himanshu	ed322a4beb	remove size from default analysisTypes list for segmentMetadata query (#3773 )	2016-12-13 18:01:21 -08:00
Slim	7b18fb79e0	as per @itaiy suggested will close then try to connect. (#3669 ) * as per @itaiy suggested will close then try to connect. * use close instead of flush * git fix comments * break the loop in case of interrupted	2016-12-13 23:50:28 +05:30
Ninglin Du	469ab21091	[Feature] Thrift support for realtime and batch ingestion (#3418 ) * Thrift ingestion plugin 1. thrift binary is platform dependent, use scrooge to generate java files to avoid style check failure 2. stream and hadoop ingesion are both supported, input format can be sequence file and lzo thrift block file. 3. base64 and protocol aware change header * fix conlicts in pom	2016-12-13 10:05:15 -08:00
John Zhang	48b22e261a	support atomic writes for local deep storage (#3521 ) * Use atomic writes for local deep storage * fix pr issues * use defaultObjMapper for test * move tmp pushes to a intermediate dir * minor refactor	2016-12-13 10:03:22 -08:00
kaijianding	4be3eb0ce7	report message gap, source gap and sink count in RealtimePlumber (#3744 ) * report message gap, source gap and sink count in RealtimePlumber * report message gap, sink count in Appenderator * add ingest/events/sourceGap in metrics.md * remove source gap	2016-12-13 11:23:02 -06:00
Gleb Smirnov	07384d6f40	Update Apache curator to a non-leaky version (see CURATOR-354) (#3769 )	2016-12-12 09:52:40 -08:00
David Lim	0b9dff0bc1	fix worker thread pool exhaustion bug (#3760 ) * fix worker thread pool exhaustion bug * code review changes * code review changes	2016-12-09 15:23:11 -08:00
David Lim	7f087cdd3b	allow Kafka consumer group.id to be overriden by config (#3765 )	2016-12-08 15:53:13 -08:00
Akash Dwivedi	6386e6a4dc	root and java-util pom cleanup (#3764 ) * Remove bytebuffer-collections dependency from the root pom and java-util pom cleanup. * Remove json-smart exclusion from root pom	2016-12-08 11:30:19 -08:00
Jonathan Wei	880a021a7a	Fix missed travis failures from PR 3567 and 2798 (#3761 ) * Fix checkstyle failures from PR 3567 * Fix GranularityPathSpecTest compile failure	2016-12-07 19:07:31 -08:00
Navis Ryu	f794246ec1	Trimming out outside of given interval (#2798 ) * Trimming out outside of given interval (Fix for #2659) * addressed comments	2016-12-07 18:05:50 -08:00
Erik Dubbelboer	bb9e35e1af	Add Greatest and Least post aggregations (#3567 )	2016-12-07 17:58:23 -08:00
Gian Merlino	943982b7b0	Configurable HTTP compression. (#3759 ) * Configurable HTTP compression. * Call real-time nodes real-time processes in docs.	2016-12-07 17:40:39 -08:00
Roman Leventov	d9807835f9	Add BitmapIterationBenchmark and update to JMH 1.17.2 (#3751 )	2016-12-07 15:50:33 -08:00
Roman Leventov	949e65165c	Bitset iteration optimization and improve safety (#3753 ) * Deduplicate looking for bitset.nextSetBit() in BitSetIterator.next() and hasNext() * Add BitmapIterationTest * More elaborate comment on why Roaring is not tested in BitmapIterationTest	2016-12-07 15:49:16 -08:00
Navis Ryu	87c61fa749	Refactor boolean cast code, add tests (#3016 )	2016-12-07 13:10:39 -08:00
Roman Leventov	70e83bea6d	Fix PathChildrenCache's ExecutorService leak (#3726 ) * Fix PathChildrenCache's executorService leak in Announcer, CuratorInventoryManager and RemoteTaskRunner * Use a single ExecutorService for all workerStatusPathChildrenCaches in RemoteTaskRunner	2016-12-07 13:00:10 -08:00
Roman Leventov	dc8f814acc	Optimize Iterator<ImmutableBitmap> implementation inside Filters.matchPredicate() so that it doesn't emit empty bitmap in the end of the iteration, and make it to follow Iterator contract, that is throw NoSuchElementException from next() if there are no more bitmaps (#3754 )	2016-12-07 12:54:09 -08:00
Nishant	361af4c94f	Wait for any pending realtime task to complete before disabling datasource (#3757 ) Noticed this in our internal testing, sometimes realtime index tasks in kafkaIndexing service can get stuck waiting for handoff if datasource is disabled before there task completion. This is a workaround to ensure integration tests do not hit this case until https://github.com/druid-io/druid/issues/1729 is fixed.	2016-12-07 10:17:16 -08:00
Himanshu	06d0ef9c6c	allow and load extensions with absolute paths in druid.extensions.loadList (#3747 )	2016-12-06 17:40:23 -08:00
Jonathan Wei	d1896a2d62	Disable flush after every ObjectMapper write (#3748 )	2016-12-06 16:45:23 -08:00
Gian Merlino	b1bac9f2d3	groupBy v2: Ignore timestamp completely when granularity = all, except for the final merge. (#3740 ) * GroupByBenchmark: Add serde, spilling, all-gran benchmarks. Also use more iterations. * groupBy v2: Ignore timestamp completely when granularity = all, except for the final merge. Specifically: - Remove timestamp from RowBasedKey when not needed - Set timestamp to null in MapBasedRows that are not part of the final merge.	2016-12-06 16:17:32 -08:00
kaijianding	f995b1426f	retry loadSegment with all locations (#3681 )	2016-12-06 12:00:59 -08:00
Himanshu	5440a06b2d	make sure CliCoordinator initializes and starts DerbyMetadataStorage first if configured (#3700 ) * make sure CliCoordinator initializes and starts DerbyMetadataStorage first if configured * Revert "make sure CliCoordinator initializes and starts DerbyMetadataStorage first if configured" This reverts commit 54f5644054626d4a9e2448bb4bd5e6ce9a9fca1d. * make sure CliCoordinator initializes and starts DerbyMetadataStorage first if configured	2016-12-06 10:22:04 -08:00
Himanshu	45da7e48f1	groupBy sort results by (dimensions,timestamp) instead of (timestamp,dimension) (#3672 ) * sortByDimsFirst flag for groupBy query * Remove need for KeyType in Grouper<KeyType> to be Comparable<KeyType> * fix review comments * fix review comments regarding removing code duplication of dim/time comparison * move comparator for KeyType object to KeySerdeFactory so that creation of comparator does not need KeySerde * remove unnecessary system.out.println * make access static var NATURAL_NULLS_FIRST directly * further review comments addressing	2016-12-06 09:48:56 -08:00
Niketh Sabbineni	d904c79081	Normalized Cost Balancer (#3632 ) * Normalized Cost Balancer * Adding documentation and renaming to use diskNormalizedCostBalancer * Remove balancer from the strings * Update docs and include random cost balancer * Fix checkstyle issues	2016-12-05 17:18:20 -08:00
Navis Ryu	c74d267f50	Support virtual column for select query (#2511 ) * Support virtual column for select query * Addressed comments	2016-12-05 15:14:35 -08:00
Gian Merlino	4e67dd28c0	RemoteTaskRunnerConfig: Fix Guice error on startup. (#3737 )	2016-12-06 00:19:53 +05:30
Gian Merlino	ff42058453	Expressions: Allow escapes in quoted identifiers. (#3735 )	2016-12-06 00:17:55 +05:30
Gian Merlino	b64e06704e	Fix SingleScanTimeDimSelector when an extractionFn returns null for a timestamp. (#3732 )	2016-12-02 15:27:54 -08:00
Gian Merlino	f4cc8c2b2f	IndexBuilder: Close IncrementalIndex when done. (#3734 )	2016-12-02 16:56:34 -06:00
Gian Merlino	353fee79dd	Add "asMillis" option to "timeFormat" extractionFn. (#3733 ) This is useful for chaining extractionFns that all want to treat time as millis, such as having a javascript extractionFn after a timeFormat.	2016-12-02 13:45:16 -08:00
Gian Merlino	102375d9bb	Add "strlen" extractionFn. (#3731 )	2016-12-02 12:08:51 -08:00
Gian Merlino	4c5d10f8a3	Add DimFilterHavingSpec. (#3727 ) * Add DimFilterHavingSpec. * Add test for DimFilterHavingSpec with extractionFns.	2016-12-02 10:04:30 -08:00
Gian Merlino	a8069f2441	Retry on dataSource metadata CAS failures where retrying might help. (#3728 ) This retries when the start condition is met but SELECT -> INSERT/UPDATE fails, which indicates a race. If the start condition isn't met, there won't be any retrying.	2016-12-01 11:50:15 -07:00
Charles Allen	27ab23ef44	Don't update segment metadata if archive doesn't move anything (#3476 ) * Don't update segment metadata if archive doesn't move anything * Fix restore task to handle potential null values * Don't try to update empty metadata * Address review comments * Move to druid-io java-util	2016-12-01 07:49:28 -08:00
Gian Merlino	68735829ca	Add, fix equals, hashCode, toString on various classes. (#3723 ) * TimeFormatExtractionFn: Add toString. * InDimFilter: Add toString, allow accepting any Collection of values. * DimensionTopNMetricSpec: Fix toString. * InvertedTopNMetricSpec: Add toString. * HyperUniqueFinalizingPostAggregator: Add equals, hashCode, toString.	2016-11-30 19:00:14 -08:00
Gian Merlino	477e0cab7c	Filter fixes and tests (#3724 ) * More robust Filter tests. All Filter tests now exercise the CNF and post-filtering features. * Fixes to RowBasedValueMatcherFactory and to bound filters. - Change Comparables to Strings in ValueMatcher related code. - Break out RowBasedValueMatcherFactory, fix a variety of issues around nulls, and add tests. - Fix bound filters on long columns with non-numeric bounds, and add tests.	2016-11-30 16:10:05 -08:00
Gian Merlino	e4465e63bd	Fix ordering of sections on dimensionspecs.md. (#3722 ) The Filtered and List DimensionSpecs were mixed in with the extraction functions.	2016-11-29 16:28:36 -08:00
Parag Jain	877992fe63	use sync kafka producer for deterministic test (#3721 )	2016-11-29 10:04:51 -08:00
Niketh Sabbineni	2640d170c3	Blacklist workers if they fail for too many times (#3643 ) * Blacklist workers if they fail for too many times * Adding documentation * Changing to timeout to period and updating docs * 1. Add configurable maxPercentageBlacklistWorkers 2. Rename variable * Change maxPercentageBlacklistWorkers to double * Remove thread.sleep	2016-11-29 12:38:56 +05:30
Gian Merlino	6922d684bf	GroupBy: Validation of output names, and a gross hack for v1 subqueries. (#3686 ) v1 subqueries try to use aggregators to "transfer" values from the inner results to an incremental index, but aggregators can't transfer all kinds of values (strings are a common one). This is a workaround that selectively ignores what the outer aggregators ask for and instead assumes that we know best. These are in the same commit because the name validation changed the kinds of errors that were thrown by v1 subqueries.	2016-11-29 12:35:03 +05:30
Erik Dubbelboer	9f7050e221	Fix some grammar and spelling mistakes (#3717 )	2016-11-28 11:49:30 -08:00
Roman Leventov	c070b4a816	Fix concurrency defects, remove unnecessary volatiles (#3701 )	2016-11-22 16:42:28 -08:00
Himanshu	7d37f675ba	fix the documented property name for specifying avro reader schema (#3708 )	2016-11-22 15:02:41 -08:00
Parag Jain	7ee6bb7410	option to reset offest automatically in case of OffsetOutOfRangeException (#3678 ) * option to reset offset automatically in case of OffsetOutOfRangeException if the next offset is less than the earliest available offset for that partition * review comments * refactoring * refactor * review comments	2016-11-21 16:29:46 -06:00
Jonathan Wei	7c63bee7f5	Add mapreduce.job.classloader.system.classes property to 'Other Hadoop Versions' docs (#3706 )	2016-11-18 16:16:50 -08:00
Roman Leventov	7b56cec3b9	Fix resource leaks (#3702 )	2016-11-18 21:21:36 +05:30
Gian Merlino	7e80d1045a	Exercise v2 engine in the groupBy aggregator and multi-value dimension tests. (#3698 ) This also involved some other test changes: - Added a factory.mergeRunners step to AggregationTestHelper's groupBy chain, since the v2 engine does merging there. - Changed test byteBuffer pools from on-heap to off-heap to work around https://github.com/DataSketches/sketches-core/pull/116 for datasketches tests.	2016-11-16 20:02:25 -08:00
Erik Dubbelboer	7d36f540e8	WIP: Add Google Storage support (#2458 ) Also excludes the correct artifacts from #2741	2016-11-16 14:06:45 +05:30

1 2 3 4 5 ...

7582 Commits All Branches Search

7582 Commits

All Branches