druid

Commit Graph

Author	SHA1	Message	Date
David Lim	8eee259629	add documentation on segments generated (#3785 )	2016-12-19 09:41:47 -08:00
Dongkyu Hwangbo	da007ca3c2	Replace caravel with superset (#3780 )	2016-12-16 20:47:52 -08:00
Gian Merlino	dd63f54325	Built-in SQL. (#3682 )	2016-12-16 17:15:59 -08:00
Jonathan Wei	2bfcc8a592	First and Last Aggregator (#3566 ) * add first and last aggregator * add test and fix * moving around * separate aggregator valueType * address PR comment * add finalize inner query and adjust v1 inner indexing * better test and fixes * java-util import fixes * PR comments * Add first/last aggs to ITWikipediaQueryTest	2016-12-16 15:26:40 -08:00
Nishant	93c34d3c3f	Ability to add hadoop config directory via environment variable (#3781 )	2016-12-16 11:19:15 -08:00
Nishant	8cfcb95fbc	Add Filtered and Composing request loggers (#3469 ) * Add Filtered and Composing request loggers Add Filtered and Composite Request loggers - enables users to filter request logs for slow queries. fix test * review comments * review comment * remove unused import	2016-12-16 11:18:32 -08:00
Jihoon Son	5e39578eee	Enable parallel test (#3774 ) * Enable parallel test * Remove unnecessary NotThreadSafe annocation * Randomize the start port when finding available ports * Fix test failure * Change to handle all negatives	2016-12-14 21:05:56 -08:00
Himanshu	ed322a4beb	remove size from default analysisTypes list for segmentMetadata query (#3773 )	2016-12-13 18:01:21 -08:00
Slim	7b18fb79e0	as per @itaiy suggested will close then try to connect. (#3669 ) * as per @itaiy suggested will close then try to connect. * use close instead of flush * git fix comments * break the loop in case of interrupted	2016-12-13 23:50:28 +05:30
Ninglin Du	469ab21091	[Feature] Thrift support for realtime and batch ingestion (#3418 ) * Thrift ingestion plugin 1. thrift binary is platform dependent, use scrooge to generate java files to avoid style check failure 2. stream and hadoop ingesion are both supported, input format can be sequence file and lzo thrift block file. 3. base64 and protocol aware change header * fix conlicts in pom	2016-12-13 10:05:15 -08:00
John Zhang	48b22e261a	support atomic writes for local deep storage (#3521 ) * Use atomic writes for local deep storage * fix pr issues * use defaultObjMapper for test * move tmp pushes to a intermediate dir * minor refactor	2016-12-13 10:03:22 -08:00
kaijianding	4be3eb0ce7	report message gap, source gap and sink count in RealtimePlumber (#3744 ) * report message gap, source gap and sink count in RealtimePlumber * report message gap, sink count in Appenderator * add ingest/events/sourceGap in metrics.md * remove source gap	2016-12-13 11:23:02 -06:00
Gleb Smirnov	07384d6f40	Update Apache curator to a non-leaky version (see CURATOR-354) (#3769 )	2016-12-12 09:52:40 -08:00
David Lim	0b9dff0bc1	fix worker thread pool exhaustion bug (#3760 ) * fix worker thread pool exhaustion bug * code review changes * code review changes	2016-12-09 15:23:11 -08:00
David Lim	7f087cdd3b	allow Kafka consumer group.id to be overriden by config (#3765 )	2016-12-08 15:53:13 -08:00
Akash Dwivedi	6386e6a4dc	root and java-util pom cleanup (#3764 ) * Remove bytebuffer-collections dependency from the root pom and java-util pom cleanup. * Remove json-smart exclusion from root pom	2016-12-08 11:30:19 -08:00
Jonathan Wei	880a021a7a	Fix missed travis failures from PR 3567 and 2798 (#3761 ) * Fix checkstyle failures from PR 3567 * Fix GranularityPathSpecTest compile failure	2016-12-07 19:07:31 -08:00
Navis Ryu	f794246ec1	Trimming out outside of given interval (#2798 ) * Trimming out outside of given interval (Fix for #2659) * addressed comments	2016-12-07 18:05:50 -08:00
Erik Dubbelboer	bb9e35e1af	Add Greatest and Least post aggregations (#3567 )	2016-12-07 17:58:23 -08:00
Gian Merlino	943982b7b0	Configurable HTTP compression. (#3759 ) * Configurable HTTP compression. * Call real-time nodes real-time processes in docs.	2016-12-07 17:40:39 -08:00
Roman Leventov	d9807835f9	Add BitmapIterationBenchmark and update to JMH 1.17.2 (#3751 )	2016-12-07 15:50:33 -08:00
Roman Leventov	949e65165c	Bitset iteration optimization and improve safety (#3753 ) * Deduplicate looking for bitset.nextSetBit() in BitSetIterator.next() and hasNext() * Add BitmapIterationTest * More elaborate comment on why Roaring is not tested in BitmapIterationTest	2016-12-07 15:49:16 -08:00
Navis Ryu	87c61fa749	Refactor boolean cast code, add tests (#3016 )	2016-12-07 13:10:39 -08:00
Roman Leventov	70e83bea6d	Fix PathChildrenCache's ExecutorService leak (#3726 ) * Fix PathChildrenCache's executorService leak in Announcer, CuratorInventoryManager and RemoteTaskRunner * Use a single ExecutorService for all workerStatusPathChildrenCaches in RemoteTaskRunner	2016-12-07 13:00:10 -08:00
Roman Leventov	dc8f814acc	Optimize Iterator<ImmutableBitmap> implementation inside Filters.matchPredicate() so that it doesn't emit empty bitmap in the end of the iteration, and make it to follow Iterator contract, that is throw NoSuchElementException from next() if there are no more bitmaps (#3754 )	2016-12-07 12:54:09 -08:00
Nishant	361af4c94f	Wait for any pending realtime task to complete before disabling datasource (#3757 ) Noticed this in our internal testing, sometimes realtime index tasks in kafkaIndexing service can get stuck waiting for handoff if datasource is disabled before there task completion. This is a workaround to ensure integration tests do not hit this case until https://github.com/druid-io/druid/issues/1729 is fixed.	2016-12-07 10:17:16 -08:00
Himanshu	06d0ef9c6c	allow and load extensions with absolute paths in druid.extensions.loadList (#3747 )	2016-12-06 17:40:23 -08:00
Jonathan Wei	d1896a2d62	Disable flush after every ObjectMapper write (#3748 )	2016-12-06 16:45:23 -08:00
Gian Merlino	b1bac9f2d3	groupBy v2: Ignore timestamp completely when granularity = all, except for the final merge. (#3740 ) * GroupByBenchmark: Add serde, spilling, all-gran benchmarks. Also use more iterations. * groupBy v2: Ignore timestamp completely when granularity = all, except for the final merge. Specifically: - Remove timestamp from RowBasedKey when not needed - Set timestamp to null in MapBasedRows that are not part of the final merge.	2016-12-06 16:17:32 -08:00
kaijianding	f995b1426f	retry loadSegment with all locations (#3681 )	2016-12-06 12:00:59 -08:00
Himanshu	5440a06b2d	make sure CliCoordinator initializes and starts DerbyMetadataStorage first if configured (#3700 ) * make sure CliCoordinator initializes and starts DerbyMetadataStorage first if configured * Revert "make sure CliCoordinator initializes and starts DerbyMetadataStorage first if configured" This reverts commit 54f5644054626d4a9e2448bb4bd5e6ce9a9fca1d. * make sure CliCoordinator initializes and starts DerbyMetadataStorage first if configured	2016-12-06 10:22:04 -08:00
Himanshu	45da7e48f1	groupBy sort results by (dimensions,timestamp) instead of (timestamp,dimension) (#3672 ) * sortByDimsFirst flag for groupBy query * Remove need for KeyType in Grouper<KeyType> to be Comparable<KeyType> * fix review comments * fix review comments regarding removing code duplication of dim/time comparison * move comparator for KeyType object to KeySerdeFactory so that creation of comparator does not need KeySerde * remove unnecessary system.out.println * make access static var NATURAL_NULLS_FIRST directly * further review comments addressing	2016-12-06 09:48:56 -08:00
Niketh Sabbineni	d904c79081	Normalized Cost Balancer (#3632 ) * Normalized Cost Balancer * Adding documentation and renaming to use diskNormalizedCostBalancer * Remove balancer from the strings * Update docs and include random cost balancer * Fix checkstyle issues	2016-12-05 17:18:20 -08:00
Navis Ryu	c74d267f50	Support virtual column for select query (#2511 ) * Support virtual column for select query * Addressed comments	2016-12-05 15:14:35 -08:00
Gian Merlino	4e67dd28c0	RemoteTaskRunnerConfig: Fix Guice error on startup. (#3737 )	2016-12-06 00:19:53 +05:30
Gian Merlino	ff42058453	Expressions: Allow escapes in quoted identifiers. (#3735 )	2016-12-06 00:17:55 +05:30
Gian Merlino	b64e06704e	Fix SingleScanTimeDimSelector when an extractionFn returns null for a timestamp. (#3732 )	2016-12-02 15:27:54 -08:00
Gian Merlino	f4cc8c2b2f	IndexBuilder: Close IncrementalIndex when done. (#3734 )	2016-12-02 16:56:34 -06:00
Gian Merlino	353fee79dd	Add "asMillis" option to "timeFormat" extractionFn. (#3733 ) This is useful for chaining extractionFns that all want to treat time as millis, such as having a javascript extractionFn after a timeFormat.	2016-12-02 13:45:16 -08:00
Gian Merlino	102375d9bb	Add "strlen" extractionFn. (#3731 )	2016-12-02 12:08:51 -08:00
Gian Merlino	4c5d10f8a3	Add DimFilterHavingSpec. (#3727 ) * Add DimFilterHavingSpec. * Add test for DimFilterHavingSpec with extractionFns.	2016-12-02 10:04:30 -08:00
Gian Merlino	a8069f2441	Retry on dataSource metadata CAS failures where retrying might help. (#3728 ) This retries when the start condition is met but SELECT -> INSERT/UPDATE fails, which indicates a race. If the start condition isn't met, there won't be any retrying.	2016-12-01 11:50:15 -07:00
Charles Allen	27ab23ef44	Don't update segment metadata if archive doesn't move anything (#3476 ) * Don't update segment metadata if archive doesn't move anything * Fix restore task to handle potential null values * Don't try to update empty metadata * Address review comments * Move to druid-io java-util	2016-12-01 07:49:28 -08:00
Gian Merlino	68735829ca	Add, fix equals, hashCode, toString on various classes. (#3723 ) * TimeFormatExtractionFn: Add toString. * InDimFilter: Add toString, allow accepting any Collection of values. * DimensionTopNMetricSpec: Fix toString. * InvertedTopNMetricSpec: Add toString. * HyperUniqueFinalizingPostAggregator: Add equals, hashCode, toString.	2016-11-30 19:00:14 -08:00
Gian Merlino	477e0cab7c	Filter fixes and tests (#3724 ) * More robust Filter tests. All Filter tests now exercise the CNF and post-filtering features. * Fixes to RowBasedValueMatcherFactory and to bound filters. - Change Comparables to Strings in ValueMatcher related code. - Break out RowBasedValueMatcherFactory, fix a variety of issues around nulls, and add tests. - Fix bound filters on long columns with non-numeric bounds, and add tests.	2016-11-30 16:10:05 -08:00
Gian Merlino	e4465e63bd	Fix ordering of sections on dimensionspecs.md. (#3722 ) The Filtered and List DimensionSpecs were mixed in with the extraction functions.	2016-11-29 16:28:36 -08:00
Parag Jain	877992fe63	use sync kafka producer for deterministic test (#3721 )	2016-11-29 10:04:51 -08:00
Niketh Sabbineni	2640d170c3	Blacklist workers if they fail for too many times (#3643 ) * Blacklist workers if they fail for too many times * Adding documentation * Changing to timeout to period and updating docs * 1. Add configurable maxPercentageBlacklistWorkers 2. Rename variable * Change maxPercentageBlacklistWorkers to double * Remove thread.sleep	2016-11-29 12:38:56 +05:30
Gian Merlino	6922d684bf	GroupBy: Validation of output names, and a gross hack for v1 subqueries. (#3686 ) v1 subqueries try to use aggregators to "transfer" values from the inner results to an incremental index, but aggregators can't transfer all kinds of values (strings are a common one). This is a workaround that selectively ignores what the outer aggregators ask for and instead assumes that we know best. These are in the same commit because the name validation changed the kinds of errors that were thrown by v1 subqueries.	2016-11-29 12:35:03 +05:30
Erik Dubbelboer	9f7050e221	Fix some grammar and spelling mistakes (#3717 )	2016-11-28 11:49:30 -08:00

... 3 4 5 6 7 ...

7789 Commits All Branches Search

7789 Commits

All Branches