Gian Merlino
7c0b1dde3a
DimensionPredicateFilter: Skip unnecessary dictionary lookup. ( #2850 )
2016-04-18 12:38:25 -07:00
Jonathan Wei
b534f7203c
Fix performance regression from #2753 in IndexMerger ( #2841 )
2016-04-14 21:39:41 -07:00
Jonathan Wei
a26134575b
Fix NPE in TopNLexicographicResultBuilder.addEntry() ( #2835 )
2016-04-13 17:27:16 -07:00
Fangjin Yang
abd951df1a
Document how to use roaring bitmaps ( #2824 )
...
* Document how to use roaring bitmaps
This fixes #2408 .
While not all indexSpec properties are explained, it does explain how roaring bitmaps can be turned on.
* fix
* fix
* fix
* fix
2016-04-12 19:28:02 -07:00
michaelschiff
db35dd7508
fix issue #2744 . Check for null before combining metrics ( #2774 )
2016-04-12 14:46:31 -07:00
Nishant
1bf1dd03a0
Merge pull request #2812 from mrijke/fix-missing-equals-hashcode-filters
...
Add missing equals/hashcode to JS, Regex and SearchQuery DimFilters
2016-04-12 12:00:23 +05:30
Charles Allen
21e406613c
Merge pull request #2809 from metamx/fix2694
...
Fix test for snapshot taker to better check for lookup perist failure
2016-04-11 14:52:47 -07:00
Maarten Rijke
de68d6b7c4
Add missing equals/hashcode to JS, Regex and SearchQuery DimFilters
...
This commits adds missing equals() and hashcode() methods to
the JavascriptDimFilter, RegexDimFilter and the SearchQueryDimFilter.
2016-04-11 12:16:24 +02:00
Nishant
bbb326decf
Merge pull request #2799 from b-slim/fix_snapshot
...
MapLookupFactory need to be Ser/Desr ready.
2016-04-07 13:22:34 +05:30
Slim Bouguerra
bf1eafc4e1
remove all the mock lookupFactory
2016-04-06 15:37:52 -05:00
Slim Bouguerra
59eb2490a0
MapLookupFactory need to be Ser/Desr.
2016-04-06 15:02:18 -05:00
Charles Allen
f915a59138
Merge pull request #2691 from metamx/lookupExtrFn
...
Add ExtractionFn to LookupExtractor bridge
2016-04-06 09:13:08 -07:00
jon-wei
051fd6c0eb
Remove extra println from InFilter
2016-04-05 14:55:49 -07:00
Fangjin Yang
289bb6f885
Merge pull request #2690 from jon-wei/filter_support
...
Allow filters to use extraction functions
2016-04-05 15:40:15 -06:00
jon-wei
0e481d6f93
Allow filters to use extraction functions
2016-04-05 13:24:56 -07:00
Gian Merlino
e060a9f283
Additional ExtractionFn null-handling adjustments.
...
Followup to comments on #2771 .
2016-04-01 18:35:26 -07:00
Fangjin Yang
18b9ea62cf
Merge pull request #2771 from gianm/extractionfn-stuff
...
Various ExtractionFn null handling fixes.
2016-04-01 16:35:46 -07:00
Gian Merlino
23d66e5ff9
Merge pull request #2765 from navis/invalid-encode-nullstring
...
Null string is encoded as "null" in incremental index
2016-04-01 14:43:40 -07:00
Gian Merlino
b6e4d8b2c1
Various ExtractionFn null handling fixes.
...
- JavaScriptExtractionFn shouldn't pass empty strings to its JS functions
- Upper/LowerExtractionFn properly handles null Objects (DimExtractionFn's implementation works here)
- MatchingDimExtractionFn properly returns nulls rather than empties
- RegexDimExtractionFn properly attempts matching on nulls and empties
- SearchQuerySpecDimExtractionFn properly returns nulls when passed empties
2016-04-01 14:34:47 -07:00
Fangjin Yang
eea7a47870
Merge pull request #2576 from navis/paging-from-next
...
Add option for select query to get next page without modifying returned paging identifiers
2016-04-01 13:50:36 -07:00
Fangjin Yang
4eb5a2c4f1
Merge pull request #2715 from navis/stringformat-null-handling
...
stringFormat extractionFn should be able to return null on null values (Fix for #2706 )
2016-04-01 13:45:28 -07:00
Gian Merlino
23364a47fd
BaseFilterTest: Test optimized filters too.
2016-04-01 12:44:59 -07:00
navis.ryu
077522a46f
stringFormat extractionFn should be able to return null on null values (Fix for #2706 )
2016-04-01 13:40:56 +09:00
navis.ryu
f0e55f5d31
Null string is encoded as "null" in incremental index
2016-04-01 09:47:15 +09:00
navis.ryu
29bb00535b
Add option for select query to get next page without modifying returned paging identifiers
2016-04-01 09:03:03 +09:00
Gian Merlino
5f9240fcbc
Merge pull request #2577 from navis/native-in-filter
...
Implement native in filter
2016-03-30 20:02:54 -07:00
Fangjin Yang
3d68da94fe
Merge pull request #2661 from navis/utf8-estimated-length
...
Utility method for length estimation of utf8
2016-03-30 19:56:14 -07:00
navis.ryu
108535fd07
Implement native in filter (Fix for #2577 )
2016-03-31 10:10:57 +09:00
navis.ryu
e0cfd9ee19
Utility method for length estimation of utf8
2016-03-31 10:07:00 +09:00
jon-wei
5503bf1b38
Remove unnecessary type check in TimeAndDimsComp
2016-03-30 17:54:15 -07:00
Fangjin Yang
95733a362f
Merge pull request #2753 from gianm/null-filtering-multi-value-columns
...
More consistent empty-set filtering behavior on multi-value columns.
2016-03-29 18:52:25 -07:00
Charles Allen
95d42cfd9e
Merge pull request #2758 from pjain1/fix_npe_in_filter
...
handle null values in In Filter
2016-03-29 17:53:02 -07:00
Gian Merlino
1853f36e9f
More consistent empty-set filtering behavior on multi-value columns.
...
The behavior is now that filters on "null" will match rows with no
values. The behavior in the past was inconsistent; sometimes these
filters would match and sometimes they wouldn't.
Adds tests for this behavior to SelectorFilterTest and
BoundFilterTest, for query-level filters and filtered aggregates.
Fixes #2750 .
2016-03-29 15:32:13 -07:00
Parag Jain
d892918a3d
handle null values in In Filter
2016-03-29 17:03:26 -05:00
Fangjin Yang
e023df2b92
Merge pull request #2754 from gianm/i-dont-get-it
...
Remove error suppression code from IncrementalIndexAdapter.
2016-03-28 19:29:53 -07:00
Gian Merlino
c7ff0d698e
Remove error suppression code from IncrementalIndexAdapter.
2016-03-28 18:40:27 -07:00
fjy
c418a55638
cleanup distinct count agg
2016-03-28 17:29:41 -07:00
Fangjin Yang
9cb197adec
Merge pull request #2722 from himanshug/fix_hadoop_jar_upload
...
config to explicitly specify classpath for hadoop container during hadoop ingestion
2016-03-28 14:49:03 -07:00
Charles Allen
4a98c4fbac
Fix LookupExtractionFn equals and hashCode
2016-03-28 13:14:43 -07:00
Charles Allen
0ee861d0da
Add ExtractionFn to LookupExtractor bridge
2016-03-28 13:14:43 -07:00
Fangjin Yang
7fe277e6da
Merge pull request #2727 from gianm/optimize-bound-filter
...
BoundFilter optimizations, and related interface changes.
2016-03-26 18:59:05 -07:00
Fangjin Yang
0dae28b6af
Merge pull request #2729 from jon-wei/fix_hyperunique_comparator
...
Fix HyperUniquesAggregatorFactory comparator
2016-03-26 15:39:35 -07:00
Gian Merlino
2970b49adc
BoundFilter optimizations, and related interface changes.
...
BoundFilter:
- For lexicographic bounds, use bitmapIndex.getIndex to find the start and end points,
then union all bitmaps between those points.
- For alphanumeric bounds, iterate through dimValues, and union all bitmaps for values
matching the predicate.
- Change behavior for nulls: it used to be that the BoundFilter would never match nulls,
now it matches nulls if "" is allowed by the lower limit and not excluded by the
upper limit.
Interface changes:
- BitmapIndex: add `int getIndex(value)` to make it possible to get the index for a
value without retrieving the bitmap.
- BitmapIndex: remove `ImmutableBitmap getBitmap(value)`, change callers to `getBitmap(getIndex(value))`.
- BitmapIndexSelector: allow retrieving the underlying BitmapIndex through getBitmapIndex.
- Clarified contract of indexOf in Indexed, GenericIndexed.
Also added tests for SelectorFilter, NotFilter, and BoundFilter.
2016-03-25 14:11:48 -07:00
jon-wei
9afaa2b94a
Fix HyperUniquesAggregatorFactory comparator
2016-03-25 12:36:42 -07:00
Gian Merlino
4ac9e03161
Fix predicate-based ValueMatcher behavior for IncrementalIndex on missing columns.
...
Missing columns should be treated the same as columns containing 100% nulls.
2016-03-25 10:23:59 -07:00
Himanshu Gupta
e78a469fb7
UTs for ExtensionsConfig
2016-03-25 10:51:28 -05:00
Himanshu Gupta
004b00bb96
config to explicitly specify classpath for hadoop container during hadoop ingestion
2016-03-25 10:51:28 -05:00
Nishant
0b03c9405f
Merge pull request #2614 from sirpkt/calendric_gran
...
Support week, month, quarter, and year in query granularity
2016-03-24 16:21:01 -07:00
Himanshu
56343c6cdc
Merge pull request #2704 from navis/simple-optimize
...
optimize single elemented and/or filter
2016-03-24 16:13:48 -05:00
Gian Merlino
713062053c
Filters: Add filter.toFilter method, use that instead of the instanceof chain in Filters.
...
I believe that the instanceof chain in Filters exists because in the past, Filter
and DimFilter were in different packages (DimFilter was in druid-client and Filter
was in druid-processing). And since druid-client didn't depend on druid-processing,
DimFilter couldn't have a toFilter method. But now it can.
2016-03-23 17:03:49 -07:00
Gian Merlino
dd86198902
All Filters should work with FilteredAggregators.
...
This removes Filter.makeMatcher(ColumnSelectorFactory) and adds a
ValueMatcherFactory implementation to FilteredAggregatorFactory so it can
take advantage of existing makeMatcher(ValueMatcherFactory) implementations.
This patch also removes the Bound-based method from ValueMatcherFactory. Its
only user was the SpatialFilter, which could use the Predicate-based method.
Fixes #2604 .
2016-03-23 12:24:01 -07:00
binlijin
57d78d3293
clean tmp file when index merge fail
2016-03-23 10:55:12 +08:00
navis.ryu
91f6be4884
optimize single elemented and/or filter
2016-03-23 09:29:15 +09:00
Gian Merlino
ff25325f3b
Improved docs for multi-value dimensions.
...
- Add central doc for multi-value dimensions, with some content from other docs.
- Link to multi-value dimension doc from topN and groupBy docs.
- Fixes a broken link from dimensionspecs.md, which was presciently already
linking to this nonexistent doc.
- Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes
"multi-value") in favor of "multi-value".
2016-03-22 14:40:55 -07:00
jon-wei
a59c9ee1b1
Support use of DimensionSchema class in DimensionsSpec
2016-03-21 13:12:04 -07:00
Keuntae Park
7f29f2ac3b
support week, month, quarter, year in query granularity
2016-03-21 17:41:53 +09:00
Charles Allen
5da9a280b6
Query Time Lookup - Dynamic Configuration
2016-03-18 09:45:05 -07:00
Gian Merlino
738dcd8cd9
Update version to 0.9.1-SNAPSHOT.
...
Fixes #2462
2016-03-17 10:34:20 -07:00
Slim
cf342d8d3c
Merge pull request #2517 from b-slim/adding_lookup_snapshot_utility
...
[QTL][Lookup] lookup module with the snapshot utility
2016-03-17 11:39:47 -05:00
Slim Bouguerra
0c86b29ef0
lookup module with the snapshot utility
2016-03-17 09:20:41 -05:00
Charles Allen
2ac8a22173
Merge pull request #2579 from metamx/closerIsCloser
...
Make CloserRule use guava's Closer
2016-03-14 17:18:19 -07:00
Charles Allen
a64979463f
Make CloserRule use guava's Closer
2016-03-14 15:01:24 -07:00
Fangjin Yang
06813b510a
Merge pull request #2571 from himanshug/gp_by_avoid_sort
...
avoid sort while doing groupBy merging when possible
2016-03-14 14:46:51 -07:00
Fangjin Yang
dbdbacaa18
Merge pull request #2260 from navis/cardinality-for-searchquery
...
Support cardinality for search query
2016-03-14 13:24:40 -07:00
Slim
8cc3582e70
Merge pull request #2644 from metamx/optimize-timeboundary
...
optimize timeboundary for min or max bound
2016-03-13 13:16:24 -05:00
navis.ryu
be341bf4e3
Support cardinality for search query (Fix for #2260 )
2016-03-12 09:51:01 +09:00
Xavier Léauté
6f0d6ef0e9
optimize timeboundary for min or max bound
2016-03-11 14:11:47 -08:00
Gian Merlino
8a11161b20
Plumbers: Move plumber.add out of try/catch for ParseException.
...
The incremental indexes handle that now so it's not necessary.
Also, add debug logging and more detailed exceptions to the incremental
indexes for the case where there are parse exceptions during aggregation.
2016-03-10 16:39:26 -08:00
Himanshu Gupta
dc0214bddb
while GroupBy merging use unsorted facts in IncrementalIndex wherever possible
2016-03-10 16:11:48 -06:00
Himanshu Gupta
02dfd5cd80
update IncrementalIndex to support unsorted facts map that can be used in groupBy merging to improve performance
2016-03-10 16:11:48 -06:00
Xavier Léauté
90d7409e1a
Merge pull request #2611 from himanshug/gp_by_max_limit
...
only allow lowering maxResults and maxIntermediateRows from groupBy query context
2016-03-10 13:44:13 -08:00
Gian Merlino
a2b1652787
Clarify parser docs.
...
- Clarify what parseSpecs are used for.
- Avro, Protobuf should use timeAndDims parseSpecs.
- Hadoop jobs should use hadoopyString string parsers.
2016-03-10 08:45:04 -08:00
Fangjin Yang
68cffe1d91
Merge pull request #2615 from gianm/timeseries-skipEmptyBuckets-cache
...
Fix caching of skipEmptyBuckets for TimeseriesQuery.
2016-03-09 18:45:59 -08:00
Gian Merlino
708bc674fa
Make specifying query context booleans more consistent.
...
Before, some needed to be strings and some needed to be real booleans. Now
they can all be either one.
2016-03-08 19:38:26 -08:00
Gian Merlino
40dad6dff4
Fix caching of skipEmptyBuckets for TimeseriesQuery.
2016-03-08 19:22:12 -08:00
Himanshu Gupta
ca5de3f583
only allow lowering maxResults and maxIntermediateRows from groupBy query context
2016-03-08 15:03:59 -06:00
Himanshu Gupta
099acb4966
allow groupBy max[Intermediate]Rows limit be overridable by context
2016-03-07 15:22:41 -06:00
Himanshu Gupta
c544ebf25e
reintroducing the safety check removed in commit-1d602be so that dim value ids are less than cardinality
2016-03-03 23:34:23 -06:00
Bingkun Guo
4a58462fc7
update querySegmentSpec when passing query to getQueryRunner
...
After finding the FireChief for a specific partition, Druid will need to find the specific queryRunner for each segment being queried by passing the query to FireChief. Currently Druid is passing the original query that contains all the segments need to be queried, it's possible that fireChief.getQueryRunner(query) returns more than 1 queryRunner because query.getIntervals() is not specific to a single segment.
In this patch, for each segment being queried, Druid will update the query with its corresponding SpecificSegmentSpec.
2016-03-02 16:44:56 -06:00
Nishant
31b502773a
Merge pull request #2480 from navis/pagingfail-over-segments
...
Select query cannot span to next segment with paging
2016-03-01 11:42:41 +05:30
Fangjin Yang
e5c25725c0
Merge pull request #2562 from himanshug/fix_2556
...
with nested GpBy query outer query results need to be further merged
2016-02-29 12:17:33 -08:00
Himanshu Gupta
0722ced413
with GpBy query outer query results need to be further merged
2016-02-29 10:16:25 -06:00
navis.ryu
b1ff920831
Lazily initialize predicate for bound filter
2016-02-29 15:35:52 +09:00
navis.ryu
5f1e60324a
Added more complex test case with versioned segments
2016-02-29 14:48:24 +09:00
navis.ryu
2686bfa394
Select query cannot span to next segment with paging
2016-02-29 00:01:46 +09:00
Fangjin Yang
29d29ba98d
Merge pull request #2263 from jon-wei/flex_dims3
...
Allow IncrementalIndex to store Long/Float dimensions
2016-02-25 17:23:02 -08:00
jon-wei
c17ce02467
Allow IncrementalIndex to store Long/Float dimensions
2016-02-24 13:51:57 -08:00
jon-wei
fd3782522c
Rename 'replaceMissingValues...' parameters in RegexExtractionFn
2016-02-24 13:12:56 -08:00
Nishant
fb7eae34ed
Merge pull request #2249 from metamx/workerExpanded
...
Use Worker instead of ZkWorker whenever possible
2016-02-24 13:23:22 +05:30
Charles Allen
ac13a5942a
Use Worker instead of ZkWorker whenver possible
...
* Moves last run task state information to Worker
* Makes WorkerTaskRunner a TaskRunner which has interfaces to help with getting information about a Worker
2016-02-23 15:02:03 -08:00
Gian Merlino
3534483433
Better handling of ParseExceptions.
...
Two changes:
- Allow IncrementalIndex to suppress ParseExceptions on "aggregate".
- Add "reportParseExceptions" option to realtime tuning configs. By default this is "false".
Behavior of the counters should now be:
- processed: Number of rows indexed, including rows where some fields could be parsed and some could not.
- thrownAway: Number of rows thrown away due to rejection policy.
- unparseable: Number of rows thrown away due to being completely unparseable (no fields salvageable at all).
If "reportParseExceptions" is true then "unparseable" will always be zero (because a parse error would
cause an exception to be thrown). In addition, "processed" will only include fully parseable rows
(because even partial parse failures will cause exceptions to be thrown).
Fixes #2510 .
2016-02-23 10:11:43 -08:00
Fangjin Yang
3bdd757024
Merge pull request #1773 from b-slim/log_details
...
Adding downstream source when throwing QueryInterruptedException
2016-02-22 10:16:07 -08:00
Slim Bouguerra
77925cc061
adding downstream source of QueryInterruptedException
2016-02-20 13:05:14 -06:00
Fangjin Yang
8ee81947cd
Merge pull request #2494 from himanshug/fix_timeseries
...
do not drop post-aggs in TimeseriesQueryToolChest.makePreComputeManipulatorFn
2016-02-20 10:37:32 -08:00
Gian Merlino
d25c46cb9f
Add comparator to HyperUniquesFinalizingPostAggregator.
...
This makes it possible to do groupBys with clauses like "HAVING uniques > 10".
Beforehand you couldn't do it with either an aggregator (because it returns
an HLLV1 which the havingSpec can't understand) or a finalized postaggregator
(because it didn't have a comparator).
Now you can at least do it with a finalizing postaggregator. Trying it with
the aggregator alone still doesn't work.
Added some topN and groupBy tests verifying the comparator, and added an
@Ignore test that should pass if havingSpecs are made work on the aggregator
directly.
2016-02-19 08:36:08 -08:00
Himanshu Gupta
11b0117422
do not drop post-aggs in timeseries query tool chest makePreComputeManipulatorFn like other query types
2016-02-17 20:51:35 -06:00
Jaehong Choi
32b9d57b23
handle a failing UT in GroupByQueryRunnerTest after merging into the master
2016-02-16 16:56:57 +09:00
Jaehong Choi
b25bca85bc
Merge branch 'master' of https://github.com/druid-io/druid into support-alphanumeric-dimensional-sort-in-gropu-by
2016-02-16 16:42:05 +09:00
Jaehong Choi
e89afc901b
delete System.out.println() in test code
2016-02-16 15:26:37 +09:00
Navis Ryu
cd315627c9
Merge pull request #2393 from CHOIJAEHONG1/support-alphanumeric-dimensional-sort-in-gropu-by
...
support alphanumeric sorting for dimensional columns in groupby (#2393 )
2016-02-16 14:11:30 +09:00
Slim
16092eb5e2
Merge pull request #2464 from gianm/print-properties
...
Make startup properties logging optional.
2016-02-14 15:11:35 -06:00
Gian Merlino
e0c049c0b0
Make startup properties logging optional.
...
Off by default, but enabled in the example config files. See also #2452 .
2016-02-12 14:12:16 -08:00
Himanshu Gupta
da5fcd0124
before facts get it , indexAndOffsets should already know about it
2016-02-12 13:32:06 -06:00
Jonathan Wei
d63eec65a1
Merge pull request #2208 from navis/metadataquery-minmax
...
Support min/max values for metadata query
2016-02-11 17:28:07 -08:00
Jonathan Wei
e1b022eac9
Merge pull request #2349 from navis/dimensionspec-for-selectquery
...
Support dimension spec for select query
2016-02-11 16:38:16 -08:00
navis.ryu
dd2375477a
Support min/max values for metadata query ( #2208 )
2016-02-12 09:35:58 +09:00
Gian Merlino
2d037ef05e
Merge pull request #2453 from DreamLab/fix/topn_sorting_anomaly
...
Fix for unstable behavior of HyperLogLog comparator
2016-02-11 16:05:34 -08:00
navis.ryu
4d63196535
Support dimension spec for select query
2016-02-12 08:54:28 +09:00
Himanshu
47d48e1e67
Merge pull request #2452 from gianm/print-properties
...
PropertiesModule: Print properties, processors, totalMemory on startup.
2016-02-11 16:49:34 -06:00
turu
f277a54a5c
removed unsafe heuristics from hll compareTo and provided unit test for regression
2016-02-11 23:46:24 +01:00
Slim
368988d187
Merge pull request #2291 from druid-io/lookupManager
...
Promoting LookupExtractor state and LookupExtractorFactory to be a first class druid state object.
2016-02-11 16:07:27 -06:00
Gian Merlino
29f7758e74
PropertiesModule: Print properties, processors, totalMemory on startup.
2016-02-11 13:51:08 -08:00
Slim Bouguerra
4e119b7a24
Adding lookup ref manager and lookup dimension spec impl
2016-02-11 12:11:51 -06:00
Jaehong Choi
2f2e2ff5b9
support alphanumeric sorting for dimensional columns in groupby
2016-02-11 17:31:28 +09:00
Keuntae Park
05a144e39a
fix crash with filtered aggregator at ingestion time
...
- only for selector filter because extraction filter is not supported as
cardinality is not fixed at ingestion time
2016-02-11 11:25:33 +09:00
Fangjin Yang
b1673ee90e
Merge pull request #2409 from gianm/smq-merged-thing
...
SegmentMetadataQuery: Retain segment id when merging, if possible.
2016-02-08 15:43:39 -08:00
Fangjin Yang
c9c20bb7f3
Merge pull request #2395 from metamx/fixExtractionDimFilterNullTest
...
Actually check cache key null checking in ExtractionDimFilterTest
2016-02-08 14:10:52 -08:00
Gian Merlino
bd9c04244f
SegmentMetadataQuery: Retain segment id when merging, if possible.
...
This is helpful on realtime nodes, where two analyses from two different hydrants
are merged together but they are actually from the same segment.
2016-02-08 13:07:02 -08:00
Himanshu Gupta
9fe1b28ee5
provide configuration to enable usage of Off heap merging for groupBy query
2016-02-05 14:18:06 -06:00
Himanshu Gupta
b40c342cd1
make Global stupid pool cache size configurable
2016-02-05 14:18:06 -06:00
Himanshu Gupta
72a1e730a2
OffheapIncrementalIndex updates to do the aggregation merging off-heap
2016-02-05 14:17:05 -06:00
Himanshu Gupta
907dd77483
OffheapIncrementalIndex a copy/paste of OnheapIncrementalIndex
2016-02-05 14:02:31 -06:00
Charles Allen
aac5f9b2c9
Actually check cache key null checking in ExtractionDimFilterTest
2016-02-04 09:44:13 -08:00
fjy
1aa363cea7
new quickstart
2016-02-04 09:37:38 -08:00
Fangjin Yang
da77591129
Merge pull request #2392 from metamx/fix2391
...
Allow ExtractionDimFilter value to be null
2016-02-03 17:47:14 -08:00
Charles Allen
d4f00096ff
Allow ExtractionDimFilter value to be null
...
* Fixes #2391
2016-02-03 15:51:47 -08:00
Himanshu Gupta
6e7d90cf56
UTs for DefaultLimitSpec
2016-02-03 15:59:12 -06:00
Himanshu Gupta
29e0d7f971
lazily create comparators for row columns when needed
2016-02-03 13:38:20 -06:00
navis.ryu
1d602be0f9
Replace string[] with int[] for dimensions
2016-02-03 15:03:22 +09:00
binlijin
a5ef30ff84
optimize topn on particular situation
2016-02-02 14:20:09 +08:00
Himanshu
93c50d8538
Merge pull request #2094 from navis/simplify-index-merge
...
Simplifying dimension merging
2016-01-29 11:23:14 -06:00
navis.ryu
55a888ea2f
time-descending result of select queries
2016-01-29 10:06:05 +09:00
navis.ryu
dd774ef4dd
one-pass merging of dictionary & index
2016-01-29 10:03:53 +09:00
Himanshu
edd7ce58aa
Merge pull request #2348 from AlexanderSaydakov/fix-aggregator-test-helper
...
fixed createIndex
2016-01-28 16:01:36 -06:00
saydakov
e0860661b1
fixed createIndex
2016-01-28 13:20:50 -08:00
Nishant
99017f4518
Merge pull request #2326 from navis/use-reverse-iterator
...
use reverse-iterator if possible
2016-01-28 19:48:38 +05:30
Nishant
3880f54b87
Merge pull request #2332 from himanshug/configurable_partial
...
make populateUncoveredIntervals a configuration in query context
2016-01-28 10:34:35 +05:30
navis.ryu
7324ece8f9
use reverse-iterator if possible
2016-01-28 09:04:55 +09:00
Xavier Léauté
5a3642bb93
Merge pull request #2247 from metamx/pedanticBuild
...
Enable strict building in travis
2016-01-27 10:27:03 -08:00
Xavier Léauté
2e5004095a
Merge pull request #2341 from gianm/smq-test
...
SegmentMetadataQuery: Fix merging of ColumnAnalysis errors.
2016-01-27 09:37:06 -08:00
Charles Allen
508734c8b0
Long constant reformatting in tests `l` --> `L`
2016-01-27 08:59:19 -08:00
Gian Merlino
b1e6c01762
Make LookupExtractor abstract methods public, they have to work across classloaders.
2016-01-26 23:08:03 -08:00
Gian Merlino
795343f7ef
SegmentMetadataQuery: Fix merging of ColumnAnalysis errors.
...
Also add tests for:
- ColumnAnalysis folding
- Mixed mmap/incremental merging
2016-01-26 17:16:26 -08:00
Himanshu Gupta
3719b6e3c8
make populateUncoveredIntervals a configuration in query context
2016-01-26 15:13:45 -06:00
Himanshu
3844658fb5
Merge pull request #2323 from druid-io/update-druidapi
...
Update druid-api to 0.3.16
2016-01-26 13:02:10 -06:00
Himanshu Gupta
09d3678667
adding single threaded indexing and querying test for IncrementalIndex
2016-01-23 00:17:14 -06:00
Charles Allen
0000b9fc62
Remove sorting in ProtoBufInputRowParserTest
...
Due to processing/src/test/java/io/druid/data/input/ProtoBufInputRowParserTest.java
2016-01-22 16:02:25 -08:00
Himanshu Gupta
2f7f5119cf
older segments might not have field bitmapSerdeFactory for dimension columns and we must use appropriate default
2016-01-22 13:28:25 -06:00
binlijin
1d1f4d996d
Merge pull request #2111 from binlijin/optimize-create-inverted-indexes
...
optimize create inverted indexes
2016-01-22 11:36:27 +08:00
binlijin
55f7dd4629
optimize create inverted indexes
2016-01-22 10:40:09 +08:00
Gian Merlino
d416279c14
SegmentMetadataQuery support for returning aggregators.
2016-01-21 17:27:25 -08:00
Fangjin Yang
5a9cd89059
Merge pull request #2305 from gianm/segment-metadata-query-multivalues
...
Add StorageAdapter#getColumnTypeName, and various SegmentMetadataQuery adjustments
2016-01-21 17:22:34 -08:00
Gian Merlino
e5913be90e
Merge pull request #2257 from tubemogul/index-merge-bug
...
Adds support for empty merge metrics. fixes #2256
2016-01-21 16:38:00 -08:00
Gian Merlino
87c8046c6c
Add StorageAdapter#getColumnTypeName, and various SegmentMetadataQuery adjustments.
...
SegmentMetadataQuery stuff:
- Simplify implementation of SegmentAnalyzer.
- Fix type names for realtime complex columns; this used to try to merge a nice type
name (like "hyperUnique") from mmapped segments with the word "COMPLEX" from incremental
index segments, leading to a merge failure. Now it always uses the nice name.
- Add hasMultipleValues to ColumnAnalysis.
- Add tests for both mmapped and incremental index segments.
- Update docs to include errorMessage.
2016-01-21 15:50:33 -08:00
Fangjin Yang
3f998117a6
Merge pull request #2306 from jon-wei/inherit2
...
More specific null/empty str handling in IndexMerger
2016-01-21 14:36:09 -08:00
Michael Schiff
1e44445f06
Adds support for empty merge metrics. fixes #2256
2016-01-21 13:21:37 -08:00
jon-wei
459a236067
More specific null/empty str handling in IndexMerger
2016-01-21 12:24:38 -08:00
Slim
201539260c
Merge pull request #2076 from b-slim/issue_2010_upper_lower_extractionFN
...
adding lower and upper extraction fn
2016-01-21 09:58:07 -06:00
Slim Bouguerra
78feb3a13e
adding lower and upper extraction fn
2016-01-21 08:59:05 -06:00
Gian Merlino
5a932d28c1
Merge pull request #2288 from tubemogul/index-merge-bug2
...
Null check in IncrementalIndexAdapter.getDimValueLookup()
2016-01-20 17:07:15 -08:00
Nishant
59ea186af7
fix reference counting for segments
2016-01-20 17:24:21 +05:30
Michael Schiff
50ceec78a2
null check in IncrementalIndexAdapter.getDimValueLookup()
2016-01-19 23:19:28 -08:00
jon-wei
bc1e9b27c8
Consolidate IndexMergerTest and IndexMergerV9Test
2016-01-19 16:28:35 -08:00
jon-wei
747343e621
Preserve dimension order across indexes during ingestion
2016-01-19 13:34:11 -08:00
Fangjin Yang
0c31f007fc
Merge pull request #1728 from himanshug/aggregators_in_segment_metadata
...
Store AggregatorFactory[] in segment metadata
2016-01-19 12:55:49 -08:00
Himanshu Gupta
a99aef29a1
adding aggregators to segment metadata
2016-01-19 14:23:39 -06:00
Himanshu Gupta
52eb0f04a7
adding a new method getMergingFactory(..) to AggregatorFactory
2016-01-18 22:03:46 -06:00
Himanshu Gupta
77fc86c015
making AggregatorFactory abstract class
2016-01-18 22:03:46 -06:00
Himanshu Gupta
164b0aad7a
removing Map<String,Object> segmentMetadata from methods in Index[Maker/Merger] and using Metadata class
...
instead of a Map to store segment metadata
2016-01-18 22:03:46 -06:00
zhxiaog
3459a202ce
fixed #1873 , add ability to express CONCAT as an extractionFn
2016-01-18 15:03:17 -08:00
Keuntae Park
238dd3be3c
support cascade execution of extraction filters in extraction dimension spec
2016-01-18 11:10:19 +09:00
Fangjin Yang
f6a1a4ae20
Merge pull request #2138 from KurtYoung/feature-build-v9
...
build v9 directly
2016-01-16 13:35:46 -06:00
Kurt Young
82ff98c2bf
add config for build v9 directly and update docs
2016-01-16 11:26:34 +08:00
Kurt Young
1f2168fae5
add IndexMergerV9
...
add unit tests for IndexMergerV9 and fix some bugs
add more unit tests and fix bugs
handle null values and add more tests
minor changes & use LoggingProgressIndicator in IndexGeneratorReducer
make some static class public from IndexMerger
minor changes and add some comments
changes for comments
2016-01-16 11:25:28 +08:00
Kurt Young
bb50d2a2b2
add some streaming writers
2016-01-16 11:25:26 +08:00
Fangjin Yang
e0932ba1c2
Merge pull request #2267 from himanshug/fix_topn_multi_val_filter
...
Remap id's returned in XXXFilteredDimensionSpec.getRow() as per reduced cardinality
2016-01-14 17:06:54 -08:00
Fangjin Yang
7704699b40
Merge pull request #2265 from navis/strlen-dimension-ignored
...
Strlen sort spec ignores dimension
2016-01-14 17:06:33 -08:00
Himanshu Gupta
ae6a111444
fix XXXFilteredDimensionSpec to remap the dictionary encodings as per new cardinality
2016-01-13 22:25:02 -06:00
binlijin
a3140b2548
fix topN filtering on multi-valued dimension bug
2016-01-13 22:25:02 -06:00
navis.ryu
ea9fabdf2f
Strlen sort spec ignores dimension
2016-01-14 11:05:44 +09:00
Fangjin Yang
4c014c1574
Merge pull request #2228 from metamx/incremental-index-mem2
...
Improve heap usage for IncrementalIndex
2016-01-13 14:48:03 -08:00
navis.ryu
18479bb757
time-descending result of timeseries queries
2016-01-13 12:23:01 +09:00
Fangjin Yang
d7ad93debc
Merge pull request #2221 from binlijin/topN_minTopNThreshold
...
Allow change minTopNThreshold per topN query
2016-01-12 16:22:20 -08:00
Nishant
4863e2ca4f
cache metric selectors instead of creating new ones for every metric in each row
...
clear selectors on close.
Add comments about thread safety.
2016-01-13 00:45:23 +05:30
Nishant
dfe6abb721
Merge pull request #2250 from himanshug/agg_test_helper_fix
...
remove redundant registering of json modules in AggregationTestHelper
2016-01-12 11:42:00 +05:30
navis.ryu
976ebc45c0
Simplify information in IncrementalIndex
2016-01-12 10:18:11 +09:00
Himanshu Gupta
b973604bf8
remove redundant registering of json modules in AggregationTestHelper
2016-01-11 19:03:22 -06:00
Xavier Léauté
46a7f2660d
fix casing to be consistent with other classes
2016-01-08 10:19:06 -08:00
Fangjin Yang
d0b10c29d7
Merge pull request #2197 from metamx/clearIncIndexClose
...
Make OnHeapIncrementalIndex clean maps on close()
2016-01-07 15:43:47 -08:00
Gian Merlino
4ecd901a1a
Merge pull request #2219 from himanshug/identity_extraction_fn_singleton
...
make IdentityExtractionFn singleton
2016-01-07 10:08:28 -08:00
Fangjin Yang
aaea95ed1b
Merge pull request #2207 from himanshug/theta_sketch_select_query
...
fix bug for thetaSketch metric not working with select queries
2016-01-07 09:46:09 -08:00
binlijin
010c6e959c
add test
2016-01-07 18:01:46 +08:00
binlijin
a6bfcc5bfd
Allow change minTopNThreshold per topN query
2016-01-07 14:51:00 +08:00
Fangjin Yang
4cc81d3eff
Merge pull request #2096 from b-slim/add_use_case_unapply
...
Add use case unapply
2016-01-06 21:58:12 -08:00
Himanshu Gupta
217079d0c7
make IdentityExtractionFn singleton
2016-01-06 22:29:07 -06:00
Himanshu
902f51433d
Merge pull request #2125 from mangeshpardeshiyahoo/master
...
Add extraction function support for Dimension Selector
2016-01-06 14:22:26 -06:00
Mangesh Pardeshi
75ee952197
Add extraction function support for dimension Selector
2016-01-06 13:47:07 -06:00
Slim Bouguerra
032d3bf6e6
Optimization of extraction filter by reversing the lookup
2016-01-06 11:16:11 -06:00
Himanshu Gupta
3f048f0b15
adding support to execute Select queries in AggregationTestHelper so that Select query based UTs can be written for complex aggregator implementations
2016-01-05 21:54:55 -06:00
Charles Allen
91fc32749b
Make OnHeapIncrementalIndex clean maps on close()
2016-01-04 11:18:16 -08:00
Himanshu Gupta
b47d807738
Add support for filtering at DimensionSpec level so that multivalued dimensions can be filtered correctly
...
also adding UTs for multi-valued dimensions
2015-12-30 17:59:47 -06:00
Himanshu Gupta
fa5c3bb014
adding decorate(DimensionSelector) to DimensionSpec to enable support for arbitrary filtering/transformations to returned dimension values
2015-12-30 15:06:24 -06:00
Nishant
b68265399c
Merge pull request #2168 from druid-io/remove-indexmaker
...
Remove IndexMaker
2015-12-30 12:24:29 +05:30
Fangjin Yang
e14ad74088
Merge pull request #1936 from b-slim/between_range_with_predicat
...
adding Upper/Lower Bound Filter
2015-12-29 10:11:22 -08:00
fjy
faf421726b
remove IndexMaker
2015-12-28 14:19:02 -08:00
Gian Merlino
83f4130b5f
SegmentMetadataQuery merging fixes.
...
- Fix merging when the INTERVALS analysisType is disabled, and add a test.
- Remove transformFn from CombiningSequence, use MappingSequence instead. transformFn did
not work for "accumulate" anyway, which made the tests wrong (the intervals should have
been condensed, but were not).
- Add analysisTypes to the Druids segmentMetadataQuery builder to make testing simpler.
2015-12-22 07:57:10 -08:00
Robin
dded4441d3
for completeness, add unit test for groupby/having with unrecognized type
2015-12-21 12:06:56 -06:00
Himanshu Gupta
e1631967e3
adding comments to explain merge failure in segmentMetadata query
2015-12-19 11:39:24 -06:00
Himanshu Gupta
7ecad1be24
Fix and UT for testing segment analysis merge
2015-12-19 00:24:02 -06:00
Fangjin Yang
7019d3c421
Merge pull request #2107 from jon-wei/fix_smq
...
More efficient SegmentMetadataQuery
2015-12-18 16:40:47 -08:00
Fangjin Yang
14229ba0f2
Merge pull request #1922 from metamx/jsonIgnoresFinalFields
...
Change DefaultObjectMapper to NOT overwrite final fields unless explicitly asked to
2015-12-18 15:38:32 -08:00
Fangjin Yang
71f554bf80
Merge pull request #2101 from himanshug/fix_extraction_dim_filter_cache_key
...
add extractionFn bytes to cache key in ExtractionDimFilter
2015-12-18 12:05:43 -08:00
Fangjin Yang
9e6874cc7e
Merge pull request #2084 from binlijin/master
...
minor optimize IndexMerger's MMappedIndexRowIterable
2015-12-18 11:42:55 -08:00
Bingkun
cc21a5fac7
Merge pull request #1999 from himanshug/remove_min_max_aggs
...
remove min/max aggregator factory
2015-12-18 13:38:52 -06:00
jon-wei
356b07c6c3
More efficient SegmentMetadataQuery
2015-12-17 12:46:23 -08:00
Jonathan Wei
f8cf84f466
Merge pull request #1995 from himanshug/num_rows_seg_metadata_query
...
add numRows to segment metadata query response
2015-12-17 12:23:46 -08:00
Himanshu Gupta
82ea348003
add extractionFn bytes to cache key in ExtractionDimFilter
2015-12-16 14:00:38 -06:00
Himanshu
628643d80e
Merge pull request #2091 from rasahner/noDefaultForGroupbyHaving
...
take away default for groupBy/having
2015-12-16 01:07:40 -06:00
sahner
3441cf3110
take away default for groupBy/having
2015-12-15 10:32:45 -06:00
Fangjin Yang
e7f06cf61c
Merge pull request #2075 from jon-wei/regex_extract
...
Configurable value replacement on match failure for RegexExtractionFn
2015-12-14 19:10:50 -08:00
jon-wei
c88f75df7c
Configurable value replacement on match failure for RegexExtractionFn
2015-12-14 17:57:41 -08:00
binlijin
362bea1090
minor optimize IndexMerger's MMappedIndexRowIterable
2015-12-11 15:04:46 +08:00
Xavier Léauté
d531e69d1a
Merge pull request #2079 from binlijin/master
...
reduce bytearray copy to minimal optimize VSizeIndexedWriter
2015-12-10 21:30:09 -08:00
Slim Bouguerra
77afdf25e3
adding Bound Filter
2015-12-10 08:47:21 -06:00
Slim Bouguerra
ee1a39801a
adding bulk lookup and reverse lookup
2015-12-10 08:29:41 -06:00
binlijin
0eafbd55b2
reduce bytearray copy to minimal optimize VSizeIndexedWriter
2015-12-10 16:34:39 +08:00
Fangjin Yang
f4ba13a1ac
Merge pull request #2029 from b-slim/add_reverse_fn
...
Adding reverse lookup function to LookupExtractor.
2015-12-09 12:50:13 -08:00
Xavier Léauté
9015a68c03
Merge pull request #2002 from navis/DRUID-2001
...
fixed #2001 GenericIndexed.fromIterable compares all values even when it's not sorted
2015-12-09 08:56:49 -08:00
Slim Bouguerra
85f339b687
introduction and implem of reverse lookup function unApply.
2015-12-09 10:02:57 -06:00
Nishant
6c23d8edb4
Merge pull request #2043 from mangeshpardeshiyahoo/master
...
Add dimension selector support for groupby/having filters
2015-12-08 12:08:53 +05:30
Mangesh Pardeshi
d7ce120929
Add dimension selector support for groupby/having quries
2015-12-08 01:51:11 +00:00
Himanshu Gupta
431469e9c1
remove min/max aggregator factory which are replaced by double[min/max] aggregator factories
2015-12-05 22:36:49 -06:00
Himanshu Gupta
62ba9ade37
unifying license header in all java files
2015-12-05 22:16:23 -06:00
Gian Merlino
d21a640695
Merge pull request #2034 from b-slim/fix_cache_key
...
Fix getCacheKey for DimFilters
2015-12-04 09:13:06 -08:00
Slim Bouguerra
fb4ff3cf54
fix getCacheKey
2015-12-04 08:07:08 -06:00
Charles Allen
9d02f47201
Update IncrementalIndexTest copyright notice
2015-12-03 18:03:08 -08:00
Charles Allen
be8c6fafb0
Merge pull request #2017 from tubemogul/issue/63
...
fixes issue #63
2015-12-03 18:01:11 -08:00
Gian Merlino
045df54404
Merge pull request #1961 from metamx/druidMetricsVersion
...
Add the druid artifact version to metrics when emitted
2015-12-03 17:34:57 -08:00
Michael Schiff
b6cc2428e1
fixes issue #63
2015-12-03 17:30:47 -08:00
Himanshu
0eab8417cb
Merge pull request #2008 from codingwhatever/regex-search-query
...
Regex search query
2015-12-03 09:57:34 -06:00
Sam Groth
596b7ebd9a
Adding RegexSearchQuerySpec
2015-12-03 09:16:02 -06:00
Himanshu
d02be6194d
Merge pull request #1967 from metamx/realtime-metrics-improvements
...
Add datasource and taskId to metrics emitted by peons
2015-12-02 23:48:13 -06:00
Himanshu
00c6027777
Merge pull request #1986 from metamx/substring
...
fixes #1874 adding a substring extraction function, tests, and documentation
2015-12-02 23:45:47 -06:00
Clint Wylie
68ef5f437a
fixes #1874 adding a substring extraction function, tests, and documentation
2015-12-01 23:50:32 -08:00
navis.ryu
87357a0534
fixed #2001 GenericIndexed.fromIterable compares all values even when it's not sorted
2015-12-02 15:11:14 +09:00
Nishant
1eb8211346
Add datasource and taskId to metrics emitted by peons
...
This PR adds the datasource and taskId to the jvm and sys metrics
emitted by the peons.
fix spelling
review comment
review comment
2015-12-01 23:20:59 +05:30
Gian Merlino
cd2cff24ff
Fix serde for FragmentSearchQuerySpec and add some tests.
2015-11-30 17:34:35 -08:00
navis.ryu
c73418c181
fixed #2003 ColumnSelectorBitmapIndexSelector throws NPE for dimension not supporting bitmap
2015-11-24 10:45:36 +09:00
Himanshu Gupta
7a89b2e1a6
add numRows to segment metadata query response
2015-11-20 01:25:02 -06:00
Himanshu
d93640bfcb
Merge pull request #1974 from jon-wei/dim_order_merge
...
Allow IndexMerger to use non-lexicographic dim order when merging indexes
2015-11-18 19:51:34 -06:00
Xavier Léauté
e3e6159336
Merge pull request #1985 from metamx/FixLookupCacheKey
...
Change LookupExtractionFn cache key to be unique
2015-11-18 10:13:55 -08:00
Charles Allen
7abe999418
Change LookupExtractionFn cache key to be unique
2015-11-17 18:02:40 -08:00
jon-wei
4afc62be29
Allow IndexMerger to use non-lexicographic dim order when merging indexes
2015-11-17 13:02:31 -08:00
Xavier Léauté
d7eb2f717e
enable query caching on intermediate realtime persists
2015-11-17 10:58:00 -08:00
Gian Merlino
57f213d536
Better toString for groupBy, segmentMetadata queries.
2015-11-16 12:54:59 -08:00
jon-wei
cdceaf2d26
Fix IncrementalIndexAdapter getRows() Iterable
2015-11-12 13:10:42 -08:00
Charles Allen
af34e9c8cb
Add the druid artifact version to metrics when emitted
2015-11-12 12:11:27 -08:00
binlijin
286b8f8c6f
optimize index merge
2015-11-12 11:08:54 +08:00
Xavier Léauté
fa6142e217
cleanup and remove unused imports
2015-11-11 12:25:21 -08:00
dclim
fd0935ecb9
fix spatial dimension transformer to work with hadoop
2015-11-10 19:16:51 -07:00
Slim Bouguerra
c511273efd
adding in filter
2015-11-06 16:23:24 -06:00
Charles Allen
929b981710
Change DefaultObjectMapper to NOT overwrite final fields unless explicitly asked to
2015-11-05 18:10:13 -08:00
fjy
8f231fd3e3
cleanup druid codebase
2015-11-04 13:59:53 -08:00
Gian Merlino
8defe29270
Merge pull request #1901 from guobingkun/fix_typo_and_rename
...
Fix metadata typo and rename default extension directory
2015-11-03 14:02:11 -08:00
Bingkun Guo
962f65cc76
fix metadata typo and rename default extension directory
2015-11-03 14:50:42 -06:00
Fangjin Yang
cec09a9967
Merge pull request #1804 from himanshug/objectify_index_creators
...
static to non-static conversion for methods in Index[Merger/Maker/IO]
2015-11-03 11:25:32 -08:00
Himanshu Gupta
8b67417ac8
make methods in Index[Merger,Maker,IO] non-static so that they can have
...
appropriate ObjectMapper injected instead of creating one statically
2015-11-02 23:24:26 -06:00
navis.ryu
e03fc2032f
changed equals/hashCode implementation
2015-11-02 17:21:35 +09:00
navis.ryu
69c86716d6
addressed comments
2015-11-02 14:23:13 +09:00
navis.ryu
032c3e986d
Make 'search' filter have a case sensitive option( #1878 )
2015-10-30 16:38:54 +09:00
Fangjin Yang
25a0eb7ed5
Merge pull request #1799 from dclim/nested-groupby-aggregator-fix
...
Support multiple outer aggregators of same type and provide more help…
2015-10-29 18:01:31 -07:00
Xavier Léauté
59872bd0cd
Merge pull request #1809 from metamx/fifoPriorityExecutorService
...
Make PrioritizedExecutorService optionally FIFO
2015-10-27 15:19:32 -07:00
Charles Allen
060402a216
Merge pull request #1855 from himanshug/fix_having_specs
...
fix [GreaterThan,LessThan,Equals] HavingSpecs
2015-10-27 14:46:04 -07:00
Charles Allen
ecdafa87c5
Make PrioritizedExecutorService optionally FIFO
2015-10-27 14:16:22 -07:00
Himanshu Gupta
a71c7270b9
making [GreaterThan,LessThan,Equals] HavingSpecs more robust by carefully using long vs float for comparison
2015-10-27 13:15:13 -05:00
Fangjin Yang
5a082b2f5e
Merge pull request #1824 from metamx/UniformGranularitySpecHashEquals
...
Add hashCode and equals to UniformGranularitySpec
2015-10-26 09:34:01 -07:00
Fangjin Yang
5f23703216
Merge pull request #1638 from guobingkun/remove_maven_client_code
...
Remove Maven client at runtime + Provide a way to load Druid extensions through local file system
2015-10-26 09:30:05 -07:00
Nishant
7cecc55045
Add segment merge time as a metric
...
Add merge and persist cpu time
Fix typo
review comment
move cpu time measuring to VMUtils
review comments.
2015-10-22 12:28:03 +05:30
Bingkun Guo
4914925d65
New extension loading mechanism
...
1) Remove maven client from downloading extensions at runtime.
2) Provide a way to load Druid extensions and hadoop dependencies through file system.
3) Refactor pull-deps so that it can download extensions into extension directories.
4) Add documents on how to use this new extension loading mechanism.
5) Change the way how Druid tarball is generated. Now all the extensions + hadoop-client 2.3.0
are packaged within the Druid tarball.
2015-10-21 14:22:36 -05:00
Xavier Léauté
e4ac78e43d
bump next snapshot to 0.9.0
2015-10-20 13:46:13 -07:00
dclim
46ecdfa757
add comment explaining logic
2015-10-15 16:04:06 -06:00
Xavier Léauté
4c2c7a2c37
update version to 0.8.3
2015-10-14 21:40:55 -07:00
Charles Allen
f432b8e3f9
Add hashCode and equals to UniformGranularitySpec
...
* Also add hashCode != 0 to AllGranularity and NoneGranularity
2015-10-13 16:42:21 -07:00
Gian Merlino
c9d6994040
Merge pull request #1821 from himanshug/storage_adapter_update
...
cache max data timestamp in QueryableIndexStorageAdapter
2015-10-13 10:52:43 -07:00
Himanshu Gupta
490de1f98a
support multiple non-consecutive intervals in outer query of nested group-by
2015-10-13 10:16:06 -05:00
Himanshu Gupta
fbba30eb60
cache max data timestamp in QueryableIndexStorageAdapter so that TimestampCheckingOffset
...
does not have to get it per cursor.
2015-10-12 15:34:22 -05:00
Charles Allen
8ed5d2c06a
Add hashCode and equals to stock lookups
2015-10-12 10:29:39 -07:00
Himanshu Gupta
2737fd83f5
in the IndexSizeExceededException put maxRowCount to confirm if it is correctly picked up from configuration
2015-10-06 15:23:14 -05:00
Himanshu Gupta
8654732ef6
make IndexSizeExceededException constructor take formatString and arguments than just fixed String
...
like ISE, IAE etc
2015-10-06 13:44:22 -05:00
dclim
f4e0a76820
Support multiple outer aggregators of same type and provide more helpful exception when the same inner aggregator is referenced by multiple types of outer aggregators
2015-10-01 15:15:12 -06:00
Gian Merlino
774765dc40
GroupByQueryRunnerTest for hyperUnique finalizing post aggregators
2015-10-01 00:09:29 -04:00
Gian Merlino
e3bb93e8c7
Revert "Merge pull request #1781 from dclim/nested-groupby-multiple-same-aggregator-fix-v2"
...
This reverts commit dae488b7c0
, reversing
changes made to 397be4b897
.
2015-10-01 00:05:59 -04:00
dclim
8e20a1e1f3
Use DoubleSumAggregatorFactory instead of CountAggregatorFactory, add test for non-integers
2015-09-30 17:11:39 -06:00
David Lim
70ae5ca922
Fix failure in nested groupBy with multiple aggregators with same fieldName
...
Version 2 - Throws an exception if an outer query references an
aggregator that doesn't exist in the inner query, and then uses the
inner query aggregator names to form the columns for the intermediate
incremental index.
Also deleted all the getRequiredColumns() methods which are no longer
being used.
We do something wacky by adding an aggregator factory for the post
aggregators when building the intermediate incremental index, otherwise
queries on post aggregate results fail because the data isn't in the
incremental index.
Closes #1419
2015-09-30 15:43:11 -06:00
Charles Allen
8199ecf1a4
Merge pull request #1782 from jon-wei/smq_cachekey
...
Add analysisTypes to SegmentMetadataQuery cache key
2015-09-29 15:51:35 -07:00
jon-wei
41ff271339
Add analysisTypes to SegmentMetadataQuery cache key
2015-09-29 14:33:35 -07:00
Charles Allen
2d847ad654
Merge pull request #1730 from metamx/union-queries-fix
...
fix #1727 - Union bySegment queries fix
2015-09-29 12:23:25 -07:00
Nishant
573aa96bd6
fix #1727 - Union bySegment queries fix
...
Fixes #1727 .
revert to doing merging for results for union queries on broker.
revert unrelated changes
Add test for union query runner
Add test
remove unused imports
fix imports
fix renamed file
fix test
update docs.
2015-09-29 23:32:36 +05:30
Gian Merlino
62d4ced4dd
Separate ListColumnIncluderator cache key parts with nul bytes
2015-09-29 13:59:58 -04:00
jon-wei
e6a6284ebd
Allow SegmentMetadataQuery to skip cardinality and size calculations
2015-09-22 13:51:55 -07:00
Gian Merlino
aaa8a88464
Merge pull request #1739 from jon-wei/segment_realtime
...
Allow SegmentAnalyzer to read columns from StorageAdapter, allow SegmentMetadataQuery to query IncrementalIndexSegments on realtime node
2015-09-17 18:36:53 -07:00
Charles Allen
df4c2bab10
Soften concurrency requirements on IncrementalIndexTest
2015-09-17 15:51:07 -07:00
jon-wei
367c50d4ba
Allow SegmentAnalyzer to read columns from StorageAdapter, allow SegmentMetadataQuery to query IncrementalIndexSegments on realtime node
2015-09-16 18:39:31 -07:00
Charles Allen
6e1eb3b7fe
Add better concurrency testing to IncrementalIndexTest
2015-09-16 14:04:20 -07:00
Gian Merlino
9705c5139b
Merge pull request #1732 from jon-wei/segmentmeta
...
Add support for a configurable default segment history period for segmentMetadata queries and GET /datasources/<datasourceName> lookups
2015-09-16 12:36:25 -07:00
Fangjin Yang
8b071a7230
Merge pull request #1710 from metamx/incrementalIndexConcurrentTestLatching
...
Add some basic latching to concurrency testing in IncrementalIndexTest
2015-09-15 13:55:52 -07:00
jon-wei
193fb4fdfc
Add support for a configurable default segment history period for segmentMetadata queries and GET /datasources/<datasourceName> lookups
2015-09-14 19:41:42 -07:00
Charles Allen
bd605a097e
Merge pull request #1731 from metamx/regex-extraction-npe
...
fix NPE with regex extraction function
2015-09-14 15:55:05 -07:00
Xavier Léauté
08a527d01a
fix NPE with regex extraction function
2015-09-14 14:45:30 -07:00
Charles Allen
e569f4b6a7
Add dimension extraction functionality to SearchQuery
...
* Add IdentityExtractionFn
2015-09-14 11:36:15 -07:00
Himanshu
5ff92664f8
Merge pull request #1696 from metamx/cpuTimeReporting
...
Add CPU time to metrics for segment scanning.
2015-09-14 10:53:55 -05:00
Fangjin Yang
34ef81572d
Merge pull request #1700 from himanshug/update_agg_test_helper
...
update indexing in the helper to use multiple persists and merge
2015-09-14 06:56:29 -07:00
Charles Allen
8d3cdd8572
Don't check for sortedness if we already know GenericIndexedWriter isn't sorted
2015-09-11 16:32:09 -07:00
Charles Allen
d6849805ea
Add some basic latching to concurrency testing in IncrementalIndexTest
2015-09-10 10:06:51 -07:00
Himanshu Gupta
5da58e48e0
use Rule based TemporaryFolder for cleanup of temp directory/files
2015-09-09 11:10:33 -05:00
Himanshu Gupta
44911039c5
update indexing in the helper to use multiple persists and final merge to
...
catch further issues in aggregator implementations
2015-09-09 11:10:33 -05:00
Charles Allen
fcf5cae81d
Add CPU time to metrics for segment scanning.
2015-09-08 13:34:19 -07:00
cheddar
4f61b42f40
Merge pull request #1578 from b-slim/fix_extraction_filter_2
...
Fix UT and documentation to the extraction filter
2015-09-01 10:46:20 -07:00
Himanshu
04ff6cd355
Merge pull request #1685 from gianm/close-loudly
...
Close output streams and channels loudly when creating segments.
2015-08-28 23:32:22 -05:00
Gian Merlino
940e1aa3eb
Replace funky imports with standard ones.
...
1) Lots of Guava imports were not coming from the actual Guava
2) junit.framework.Assert should be org.junit.Assert
2015-08-28 18:02:05 -07:00
Gian Merlino
7d6fa2ba50
Close output streams and channels loudly when creating segments.
2015-08-28 17:14:03 -07:00
Himanshu Gupta
2e0dd1d792
adding UTs and addressing review comments to
...
firehoseV2 addition to Realtime[Manager|Plumber],
essential segment metadata persist support,
kafka-simple-consumer-firehose extension patch
2015-08-27 20:50:46 -05:00
lvjq
2237a8cf0f
kafka 8 simple consumer firehose
2015-08-27 20:50:46 -05:00
Charles Allen
c1388a1685
Merge pull request #1632 from Hailei/fix-subquery-innerquery-demension
...
Inner Query should build on sub query
2015-08-27 10:25:38 -07:00
Gian Merlino
2a866f49df
Downgrade Jackson to 2.4.6.
2015-08-26 18:25:55 -07:00
Charles Allen
24aa762c79
Add test for #1632
2015-08-25 20:50:30 -07:00
Xavier Léauté
51f6a9a2c9
update jackson to 2.6.1
2015-08-25 16:07:01 -07:00
Himanshu Gupta
c57c07f28a
add ability for client code to provide InputStream of input data in addition to File
...
It would be needed when input data file does not reside in the same jar
but you could still use getResourceAsStream() to read the data inside a file
2015-08-20 00:54:58 -05:00
Xavier Léauté
3b2e41e42a
update for next release
2015-08-18 17:16:46 -07:00
Slim Bouguerra
7549f02578
support the case filter value is null
2015-08-17 15:09:37 -05:00
zhanghailei
234a958817
Inner Query should build on sub query
2015-08-17 18:18:26 +08:00
Charles Allen
db19d2d547
Revert "Update to guice 4.0"
2015-08-14 09:26:07 -07:00
Charles Allen
be89105621
Merge pull request #1602 from metamx/more-code-cleanup
...
Some perf Improvements in Broker
2015-08-11 13:51:49 -07:00
Xavier Léauté
fbdb841928
Merge pull request #1603 from metamx/optimize-lexicographic-topN
...
Optimizations for LexicographicTopNs
2015-08-11 13:35:34 -07:00
Nishant
b8d8a8da9e
Optimisations for LexicographicTopNs
...
initial review for perf optimizations for lexicographic TopNs
fix compilation
create map with proper size
review comment
review comment
review comments
2015-08-12 00:37:48 +05:30
Charles Allen
7e61216287
Update to guice 4.0
...
- Mark a lot of `@Provides` methods as final since guice 4.0 disallows overriding them
2015-08-10 13:57:18 -07:00
Slim Bouguerra
f0bc362981
clean code if is not needed anymore
2015-08-07 12:38:41 -05:00
Slim Bouguerra
64d638a386
optimize makeMatcher
2015-08-06 17:04:36 -05:00
Nishant
1a46c4c71c
avoid creating mergeSeqence when not required
2015-08-06 14:25:13 +05:30
Slim Bouguerra
83de5a4716
addressing reviewers comments
2015-08-03 09:03:28 -05:00
Slim Bouguerra
dda0790a60
Fix extractionFilter by implementing make matcher
...
Fix getBitmapIndex to consider the case were dim is null
Unit Test for exractionFn with empty result and null_column
UT for TopN queries with Extraction filter
refactor in Extractiuon fileter makematcher for realtime segment and clean code in b/processing/src/test/java/io/druid/query/groupby/GroupByQueryRunnerTest.java
fix to make sure that empty string are converted to null
2015-08-03 09:02:17 -05:00
Himanshu Gupta
d11d9b6c45
dont waste memory in storing all lines from input
...
CharSource.readLines() reads all lines from input into a in-memory list
Since we need an iterator here, so this wastage can be easily prevented
2015-07-20 21:59:38 -05:00
Fangjin Yang
0481c8ca26
Merge pull request #1406 from zhaown/fix-breaking-while-exceeding-max-intermediate-rows
...
Fix breaking while exceeding max intermediate rows.
2015-07-20 13:41:22 -07:00
Himanshu Gupta
f7a92db332
generic byte[] serde for InputRow
2015-07-20 12:01:53 -05:00
Himanshu Gupta
0439e8ec23
adding serde methods for intermediate aggregation object to ComplexMetricSerde
...
This provides the alternative to using ComplexMetricSerde.getObjectStrategy()
and using the serde methods from ObjectStrategy as that usage pattern is deprecated.
2015-07-20 12:01:53 -05:00
zhaown
524b05f073
Fix breaking while exceeding max intermediate rows.
2015-07-19 10:41:53 +08:00
Fangjin Yang
e21195f987
Merge pull request #1469 from guobingkun/table_config
...
Inconsistent property names for "druid.metadata.storage.tables.xxx"
2015-07-17 07:43:19 -07:00
Himanshu
19af3bc9bc
Merge pull request #1535 from metamx/alphanum-docs-tests
...
Update alphanumeric sort docs + more tests / examples
2015-07-16 22:09:41 -05:00
Xavier Léauté
2c464ad936
correct reference in docs + more tests / examples
2015-07-16 19:50:05 -07:00
Xavier Léauté
9616c10b1d
remove import static
2015-07-16 17:46:21 -07:00
Xavier Léauté
c1308203b8
Merge pull request #1532 from metamx/fixTopNDimExtractionDoubleApply
...
Fix TopN dimension extractions being applied twice
2015-07-16 13:39:02 -07:00
Xavier Léauté
3a0793aaf9
Merge pull request #1533 from metamx/extraCheckGroupByDimExtraction
...
Add more unit tests for group by
2015-07-15 21:09:00 -07:00
Charles Allen
7d0b77c261
Add more unit tests for group by
2015-07-15 20:15:21 -07:00
Xavier Léauté
a15a2c4047
fix histogram aggregator cache key
2015-07-15 17:33:36 -07:00
Charles Allen
9092c665b7
Fix TopN dimension extractions being applied twice
2015-07-15 16:58:15 -07:00
Charles Allen
456ad9ffba
Merge pull request #1529 from metamx/update-versions
...
inrement version
2015-07-15 13:25:31 -07:00
Xavier Léauté
4cfb00bc8a
inrement version
2015-07-15 13:09:05 -07:00
Charles Allen
5eadd395e2
Move lots of executor service creation to Execs
2015-07-14 15:38:49 -07:00
Nishant
184b12bee8
fix groupBy caching to work with renamed aggregators
...
Issue - while storing results in cache we store the event map which
contains aggregator names mapped to values. Now when someone fire same
query after renaming aggs, the cache key will be same but the event
will contain metric values mapped to older names which leads to wrong
results.
Fix - modify cache to not store raw event but the actual list of values
only.
review comments + fix dimension renaming
review comment
2015-07-09 11:48:26 +05:30
Xavier Léauté
9789417612
ModuleList is already part of Initialization
2015-07-01 11:37:40 -07:00
Xavier Léauté
2c463ae435
Merge pull request #1489 from metamx/moveTestPackages
...
Move some test packages
2015-07-01 11:18:09 -07:00
Charles Allen
5e19a615f1
Add coments to DimExtractionTopNAlgorithm
2015-07-01 10:32:45 -07:00
Charles Allen
7a2a8a3d6e
Move extraction tests to more reasonable package
2015-07-01 10:30:50 -07:00
Bingkun Guo
4a0ae7d8d5
Fix inconsistent druid property names for "druid.metadata.storage.tables.xxx" between document and code
2015-06-29 10:12:30 -05:00
Xavier Léauté
28fa1642b9
add node time metrics to DirectDruidClient
2015-06-26 17:57:44 -07:00
Xavier Léauté
36b4453789
Merge pull request #1455 from druid-io/fix-protobuf
...
Fix protobuf impl and docs
2015-06-22 23:15:40 -07:00
nishant
f9cdb0ad61
test for #1120
...
Make the changes described in #1120 to add test for the issue described
there.
2015-06-21 23:34:21 +05:30
fjy
9c74993559
fix protobuf impl and docs
2015-06-20 21:59:38 -07:00
Xavier Léauté
0a5bb909a2
[maven-release-plugin] prepare for next development iteration
2015-06-18 17:35:19 -07:00
Xavier Léauté
59c6b2b279
[maven-release-plugin] prepare release druid-0.8.0-rc1
2015-06-18 17:35:14 -07:00
Charles Allen
6230ac90ae
Use IndexMerger for conversion
2015-06-10 11:34:58 -07:00
Xavier Léauté
395ba79f8b
Merge pull request #1403 from metamx/mergerMakerTests
...
Improvements around resource handling in IndexMerger / IndexIO / QueryableIndex
2015-06-04 15:59:10 -07:00
Charles Allen
ed8eb5c991
Improvements around resource handling in IndexMerger / IndexIO / QueryableIndex
...
* Fix resource leak in `io.druid.segment.IndexIO.DefaultIndexIOHandler#validateTwoSegments(java.io.File, java.io.File)`
* Un-deprecate `close()` in `QueryableIndex` and make it inherit `Closeable`
* Fix resource leaks in various unit tests
* Add `CloserRule` for closing out resources
2015-06-04 14:18:27 -07:00
Himanshu
50ad0e6474
Merge pull request #1412 from pjain1/alphaNumericTopN_NPE_fix
...
NPE fix for TopN query with alphaNumericTopN metric spec
2015-06-04 09:49:31 -05:00
Parag Jain
a7b09e857c
NPE fix for alphaNumericTopN when pervious stop is not specified
2015-06-04 09:30:31 -05:00
Xavier Léauté
35e2fde18e
Merge pull request #1386 from himanshug/aggregation_testing1
...
General class for testing any Aggregation Implementation
2015-06-03 23:43:36 -07:00
Xavier Léauté
92d7316ed8
Merge pull request #1414 from metamx/timeout2TIMEOUT
...
Replace "timeout" with QueryContextKeys.TIMEOUT
2015-06-02 17:11:09 -07:00
Charles Allen
1c4d42bc15
Replace "timeout" with QueryContextKeys.TIMEOUT
2015-06-02 14:49:21 -07:00
Charles Allen
f48db09e35
Add optimizations for ExtractionFn by enabling MANY_TO_ONE vs ONE_TO_ONE codepaths
...
* Also adds LookupExtractionFn and MapLookupExtractor which takes in an explicit mapping of renames
* Add injective to javascript extraction fn
2015-06-02 12:22:56 -07:00
Himanshu Gupta
215c1ab01e
UTs for hyperUnique aggregation
2015-06-01 12:52:40 -05:00
Himanshu Gupta
160d5fe6b7
a general class for testing any [complex] aggregation implementation
2015-06-01 12:52:40 -05:00
Charles Allen
55292bba13
Add more IndexMergerTests
2015-05-28 18:18:20 -07:00
Charles Allen
1ebe622c7d
Add checkin GroupByQuery for null DimensionSpec in dimension list
2015-05-28 14:55:34 -07:00
Xavier Léauté
f9c624c7db
Merge pull request #1361 from mrijke/groupby-limithavingorder-unittest
...
GroupBy Query with Having/Limit/Orderingspec inconsistencies (UnitTest)
2015-05-27 14:49:18 -07:00
Xavier Léauté
1a3f04f0ed
Merge pull request #1354 from metamx/multi-valued-dimension-compression
...
Enabling compression for multiValued dimension
2015-05-26 23:43:53 -07:00
Charles Allen
fd64c24e43
Fix roaring extraction filter on empty values
2015-05-26 13:54:18 -07:00
nishant
81415282aa
Enabling compression for multiValued dimension
...
Add test and refactoring
Add benchmark tests
2015-05-27 00:09:14 +05:30
Charles Allen
e97d22a10a
Fix Extraction Filter cast problems for empty results
2015-05-22 15:20:11 -07:00
Charles Allen
e1399b7ce4
Add unit test to show breaking Dimension Extraction Filter
2015-05-22 15:02:11 -07:00
Xavier Léauté
75c092ccb1
Merge pull request #1375 from metamx/MetricManipulatorFnInstances
...
Modify MetricManipulatorFns to use instanced classes
2015-05-22 15:56:47 -04:00
Charles Allen
042653ebcb
Modify MetricManipulatorFns to use instanced classes
2015-05-22 12:38:38 -07:00
Himanshu Gupta
723df735e9
force eagerness of processing of SegmentMetadata queries on the processing executor by converting the Sequence into List
2015-05-22 13:46:26 -05:00
Himanshu Gupta
5852b64852
adding UT for SegmentMetadata bySegment query which catches following regression caused by commit 55ebf0cfdf
...
it fails when we issue the SegmentMetadataQuery by setting {"bySegment" : true} in context with exception -
java.lang.ClassCastException: io.druid.query.Result cannot be cast to io.druid.query.metadata.metadata.SegmentAnalysis
at io.druid.query.metadata.SegmentMetadataQueryQueryToolChest$4.compare(SegmentMetadataQueryQueryToolChest.java:222) ~[druid-processing-0.7.3-SNAPSHOT.jar:0.7.3-SNAPSHOT]
at com.google.common.collect.NullsFirstOrdering.compare(NullsFirstOrdering.java:44) ~[guava-16.0.1.jar:?]
at com.metamx.common.guava.MergeIterator$1.compare(MergeIterator.java:46) ~[java-util-0.27.0.jar:?]
at com.metamx.common.guava.MergeIterator$1.compare(MergeIterator.java:42) ~[java-util-0.27.0.jar:?]
at java.util.PriorityQueue.siftUpUsingComparator(PriorityQueue.java:649) ~[?:1.7.0_80]
2015-05-22 13:45:54 -05:00
Himanshu Gupta
da0cc32bc8
Revert commit 55ebf0cfdf
...
which caused following regression
it fails when we issue the SegmentMetadataQuery by setting {"bySegment" : true} in context with exception -
java.lang.ClassCastException: io.druid.query.Result cannot be cast to io.druid.query.metadata.metadata.SegmentAnalysis
at io.druid.query.metadata.SegmentMetadataQueryQueryToolChest$4.compare(SegmentMetadataQueryQueryToolChest.java:222) ~[druid-processing-0.7.3-SNAPSHOT.jar:0.7.3-SNAPSHOT]
at com.google.common.collect.NullsFirstOrdering.compare(NullsFirstOrdering.java:44) ~[guava-16.0.1.jar:?]
at com.metamx.common.guava.MergeIterator$1.compare(MergeIterator.java:46) ~[java-util-0.27.0.jar:?]
at com.metamx.common.guava.MergeIterator$1.compare(MergeIterator.java:42) ~[java-util-0.27.0.jar:?]
at java.util.PriorityQueue.siftUpUsingComparator(PriorityQueue.java:649) ~[?:1.7.0_80]
2015-05-22 13:39:34 -05:00
Maarten Rijke
82da479464
Fix for GroupBy with Having+Limit+Orderspec
...
* Inverted function arguments to compose postProcFn for GroupBy queries
with havingspec + limitspec.
* Replaced query.getLimitSpec() with null in GroupByQueryToolChest's
mergeGroupByResults
* Added unittest to verify functionality
2015-05-19 18:35:48 +02:00
Himanshu Gupta
2fd3e9e8e5
return size = 0 in ColumnAnalysis if its unknown
...
that is if complex agg did not implement inputSizeFn() so
that segment metadata query shows atleast some information.
also instead of COMPLEX, return type of data stored.
2015-05-15 20:11:56 -05:00
Xavier Léauté
3c3db7229c
Merge pull request #1355 from himanshug/long_max_min_aggregators
...
Long max/min aggregators
2015-05-13 12:08:11 -07:00
Himanshu Gupta
cebb550796
additional UTs for [DoubleMax/DoubleMin] aggregation
2015-05-13 09:25:41 -05:00
Himanshu Gupta
d0ec945129
adding aliases doubleMax and doubleMin for max and min respectively
...
renamed all [Max/Min]*.java to [DoubleMax/DoubleMin]*.java and created [Max/Min]AggregatorFactory.java which can be removed when we dont need the min/max aggregator type backward compatibility
2015-05-13 09:25:41 -05:00
Himanshu Gupta
2de38f7d29
UTs for long[Max/Min] aggregation
2015-05-13 09:25:22 -05:00
Himanshu Gupta
00436f93e2
long max/min aggregators implementation
2015-05-13 09:25:22 -05:00
fjy
7a6acf5c1b
update pom to 0.8
2015-05-11 19:41:58 -06:00
Xavier Léauté
33265d63e1
Merge pull request #1262 from metamx/fix-null-dimension
...
fix handling of dimension having only null values
2015-05-06 13:51:26 -07:00
nishant
34be1e96fa
fix NPE
...
review comments
Add test
fix test for java8
2015-05-05 23:11:13 +05:30
Neo
8f8400e24e
fix handling of dimension having only null values
...
fixes #1211
fix value matcher
more improvements
more fixes for partial null column
fix handling of dimension having only null values
fixes #1211
fix value matcher
more improvements
more fixes for partial null column
review comment
IndexMaker speedups
* About 15% speedup
Conflicts:
processing/src/main/java/io/druid/segment/IndexMaker.java
fix handling of dimension having only null values
fixes #1211
fix value matcher
more improvements
more fixes for partial null column
fix handling of dimension having only null values
fixes #1211
fix value matcher
more improvements
more fixes for partial null column
review comment
review comments
review comment
fix failing tests
review comment
fix compilation
2015-05-04 22:07:45 +05:30
nishant
50158357ff
fixes #1330
...
fixes #1330 ,
Avoid creating Period instance as creating a Period from Long.MAX_VALUE
throws arithmetic exception.
After this query metric will emit duration in seconds instead of
minutes.
2015-05-04 20:34:28 +05:30
Xavier Léauté
721505c017
Merge pull request #1208 from druid-io/rework-metrics
...
Schemaless metrics + additional metrics for things we care about
2015-04-27 15:04:54 -07:00
fjy
963e5765bf
Schemaless metrics + additional metrics for things we care about
2015-04-27 13:39:40 -07:00
Charles Allen
27016c0289
Fix IndexIO segment validator to account for timestamp mismatches.
2015-04-27 12:42:16 -07:00
Charles Allen
633fdb029e
Add option to ConvertSegmentTask to skip validation
...
* Validation is enabled by default
2015-04-27 08:37:55 -07:00
Charles Allen
303727e6a9
IndexMaker speedups
...
* About 15% speedup
Conflicts:
processing/src/main/java/io/druid/segment/IndexMaker.java
2015-04-23 13:19:21 -07:00
Charles Allen
f2300430d1
Cleanup some code in index creation.
...
* Add some unit tests
* Add io.druid.segment.IndexMerger.reprocess for quick re-indexing of data
* Add dim-value validation to validation checker (instead of ONLY index #)
* General code refactoring to make things a little easier to read
2015-04-23 12:41:42 -07:00
Xavier Léauté
7939f43681
Merge pull request #1296 from druid-io/limit-test
...
Add test for order by metric and limit across multiple days
2015-04-22 11:28:06 -07:00
fjy
97d87a06d0
Add another test for limit across multiple days
2015-04-22 11:27:37 -07:00
Fangjin Yang
28f69d6bd3
Merge pull request #1299 from metamx/improve-filter-datasource-metadata
...
Improve filtering of segments for dataSourceMetadataQuery
2015-04-22 11:07:35 -07:00
Xavier Léauté
a0a28de551
fix serde issue when pulling timestamps from cache
2015-04-22 11:03:26 -07:00
Xavier Léauté
2b4406671e
Merge pull request #1301 from druid-io/fix-type
...
fix count agg factory type
2015-04-21 09:24:20 -07:00
fjy
7805357ab1
fix count agg factory type
2015-04-21 09:23:04 -07:00
nishant
bb8c0cb50b
Improve filtering of segments for dataSourceMetadataQuery
...
dataSourceMetadataQuery only needs to be executed on latest segments at
present, modify filterSegments and add test.
2015-04-21 09:31:13 +05:30
Xavier Léauté
f73f14ab91
Merge pull request #1297 from metamx/versionConverterTaskUpdates
...
Update VersionConverterTask for IndexSpec and allowing Forced updates
2015-04-20 16:44:35 -07:00
Charles Allen
7479ac9012
Update VersionConverterTask for IndexSepc and allowing Forced updates
2015-04-20 16:17:06 -07:00
fjy
d260515a43
update druid-api version
2015-04-17 14:58:35 -07:00
Bingkun Guo
cf155e4eba
Fix an issue that after broker forwards GroupByQuery to historical, havingSpec is still applied
...
on postAggregations which are removed in the forwarded query.
Add a unit test to replicate the issue.
Add a query that can replicate this issue into integration test.
2015-04-17 13:00:41 -05:00
fjy
f0a19349bf
fix up some comments for contributed test
2015-04-16 15:07:09 -07:00
Fangjin Yang
90b17a5259
Merge pull request #1285 from venkateshk/limitspec-tests
...
Unit test to surface bug with limit-spec order by over specific query intervals
2015-04-16 13:52:58 -07:00
Xavier Léauté
1d153674b6
remove overzealous check for backwards compatibility
2015-04-15 22:11:55 -07:00
Xavier Léauté
ea5572d001
Merge pull request #1271 from metamx/strictErrorChecking
...
Add stricter checking for potential coding errors
2015-04-15 15:21:41 -07:00
Charles Allen
abdeaa0746
Add stricter checking for potential coding errors
...
Can use via `mvn clean compile test-compile -P strict'
2015-04-15 14:52:25 -07:00
vkavuluri
a2ba5b6183
Unit test to surface bug with limit-spec order by over specific query intervals
2015-04-15 06:31:22 -07:00
Xavier Léauté
3a3046ccf3
add support for dimension compression
...
- compression for single-value dimensions using CompressedVSizeIntsIndexedSupplier
- makes dimension compression configurable via IndexSpec
- IndexSpec also enables configuring bitmap and metric compression
2015-04-14 10:44:18 -07:00
Xavier Léauté
bafc5114b4
add toString, equals, and hashCode to BitmapSerdeFactory
2015-04-14 10:44:18 -07:00
Xavier Léauté
d20128b89b
add compressed variable-size ints column type
2015-04-14 10:44:18 -07:00
Xavier Léauté
ce928d9636
add compressed ints column type
2015-04-14 10:44:17 -07:00
Xavier Léauté
5c23679238
add WritableSupplier and IndexedMultivalue
2015-04-14 10:44:17 -07:00
Xavier Léauté
1abb9cce7c
make IndexedInts closeable + add fill method
2015-04-14 10:44:17 -07:00
Xavier Léauté
ed0d49933e
fix memory leak in CompressedXXXIndexedSupplierTest
2015-04-14 10:44:16 -07:00
Xavier Léauté
6790e6cf0f
add fromList to CompressedLongsIndexedSupplier
2015-04-14 10:44:16 -07:00
Eric Tschetter
7517f0d0f0
Add some javadoc to the two Query processing interfaces to help aid in implementations of new Queries.
...
Also, remove some comments that did not have enough context to actually make sense to anyone but the original author (at least, I hope they make sense to the author, I definitely don't know what was being said).
2015-04-09 18:11:42 -07:00
Fangjin Yang
208e307915
Merge pull request #1251 from metamx/uriSegmentLoaders
...
Revert "Revert "Overhaul of SegmentPullers to add consistency and retries""
2015-03-30 17:43:51 -07:00
fjy
aea7f9d192
[maven-release-plugin] prepare for next development iteration
2015-03-30 16:35:24 -07:00
fjy
060d7aef03
[maven-release-plugin] prepare release druid-0.7.1
2015-03-30 16:35:20 -07:00
Charles Allen
1c6cbea89c
Revert "Revert "Overhaul of SegmentPullers to add consistency and retries""
...
This reverts commit f904bc7858
.
2015-03-30 13:40:04 -07:00
Fangjin Yang
f904bc7858
Revert "Overhaul of SegmentPullers to add consistency and retries"
2015-03-30 13:15:50 -07:00
Charles Allen
6d407e8677
Add URI handling to SegmentPullers
...
* Requires https://github.com/druid-io/druid-api/pull/37
* Requires https://github.com/metamx/java-util/pull/22
* Moves the puller logic to use a more standard workflow going through java-util helpers instead of re-writing the handlers for each impl
* General workflow goes like this: 1) LoadSpec makes sure the correct Puller is called with the correct parameters. 2) The Puller sets up general information like how to make an InputStream, how to find a file name (for .gz files for example), and when to retry. 3) CompressionUtils does most of the heavy lifting when it can
2015-03-30 12:33:23 -07:00
Fangjin Yang
e5653f0752
Merge pull request #1190 from vigiglobe/master
...
Fix NPE when partionNumber 0 does not exist.
2015-03-26 13:25:39 -07:00
Xavier Léauté
389ea4c32f
Merge pull request #1245 from b-slim/fix_injector_plus_ut
...
Bug fix @DruidSecondaryModule plus unit test
2015-03-26 10:04:44 -07:00
Fangjin Yang
a9c47de571
Merge pull request #1243 from metamx/fix-union-timeline-lookup
...
fixes TimeboundaryQuery and DataSourceMetadata queries returning wrong values for union queries
2015-03-26 10:02:56 -07:00
Slim Bouguerra
1e6be7796e
bug fix @DruidSecondaryModule plus unit test
2015-03-26 10:44:52 -05:00
nishantmonu51
638bf9d4e9
return sorted List of TimeLineObjectHolder
2015-03-26 11:51:09 +05:30
msprunck
942c17a2aa
Remove timeline chunk count assumptions.
...
* Replace with generic iterables
2015-03-24 22:40:49 +01:00
Prajwal Tuladhar
9983216871
use https maven repo URL to download dependencies
2015-03-20 14:09:07 -04:00
fjy
b389cfe404
[maven-release-plugin] prepare for next development iteration
2015-03-19 12:38:17 -07:00
fjy
60e7d543cc
[maven-release-plugin] prepare release druid-0.7.1-rc1
2015-03-19 12:38:13 -07:00
nishantmonu51
39e60b3405
fix race in groupByParallelQueryRunner
...
add UT and use a queue for better concurrency
2015-03-17 20:57:05 +05:30
Xavier Léauté
127b6fd857
Merge pull request #1172 from himanshug/segment_metadata_eager
...
force eager the processing of segment metadata query on the processing executor
2015-03-12 10:19:48 -07:00
Xavier Léauté
0a5a3fe2dc
fix file missing from rebase
2015-03-11 17:30:11 -07:00
Xavier Léauté
e01ed16030
serde tests + equals/hashCode fixes for extraction functions
2015-03-11 16:48:28 -07:00
Xavier Léauté
d3f5bddc5c
Add ability to apply extraction functions to the time dimension
...
- Moves DimExtractionFn under a more generic ExtractionFn interface to
support extracting dimension values other than strings
- pushes down extractionFn to the storage adapter from query engine
- 'dimExtractionFn' parameter has been deprecated in favor of 'extractionFn'
- adds a TimeFormatExtractionFn, allowing to project the '__time' dimension
- JavascriptDimExtractionFn renamed to JavascriptExtractionFn, adding
support for any dimension value types that map directly to Javascript
- update documentation for time column extraction and related changes
2015-03-11 16:45:42 -07:00
Himanshu Gupta
55ebf0cfdf
force eager the processing of segment metadata query on the processing threadpool by using ChainedExecutionQueryRunner in SegmentMetadataQueryRunnerFactory.mergeRunners(..)
2015-03-11 12:58:58 -05:00
Xavier Léauté
217e674063
Handling aggregators and post aggregators with duplicate names
...
* add test for same-name groupBy hyperUniques post-agg
* add test for same-name post-agg in groupby with approx histogram
* Fixes https://github.com/druid-io/druid/issues/1045
* Throws an error if post aggs and aggs do not have unique names
* Add more groupBy tests for Having filters
2015-03-10 17:10:43 -07:00
Fangjin Yang
0b467624ec
Merge pull request #694 from druid-io/arithmetic-op-strategies
...
normal division & configurable ordering for ArithmeticPostAggregator
2015-03-10 13:48:27 -07:00
Fangjin Yang
2abdce1dc0
Merge pull request #1180 from metamx/logging-groupBy-NPE
...
add null check early to catch root cause for groupBy NPE while running bySegment query
2015-03-09 09:16:33 -07:00
nishantmonu51
6e935cca0a
add null check early to catch root cause
2015-03-09 21:10:28 +05:30
Xavier Léauté
0d47c0c36d
normal division and configurable ordering for ArithmeticPostAggregator
...
Fixes #510
2015-03-04 12:44:24 -08:00
Fangjin Yang
d685e2ab04
Merge pull request #1165 from friedhardware/fix-NPerror-select
...
Added null check for the pagingSpec on a Select Query.
2015-03-02 14:17:06 -08:00
Fangjin Yang
e8605c63a9
Merge pull request #1150 from himanshug/broker-parallel-chunk-process
...
interval chunk query runner now processes individual chunk in a threadpool
2015-03-02 13:50:23 -08:00
Himanshu Gupta
29039fd541
interval chunk query runner now processes individual chunk in a thread pool and prints metrics query/time per chunk
2015-03-02 15:45:09 -06:00
Joshua Schumacher
e6130e0fdc
Added null check for the pagingSpec on a Select Query.
2015-03-02 12:41:59 -08:00
Fangjin Yang
005f4da2c0
Merge pull request #1143 from metamx/update-rhino-1.7rc5
...
Update Rhino to 1.7RC5
2015-02-25 12:50:23 -08:00
Xavier Léauté
b167dcf82c
[maven-release-plugin] prepare for next development iteration
2015-02-23 14:28:06 -08:00
Xavier Léauté
e81ac2ba43
[maven-release-plugin] prepare release druid-0.7.0
2015-02-23 14:27:58 -08:00
James Estes
562de6c621
Update docs and examples for log4j2 usage.
...
- Put configs early in classpath in examples so log4j2.xml will get picked up properly
- Add an example log4j2.xml file.
- Update Logging doc.
2015-02-19 11:40:56 -07:00
Xavier Léauté
c4d721fffd
update Rhino to 1.7RC5
2015-02-19 09:48:18 -08:00
Xavier Léauté
78df7f6165
Move Druid release artifacts to Sonatype
...
- Switch to using Druid parent POM
- Add required fields for Sonatype
- Common plugin versions and settings have been moved to the parent pom
- Cleanup artifacts and POMs for consistent formatting
- Remove org.hyperic.sigar dependency and update docs to reflect necessary jars to add at runtime when sigar is needed
2015-02-13 14:26:31 -08:00
fjy
d29740ed9f
[maven-release-plugin] prepare for next development iteration
2015-02-12 16:16:00 -08:00
fjy
211fd15b7e
[maven-release-plugin] prepare release druid-0.7.0-rc3
2015-02-12 16:15:56 -08:00
Fangjin Yang
90bc62eb5c
Merge pull request #1108 from metamx/improve-groupby-perf
...
Improve groupby by removing conversion to case insensitive row
2015-02-12 11:45:20 -08:00
nishantmonu51
15cf432b74
remove conversion to case insensitive row
...
this is not required after death to casing in 0.7
2015-02-11 19:40:36 +05:30
Xavier Léauté
c5e99bf6ec
Merge pull request #1105 from metamx/fixEmptyExtractionFilter
...
Fix empty results on ExtractionFilter.
2015-02-10 14:25:58 -08:00
Charles Allen
b9cb311a52
Fix empty results on ExtractionFilter.
...
* Now returns empty results rather than erroring out
* Added unit tests for multiples case
2015-02-10 14:04:38 -08:00
fjy
708759e1e0
Update http-client to 1.0.0
2015-02-10 13:36:47 -08:00
Xavier Léauté
a7dcaffb53
fix `__time` column selector for incremental index
...
- also adds tests for selecting the time column
2015-02-06 12:06:05 -08:00
Fangjin Yang
42e902b6e3
Merge pull request #1090 from metamx/alphanum-attribution
...
update code attribution
2015-02-04 15:51:34 -08:00
Xavier Léauté
0fbc6071c9
update code attribution
2015-02-04 15:28:44 -08:00
Fangjin Yang
25cf15824b
Merge pull request #1085 from gianm/dsmrv-fix
...
DataSourceMetadataResultValue fixes and JodaUtils adjustments.
2015-02-03 17:51:33 -08:00
Gian Merlino
085ad8d345
Fix DataSourceMetadataResultValue serde.
2015-02-03 17:39:42 -08:00
fjy
1f12c5b2f1
[maven-release-plugin] prepare for next development iteration
2015-02-03 12:06:49 -08:00
fjy
e82d431be7
[maven-release-plugin] prepare release druid-0.7.0-rc2
2015-02-03 12:06:41 -08:00
Xavier Léauté
4eff269536
Merge pull request #1079 from druid-io/cleanup-deps
...
Remove non friendly dependencies from Druid
2015-02-03 11:56:41 -08:00
fjy
3e5d338c8e
Remove non friendly dependencies from Druid
2015-02-03 11:36:08 -08:00
Fangjin Yang
71b4c5fa86
Merge pull request #1076 from metamx/remove-threadlocals
...
remove thread-locals in GenericIndexed in favor of wrapped objects
2015-02-02 20:02:33 -08:00
Xavier Léauté
cb2e300eba
remove thread-locals in GenericIndexed in favor of wrapped objects to reduce GC pressure
2015-02-02 15:59:30 -08:00
Eric Tschetter
42eba986ce
Towards consistent null handling
...
This commit also includes
1) the addition of a context parameter on timeseries queries that allows it to ignore empty buckets instead of generating results for them
2) A cleanup of an unused method on an interface
2015-02-02 12:53:07 -08:00
Fangjin Yang
92e616de11
Merge pull request #1077 from metamx/remove-unused-imports
...
remove unused imports
2015-02-02 10:45:27 -08:00
nishantmonu51
ba932bb1f2
remove unused imports
2015-02-02 21:53:39 +05:30
fjy
d05032b98a
towards a community led druid
2015-01-31 20:57:36 -08:00
Xavier Léauté
f24a89a22a
fix NPE for topN over missing hyperUniques column
2015-01-27 16:12:41 -08:00
Charles Allen
226dd91a31
Add a hash map for storing groupBy partition index
...
* Improves groupBy performance by approx 15%
2015-01-26 08:42:02 -08:00
fjy
1f94de22c6
[maven-release-plugin] prepare for next development iteration
2015-01-20 14:23:55 -08:00
fjy
17476edc31
[maven-release-plugin] prepare release druid-0.7.0-rc1
2015-01-20 14:23:51 -08:00