druid

Commit Graph

Author	SHA1	Message	Date
Himanshu	45da7e48f1	groupBy sort results by (dimensions,timestamp) instead of (timestamp,dimension) (#3672 ) * sortByDimsFirst flag for groupBy query * Remove need for KeyType in Grouper<KeyType> to be Comparable<KeyType> * fix review comments * fix review comments regarding removing code duplication of dim/time comparison * move comparator for KeyType object to KeySerdeFactory so that creation of comparator does not need KeySerde * remove unnecessary system.out.println * make access static var NATURAL_NULLS_FIRST directly * further review comments addressing	2016-12-06 09:48:56 -08:00
Gian Merlino	353fee79dd	Add "asMillis" option to "timeFormat" extractionFn. (#3733 ) This is useful for chaining extractionFns that all want to treat time as millis, such as having a javascript extractionFn after a timeFormat.	2016-12-02 13:45:16 -08:00
Gian Merlino	102375d9bb	Add "strlen" extractionFn. (#3731 )	2016-12-02 12:08:51 -08:00
Gian Merlino	4c5d10f8a3	Add DimFilterHavingSpec. (#3727 ) * Add DimFilterHavingSpec. * Add test for DimFilterHavingSpec with extractionFns.	2016-12-02 10:04:30 -08:00
Gian Merlino	e4465e63bd	Fix ordering of sections on dimensionspecs.md. (#3722 ) The Filtered and List DimensionSpecs were mixed in with the extraction functions.	2016-11-29 16:28:36 -08:00
Joan Viladrosa	2df98bcaa6	Fixed Missing commas in json example of Lookup (#3680 )	2016-11-15 14:56:18 +09:00
Gian Merlino	2c504b6258	Add "like" filter. (#3642 ) * Add "like" filter. * Addressed some PR comments. * Slight simplifications to LikeFilter. * Additional simplifications. * Fix comment in LikeFilter. * Clarify comment in LikeFilter. * Simplify LikeMatcher a bit. * No use going through the optimized path if prefix is empty. * Add more tests.	2016-11-04 23:25:03 +05:30
Gian Merlino	f8d71fc602	groupBy: Fix maxMergingDictionarySize config. (#3488 )	2016-09-22 10:02:33 -07:00
Gian Merlino	e0e28866ee	JavaScript docs: Fix links and typos, add to TOC. (#3457 )	2016-09-13 15:26:44 -07:00
Gian Merlino	76a24054e3	JavaScript docs, including docs for globals. (#3454 )	2016-09-13 13:46:55 -07:00
Gian Merlino	1344e3c3af	Clearer filter docs. (#3448 )	2016-09-09 13:47:13 -07:00
Gian Merlino	1e3f94237e	groupBy v2: Configurable load factor. (#3437 ) Also change defaults: - bufferGrouperMaxLoadFactor from 0.75 to 0.7. - maxMergingDictionarySize to 100MB from 25MB, should be more appropriate for most heaps.	2016-09-07 14:14:59 -05:00
Gian Merlino	e9050c2b4c	TimeFormatExtractionFn: Allow null formats (equivalent to ISO8601) and granular bucketing. (#3411 )	2016-08-31 20:58:53 +05:30
Jonathan Wei	4e91330a17	Use DimensionSpec in CardinalityAggregatorFactory (#3406 ) * Use DimensionSpec in CardinalityAggregatorFactory * Address PR comments * Fix requiredFields()	2016-08-30 15:54:02 -07:00
rajk-tetration	362b9266f8	Adding filters for TimeBoundary on backend (#3168 ) * Adding filters for TimeBoundary on backend Signed-off-by: Balachandar Kesavan <raj.ksvn@gmail.com> * updating TimeBoundaryQuery constructor in QueryHostFinderTest * add filter helpers * update filterSegments + test * Conditional filterSegment depending on whether a filter exists * Style changes * Trigger rebuild * Adding documentation for timeboundaryquery filtering * added filter serialization to timeboundaryquery cache * code style changes	2016-08-15 10:25:24 -07:00
Gian Merlino	8899affe48	Introduce standardized "Resource limit exceeded" error. (#3338 ) Fixes #3336.	2016-08-09 10:50:56 -07:00
Gian Merlino	21bce96c4c	More useful query errors. (#3335 ) Follow-up to #1773, which meant to add more useful query errors but did not actually do so. Since that patch, any error other than interrupt/cancel/timeout was reported as `{"error":"Unknown exception"}`. With this patch, the error fields are: - error, one of the specific strings "Query interrupted", "Query timeout", "Query cancelled", or "Unknown exception" (same behavior as before). - errorMessage, the message of the topmost non-QueryInterruptedException in the causality chain. - errorClass, the class of the topmost non-QueryInterruptedException in the causality chain. - host, the host that failed the query.	2016-08-09 16:14:52 +08:00
Jonathan Wei	decefb7477	Add time interval dim filter and retention analysis example (#3315 ) * Add time interval dim filter and retention analysis example * Use closed-open matching for intervals, update cache key generation * Fix time filtering tests for interval boundary change	2016-08-05 07:25:04 -07:00
kaijianding	50d52a24fc	ability to not rollup at index time, make pre aggregation an option (#3020 ) * ability to not rollup at index time, make pre aggregation an option * rename getRowIndexForRollup to getPriorIndex * fix doc misspelling * test query using no-rollup indexes * fix benchmark fail due to jmh bug	2016-08-02 11:13:05 -07:00
Dave Li	bc20658239	groupBy nested query using v2 strategy (#3269 ) * changed v2 nested query strategy * add test for #3239 * update for new ValueMatcher interface and add benchmarks * enable time filtering * address PR comments * add failing test for outer filter aggregator * add helper class for sharing code * update nested groupby doc * move temporary storage instantiation * address PR comment * address PR comment 2	2016-08-01 18:30:39 -07:00
Jonathan Wei	a6105cbb86	Add numeric StringComparator (#3270 ) * Add numeric StringComparator * Only use direct long comparison for numeric ordering in BoundFilter, add time filtering benchmark query * Address PR comments, add multithreaded BoundDimFilter test * Add comment on strlen tie handling * Add timeseries interval filter benchmark * Adjust docs * Use jackson for StringComparator, address PR comments * Add new TopNMetricSpec and SearchSortSpec with tests (WIP) * More TopNMetricSpec and SearchSortSpec tests * Fix NewSearchSortSpec serde * Update docs for new DimensionTopNMetricSpec * Delete NumericDimensionTopNMetricSpec * Delete old SearchSortSpec * Rename NewSearchSortSpec to SearchSortSpec * Add TopN numeric comparator benchmark, address PR comments * Refactor OrderByColumnSpec * Add null checks to NumericComparator and String->BigDecimal conversion function * Add more OrderByColumnSpec serde tests	2016-07-29 15:44:16 -07:00
Gian Merlino	2553997200	Associate groupBy v2 resources with the Sequence lifecycle. (#3296 ) This fixes a potential issue where groupBy resources could be allocated to create a Sequence, but then the Sequence is never used, and thus the resources are never freed. Also simplifies how groupBy handles config overrides (this made the new unit test easier to write).	2016-07-27 18:44:19 -07:00
kaijianding	3dc2974894	Add timestampSpec to metadata.drd and SegmentMetadataQuery (#3227 ) * save TimestampSpec in metadata.drd * add timestampSpec info in SegmentMetadataQuery	2016-07-25 15:45:30 -07:00
Jonathan Wei	a42ccb6d19	Support filtering on long columns (including __time) (#3180 ) * Support filtering on __time column * Rename DruidPredicate * Add docs for ValueMatcherFactory, add comment on getColumnCapabilities * Combine ValueMatcherFactory predicate methods to accept DruidCompositePredicate * Address PR comments (support filter on all long columns) * Use predicate factory instead of composite predicate * Address PR comments * Lazily initialize long handling in selector/in filter * Move long value parsing from InFilter to InDimFilter, make long value parsing thread-safe * Add multithreaded selector/in filter test * Fix non-final lock object in SelectorDimFilter	2016-07-20 17:08:49 -07:00
Gian Merlino	3ab4a4efbc	Fix formatting in granularities doc. (#3229 )	2016-07-08 09:29:58 -07:00
Gian Merlino	fdc7e88a7d	Allow queries with no aggregators. (#3216 ) This is actually reasonable for a groupBy or lexicographic topNs that is being used to do a "COUNT DISTINCT" kind of query. No aggregators are needed for that query, and including a dummy aggregator wastes 8 bytes per row. It's kind of silly for timeseries, but why not.	2016-07-06 20:38:54 +05:30
jaehong choi	efbcbf5315	Support alphanumeric sort in search query (#2593 ) * support alphanumeric sort in search query * address a comment about handling equals() and hashCode() * address comments * add Ut for string comparators * address a comment about space indentations.	2016-06-28 15:06:18 -07:00
Gian Merlino	4cc39b2ee7	Alternative groupBy strategy. (#2998 ) This patch introduces a GroupByStrategy concept and two strategies: "v1" is the current groupBy strategy and "v2" is a new one. It also introduces a merge buffers concept in DruidProcessingModule, to try to better manage memory used for merging. Both of these are described in more detail in #2987. There are two goals of this patch: 1. Make it possible for historical/realtime nodes to return larger groupBy result sets, faster, with better memory management. 2. Make it possible for brokers to merge streams when there are no order-by columns, avoiding materialization. This patch does not do anything to help with memory management on the broker when there are order-by columns or when there are nested queries. That could potentially be done in a future patch.	2016-06-24 18:06:09 -07:00
Dave Li	12be1c0a4b	Add bucket extraction function (#3033 ) * add bucket extraction function * add doc and header * updated doc and test	2016-06-17 09:24:27 -07:00
linbo.jin	8c76fe7b97	docs: change OR to AND inside query docs about multi-value dims (#3162 ) * docs: replace OR by AND inside topnquery docs about multi value dimensions * docs: replace OR by AND inside groupby docs about multi value dimensions	2016-06-17 08:54:18 -07:00
Kirill Kozlov	9f93be448e	Fix logical operator in example (#3093 )	2016-06-07 10:44:18 -07:00
Gian Merlino	99ee3f4dc3	Fixups, clarifications to lookup docs. (#3060 )	2016-06-07 10:43:35 -07:00
Charles Allen	fa41a6466a	Cleanup the base lookup cluster wide config docs (#3061 ) * Cleanup the base lookup cluster wide config docs * Add better examples in lookups-cached-global.md * Use actual valid stock lookups * Fixed maps with : * Add mix of lookups * Better examples in extension * Remove unneeded namespace requirement * Add extra line space * Add link to lookup tiers * Renamed header	2016-06-07 10:42:41 -07:00
Charles Allen	8cac710546	Async lookups-cached-global by default (#3074 ) * Async lookups-cached-global by default * Also better lookup docs * Fix test timeouts * Fix timing of deserialized test * Fix problem with 0 wait failing immediately	2016-06-03 15:58:10 -05:00
Gian Merlino	603fbbcc20	Fix docs for "contains" search spec. (#3066 )	2016-06-02 19:03:40 -07:00
Vadim Ogievetsky	13c267bfee	Added new line for site formatting (#3059 )	2016-06-02 11:36:45 -07:00
Vadim Ogievetsky	767190d5db	Clear up confusing wording (#3052 ) There is no such thing as a "Java aggregator" in Druid from a user's point of view, there are just native aggregator that happen to be implemented in Java.	2016-06-01 15:41:50 -07:00
Charles Allen	eaaad01de7	[QTL] Datasource as lookupTier (#2955 ) * Datasource as lookup tier * Adds an option to let indexing service tasks pull their lookup tier from the datasource they are working for. * Fix bad docs for lookups lookupTier * Add Datasource name holder * Move task and datasource to be pulled from Task file * Make LookupModule pull from bound dataSource * Fix test * Fix code style on imports * Fix formatting * Make naming better * Address code comments about naming	2016-05-17 15:44:42 -07:00
Parag Jain	e3ea842cd3	add available query granularity strings (#2960 )	2016-05-12 18:49:31 -07:00
Himanshu	8e2742b7e8	adding QueryGranularity to segment metadata and optionally expose same from segmentMetadata query (#2873 )	2016-05-03 11:31:10 -07:00
Slim	55785267e4	postAgg filedName must match name of AGG (#2874 )	2016-04-22 11:11:54 -07:00
Himanshu	3cfd9c64c9	make singleThreaded groupBy query config overridable at query time (#2828 ) * make isSingleThreaded groupBy query processing overridable at query time * refactor code in GroupByMergedQueryRunner to make processing of single threaded and parallel merging of runners consistent	2016-04-21 17:12:58 -07:00
Slim	984a518c9f	Merge pull request #2734 from b-slim/LookupIntrospection2 [QTL][Lookup] adding introspection endpoint	2016-04-21 12:15:57 -05:00
Gian Merlino	e320d13385	Fix various broken links in the docs. (#2833 )	2016-04-13 13:30:01 -07:00
Charles Allen	2b99f717e4	Move lookup config doc to proper location	2016-04-08 08:15:38 -07:00
Charles Allen	f915a59138	Merge pull request #2691 from metamx/lookupExtrFn Add ExtractionFn to LookupExtractor bridge	2016-04-06 09:13:08 -07:00
jon-wei	0e481d6f93	Allow filters to use extraction functions	2016-04-05 13:24:56 -07:00
Fangjin Yang	eea7a47870	Merge pull request #2576 from navis/paging-from-next Add option for select query to get next page without modifying returned paging identifiers	2016-04-01 13:50:36 -07:00
navis.ryu	077522a46f	stringFormat extractionFn should be able to return null on null values (Fix for #2706 )	2016-04-01 13:40:56 +09:00
navis.ryu	29bb00535b	Add option for select query to get next page without modifying returned paging identifiers	2016-04-01 09:03:03 +09:00
Gian Merlino	1853f36e9f	More consistent empty-set filtering behavior on multi-value columns. The behavior is now that filters on "null" will match rows with no values. The behavior in the past was inconsistent; sometimes these filters would match and sometimes they wouldn't. Adds tests for this behavior to SelectorFilterTest and BoundFilterTest, for query-level filters and filtered aggregates. Fixes #2750.	2016-03-29 15:32:13 -07:00
Charles Allen	4764e86409	Add docs for RegisteredDimensionExtractionFn	2016-03-28 13:27:49 -07:00
Charles Allen	ab324e4ac0	Move lookup docs that are in druid-proper back into lookups.md	2016-03-25 10:46:50 -07:00
Fangjin Yang	a5d5529749	Merge pull request #2711 from gianm/filtered-aggregator-impls All Filters should work with FilteredAggregators.	2016-03-23 13:37:21 -07:00
Gian Merlino	dd86198902	All Filters should work with FilteredAggregators. This removes Filter.makeMatcher(ColumnSelectorFactory) and adds a ValueMatcherFactory implementation to FilteredAggregatorFactory so it can take advantage of existing makeMatcher(ValueMatcherFactory) implementations. This patch also removes the Bound-based method from ValueMatcherFactory. Its only user was the SpatialFilter, which could use the Predicate-based method. Fixes #2604.	2016-03-23 12:24:01 -07:00
Gian Merlino	2dfd3877c0	Fix a bunch of broken links in the docs.	2016-03-23 10:21:28 -07:00
Fangjin Yang	d1f8f2b2fd	Merge pull request #2698 from druid-io/fix-ext-docs refactor extensions into their own docs	2016-03-22 22:04:12 -07:00
fjy	943cbe6e76	refactor extensions into their own docs	2016-03-22 18:54:10 -07:00
Fangjin Yang	041350c31b	Merge pull request #2701 from gianm/mvd-docs Improved docs for multi-value dimensions.	2016-03-22 18:09:37 -07:00
Gian Merlino	451c0bc6d8	Merge pull request #2702 from pjain1/improve_docs how to query in the querying section, correct default for select strategy, formatting	2016-03-22 16:40:35 -07:00
Parag Jain	39ecb9929d	how to query, correct default for select strategy, formatting	2016-03-22 17:06:15 -05:00
Gian Merlino	ff25325f3b	Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value".	2016-03-22 14:40:55 -07:00
Nishant	11b8d1ed70	Merge pull request #2686 from gianm/fix-analysistypes-docs Fix analysisTypes docs for SegmentMetadataQuery.	2016-03-18 16:15:38 -07:00
Gian Merlino	76ae30604e	Fix analysisTypes docs for SegmentMetadataQuery.	2016-03-18 13:17:33 -07:00
Charles Allen	5da9a280b6	Query Time Lookup - Dynamic Configuration	2016-03-18 09:45:05 -07:00
Slim	cf342d8d3c	Merge pull request #2517 from b-slim/adding_lookup_snapshot_utility [QTL][Lookup] lookup module with the snapshot utility	2016-03-17 11:39:47 -05:00
Slim Bouguerra	0c86b29ef0	lookup module with the snapshot utility	2016-03-17 09:20:41 -05:00
Fangjin Yang	8cea85816d	Merge pull request #2668 from navis/fix-document-selectquery Document for search query was not updated properly (Fix for #2662)	2016-03-15 20:34:27 -07:00
navis.ryu	71ee9e2aac	Document for search query is not updated properly (Fix for #2662 )	2016-03-16 09:22:26 +09:00
dclim	553b677971	caching doc fix	2016-03-15 17:09:33 -06:00
Fangjin Yang	dbdbacaa18	Merge pull request #2260 from navis/cardinality-for-searchquery Support cardinality for search query	2016-03-14 13:24:40 -07:00
navis.ryu	be341bf4e3	Support cardinality for search query (Fix for #2260 )	2016-03-12 09:51:01 +09:00
Himanshu Gupta	ca5de3f583	only allow lowering maxResults and maxIntermediateRows from groupBy query context	2016-03-08 15:03:59 -06:00
Himanshu Gupta	099acb4966	allow groupBy max[Intermediate]Rows limit be overridable by context	2016-03-07 15:22:41 -06:00
jon-wei	fd3782522c	Rename 'replaceMissingValues...' parameters in RegexExtractionFn	2016-02-24 13:12:56 -08:00
Fangjin Yang	083f019a48	Merge pull request #2465 from druid-io/more-doc-fix more doc fixes	2016-02-17 11:00:38 -08:00
fjy	7da6594bfe	more doc fixes	2016-02-17 09:43:47 -08:00
Jonathan Wei	d63eec65a1	Merge pull request #2208 from navis/metadataquery-minmax Support min/max values for metadata query	2016-02-11 17:28:07 -08:00
navis.ryu	dd2375477a	Support min/max values for metadata query (#2208 )	2016-02-12 09:35:58 +09:00
navis.ryu	4d63196535	Support dimension spec for select query	2016-02-12 08:54:28 +09:00
Slim	368988d187	Merge pull request #2291 from druid-io/lookupManager Promoting LookupExtractor state and LookupExtractorFactory to be a first class druid state object.	2016-02-11 16:07:27 -06:00
Slim Bouguerra	438a4a9970	fix docs about search query limit	2016-02-11 13:20:59 -06:00
Slim Bouguerra	4e119b7a24	Adding lookup ref manager and lookup dimension spec impl	2016-02-11 12:11:51 -06:00
fjy	003f54e268	add doc rendering	2016-02-04 14:21:59 -08:00
fjy	1aa363cea7	new quickstart	2016-02-04 09:37:38 -08:00
navis.ryu	55a888ea2f	time-descending result of select queries	2016-01-29 10:06:05 +09:00
Rafael Abbondanza	145c65c72d	Updates number of parts in a topN query This threw me off a bit, so I'm sure it may throw others off, too. Updating from 10 to 11 parts.	2016-01-25 10:29:25 -05:00
Gian Merlino	d416279c14	SegmentMetadataQuery support for returning aggregators.	2016-01-21 17:27:25 -08:00
Gian Merlino	87c8046c6c	Add StorageAdapter#getColumnTypeName, and various SegmentMetadataQuery adjustments. SegmentMetadataQuery stuff: - Simplify implementation of SegmentAnalyzer. - Fix type names for realtime complex columns; this used to try to merge a nice type name (like "hyperUnique") from mmapped segments with the word "COMPLEX" from incremental index segments, leading to a merge failure. Now it always uses the nice name. - Add hasMultipleValues to ColumnAnalysis. - Add tests for both mmapped and incremental index segments. - Update docs to include errorMessage.	2016-01-21 15:50:33 -08:00
Slim Bouguerra	78feb3a13e	adding lower and upper extraction fn	2016-01-21 08:59:05 -06:00
zhxiaog	3459a202ce	fixed #1873 , add ability to express CONCAT as an extractionFn	2016-01-18 15:03:17 -08:00
Keuntae Park	238dd3be3c	support cascade execution of extraction filters in extraction dimension spec	2016-01-18 11:10:19 +09:00
navis.ryu	18479bb757	time-descending result of timeseries queries	2016-01-13 12:23:01 +09:00
binlijin	2751f785f8	add doc	2016-01-12 11:25:11 +08:00
Fangjin Yang	4cc81d3eff	Merge pull request #2096 from b-slim/add_use_case_unapply Add use case unapply	2016-01-06 21:58:12 -08:00
Slim Bouguerra	032d3bf6e6	Optimization of extraction filter by reversing the lookup	2016-01-06 11:16:11 -06:00
fjy	88f6b9b5ad	Multiple improvements for docs	2016-01-02 21:54:54 -08:00
Himanshu Gupta	e1ea93b6fc	documenting querying behavior on multi-valued dimensions	2015-12-30 18:14:25 -06:00
Himanshu Gupta	b47d807738	Add support for filtering at DimensionSpec level so that multivalued dimensions can be filtered correctly also adding UTs for multi-valued dimensions	2015-12-30 17:59:47 -06:00
Fangjin Yang	e14ad74088	Merge pull request #1936 from b-slim/between_range_with_predicat adding Upper/Lower Bound Filter	2015-12-29 10:11:22 -08:00

1 2 3 4

193 Commits