druid

Commit Graph

Author	SHA1	Message	Date
Clint Wylie	1181411901	small optimization in timeseries if 'skipEmptyBuckets' is true and cursor completed (#5178 )	2017-12-19 16:47:00 -06:00
Roman Leventov	f18eba50ee	Remove Aggregator.reset() (#5177 )	2017-12-19 14:09:17 -08:00
Roman Leventov	5787d04fad	Bump Druid version to 0.12.0 (#5138 )	2017-12-15 07:37:01 -08:00
Roman Leventov	64848c7ebf	DataSegment memory optimizations (#5094 ) * Deduplicate DataSegments contents (loadSpec's keys, dimensions and metrics lists as a whole) more aggressively; use ArrayMap instead of default LinkedHashMap for DataSegment.loadSpec, because they have only 3 entries on average; prune DataSegment.loadSpec on brokers * Fix DataSegmentTest * Refinements * Try to fix * Fix the second DataSegmentTest * Nullability * Fix tests * Fix tests, unify to use TestHelper.getJsonMapper() * Revert TestUtil as ServerTestHelper, fix tests * Add newline * Fix indexing tests * Fix s3 tests * Try to fix tests, remove lazy caching of ObjectMapper in TestHelper, rename TestHelper.getJsonMapper() to makeJsonMapper() * Fix HDFS tests * Fix HdfsDataSegmentPusherTest * Capitalize constant names	2017-12-12 11:41:40 -08:00
Charles Allen	4365390310	Remove duplicate fastutil dependency in processing pom.xml (#5142 )	2017-12-07 08:54:48 +09:00
Alexander Saydakov	45f91a241e	numeric quantiles sketch aggregator (#5002 ) * numeric quantiles sketch aggregator * it seems that we need to synchronize all methods, which modify the state * Seems like a false positive with -Pstrict * code style fix * code style fix * use sketches-core-0.10.3 * moved cache ids to the central place * better class names * support large columns * explained autodetection, added exception * added comments regarding sketches moving on heap * support reindexing * implemented suggestions from jihoonson * style fix * use max(k, other.k) for better accuracy * check for NilColumnValueSelector instead of null * throw exceptions instead of providing no-op comparators	2017-12-06 08:18:08 +09:00
Roman Leventov	a7a6a0487e	Replace IOPeon with SegmentWriteOutMedium; Improve buffer compression (#4762 ) * Replace IOPeon with OutputMedium; Improve compression * Fix test * Cleanup CompressionStrategy * Javadocs * Add OutputBytesTest * Address comments * Random access in OutputBytes and GenericIndexedWriter * Fix bugs * Fixes * Test OutputBytes.readFully() * Address comments * Rename OutputMedium to SegmentWriteOutMedium and OutputBytes to WriteOutBytes * Add comments to ByteBufferInputStream * Remove unused declarations	2017-12-04 18:04:27 -08:00
Parag Jain	7c01f77b04	Parse Batch support (#5081 ) * add parseBatch and deprecate parse method in InputRowParser add addAll method, skip max rows in memory check for it remove parse method from implemetations transform transformers add string multiplier input row parser fix withParseSpec fix kafka batch indexing fix isPersistRequired comments * add unit test * make persist async * review comments	2017-12-04 16:06:16 -06:00
Roman Leventov	aacc57131b	Don't use HistoricalXxxPrototypes in PooledTopNAlgorithm when Cursor's offset is FilteredOffset (#5133 ) * Don't use HistoricalXxxPrototypes in PooledTopNAlgorithm when Cursor's offset is FilteredOffset, because it doens't support cloning * Add test	2017-12-01 17:04:22 -08:00
Gian Merlino	5f6bdd940b	SQL: Improve translation of time floor expressions. (#5107 ) * SQL: Improve translation of time floor expressions. The main change is to TimeFloorOperatorConversion.applyTimestampFloor. - Prefer timestamp_floor expressions to timeFormat extractionFns, to avoid turning things into strings when it isn't necessary. - Collapse CAST(FLOOR(X TO Y) AS DATE) to FLOOR(X TO Y) if appropriate. * Fix tests.	2017-11-29 12:06:03 -08:00
Chuanlei Ni	368d03146b	assign granularity.all to SelectQuery by default (#5091 )	2017-11-21 17:10:19 -08:00
zhangxinyu1	590633c595	fix bug in method reset of ByteBufferHashTable.java (#5100 )	2017-11-17 13:19:16 -03:00
Jonathan Wei	ec6774039e	Fix numLookupLoadingThreads default value (#5097 )	2017-11-16 15:13:52 -08:00
Gian Merlino	77df5e0673	ExpressionSelectors: Add optimized selectors. (#5048 ) * ExpressionSelectors: Add caching selectors. - SingleLongInputCaching selector for expressions on the __time column, using a similar optimization to SingleScanTimeDimSelector - SingleStringInputDimensionSelector for expressions on string columns that return strings, using a similar optimization to ExtractionFn based DimensionSelectors. - SingleStringInputCaching selector for expressions on string columns that return primitives. Also, in the SQL planner, prefer expressions for time operations rather than extractionFns. * Code review comments.	2017-11-13 20:24:24 -08:00
Akash Dwivedi	c1538f29fc	maxQueryTimeout property in runtime properties. (#4852 ) * maxQueryTimeout property in runtime properties. * extra line * move withTimeoutAndMaxScatterGatherBytes method to QueryLifeCycle. * Fix initialize method. * remove unused import. * doc update. * some more details in doc about query failure.. * minor fix. * decorating QueryRunner to set and verify context. Added by servers. * remove whitespace.	2017-11-13 19:23:11 -06:00
Gian Merlino	9444da5038	SQL: Improved behavior when implicitly casting strings to date/time literals. (#5023 ) * SQL: Improved behavior when implicitly casting strings to date/time literals. - Handle all flavors of ISO8601 and SQL literals. - Throw errors on other literals instead of silently transforming them to 0. * Respect timeZone when format is null.	2017-11-10 17:43:22 +09:00
Roman Leventov	3541b7544b	Prohibit and remove unused declarations in the processing module (#4930 ) * Prohibit and remove unused declarations in the processing module * Fix tests * Fix integration tests * Suppress unused * Try to remove SuppressWarnings unused in VirtualColumn * Remove reset 'false positives' * Annotate CliCommandCreator as ExtensionPoint * Unused import warning instead of error in IntelliJ * Fixes * Add comment * Fix AzureBlob * Fix CloudFilesBlob * Address comments * Add Project SDK section to INTELLIJ_SETUP.md * Fix image	2017-11-09 09:27:27 -08:00
Roman Leventov	a8dc056c09	Add retries for coordinator fetch and lookup start in LookupReferencesManager (#5029 ) * Add retries for coordinator fetch and lookup start in LookupReferencesManager * Fix LookupConfigTest * Address comments * Address more comments * And address more comments * Address comms * Recognize 'not found' lookups in LookupReferencesManager.tryGetLookupListFromCoordinator(), by @egor-ryashin	2017-11-09 02:30:36 -03:00
Jihoon Son	5f3c863d5e	Add compaction task (#4985 ) * Add compaction task * added doc * use combining aggregators * address comments * add support for dimensionsSpec * fix getUniqueDims and getUniqueMetics * find unique dimensionsSpec * fix compilation * add unit test * fix test * fix test * test for different dimension orderings and types, and doc for type and ordering * add control for custom ordering and type * update doc * fix compile * fix compile * add segments param * fix serde error * fix build	2017-11-03 21:55:27 -06:00
Roman Leventov	5eb08c27cb	Add Emitter monitoring (#4973 ) * Add Emitter monitoring * Fix typo * Fixes * testing new emitter * Fix failed test (#71) * testing new emitter * fix on failed test * Remove emitter's readTimeout from docs * Update docs * Add HttpEmittingMonitor * Update java-util to 1.3.2	2017-11-03 21:27:57 -06:00
Gian Merlino	6c725a7e06	Fix havingSpec on complex aggregators. (#5024 ) * Fix havingSpec on complex aggregators. - Uses the technique from #4883 on DimFilterHavingSpec too. - Also uses Transformers from #4890, necessitating a move of that and other related classes from druid-server to druid-processing. They probably make more sense there anyway. - Adds a SQL query test. Fixes #4957. * Remove unused import.	2017-11-01 12:58:08 -04:00
Gian Merlino	1df458b35e	Fix improper handling of empty arrays in StringDimensionIndexer. (#5012 ) * Fix improper handling of empty arrays in StringDimensionIndexer. This bug was able to introduce data errors: if the input rows to an IncrementalIndex contained entirely empty arrays and single values, then upon persisting to disk, the empty arrays would be replaced with the lexicographically smallest single value, rather than nulls like they should have been. * Style fix. * Add tests for bitmap indexes too.	2017-10-27 14:21:48 -07:00
Jihoon Son	d7024f22e1	Upgrade fastutil to 8.1.0 (#4988 ) * Upgrade failutil to 8.1.0 * unused import	2017-10-19 23:37:43 -05:00
Jihoon Son	52d7f74226	Add streaming aggregation as the last step of ConcurrentGrouper if data are spilled (#4704 ) * Add steaming grouper * Fix doc * Use a single dictionary while combining * Revert GroupByBenchmark * Removed unused code * More cleanup * Remove unused config * Fix some typos and bugs * Refactor Groupers.mergeIterators() * Add comments for combining tree * Refactor buildCombineTree * Refactor iterator * Add ParallelCombiner * Add ParallelCombinerTest * Handle InterruptedException * use AbstractPrioritizedCallable * Address comments * [maven-release-plugin] prepare release druid-0.11.0-sg * [maven-release-plugin] prepare for next development iteration * Address comments * Revert "[maven-release-plugin] prepare for next development iteration" This reverts commit `5c6b31e488`. * Revert "[maven-release-plugin] prepare release druid-0.11.0-sg" This reverts commit `0f5c3a8b82`. * Fix build failure * Change list to array * rename sortableIds * Address comments * change to foreach loop * Fix comment * Revert keyEquals() * Remove loop * Address comments * Fix build fail * Address comments * Remove unused imports * Fix method name * Split intermediate and leaf combine degrees * Add comments to StreamingMergeSortedGrouper * Add more comments and fix overflow * Address comments * ConcurrentGrouperTest cleanup * add thread number configuration for parallel combining * improve doc * address comments * fix build	2017-10-17 23:24:08 -07:00
Slim	af2bc5f814	Make float default representation for DoubleSum/Min/Max aggregators (#4944 ) * Introduce System wide property to select how to store double. Set the default to store as float Change-Id: Id85cca04ed0e7ecbce78624168c586dcc2adafaa * fix tests Change-Id: Ib42db724b8a8f032d204b58c366caaeabdd0d939 * Change the property name Change-Id: I3ed69f79fc56e3735bc8f3a097f52a9f932b4734 * add tests and make default distribution store doubles as 64bits Change-Id: I237b07829117ac61e247a6124423b03992f550f2 * adding mvn argument to parallel-test profile Change-Id: Iae5d1328f901c4876b133894fa37e0d9a4162b05 * move property name and helper function to io.druid.segment.column.Column Change-Id: I62ea903d332515de2b7ca45c02587a1b015cb065 * fix docs and clean style Change-Id: I726abb8f52d25dc9dc62ad98814c5feda5e4d065 * fix docs Change-Id: If10f4cf1e51a58285a301af4107ea17fe5e09b6d	2017-10-16 17:17:22 -07:00
Himanshu	a7e802c9d4	greater-than/less-than/equal-to havingSpec to call AggregatorFactory.finalizeComputation(..) (#4883 ) * greater-than/less-than/equal-to havingSpec to call AggregatorFactory.finalizeComputation(..) * fix the unit test and expect having to work on hyperUnique agg * test fix * fix style errors	2017-10-16 12:02:30 -07:00
Roman Leventov	dc7cb117a1	Refactor ColumnSelectorFactory; Rely on ColumnValueSelector's polymorphism (#4886 ) * Refactor ColumnSelectorFactory; Rely on ColumnValueSelector's polymorphism * Fix MapVirtualColumn.makeColumnValueSelector() * Minor fixes * Fix IndexGeneratorCombinerTest * DimensionSelector to return zeros when treated as numeric ColumnValueSelector * Fix IncrementalIndexTest * Fix IncrementalIndex.makeColumnSelectorFactory() * Optimize MapBasedRow.getMetric() * Fix VarianceAggregatorTest * Simplify IncrementalIndex.makeColumnSelectorFactory() * Address comments * More comments * Test	2017-10-13 21:44:17 -05:00
Jihoon Son	8d9902831e	Refactoring PrefetchableTextFilesFirehoseFactory (#4836 ) * Refactoring prefetchable firehose * Fix to read cache when prefetch is disabled * More tests * Cleanup codes * Add Fetcher * Fix test failure * Count file size * Fix test * rename generic parameter * address comments * address comments * reuse buffer * move Execs to java-util * use execs * Fix build	2017-10-13 21:39:28 -05:00
Jihoon Son	675c6c00dd	Add checkstyle and intellij rule to prohibit unnecessary qualifiers in interfaces (#4958 ) * add checkstyle and intellij rule * fix tc fail	2017-10-13 07:56:19 -07:00
Atul Mohan	c07678b143	Synchronization of lookups during startup of druid processes (#4758 ) * Changes for lookup synchronization * Refactor of Lookup classes * Minor refactors and doc update * Change coordinator instance to be retrieved by DruidLeaderClient * Wait before thread shutdown * Make disablelookups flag true by default * Update docs * Rename flag * Move executorservice shutdown to finally block * Update LookupConfig * Refactoring and doc changes * Remove lookup config constructor * Revert Lookupconfig constructor changes * Add tests to LookupConfig * Make executorservice local * Update LRM * Move ListeningScheduledExecutorService to ExecutorCompletionService * Move exception to outer block * Remove check to see future is done * Remove unnecessary assignment * Add logging	2017-10-12 21:22:24 -05:00
Gian Merlino	32f36beaae	QueryableIndexStorageAdapter: Lift column cache to Cursor sequence. (#4950 ) * QueryableIndexStorageAdapter: Lift column cache to Cursor sequence. This is where it was before #4710, when its was moved to the individual Cursors, leading to higher than expected memory usage. It could be extreme for finer query granularities like "second". * Comment.	2017-10-12 16:44:33 -05:00
Jihoon Son	56fb11ce0b	Lazy initialization for JavaScript functions (#4871 ) * Lazy initialization of JavaScript functions * Fix test failure * Fix thread-safety and postpone js conf check * Fix test fail * Fix test * Fix KafkaIndexTaskTest * Move config check	2017-10-10 21:52:42 -07:00
Jonathan Wei	18635a19b3	Remove unused limitFn in GroupByQuery (#4935 ) * Remove unused limitFn in GroupByQuery * Remove unused limitFn creation logic	2017-10-10 15:56:30 -07:00
Gian Merlino	b20e3038b6	SQL: Upgrade to Calcite 1.14.0, some refactoring of internals. (#4889 ) * SQL: Upgrade to Calcite 1.14.0, some refactoring of internals. This brings benefits: - Ability to do GROUP BY and ORDER BY with ordinals. - Ability to support IN filters beyond 19 elements (fixes #4203). Some refactoring of druid-sql internals: - Builtin aggregators and operators are implemented as SqlAggregators and SqlOperatorConversions rather being special cases. This simplifies the Expressions and GroupByRules code, which were becoming complex. - SqlAggregator implementations are no longer responsible for filtering. Added new functions: - Expressions: strpos. - SQL: TRUNCATE, TRUNC, LENGTH, CHAR_LENGTH, STRLEN, STRPOS, SUBSTR, and DATE_TRUNC. * Add missing @Override annotation. * Adjustments for forbidden APIs. * Adjustments for forbidden APIs. * Disable GROUP BY alias. * Doc reword.	2017-10-10 12:44:05 -07:00
chunghochen	0614b92df1	adding new post aggregators for test statistics to druid-stats extension (#4532 ) * adding new post aggregators of test stats to druid-stats extension * changes to address code review comments * fix checkstyle violations using druid_intellij_formatting.xml after merge upstream/master * add @Override annotation per CI log * make changes per review comments/discussions * remove some blocks per review comments	2017-10-09 23:43:27 -07:00
Akash Dwivedi	716a5ec1a8	Add identity to DefaultSearchQueryMetrics and DefaultSelectQueryMetrics. (#4906 )	2017-10-04 20:28:23 -05:00
Akash Dwivedi	2ee32399ff	granularity method in QueryMetrics. (#4570 ) * granularity method in QueryMetrics. PR to emit granularity dimension for timeseries, search, groupBy, select and topN queries. * QueryMetricsFactory classes for search and select queries. * Empty implementation for Granularity() method. * Review comment changes. * Remove unused import. * empty query() method. * checkstyle fix. * Import fix.	2017-10-04 09:42:52 -07:00
Jonathan Wei	5fbec5b435	Fix limit push down comparator bug (#4868 )	2017-10-02 11:44:23 -07:00
Gian Merlino	1f2074c247	Bump versions in master to 0.11.1-SNAPSHOT. (#4878 ) * Bump versions in master to 0.11.1-SNAPSHOT. * Missed a few.	2017-09-28 17:09:51 -05:00
Gian Merlino	a19f22b5bb	Add identity to query metrics, logs. (#4862 ) * Add identity to query metrics, logs. Also fix a bug where unauthorized requests would not emit any logs or metrics, and instead would log a "Tried to emit logs and metrics twice" warning. Also rename QueryResource's "getServer" to "cancelQuery", because that's what it does. * Do not emit identity by default.	2017-09-28 11:45:23 -07:00
Goh Wei Xiang	2c30d5ba55	Add org.joda.time.DateTime.parse() to forbidden APIs (#4857 ) * Added org.joda.time.DateTime#(java.lang.String) to forbidden API. * Added org.joda.time.DateTime#(java.lang.String, org.joda.time.format.DateTimeFormatter) to forbidden API. * Add additional APIs that may create DateTime with default time zone * Add helper function that accepts formatter to parse String. * Add additional forbidden APIs * Replace existing usage of forbidden APIs * Use wrapper class to enforce Chronology on DateTimeFormatter. * Creates constant UtcFormatter for constant ISODateTimeFormat.	2017-09-27 17:46:44 -05:00
Roman Leventov	9c126e2aa9	Forbid MapMaker (#4845 ) * Forbid MapMaker * Shorter syntax * Forbid Maps.newConcurrentMap()	2017-09-27 06:49:47 -07:00
Roman Leventov	e267f3901b	Enforce Indentation with Checkstyle (#4799 )	2017-09-21 13:06:48 -07:00
Roman Leventov	a9d8539802	Remove IndexedInts.iterator() (#4811 ) * Remove IndexedInts.iterator() * Retain IndexedInts.iterator(), but don't extend Iterable * Add BitmapValues * Fix tests	2017-09-20 21:25:52 -07:00
Roman Leventov	88e9a80636	Rename ObjectValueSelector.get() to getObject(); Add getObject() and classOfObject() to ColumnValueSelector (#4801 )	2017-09-19 14:47:20 -05:00
Roman Leventov	24646ac76a	LZ4 decompression forward compatibility (#4824 )	2017-09-19 10:18:37 -07:00
Charles Allen	00d39ce7a5	Move checks for bitmap size == 0 to isEmpty (#4820 )	2017-09-19 21:45:16 +05:30
Jonathan Wei	c2a0e753b6	Extension points for authentication/authorization (#4271 ) * Extension points for authentication/authorization * Address some PR comments * Authorization result caching * Add unit tests for SecuritySanityCheckFilter and PreResponseAuthorizationCheckFilter * Use Set for auth caching, close outputstreams in filters * Don't close output stream on success in sanity check filter * Add ConfigResourceFilter to coordinator lookups * Fix filtering authorization check for empty resource list * HttpClient users must explicitly escalate the client * Remove response modification from PreResponseAuthorizationCheckFilter * Remove extraneous pom.xml * Fix unit test * Better lifecycle management * Rename AuthorizationManager to Authorizer * Fix authorization denials for empty supervisor list * Address some PR comments * Address more PR comments * Small cleanup * Add Jetty HttpClient wrapper to Authenticator * Remove Authorizer start/stop * Restore immutable context map in DruidConnection, UT fix * Fix/update docs * Add authorization checks to EventReceiverFirehose * Fix router authorization check failure, restore PreResponseAuthorizationFilter changes * Compile fixes * Test fixes * Update Authenticator/Authorizer doc comments * Merge fixes * PR comments * Fix test * Fix IT * More PR comments * PR comments * SSL fix	2017-09-15 23:45:48 -07:00
Roman Leventov	3f92184dd8	Inspection fixes (#4809 )	2017-09-15 17:48:29 -07:00
Roman Leventov	b61248fdb1	Replace HistoricalFloatColumnSelector with more generic HistoricalColumnSelector (#4796 )	2017-09-14 13:52:06 -07:00
Akash Dwivedi	a17e48fe69	search package name correction. (#4785 ) * search package name correction. * Refactor search.search pkg to search. * remove unused import.	2017-09-14 13:50:23 -07:00
Gian Merlino	2ce8123bdb	Move scan-query from a contrib extension into core. (#4751 ) * Move scan-query from a contrib extension into core. Based on a proposal at: https://groups.google.com/d/topic/druid-development/ME_OatUDnbk/discussion This patch also adds support for virtual columns to the Scan query, and updates Druid SQL to use Scan instead of Select. This patch also makes some behavioral changes to handling of the __time column. In particular, it is now is returned as "__time" rather than "timestamp"; it is no longer included if you do not specifically ask for it in your "columns"; and it is returned as a long rather than a string. Users can revert time handling to the legacy extension behavior by setting "legacy" : true in their queries, or setting the property druid.query.scan.legacy = true. This is meant to provide a migration path for users that were formerly using the contrib extension. * Adjustments from review. * Add back Select query. * Adjust SQL docs. * Restore SelectQuery link.	2017-09-13 09:51:24 -07:00
Gian Merlino	c3a1ce6933	SQL: Fix toTimeseriesQuery and toTopNQuery. (#4780 ) The former would sometimes eat limits, and the latter would sometimes use the wrong dimension comparator.	2017-09-12 14:37:27 -07:00
Jonathan Wei	3a29521273	Fix GroupBy limit push down error when buffer is too small (#4745 ) * Fix GroupBy limit push down error when buffer is too small * Address PR comments	2017-09-12 12:34:50 -07:00
Roman Leventov	832cc293ef	Refactoring of ReferenceCountingSegment and FireHydrant (#4154 ) * Refactoring of ReferenceCountingSegment and FireHydrant * Address comment * Fix FireHydrant.closeSegment() * Address comment * Added comments to ReferenceCountingSegment	2017-09-12 14:28:35 -05:00
Gian Merlino	4909c48b0c	SQL: Full TRIM support. (#4750 ) * SQL: Full TRIM support. - Support trimming arbitrary characters - Support BOTH, LEADING, and TRAILING * Remove unused import. * Fix tests, add RTRIM / LTRIM. * Remove unused imports. * BTRIM and docs. * Replace for with foreach.	2017-09-12 11:49:08 -07:00
Gian Merlino	23c0357816	BufferHashGrouperTest: Better behavior with regard to large buffers. (#4779 ) * BufferHashGrouperTest: Better behavior with regard to large buffers. 1) Free buffers after each test 2) Avoid mmaping past the end of a file * Use CloserRule.	2017-09-11 12:10:31 -07:00
Andy Sloane	706747cc8c	Fix for sort order in select/topN query cache (#4766 ) When historical caching is enabled, and a select or topN query is issued, and then a following query with "descending": true is set, the cached query returns the ascending result (or vice versa), often resulting in invalid paging identifiers. The CacheKey for these queries doesn't include the "descending" flag; this change adds it, and fixes the problem.	2017-09-10 11:33:00 +09:00
Charles Allen	bdfc6fe25e	Move common TypeReference into JacksonUtils (#4738 )	2017-08-31 13:40:16 -07:00
Niketh Sabbineni	beecb9e210	Fix failing build, remove unused import (#4726 ) LGTM	2017-08-29 14:46:38 +09:00
Roman Leventov	4d109a358a	Refactoring of Storage Adapters (#4710 ) * Factor QueryableIndexColumnSelectorFactory and IncrementalIndexColumnSelectorFactory out of QueryableIndexStorageAdapter and IncrementalIndexStorageAdapter; Add Offset.getBaseReadableOffset(); Remove OffsetHolder interface; Replace Cursor extends ColumnSelectorFactory with composition; Reduce indirection in ColumnValueSelectors created by QueryableIndexColumnSelectorFactory * Don't override clone() in FilteredOffset (the prev. implementation was broken); Some warnings fixed * Simplify Cursors in QueryableIndexStorageAdapter * Address comments * Remove unused and unimplemented methods from GenericColumn interface * Comments	2017-08-28 18:07:31 -07:00
Gian Merlino	daf3c5f927	Add "round" option to cardinality and hyperUnique aggregators. (#4720 ) * Add "round" option to cardinality and hyperUnique aggregators. Also turn it on by default in SQL, to make math on distinct counts work more as expected. * Fix some compile errors. * Fix test. * Formatting.	2017-08-28 14:52:11 -07:00
Gian Merlino	9fbfc1be32	Add @ExtensionPoint and @PublicApi annotations. (#4433 ) * Add @ExtensionPoint and @PublicApi annotations. * Clean up wording. * Remove unused import. * Remove unused imports. * Only types can be extension points. * Adjust annotations some more. * Remove unused import. * Make ServletFilterHolder an extension point. * Add a couple extension points, and update docs.	2017-08-28 14:50:58 -07:00
Gian Merlino	43488df975	Fix dimension selectors with extractionFns on missing columns. (#4717 ) * Fix dimension selectors with extractionFns on missing columns. This patch properly applies the requested extractionFn to missing columns. It's important when the extractionFn maps null to something other than null. * Extract helper method. * Change contracts of VirtualColumns and VirtualColumn methods based on review comments. * Remove unused import. * Remove unused method. * Adjust helper function. * Adjustments	2017-08-25 18:34:42 -05:00
Roman Leventov	598cc46bae	Replace HashMap with Obj2IntMap in StringDimensionIndexer; Small optimization in StringDimensionMergerV9 (#4721 )	2017-08-25 12:30:39 -07:00
Roman Leventov	326a85a9a4	Add Offset.reset() and remove unused Offset implementations (#4706 ) * Add Offset.reset() and remove unused Offset implementations * Fix BitmapOffset * Address comments	2017-08-22 17:43:29 -07:00
Roman Leventov	cacf63b007	Add AggregateCombiners (#4676 ) * Add MetricCombiners * Rename MetricCombiner to AggregateCombiner * Spelling * Fix TimestampAggregatorFactory.combine() and add makeAggregateCombiner() implementation * Rename AggregateCombiner.combine() to fold()	2017-08-21 16:45:29 -07:00
Roman Leventov	cbd1902db8	Add forbidden-apis plugin; prohibit using system time zone (#4611 ) * Forbidden APIs WIP * Remove some tests * Restore io.druid.math.expr.Function * Integration tests fix * Add comments * Fix in SimpleWorkerProvisioningStrategy * Formatting * Replace String.format() with StringUtils.format() in RemoteTaskRunnerTest * Address comments * Fix GroupByMultiSegmentTest	2017-08-21 13:02:42 -07:00
Roman Leventov	fa87eaa6e8	Remove IndexedInts.fill() (#4705 )	2017-08-21 13:01:34 -07:00
Asif Mansoor Amanullah	37f85b08d2	move row up/down for null metric ordering (#4681 ) * move row up/down for null metric ordering * addressed comments * addressed changes	2017-08-17 11:36:19 -05:00
Jonathan Wei	ab28dc3b97	free() dictionary merging buffers in IndexMerger (#4684 ) * free() dictionary merging buffers in IndexMerger * Use close() for dictionary merge iterators * Add comments on buffer free	2017-08-15 10:11:29 -07:00
Jonathan Wei	e91d4d1b80	Remove makeObjectColumnSelector() from DimensionIndexer (#4679 )	2017-08-11 14:39:00 -07:00
Jonathan Wei	1bddfc089c	Additional docs/log for direct memory usage (#4631 ) * Additional docs/log for direct memory usage * Tweak docs * Doc rewording	2017-08-10 23:33:20 -07:00
Roman Leventov	bf28d0775b	Remove QueryRunner.run(Query, responseContext) and related legacy methods (#4482 ) * Remove QueryRunner.run(Query, responseContext) and related legacy methods * Remove local var	2017-08-11 09:12:38 +09:00
Jihoon Son	65c1d6c797	Add IntGrouper to avoid unnecessary boxing/unboxing in array-based aggregation (#4668 ) * Add IntGrouper * Fix build * Address comments * Add a benchmark query	2017-08-10 07:41:39 -07:00
solimant	de9ba97d54	Move equals() from Float[Sum\|Min\|Max]AggregatorFactory to SimpleFloat... (#4675 ) Addresses #4671	2017-08-10 07:22:22 -07:00
Gian Merlino	7c89e12ca9	Replace Guava Enum.getIfPresent with builtin version. (#4659 ) * Replace Guava Enum.getIfPresent with builtin version. This is useful for running in Hadoop environments that use Guava 11. Some code is also simplified. * Code review	2017-08-09 17:20:00 -07:00
Jihoon Son	fe3421032b	Parallel sort for ConcurrentGrouper (#4660 ) * Multi-thread sort * Address comments	2017-08-09 16:24:36 -07:00
Goh Wei Xiang	42569e65e2	Minor fix in ExpressionSelectors to avoid potential NPE. (#4669 )	2017-08-09 10:13:31 -07:00
Roman Leventov	7454fd86a0	Polymorphic numeric getters for ColumnValueSelector (#4623 ) * Add methods getFloat(), getDouble() and getLong() to ColumnValueSelector * Fix copy-paste mistake in docs * Spelling	2017-08-08 18:38:06 -07:00
Roman Leventov	f5d4171459	Prohibit for loops which could be foreach with IntelliJ (#4653 ) * Replace for with foreach * Replace for with for-each in GroupByQueryEngineV2 * Remove io.druid.collections.IntList	2017-08-08 18:05:33 -07:00
Roman Leventov	aa7e4ae5e4	Enforce correct spacing with Checkstyle (#4651 )	2017-08-05 10:18:25 -07:00
Jonathan Wei	aa8d75004c	More informative QueryInterruptedException toString() (#4642 ) * More informative QueryInterruptedException toString() * Use StringUtils.format	2017-08-04 16:00:20 -07:00
Jonathan Wei	9650d80f3e	Don't use limit push down with having spec (#4630 ) * Don't use limit push down with having spec * Throw exception when forcing limit push down with having * Tests for having and limit push down * Fix pool sizes in unit test	2017-08-04 15:13:29 -07:00
Roman Leventov	7a005088d9	Add HistoricalCursor.getReadableOffset() to access unwrapped offset in selectors (#4633 ) * Add HistoricalCursor.getReadableOffset() to access unwrapped offset in selectors, when the 'main' offset if FilteredOffset (fixes #4628) * Stack overflow test	2017-08-04 12:51:48 -07:00
Jihoon Son	f3f2cd35e1	Array-based aggregation for groupBy query (#4576 ) * Array-based aggregation * Fix handling missing grouping key * Handle invalid offset * Fix compilation * Add cardinality check * Fix cardinality check * Address comments * Address comments * Address comments * Address comments * Cleanup GroupByQueryEngineV2.process * Change to Byte.SIZE * Add flatMap	2017-08-03 20:04:54 +03:00
Niketh Sabbineni	da43f68e95	NPE thrown when empty/null is passes to TimeDimExtractionFn (#4601 ) * NPE thrown when empty/null is passes to TimeDimExtractionFn * Add @Nullable where ever applicable * Add @Nullable to SearchQuerySpec.apply() * Remove unused	2017-07-26 21:02:08 -05:00
Akash Dwivedi	c372d2ecc1	Default implementation for getDouble(). (#4595 ) * Default implementation for getDouble(). * use getFloat for default implementation. * addressed comment. * new line.	2017-07-25 19:06:27 -05:00
Gian Merlino	3d6f409fc8	Fix groupBy on double dimensions. (#4596 ) * Fix groupBy on double dimensions. * Fix tests. * Fix tests. * Fix Scan tests.	2017-07-24 23:18:06 -07:00
Gian Merlino	8a4185897e	Add filter tests for both floats and doubles. (#4597 )	2017-07-24 17:02:02 -07:00
Atul Mohan	4bd0f174ba	Changes for deduplication (#4581 )	2017-07-24 11:12:23 -05:00
Roman Leventov	7408a7c4ed	Refactor CachingClusteredClient.run() (#4489 ) * Refactor CachingClusteredClient * Comments * Refactoring * Readability fixes	2017-07-23 23:10:36 +09:00
Roman Leventov	c0beb78ffd	Enforce brace formatting with Checkstyle (#4564 )	2017-07-21 10:26:59 -05:00
Slim	71e7a4c054	Adding double colums supports (#4491 ) * add double columns support * Fix numbers and expected results in UTs * adding float aggregators * fix IT expected test results * fix comments * more fixes * fix comp * fix test * refactor double and float aggregator factories * fix * fix UTs * fix comments * clean unused code * fix more comments * undo unnecessary changes * fix null issue * refactor TopNColumnSelectorStrategyFactory * fix docs * refactor NumericTopNColumnSelectorStrategy * fix return * fix comments * handle the null case in DimesionIndexer * more null fixing * cosmetic changes	2017-07-20 10:14:14 +03:00
Roman Leventov	ae86323dbd	Remove unnecessary qualifier (#4565 )	2017-07-18 17:40:54 +09:00
Roman Leventov	60cdf94677	Add PMD and prohibit unnecessary fully qualified class names in code (#4350 ) * Add PMD and prohibit unnecessary fully qualified class names in code * Extra fixes * Remove extra unnecessary fully-qualified names * Remove qualifiers * Remove qualifier	2017-07-17 22:22:29 +09:00
Chris Gavin	960cb07ea6	Fix some unnecessary use of boxed types and incorrect format strings spotted by lgtm. (#4474 ) * Remove some unnecessary use of boxed types. * Fix some incorrect format strings. * Enable IDEA's MalformedFormatString inspection. * Add a Checkstyle check for finding uses of incorrect logging packages. * Fix some incorrect usages of the metamx logger. * Bypass incorrect logger Checkstyle check where using the correct logger is not simple. * Fix some more places where the wrong number of arguments are provided to format strings. * Suppress `MalformedFormatString` inspection on legacy logging test. * Use @SuppressWarnings rather than a noinspection suppression comment. * Fix some more incorrect format strings. * Suppress some more incorrect format string warnings where the incorrect string is intentional. * Log the aggregator when closing it fails. * Remove some unneeded log lines.	2017-07-13 12:15:32 -07:00
Roman Leventov	b2865b7c7b	Make possible to start Peon without DI loading of any querying-related stuff (#4516 ) * Make QueryRunnerFactoryConglomerate injection lazy in TaskToolbox/TaskToolboxFactory * Extract QueryablePeonModule and add druid.modules.excludeList config * Typo	2017-07-12 13:18:25 -05:00
Goh Wei Xiang	53e6b5cb9b	Removal of TopNResultMerger because it is vestigial. (#4520 )	2017-07-12 13:24:07 +03:00
Roman Leventov	ad76f7a1ab	Make Filter.getBitmapResult() abstract (#4481 )	2017-07-11 12:39:32 -07:00
Akash Dwivedi	a108d05f76	Use GenericIndexed v2 supported read() during deserializeColumn (#4463 )	2017-07-11 10:18:25 -05:00
Roman Leventov	d168a4271e	Use Double.NEGATIVE_INFINITY and Double.POSITIVE_INFINITY (#4496 ) * Use Double.NEGATIVE_INFINITY and Double.POSITIVE_INFINITY instead of Double.MIN_VALUE and Double.MAX_VALUE, same for Float * Replace usages in comments * Fix RTree * Remove commented code * Add tests	2017-07-07 09:10:13 -06:00
Roman Leventov	fc4fe24dd5	Incorrect use of Long.TYPE and Float.TYPE as return type of ObjectColumnSelector.classOfObject() (#4501 )	2017-07-05 08:54:06 -07:00
Jonathan Wei	97a79f4478	Fix GroupBy type cast when ChainedExecutionQueryRunner merges results (#4488 ) * Fix GroupBy type cast error when ChainedExecutionQueryRunner merges multiple runners * Move conversion step to separate method * Remove unnecessary comment * Use compute to update map	2017-06-30 17:33:03 -07:00
Roman Leventov	9ae457f7ad	Avoid using the default system Locale and printing to System.out in production code (#4409 ) * Avoid usages of Default system Locale and printing to System.out or System.err in production code * Fix Charset in DruidKerberosUtil * Remove redundant string format in GenericIndexed * Rename StringUtils.safeFormat() to unimportantSafeFormat(); add StringUtils.format() which fails as well as String.format() * Fix testSafeFormat() * More fixes of redundant StringUtils.format() inside ISE * Rename unimportantSafeFormat() to nonStrictFormat()	2017-06-29 14:06:19 -07:00
Roman Leventov	ae900a4934	Update versions to 0.11.0-SNAPSHOT (#4483 )	2017-06-28 17:05:58 -07:00
Roman Leventov	6173570425	Add ExtensionsConfig.excludeModules (#4438 ) * Add ExtensionsConfig.excludeModules * Add branch * Refactor Initialization.getFromExtensions() * excludeModules -> moduleExcludeList * Initialization.getFromExtensions() and getLoadedModules() should return Collection, not Set * Fix doc	2017-06-28 14:01:31 -07:00
Gian Merlino	4c33d0a00f	Add some new expression functions and macros. (#4442 ) * Add some new expression functions and macros. See misc/math-expr.md for the list of added functions, except for "like", which previously existed but was not documented. * Add easymock to datasketches tests. * Add easymock to distinctcount tests. * Add easymock to virtual-columns tests. * Code review comments. * Clean up code a bit. * Add easymock to scan-query tests. * Rework ExprMacros that have multiple impls. * Improve test coverage.	2017-06-28 10:15:58 -07:00
Roman Leventov	2fa4b10145	More fine-grained DI for management node types. Don't allocate processing resources on Router (#4429 ) * Remove DruidProcessingModule, QueryableModule and QueryRunnerFactoryModule from DI for coordinator, overlord, middle-manager. Add RouterDruidProcessing not to allocate processing resources on router * Fix examples * Fixes * Revert Peon configs and add comments * Remove qualifier	2017-06-27 22:58:01 -07:00
Roman Leventov	05d58689ad	Remove the ability to create segments in v8 format (#4420 ) * Remove ability to create segments in v8 format * Fix IndexGeneratorJobTest * Fix parameterized test name in IndexMergerTest * Remove extra legacy merging stuff * Remove legacy serializer builders * Remove ConciseBitmapIndexMergerTest and RoaringBitmapIndexMergerTest	2017-06-26 13:21:39 -07:00
Jihoon Son	3e60c9125d	Increase timeout of GroupByQueryMergeBufferTest and AppenderatorDriverTest (#4441 )	2017-06-22 06:09:52 -07:00
Jihoon Son	3a5c480405	Split IndexMergerTest and ImmutableConciseSetTest (#4427 )	2017-06-21 20:55:51 -07:00
Gian Merlino	34d2f9ebfe	Queries: Restore old prepareAggregations method. (#4432 ) For backwards compatibility, post-#4394.	2017-06-21 05:36:32 -07:00
Gian Merlino	679cf277c0	Add ExpressionFilter. (#4405 ) * Add ExpressionFilter. The expression filter expects a single argument, "expression", and matches rows where that expression is true. * Code review comments. * CR comment. * Fix logic. * Fix test. * Remove unused import.	2017-06-20 12:42:26 -07:00
Gian Merlino	22aad08a59	ExpressionPostAggregator: Automatically finalize inputs. (#4406 ) * ExpressionPostAggregator: Automatically finalize inputs. Raw HyperLogLogCollectors and such aren't very useful. When writing expressions like `x / y` users will expect `x` and `y` to be finalized. * Fix un-merge. * Code review comments. * Remove unnecessary ImmutableMap.copyOf.	2017-06-17 13:22:47 -07:00
Goh Wei Xiang	f68a0693f3	Allow use of non-threadsafe ObjectCachingColumnSelectorFactory (#4397 ) * Adding a flag to indicate when ObjectCachingColumnSelectorFactory need not be threadsafe. * - Use of computeIfAbsent over putIfAbsent - Replace Maps.newXXXMap() with normal instantiation - Documentations on when is thread-safe required. - Use Builders for On/OffheapIncrementalIndex * - Optimization on computeIfAbsent - Constant EMPTY DimensionsSpec - Improvement on IncrementalIndexSchema.Builder - Remove setting of default values - Use var args for metrics - Correction on On/OffheapIncrementalIndex Builders - Combine On/OffheapIncrementalIndex Builders * - Removing unused imports. * - Helper method for testing with IncrementalIndex.Builder * - Correction on javadoc. * Style fix	2017-06-16 16:04:19 -05:00
Gian Merlino	054cf8a183	Limit random access in compressed column tests. (#4414 ) * Limit random access in compressed column tests. Random access leads to lots of block decompressions for reading single elements, which is time prohibitive for the large column tests. For those tests, limit the number of randomly accessed elements to 1000. * Random -> ThreadLocalRandom	2017-06-15 14:48:06 -07:00
Jonathan Wei	7fe295009e	Faster ByteBufferMinMaxOffsetHeapTest (#4404 )	2017-06-15 13:14:29 -05:00
Gian Merlino	6edee7f434	Expressions work better with strings. (#4394 ) * Expressions work better with strings. - ExpressionObjectSelector able to read from string columns, and able to return strings. - ExpressionVirtualColumn able to offer string (and long for that matter) as its native type. - ExpressionPostAggregator able to return strings. - groupBy, topN: Allow post-aggregators to accept dimensions as inputs, making ExpressionPostAggregator more useful. - topN: Use DimExtractionTopNAlgorithm for STRING columns that do not have dictionaries, allowing it to work with STRING-type expression virtual columns. - Adjusts null handling to better match the rest of Druid: null and empty string treated the same; nulls implicitly treated as zeroes in numeric context. * Code review comments. * More code review. * Fix test. * Adjust annotations.	2017-06-14 14:50:18 -07:00
Roman Leventov	113b8007b7	Increase timeout for QueryGranularityTest.testDeadLock() (#4395 )	2017-06-12 13:28:21 -07:00
Gian Merlino	1f2afccdf8	Expressions: Add ExprMacros. (#4365 ) * Expressions: Add ExprMacros, which have the same syntax as functions, but can convert themselves to any kind of Expr at parse-time. ExprMacroTable is an extension point for adding new ExprMacros. Anything that might need to parse expressions needs an ExprMacroTable, which can be injected through Guice. * Address code review comments.	2017-06-08 09:32:10 -04:00
Jonathan Wei	9ae04b7375	Remove queryMetricsFactory from GroupByQueryConfig (#4383 )	2017-06-07 21:35:26 -05:00
Roman Leventov	63a897c278	Enable most IntelliJ 'Probable bugs' inspections (#4353 ) * Enable most IntelliJ 'Probable bugs' inspections * Fix in RemoteTestNG * Fix IndexSpec's equals() and hashCode() to include longEncoding * Fix inspection errors * Extract global isntance of natural().nullsFirst(); address comments * Fix * Use noinspection comments instead of SuppressWarnings on method for IntelliJ-specific inspections * Prohibit Ordering.natural().nullsFirst() using Checkstyle	2017-06-07 09:54:25 -07:00
Roman Leventov	b487fa355b	More methods in QueryMetrics and TopNQueryMetrics (the last part of #3798 ) (#4284 ) * Add more methods to QueryMetrics and TopNQueryMetrics, add BitmapResultFactory * Add implementor expectations section to BitmapResultFactory javadoc	2017-06-07 09:49:08 -07:00
kaijianding	551a89bd67	serialize DateTime As Long to improve json serde performance (#4038 )	2017-06-06 10:08:51 -07:00
Gian Merlino	d22db30db4	VirtualColumns: Block virtual columns with empty names. (#4367 ) * VirtualColumns: Block virtual columns with empty names. * Spelling.	2017-06-06 08:05:47 -07:00
Roman Leventov	31d33b333e	Make using implicit system Charset an error (#4326 ) * Make using implicit system charset an error * Use StringUtils.toUtf8() and fromUtf8() instead of String.getBytes() and new String() * Use English locale in StringUtils.safeFormat() * Restore comment	2017-06-05 23:57:25 -07:00
Jonathan Wei	b90c28e861	Support limit push down for GroupBy (#3873 ) * Support limit push down for GroupBy V2 * Use orderBy spec ordering when applying limit push down * PR Comments * Remove unused var * Checkstyle fixes * Fix test * Add comment on non-final variables, fix checkstyle * Address PR comments * PR comments * Remove unnecessary buffer reset * Fix missing @JsonProperty annotation	2017-06-02 15:39:04 -07:00
praveev	290ed3ab9d	Make DateTime timezone aware (#4343 ) * Make DateTime timezone aware * Change unit tests to make DateTime timezone aware for PeriodGranularity	2017-06-02 12:45:52 -07:00
kaijianding	0efd18247b	explicitly unmap hydrant files when abandonSegment to recycle mmap memory (#4341 ) * fix TestKafkaExtractionCluster fail due to port already used * explicitly unmap hydrant files when abandonSegment to recyle mmap memory * address the comments * apply to AppenderatorImpl	2017-06-01 18:15:30 -05:00
Roman Leventov	50e72c6aea	Fix bugs (core) (#4339 ) * Fix bugs * Add test for GoogleDataSegmentPusher.buildPath() * Exclude extension changes * Address comments * Brace	2017-06-02 06:47:59 +09:00
Roman Leventov	78179ef74d	Inject QueryMetrics factories via PolyBind (#4336 )	2017-05-31 09:07:03 -07:00
Kenji Noguchi	3400f601db	Protobuf extension (#4039 ) * move ProtoBufInputRowParser from processing module to protobuf extensions * Ported PR #3509 * add DynamicMessage * fix local test stuff that slipped in * add license header * removed redundant type name * removed commented code * fix code style * rename ProtoBuf -> Protobuf * pom.xml: shade protobuf classes, handle .desc resource file as binary file * clean up error messages * pick first message type from descriptor if not specified * fix protoMessageType null check. add test case * move protobuf-extension from contrib to core * document: add new configuration keys, and descriptions * update document. add examples * move protobuf-extension from contrib to core (2nd try) * touch * include protobuf extensions in the distribution * fix whitespace * include protobuf example in the distribution * example: create new pb obj everytime * document: use properly quoted json * fix whitespace * bump parent version to 0.10.1-SNAPSHOT * ignore Override check * touch	2017-05-30 13:11:58 -07:00
Kamal Gurala	dcb07d6958	Option to configure default analysis types in SegmentMetadataQuery (#4259 ) * Option to configure default analysis types * Updated Docs and renamed * Added serde tests and Null handling * Fixed Documentation * Updated implementation * Updated implementation * Updated implementation * Added usingDefaultIntervals in Builder * Updated implementation * Updated implementation and added failing test * filterSegments implementation updated * Updated imlementation * Padding * Add missing Override * Updated implementation * Fixed a naming bug * Fixed bug * Removed comment	2017-05-26 12:12:39 -07:00
Gian Merlino	1eaa7887bd	Fix integer overflow in BufferGrouper. (#4333 ) Would have led to out of bounds buffer access with large buffers. Also added tests using large buffers.	2017-05-25 23:30:20 -07:00
Gian Merlino	2bd4c0930f	Fix "quarter" granularity serialization. (#4316 )	2017-05-23 10:06:17 -07:00
Gian Merlino	9283807ad7	GroupByQuery: Fix type-spanning comparisons. (#4317 ) Jackson deserializes integers sometimes as int and sometimes as long, depending on how big they are. This leads to ClassCastException when comparing deserialized values as part of groupBy merging on the broker.	2017-05-23 10:06:04 -07:00
Gian Merlino	22e5f52d00	Workaround for non-thread-safe use of CardinalityAggregator. (#4304 )	2017-05-23 10:33:03 +09:00
Roman Leventov	8ec3a29af0	Don't pass QueryMetrics down in concurrent and async QueryRunners (fixes #4279 ) (#4288 ) * Don't pass QueryMetrics down in concurrent and async QueryRunners * Rename QueryPlus.threadSafe() to withoutThreadUnsafeState(); Update QueryPlus.withQueryMetrics() Javadocs; Fix generics in MetricsEmittingQueryRunner and CpuTimeMetricQueryRunner; Make DefaultQueryMetrics to fail fast on modifications from concurrent threads	2017-05-22 13:42:09 -05:00
Maksim Logvinenko	d45dad2b44	Remove boxing/unboxing in indexer (#4269 ) * Remove boxing/unboxing in indexer * Fix rowIndex visibility * Cleanup	2017-05-17 19:13:53 -05:00
Roman Leventov	d9f423f55d	Make QueryMetrics factories configurable (#4268 ) * Ensure QueryMetrics factories accept Json ObjectMapper; Make QueryMetrics factories configurable * Update QueryMetrics Javadocs * Add javadocs to QueryMetrics factories * Move queryMetricsFactory defaults to getter methods of config classes	2017-05-17 08:41:59 -07:00
Gian Merlino	ddc2e68998	Remove cache keys from HavingSpecs. (#4280 ) * Remove cache keys from HavingSpecs. They weren't used, since they aren't part of the groupBy cache key. Also, it's good that they weren't used, since many of them had value truncation bugs. * Fix imports. * Fix test.	2017-05-16 22:13:02 -07:00
Roman Leventov	d400f23791	Monomorphic processing of TopN queries with simple double aggregators over historical segments (part of #3798 ) (#4079 ) * Monomorphic processing of topN queries with simple double aggregators and historical segments * Add CalledFromHotLoop annocations to specialized methods in SimpleDoubleBufferAggregator * Fix a bug in Historical1SimpleDoubleAggPooledTopNScannerPrototype * Fix a bug in SpecializationService * In SpecializationService, emit maxSpecializations warning only once * Make GenericIndexed.theBuffer final * Address comments * Newline * Reapply `439c906` (Make GenericIndexed.theBuffer final) * Remove extra PooledTopNAlgorithm.capabilities field * Improve CachingIndexed.inspectRuntimeShape() * Fix CompressedVSizeIntsIndexedSupplier.inspectRuntimeShape() * Don't override inspectRuntimeShape() in subclasses of CompressedVSizeIndexedInts * Annotate methods in specializations of DimensionSelector and FloatColumnSelector with @CalledFromHotLoop * Make ValueMatcher to implement HotLoopCallee * Doc fix * Fix inspectRuntimeShape() impl in ExpressionSelectors * INFO logging of specialization events * Remove modificator * Fix OrFilter * Fix AndFilter * Refactor PooledTopNAlgorithm.scanAndAggregate() * Small refactoring * Add 'nothing to inspect' messages in empty HotLoopCallee.inspectRuntimeShape() implementations * Don't care about runtime shape in tests * Fix accessor bugs in Historical1SimpleDoubleAggPooledTopNScannerPrototype and HistoricalSingleValueDimSelector1SimpleDoubleAggPooledTopNScannerPrototype, cover them with tests * Doc wording * Address comments * Remove MagicAccessorBridge and ensure Offset subclasses are public * Attach error message to element	2017-05-16 16:19:55 -07:00
Roman Leventov	b7a52286e8	Make @Override annotation obligatory (#4274 ) * Make MissingOverride an error * Make travis stript to fail fast * Add missing Override annotations * Comment	2017-05-16 13:30:30 -05:00
Himanshu	136b2fae72	improve query timeout handling and limit max scatter-gather bytes (#4229 ) * improve query timeout handling and limit max scatter-gather bytes * address review comments	2017-05-16 12:47:32 -05:00
Benedict Jin	e823085866	Improve `collection` related things that reusing a immutable object instead of creating a new object (#4135 )	2017-05-17 01:38:51 +09:00
Jihoon Son	50a4ec2b0b	Add support for headers and skipping thereof for CSV and TSV (#4254 ) * initial commit * small fixes * fix bug * fix bug * address code review * more cr * more cr * more cr * fix * Skip head rows for CSV and TSV * Move checking skipHeadRows to FileIteratingFirehose * Remove checking null iterators * Remove unused imports * Address comments * Fix compilation error * Address comments * Add more tests * Add a comment to ReplayableFirehose * Addressing comments * Add docs and fix typos	2017-05-15 22:57:31 -07:00
Roman Leventov	1ebfa22955	Update Error prone configuration; Fix bugs (#4252 ) * Make Errorprone the default compiler * Address comments * Make Error Prone's ClassCanBeStatic rule a error * Preconditions allow only %s pattern * Fix DruidCoordinatorBalancerTester * Try to give the compiler more memory * Remove distribution module activation on jdk 1.8 because only jdk 1.8 is used now * Don't show compiler warnings * Try different travis script * Fix travis.yml * Make Error Prone optional again * For error-prone compiler * Increase compiler's maxmem * Don't run Error Prone for benchmarks because of OOM * Skip install step in Travis * Remove MetricHolder.writeToChannel() * In travis.yml, check compilation before tests, because it may fail faster	2017-05-12 15:55:17 +09:00
Roman Leventov	e09e892477	Refactor QueryRunner to accept QueryPlus: Query + QueryMetrics (part of #3798 ) (#4184 ) * Add QueryPlus. Add QueryRunner.run(QueryPlus, Map) method with default implementation, to replace QueryRunner.run(Query, Map). * Fix GroupByMergingQueryRunnerV2 * Fix QueryResourceTest * Expand the comment to Query.run(walker, context) * Remove legacy version of BySegmentSkippingQueryRunner.doRun() * Add LegacyApiQueryRunnerTest and be more specific about legacy API removal plans in Druid 0.11 in Javadocs	2017-05-10 12:25:00 -07:00
Himanshu	462f6482df	optionally add extensions to explicitly specified hadoopContainerClassPath (#4230 ) * optionally add extensions to explicitly specified hadoopContainerClassPath * note extensions always pushed in hadoop container when druid.extensions.hadoopContainerDruidClasspath is not provided explicitly	2017-05-08 14:24:14 -05:00
Roman Leventov	8277284d67	Add Checkstyle rule to force comments to classes and methods to be Javadoc comments (#4239 )	2017-05-04 11:14:41 -07:00
Roman Leventov	5e85fcc0f5	Restore BaseQuery.computeOverridenContext() for compatibility (#4241 )	2017-05-02 10:22:02 -07:00
Himanshu	5a5a2749cd	improvements to coordinator lookups management (#3855 ) * coordinator lookups mgmt improvements * revert replaces removal, deprecate it instead * convert and use older specs stored in db * more tests and updates * review comments * add behavior for 0.10.0 to 0.9.2 downgrade * incorporating more review comments * remove explicit lock and use LifecycleLock in LookupReferencesManager. use LifecycleLock in LookupCoordinatorManager as well * wip on LookupCoordinatorManager * lifecycle lock * refactor thread creation into utility method * more review comments addressed * support smooth roll back of lookup snapshots from 0.10.0 to 0.9.2 * correctly use LifecycleLock in LookupCoordinatorManager and remove synchronization from start/stop * run lookup mgmt on leader coordinator only * wip: changes to do multiple start() and stop() on LookupCoordinatorManager * lifecycleLock fix usage in LookupReferencesManagerTest * add LifecycleLock back * fix license hdr * some fixes * make LookupReferencesManager.getAllLookupsState() consistent while still being lockless * address review comments * addressing leventov's comments * address charle's comments * add IOE.java * for safety in LookupReferencesManager mainThread check for lifecycle started state on each loop in addition to interrupt * move thread creation utility method to Execs * fix names * add tests for LookupCoordinatorManager.lookupManagementLoop() * add further tests for figuring out toBeLoaded and toBeDropped on LookupCoordinatorManager * address leventov comments * remove LookupsStateWithMap and parameterize LookupsState * address review comments * address more review comments * misc fixes	2017-04-28 08:41:38 -05:00
Roman Leventov	b9fd30e90a	Add Checkstyle check to prohibit IntelliJ-style commented code lines (#4220 ) * Add Checkstyle check to prohibit IntelliJ-style commented code lines * Address comment * Restore issue link	2017-04-27 18:11:25 -07:00
kaijianding	c47cfed0ec	Significantly improve LongEncodingStrategy.AUTO build performance (#4215 ) * Significantly improve LongEncodingStrategy.AUTO build performance * use numInserted instead of tempIn.available * fix bug	2017-04-27 15:11:07 +03:00
Roman Leventov	ee9b5a619a	Fix bugs in query builders and in TimeBoundaryQuery.getFilter() (#4131 ) * Add queryMetrics property to Query interface; Fix bugs and removed unused code in Druids * Fix a bug in TimeBoundaryQuery.getFilter() and remove TimeBoundaryQuery.getDimensionsFilter() * Don't reassign query's queryMetrics if already present in CPUTimeMetricQueryRunner and MetricsEmittingQueryRunner * Add compatibility constructor to BaseQuery * Remove Query.queryMetrics property * Move nullToNoopLimitSpec() method to LimitSpec interface * Rename GroupByQuery.applyLimit() to postProcess(); Fix inconsistencies in GroupByQuery.Builder	2017-04-25 16:32:02 -05:00
kaijianding	336089563d	skip rows which are added after cursor created (#4049 ) * fix can't get dim value via IncrementalIndexStorageAdapter cursor * address the comment * add ut * address ut comments * fix bug and fix ut	2017-04-26 03:26:46 +09:00
Jonathan Wei	723a855ab9	Fix nested groupBys with outer exfns on inner numeric columns (#4182 )	2017-04-21 19:47:46 -07:00
Gian Merlino	2ca7b00346	Update versions to 0.10.1-SNAPSHOT. (#4191 )	2017-04-20 18:12:28 -07:00
Gian Merlino	60caa641f3	Restore backwards compatibility of Query. (#4185 )	2017-04-19 19:47:50 +03:00
Jihoon Son	5b69f2eff2	Make timeout behavior consistent to document (#4134 ) * Make timeout behavior consistent to document * Refactoring BlockingPool and add more methods to QueryContexts * remove unused imports * Addressed comments * Address comments * remove unused method * Make default query timeout configurable * Fix test failure * Change timeout from period to millis	2017-04-19 09:47:53 +09:00
Gian Merlino	b2954d5fea	Better groupBy error messages and docs around resource limits. (#4162 ) * Better groupBy error messages and docs around resource limits. * Fix BufferGrouper test from datasketches. * Further clarify.	2017-04-13 10:38:53 -07:00
Ram iyer	2e9589215e	removing unused var (#4163 )	2017-04-13 04:03:41 +09:00
kaijianding	676af79044	don't do postAgg in TimeseriesQueryQueryToolChest when not necessary (#4155 ) * don't do postAgg in TimeseriesQueryQueryToolChest when not necessary * set postAggs to empty list in TimeseriesQueryQueryToolChest instead of checking finalizing fn * fix ut * fix ut again	2017-04-12 15:46:46 +05:30
Roman Leventov	15f3a94474	Copy closer into Druid codebase (fixes #3652 ) (#4153 )	2017-04-10 09:38:45 +09:00
Roman Leventov	73d9b31664	GenericIndexed minor bug fixes, optimizations and refactoring (#3951 ) * Minor bug fixes in GenericIndexed; Refactor and optimize GenericIndexed; Remove some unnecessary ByteBuffer duplications in some deserialization paths; Add ZeroCopyByteArrayOutputStream * Fixes * Move GenericIndexedWriter.writeLongValueToOutputStream() and writeIntValueToOutputStream() to SerializerUtils * Move constructors * Add GenericIndexedBenchmark * Comments * Typo * Note in Javadoc that IntermediateLongSupplierSerializer, LongColumnSerializer and LongMetricColumnSerializer are thread-unsafe * Use primitive collections in IntermediateLongSupplierSerializer instead of BiMap * Optimize TableLongEncodingWriter * Add checks to SerializerUtils methods * Don't restrict byte order in SerializerUtils.writeLongToOutputStream() and writeIntToOutputStream() * Update GenericIndexedBenchmark * SerializerUtils.writeIntToOutputStream() and writeLongToOutputStream() separate for big-endian and native-endian * Add GenericIndexedBenchmark.indexOf() * More checks in methods in SerializerUtils * Use helperBuffer.arrayOffset() * Optimizations in SerializerUtils	2017-03-27 14:17:31 -05:00
Gian Merlino	dd6c0ab509	Add SQL REGEXP_EXTRACT function; add "index" to "regex" extractionFn. (#4055 ) * Add SQL REGEXP_EXTRACT function; add "index" to "regex" extractionFn. * Fix tests.	2017-03-24 17:38:36 -07:00
Erik Dubbelboer	2cbc4764f8	Comparing dimensions to each other in a filter (#3928 ) Comparing dimensions to each other using a select filter	2017-03-23 18:23:46 -07:00
Roman Leventov	4b5ae31207	QueryMetrics: abstraction layer of query metrics emitting (part of #3798 ) (#3954 ) * QueryMetrics: abstraction layer of query metrics emitting * Minor fixes * QueryMetrics.emit() for bulk emit and improve Javadoc * Fixes * Fix * Javadoc fixes * Typo * Use DefaultObjectMapper * Add tests * Address PR comments * Remove QueryMetrics.userDimensions(); Rename QueryMetric.register() to report() * Dedicated TopNQueryMetricsFactory, GroupByQueryMetricsFactory and TimeseriesQueryMetricsFactory * Typo * More elaborate Javadoc of QueryMetrics * Formatting * Replace QueryMetric enum with lambdas * Add comments and VisibleForTesting annotations	2017-03-23 17:23:59 -07:00
Jonathan Wei	79f1a1d7f0	Allow float parameters for Bound/Selector/In filters on long columns (#4074 ) * Allow float parameters for long filters * Use BigDecimal intermediate form for string->long conversions * PR comments * PR comments	2017-03-23 14:18:05 -07:00
Akash Dwivedi	ff7f90b02d	relocate method in BufferAggregator. (#4071 ) * relocate method in BufferAggregator. * Unused import. * Detailed javadoc. * using Int2ObjectMap. * batch relocate. * Revert batch relocate. * Unused import. * code comments. * code comment.	2017-03-23 13:07:59 -07:00
David Lim	f68ba4128f	Exclude pagingIdentifiers that don't apply to a datasource (#4078 ) * exclude pagingIdentifiers that don't apply to a datasource to support union datasources * code review changes * code review changes	2017-03-22 12:32:27 -07:00
Gian Merlino	1f48198607	Fix some query cache key collisions. (#4094 ) The query caches generally store dimensions and aggregators positionally, so appendCacheablesIgnoringOrder could lead to incorrect results being pulled from the cache.	2017-03-22 11:08:48 -07:00
Gian Merlino	77b6213222	Remove unused Filters.getLongValueMatcher method. (#4086 )	2017-03-21 13:46:07 -06:00
Gian Merlino	ad477cb454	Fix topNs with extractionFns but no aggregators. (#4070 ) The result sets were empty because of an aggs.length > 0 check. I'm not sure if it was there for any good reason, but there didn't seem to be one.	2017-03-20 11:31:30 -07:00
Roman Leventov	84fe91ba0b	Monomorphic processing of TopN queries with 1 and 2 aggregators (key part of #3798 ) (#3889 ) * Monomorphic processing: add HotLoopCallee, CalledFromHotLoop, RuntimeShapeInspector, SpecializationService. Specialize topN queries with 1 or 2 aggregators. Add Cursor.advanceUninterruptibly() and isDoneOrInterrupted() for exception-free query processing. * Use Execs.singleThreaded() * RuntimeShapeInspector to support nullable fields * Make CalledFromHotLoop annotation Inherited * Remove unnecessary conversion of array of ColumnSelectorPluses to list and back to array in CardinalityAggregatorFactory * Close InputStream in SpecializationService * Formatting * Test specialized PooledTopNScanners * Set flags in PooledTopNAlgorithm directly * Fix tests, dependent on CountAggragatorFactory toString() form * Fix * Revert CountAggregatorFactory changes * Implement inspectRuntimeShape() for LongWrappingDimensionSelector and FloatWrappingDimensionSelector * Remove duplicate RoaringBitmap dependency in the extendedset pom.xml * Fix * Treat ByteBuffers specially in StringRuntimeShape * Doc fix * Annotate BufferAggregator.init() with CalledFromHotLoop * Make triggerSpecializationIterationsThreshold an int * Remove SpecializationService.PerPrototypeClassState.of() * Add comments * Limit the amount of specializations that SpecializationService could make * Add default implementation for BufferAggregator.inspectRuntimeShape(), for compatibility with extensions * Use more efficient ConcurrentMap's idioms in SpecializationService	2017-03-17 14:44:36 -05:00
Gian Merlino	3ec1877887	Fix BucketExtractionFn on objects that are strings. (#4072 )	2017-03-16 22:59:11 -07:00
Charles Allen	805d85afda	Allow compilation as Java8 source and target (#3328 ) * Allow compilation as Java8 source and target for everything except API * Remove conditions in tests which assume that we may run with Java 7 * Update easymock to 3.4 * Make Animal Sniffer to check Java 1.8 usage; remove redundant druid-caffeine-cache configuration * Use try-with-resources in LargeColumnSupportedComplexColumnSerializerTest.testSanity() * Remove java7 special for druid-api	2017-03-14 22:23:47 -06:00
Gian Merlino	e5c0dab12c	groupBy v2: Better error message when resources are exhausted. (#4046 ) * groupBy v2: Better error message when resources are exhausted. Fixes #4043. * Fix tests.	2017-03-15 00:37:49 +05:30
Jihoon Son	dfe4bda7fd	add doc (#4030 )	2017-03-10 12:49:20 -08:00
Gian Merlino	a5170666b6	groupBy v2: Always merge queries. (#4023 ) This fixes #4020 because it means the timestamp will always be included for outermost queries. Historicals receiving queries from older brokers will think they're outermost (because CTX_KEY_OUTERMOST isn't set to "false"), so they'll include a timestamp, so the older brokers will be OK.	2017-03-08 12:47:46 -06:00
Gian Merlino	4ca5270e88	Ignore chunkPeriod for groupBy v2, fix chunkPeriod for irregular periods. (#4004 ) * Ignore chunkPeriod for groupBy v2, fix chunkPeriod for irregular periods. Includes two fixes: - groupBy v2 now ignores chunkPeriod, since it wouldn't have helped anyway (its mergeResults returns a lazy sequence) and it generates incorrect results. - Fix chunkPeriod handling for periods of irregular length, like "P1M" or "P1Y". Also includes doc and test fixes: - groupBy v1 was no longer being tested by GroupByQueryRunnerTest since #3953, now it is once again. - chunkPeriod documentation was misleading due to its checkered past. Updated it to be more accurate. * Remove unused import. * Restore buffer size.	2017-03-06 12:27:02 -06:00
Gian Merlino	7b9e6c29cd	Fix float, long dimension indexer object selectors. (#4012 ) Their "convertUnsortedEncodedKeyComponentToActualArrayOrList" methods didn't respect the contract, which says they should return single values (not array/list) if there is only a single value to return. This affects the behavior of ObjectColumnSelectors on realtime segments.	2017-03-06 10:01:30 -08:00
Gian Merlino	337f3870d8	Fix TimeFormatExtractionFn getCacheKey when tz, locale are not provided. (#4007 ) * Fix TimeFormatExtractionFn getCacheKey when tz, locale are not provided. * Remove unused import. * Use defaults in cache key.	2017-03-04 17:41:59 -08:00
praveev	67d0ae3271	Let toDateTime call fall through for Duration Granularity (#4001 ) * Let toDateTime call fall through for Duration Granularity Added test for the same. * Add duration granularity test to GroupByQueryRunnerTest	2017-03-03 13:27:22 -06:00
Himanshu	e7e3c2dc5a	support singleThreaded flag for groupBy-v2 as well (#3992 )	2017-03-03 23:43:06 +05:30
Roman Leventov	81a5f9851f	TmpFileIOPeons to create files under the merging output directory, instead of java.io.tmpdir (#3990 ) * In IndexMerger and IndexMergerV9, create temporary files under the output directory/tmpPeonFiles, instead of java.io.tmpdir * Use FileUtils.forceMkdir() across the codebase and remove some unused code * Fix test * Fix PullDependencies.run() * Unused import	2017-03-02 14:05:12 -08:00
Jonathan Wei	5fb1638534	Add default configuration for select query 'fromNext' parameter (#3986 ) * Add default configuration for select query 'fromNext' parameter * PR comments * Fix PagingSpec config injection * Injection fix for test	2017-03-01 17:05:35 -08:00
Himanshu	8316b4f48f	fix TimeDimExtractionFn.apply() under concurrency (#3984 )	2017-03-01 13:07:12 -08:00
kaijianding	772de66e79	add filenameBase to log when exceed file size limit to indicate which column it is (#3982 )	2017-03-01 13:05:07 -08:00
Gian Merlino	cc20133e70	Checkstyle rule to outlaw tabs. (#3988 ) Tabs are the worst.	2017-02-28 23:52:53 -08:00
Akash Dwivedi	91344cbe57	Enable GenericIndexed V2 for built-in(druid-io managed) complex columns. (#3987 ) * Enable GenericIndexed V2 for complex columns. * SerializerBuilder to use GenericColumnSerializer.	2017-02-28 22:06:54 -08:00
Jonathan Wei	a08660a9ca	Support ingestion of long/float dimensions (#3966 ) * Support ingestion for long/float dimensions * Allow non-arrays for key components in indexing type strategy interfaces * Add numeric index merge test, fixes * Docs for numeric dims at ingestion * Remove unused import * Adjust docs, add aggregate on numeric dims tests * remove unused imports * Throw exception for bitmap method on numerics * Move typed selector creation to DimensionIndexer interface * unused imports * Fix * Remove unused DimensionSpec from indexer methods, check for dims first in inc index storage adapter * Remove spaces	2017-02-28 19:04:41 -08:00
praveev	5ccfdcc48b	Fix testDeadlock timeout delay (#3979 ) * No more singleton. Reduce iterations * Granularities * Fix the delay in the test * Add license header * Remove unused imports * Lot more unused imports from all the rearranging * CR feedback * Move javadoc to constructor	2017-02-28 12:51:41 -06:00
praveev	c3bf40108d	One granularity (#3850 ) * Refactor Segment Granularity * Beginning of one granularity * Copy the fix for custom periods in segment-grunalrity over here. * Remove the custom serialization for now. * Compilation cleanup * Reformat code * Fixing unit tests * Unify to use a single iterable * Backward compatibility for rolling upgrade * Minor check style. Cosmetic changes. * Rename length and millis to duration * CR feedback * Minor changes.	2017-02-25 01:02:29 -06:00
Jonathan Wei	58b704c3b4	Don't allow '__time' as a GroupBy output field name (#3967 ) * Don't allow '__time' as a GroupBy column field name * Tweak exception message	2017-02-23 14:39:17 -08:00
kaijianding	7ce05d58bc	fix NPE in search query when dimension contains null value (#3968 ) * fix NPE when dimension contains null value in search query * add ut * search with not existed dimension should always return empty result	2017-02-23 08:07:59 -08:00
Gian Merlino	372b84991c	Add virtual columns to timeseries, topN, and groupBy. (#3941 ) * Add virtual columns to timeseries, topN, and groupBy. * Fix GroupByTimeseriesQueryRunnerTest. * Updates from review comments.	2017-02-22 13:16:48 -08:00
Jihoon Son	7200dce112	Atomic merge buffer acquisition for groupBys (#3939 ) * Atomic merge buffer acquisition for groupBys * documentation * documentation * address comments * address comments * fix test failure * Addressed comments - Add InsufficientResourcesException - Renamed GroupByQueryBrokerResource to GroupByQueryResource * addressed comments * Add takeBatch() to BlockingPool	2017-02-22 14:49:37 -06:00
Gian Merlino	985203b634	Finalize fields in postaggs (#3957 ) * initial commits for finalizeFieldAccess #2433 * fix some bugs to run a query * change name of method Queries.verifyAggregations to Queries.prepareAggregations * add Uts * fix Ut failures * rebased to master * address comments and add a Ut for arithmetic post aggregators * rebased to the master * address the comment of injection within arithmetic post aggregator * address comments and introduce decorate() in the PostAggregator interface. * Address comments. 1. Implements getComparator in FinalizingFieldAccessPostAggregator and add Uts for it 2. Some minor changes like renaming a method name. * Fix a code style mismatch. * Rebased to the master	2017-02-21 16:32:14 -08:00

... 2 3 4 5 6 ...

2014 Commits