druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	ddc2e68998	Remove cache keys from HavingSpecs. (#4280 ) * Remove cache keys from HavingSpecs. They weren't used, since they aren't part of the groupBy cache key. Also, it's good that they weren't used, since many of them had value truncation bugs. * Fix imports. * Fix test.	2017-05-16 22:13:02 -07:00
Gian Merlino	22f20f2207	IngestSegmentFirehoseTest: Add more tests for reindexing. (#4285 ) * IngestSegmentFirehoseTest: Add more tests for reindexing. * Nix unused imports.	2017-05-16 22:12:26 -07:00
Gian Merlino	51872fd310	Log max memory on startup too, in case Xmx and Xms are different. (#4283 )	2017-05-16 20:06:34 -05:00
Roman Leventov	d400f23791	Monomorphic processing of TopN queries with simple double aggregators over historical segments (part of #3798 ) (#4079 ) * Monomorphic processing of topN queries with simple double aggregators and historical segments * Add CalledFromHotLoop annocations to specialized methods in SimpleDoubleBufferAggregator * Fix a bug in Historical1SimpleDoubleAggPooledTopNScannerPrototype * Fix a bug in SpecializationService * In SpecializationService, emit maxSpecializations warning only once * Make GenericIndexed.theBuffer final * Address comments * Newline * Reapply `439c906` (Make GenericIndexed.theBuffer final) * Remove extra PooledTopNAlgorithm.capabilities field * Improve CachingIndexed.inspectRuntimeShape() * Fix CompressedVSizeIntsIndexedSupplier.inspectRuntimeShape() * Don't override inspectRuntimeShape() in subclasses of CompressedVSizeIndexedInts * Annotate methods in specializations of DimensionSelector and FloatColumnSelector with @CalledFromHotLoop * Make ValueMatcher to implement HotLoopCallee * Doc fix * Fix inspectRuntimeShape() impl in ExpressionSelectors * INFO logging of specialization events * Remove modificator * Fix OrFilter * Fix AndFilter * Refactor PooledTopNAlgorithm.scanAndAggregate() * Small refactoring * Add 'nothing to inspect' messages in empty HotLoopCallee.inspectRuntimeShape() implementations * Don't care about runtime shape in tests * Fix accessor bugs in Historical1SimpleDoubleAggPooledTopNScannerPrototype and HistoricalSingleValueDimSelector1SimpleDoubleAggPooledTopNScannerPrototype, cover them with tests * Doc wording * Address comments * Remove MagicAccessorBridge and ensure Offset subclasses are public * Attach error message to element	2017-05-16 16:19:55 -07:00
Roman Leventov	b7a52286e8	Make @Override annotation obligatory (#4274 ) * Make MissingOverride an error * Make travis stript to fail fast * Add missing Override annotations * Comment	2017-05-16 13:30:30 -05:00
David Lim	8333043b7b	add skipOffsetGaps flag (#4256 )	2017-05-16 12:19:28 -06:00
Himanshu	136b2fae72	improve query timeout handling and limit max scatter-gather bytes (#4229 ) * improve query timeout handling and limit max scatter-gather bytes * address review comments	2017-05-16 12:47:32 -05:00
Benedict Jin	e823085866	Improve `collection` related things that reusing a immutable object instead of creating a new object (#4135 )	2017-05-17 01:38:51 +09:00
Charles Allen	e4add598f0	Add INTELLIJ_SETUP.md (#4261 ) * Add INTELLIJ_SETUP.md * Fix `.idea/runConfigurations` * Update INTELLIJ_SETUP.md * Update INTELLIJ_SETUP.md * Address Comments	2017-05-17 01:26:16 +09:00
Yuya Fujiwara	8010d7f28d	fix typo: seraching -> searching (#4273 )	2017-05-17 01:25:16 +09:00
Jihoon Son	50a4ec2b0b	Add support for headers and skipping thereof for CSV and TSV (#4254 ) * initial commit * small fixes * fix bug * fix bug * address code review * more cr * more cr * more cr * fix * Skip head rows for CSV and TSV * Move checking skipHeadRows to FileIteratingFirehose * Remove checking null iterators * Remove unused imports * Address comments * Fix compilation error * Address comments * Add more tests * Add a comment to ReplayableFirehose * Addressing comments * Add docs and fix typos	2017-05-15 22:57:31 -07:00
Fokko Driesprong	5ca67644e7	Remove slf4j as dependencies (#4233 ) From the kafka-schema-registry-client in the avro extension slf4j will be packaged into the distribution. We don't want this as it will conflict and throw a slf4j multiple bindings warning. This will cause slf4j to fall back to no-operation (NOP) binding.	2017-05-12 15:59:14 +09:00
satishbhor	5e6539fec6	Average Server Percent Used: NaN% Error when server startup is in progress (fixes #4214 ) (#4240 ) * Fix lz4 library incompatibility in kafka-indexing-service extension #3266 * Bumped Kafka version to 0.10.2.0 for : Fix lz4 library incompatibility in kafka-indexing-service extension #3266 * Replaced Lists.newArrayList() with Collections.singletonList() For Fix lz4 library incompatibility in kafka-indexing-service extension #4115 * Fixed: Average Server Percent Used: NaN% Error when server startup is in progress #4214	2017-05-12 15:56:17 +09:00
Roman Leventov	1ebfa22955	Update Error prone configuration; Fix bugs (#4252 ) * Make Errorprone the default compiler * Address comments * Make Error Prone's ClassCanBeStatic rule a error * Preconditions allow only %s pattern * Fix DruidCoordinatorBalancerTester * Try to give the compiler more memory * Remove distribution module activation on jdk 1.8 because only jdk 1.8 is used now * Don't show compiler warnings * Try different travis script * Fix travis.yml * Make Error Prone optional again * For error-prone compiler * Increase compiler's maxmem * Don't run Error Prone for benchmarks because of OOM * Skip install step in Travis * Remove MetricHolder.writeToChannel() * In travis.yml, check compilation before tests, because it may fail faster	2017-05-12 15:55:17 +09:00
Roman Leventov	e09e892477	Refactor QueryRunner to accept QueryPlus: Query + QueryMetrics (part of #3798 ) (#4184 ) * Add QueryPlus. Add QueryRunner.run(QueryPlus, Map) method with default implementation, to replace QueryRunner.run(Query, Map). * Fix GroupByMergingQueryRunnerV2 * Fix QueryResourceTest * Expand the comment to Query.run(walker, context) * Remove legacy version of BySegmentSkippingQueryRunner.doRun() * Add LegacyApiQueryRunnerTest and be more specific about legacy API removal plans in Druid 0.11 in Javadocs	2017-05-10 12:25:00 -07:00
David Lim	11538e2ece	kafka generated shards were wrongly being marked as overshadowed if they extended a NumberedShardSpec with a non-zero number of total shards (#4257 )	2017-05-09 12:19:24 +09:00
Parag Jain	1fd177039d	fix auto reset - pause task instead of putting thread to sleep (#4244 )	2017-05-08 15:08:25 -07:00
Parag Jain	eb8e1b0a97	Prevent interrupted exception from polluting log during supervisor shutdown (#4253 ) * Prevent interrupted exception from polluting log during supervisor shutdown * do nothing in case of InterruptedException	2017-05-08 15:05:25 -07:00
Himanshu	e02f783e82	do not use --clean option when using bundle-contrib-exts profile so that core extensions are not wiped out (#4223 )	2017-05-08 14:26:16 -05:00
Himanshu	462f6482df	optionally add extensions to explicitly specified hadoopContainerClassPath (#4230 ) * optionally add extensions to explicitly specified hadoopContainerClassPath * note extensions always pushed in hadoop container when druid.extensions.hadoopContainerDruidClasspath is not provided explicitly	2017-05-08 14:24:14 -05:00
Pierre	bba31e0c8b	close aggregators in indexing-hadoop mappers (#4251 )	2017-05-05 08:29:13 -07:00
Pierre	e9872f0695	do not flush on closed stream (#4250 )	2017-05-05 09:19:20 +09:00
Himanshu	417714d228	additional lookup status discovery http endpoints at coordinator (#4228 ) * additional lookup status discovery http endpoints at coordinator * more changes * jsonize the error msgs as well * fix tests	2017-05-04 11:15:30 -07:00
Roman Leventov	8277284d67	Add Checkstyle rule to force comments to classes and methods to be Javadoc comments (#4239 )	2017-05-04 11:14:41 -07:00
Gian Merlino	f0fd8ba191	Add supervisors to overlord console. (#4248 )	2017-05-04 11:13:12 -07:00
Gian Merlino	d0f89e969a	Ignore misnamed segment cache info files. (#4245 ) * Ignore misnamed segment cache info files. Fixes a bug where historical nodes could announce the same segment twice, ultimately leading to historicals and their watchers (like coordinators and brokers) being out of sync about which segments are served. This could be caused if Druid is switched from local time to UTC, because that causes the same segment descriptors to lead to different identifiers (an identifier with local time interval before the switch, and UTC interval after the switch). In turn this causes that segment descriptor to be written to multiple segment cache info files and potentially get announced twice. Later, if the historical receives a drop request, it drops the segment and unannounces it once, but the other announcement would stick around in an ephemeral znode forever, confusing coordinators and brokers. * Only alert once.	2017-05-03 22:02:37 -06:00
Parag Jain	4502c207af	fix injection bug and documentation (#4243 )	2017-05-03 15:07:43 -05:00
Parag Jain	f9a61ea2ba	Kafka lag emitter - Kafka Indexing Service (#4194 ) * Kafka lag emitter * enforce minimum emit period to a minute * fixed comment	2017-05-02 17:30:07 -06:00
Roman Leventov	5e85fcc0f5	Restore BaseQuery.computeOverridenContext() for compatibility (#4241 )	2017-05-02 10:22:02 -07:00
Roman Leventov	0bc18e7906	Make UpdateCounter proof to update count overflow (#4138 ) * Make UpdateCounter proof to update count overflow. * Fix	2017-05-01 09:59:49 -07:00
hzy001	0c464f4a84	Fix docs (#4225 ) * Fix one typo Signed-off-by: Hao Ziyu <haoziyu@qiyi.com> * Fix deprecated links Signed-off-by: Hao Ziyu <haoziyu@qiyi.com>	2017-05-01 09:55:43 -07:00
Jihoon Son	7411b18df9	Add BroadcastDistributionRule (#4077 ) * Add BroadcastDistributionRule * Add missing null check * Rename variable 'colocateDataSource' to 'colocatedDatasource' * Address comments * Document for broadcast rules * Drop segments which are not co-located anymore * Remove duplicated segment loading and dropping * Add caveat * address comments	2017-05-01 09:55:17 -07:00
Himanshu	5a5a2749cd	improvements to coordinator lookups management (#3855 ) * coordinator lookups mgmt improvements * revert replaces removal, deprecate it instead * convert and use older specs stored in db * more tests and updates * review comments * add behavior for 0.10.0 to 0.9.2 downgrade * incorporating more review comments * remove explicit lock and use LifecycleLock in LookupReferencesManager. use LifecycleLock in LookupCoordinatorManager as well * wip on LookupCoordinatorManager * lifecycle lock * refactor thread creation into utility method * more review comments addressed * support smooth roll back of lookup snapshots from 0.10.0 to 0.9.2 * correctly use LifecycleLock in LookupCoordinatorManager and remove synchronization from start/stop * run lookup mgmt on leader coordinator only * wip: changes to do multiple start() and stop() on LookupCoordinatorManager * lifecycleLock fix usage in LookupReferencesManagerTest * add LifecycleLock back * fix license hdr * some fixes * make LookupReferencesManager.getAllLookupsState() consistent while still being lockless * address review comments * addressing leventov's comments * address charle's comments * add IOE.java * for safety in LookupReferencesManager mainThread check for lifecycle started state on each loop in addition to interrupt * move thread creation utility method to Execs * fix names * add tests for LookupCoordinatorManager.lookupManagementLoop() * add further tests for figuring out toBeLoaded and toBeDropped on LookupCoordinatorManager * address leventov comments * remove LookupsStateWithMap and parameterize LookupsState * address review comments * address more review comments * misc fixes	2017-04-28 08:41:38 -05:00
Roman Leventov	b9fd30e90a	Add Checkstyle check to prohibit IntelliJ-style commented code lines (#4220 ) * Add Checkstyle check to prohibit IntelliJ-style commented code lines * Address comment * Restore issue link	2017-04-27 18:11:25 -07:00
Gian Merlino	631068b099	Fix broken DataSketches link. (#4221 ) * Fix broken DataSketches link. * Better fixed link.	2017-04-27 17:37:12 -07:00
kaijianding	c47cfed0ec	Significantly improve LongEncodingStrategy.AUTO build performance (#4215 ) * Significantly improve LongEncodingStrategy.AUTO build performance * use numInserted instead of tempIn.available * fix bug	2017-04-27 15:11:07 +03:00
Fokko Driesprong	13143f9376	Update to Parquet 1.8.2 (#4210 ) Hi guys, Since Spark 2.x uses Parquet 1.8.2, we would like to update Druid's parquet library from 1.8.1 to 1.8.2 as well. It includes a lot of patches, performance improvements and better compatibility: `4aba4da...c652278` Cheers, Fokko	2017-04-27 15:34:30 +09:00
Himanshu	40057570f3	doc update on overlord console url when coordinator is acting as overlord (#4213 )	2017-04-26 15:03:54 -07:00
Himanshu	9b9e1cfecb	coordinator dynamic config POST to update only explicitly specified fields (#4141 ) * coordinator dynamic config POST to update only explicitly specified fields instead of resetting everything else to zeros * address review comments	2017-04-26 14:59:20 -07:00
Bas van Schaik	54463941b9	Fix two alerts from lgtm.com: comparing two boxed primitive values using (#4212 ) the == or != operator compares object identity, which may not be intended Details: `013566ade9/files/extensions-core/datasketches/src/main/java/io/druid/query/aggregation/datasketches/theta/SketchEstimatePostAggregator.java (V144)` `013566ade9/files/extensions-core/datasketches/src/main/java/io/druid/query/aggregation/datasketches/theta/SketchMergeAggregatorFactory.java (V164)`	2017-04-26 14:56:25 -07:00
David Lim	52f7bb091d	suppress warn message if metricsSpec is absent when using no-rollup ingestion (#4211 )	2017-04-25 22:52:49 -06:00
Roman Leventov	ee9b5a619a	Fix bugs in query builders and in TimeBoundaryQuery.getFilter() (#4131 ) * Add queryMetrics property to Query interface; Fix bugs and removed unused code in Druids * Fix a bug in TimeBoundaryQuery.getFilter() and remove TimeBoundaryQuery.getDimensionsFilter() * Don't reassign query's queryMetrics if already present in CPUTimeMetricQueryRunner and MetricsEmittingQueryRunner * Add compatibility constructor to BaseQuery * Remove Query.queryMetrics property * Move nullToNoopLimitSpec() method to LimitSpec interface * Rename GroupByQuery.applyLimit() to postProcess(); Fix inconsistencies in GroupByQuery.Builder	2017-04-25 16:32:02 -05:00
Akash Dwivedi	a2419654ea	Allow hadoop configurations using runtime properties. (#4189 )	2017-04-26 00:05:27 +05:30
Gian Merlino	3b92220015	Reduce log spam from Avro decoders. (#4205 ) These objects get constructed semi-frequently (any time a parser is deserialized) and so info logs are spammy. They'll still appear in task logs at least once, since they're part of the task definition and will get logged due to that.	2017-04-25 23:59:59 +05:30
kaijianding	336089563d	skip rows which are added after cursor created (#4049 ) * fix can't get dim value via IncrementalIndexStorageAdapter cursor * address the comment * add ut * address ut comments * fix bug and fix ut	2017-04-26 03:26:46 +09:00
Himanshu	4d3745d6c9	log the exception on failure to send query response (#4179 )	2017-04-25 10:27:20 -07:00
Gian Merlino	97ddb38d75	DatasourceInputSplit: Serialize with write instead of writeUTF. (#4195 ) writeUTF has a limit of 64KB, making it difficult to write out splits that read a large number of descriptors for small segments.	2017-04-25 10:26:44 -07:00
asrayousuf	e4fbc2bc5b	Updating the description of useCache (#4200 ) Updating the description of useCache Updating query-context doc based on Gian's comment Updating query-context doc based on Gian's comment Updating query-context doc based on Gian's comment Updating query-context doc based on Gian's comment	2017-04-25 10:26:15 -07:00
Gian Merlino	809112cd5f	DirectDruidClient: Fix division by zero. (#4206 ) * DirectDruidClient: Fix division by zero. Introduced in #3954 when some floating math was changed to integer math. This patch restores the old math. * Added comment.	2017-04-25 13:03:00 +09:00
Benedict Jin	de815da942	Some code refactor for better performance of `Avro-Extension` (#4092 ) * 1. Collections.singletonList instand of Arrays.asList; 2. close FSDataInputStream/ByteBufferInputStream for releasing resource; 3. convert com.google.common.base.Function into java.util.function.Function; 4. others code refactor * Put each param on its own line for code style * Revert GenericRecordAsMap back about `Function`	2017-04-25 12:46:32 +09:00

... 6 7 8 9 10 ...

8192 Commits All Branches Search

8192 Commits

All Branches