druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	12317fd001	Bump version to 0.10.0-SNAPSHOT. (#3913 )	2017-02-06 17:54:35 -08:00
Roman Leventov	ca9f0e2b27	Don't override finalize() and reduce locking in LoadBalancingPool and ReferenceCountedResourceHandler (#3874 ) * Specialize LoadBalancingPool as MemcacheClientPool, reduce locking and don't override Object.finalize() * Remove locking and don't override Object.finalize() in ReferenceCountingResourceHolder * Add leak counts in ReferenceCountingResourceHolder and MemcacheClientPool. Add tests for ReferenceCountingResourceHolder and MemcacheClientPool * Fix a race condition in ReferenceCountingResourceHolder.increment()	2017-02-06 17:14:46 -08:00
Gian Merlino	d3a3b7ba0c	Add virtual column types, holder serde, and safety features. (#3823 ) * Add virtual column types, holder serde, and safety features. Virtual columns: - add long, float, dimension selectors - put cache IDs in VirtualColumnCacheHelper - adjust serde so VirtualColumns can be the holder object for Jackson - add fail-fast validation for cycle detection and duplicates - add expression virtual column in core Storage adapters: - move virtual column hooks before checking base columns, to prevent surprises when a new base column is added that happens to have the same name as a virtual column. * Fix ExtractionDimensionSpecs with virtual dimensions. * Fix unused imports. * CR comments * Merge one more time, with feeling.	2017-01-26 18:15:51 -08:00
Roman Leventov	af93a8d189	Sequences refactorings and removed unused code (part of #3798 ) (#3693 ) * Removing unused code from io.druid.java.util.common.guava package; fix #3563 (more consistent and paranoiac resource handing in Sequences subsystem); Add Sequences.wrap() for DRY in MetricsEmittingQueryRunner, CPUTimeMetricQueryRunner and SpecificSegmentQueryRunner; Catch MissingSegmentsException in SpecificSegmentQueryRunner's yielder.next() method (follow up on #3617) * Make Sequences.withEffect() execute the effect if the wrapped sequence throws exception from close() * Fix strange code in MetricsEmittingQueryRunner * Add comment on why YieldingSequenceBase is used in Sequences.withEffect() * Use Closer in OrderedMergeSequence and MergeSequence to close multiple yielders	2017-01-19 20:07:43 -08:00
Jihoon Son	d80bec83cc	Enable auto license checking (#3836 ) * Enable license checking * Clean duplicated license headers	2017-01-10 18:13:47 -08:00
Gian Merlino	d8702ebece	Filters: Use ColumnSelectorFactory directly for building row-based matchers. (#3797 ) * Filters: Use ColumnSelectorFactory directly for building row-based matchers. * Adjustments based on code review. - BoundDimFilter: fewer volatiles, rename matchesAnything to !matchesNothing. - HavingSpecs: Clarify that they are not thread-safe, and make DimFilterHavingSpec not thread safe. - Renamed rowType to rowSignature. - Added specializations for time-based vs non-time-based DimensionSelector in RBCSF. - Added convenience method DimensionHanderUtils.createColumnSelectorPlus. - Added singleton ZeroIndexedInts. - Added test cases for DimFilterHavingSpec. * Make ValueMatcherColumnSelectorStrategy actually use the associated selector. * Add RangeIndexedInts. * DimFilterHavingSpec: Fix concurrent usage guard on jdk7. * Add assertion to ZeroIndexedInts. * Rename no-longer-volatile members.	2017-01-03 14:30:22 -08:00
Roman Leventov	33800122ad	Don't return leaked Objects back to StupidPool, because this is dangerous. Reuse Cleaners in StupidPool. Make StupidPools named. Add StupidPool.leakedObjectCount(). Minor fixes (#3631 )	2016-12-26 00:35:35 -06:00
Himanshu	c5df30d813	fix JodaUtils.condenseIntervals(..) to correctly take end or current/next interval on overlap (#3793 ) * remove unused duplicate JodaUtils.java * fix JodaUtils.condenseIntervals(..) to correctly take end or current/next interval on overlap	2016-12-20 12:07:23 -08:00
Jonathan Wei	2bfcc8a592	First and Last Aggregator (#3566 ) * add first and last aggregator * add test and fix * moving around * separate aggregator valueType * address PR comment * add finalize inner query and adjust v1 inner indexing * better test and fixes * java-util import fixes * PR comments * Add first/last aggs to ITWikipediaQueryTest	2016-12-16 15:26:40 -08:00
Jihoon Son	5e39578eee	Enable parallel test (#3774 ) * Enable parallel test * Remove unnecessary NotThreadSafe annocation * Randomize the start port when finding available ports * Fix test failure * Change to handle all negatives	2016-12-14 21:05:56 -08:00
Navis Ryu	87c61fa749	Refactor boolean cast code, add tests (#3016 )	2016-12-07 13:10:39 -08:00
Gian Merlino	ff42058453	Expressions: Allow escapes in quoted identifiers. (#3735 )	2016-12-06 00:17:55 +05:30
Roman Leventov	c070b4a816	Fix concurrency defects, remove unnecessary volatiles (#3701 )	2016-11-22 16:42:28 -08:00
Navis Ryu	bb26636289	Constant flattening in math expression (#3090 ) * Constant flatteing in math expression * Addressed comments and fixed some bugs * Addressed comments	2016-11-14 14:14:10 -08:00
Himanshu	b76b3f8d85	reset-cluster command to clean up druid state stored on metadata and deep storage (#3670 )	2016-11-09 11:07:01 -06:00
Gian Merlino	657e4512d2	Checkstyle checks for AvoidStaticImport, UnusedImports. (#3660 ) Excludes tests from AvoidStaticImport, since those are used often there and I didn't want to make this changeset too large. Production code use was minimal and I switched those to non-static imports.	2016-11-05 11:34:36 -07:00
Navis Ryu	e10def32f2	Support string type in math expression (#2836 ) * Support string type in math expression addressed comments addressed comments Addressed comments * Updated math function document * Addressed comments	2016-11-02 21:10:48 -06:00
Gian Merlino	45940d6e40	Math expressions support for missing columns. (#3630 ) Also add SchemaEvolutionTest to help test this kind of thing. Fixes #3627 and includes test for #3625.	2016-11-01 09:40:25 -07:00
Himanshu	32c5494e97	eagerly allocate the intermediate computation buffers (#3628 )	2016-10-31 15:24:07 -07:00
Navis Ryu	898c1c21af	More best-effort parse long (#3603 ) * More best-effort parse long * addressed comments	2016-10-25 10:31:51 -07:00
Himanshu	641469fc38	manage overshadowing efficiently at coordinator (#3584 ) * manage overshadowing efficiently at coordinator * take readlock in VersionedIntervalTimeline.isOvershadowed()	2016-10-24 22:49:08 +05:30
Akash Dwivedi	4b3bd8bd63	Migrating java-util from Metamarkets. (#3585 ) * Migrating java-util from Metamarkets. * checkstyle and updated license on java-util files. * Removed unused imports from whole project. * cherry pick metamx/java-util@826021f. * Copyright changes on java-util pom, address review comments.	2016-10-21 14:57:07 -07:00
Navis Ryu	8b7ff4409a	Math expressional parameters for aggregator (#2783 ) * Supports expression-paramed aggregator (squashed and rebased on master) also includes math post aggregator (was #2820) * Addressed comments * addressed comments	2016-10-19 13:58:35 -05:00
David Lim	c2ae734848	KafkaIndexTask: Allow run thread to stop gracefully instead of interrupting (#3534 ) * allow run thread to gracefully complete instead of interrupting when stopGracefully() is called * add comments	2016-10-17 10:52:19 -04:00
Gian Merlino	ddc856214d	When inserting segments, mark unused if already overshadowed. (#3499 ) This is useful for the insert-segment-to-db tool, which would otherwise potentially insert a lot of overshadowed segments as "used", causing load and drop churn in the cluster.	2016-10-10 18:10:18 -07:00
Gian Merlino	40f2fe7893	Bump versions to 0.9.3-SNAPSHOT (#3524 )	2016-09-29 13:53:32 -07:00
Slim	3175e17a3b	Cached lookup module. first cut implementing JDBC cache (#2819 )	2016-09-16 13:45:54 -07:00
jianran	18af480017	Rename fields in OrderedMergeIterator (#3149 ) * code readable * fix the pre middle manager peon no stop * Revert "fix the pre middle manager peon no stop" This reverts commit `6cef4980bf`.	2016-08-11 09:42:12 -07:00
Nishant	8035c73409	Implement EnvironmentVariablePasswordProvider (#3329 ) * Implement EnvironmentVariablePasswordProvider * Review Comment : rename passwordKey to passwordVariable * add docs * improve doc layout * review comment: rename property for variable	2016-08-10 05:33:51 +08:00
Gian Merlino	1aae5bd67d	Nicer handling for cancelled groupBy v2 queries. (#3330 ) 1. Wrap temporaryStorage in a resource holder, to avoid spurious "Closed" errors from already-running processing tasks. 2. Exit early from the merging accumulator if the query is cancelled.	2016-08-05 14:48:06 -07:00
kaijianding	1fa681934c	fix ConcurrentModificationException in CachingClusteredClient.run() (#3278 ) * fix ConcurrentModificationException in CachingClusteredClient.run() * obtain new copy of PartitionHolder to avoid potential multi-threads read/write issue	2016-07-28 19:52:50 -07:00
Gian Merlino	9b5523add3	Reference counting, better error handling for resources in groupBy v2. (#3268 ) Refcounting prevents releasing the merge buffer, or closing the concurrent grouper, before the processing threads have all finished. The better error handling prevents an avalanche of per-runner exceptions when grouping resources are exhausted, by grouping those all up into a single merged exception.	2016-07-27 01:59:02 +05:30
Gian Merlino	4cc39b2ee7	Alternative groupBy strategy. (#2998 ) This patch introduces a GroupByStrategy concept and two strategies: "v1" is the current groupBy strategy and "v2" is a new one. It also introduces a merge buffers concept in DruidProcessingModule, to try to better manage memory used for merging. Both of these are described in more detail in #2987. There are two goals of this patch: 1. Make it possible for historical/realtime nodes to return larger groupBy result sets, faster, with better memory management. 2. Make it possible for brokers to merge streams when there are no order-by columns, avoiding materialization. This patch does not do anything to help with memory management on the broker when there are order-by columns or when there are nested queries. That could potentially be done in a future patch.	2016-06-24 18:06:09 -07:00
Gian Merlino	ebf890fe79	Update master version to 0.9.2-SNAPSHOT. (#3133 )	2016-06-13 13:10:38 -07:00
sainath batthala	d552a5c034	Documented getAuditTime, getPayload methods in AuditEntry.java (#3045 ) * Documented getAuditTime, getPayload methods in AuditEntry.java * author tag removed from documentation	2016-06-02 08:20:33 -07:00
Himanshu	7e67397b5a	fix-3010: look through all versions to find the set with complete partitions (#3013 )	2016-05-25 11:01:22 -07:00
Gian Merlino	a54381a084	Fix CombiningSequence.close on single element sequences. (#2969 ) Regression introduced by #2892.	2016-05-13 23:12:30 -07:00
David Lim	b489f63698	Supervisor for KafkaIndexTask (#2656 ) * supervisor for kafka indexing tasks * cr changes	2016-05-04 23:13:13 -07:00
Navis Ryu	45a3a26ef7	Add more math functions (#2822 ) * Add more math functions * added function list	2016-05-03 10:55:13 -07:00
Navis Ryu	2729fea84d	Fix parsing fail of segment id with datasource containing underscore (#2797 ) * Fix parsing fail of segment id with underscored datasource (Fix for #2786) * addressed comment * renamed and moved code into api. added log4 dependency for tests * addressed comments * fixed test fails	2016-05-02 22:37:28 -07:00
Charles Allen	6b957aa072	[QTL] Make URI Exctraction Namespace take more sane arguments (#2738 ) * Make URI Exctraction Namespace take more sane arguments * Fixes https://github.com/druid-io/druid/issues/2669 * Update docs * Rename error message * Undo overzealous deletion of docs * Explain caching mechanism a bit more in docs	2016-05-02 12:54:34 -07:00
Gian Merlino	488d12d592	CombiningSequence: Delay making next yielder on creation until it is actually asked for. (#2892 ) This fixes the behavior of limited combining sequences (otherwise limit = 1 would actually yield 2 elements).	2016-04-29 11:12:58 -07:00
Himanshu Gupta	308211cc18	math expression language with parser/lexer generated using ANTLR	2016-04-08 11:40:29 -05:00
Himanshu Gupta	36ccfbd20e	math expression language with hand written parser/lexer	2016-04-08 11:40:29 -05:00
navis.ryu	e0cfd9ee19	Utility method for length estimation of utf8	2016-03-31 10:07:00 +09:00
Gian Merlino	7e7a886f65	Move druid-api into the druid repo. This is from druid-api-0.3.17, as of commit 51884f1d05d5512cacaf62cedfbb28c6ab2535cf in the druid-api repo.	2016-03-24 11:04:34 -07:00
Charles Allen	5da9a280b6	Query Time Lookup - Dynamic Configuration	2016-03-18 09:45:05 -07:00
Gian Merlino	738dcd8cd9	Update version to 0.9.1-SNAPSHOT. Fixes #2462	2016-03-17 10:34:20 -07:00
Gian Merlino	187569e702	DataSource metadata. Geared towards supporting transactional inserts of new segments. This involves an interface "DataSourceMetadata" that allows combining of partially specified metadata (useful for partitioned ingestion). DataSource metadata is stored in a new "dataSource" table.	2016-03-10 17:41:50 -08:00
Nishant	ba1185963b	Fix a bunch of dependencies * Eliminate exclusion groups from pull-deps * Only consider dependency nodes in pull-deps if they are not in the following scopes * provided * test * system * Fix a bunch of `<scope>provided</scope>` missing tags * Better exclusions for a couple of problematic libs	2016-03-10 10:18:08 -08:00
Xavier Léauté	163e536415	Merge pull request #2601 from navis/fix-combine-sequence Relay final value to yielder in CombineSequence (Fix for #2586)	2016-03-08 15:59:08 -08:00
Charles Allen	908eb7eb4d	Add LogTest to show bad log behavior	2016-03-08 09:46:26 -08:00
navis.ryu	1b3fd8a8aa	added more tests and fixed concat+combine	2016-03-09 02:08:52 +09:00
navis.ryu	4ff1620131	Relay final value to yielder in CombineSequence (Fix for #2586 )	2016-03-08 10:31:15 +09:00
Himanshu Gupta	b40c342cd1	make Global stupid pool cache size configurable	2016-02-05 14:18:06 -06:00
Charles Allen	4282eac067	Add missing timeline test for VersionedIntervalTimelineTest	2016-01-27 08:49:08 -08:00
Charles Allen	2e1d6aaf3d	Use thread priorities. (aka set `nice` values for background-like tasks) * Defaults the thread priority to java.util.Thread.NORM_PRIORITY in io.druid.indexing.common.task.AbstractTask * Each exec service has its own Task Factory which is assigned a priority for spawned task. Therefore each priority class has a unique exec service * Added priority to tasks as taskPriority in the task context. <0 means low, 0 means take default, >0 means high. It is up to any particular implementation to determine how to handle these numbers * Add options to ForkingTaskRunner * Add "-XX:+UseThreadPriorities" default option * Add "-XX:ThreadPriorityPolicy=42" default option * AbstractTask - Removed unneded @JsonIgnore on priority * Added priority to RealtimePlumber executors. All sub-executors (non query runners) get Thread.MIN_PRIORITY * Add persistThreadPriority and mergeThreadPriority to realtime tuning config	2016-01-20 14:00:31 -08:00
navis.ryu	443ce2db9d	callbacks registered to Log4jShutdown is not executed when stop is called	2016-01-05 08:37:54 +09:00
Gian Merlino	83f4130b5f	SegmentMetadataQuery merging fixes. - Fix merging when the INTERVALS analysisType is disabled, and add a test. - Remove transformFn from CombiningSequence, use MappingSequence instead. transformFn did not work for "accumulate" anyway, which made the tests wrong (the intervals should have been condensed, but were not). - Add analysisTypes to the Druids segmentMetadataQuery builder to make testing simpler.	2015-12-22 07:57:10 -08:00
jon-wei	356b07c6c3	More efficient SegmentMetadataQuery	2015-12-17 12:46:23 -08:00
Nishant	9491e8de3b	Remove ServerView from RealtimeIndexTasks and use coordinator http endpoint for handoffs - fixes #1970 - extracted out segment handoff callbacks in SegmentHandoffNotifier which is responsible for tracking segment handoffs and doing callbacks when handoff is complete. - Coordinator now maintains a view of segments in the cluster, this will affect the jam heap requirements for the overlord for large clusters. realtime index task and nodes now use HTTP end points exposed by the coordinator to get serverView review comment fix realtime node guide injection review comments make test not rely on scheduled exec fix compilation fix import review comment introduce immutableSegmentLoadInfo fix son reading remove unnecessary logging	2015-12-09 01:54:09 +05:30
Himanshu Gupta	62ba9ade37	unifying license header in all java files	2015-12-05 22:16:23 -06:00
Fangjin Yang	21c84b5ff7	Merge pull request #1896 from gianm/allocate-segment SegmentAllocateAction (fixes #1515)	2015-11-18 21:05:46 -08:00
Fangjin Yang	4f46d457f1	Merge pull request #1947 from noddi/feature/count-parameter-history-endpoints Add count parameter to history endpoints	2015-11-12 10:23:44 -08:00
Gian Merlino	e4e5f0375b	SegmentAllocateAction (fixes #1515 ) This is a feature meant to allow realtime tasks to work without being told upfront what shardSpec they should use (so we can potentially publish a variable number of segments per interval). The idea is that there is a "pendingSegments" table in the metadata store that tracks allocated segments. Each one has a segment id (the same segment id we know and love) and is also part of a sequence. The sequences are an idea from @cheddar that offers a way of doing replication. If there are N tasks reading exactly the same data with exactly the same logic (think Kafka tasks reading a fixed range of offsets) then you can place them in the same sequence, and they will generate the same sequence of segments.	2015-11-11 16:54:35 -08:00
Bartosz Ługowski	6e5d2c6745	Add count parameter to history endpoints.	2015-11-11 23:03:57 +01:00
Xavier Léauté	fa6142e217	cleanup and remove unused imports	2015-11-11 12:25:21 -08:00
Xavier Léauté	a57cbfd2c3	Merge pull request #1387 from metamx/enableShutdownLogging Add special handler to allow logger messages during shutdown	2015-11-09 17:20:09 -08:00
Charles Allen	1df4baf489	Move Jackson Guice adapters into io.druid * Removes access to protected methods in com.fasterxml * Eliminates druid-common's use of foreign package com.fasterxml	2015-11-09 10:50:45 -08:00
Fangjin Yang	f90ddfdb89	Merge pull request #1745 from himanshug/numbered_to_elastic making NumberedShardSpec elastic	2015-10-30 16:01:03 -07:00
Charles Allen	7a2ceef690	Add special handler to allow logger messages during shutdown * Adds a special PropertyChecker interface which is ONLY for setting string properties at the very start of psvm	2015-10-27 14:33:36 -07:00
Xavier Léauté	72c408cf2d	Merge pull request #1770 from metamx/merge-time Add segment merge time as a metric	2015-10-22 22:03:41 -07:00
Nishant	7cecc55045	Add segment merge time as a metric Add merge and persist cpu time Fix typo review comment move cpu time measuring to VMUtils review comments.	2015-10-22 12:28:03 +05:30
Charles Allen	1cad571354	Add more verbose logging to SQLMetadataRuleManager	2015-10-21 16:11:40 -07:00
Xavier Léauté	e4ac78e43d	bump next snapshot to 0.9.0	2015-10-20 13:46:13 -07:00
Xavier Léauté	4c2c7a2c37	update version to 0.8.3	2015-10-14 21:40:55 -07:00
Nishant	573aa96bd6	fix #1727 - Union bySegment queries fix Fixes #1727. revert to doing merging for results for union queries on broker. revert unrelated changes Add test for union query runner Add test remove unused imports fix imports fix renamed file fix test update docs.	2015-09-29 23:32:36 +05:30
Xavier Léauté	1f897257b5	add simple load balancing pool	2015-09-18 09:43:14 -07:00
Himanshu Gupta	a7b1cacbbf	make NumberedShardSpec "elastic" and UTs	2015-09-17 08:38:48 -05:00
Himanshu Gupta	22dfa8ecf7	change ParitionHolder.isComplete() to accept chunks beyond the "end"	2015-09-16 21:33:11 -05:00
nishant	0835e12f2a	add endpoint to fetch rule history for all datasources. review comments Review comment fix compilation	2015-08-31 14:05:52 +05:30
Gian Merlino	940e1aa3eb	Replace funky imports with standard ones. 1) Lots of Guava imports were not coming from the actual Guava 2) junit.framework.Assert should be org.junit.Assert	2015-08-28 18:02:05 -07:00
Himanshu Gupta	2e0dd1d792	adding UTs and addressing review comments to firehoseV2 addition to Realtime[Manager\|Plumber], essential segment metadata persist support, kafka-simple-consumer-firehose extension patch	2015-08-27 20:50:46 -05:00
lvjq	2237a8cf0f	kafka 8 simple consumer firehose	2015-08-27 20:50:46 -05:00
Gian Merlino	2bf9a70bfa	Consolidate SQL retrying by moving logic into the connectors. Also change boolean removeLock to void addLock in MetadataStorageActionHandler.	2015-08-25 12:42:29 -07:00
Xavier Léauté	3b2e41e42a	update for next release	2015-08-18 17:16:46 -07:00
Charles Allen	db19d2d547	Revert "Update to guice 4.0"	2015-08-14 09:26:07 -07:00
Charles Allen	7e61216287	Update to guice 4.0 - Mark a lot of `@Provides` methods as final since guice 4.0 disallows overriding them	2015-08-10 13:57:18 -07:00
Charles Allen	7fe8562980	Remove locks from StupidPool	2015-08-05 19:24:56 -07:00
Charles Allen	86ede702b1	Add namespaced lookups as extensions * Adds kafka, URI, and JDBC namespace defintions * Add ability to explicitly rename using a "namespace" which is a particular data collection that is loaded on all realtime, historic nodes, and brokers. If any of these nodes has the namespace extension, ALL nodes have the namespace extension. * Add namespace caching and populating (can be on heap or off heap) * Add NamespaceExtractionCacheManager for handling caches * Added ExtractionNamespace for handling metadata on the extraction namespaces * Added ExtractionNamespaceUpdate for handling metadata related to updates * Add extension which caches renames from a kafka stream (requires kafka8) * Added README.md for the namespace kafka extension * Added docs * Added namespace/size, namespace/count, namespace/deltaTasksStarted metrics Add static config for namespaces via `druid.query.extraction.namespace` * This is a rebase of https://github.com/b-slim/druid/tree/static_config_only	2015-07-28 11:14:14 -07:00
Xavier Léauté	4cfb00bc8a	inrement version	2015-07-15 13:09:05 -07:00
Charles Allen	a37d631cea	Add log4j2 for debugging which can be specified with `-Dlog4j.configurationFile=/some/path/druid/common/src/main/resources/log4j2.debug.xml` during testing and development	2015-07-01 10:29:22 -07:00
Xavier Léauté	0a5bb909a2	[maven-release-plugin] prepare for next development iteration	2015-06-18 17:35:19 -07:00
Xavier Léauté	59c6b2b279	[maven-release-plugin] prepare release druid-0.8.0-rc1	2015-06-18 17:35:14 -07:00
fjy	7a6acf5c1b	update pom to 0.8	2015-05-11 19:41:58 -06:00
nishant	50158357ff	fixes #1330 fixes #1330, Avoid creating Period instance as creating a Period from Long.MAX_VALUE throws arithmetic exception. After this query metric will emit duration in seconds instead of minutes.	2015-05-04 20:34:28 +05:30
Xavier Léauté	f0726f4d94	fix typing on StupidResourceHolder	2015-04-13 21:28:56 -07:00
Fangjin Yang	208e307915	Merge pull request #1251 from metamx/uriSegmentLoaders Revert "Revert "Overhaul of SegmentPullers to add consistency and retries""	2015-03-30 17:43:51 -07:00
fjy	aea7f9d192	[maven-release-plugin] prepare for next development iteration	2015-03-30 16:35:24 -07:00
fjy	060d7aef03	[maven-release-plugin] prepare release druid-0.7.1	2015-03-30 16:35:20 -07:00

1 2 3 4 5 ...

1071 Commits