druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	d8702ebece	Filters: Use ColumnSelectorFactory directly for building row-based matchers. (#3797 ) * Filters: Use ColumnSelectorFactory directly for building row-based matchers. * Adjustments based on code review. - BoundDimFilter: fewer volatiles, rename matchesAnything to !matchesNothing. - HavingSpecs: Clarify that they are not thread-safe, and make DimFilterHavingSpec not thread safe. - Renamed rowType to rowSignature. - Added specializations for time-based vs non-time-based DimensionSelector in RBCSF. - Added convenience method DimensionHanderUtils.createColumnSelectorPlus. - Added singleton ZeroIndexedInts. - Added test cases for DimFilterHavingSpec. * Make ValueMatcherColumnSelectorStrategy actually use the associated selector. * Add RangeIndexedInts. * DimFilterHavingSpec: Fix concurrent usage guard on jdk7. * Add assertion to ZeroIndexedInts. * Rename no-longer-volatile members.	2017-01-03 14:30:22 -08:00
Roman Leventov	33800122ad	Don't return leaked Objects back to StupidPool, because this is dangerous. Reuse Cleaners in StupidPool. Make StupidPools named. Add StupidPool.leakedObjectCount(). Minor fixes (#3631 )	2016-12-26 00:35:35 -06:00
Himanshu	c5df30d813	fix JodaUtils.condenseIntervals(..) to correctly take end or current/next interval on overlap (#3793 ) * remove unused duplicate JodaUtils.java * fix JodaUtils.condenseIntervals(..) to correctly take end or current/next interval on overlap	2016-12-20 12:07:23 -08:00
Jonathan Wei	2bfcc8a592	First and Last Aggregator (#3566 ) * add first and last aggregator * add test and fix * moving around * separate aggregator valueType * address PR comment * add finalize inner query and adjust v1 inner indexing * better test and fixes * java-util import fixes * PR comments * Add first/last aggs to ITWikipediaQueryTest	2016-12-16 15:26:40 -08:00
Jihoon Son	5e39578eee	Enable parallel test (#3774 ) * Enable parallel test * Remove unnecessary NotThreadSafe annocation * Randomize the start port when finding available ports * Fix test failure * Change to handle all negatives	2016-12-14 21:05:56 -08:00
Navis Ryu	87c61fa749	Refactor boolean cast code, add tests (#3016 )	2016-12-07 13:10:39 -08:00
Gian Merlino	ff42058453	Expressions: Allow escapes in quoted identifiers. (#3735 )	2016-12-06 00:17:55 +05:30
Roman Leventov	c070b4a816	Fix concurrency defects, remove unnecessary volatiles (#3701 )	2016-11-22 16:42:28 -08:00
Navis Ryu	bb26636289	Constant flattening in math expression (#3090 ) * Constant flatteing in math expression * Addressed comments and fixed some bugs * Addressed comments	2016-11-14 14:14:10 -08:00
Himanshu	b76b3f8d85	reset-cluster command to clean up druid state stored on metadata and deep storage (#3670 )	2016-11-09 11:07:01 -06:00
Gian Merlino	657e4512d2	Checkstyle checks for AvoidStaticImport, UnusedImports. (#3660 ) Excludes tests from AvoidStaticImport, since those are used often there and I didn't want to make this changeset too large. Production code use was minimal and I switched those to non-static imports.	2016-11-05 11:34:36 -07:00
Navis Ryu	e10def32f2	Support string type in math expression (#2836 ) * Support string type in math expression addressed comments addressed comments Addressed comments * Updated math function document * Addressed comments	2016-11-02 21:10:48 -06:00
Gian Merlino	45940d6e40	Math expressions support for missing columns. (#3630 ) Also add SchemaEvolutionTest to help test this kind of thing. Fixes #3627 and includes test for #3625.	2016-11-01 09:40:25 -07:00
Himanshu	32c5494e97	eagerly allocate the intermediate computation buffers (#3628 )	2016-10-31 15:24:07 -07:00
Navis Ryu	898c1c21af	More best-effort parse long (#3603 ) * More best-effort parse long * addressed comments	2016-10-25 10:31:51 -07:00
Himanshu	641469fc38	manage overshadowing efficiently at coordinator (#3584 ) * manage overshadowing efficiently at coordinator * take readlock in VersionedIntervalTimeline.isOvershadowed()	2016-10-24 22:49:08 +05:30
Akash Dwivedi	4b3bd8bd63	Migrating java-util from Metamarkets. (#3585 ) * Migrating java-util from Metamarkets. * checkstyle and updated license on java-util files. * Removed unused imports from whole project. * cherry pick metamx/java-util@826021f. * Copyright changes on java-util pom, address review comments.	2016-10-21 14:57:07 -07:00
Navis Ryu	8b7ff4409a	Math expressional parameters for aggregator (#2783 ) * Supports expression-paramed aggregator (squashed and rebased on master) also includes math post aggregator (was #2820) * Addressed comments * addressed comments	2016-10-19 13:58:35 -05:00
David Lim	c2ae734848	KafkaIndexTask: Allow run thread to stop gracefully instead of interrupting (#3534 ) * allow run thread to gracefully complete instead of interrupting when stopGracefully() is called * add comments	2016-10-17 10:52:19 -04:00
Gian Merlino	ddc856214d	When inserting segments, mark unused if already overshadowed. (#3499 ) This is useful for the insert-segment-to-db tool, which would otherwise potentially insert a lot of overshadowed segments as "used", causing load and drop churn in the cluster.	2016-10-10 18:10:18 -07:00
Slim	3175e17a3b	Cached lookup module. first cut implementing JDBC cache (#2819 )	2016-09-16 13:45:54 -07:00
jianran	18af480017	Rename fields in OrderedMergeIterator (#3149 ) * code readable * fix the pre middle manager peon no stop * Revert "fix the pre middle manager peon no stop" This reverts commit `6cef4980bf`.	2016-08-11 09:42:12 -07:00
Nishant	8035c73409	Implement EnvironmentVariablePasswordProvider (#3329 ) * Implement EnvironmentVariablePasswordProvider * Review Comment : rename passwordKey to passwordVariable * add docs * improve doc layout * review comment: rename property for variable	2016-08-10 05:33:51 +08:00
Gian Merlino	1aae5bd67d	Nicer handling for cancelled groupBy v2 queries. (#3330 ) 1. Wrap temporaryStorage in a resource holder, to avoid spurious "Closed" errors from already-running processing tasks. 2. Exit early from the merging accumulator if the query is cancelled.	2016-08-05 14:48:06 -07:00
kaijianding	1fa681934c	fix ConcurrentModificationException in CachingClusteredClient.run() (#3278 ) * fix ConcurrentModificationException in CachingClusteredClient.run() * obtain new copy of PartitionHolder to avoid potential multi-threads read/write issue	2016-07-28 19:52:50 -07:00
Gian Merlino	9b5523add3	Reference counting, better error handling for resources in groupBy v2. (#3268 ) Refcounting prevents releasing the merge buffer, or closing the concurrent grouper, before the processing threads have all finished. The better error handling prevents an avalanche of per-runner exceptions when grouping resources are exhausted, by grouping those all up into a single merged exception.	2016-07-27 01:59:02 +05:30
Gian Merlino	4cc39b2ee7	Alternative groupBy strategy. (#2998 ) This patch introduces a GroupByStrategy concept and two strategies: "v1" is the current groupBy strategy and "v2" is a new one. It also introduces a merge buffers concept in DruidProcessingModule, to try to better manage memory used for merging. Both of these are described in more detail in #2987. There are two goals of this patch: 1. Make it possible for historical/realtime nodes to return larger groupBy result sets, faster, with better memory management. 2. Make it possible for brokers to merge streams when there are no order-by columns, avoiding materialization. This patch does not do anything to help with memory management on the broker when there are order-by columns or when there are nested queries. That could potentially be done in a future patch.	2016-06-24 18:06:09 -07:00
sainath batthala	d552a5c034	Documented getAuditTime, getPayload methods in AuditEntry.java (#3045 ) * Documented getAuditTime, getPayload methods in AuditEntry.java * author tag removed from documentation	2016-06-02 08:20:33 -07:00
Himanshu	7e67397b5a	fix-3010: look through all versions to find the set with complete partitions (#3013 )	2016-05-25 11:01:22 -07:00
Gian Merlino	a54381a084	Fix CombiningSequence.close on single element sequences. (#2969 ) Regression introduced by #2892.	2016-05-13 23:12:30 -07:00
David Lim	b489f63698	Supervisor for KafkaIndexTask (#2656 ) * supervisor for kafka indexing tasks * cr changes	2016-05-04 23:13:13 -07:00
Navis Ryu	45a3a26ef7	Add more math functions (#2822 ) * Add more math functions * added function list	2016-05-03 10:55:13 -07:00
Navis Ryu	2729fea84d	Fix parsing fail of segment id with datasource containing underscore (#2797 ) * Fix parsing fail of segment id with underscored datasource (Fix for #2786) * addressed comment * renamed and moved code into api. added log4 dependency for tests * addressed comments * fixed test fails	2016-05-02 22:37:28 -07:00
Charles Allen	6b957aa072	[QTL] Make URI Exctraction Namespace take more sane arguments (#2738 ) * Make URI Exctraction Namespace take more sane arguments * Fixes https://github.com/druid-io/druid/issues/2669 * Update docs * Rename error message * Undo overzealous deletion of docs * Explain caching mechanism a bit more in docs	2016-05-02 12:54:34 -07:00
Gian Merlino	488d12d592	CombiningSequence: Delay making next yielder on creation until it is actually asked for. (#2892 ) This fixes the behavior of limited combining sequences (otherwise limit = 1 would actually yield 2 elements).	2016-04-29 11:12:58 -07:00
Himanshu Gupta	308211cc18	math expression language with parser/lexer generated using ANTLR	2016-04-08 11:40:29 -05:00
Himanshu Gupta	36ccfbd20e	math expression language with hand written parser/lexer	2016-04-08 11:40:29 -05:00
navis.ryu	e0cfd9ee19	Utility method for length estimation of utf8	2016-03-31 10:07:00 +09:00
Charles Allen	5da9a280b6	Query Time Lookup - Dynamic Configuration	2016-03-18 09:45:05 -07:00
Gian Merlino	187569e702	DataSource metadata. Geared towards supporting transactional inserts of new segments. This involves an interface "DataSourceMetadata" that allows combining of partially specified metadata (useful for partitioned ingestion). DataSource metadata is stored in a new "dataSource" table.	2016-03-10 17:41:50 -08:00
Xavier Léauté	163e536415	Merge pull request #2601 from navis/fix-combine-sequence Relay final value to yielder in CombineSequence (Fix for #2586)	2016-03-08 15:59:08 -08:00
Charles Allen	908eb7eb4d	Add LogTest to show bad log behavior	2016-03-08 09:46:26 -08:00
navis.ryu	1b3fd8a8aa	added more tests and fixed concat+combine	2016-03-09 02:08:52 +09:00
navis.ryu	4ff1620131	Relay final value to yielder in CombineSequence (Fix for #2586 )	2016-03-08 10:31:15 +09:00
Himanshu Gupta	b40c342cd1	make Global stupid pool cache size configurable	2016-02-05 14:18:06 -06:00
Charles Allen	4282eac067	Add missing timeline test for VersionedIntervalTimelineTest	2016-01-27 08:49:08 -08:00
Charles Allen	2e1d6aaf3d	Use thread priorities. (aka set `nice` values for background-like tasks) * Defaults the thread priority to java.util.Thread.NORM_PRIORITY in io.druid.indexing.common.task.AbstractTask * Each exec service has its own Task Factory which is assigned a priority for spawned task. Therefore each priority class has a unique exec service * Added priority to tasks as taskPriority in the task context. <0 means low, 0 means take default, >0 means high. It is up to any particular implementation to determine how to handle these numbers * Add options to ForkingTaskRunner * Add "-XX:+UseThreadPriorities" default option * Add "-XX:ThreadPriorityPolicy=42" default option * AbstractTask - Removed unneded @JsonIgnore on priority * Added priority to RealtimePlumber executors. All sub-executors (non query runners) get Thread.MIN_PRIORITY * Add persistThreadPriority and mergeThreadPriority to realtime tuning config	2016-01-20 14:00:31 -08:00
navis.ryu	443ce2db9d	callbacks registered to Log4jShutdown is not executed when stop is called	2016-01-05 08:37:54 +09:00
Gian Merlino	83f4130b5f	SegmentMetadataQuery merging fixes. - Fix merging when the INTERVALS analysisType is disabled, and add a test. - Remove transformFn from CombiningSequence, use MappingSequence instead. transformFn did not work for "accumulate" anyway, which made the tests wrong (the intervals should have been condensed, but were not). - Add analysisTypes to the Druids segmentMetadataQuery builder to make testing simpler.	2015-12-22 07:57:10 -08:00
jon-wei	356b07c6c3	More efficient SegmentMetadataQuery	2015-12-17 12:46:23 -08:00
Nishant	9491e8de3b	Remove ServerView from RealtimeIndexTasks and use coordinator http endpoint for handoffs - fixes #1970 - extracted out segment handoff callbacks in SegmentHandoffNotifier which is responsible for tracking segment handoffs and doing callbacks when handoff is complete. - Coordinator now maintains a view of segments in the cluster, this will affect the jam heap requirements for the overlord for large clusters. realtime index task and nodes now use HTTP end points exposed by the coordinator to get serverView review comment fix realtime node guide injection review comments make test not rely on scheduled exec fix compilation fix import review comment introduce immutableSegmentLoadInfo fix son reading remove unnecessary logging	2015-12-09 01:54:09 +05:30
Himanshu Gupta	62ba9ade37	unifying license header in all java files	2015-12-05 22:16:23 -06:00
Fangjin Yang	21c84b5ff7	Merge pull request #1896 from gianm/allocate-segment SegmentAllocateAction (fixes #1515)	2015-11-18 21:05:46 -08:00
Fangjin Yang	4f46d457f1	Merge pull request #1947 from noddi/feature/count-parameter-history-endpoints Add count parameter to history endpoints	2015-11-12 10:23:44 -08:00
Gian Merlino	e4e5f0375b	SegmentAllocateAction (fixes #1515 ) This is a feature meant to allow realtime tasks to work without being told upfront what shardSpec they should use (so we can potentially publish a variable number of segments per interval). The idea is that there is a "pendingSegments" table in the metadata store that tracks allocated segments. Each one has a segment id (the same segment id we know and love) and is also part of a sequence. The sequences are an idea from @cheddar that offers a way of doing replication. If there are N tasks reading exactly the same data with exactly the same logic (think Kafka tasks reading a fixed range of offsets) then you can place them in the same sequence, and they will generate the same sequence of segments.	2015-11-11 16:54:35 -08:00
Bartosz Ługowski	6e5d2c6745	Add count parameter to history endpoints.	2015-11-11 23:03:57 +01:00
Xavier Léauté	fa6142e217	cleanup and remove unused imports	2015-11-11 12:25:21 -08:00
Xavier Léauté	a57cbfd2c3	Merge pull request #1387 from metamx/enableShutdownLogging Add special handler to allow logger messages during shutdown	2015-11-09 17:20:09 -08:00
Charles Allen	1df4baf489	Move Jackson Guice adapters into io.druid * Removes access to protected methods in com.fasterxml * Eliminates druid-common's use of foreign package com.fasterxml	2015-11-09 10:50:45 -08:00
Fangjin Yang	f90ddfdb89	Merge pull request #1745 from himanshug/numbered_to_elastic making NumberedShardSpec elastic	2015-10-30 16:01:03 -07:00
Charles Allen	7a2ceef690	Add special handler to allow logger messages during shutdown * Adds a special PropertyChecker interface which is ONLY for setting string properties at the very start of psvm	2015-10-27 14:33:36 -07:00
Xavier Léauté	72c408cf2d	Merge pull request #1770 from metamx/merge-time Add segment merge time as a metric	2015-10-22 22:03:41 -07:00
Nishant	7cecc55045	Add segment merge time as a metric Add merge and persist cpu time Fix typo review comment move cpu time measuring to VMUtils review comments.	2015-10-22 12:28:03 +05:30
Charles Allen	1cad571354	Add more verbose logging to SQLMetadataRuleManager	2015-10-21 16:11:40 -07:00
Nishant	573aa96bd6	fix #1727 - Union bySegment queries fix Fixes #1727. revert to doing merging for results for union queries on broker. revert unrelated changes Add test for union query runner Add test remove unused imports fix imports fix renamed file fix test update docs.	2015-09-29 23:32:36 +05:30
Xavier Léauté	1f897257b5	add simple load balancing pool	2015-09-18 09:43:14 -07:00
Himanshu Gupta	a7b1cacbbf	make NumberedShardSpec "elastic" and UTs	2015-09-17 08:38:48 -05:00
Himanshu Gupta	22dfa8ecf7	change ParitionHolder.isComplete() to accept chunks beyond the "end"	2015-09-16 21:33:11 -05:00
nishant	0835e12f2a	add endpoint to fetch rule history for all datasources. review comments Review comment fix compilation	2015-08-31 14:05:52 +05:30
Gian Merlino	940e1aa3eb	Replace funky imports with standard ones. 1) Lots of Guava imports were not coming from the actual Guava 2) junit.framework.Assert should be org.junit.Assert	2015-08-28 18:02:05 -07:00
Himanshu Gupta	2e0dd1d792	adding UTs and addressing review comments to firehoseV2 addition to Realtime[Manager\|Plumber], essential segment metadata persist support, kafka-simple-consumer-firehose extension patch	2015-08-27 20:50:46 -05:00
lvjq	2237a8cf0f	kafka 8 simple consumer firehose	2015-08-27 20:50:46 -05:00
Gian Merlino	2bf9a70bfa	Consolidate SQL retrying by moving logic into the connectors. Also change boolean removeLock to void addLock in MetadataStorageActionHandler.	2015-08-25 12:42:29 -07:00
Charles Allen	db19d2d547	Revert "Update to guice 4.0"	2015-08-14 09:26:07 -07:00
Charles Allen	7e61216287	Update to guice 4.0 - Mark a lot of `@Provides` methods as final since guice 4.0 disallows overriding them	2015-08-10 13:57:18 -07:00
Charles Allen	7fe8562980	Remove locks from StupidPool	2015-08-05 19:24:56 -07:00
Charles Allen	86ede702b1	Add namespaced lookups as extensions * Adds kafka, URI, and JDBC namespace defintions * Add ability to explicitly rename using a "namespace" which is a particular data collection that is loaded on all realtime, historic nodes, and brokers. If any of these nodes has the namespace extension, ALL nodes have the namespace extension. * Add namespace caching and populating (can be on heap or off heap) * Add NamespaceExtractionCacheManager for handling caches * Added ExtractionNamespace for handling metadata on the extraction namespaces * Added ExtractionNamespaceUpdate for handling metadata related to updates * Add extension which caches renames from a kafka stream (requires kafka8) * Added README.md for the namespace kafka extension * Added docs * Added namespace/size, namespace/count, namespace/deltaTasksStarted metrics Add static config for namespaces via `druid.query.extraction.namespace` * This is a rebase of https://github.com/b-slim/druid/tree/static_config_only	2015-07-28 11:14:14 -07:00
Charles Allen	a37d631cea	Add log4j2 for debugging which can be specified with `-Dlog4j.configurationFile=/some/path/druid/common/src/main/resources/log4j2.debug.xml` during testing and development	2015-07-01 10:29:22 -07:00
nishant	50158357ff	fixes #1330 fixes #1330, Avoid creating Period instance as creating a Period from Long.MAX_VALUE throws arithmetic exception. After this query metric will emit duration in seconds instead of minutes.	2015-05-04 20:34:28 +05:30
Xavier Léauté	f0726f4d94	fix typing on StupidResourceHolder	2015-04-13 21:28:56 -07:00
Charles Allen	1c6cbea89c	Revert "Revert "Overhaul of SegmentPullers to add consistency and retries"" This reverts commit `f904bc7858`.	2015-03-30 13:40:04 -07:00
Fangjin Yang	f904bc7858	Revert "Overhaul of SegmentPullers to add consistency and retries"	2015-03-30 13:15:50 -07:00
Charles Allen	6d407e8677	Add URI handling to SegmentPullers * Requires https://github.com/druid-io/druid-api/pull/37 * Requires https://github.com/metamx/java-util/pull/22 * Moves the puller logic to use a more standard workflow going through java-util helpers instead of re-writing the handlers for each impl * General workflow goes like this: 1) LoadSpec makes sure the correct Puller is called with the correct parameters. 2) The Puller sets up general information like how to make an InputStream, how to find a file name (for .gz files for example), and when to retry. 3) CompressionUtils does most of the heavy lifting when it can	2015-03-30 12:33:23 -07:00
Xavier Léauté	389ea4c32f	Merge pull request #1245 from b-slim/fix_injector_plus_ut Bug fix @DruidSecondaryModule plus unit test	2015-03-26 10:04:44 -07:00
Slim Bouguerra	1e6be7796e	bug fix @DruidSecondaryModule plus unit test	2015-03-26 10:44:52 -05:00
nishantmonu51	638bf9d4e9	return sorted List of TimeLineObjectHolder	2015-03-26 11:51:09 +05:30
Fangjin Yang	6bcdf7f60f	Merge pull request #1223 from metamx/AnnounceIdUUID Add consistency to UUIDUtils handling	2015-03-17 16:09:58 -07:00
Charles Allen	868100c79c	Add consistency to UUIDUtils handling * Use "_" instead of "-"	2015-03-17 16:05:46 -07:00
Fangjin Yang	8d0dfd9af1	Merge pull request #1221 from metamx/AnnounceIdUUID Change announcement ID to a UUID instead of Timestamp	2015-03-17 15:38:53 -07:00
Charles Allen	79b1443cc3	Change announcement ID to a UUID instead of Timestamp * Also add a UUIDUtils to the common package	2015-03-17 15:34:04 -07:00
nishantmonu51	f9821d242f	also log author ip address in audit log	2015-03-17 23:15:15 +05:30
nishantmonu51	3ec4a30ab5	initial commit review comments more refactoring and cleaning of redundant code add UT + docs + more refactoring fixes + review comments more cleanup end points to fetch history review comments remove unnecessary changes review comments rename header name review comments + add test for MetadataRulesManager review comments docs	2015-03-12 22:50:29 +05:30
Himanshu Gupta	e6ee98e2d2	UTs update for common	2015-02-25 15:40:26 -08:00
Himanshu Gupta	126262edce	support for PasswordProvider interface to enable writing druid extension which can get metadata store password from secured location or anywhere instead of plain text properties file	2015-02-25 14:05:19 -06:00
nishantmonu51	e5fa8e0cb4	Fix NPE for union queries filter non-existing datasources for union queries. handle review comment	2015-02-10 23:21:58 +05:30
Fangjin Yang	25cf15824b	Merge pull request #1085 from gianm/dsmrv-fix DataSourceMetadataResultValue fixes and JodaUtils adjustments.	2015-02-03 17:51:33 -08:00
Gian Merlino	1298b4abd3	JodaUtils: Bring in MIN/MAX instants a bit. This prevents the difference overflowing a long.	2015-02-03 17:34:33 -08:00
Eric Tschetter	42eba986ce	Towards consistent null handling This commit also includes 1) the addition of a context parameter on timeseries queries that allows it to ignore empty buckets instead of generating results for them 2) A cleanup of an unused method on an interface	2015-02-02 12:53:07 -08:00
Fangjin Yang	92e616de11	Merge pull request #1077 from metamx/remove-unused-imports remove unused imports	2015-02-02 10:45:27 -08:00
nishantmonu51	ba932bb1f2	remove unused imports	2015-02-02 21:53:39 +05:30

1 2 3 4 5 ...

338 Commits