druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	45940d6e40	Math expressions support for missing columns. (#3630 ) Also add SchemaEvolutionTest to help test this kind of thing. Fixes #3627 and includes test for #3625.	2016-11-01 09:40:25 -07:00
Himanshu	32c5494e97	eagerly allocate the intermediate computation buffers (#3628 )	2016-10-31 15:24:07 -07:00
Navis Ryu	898c1c21af	More best-effort parse long (#3603 ) * More best-effort parse long * addressed comments	2016-10-25 10:31:51 -07:00
Himanshu	641469fc38	manage overshadowing efficiently at coordinator (#3584 ) * manage overshadowing efficiently at coordinator * take readlock in VersionedIntervalTimeline.isOvershadowed()	2016-10-24 22:49:08 +05:30
Akash Dwivedi	4b3bd8bd63	Migrating java-util from Metamarkets. (#3585 ) * Migrating java-util from Metamarkets. * checkstyle and updated license on java-util files. * Removed unused imports from whole project. * cherry pick metamx/java-util@826021f. * Copyright changes on java-util pom, address review comments.	2016-10-21 14:57:07 -07:00
Navis Ryu	8b7ff4409a	Math expressional parameters for aggregator (#2783 ) * Supports expression-paramed aggregator (squashed and rebased on master) also includes math post aggregator (was #2820) * Addressed comments * addressed comments	2016-10-19 13:58:35 -05:00
David Lim	c2ae734848	KafkaIndexTask: Allow run thread to stop gracefully instead of interrupting (#3534 ) * allow run thread to gracefully complete instead of interrupting when stopGracefully() is called * add comments	2016-10-17 10:52:19 -04:00
Gian Merlino	ddc856214d	When inserting segments, mark unused if already overshadowed. (#3499 ) This is useful for the insert-segment-to-db tool, which would otherwise potentially insert a lot of overshadowed segments as "used", causing load and drop churn in the cluster.	2016-10-10 18:10:18 -07:00
Gian Merlino	40f2fe7893	Bump versions to 0.9.3-SNAPSHOT (#3524 )	2016-09-29 13:53:32 -07:00
Slim	3175e17a3b	Cached lookup module. first cut implementing JDBC cache (#2819 )	2016-09-16 13:45:54 -07:00
jianran	18af480017	Rename fields in OrderedMergeIterator (#3149 ) * code readable * fix the pre middle manager peon no stop * Revert "fix the pre middle manager peon no stop" This reverts commit `6cef4980bf`.	2016-08-11 09:42:12 -07:00
Nishant	8035c73409	Implement EnvironmentVariablePasswordProvider (#3329 ) * Implement EnvironmentVariablePasswordProvider * Review Comment : rename passwordKey to passwordVariable * add docs * improve doc layout * review comment: rename property for variable	2016-08-10 05:33:51 +08:00
Gian Merlino	1aae5bd67d	Nicer handling for cancelled groupBy v2 queries. (#3330 ) 1. Wrap temporaryStorage in a resource holder, to avoid spurious "Closed" errors from already-running processing tasks. 2. Exit early from the merging accumulator if the query is cancelled.	2016-08-05 14:48:06 -07:00
kaijianding	1fa681934c	fix ConcurrentModificationException in CachingClusteredClient.run() (#3278 ) * fix ConcurrentModificationException in CachingClusteredClient.run() * obtain new copy of PartitionHolder to avoid potential multi-threads read/write issue	2016-07-28 19:52:50 -07:00
Gian Merlino	9b5523add3	Reference counting, better error handling for resources in groupBy v2. (#3268 ) Refcounting prevents releasing the merge buffer, or closing the concurrent grouper, before the processing threads have all finished. The better error handling prevents an avalanche of per-runner exceptions when grouping resources are exhausted, by grouping those all up into a single merged exception.	2016-07-27 01:59:02 +05:30
Gian Merlino	4cc39b2ee7	Alternative groupBy strategy. (#2998 ) This patch introduces a GroupByStrategy concept and two strategies: "v1" is the current groupBy strategy and "v2" is a new one. It also introduces a merge buffers concept in DruidProcessingModule, to try to better manage memory used for merging. Both of these are described in more detail in #2987. There are two goals of this patch: 1. Make it possible for historical/realtime nodes to return larger groupBy result sets, faster, with better memory management. 2. Make it possible for brokers to merge streams when there are no order-by columns, avoiding materialization. This patch does not do anything to help with memory management on the broker when there are order-by columns or when there are nested queries. That could potentially be done in a future patch.	2016-06-24 18:06:09 -07:00
Gian Merlino	ebf890fe79	Update master version to 0.9.2-SNAPSHOT. (#3133 )	2016-06-13 13:10:38 -07:00
sainath batthala	d552a5c034	Documented getAuditTime, getPayload methods in AuditEntry.java (#3045 ) * Documented getAuditTime, getPayload methods in AuditEntry.java * author tag removed from documentation	2016-06-02 08:20:33 -07:00
Himanshu	7e67397b5a	fix-3010: look through all versions to find the set with complete partitions (#3013 )	2016-05-25 11:01:22 -07:00
Gian Merlino	a54381a084	Fix CombiningSequence.close on single element sequences. (#2969 ) Regression introduced by #2892.	2016-05-13 23:12:30 -07:00
David Lim	b489f63698	Supervisor for KafkaIndexTask (#2656 ) * supervisor for kafka indexing tasks * cr changes	2016-05-04 23:13:13 -07:00
Navis Ryu	45a3a26ef7	Add more math functions (#2822 ) * Add more math functions * added function list	2016-05-03 10:55:13 -07:00
Navis Ryu	2729fea84d	Fix parsing fail of segment id with datasource containing underscore (#2797 ) * Fix parsing fail of segment id with underscored datasource (Fix for #2786) * addressed comment * renamed and moved code into api. added log4 dependency for tests * addressed comments * fixed test fails	2016-05-02 22:37:28 -07:00
Charles Allen	6b957aa072	[QTL] Make URI Exctraction Namespace take more sane arguments (#2738 ) * Make URI Exctraction Namespace take more sane arguments * Fixes https://github.com/druid-io/druid/issues/2669 * Update docs * Rename error message * Undo overzealous deletion of docs * Explain caching mechanism a bit more in docs	2016-05-02 12:54:34 -07:00
Gian Merlino	488d12d592	CombiningSequence: Delay making next yielder on creation until it is actually asked for. (#2892 ) This fixes the behavior of limited combining sequences (otherwise limit = 1 would actually yield 2 elements).	2016-04-29 11:12:58 -07:00
Himanshu Gupta	308211cc18	math expression language with parser/lexer generated using ANTLR	2016-04-08 11:40:29 -05:00
Himanshu Gupta	36ccfbd20e	math expression language with hand written parser/lexer	2016-04-08 11:40:29 -05:00
navis.ryu	e0cfd9ee19	Utility method for length estimation of utf8	2016-03-31 10:07:00 +09:00
Gian Merlino	7e7a886f65	Move druid-api into the druid repo. This is from druid-api-0.3.17, as of commit 51884f1d05d5512cacaf62cedfbb28c6ab2535cf in the druid-api repo.	2016-03-24 11:04:34 -07:00
Charles Allen	5da9a280b6	Query Time Lookup - Dynamic Configuration	2016-03-18 09:45:05 -07:00
Gian Merlino	738dcd8cd9	Update version to 0.9.1-SNAPSHOT. Fixes #2462	2016-03-17 10:34:20 -07:00
Gian Merlino	187569e702	DataSource metadata. Geared towards supporting transactional inserts of new segments. This involves an interface "DataSourceMetadata" that allows combining of partially specified metadata (useful for partitioned ingestion). DataSource metadata is stored in a new "dataSource" table.	2016-03-10 17:41:50 -08:00
Nishant	ba1185963b	Fix a bunch of dependencies * Eliminate exclusion groups from pull-deps * Only consider dependency nodes in pull-deps if they are not in the following scopes * provided * test * system * Fix a bunch of `<scope>provided</scope>` missing tags * Better exclusions for a couple of problematic libs	2016-03-10 10:18:08 -08:00
Xavier Léauté	163e536415	Merge pull request #2601 from navis/fix-combine-sequence Relay final value to yielder in CombineSequence (Fix for #2586)	2016-03-08 15:59:08 -08:00
Charles Allen	908eb7eb4d	Add LogTest to show bad log behavior	2016-03-08 09:46:26 -08:00
navis.ryu	1b3fd8a8aa	added more tests and fixed concat+combine	2016-03-09 02:08:52 +09:00
navis.ryu	4ff1620131	Relay final value to yielder in CombineSequence (Fix for #2586 )	2016-03-08 10:31:15 +09:00
Himanshu Gupta	b40c342cd1	make Global stupid pool cache size configurable	2016-02-05 14:18:06 -06:00
Charles Allen	4282eac067	Add missing timeline test for VersionedIntervalTimelineTest	2016-01-27 08:49:08 -08:00
Charles Allen	2e1d6aaf3d	Use thread priorities. (aka set `nice` values for background-like tasks) * Defaults the thread priority to java.util.Thread.NORM_PRIORITY in io.druid.indexing.common.task.AbstractTask * Each exec service has its own Task Factory which is assigned a priority for spawned task. Therefore each priority class has a unique exec service * Added priority to tasks as taskPriority in the task context. <0 means low, 0 means take default, >0 means high. It is up to any particular implementation to determine how to handle these numbers * Add options to ForkingTaskRunner * Add "-XX:+UseThreadPriorities" default option * Add "-XX:ThreadPriorityPolicy=42" default option * AbstractTask - Removed unneded @JsonIgnore on priority * Added priority to RealtimePlumber executors. All sub-executors (non query runners) get Thread.MIN_PRIORITY * Add persistThreadPriority and mergeThreadPriority to realtime tuning config	2016-01-20 14:00:31 -08:00
navis.ryu	443ce2db9d	callbacks registered to Log4jShutdown is not executed when stop is called	2016-01-05 08:37:54 +09:00
Gian Merlino	83f4130b5f	SegmentMetadataQuery merging fixes. - Fix merging when the INTERVALS analysisType is disabled, and add a test. - Remove transformFn from CombiningSequence, use MappingSequence instead. transformFn did not work for "accumulate" anyway, which made the tests wrong (the intervals should have been condensed, but were not). - Add analysisTypes to the Druids segmentMetadataQuery builder to make testing simpler.	2015-12-22 07:57:10 -08:00
jon-wei	356b07c6c3	More efficient SegmentMetadataQuery	2015-12-17 12:46:23 -08:00
Nishant	9491e8de3b	Remove ServerView from RealtimeIndexTasks and use coordinator http endpoint for handoffs - fixes #1970 - extracted out segment handoff callbacks in SegmentHandoffNotifier which is responsible for tracking segment handoffs and doing callbacks when handoff is complete. - Coordinator now maintains a view of segments in the cluster, this will affect the jam heap requirements for the overlord for large clusters. realtime index task and nodes now use HTTP end points exposed by the coordinator to get serverView review comment fix realtime node guide injection review comments make test not rely on scheduled exec fix compilation fix import review comment introduce immutableSegmentLoadInfo fix son reading remove unnecessary logging	2015-12-09 01:54:09 +05:30
Himanshu Gupta	62ba9ade37	unifying license header in all java files	2015-12-05 22:16:23 -06:00
Fangjin Yang	21c84b5ff7	Merge pull request #1896 from gianm/allocate-segment SegmentAllocateAction (fixes #1515)	2015-11-18 21:05:46 -08:00
Fangjin Yang	4f46d457f1	Merge pull request #1947 from noddi/feature/count-parameter-history-endpoints Add count parameter to history endpoints	2015-11-12 10:23:44 -08:00
Gian Merlino	e4e5f0375b	SegmentAllocateAction (fixes #1515 ) This is a feature meant to allow realtime tasks to work without being told upfront what shardSpec they should use (so we can potentially publish a variable number of segments per interval). The idea is that there is a "pendingSegments" table in the metadata store that tracks allocated segments. Each one has a segment id (the same segment id we know and love) and is also part of a sequence. The sequences are an idea from @cheddar that offers a way of doing replication. If there are N tasks reading exactly the same data with exactly the same logic (think Kafka tasks reading a fixed range of offsets) then you can place them in the same sequence, and they will generate the same sequence of segments.	2015-11-11 16:54:35 -08:00
Bartosz Ługowski	6e5d2c6745	Add count parameter to history endpoints.	2015-11-11 23:03:57 +01:00
Xavier Léauté	fa6142e217	cleanup and remove unused imports	2015-11-11 12:25:21 -08:00
Xavier Léauté	a57cbfd2c3	Merge pull request #1387 from metamx/enableShutdownLogging Add special handler to allow logger messages during shutdown	2015-11-09 17:20:09 -08:00
Charles Allen	1df4baf489	Move Jackson Guice adapters into io.druid * Removes access to protected methods in com.fasterxml * Eliminates druid-common's use of foreign package com.fasterxml	2015-11-09 10:50:45 -08:00
Fangjin Yang	f90ddfdb89	Merge pull request #1745 from himanshug/numbered_to_elastic making NumberedShardSpec elastic	2015-10-30 16:01:03 -07:00
Charles Allen	7a2ceef690	Add special handler to allow logger messages during shutdown * Adds a special PropertyChecker interface which is ONLY for setting string properties at the very start of psvm	2015-10-27 14:33:36 -07:00
Xavier Léauté	72c408cf2d	Merge pull request #1770 from metamx/merge-time Add segment merge time as a metric	2015-10-22 22:03:41 -07:00
Nishant	7cecc55045	Add segment merge time as a metric Add merge and persist cpu time Fix typo review comment move cpu time measuring to VMUtils review comments.	2015-10-22 12:28:03 +05:30
Charles Allen	1cad571354	Add more verbose logging to SQLMetadataRuleManager	2015-10-21 16:11:40 -07:00
Xavier Léauté	e4ac78e43d	bump next snapshot to 0.9.0	2015-10-20 13:46:13 -07:00
Xavier Léauté	4c2c7a2c37	update version to 0.8.3	2015-10-14 21:40:55 -07:00
Nishant	573aa96bd6	fix #1727 - Union bySegment queries fix Fixes #1727. revert to doing merging for results for union queries on broker. revert unrelated changes Add test for union query runner Add test remove unused imports fix imports fix renamed file fix test update docs.	2015-09-29 23:32:36 +05:30
Xavier Léauté	1f897257b5	add simple load balancing pool	2015-09-18 09:43:14 -07:00
Himanshu Gupta	a7b1cacbbf	make NumberedShardSpec "elastic" and UTs	2015-09-17 08:38:48 -05:00
Himanshu Gupta	22dfa8ecf7	change ParitionHolder.isComplete() to accept chunks beyond the "end"	2015-09-16 21:33:11 -05:00
nishant	0835e12f2a	add endpoint to fetch rule history for all datasources. review comments Review comment fix compilation	2015-08-31 14:05:52 +05:30
Gian Merlino	940e1aa3eb	Replace funky imports with standard ones. 1) Lots of Guava imports were not coming from the actual Guava 2) junit.framework.Assert should be org.junit.Assert	2015-08-28 18:02:05 -07:00
Himanshu Gupta	2e0dd1d792	adding UTs and addressing review comments to firehoseV2 addition to Realtime[Manager\|Plumber], essential segment metadata persist support, kafka-simple-consumer-firehose extension patch	2015-08-27 20:50:46 -05:00
lvjq	2237a8cf0f	kafka 8 simple consumer firehose	2015-08-27 20:50:46 -05:00
Gian Merlino	2bf9a70bfa	Consolidate SQL retrying by moving logic into the connectors. Also change boolean removeLock to void addLock in MetadataStorageActionHandler.	2015-08-25 12:42:29 -07:00
Xavier Léauté	3b2e41e42a	update for next release	2015-08-18 17:16:46 -07:00
Charles Allen	db19d2d547	Revert "Update to guice 4.0"	2015-08-14 09:26:07 -07:00
Charles Allen	7e61216287	Update to guice 4.0 - Mark a lot of `@Provides` methods as final since guice 4.0 disallows overriding them	2015-08-10 13:57:18 -07:00
Charles Allen	7fe8562980	Remove locks from StupidPool	2015-08-05 19:24:56 -07:00
Charles Allen	86ede702b1	Add namespaced lookups as extensions * Adds kafka, URI, and JDBC namespace defintions * Add ability to explicitly rename using a "namespace" which is a particular data collection that is loaded on all realtime, historic nodes, and brokers. If any of these nodes has the namespace extension, ALL nodes have the namespace extension. * Add namespace caching and populating (can be on heap or off heap) * Add NamespaceExtractionCacheManager for handling caches * Added ExtractionNamespace for handling metadata on the extraction namespaces * Added ExtractionNamespaceUpdate for handling metadata related to updates * Add extension which caches renames from a kafka stream (requires kafka8) * Added README.md for the namespace kafka extension * Added docs * Added namespace/size, namespace/count, namespace/deltaTasksStarted metrics Add static config for namespaces via `druid.query.extraction.namespace` * This is a rebase of https://github.com/b-slim/druid/tree/static_config_only	2015-07-28 11:14:14 -07:00
Xavier Léauté	4cfb00bc8a	inrement version	2015-07-15 13:09:05 -07:00
Charles Allen	a37d631cea	Add log4j2 for debugging which can be specified with `-Dlog4j.configurationFile=/some/path/druid/common/src/main/resources/log4j2.debug.xml` during testing and development	2015-07-01 10:29:22 -07:00
Xavier Léauté	0a5bb909a2	[maven-release-plugin] prepare for next development iteration	2015-06-18 17:35:19 -07:00
Xavier Léauté	59c6b2b279	[maven-release-plugin] prepare release druid-0.8.0-rc1	2015-06-18 17:35:14 -07:00
fjy	7a6acf5c1b	update pom to 0.8	2015-05-11 19:41:58 -06:00
nishant	50158357ff	fixes #1330 fixes #1330, Avoid creating Period instance as creating a Period from Long.MAX_VALUE throws arithmetic exception. After this query metric will emit duration in seconds instead of minutes.	2015-05-04 20:34:28 +05:30
Xavier Léauté	f0726f4d94	fix typing on StupidResourceHolder	2015-04-13 21:28:56 -07:00
Fangjin Yang	208e307915	Merge pull request #1251 from metamx/uriSegmentLoaders Revert "Revert "Overhaul of SegmentPullers to add consistency and retries""	2015-03-30 17:43:51 -07:00
fjy	aea7f9d192	[maven-release-plugin] prepare for next development iteration	2015-03-30 16:35:24 -07:00
fjy	060d7aef03	[maven-release-plugin] prepare release druid-0.7.1	2015-03-30 16:35:20 -07:00
Charles Allen	1c6cbea89c	Revert "Revert "Overhaul of SegmentPullers to add consistency and retries"" This reverts commit `f904bc7858`.	2015-03-30 13:40:04 -07:00
Fangjin Yang	f904bc7858	Revert "Overhaul of SegmentPullers to add consistency and retries"	2015-03-30 13:15:50 -07:00
Charles Allen	6d407e8677	Add URI handling to SegmentPullers * Requires https://github.com/druid-io/druid-api/pull/37 * Requires https://github.com/metamx/java-util/pull/22 * Moves the puller logic to use a more standard workflow going through java-util helpers instead of re-writing the handlers for each impl * General workflow goes like this: 1) LoadSpec makes sure the correct Puller is called with the correct parameters. 2) The Puller sets up general information like how to make an InputStream, how to find a file name (for .gz files for example), and when to retry. 3) CompressionUtils does most of the heavy lifting when it can	2015-03-30 12:33:23 -07:00
Xavier Léauté	2c58e355fd	Merge pull request #1239 from metamx/log4j2forRealz Exclude log4j from curator dependencies in favor of log4j-1.2-api	2015-03-26 13:21:13 -07:00
Charles Allen	9cd6c08e96	Exclude log4j from curator dependencies in favor of log4j-1.2-api	2015-03-26 13:05:12 -07:00
Xavier Léauté	389ea4c32f	Merge pull request #1245 from b-slim/fix_injector_plus_ut Bug fix @DruidSecondaryModule plus unit test	2015-03-26 10:04:44 -07:00
Slim Bouguerra	1e6be7796e	bug fix @DruidSecondaryModule plus unit test	2015-03-26 10:44:52 -05:00
nishantmonu51	638bf9d4e9	return sorted List of TimeLineObjectHolder	2015-03-26 11:51:09 +05:30
fjy	b389cfe404	[maven-release-plugin] prepare for next development iteration	2015-03-19 12:38:17 -07:00
fjy	60e7d543cc	[maven-release-plugin] prepare release druid-0.7.1-rc1	2015-03-19 12:38:13 -07:00
Fangjin Yang	6bcdf7f60f	Merge pull request #1223 from metamx/AnnounceIdUUID Add consistency to UUIDUtils handling	2015-03-17 16:09:58 -07:00
Charles Allen	868100c79c	Add consistency to UUIDUtils handling * Use "_" instead of "-"	2015-03-17 16:05:46 -07:00
Fangjin Yang	8d0dfd9af1	Merge pull request #1221 from metamx/AnnounceIdUUID Change announcement ID to a UUID instead of Timestamp	2015-03-17 15:38:53 -07:00
Charles Allen	79b1443cc3	Change announcement ID to a UUID instead of Timestamp * Also add a UUIDUtils to the common package	2015-03-17 15:34:04 -07:00
nishantmonu51	f9821d242f	also log author ip address in audit log	2015-03-17 23:15:15 +05:30
Xavier Léauté	7fb4b1d2bb	Merge pull request #1144 from metamx/javaLoggingManager Add log4j2 hooks to other loggings	2015-03-12 14:38:51 -07:00
Charles Allen	edfcea18d8	Add log4j2 hooks to standard java logging	2015-03-12 14:29:41 -07:00

1 2 3 4 5 ...

1054 Commits