druid

Commit Graph

Author	SHA1	Message	Date
Nishant	f576a0ff14	Contrib Extension for Ambari Metrics Emitter (#3767 ) * Contrib Extension for Ambari Metrics Emitter extension to enable druid to send metrics to ambari metrics server (https://cwiki.apache.org/confluence/display/AMBARI/Metrics) review comments switch to public repo * review comments * add docs * fix pom version * Add link for doc page in extensions.md * remove unused imports * review comments review comments remove unused dependency review comment	2016-12-19 11:12:47 -08:00
Gian Merlino	dd63f54325	Built-in SQL. (#3682 )	2016-12-16 17:15:59 -08:00
Jihoon Son	5e39578eee	Enable parallel test (#3774 ) * Enable parallel test * Remove unnecessary NotThreadSafe annocation * Randomize the start port when finding available ports * Fix test failure * Change to handle all negatives	2016-12-14 21:05:56 -08:00
Ninglin Du	469ab21091	[Feature] Thrift support for realtime and batch ingestion (#3418 ) * Thrift ingestion plugin 1. thrift binary is platform dependent, use scrooge to generate java files to avoid style check failure 2. stream and hadoop ingesion are both supported, input format can be sequence file and lzo thrift block file. 3. base64 and protocol aware change header * fix conlicts in pom	2016-12-13 10:05:15 -08:00
Gleb Smirnov	07384d6f40	Update Apache curator to a non-leaky version (see CURATOR-354) (#3769 )	2016-12-12 09:52:40 -08:00
Akash Dwivedi	6386e6a4dc	root and java-util pom cleanup (#3764 ) * Remove bytebuffer-collections dependency from the root pom and java-util pom cleanup. * Remove json-smart exclusion from root pom	2016-12-08 11:30:19 -08:00
Gian Merlino	943982b7b0	Configurable HTTP compression. (#3759 ) * Configurable HTTP compression. * Call real-time nodes real-time processes in docs.	2016-12-07 17:40:39 -08:00
Roman Leventov	949e65165c	Bitset iteration optimization and improve safety (#3753 ) * Deduplicate looking for bitset.nextSetBit() in BitSetIterator.next() and hasNext() * Add BitmapIterationTest * More elaborate comment on why Roaring is not tested in BitmapIterationTest	2016-12-07 15:49:16 -08:00
Navis Ryu	c74d267f50	Support virtual column for select query (#2511 ) * Support virtual column for select query * Addressed comments	2016-12-05 15:14:35 -08:00
Erik Dubbelboer	7d36f540e8	WIP: Add Google Storage support (#2458 ) Also excludes the correct artifacts from #2741	2016-11-16 14:06:45 +05:30
Keuntae Park	094f5b851b	Support Min/Max for Timestamp (#3299 ) * Min/Max aggregator for Timestamp * remove unused imports and method * rebase and zip the test data * add docs	2016-11-14 23:00:21 -08:00
Roman Leventov	fbbb55f867	Update emitter dependency to 0.4.0 and emit "version" dimension for all druid metrics (#3679 ) * Update emitter dependency to 0.4.0 and emit "version" dimension for all druid metrics, not only query metrics * Remove unused imports * Use empty string instead of "testing-version" as a version placeholder	2016-11-11 17:17:27 -06:00
Akash Dwivedi	3e408497b3	Migrating bytebuffercollections from Metamarkets. (#3647 ) * Migrating bytebuffercollections from Metamarkets. * resolving code conflicts and removing <p> from bytebuffer-collections.	2016-11-11 10:51:07 -08:00
Gian Merlino	657e4512d2	Checkstyle checks for AvoidStaticImport, UnusedImports. (#3660 ) Excludes tests from AvoidStaticImport, since those are used often there and I didn't want to make this changeset too large. Production code use was minimal and I switched those to non-static imports.	2016-11-05 11:34:36 -07:00
Akash Dwivedi	4b3bd8bd63	Migrating java-util from Metamarkets. (#3585 ) * Migrating java-util from Metamarkets. * checkstyle and updated license on java-util files. * Removed unused imports from whole project. * cherry pick metamx/java-util@826021f. * Copyright changes on java-util pom, address review comments.	2016-10-21 14:57:07 -07:00
Roman Leventov	5dc95389f7	Add Checkstyle framework (#3551 ) * Add Checkstyle framework * Avoid star import * Need braces for control flow statements * Redundant imports * Add NewLineAtEndOfFile check	2016-10-13 13:37:47 -07:00
Roman Leventov	85ac8eff90	Improve performance of IndexMergerV9 (#3440 ) * Improve performance of StringDimensionMergerV9 and StringDimensionMergerLegacy by avoiding primitive int boxing by using IntIterator in IndexedInts instead of Iterator<Integer>; Extract some common logic for V9 and Legacy mergers; Minor improvements to resource handling in StringDimensionMergerV9 * Don't mask index in MergeIntIterator.makeQueueElement() * DRY conversion RoaringBitmap's IntIterator to fastutil's IntIterator * Do implement skip(n) in IntIterators extending AbstractIntIterator because original implementation is not reliable * Use Test(expected=Exception.class) instead of try { } catch (Exception e) { /* ignore */ }	2016-10-13 08:28:46 -07:00
Gian Merlino	40f2fe7893	Bump versions to 0.9.3-SNAPSHOT (#3524 )	2016-09-29 13:53:32 -07:00
John Zhang	78b06a7d7e	make global http client worker threads configurable (#3514 )	2016-09-28 23:18:51 -07:00
Slim	3175e17a3b	Cached lookup module. first cut implementing JDBC cache (#2819 )	2016-09-16 13:45:54 -07:00
Gian Merlino	76fcbd8fc5	Update Curator, ZK to latest stable versions. (#3461 )	2016-09-16 09:16:14 -07:00
Gian Merlino	2613e68477	Update java-util to 0.27.10. (#3337 )	2016-08-09 13:37:30 +05:30
Navis Ryu	5b3f0ccb1f	Support variance and standard deviation (#2525 ) * Support variance and standard deviation * addressed comments	2016-08-04 17:32:58 -07:00
Keuntae Park	95a58097e2	Hadoop InputRowParser for Orc file (#3019 ) * InputRowParser to decode OrcStruct from OrcNewInputFormat * add unit test for orc hadoop indexing * update docs and fix test code bug * doc updated * resove maven dependency conflict * remove unused imports * fix returning array type from Object[] to correct primitive array type * fix to support getDimension() of MapBasedRow : changing return type of orc list from array to list * rebase and updated based on comments * updated based on comments * on reflecting review comments * fix bug in typeStringFromParseSpec() and add unit test * add license header	2016-07-26 09:42:56 -07:00
Nishant	47894c4eff	add comment for default hadoop coordinates (#3257 ) 1) Modify CliHadoopIndexer to share constant from `TaskConfig.DEFAULT_DEFAULT_HADOOP_COORDINATES` 2) add comment to pom.xml as discussed in https://github.com/druid-io/druid/pull/3044 fix name	2016-07-18 15:23:11 -07:00
Gian Merlino	13d8d96bc6	Update to guice-4.1.0. (#3222 )	2016-07-18 08:08:43 -07:00
Charles Allen	3f1681c16c	Caffeine cache extension (#3028 ) * Initial commit of caffeine cache * Address code comments * Move and fixup README.md a bit * Improve caffeine readme information * Cleanup caffeine pom * Address review comments * Bump caffeine to 2.3.1 * Bump druid version to 0.9.2-SNAPSHOT * Make test not fail randomly. See https://github.com/ben-manes/caffeine/pull/93#issuecomment-227617998 for an explanation * Fix distribution and documentation * Add caffeine to extensions.md * Fix links in extensions.md * Lexicographic	2016-07-06 15:42:54 -07:00
Gian Merlino	ebf890fe79	Update master version to 0.9.2-SNAPSHOT. (#3133 )	2016-06-13 13:10:38 -07:00
Charles Allen	aa2982ee31	Update bytebuffer-collections to 0.2.5 (#3117 )	2016-06-13 08:41:20 -07:00
Fangjin Yang	53886a677c	include avro in the druid tarball (#3123 )	2016-06-13 16:58:21 +05:30
David Lim	6d38dde2f8	exclude slf4j-log4j12 (#3075 )	2016-06-03 11:39:23 -07:00
Charles Allen	8024b915e2	[QTL] Implement LookupExtractorFactory of namespaced lookup (#2926 ) * support LookupReferencesManager registration of namespaced lookup and eliminate static configurations for lookup from namespecd lookup extensions - druid-namespace-lookup and druid-kafka-extraction-namespace are modified - However, druid-namespace-lookup still has configuration about ON/OFF HEAP cache manager selection, which is not namespace wide configuration but node wide configuration as multiple namespace shares the same cache manager * update KafkaExtractionNamespaceTest to reflect argument signature changes * Add more synchronization functionality to NamespaceLookupExtractorFactory * Remove old way of using extraction namespaces * resolve compile error by supporting LookupIntrospectHandler * Remove kafka lookups * Remove unused stuff * Fix start and stop behavior to be consistent with new javadocs * Remove unused strings * Add timeout option * Address comments on configurations and improve docs * Add more options and update hash key and replaces * Move monitoring to the overriding classes * Add better start/stop logging * Remove old docs about namespace names * Fix bad comma * Add `@JsonIgnore` to lookup factory * Address code review comments * Remove ExtractionNamespace from module json registration * Fix problems with naming and initialization. Add tests * Optimize imports / reformat * Fix future not being properly cancelled on failed initial scheduling * Fix delete returns * Add more docs about whole introspection * Add `/version` introspection point for lookups * Add more tests and address comments * Add StaticMap extraction namespace for testing. Also add a bunch of tests * Move cache system property to `druid.lookup.namespace.cache.type` * Make VERSION lower case * Change poll period to 0ms for StaticMap * Move cache key to bytebuffer * Change hashCode and equals on static map extraction fn * Add more comments on StaticMap * Address comments * Make scheduleAndWait use a latch * Sanity renames and fix imports * Remove extra info in docs * Fix review comments * Strengthen failure on start from warn to error * Address comments * Rename namespace-lookup to lookups-cached-global * Fix injective mis-naming * Also add serde test	2016-05-24 10:56:40 -07:00
Xavier Léauté	e79284da59	new interval based cost function (#2972 ) * new interval based cost function Addresses issues with balancing of segments in the existing cost function - `gapPenalty` led to clusters of segments ~30 days apart - `recencyPenalty` caused imbalance among recent segments - size-based cost could be skewed by compression New cost function is purely based on segment intervals: - assumes each time-slice of a partition is a constant cost - cost is additive, i.e. cost(A, B union C) = cost(A, B) + cost(A, C) - cost decays exponentially based on distance between time-slices * comments and formatting * add more comments to explain the calculation	2016-05-17 09:56:00 -07:00
michaelschiff	2203a812bc	statsd-emitter (#2410 )	2016-04-28 18:41:02 -07:00
Xavier Léauté	fc91120b54	Merge pull request #2857 from metamx/upgrade-zk upgrade zookeeper client dependency to 3.4.8	2016-04-20 10:36:07 +05:30
Xavier Léauté	838768c632	upgrade curator, fixes #2829 (#2849 )	2016-04-18 13:17:36 -07:00
Himanshu Gupta	308211cc18	math expression language with parser/lexer generated using ANTLR	2016-04-08 11:40:29 -05:00
DuNinglin [杜宁林]	0f67ff7dfb	reoganize code folder according to recent upstream folder changes, seperate it from avro code and take it into extensions-conrib. docs rewite too	2016-03-30 11:21:41 +08:00
Fangjin Yang	62c1dc7a09	Merge pull request #2602 from binlijin/distinctcount implement special distinctcount	2016-03-28 17:20:17 -07:00
Gian Merlino	977e867ad8	Downgrade geoip2, exclude com.google.http-client. Reverts "Update com.maxmind.geoip2 to 2.6.0" and exclude the google http client from com.maxmind.geoip2. This should satisfy the original need from #2646 (wanting to run Druid along with an upgraded com.google.http-client) while preventing Jackson conflicts pointed out in #2717. Fixes #2717. This reverts commit `21b7572533`.	2016-03-25 14:43:22 -07:00
Gian Merlino	7e7a886f65	Move druid-api into the druid repo. This is from druid-api-0.3.17, as of commit 51884f1d05d5512cacaf62cedfbb28c6ab2535cf in the druid-api repo.	2016-03-24 11:04:34 -07:00
binlijin	2729efca71	implement special distinctcount	2016-03-24 11:11:11 +08:00
jon-wei	a59c9ee1b1	Support use of DimensionSchema class in DimensionsSpec	2016-03-21 13:12:04 -07:00
Gian Merlino	738dcd8cd9	Update version to 0.9.1-SNAPSHOT. Fixes #2462	2016-03-17 10:34:20 -07:00
Nishant	773d6fe86c	Merge pull request #2646 from atomx/update-maxmind Update com.maxmind.geoip2 to 2.6.0	2016-03-14 11:20:48 -07:00
Erik Dubbelboer	21b7572533	Update com.maxmind.geoip2 to 2.6.0 com.maxmind.geoip2 2.6.0 depends on com.google.http-client 1.15.0-rc (3 years old). When trying to include other libraries in Druid that require an up to date version of com.google.http-client this causes a problem.	2016-03-12 09:44:00 +00:00
Gian Merlino	f22fb2c2cf	KafkaIndexTask. Reads a specific offset range from specific partitions, and can use dataSource metadata transactions to guarantee exactly-once ingestion. Each task has a finite lifecycle, so it is expected that some process will be supervising existing tasks and creating new ones when needed.	2016-03-10 18:41:43 -08:00
Nishant	ba1185963b	Fix a bunch of dependencies * Eliminate exclusion groups from pull-deps * Only consider dependency nodes in pull-deps if they are not in the following scopes * provided * test * system * Fix a bunch of `<scope>provided</scope>` missing tags * Better exclusions for a couple of problematic libs	2016-03-10 10:18:08 -08:00
fjy	e3e932a4d4	refactor extensions into core and contrib	2016-03-08 17:12:09 -08:00
Gian Merlino	004028b887	Make first few allocatePendingSegment retries quiet. Some light retrying can happen during normal operation (SELECT -> INSERT races) and the ensuing log messages would be scary for users.	2016-03-02 13:40:29 -08:00
Fangjin Yang	3a9fe2aad0	Merge pull request #2231 from lizhanhui/pull_request Add druid-rocketmq module	2016-02-25 17:19:57 -08:00
Bingkun Guo	9e4c908922	generate tarball by mvn package	2016-02-18 16:42:41 -06:00
Slim Bouguerra	4e119b7a24	Adding lookup ref manager and lookup dimension spec impl	2016-02-11 12:11:51 -06:00
Charles Allen	3a26b3926c	Identify druid.io as committer in pom.xml	2016-02-02 17:01:58 -08:00
Xavier Léauté	e3d1e07b34	Merge pull request #2261 from metamx/improve-segment-ordering Prioritize loading of segments based on segment interval	2016-01-27 10:05:54 -08:00
Nishant	fd6bf3fe22	Use interval comparator instead of bucketMonthComparator fix when two segments have same interval review comments	2016-01-27 17:35:43 +05:30
Charles Allen	937ae6ad20	Update druid-api to 0.3.16 Fixes https://github.com/druid-io/druid/issues/2316	2016-01-22 14:37:16 -08:00
Slim Bouguerra	e0d90f875c	Graphite emitter	2016-01-21 13:43:37 -06:00
Fangjin Yang	1b162a67ff	Merge pull request #2235 from druid-io/updateCommonsIO Update commons-io to 2.4	2016-01-10 08:48:25 -08:00
pdeva	62aa8fec94	Updated log4j version	2016-01-09 10:45:40 -08:00
Charles Allen	c1abcc3ef9	Update commons-io to 2.4 Hadoop2.3.0 uses version 2.4 as per http://central.maven.org/maven2/org/apache/hadoop/hadoop-project/2.3.0/hadoop-project-2.3.0.pom	2016-01-08 21:39:50 -08:00
Li Zhanhui	8eb332c1c4	Add druid-rocketmq module	2016-01-08 08:13:04 +08:00
Charles Allen	b7b4d9f284	Update bytebuffer-collections to 0.2.4 Pulls in fix for https://github.com/RoaringBitmap/RoaringBitmap/issues/61	2016-01-07 10:21:49 -08:00
Charles Allen	3c4bdb7cc8	Manually update <tag> from <scm> in pom.xml	2016-01-05 14:42:25 -08:00
Gian Merlino	b93feb5e77	Update java-util, fixes #2193	2016-01-05 11:16:03 -05:00
Zhao Weinan	5e57ddb8cc	Adding avro support to realtime & hadoop batch indexing.	2016-01-05 10:21:27 +08:00
Charles Allen	2097669cce	Update bytebuffer-collections to 0.2.3 * Fixes https://github.com/druid-io/druid/issues/2175	2016-01-04 11:20:45 -08:00
Gian Merlino	891d639188	Remove unused kafka-seven extension.	2015-12-29 12:05:27 -05:00
fjy	398a3ec620	add docs for more specs	2015-12-17 18:06:30 -08:00
jon-wei	c53bf85d83	Add docs and benchmark for JSON flattening parser	2015-12-09 16:13:30 -08:00
Gian Merlino	f6f7bec2b6	Update java-util.	2015-12-08 15:32:27 -08:00
Himanshu	5f2466afd1	Merge pull request #2045 from metamx/updateEmitter036 Update mmx emitter to 0.3.6	2015-12-05 23:20:17 -06:00
Charles Allen	ea5fdc30f8	Update mmx emitter to 0.3.6 * 0.3.5 updated better logging messages * 0.3.6 updates validator dependency to help prevent stale validator jars from being pulled in	2015-12-04 12:50:22 -08:00
Gian Merlino	fde4753e25	Disable javadoc linting.	2015-12-03 19:11:29 -08:00
Himanshu Gupta	9c569be11e	adding datasketches module to top level pom	2015-11-12 00:04:33 -06:00
Xavier Léauté	a57cbfd2c3	Merge pull request #1387 from metamx/enableShutdownLogging Add special handler to allow logger messages during shutdown	2015-11-09 17:20:09 -08:00
Xavier Léauté	c896818241	Update curator to 2.9.1 Lots of bugfixes since 2.8.0 - https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12314425&version=12333324 - https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12314425&version=12332392	2015-11-05 15:53:01 -08:00
Lou Marvin Caraig	c924f9fe56	Added cloudfiles-extensions in order to support Rackspace's cloudfiles as deep storage	2015-11-04 17:44:48 +01:00
Nishant	dcd4468156	update emitter version contains changes - - https://github.com/metamx/emitter/pull/9 - https://github.com/metamx/emitter/pull/13 - https://github.com/metamx/emitter/pull/12 - https://github.com/metamx/emitter/pull/10	2015-10-29 17:43:03 +05:30
Nishant	20a3ebc022	update server metrics version - fixes Sigar loading for JvmCpuMetrics https://github.com/metamx/server-metrics/pull/16 update server metrics	2015-10-29 17:37:45 +05:30
Gian Merlino	7df7370935	Merge pull request #1862 from metamx/indexingServiceMMGone Add timeout to shutdown request to middle manager for indexing service	2015-10-27 14:38:01 -07:00
Charles Allen	7a2ceef690	Add special handler to allow logger messages during shutdown * Adds a special PropertyChecker interface which is ONLY for setting string properties at the very start of psvm	2015-10-27 14:33:36 -07:00
Charles Allen	44a2b204df	Add timeout to shutdown request to middle manager for indexing service	2015-10-27 13:56:03 -07:00
Bingkun Guo	4914925d65	New extension loading mechanism 1) Remove maven client from downloading extensions at runtime. 2) Provide a way to load Druid extensions and hadoop dependencies through file system. 3) Refactor pull-deps so that it can download extensions into extension directories. 4) Add documents on how to use this new extension loading mechanism. 5) Change the way how Druid tarball is generated. Now all the extensions + hadoop-client 2.3.0 are packaged within the Druid tarball.	2015-10-21 14:22:36 -05:00
Xavier Léauté	e4ac78e43d	bump next snapshot to 0.9.0	2015-10-20 13:46:13 -07:00
Xavier Léauté	4c2c7a2c37	update version to 0.8.3	2015-10-14 21:40:55 -07:00
Xavier Léauté	5a98d4e650	update coveralls plugin	2015-10-14 10:25:23 -07:00
Charles Allen	e9b81430f4	Bump server-metrics to 0.2.5 to catch a few fixes.	2015-10-08 11:05:51 -07:00
Nishant	38664904e2	Revert Jetty version update to 9.2.10, latest version that is working revert to jetty 9.2.5, last known good version	2015-10-08 21:53:13 +05:30
Nishant	42e971d1c1	Merge pull request #1797 from himanshug/fix_ingest_segment_firehose_ut ingest segment firehose ut	2015-10-02 21:57:22 +05:30
Himanshu Gupta	e2b16ab281	update java-util dep version	2015-10-01 16:06:04 -05:00
Charles Allen	bdae0cb135	Update httpcomponents and aws-sdk	2015-10-01 13:28:46 -07:00
Himanshu	63a3a4a254	Merge pull request #1763 from metamx/server-metrics-fixes fix NPE and duplicate metric keys	2015-09-22 09:39:01 -05:00
Xavier Léauté	0fe9aeb3d6	fix NPE and duplicate metric keys	2015-09-21 22:50:49 -07:00
Charles Allen	045f72505c	Merge pull request #1759 from metamx/update-roaring better faster smaller roaring bitmaps	2015-09-21 18:50:19 -07:00
Xavier Léauté	df6988bbd2	better faster smaller roaring bitmaps	2015-09-21 17:00:57 -07:00
Xavier Léauté	af86c0e6ea	update druid-api + java-util for timstamp parsing speedup	2015-09-21 09:57:29 -07:00
Himanshu Gupta	ebdb612933	composing emitter module to use multiple emitters together	2015-09-09 16:45:50 -05:00
Himanshu Gupta	2e0dd1d792	adding UTs and addressing review comments to firehoseV2 addition to Realtime[Manager\|Plumber], essential segment metadata persist support, kafka-simple-consumer-firehose extension patch	2015-08-27 20:50:46 -05:00
lvjq	2237a8cf0f	kafka 8 simple consumer firehose	2015-08-27 20:50:46 -05:00

1 2 3 4 5 ...

1253 Commits