Commit Graph

233 Commits

Author SHA1 Message Date
Gian Merlino 03ff72b7e2 [maven-release-plugin] prepare release druid-0.3.8 2013-03-01 16:57:38 -08:00
Gian Merlino 998ed0a8de [maven-release-plugin] prepare for next development iteration 2013-03-01 14:43:20 -08:00
Gian Merlino 89d5ddf2f7 [maven-release-plugin] prepare release druid-0.3.7 2013-03-01 14:43:13 -08:00
Fangjin Yang 360c6a1df1 [maven-release-plugin] prepare for next development iteration 2013-03-01 13:55:07 -08:00
Fangjin Yang a43dc44ebb [maven-release-plugin] prepare release druid-0.3.6 2013-03-01 13:55:01 -08:00
Fangjin Yang 3264a689f1 [maven-release-plugin] prepare for next development iteration 2013-03-01 11:43:23 -08:00
Fangjin Yang 334fffdef5 [maven-release-plugin] prepare release druid-0.3.5 2013-03-01 11:43:16 -08:00
Gian Merlino e40fba4de2 HadoopIndexTask: Jackson fixes and general overriding of storage-specific stuff 2013-02-28 07:53:28 -08:00
Gian Merlino f862d9205d Merge branch 'master' into hadoop-index-task
Conflicts:
	merger/src/main/java/com/metamx/druid/merger/common/task/Task.java
2013-02-27 06:53:06 -08:00
Fangjin Yang 2e1e1753d5 [maven-release-plugin] prepare for next development iteration 2013-02-26 15:15:54 -08:00
Fangjin Yang ab456cf3b1 [maven-release-plugin] prepare release druid-0.3.4 2013-02-26 15:15:36 -08:00
Gian Merlino 9964e7dfe4 [maven-release-plugin] prepare for next development iteration 2013-02-25 12:24:24 -08:00
Gian Merlino 99c4f9446c [maven-release-plugin] prepare release druid-0.3.3 2013-02-25 12:24:15 -08:00
Gian Merlino 7d7ce2b7fe Fix DeterminePartitionsJob ISE for dimensions not present in all rows 2013-02-25 11:22:25 -08:00
Eric Tschetter e0f6df1a5c [maven-release-plugin] prepare for next development iteration 2013-02-21 20:39:50 -06:00
Eric Tschetter 846bc0e4f0 [maven-release-plugin] prepare release druid-0.3.2 2013-02-21 20:39:45 -06:00
Eric Tschetter 699aadd2b1 [maven-release-plugin] prepare for next development iteration 2013-02-21 15:50:01 -06:00
Eric Tschetter 606a2e4b01 [maven-release-plugin] prepare release druid-0.3.1 2013-02-21 15:49:55 -06:00
Eric Tschetter f8c54a72c2 1) Changes to allow for local storage 2013-02-21 15:47:01 -06:00
Deep Ganguli 60da9ffddb Merge branch 'master' into hadoop-index-task
Conflicts:
	common/src/main/java/com/metamx/druid/utils/JodaUtils.java
	indexer/src/main/java/com/metamx/druid/indexer/DbUpdaterJob.java
	indexer/src/main/java/com/metamx/druid/indexer/HadoopDruidIndexerConfig.java
	indexer/src/main/java/com/metamx/druid/indexer/IndexGeneratorJob.java
2013-02-15 13:22:31 -08:00
Gian Merlino 824e3c0eb2 [maven-release-plugin] prepare for next development iteration 2013-02-15 13:03:39 -08:00
Gian Merlino 6bbc992101 [maven-release-plugin] prepare release druid-0.3.0 2013-02-15 13:03:32 -08:00
Deep Ganguli e042c29173 Fixed typos in comments, changed prefix on s3Paths to s3n from s3://, cleaned
up umbrellaIntervals code in JodaUtils, modified the DbUpdater to push segment
meta-data to the database in batches.
2013-02-15 11:39:49 -08:00
Gian Merlino e3822f6ca7 DeterminePartitionsJob: Fix docs 2013-02-13 18:51:33 -08:00
Gian Merlino ee193f0ca7 DeterminePartitionsJob: Select partition dimension to minimize segment size variance when cardinality is low 2013-02-13 18:51:32 -08:00
Eric Tschetter c5b78e75cf 1) Move up to Jackson 2.1
2) Some bugs were fixed, but they ended up getting mixed in with the Jackson upgrade :(
2013-02-13 18:51:32 -08:00
xvrl 07b0b4611b bump up to 0.3.0 2013-02-12 09:52:21 -08:00
Fangjin Yang e519f82e87 [maven-release-plugin] prepare for next development iteration 2013-02-11 10:58:51 -08:00
Fangjin Yang 73f7468a23 [maven-release-plugin] prepare release druid-0.2.7 2013-02-11 10:58:43 -08:00
Eric Tschetter 9ac5eeebb3 1) Fix NPE in DeterminePartitionsJob when it fails 2013-02-06 23:34:46 -08:00
Fangjin Yang ec2b906fad [maven-release-plugin] prepare for next development iteration 2013-02-06 18:43:42 -08:00
Fangjin Yang 22d9607169 [maven-release-plugin] prepare release druid-0.2.6 2013-02-06 18:43:34 -08:00
Fangjin Yang e09c658352 [maven-release-plugin] prepare for next development iteration 2013-02-06 17:42:39 -08:00
Fangjin Yang 668353e895 [maven-release-plugin] prepare release druid-0.2.5 2013-02-06 17:42:33 -08:00
Fangjin Yang d023d219ae [maven-release-plugin] prepare for next development iteration 2013-02-06 16:54:21 -08:00
Fangjin Yang d0fdf4ccf0 [maven-release-plugin] prepare release druid-0.2.4 2013-02-06 16:54:16 -08:00
Eric Tschetter 34b4383934 1) Adjust DataSegment to have a binaryVersion field that indicates the expected binary version of the segment files
2) Eliminate race condition from RemoteTaskRunnerTest.testAlreadyExecutedTask()
2013-02-01 18:59:33 -06:00
xvrl 7ec3563e7b [maven-release-plugin] prepare for next development iteration 2013-02-01 16:25:42 -08:00
xvrl a239f62158 [maven-release-plugin] prepare release druid-0.2.3 2013-02-01 16:25:35 -08:00
Fangjin Yang 00e6ac1e10 [maven-release-plugin] prepare for next development iteration 2013-01-30 17:09:23 -08:00
Fangjin Yang ef19de8270 [maven-release-plugin] prepare release druid-0.2.2 2013-01-30 17:09:18 -08:00
Fangjin Yang fa733565e8 [maven-release-plugin] prepare for next development iteration 2013-01-30 16:28:28 -08:00
Fangjin Yang 0b116a8fe4 [maven-release-plugin] prepare release druid-0.2.1 2013-01-30 16:28:22 -08:00
Fangjin Yang 934207be0d updating sdk version to fix indexer in aws 2013-01-30 16:26:24 -08:00
Fangjin Yang 2a0de2090b [maven-release-plugin] prepare for next development iteration 2013-01-30 13:47:15 -08:00
Fangjin Yang 049ca2bad4 [maven-release-plugin] prepare release druid-0.2.0 2013-01-30 13:47:08 -08:00
Deep Ganguli cb845e6f09 Addresed Gian's code review 2013-01-24 17:54:06 -08:00
Deep Ganguli 017d4779d6 Implemented Hadoop Index Task which takes as input a HadoopDruidIndexConfig and
generates index segments.

The HadoopIndexTask run method wraps a HadoopDruidIndexerJob run method. The
key modifications to the HadoopDruidIndexerJob are as follows:

- The UpDaterJobSpec field of the config that is used to set up the indexer job
  is set to null. This ensures that the job does not push a list of published
segments to the database, in order to allow the indexing service to handle this
later.
- Set the version field of the config file based on the TaskContext. Also
  changed config.setVersion method to take a string (as opposed to a Date) as
input, and propogated this change where necessary.
- Set the SegmentOutputDir field of the config file based on the TaskToolbox,
  to allow the indexing service to handle where to write the segments too.
- Added a method to IndexGeneratorJob called getPublishedSegments, that simply
  returns a list of published segments without publishing this list to the
database.
2013-01-23 19:27:14 -08:00
Gian Merlino 77a3f3cbe0 Merge branch 'master' into determine-partitions
Conflicts:
	indexer/src/main/java/com/metamx/druid/indexer/IndexGeneratorJob.java
2013-01-21 14:46:13 -08:00
Gian Merlino d9e6f1d954 DeterminePartitions follow-up
HadoopDruidIndexerConfig:
- Add partitionsSpec (backwards compatible with targetPartitionSize and partitionDimension)
- Add assumeGrouped flag to partitionsSpec

DeterminePartitionsJob:
- Skip group-by job if assumeGrouped is set
- Clean up code a bit
2013-01-21 14:38:35 -08:00
Eric Tschetter c8cb96b006 1) Remove vast majority of usages of IndexIO.mapDir() and deprecated it. IndexIO.loadIndex() is the new IndexIO.mapDir()
2) Fix bug with IndexMerger and null columns
3) Add QueryableIndexIndexableAdapter so that QueryableIndexes can be merged
4) Adjust twitter example to have multiple values for each hash tag
5) Adjusted GroupByQueryEngine to just drop dimensions that don't exist instead of throwing an NPE
2013-01-16 17:10:33 -06:00
Gian Merlino 7b42ee6a6e Rework DeterminePartitionsJob in the hadoop indexer
- Can handle non-rolled-up input (by grouping input rows using an additional MR stage)
- Can select its own partitioning dimension, if none is supplied
- Can detect and avoid oversized shards due to bad dimension value distribution
- Shares input parsing code with IndexGeneratorJob
2013-01-16 08:15:01 -08:00
Gian Merlino 616415cb7e UniformGranularitySpec: Only return bucketInterval for timestamps that legitimately
overlap our input intervals
2013-01-15 22:30:17 -08:00
Fangjin Yang 7e074e8158 fix pom breakage 2013-01-15 12:04:12 -08:00
xvrl 9627fdcd1c [maven-release-plugin] prepare for next development iteration 2013-01-03 10:09:52 -08:00
xvrl f2c2d3f4cf [maven-release-plugin] prepare release druid-0.1.25 2013-01-03 10:09:43 -08:00
xvrl 2a188996cd [maven-release-plugin] prepare for next development iteration 2012-12-31 14:56:10 -08:00
xvrl 6cdcc2becc [maven-release-plugin] prepare release druid-0.1.24 2012-12-31 14:56:03 -08:00
Fangjin Yang 5137af0750 [maven-release-plugin] prepare for next development iteration 2012-12-18 16:17:44 -08:00
Fangjin Yang c579ba5a42 [maven-release-plugin] prepare release druid-0.1.23 2012-12-18 16:17:38 -08:00
Fangjin Yang 090a5a105d [maven-release-plugin] prepare for next development iteration 2012-12-18 14:19:09 -08:00
Fangjin Yang fb8f4f1f41 [maven-release-plugin] prepare release druid-0.1.22 2012-12-18 14:19:02 -08:00
Fangjin Yang 75d465aed0 [maven-release-plugin] prepare for next development iteration 2012-12-18 10:37:16 -08:00
Fangjin Yang decc4e4c0e [maven-release-plugin] prepare release druid-0.1.21 2012-12-18 10:37:08 -08:00
Fangjin Yang e34944b1c5 [maven-release-plugin] prepare for next development iteration 2012-12-15 00:52:01 -08:00
Fangjin Yang 24f0207fd8 [maven-release-plugin] prepare release druid-0.1.20 2012-12-15 00:51:54 -08:00
Fangjin Yang f9386c3623 [maven-release-plugin] prepare for next development iteration 2012-12-15 00:44:58 -08:00
Fangjin Yang 4bcc8d27fa [maven-release-plugin] prepare release druid-0.1.19 2012-12-15 00:44:50 -08:00
Fangjin Yang 5fff07a8b0 [maven-release-plugin] prepare for next development iteration 2012-12-14 22:56:49 -08:00
Fangjin Yang 2b2818dbaf [maven-release-plugin] prepare release druid-0.1.18 2012-12-14 22:56:43 -08:00
Fangjin Yang b0e1f90e35 [maven-release-plugin] prepare for next development iteration 2012-12-14 22:07:57 -08:00
Fangjin Yang ce8e02b5a3 [maven-release-plugin] prepare release druid-0.1.17 2012-12-14 22:07:50 -08:00
Fangjin Yang 0821dce8c1 [maven-release-plugin] prepare for next development iteration 2012-12-14 21:29:20 -08:00
Fangjin Yang a0d22a6646 [maven-release-plugin] prepare release druid-0.1.16 2012-12-14 21:29:14 -08:00
Fangjin Yang e37e24cc25 [maven-release-plugin] prepare for next development iteration 2012-12-14 17:19:59 -08:00
Fangjin Yang 4e78d8e734 [maven-release-plugin] prepare release druid-0.1.15 2012-12-14 17:19:53 -08:00
Eric Tschetter b4a1bb4a00 [maven-release-plugin] prepare for next development iteration 2012-12-14 11:37:17 -06:00
Eric Tschetter cb9b18c7c2 [maven-release-plugin] prepare release druid-0.1.14 2012-12-14 11:37:12 -06:00
Eric Tschetter 29a64f50e2 [maven-release-plugin] prepare for next development iteration 2012-12-13 17:57:46 -06:00
Eric Tschetter c82007973e [maven-release-plugin] prepare release druid-0.1.13 2012-12-13 17:57:42 -06:00
Fangjin Yang 6729f166d9 [maven-release-plugin] prepare for next development iteration 2012-12-12 15:38:58 -08:00
Fangjin Yang 578cf94bc0 [maven-release-plugin] prepare release druid-0.1.12 2012-12-12 15:38:37 -08:00
Fangjin Yang 0105233fc3 [maven-release-plugin] prepare for next development iteration 2012-12-07 17:50:47 -08:00
Fangjin Yang 943fe5e6bf [maven-release-plugin] prepare release druid-0.1.11 2012-12-07 17:50:41 -08:00
Fangjin Yang da4b31c28f [maven-release-plugin] prepare for next development iteration 2012-12-07 15:18:19 -08:00
Fangjin Yang 6df32ac0b0 [maven-release-plugin] prepare release druid-0.1.10 2012-12-07 15:18:11 -08:00
Fangjin Yang 9eb28254f8 [maven-release-plugin] prepare for next development iteration 2012-12-07 14:49:13 -08:00
Fangjin Yang 9ce536e040 [maven-release-plugin] prepare release druid-0.1.9 2012-12-07 14:49:04 -08:00
Fangjin Yang 35362b3e6b [maven-release-plugin] prepare for next development iteration 2012-12-07 14:37:25 -08:00
Fangjin Yang bd3c05dddb [maven-release-plugin] prepare release druid-0.1.8 2012-12-07 14:37:18 -08:00
Fangjin Yang 50a28f827e [maven-release-plugin] prepare for next development iteration 2012-12-07 10:59:52 -08:00
Fangjin Yang b59361065e [maven-release-plugin] prepare release druid-0.1.7 2012-12-07 10:59:34 -08:00
Fangjin Yang 5822f4f5f7 refactor master to run rules before cleaning up; more master stats; general improvements 2012-12-03 14:43:04 -08:00
Fangjin Yang 2e5e1ce989 first commit of tiers for compute nodes; working UT at this point 2012-11-28 17:37:08 -08:00
Eric Tschetter 0390e7fa5c [maven-release-plugin] prepare for next development iteration 2012-11-21 22:13:15 -06:00
Eric Tschetter 93a9affa45 [maven-release-plugin] prepare release druid-0.1.6 2012-11-21 22:13:08 -06:00
Gian Merlino 5e447937e2 [maven-release-plugin] prepare for next development iteration 2012-11-21 16:09:14 -08:00
Gian Merlino 32d46a3eec [maven-release-plugin] prepare release druid-0.1.5 2012-11-21 16:09:08 -08:00
Eric Tschetter 7497a31943 1) Revert emr hadoop version that was accidentally commited in pom. 2012-11-20 16:14:22 -06:00
Eric Tschetter 0f63cb4f00 1) Have IndexGeneratorJob write the descriptors for each of the segments it creates to a path in the temporary working directory (generally HDFS)
2) Have the DbUpdaterJob read descriptors from the temporary working directory instead of looking in the final segment output location (often the eventually consistent S3)
3) 1 and 2 Fixes #30
2012-11-20 15:30:50 -06:00
Eric Tschetter 701cc9562b 1) Adjust the StorageAdapters to lowercase names of metrics and dimensions before looking them up.
2) Add some docs to InputRow/Row to indicate that column names passed into the methods are *always* lowercase and that the rows need to act accordingly. (fixes #29, or at least clarifies the behavior...)
2012-11-19 17:01:17 -06:00
Fangjin Yang cf1c741f8a [maven-release-plugin] prepare for next development iteration 2012-11-13 15:10:12 -08:00
Fangjin Yang 06bd77fca7 [maven-release-plugin] prepare release druid-0.1.4 2012-11-13 15:10:00 -08:00
Fangjin Yang 0ef40171a8 nodes no longer inherit from interfaces but instead extend classes 2012-11-13 13:18:31 -08:00
Fangjin Yang 24564d73e1 register subtypes for reducer 2012-11-12 16:41:34 -08:00
Fangjin Yang 57468d39ef reverting some of the last changes 2012-11-12 16:14:48 -08:00
Fangjin Yang c20dccd0f4 modifying the way registering serdes works to hopefully be a bit easier to use 2012-11-12 13:58:43 -08:00
Marshall Pierce 447dfc0cc6 Switch to using javax.servlet:servlet-api:2.5 instead of two different versions of jetty's servlet-api distribution 2012-11-09 17:31:03 -08:00
Fangjin Yang dd17992172 [maven-release-plugin] prepare for next development iteration 2012-11-09 15:50:24 -08:00
Fangjin Yang 47e71080db [maven-release-plugin] prepare release druid-0.1.3 2012-11-09 15:50:16 -08:00
Fangjin Yang 5859599365 [maven-release-plugin] prepare for next development iteration 2012-11-09 13:41:04 -08:00
Fangjin Yang 56f2c15e21 [maven-release-plugin] prepare release druid-0.1.2 2012-11-09 13:40:57 -08:00
Fangjin Yang 0b7a3f259a [maven-release-plugin] prepare for next development iteration 2012-11-09 10:10:32 -08:00
Fangjin Yang baba4e7b75 [maven-release-plugin] prepare release druid-0.1.1 2012-11-09 10:10:26 -08:00
Fangjin Yang 03e5d74087 [maven-release-plugin] prepare for next development iteration 2012-11-09 09:33:06 -08:00
Fangjin Yang 8aff8decad [maven-release-plugin] prepare release druid-0.1.0 2012-11-09 09:32:57 -08:00
Eric Tschetter 0a0e2a6cc1 1) Try to fix the dependency issues for running the HadoopDruidIndexer locally. 2012-11-08 17:06:02 -08:00
Fangjin Yang 6da047b5fa fix backwards compatibility issues 2012-11-08 15:09:00 -08:00
Fangjin Yang 34cb352cf8 working indexer with registererers 2012-11-06 14:26:53 -08:00
Fangjin Yang 5698f640d7 fix last commit with version 2012-11-06 12:40:53 -08:00
Fangjin Yang 0b6dd99452 set default version if one is not set 2012-11-06 12:36:55 -08:00
Fangjin Yang 34a221a586 fix bug with jackson conversion 2012-11-06 11:56:48 -08:00
Fangjin Yang 68e5adde33 register registererers in the config 2012-11-06 11:49:17 -08:00
Fangjin Yang eb2b5a61fa fix setters for hadoop node 2012-11-05 18:40:54 -08:00
Fangjin Yang 2ae0a15b5a add register abilities to mapper 2012-11-05 18:31:23 -08:00
Fangjin Yang 9fbee29eb4 change hadoop indexer to be node based 2012-11-05 18:19:04 -08:00
Fangjin Yang 7b2522ff3f allow hadoop druid indexer to register registererers 2012-11-05 16:13:50 -08:00
Eric Tschetter f07b15db1f Undo massive import re-ordering 2012-11-01 18:38:27 -05:00
Ian Brandt e7ec073d75 Cleaned Maven dependency configuration. In particular the 'mvn dependency:analyze' report no longer warns of used but undeclared dependencies. 2012-10-31 16:10:07 -07:00
Ian Brandt e8f02219bb Streamlined the Maven POMs, in particular the plugin configurations. The 'mvn versions:display-plugin-updates' report is now warning free, and there are no more Maven warnings reported in Eclipse. 2012-10-31 12:58:16 -07:00
Ian Brandt 0d24df7628 Removed unused imports. 2012-10-30 22:22:46 -07:00
Eric Tschetter 27999caca0 1) Create LICENSE
2) Attach copyright and notice of license to files
2012-10-24 05:09:47 -04:00
Eric Tschetter 9d41599967 Initial commit of OSS Druid Code 2012-10-24 03:39:51 -04:00