162 Commits

Author SHA1 Message Date
Gian Merlino
6bbc992101 [maven-release-plugin] prepare release druid-0.3.0 2013-02-15 13:03:32 -08:00
Deep Ganguli
e042c29173 Fixed typos in comments, changed prefix on s3Paths to s3n from s3://, cleaned
up umbrellaIntervals code in JodaUtils, modified the DbUpdater to push segment
meta-data to the database in batches.
2013-02-15 11:39:49 -08:00
Gian Merlino
e3822f6ca7 DeterminePartitionsJob: Fix docs 2013-02-13 18:51:33 -08:00
Gian Merlino
ee193f0ca7 DeterminePartitionsJob: Select partition dimension to minimize segment size variance when cardinality is low 2013-02-13 18:51:32 -08:00
Eric Tschetter
c5b78e75cf 1) Move up to Jackson 2.1
2) Some bugs were fixed, but they ended up getting mixed in with the Jackson upgrade :(
2013-02-13 18:51:32 -08:00
xvrl
07b0b4611b bump up to 0.3.0 2013-02-12 09:52:21 -08:00
Fangjin Yang
e519f82e87 [maven-release-plugin] prepare for next development iteration 2013-02-11 10:58:51 -08:00
Fangjin Yang
73f7468a23 [maven-release-plugin] prepare release druid-0.2.7 2013-02-11 10:58:43 -08:00
Eric Tschetter
9ac5eeebb3 1) Fix NPE in DeterminePartitionsJob when it fails 2013-02-06 23:34:46 -08:00
Fangjin Yang
ec2b906fad [maven-release-plugin] prepare for next development iteration 2013-02-06 18:43:42 -08:00
Fangjin Yang
22d9607169 [maven-release-plugin] prepare release druid-0.2.6 2013-02-06 18:43:34 -08:00
Fangjin Yang
e09c658352 [maven-release-plugin] prepare for next development iteration 2013-02-06 17:42:39 -08:00
Fangjin Yang
668353e895 [maven-release-plugin] prepare release druid-0.2.5 2013-02-06 17:42:33 -08:00
Fangjin Yang
d023d219ae [maven-release-plugin] prepare for next development iteration 2013-02-06 16:54:21 -08:00
Fangjin Yang
d0fdf4ccf0 [maven-release-plugin] prepare release druid-0.2.4 2013-02-06 16:54:16 -08:00
Eric Tschetter
34b4383934 1) Adjust DataSegment to have a binaryVersion field that indicates the expected binary version of the segment files
2) Eliminate race condition from RemoteTaskRunnerTest.testAlreadyExecutedTask()
2013-02-01 18:59:33 -06:00
xvrl
7ec3563e7b [maven-release-plugin] prepare for next development iteration 2013-02-01 16:25:42 -08:00
xvrl
a239f62158 [maven-release-plugin] prepare release druid-0.2.3 2013-02-01 16:25:35 -08:00
Fangjin Yang
00e6ac1e10 [maven-release-plugin] prepare for next development iteration 2013-01-30 17:09:23 -08:00
Fangjin Yang
ef19de8270 [maven-release-plugin] prepare release druid-0.2.2 2013-01-30 17:09:18 -08:00
Fangjin Yang
fa733565e8 [maven-release-plugin] prepare for next development iteration 2013-01-30 16:28:28 -08:00
Fangjin Yang
0b116a8fe4 [maven-release-plugin] prepare release druid-0.2.1 2013-01-30 16:28:22 -08:00
Fangjin Yang
934207be0d updating sdk version to fix indexer in aws 2013-01-30 16:26:24 -08:00
Fangjin Yang
2a0de2090b [maven-release-plugin] prepare for next development iteration 2013-01-30 13:47:15 -08:00
Fangjin Yang
049ca2bad4 [maven-release-plugin] prepare release druid-0.2.0 2013-01-30 13:47:08 -08:00
Deep Ganguli
cb845e6f09 Addresed Gian's code review 2013-01-24 17:54:06 -08:00
Deep Ganguli
017d4779d6 Implemented Hadoop Index Task which takes as input a HadoopDruidIndexConfig and
generates index segments.

The HadoopIndexTask run method wraps a HadoopDruidIndexerJob run method. The
key modifications to the HadoopDruidIndexerJob are as follows:

- The UpDaterJobSpec field of the config that is used to set up the indexer job
  is set to null. This ensures that the job does not push a list of published
segments to the database, in order to allow the indexing service to handle this
later.
- Set the version field of the config file based on the TaskContext. Also
  changed config.setVersion method to take a string (as opposed to a Date) as
input, and propogated this change where necessary.
- Set the SegmentOutputDir field of the config file based on the TaskToolbox,
  to allow the indexing service to handle where to write the segments too.
- Added a method to IndexGeneratorJob called getPublishedSegments, that simply
  returns a list of published segments without publishing this list to the
database.
2013-01-23 19:27:14 -08:00
Gian Merlino
77a3f3cbe0 Merge branch 'master' into determine-partitions
Conflicts:
	indexer/src/main/java/com/metamx/druid/indexer/IndexGeneratorJob.java
2013-01-21 14:46:13 -08:00
Gian Merlino
d9e6f1d954 DeterminePartitions follow-up
HadoopDruidIndexerConfig:
- Add partitionsSpec (backwards compatible with targetPartitionSize and partitionDimension)
- Add assumeGrouped flag to partitionsSpec

DeterminePartitionsJob:
- Skip group-by job if assumeGrouped is set
- Clean up code a bit
2013-01-21 14:38:35 -08:00
Eric Tschetter
c8cb96b006 1) Remove vast majority of usages of IndexIO.mapDir() and deprecated it. IndexIO.loadIndex() is the new IndexIO.mapDir()
2) Fix bug with IndexMerger and null columns
3) Add QueryableIndexIndexableAdapter so that QueryableIndexes can be merged
4) Adjust twitter example to have multiple values for each hash tag
5) Adjusted GroupByQueryEngine to just drop dimensions that don't exist instead of throwing an NPE
2013-01-16 17:10:33 -06:00
Gian Merlino
7b42ee6a6e Rework DeterminePartitionsJob in the hadoop indexer
- Can handle non-rolled-up input (by grouping input rows using an additional MR stage)
- Can select its own partitioning dimension, if none is supplied
- Can detect and avoid oversized shards due to bad dimension value distribution
- Shares input parsing code with IndexGeneratorJob
2013-01-16 08:15:01 -08:00
Gian Merlino
616415cb7e UniformGranularitySpec: Only return bucketInterval for timestamps that legitimately
overlap our input intervals
2013-01-15 22:30:17 -08:00
Fangjin Yang
7e074e8158 fix pom breakage 2013-01-15 12:04:12 -08:00
xvrl
9627fdcd1c [maven-release-plugin] prepare for next development iteration 2013-01-03 10:09:52 -08:00
xvrl
f2c2d3f4cf [maven-release-plugin] prepare release druid-0.1.25 2013-01-03 10:09:43 -08:00
xvrl
2a188996cd [maven-release-plugin] prepare for next development iteration 2012-12-31 14:56:10 -08:00
xvrl
6cdcc2becc [maven-release-plugin] prepare release druid-0.1.24 2012-12-31 14:56:03 -08:00
Fangjin Yang
5137af0750 [maven-release-plugin] prepare for next development iteration 2012-12-18 16:17:44 -08:00
Fangjin Yang
c579ba5a42 [maven-release-plugin] prepare release druid-0.1.23 2012-12-18 16:17:38 -08:00
Fangjin Yang
090a5a105d [maven-release-plugin] prepare for next development iteration 2012-12-18 14:19:09 -08:00
Fangjin Yang
fb8f4f1f41 [maven-release-plugin] prepare release druid-0.1.22 2012-12-18 14:19:02 -08:00
Fangjin Yang
75d465aed0 [maven-release-plugin] prepare for next development iteration 2012-12-18 10:37:16 -08:00
Fangjin Yang
decc4e4c0e [maven-release-plugin] prepare release druid-0.1.21 2012-12-18 10:37:08 -08:00
Fangjin Yang
e34944b1c5 [maven-release-plugin] prepare for next development iteration 2012-12-15 00:52:01 -08:00
Fangjin Yang
24f0207fd8 [maven-release-plugin] prepare release druid-0.1.20 2012-12-15 00:51:54 -08:00
Fangjin Yang
f9386c3623 [maven-release-plugin] prepare for next development iteration 2012-12-15 00:44:58 -08:00
Fangjin Yang
4bcc8d27fa [maven-release-plugin] prepare release druid-0.1.19 2012-12-15 00:44:50 -08:00
Fangjin Yang
5fff07a8b0 [maven-release-plugin] prepare for next development iteration 2012-12-14 22:56:49 -08:00
Fangjin Yang
2b2818dbaf [maven-release-plugin] prepare release druid-0.1.18 2012-12-14 22:56:43 -08:00
Fangjin Yang
b0e1f90e35 [maven-release-plugin] prepare for next development iteration 2012-12-14 22:07:57 -08:00