187 Commits

Author SHA1 Message Date
Gian Merlino
7e70881101 HdfsTaskLogs: Allow overwriting existing logs.
Necessary because ForkingTaskRunner pushes logs when gracefully stopping,
but it may need to re-push those logs when the task finishes for real after
restoring.
2016-02-26 15:10:19 -08:00
Charles Allen
1fe277ee29 Merge pull request #2367 from se7entyse7en/feature-rackspace-cloud-files-static-firehose
Adds support to use Rackspace's cloudfiles as static firehose
2016-02-25 17:31:06 -08:00
Fangjin Yang
3a9fe2aad0 Merge pull request #2231 from lizhanhui/pull_request
Add druid-rocketmq module
2016-02-25 17:19:57 -08:00
Charles Allen
a8820d837b More logging for JDBCExtractionNamespaceFunctionFactory 2016-02-25 09:48:56 -08:00
Slim
368988d187 Merge pull request #2291 from druid-io/lookupManager
Promoting LookupExtractor state and LookupExtractorFactory to be a first class druid state object.
2016-02-11 16:07:27 -06:00
Fangjin Yang
d435a9b1e9 Merge pull request #2448 from metamx/fixBigJarHadoopPlace
Fix dependencies.
2016-02-11 10:47:00 -08:00
Slim Bouguerra
4e119b7a24 Adding lookup ref manager and lookup dimension spec impl 2016-02-11 12:11:51 -06:00
Will Lauer
189376a6f9 Adding optional error bounds to sketch aggs and post-aggs
By setting a new optional parameter, `errorBoundsStdDev`, to the number
of standard deviations to use when computing error bounds, the return
type for both the SketchMergeAggregator and the SketchEstimate
PostAggregator can be changed from a simple double (estimate) to a JSON
object containing the estimate, expected high bound, expected low bound,
and standard devations used when computing bounds (same value as passed
in).
2016-02-11 10:18:16 -06:00
Charles Allen
40ade32a1f Fix dependencies.
* Don't put druid****selfcontained.jar at the end of the hadoop isolated classpath
* Add `<scope>provided</scope>` to prevent repeated dependency inclusion in the extension directories
2016-02-11 07:30:14 -08:00
Lou Marvin Caraig
3294f33838 Added support to use Rackspace's cloudfiles as static firehose 2016-02-02 14:15:09 +01:00
Erik Dubbelboer
f72b613499 Remove incorrect comment.
The CloudFilesDataSegmentPuller can't handle URI data pulls.
This comment was obviously copied from the s3 module and never removed.
2016-01-30 11:02:44 +00:00
Charles Allen
508734c8b0 Long constant reformatting in tests l --> L 2016-01-27 08:59:19 -08:00
Gian Merlino
cac4651da0 Fix spelling of 'dimReverseExtractionNamespace'. 2016-01-26 23:08:02 -08:00
Charles Allen
e941303bc6 Remove sorting of dimensions in AvroStreamInputRowParserTest
Due to https://github.com/druid-io/druid-api/pull/68
2016-01-22 16:01:41 -08:00
Slim Bouguerra
e0d90f875c Graphite emitter 2016-01-21 13:43:37 -06:00
Fangjin Yang
0c31f007fc Merge pull request #1728 from himanshug/aggregators_in_segment_metadata
Store AggregatorFactory[] in segment metadata
2016-01-19 12:55:49 -08:00
Himanshu Gupta
a99aef29a1 adding aggregators to segment metadata 2016-01-19 14:23:39 -06:00
Himanshu Gupta
52eb0f04a7 adding a new method getMergingFactory(..) to AggregatorFactory 2016-01-18 22:03:46 -06:00
Himanshu Gupta
77fc86c015 making AggregatorFactory abstract class 2016-01-18 22:03:46 -06:00
Himanshu Gupta
dcd3a24f59 adding log line for segment being killed in HdfsDataSegmentKiller 2016-01-18 21:51:04 -06:00
Li Zhanhui
da5d7b9376 Reformat code according to Druid Java and Scala style 2016-01-18 16:37:03 +08:00
Kurt Young
1f2168fae5 add IndexMergerV9
add unit tests for IndexMergerV9 and fix some bugs

add more unit tests and fix bugs

handle null values and add more tests

minor changes & use LoggingProgressIndicator in IndexGeneratorReducer

make some static class public from IndexMerger

minor changes and add some comments

changes for comments
2016-01-16 11:25:28 +08:00
Li Zhanhui
0b627dde71 Revert adding synchronized to swapRequest method as the caller method onWaitEnd(), is only invoked in synchronized code blocks. There is no need to repeat sync it even if java's synchronized keyword holds a reentrant mutex semantic ^_^. 2016-01-12 19:33:39 +08:00
Li Zhanhui
03cdf3a8fe Update license text in pom.xml 2016-01-12 18:11:58 +08:00
Li Zhanhui
dc47e80924 1. Add newline to pom.xml
2. Change RocketMQ to rocketMQ
3. Make swapRequests methods synchronized in all places.
4. Make comparator static and final and use Long.compare.
2016-01-12 18:03:16 +08:00
Li Zhanhui
abe134bef6 Fix typo 2016-01-11 11:50:19 +08:00
Li Zhanhui
3d46356dc9 Update code on PR comments 2016-01-11 11:47:02 +08:00
Li Zhanhui
8eb332c1c4 Add druid-rocketmq module 2016-01-08 08:13:04 +08:00
Charles Allen
13c63bad72 Make timeouts more explicit on what is failing in JDBCExtractionNamespaceTest 2016-01-07 11:16:36 -08:00
Fangjin Yang
aaea95ed1b Merge pull request #2207 from himanshug/theta_sketch_select_query
fix bug for thetaSketch metric not working with select queries
2016-01-07 09:46:09 -08:00
fjy
2103906a48 add pusher tests for all deep storages 2016-01-05 22:22:48 -08:00
Himanshu Gupta
c6634d7c2c adding json for thetaSketch Memory object representation 2016-01-05 22:12:52 -06:00
Himanshu Gupta
62e5e45da8 add select query UT for thetaSketch 2016-01-05 22:12:52 -06:00
Himanshu Gupta
3f048f0b15 adding support to execute Select queries in AggregationTestHelper so that Select query based UTs can be written for complex aggregator implementations 2016-01-05 21:54:55 -06:00
Charles Allen
6d886da7d9 Merge pull request #2191 from duilio/fix-rackspace-cloudfiles-segment-size
store uncompressed index size on cloudfiles storage extension
2016-01-05 17:17:35 -08:00
Zhao Weinan
5e57ddb8cc Adding avro support to realtime & hadoop batch indexing. 2016-01-05 10:21:27 +08:00
Charles Allen
957646be2c Fixes to JDBCExtractionNamespaceTest 2016-01-04 09:56:07 -08:00
maurizio
5ea0b96d9a store uncompressed index size instead of the compressed one in cf storage extension 2016-01-04 14:50:27 +01:00
fjy
57d91d754d Comment out buggy unit tests, fix #2185 2016-01-03 09:50:16 -08:00
fjy
89fc18bb55 increase timeouts for jdbc tearDown 2016-01-01 20:08:06 -08:00
fjy
ca46f1d40c attempt to fix transient tests again 2015-12-30 21:39:28 -08:00
Bingkun Guo
492adeaaa7 Merge pull request #2172 from gianm/remove-kafka-seven
Remove unused kafka-seven extension.
2015-12-29 15:19:28 -06:00
Fangjin Yang
b1261035a7 Merge pull request #1861 from guobingkun/insert_segment_tool
insert-segment tool
2015-12-29 10:06:07 -08:00
Gian Merlino
891d639188 Remove unused kafka-seven extension. 2015-12-29 12:05:27 -05:00
fjy
38b0f1fbc2 fix transient failures in unit tests 2015-12-28 20:03:30 -08:00
Fangjin Yang
e490650865 Merge pull request #2110 from navis/fix-sporadic-testfail
Fix sporadic fail of URIExtractionNamespaceFunctionFactoryTest#testReverseFunction
2015-12-27 14:45:09 -08:00
Charles Allen
05c9e1b598 Reorder Before/After in JDBCExtractionNamespaceTest
* Fixes https://github.com/druid-io/druid/issues/2120
2015-12-22 11:39:46 -08:00
Bingkun Guo
89b477970f DataSegmentFinder tool
`insert-segment-to-db` is a tool that can insert segments into Druid metadata storage. It is intended to be used
to update the segment table in metadata storage after people manually migrate segments from one place to another.
It can also be used to insert missing segment into Druid, or even recover metadata storage by telling it where the
segments are stored.

Note: This tool expects users to have Druid cluster running in a "safe" mode, where there are no active tasks to interfere
the segments being inserted. Users can optionally bring down the cluster to make 100% sure nothing is interfering.
2015-12-21 00:02:04 -06:00
Fangjin Yang
1b46ea7b3d Merge pull request #2121 from metamx/jdbcExtractionNamespaceLocking
Add nicer locking and shorter timeouts to JDBCExtractionNamespaceTest
2015-12-18 19:02:36 -08:00
Fangjin Yang
14229ba0f2 Merge pull request #1922 from metamx/jsonIgnoresFinalFields
Change DefaultObjectMapper to NOT overwrite final fields unless explicitly asked to
2015-12-18 15:38:32 -08:00