Commit Graph

188 Commits

Author SHA1 Message Date
Slim Bouguerra 623e89aa54 skip corrupt message 2016-03-04 08:30:40 -06:00
Gian Merlino 7e70881101 HdfsTaskLogs: Allow overwriting existing logs.
Necessary because ForkingTaskRunner pushes logs when gracefully stopping,
but it may need to re-push those logs when the task finishes for real after
restoring.
2016-02-26 15:10:19 -08:00
Charles Allen 1fe277ee29 Merge pull request #2367 from se7entyse7en/feature-rackspace-cloud-files-static-firehose
Adds support to use Rackspace's cloudfiles as static firehose
2016-02-25 17:31:06 -08:00
Fangjin Yang 3a9fe2aad0 Merge pull request #2231 from lizhanhui/pull_request
Add druid-rocketmq module
2016-02-25 17:19:57 -08:00
Charles Allen a8820d837b More logging for JDBCExtractionNamespaceFunctionFactory 2016-02-25 09:48:56 -08:00
Slim 368988d187 Merge pull request #2291 from druid-io/lookupManager
Promoting LookupExtractor state and LookupExtractorFactory to be a first class druid state object.
2016-02-11 16:07:27 -06:00
Fangjin Yang d435a9b1e9 Merge pull request #2448 from metamx/fixBigJarHadoopPlace
Fix dependencies.
2016-02-11 10:47:00 -08:00
Slim Bouguerra 4e119b7a24 Adding lookup ref manager and lookup dimension spec impl 2016-02-11 12:11:51 -06:00
Will Lauer 189376a6f9 Adding optional error bounds to sketch aggs and post-aggs
By setting a new optional parameter, `errorBoundsStdDev`, to the number
of standard deviations to use when computing error bounds, the return
type for both the SketchMergeAggregator and the SketchEstimate
PostAggregator can be changed from a simple double (estimate) to a JSON
object containing the estimate, expected high bound, expected low bound,
and standard devations used when computing bounds (same value as passed
in).
2016-02-11 10:18:16 -06:00
Charles Allen 40ade32a1f Fix dependencies.
* Don't put druid****selfcontained.jar at the end of the hadoop isolated classpath
* Add `<scope>provided</scope>` to prevent repeated dependency inclusion in the extension directories
2016-02-11 07:30:14 -08:00
Lou Marvin Caraig 3294f33838 Added support to use Rackspace's cloudfiles as static firehose 2016-02-02 14:15:09 +01:00
Erik Dubbelboer f72b613499 Remove incorrect comment.
The CloudFilesDataSegmentPuller can't handle URI data pulls.
This comment was obviously copied from the s3 module and never removed.
2016-01-30 11:02:44 +00:00
Charles Allen 508734c8b0 Long constant reformatting in tests `l` --> `L` 2016-01-27 08:59:19 -08:00
Gian Merlino cac4651da0 Fix spelling of 'dimReverseExtractionNamespace'. 2016-01-26 23:08:02 -08:00
Charles Allen e941303bc6 Remove sorting of dimensions in AvroStreamInputRowParserTest
Due to https://github.com/druid-io/druid-api/pull/68
2016-01-22 16:01:41 -08:00
Slim Bouguerra e0d90f875c Graphite emitter 2016-01-21 13:43:37 -06:00
Fangjin Yang 0c31f007fc Merge pull request #1728 from himanshug/aggregators_in_segment_metadata
Store AggregatorFactory[] in segment metadata
2016-01-19 12:55:49 -08:00
Himanshu Gupta a99aef29a1 adding aggregators to segment metadata 2016-01-19 14:23:39 -06:00
Himanshu Gupta 52eb0f04a7 adding a new method getMergingFactory(..) to AggregatorFactory 2016-01-18 22:03:46 -06:00
Himanshu Gupta 77fc86c015 making AggregatorFactory abstract class 2016-01-18 22:03:46 -06:00
Himanshu Gupta dcd3a24f59 adding log line for segment being killed in HdfsDataSegmentKiller 2016-01-18 21:51:04 -06:00
Li Zhanhui da5d7b9376 Reformat code according to Druid Java and Scala style 2016-01-18 16:37:03 +08:00
Kurt Young 1f2168fae5 add IndexMergerV9
add unit tests for IndexMergerV9 and fix some bugs

add more unit tests and fix bugs

handle null values and add more tests

minor changes & use LoggingProgressIndicator in IndexGeneratorReducer

make some static class public from IndexMerger

minor changes and add some comments

changes for comments
2016-01-16 11:25:28 +08:00
Li Zhanhui 0b627dde71 Revert adding synchronized to swapRequest method as the caller method onWaitEnd(), is only invoked in synchronized code blocks. There is no need to repeat sync it even if java's synchronized keyword holds a reentrant mutex semantic ^_^. 2016-01-12 19:33:39 +08:00
Li Zhanhui 03cdf3a8fe Update license text in pom.xml 2016-01-12 18:11:58 +08:00
Li Zhanhui dc47e80924 1. Add newline to pom.xml
2. Change RocketMQ to rocketMQ
3. Make swapRequests methods synchronized in all places.
4. Make comparator static and final and use Long.compare.
2016-01-12 18:03:16 +08:00
Li Zhanhui abe134bef6 Fix typo 2016-01-11 11:50:19 +08:00
Li Zhanhui 3d46356dc9 Update code on PR comments 2016-01-11 11:47:02 +08:00
Li Zhanhui 8eb332c1c4 Add druid-rocketmq module 2016-01-08 08:13:04 +08:00
Charles Allen 13c63bad72 Make timeouts more explicit on what is failing in JDBCExtractionNamespaceTest 2016-01-07 11:16:36 -08:00
Fangjin Yang aaea95ed1b Merge pull request #2207 from himanshug/theta_sketch_select_query
fix bug for thetaSketch metric not working with select queries
2016-01-07 09:46:09 -08:00
fjy 2103906a48 add pusher tests for all deep storages 2016-01-05 22:22:48 -08:00
Himanshu Gupta c6634d7c2c adding json for thetaSketch Memory object representation 2016-01-05 22:12:52 -06:00
Himanshu Gupta 62e5e45da8 add select query UT for thetaSketch 2016-01-05 22:12:52 -06:00
Himanshu Gupta 3f048f0b15 adding support to execute Select queries in AggregationTestHelper so that Select query based UTs can be written for complex aggregator implementations 2016-01-05 21:54:55 -06:00
Charles Allen 6d886da7d9 Merge pull request #2191 from duilio/fix-rackspace-cloudfiles-segment-size
store uncompressed index size on cloudfiles storage extension
2016-01-05 17:17:35 -08:00
Zhao Weinan 5e57ddb8cc Adding avro support to realtime & hadoop batch indexing. 2016-01-05 10:21:27 +08:00
Charles Allen 957646be2c Fixes to JDBCExtractionNamespaceTest 2016-01-04 09:56:07 -08:00
maurizio 5ea0b96d9a store uncompressed index size instead of the compressed one in cf storage extension 2016-01-04 14:50:27 +01:00
fjy 57d91d754d Comment out buggy unit tests, fix #2185 2016-01-03 09:50:16 -08:00
fjy 89fc18bb55 increase timeouts for jdbc tearDown 2016-01-01 20:08:06 -08:00
fjy ca46f1d40c attempt to fix transient tests again 2015-12-30 21:39:28 -08:00
Bingkun Guo 492adeaaa7 Merge pull request #2172 from gianm/remove-kafka-seven
Remove unused kafka-seven extension.
2015-12-29 15:19:28 -06:00
Fangjin Yang b1261035a7 Merge pull request #1861 from guobingkun/insert_segment_tool
insert-segment tool
2015-12-29 10:06:07 -08:00
Gian Merlino 891d639188 Remove unused kafka-seven extension. 2015-12-29 12:05:27 -05:00
fjy 38b0f1fbc2 fix transient failures in unit tests 2015-12-28 20:03:30 -08:00
Fangjin Yang e490650865 Merge pull request #2110 from navis/fix-sporadic-testfail
Fix sporadic fail of URIExtractionNamespaceFunctionFactoryTest#testReverseFunction
2015-12-27 14:45:09 -08:00
Charles Allen 05c9e1b598 Reorder Before/After in JDBCExtractionNamespaceTest
* Fixes https://github.com/druid-io/druid/issues/2120
2015-12-22 11:39:46 -08:00
Bingkun Guo 89b477970f DataSegmentFinder tool
`insert-segment-to-db` is a tool that can insert segments into Druid metadata storage. It is intended to be used
to update the segment table in metadata storage after people manually migrate segments from one place to another.
It can also be used to insert missing segment into Druid, or even recover metadata storage by telling it where the
segments are stored.

Note: This tool expects users to have Druid cluster running in a "safe" mode, where there are no active tasks to interfere
the segments being inserted. Users can optionally bring down the cluster to make 100% sure nothing is interfering.
2015-12-21 00:02:04 -06:00
Fangjin Yang 1b46ea7b3d Merge pull request #2121 from metamx/jdbcExtractionNamespaceLocking
Add nicer locking and shorter timeouts to JDBCExtractionNamespaceTest
2015-12-18 19:02:36 -08:00