Commit Graph

5930 Commits

Author SHA1 Message Date
sahner 3def847e28 add documentation about TimedShutoff firehose 2015-08-24 20:41:42 -05:00
Paul Otto 2301b60365 Add ability to provide taskResource for IndexTask. 2015-08-24 17:38:31 -07:00
Josh Yudaken 5e42aee49e Add documentation for pathFormat in batch ingestion 2015-08-24 14:39:57 -07:00
Parag Jain 073c32f01b Poll rules immediately after change 2015-08-24 16:08:25 -05:00
Gian Merlino 3331987b10 Merge pull request #1648 from himanshug/fix_comma_multiple_inputs
handle commas in the path before calling MultipleInputs.addInputPaths
2015-08-24 09:08:50 -07:00
Himanshu Gupta 5b5a76ef6c adding unit test for HdfsDataSegmentKiller.testKill(..) 2015-08-23 22:21:03 -05:00
Himanshu Gupta c2bebfe39e delete version, interval, dataSource directories on segment deletion if possible, so that they are not left behind and consume ns quota on HDFS 2015-08-23 22:06:12 -05:00
Fangjin Yang 0c11097b4f Merge pull request #1655 from druid-io/add-postgresql-docs
add postgresql setup docs
2015-08-21 13:37:53 -07:00
Xavier Léauté 5a48376788 add postgresql setup docs 2015-08-21 13:35:58 -07:00
Fangjin Yang 6ff6fd80e8 Merge pull request #1647 from himanshug/approx_histo_test
Approx histogram "integration" unit test
2015-08-21 11:19:12 -07:00
Fangjin Yang e3964cd9f6 Merge pull request #1653 from metamx/fix-success-flag
fix success flag emitted in router query metrics
2015-08-21 07:56:57 -07:00
Nishant 6d85a62ed2 fix success flag emitted in router query metrics 2015-08-21 12:25:57 +05:30
Himanshu 5bf4b81481 Merge pull request #1650 from druid-io/workaround-assembly-bug
remove maven metadata from embedded repo
2015-08-20 22:15:59 -05:00
Himanshu Gupta b3c570e78d update BatchDeltaIngestion.testDeltaIngestion(..) to check for proper glob path handling 2015-08-20 21:36:34 -05:00
Himanshu Gupta 85e3ce9096 split hadoop glob path before adding it to MultipleInputs
This can be safely reverted once https://issues.apache.org/jira/browse/MAPREDUCE-5061 is fixed
2015-08-20 21:36:34 -05:00
Himanshu Gupta a603bd9547 HadoopGlobPathSplitter implementation to split hadoop glob paths
This can be safely reverted once https://issues.apache.org/jira/browse/MAPREDUCE-5061 is fixed
2015-08-20 21:36:34 -05:00
Xavier Léauté 0eed015142 remove maven metadata from embedded repo
workaround for maven assembly bug
https://issues.apache.org/jira/browse/MASSEMBLY-677
2015-08-20 15:36:42 -07:00
Nishant e6b20db62d Support for JSON Smile format for EventReceiverFirehoseFactory
Support for Smile format for EventReceiverFireshoseFactory.
would help in decreasing network load on middle managers ingesting high
data volumes.

review comments
2015-08-20 19:52:45 +05:30
Himanshu Gupta 9b54124cd0 pseudo integration tests for approximate histogram 2015-08-20 01:27:20 -05:00
Himanshu Gupta c57c07f28a add ability for client code to provide InputStream of input data in addition to File
It would be needed when input data file does not reside in the same jar
but you could still use getResourceAsStream() to read the data inside a file
2015-08-20 00:54:58 -05:00
Fangjin Yang 1ecec1da5a Merge pull request #1645 from himanshug/approx_histo_doc_update
update approx-histogram document to explain how to ignore rows with no value
2015-08-19 13:44:16 -07:00
Himanshu Gupta 0daeb830b0 update approx-histogram document to explain how to ignore rows with no value at ingestion time 2015-08-19 15:20:37 -05:00
Gian Merlino 38b8ccab04 Merge pull request #1643 from himanshug/msg_update
helpful cause explaining why SegmentDescriptorInfo did not exist
2015-08-19 12:20:33 -07:00
Charles Allen 03f4c3e816 Merge pull request #1644 from druid-io/fix-tar-long-files
fix assembly tar with long filenames on some platforms
2015-08-19 12:11:57 -07:00
Xavier Léauté 21fa1ac6d6 fix assembly tar mode on some platforms 2015-08-19 10:13:16 -07:00
Himanshu Gupta cf3ec8eb46 helpful cause explaining why SegmentDescriptorInfo did not exist 2015-08-19 10:29:04 -05:00
Fangjin Yang d996e0aecc Merge pull request #1640 from druid-io/maven-repo-tarball-clean
Package extensions in tarball + run_druid_server goodness
2015-08-18 21:54:39 -07:00
Xavier Léauté 1abcd75696 Merge pull request #1624 from metamx/expandTimeouts
Expand timeouts on JDBCExtractionNamespaceTest
2015-08-18 21:32:50 -07:00
Fangjin Yang 6a0389297c Merge pull request #1634 from metamx/IrcFirehoseFixes
Allow IrcFirehoseFactory to shutdown cleanly
2015-08-18 21:06:34 -07:00
Gian Merlino 5ca440ff15 run_druid_server.sh 2015-08-18 18:32:33 -07:00
Gian Merlino 601cd82987 Use local repository instead of maven central:
- Use local repo in run_example_server.
- Add extensions-repo to common.runtime.properties for the tarball.
2015-08-18 18:32:33 -07:00
Xavier Léauté 2093187c91 rework tarball distribution:
- move assembly out of druid-services into a 'distribution' module
- create separate 'extensions-distribution' module and assembly to
  package extensions and their dependencies into a local maven
  repository
- include this extensions maven repository in the binaries tarball
2015-08-18 18:32:33 -07:00
Xavier Léauté 3b2e41e42a update for next release 2015-08-18 17:16:46 -07:00
Charles Allen 38110820c3 Expand timeouts on JDBCExtractionNamespaceTest 2015-08-18 14:28:40 -07:00
Charles Allen 13773865cf Make jetty scheduler threads daemon thread 2015-08-18 13:08:19 -07:00
Fangjin Yang 9ff6a637bb Merge pull request #1635 from druid-io/fix-versions
update version numbers in docs
2015-08-17 16:54:50 -07:00
Xavier Léauté 0cbda0c01d update version numbers in docs 2015-08-17 16:41:21 -07:00
Charles Allen b9792b57bc Merge pull request #1374 from himanshug/batch_delta_ingestion3
Feature for hadoop batch re-ingesion and delta ingestion
2015-08-17 15:52:49 -07:00
Charles Allen cd2c3776f8 Allow IrcFirehoseFactory to shutdown cleanly 2015-08-17 13:24:41 -07:00
Slim Bouguerra 7549f02578 support the case filter value is null 2015-08-17 15:09:37 -05:00
Fangjin Yang f9e3f922e6 Merge pull request #1633 from pjain1/edit_metrics
swap description and dimension column for some JVM metrics
2015-08-17 13:07:57 -07:00
Parag Jain 41fa9bf994 swap description and dimension for some JVM metrics 2015-08-17 15:03:06 -05:00
zhanghailei 234a958817 Inner Query should build on sub query 2015-08-17 18:18:26 +08:00
Himanshu Gupta cfd81bfac7 updating the docs on how to do hadoop batch re-ingesion and delta ingestion 2015-08-16 14:07:35 -05:00
Himanshu Gupta a3bab5b7d9 IndexGeneratorJobTest type unit test for batch delta ingestion and reindexing 2015-08-16 14:07:35 -05:00
Himanshu Gupta 15fa43dd43 changing DatasourcePathSpec, to get segment list, so that hadoop indexer uses overlord action to get list of segments and passes when running as an overlord task. and, uses metadata store directly when running as standalone hadoop indexer
also, serialized list of segments is passed to DatasourcePathSpec so that hadoop classloader issues do not creep up
2015-08-16 14:07:35 -05:00
Himanshu Gupta 45947a1021 add ability to specify Multiple PathSpecs in batch ingestion, so that we can grab data from multiple places in same ingestion
Conflicts:
	indexing-hadoop/src/main/java/io/druid/indexer/HadoopDruidIndexerConfig.java
	indexing-hadoop/src/main/java/io/druid/indexer/JobHelper.java

Conflicts:
	indexing-hadoop/src/main/java/io/druid/indexer/path/PathSpec.java
2015-08-16 13:15:38 -05:00
Himanshu Gupta 1ae56f139b Druid Hadoop InputFormat and pathSpec
Conflicts:
	indexing-hadoop/src/main/java/io/druid/indexer/path/PathSpec.java
	indexing-service/pom.xml
2015-08-16 13:15:38 -05:00
Himanshu Gupta f1d309a671 do not run parser if value from InputFormat is already an InputRow 2015-08-14 14:44:22 -05:00
Himanshu Gupta 4d4aa8bfc6 refactor IngestSegmentFirehoseFactory so that IngestSegmentFirehose becomes reusable
Conflicts:
	indexing-service/src/main/java/io/druid/indexing/firehose/IngestSegmentFirehoseFactory.java
2015-08-14 14:44:22 -05:00