5714 Commits

Author SHA1 Message Date
Josh Yudaken
5e42aee49e Add documentation for pathFormat in batch ingestion 2015-08-24 14:39:57 -07:00
Gian Merlino
3331987b10 Merge pull request #1648 from himanshug/fix_comma_multiple_inputs
handle commas in the path before calling MultipleInputs.addInputPaths
2015-08-24 09:08:50 -07:00
Fangjin Yang
0c11097b4f Merge pull request #1655 from druid-io/add-postgresql-docs
add postgresql setup docs
2015-08-21 13:37:53 -07:00
Xavier Léauté
5a48376788 add postgresql setup docs 2015-08-21 13:35:58 -07:00
Fangjin Yang
6ff6fd80e8 Merge pull request #1647 from himanshug/approx_histo_test
Approx histogram "integration" unit test
2015-08-21 11:19:12 -07:00
Fangjin Yang
e3964cd9f6 Merge pull request #1653 from metamx/fix-success-flag
fix success flag emitted in router query metrics
2015-08-21 07:56:57 -07:00
Nishant
6d85a62ed2 fix success flag emitted in router query metrics 2015-08-21 12:25:57 +05:30
Himanshu
5bf4b81481 Merge pull request #1650 from druid-io/workaround-assembly-bug
remove maven metadata from embedded repo
2015-08-20 22:15:59 -05:00
Himanshu Gupta
b3c570e78d update BatchDeltaIngestion.testDeltaIngestion(..) to check for proper glob path handling 2015-08-20 21:36:34 -05:00
Himanshu Gupta
85e3ce9096 split hadoop glob path before adding it to MultipleInputs
This can be safely reverted once https://issues.apache.org/jira/browse/MAPREDUCE-5061 is fixed
2015-08-20 21:36:34 -05:00
Himanshu Gupta
a603bd9547 HadoopGlobPathSplitter implementation to split hadoop glob paths
This can be safely reverted once https://issues.apache.org/jira/browse/MAPREDUCE-5061 is fixed
2015-08-20 21:36:34 -05:00
Xavier Léauté
0eed015142 remove maven metadata from embedded repo
workaround for maven assembly bug
https://issues.apache.org/jira/browse/MASSEMBLY-677
2015-08-20 15:36:42 -07:00
Himanshu Gupta
9b54124cd0 pseudo integration tests for approximate histogram 2015-08-20 01:27:20 -05:00
Himanshu Gupta
c57c07f28a add ability for client code to provide InputStream of input data in addition to File
It would be needed when input data file does not reside in the same jar
but you could still use getResourceAsStream() to read the data inside a file
2015-08-20 00:54:58 -05:00
Fangjin Yang
1ecec1da5a Merge pull request #1645 from himanshug/approx_histo_doc_update
update approx-histogram document to explain how to ignore rows with no value
2015-08-19 13:44:16 -07:00
Himanshu Gupta
0daeb830b0 update approx-histogram document to explain how to ignore rows with no value at ingestion time 2015-08-19 15:20:37 -05:00
Gian Merlino
38b8ccab04 Merge pull request #1643 from himanshug/msg_update
helpful cause explaining why SegmentDescriptorInfo did not exist
2015-08-19 12:20:33 -07:00
Charles Allen
03f4c3e816 Merge pull request #1644 from druid-io/fix-tar-long-files
fix assembly tar with long filenames on some platforms
2015-08-19 12:11:57 -07:00
Xavier Léauté
21fa1ac6d6 fix assembly tar mode on some platforms 2015-08-19 10:13:16 -07:00
Himanshu Gupta
cf3ec8eb46 helpful cause explaining why SegmentDescriptorInfo did not exist 2015-08-19 10:29:04 -05:00
Fangjin Yang
d996e0aecc Merge pull request #1640 from druid-io/maven-repo-tarball-clean
Package extensions in tarball + run_druid_server goodness
2015-08-18 21:54:39 -07:00
Xavier Léauté
1abcd75696 Merge pull request #1624 from metamx/expandTimeouts
Expand timeouts on JDBCExtractionNamespaceTest
2015-08-18 21:32:50 -07:00
Fangjin Yang
6a0389297c Merge pull request #1634 from metamx/IrcFirehoseFixes
Allow IrcFirehoseFactory to shutdown cleanly
2015-08-18 21:06:34 -07:00
Gian Merlino
5ca440ff15 run_druid_server.sh 2015-08-18 18:32:33 -07:00
Gian Merlino
601cd82987 Use local repository instead of maven central:
- Use local repo in run_example_server.
- Add extensions-repo to common.runtime.properties for the tarball.
2015-08-18 18:32:33 -07:00
Xavier Léauté
2093187c91 rework tarball distribution:
- move assembly out of druid-services into a 'distribution' module
- create separate 'extensions-distribution' module and assembly to
  package extensions and their dependencies into a local maven
  repository
- include this extensions maven repository in the binaries tarball
2015-08-18 18:32:33 -07:00
Xavier Léauté
3b2e41e42a update for next release 2015-08-18 17:16:46 -07:00
Charles Allen
38110820c3 Expand timeouts on JDBCExtractionNamespaceTest 2015-08-18 14:28:40 -07:00
Fangjin Yang
9ff6a637bb Merge pull request #1635 from druid-io/fix-versions
update version numbers in docs
2015-08-17 16:54:50 -07:00
Xavier Léauté
0cbda0c01d update version numbers in docs 2015-08-17 16:41:21 -07:00
Charles Allen
b9792b57bc Merge pull request #1374 from himanshug/batch_delta_ingestion3
Feature for hadoop batch re-ingesion and delta ingestion
2015-08-17 15:52:49 -07:00
Charles Allen
cd2c3776f8 Allow IrcFirehoseFactory to shutdown cleanly 2015-08-17 13:24:41 -07:00
Fangjin Yang
f9e3f922e6 Merge pull request #1633 from pjain1/edit_metrics
swap description and dimension column for some JVM metrics
2015-08-17 13:07:57 -07:00
Parag Jain
41fa9bf994 swap description and dimension for some JVM metrics 2015-08-17 15:03:06 -05:00
Himanshu Gupta
cfd81bfac7 updating the docs on how to do hadoop batch re-ingesion and delta ingestion 2015-08-16 14:07:35 -05:00
Himanshu Gupta
a3bab5b7d9 IndexGeneratorJobTest type unit test for batch delta ingestion and reindexing 2015-08-16 14:07:35 -05:00
Himanshu Gupta
15fa43dd43 changing DatasourcePathSpec, to get segment list, so that hadoop indexer uses overlord action to get list of segments and passes when running as an overlord task. and, uses metadata store directly when running as standalone hadoop indexer
also, serialized list of segments is passed to DatasourcePathSpec so that hadoop classloader issues do not creep up
2015-08-16 14:07:35 -05:00
Himanshu Gupta
45947a1021 add ability to specify Multiple PathSpecs in batch ingestion, so that we can grab data from multiple places in same ingestion
Conflicts:
	indexing-hadoop/src/main/java/io/druid/indexer/HadoopDruidIndexerConfig.java
	indexing-hadoop/src/main/java/io/druid/indexer/JobHelper.java

Conflicts:
	indexing-hadoop/src/main/java/io/druid/indexer/path/PathSpec.java
2015-08-16 13:15:38 -05:00
Himanshu Gupta
1ae56f139b Druid Hadoop InputFormat and pathSpec
Conflicts:
	indexing-hadoop/src/main/java/io/druid/indexer/path/PathSpec.java
	indexing-service/pom.xml
2015-08-16 13:15:38 -05:00
Himanshu Gupta
f1d309a671 do not run parser if value from InputFormat is already an InputRow 2015-08-14 14:44:22 -05:00
Himanshu Gupta
4d4aa8bfc6 refactor IngestSegmentFirehoseFactory so that IngestSegmentFirehose becomes reusable
Conflicts:
	indexing-service/src/main/java/io/druid/indexing/firehose/IngestSegmentFirehoseFactory.java
2015-08-14 14:44:22 -05:00
Fangjin Yang
958dd1a451 Merge pull request #1629 from metamx/fix-1584
Update post-aggregations.md
2015-08-14 11:07:47 -07:00
MarConSchneid
3f15d81f76 Update post-aggregations.md
fix for  the issue: https://github.com/druid-io/druid/issues/1584
2015-08-14 11:03:58 -07:00
Fangjin Yang
c1d274faff Merge pull request #1628 from druid-io/revert-1608-guice40
Revert "Update to guice 4.0"
2015-08-14 09:57:20 -07:00
Charles Allen
db19d2d547 Revert "Update to guice 4.0" 2015-08-14 09:26:07 -07:00
Fangjin Yang
8d6fe021e3 Merge pull request #1627 from metamx/fix-realtime-tasks
explicitly call lifecycle stop
2015-08-14 08:17:00 -07:00
Nishant
08f50fcc37 explicitly call lifecycle stop 2015-08-14 17:35:17 +05:30
Xavier Léauté
24dc23967f Merge pull request #1607 from druid-io/even-more-docs
Fixes and more docs across many areas
2015-08-13 11:53:53 -07:00
fjy
43978eba99 Fixes and more docs across many areas 2015-08-13 10:35:53 -07:00
Himanshu
901cc22060 Merge pull request #1617 from metamx/increaseNECMETTimeout
Increase timeout in tests for NamespaceExtractionCacheManagerExecutorsTest
2015-08-12 12:38:01 -05:00