jisookim0513
37979282fe
enabled ansi-quote in mysql; insert statement should now work
2014-10-21 00:09:19 -07:00
jisookim0513
b8cbe2457a
fixed variable name
2014-10-17 00:13:30 -07:00
jisookim0513
7d5c5f2083
fixed createTable; fixed miscellaneous stuff; added DerbyMetadataRuleManagerProvider
2014-10-17 00:10:36 -07:00
nishantmonu51
f4a97aebbc
fix rollup for hashed partitions
...
truncate timestamp while calculating the partitionNumber
2014-10-15 22:32:56 +05:30
nishantmonu51
bce388fb27
merge changes from 0.7.x branch
2014-10-14 18:46:02 +05:30
nishantmonu51
b5d66381f3
more cleanup
2014-10-14 18:32:40 +05:30
nishantmonu51
454acd3f5a
remove backwards compatible code
...
1) remove backwards compatible and deprecated code
2) make hashed partitions spec default
2014-10-13 19:30:44 +05:30
Gian Merlino
e1fedbe741
RemoteTaskRunner should respect worker version changes ( fixes #787 ).
2014-10-12 11:27:44 -07:00
jisookim0513
521398267c
fixed inconsistent variable names
2014-10-10 17:00:50 -07:00
fjy
c7b4d5b7b4
Merge branch 'master' into druid-0.7.x
...
Conflicts:
processing/src/test/java/io/druid/segment/filter/SpatialFilterTest.java
2014-10-02 18:12:10 -07:00
fjy
c3bea245a7
fix up some bugs
2014-09-30 17:20:52 -07:00
fjy
a0782d4c54
fix compile
2014-09-30 16:56:44 -07:00
jisookim0513
0e50852985
fixed MetadataTaskStorage and handler
2014-09-30 14:09:23 -07:00
Gian Merlino
0781781b99
Merge pull request #766 from metamx/extend-rtr
...
make the worker selection strategy in remotetaskrunner extendable
2014-09-30 12:52:12 -07:00
fjy
fab7caafff
final code reviews
2014-09-30 12:32:14 -07:00
fjy
06757034f2
add default impl
2014-09-30 11:54:29 -07:00
fjy
2b2b028e5c
why am i so bad at coding
2014-09-30 11:53:18 -07:00
fjy
b1b9e0a267
i suck
2014-09-30 11:45:19 -07:00
fjy
575d51b0ce
fix compilation error
2014-09-30 11:44:50 -07:00
fjy
4c23a5e9f6
address cr again
2014-09-30 11:40:29 -07:00
fjy
55db06ccb1
address cr
2014-09-30 10:29:02 -07:00
nishantmonu51
358ff915bb
fix merge conflicts
2014-09-30 22:19:18 +05:30
nishantmonu51
2789536bed
merge changes from druid-0.7.x
2014-09-30 22:05:49 +05:30
nishantmonu51
61c7fd2e6e
make ingestOffheap tuneable
2014-09-30 15:30:02 +05:30
Gian Merlino
1e6ce8ac9a
TaskLogs fixes and cleanups.
...
- Fix negative offsets in FileTaskLogs and HdfsTaskLogs.
- Consolidate file offset code into LogUtils (currently used in two places).
- Clean up style for HdfsTaskLogs and related classes.
- Remove unused code in ForkingTaskRunner.
2014-09-29 16:20:34 -07:00
fjy
4a09678739
make the selection strategy in rtr extendable
2014-09-29 14:24:02 -07:00
jisookim0513
74565c9371
cleaned up the code
2014-09-27 13:10:01 -07:00
jisookim0513
6a641621b2
finished merging into druid-0.7.x; derby not working (to be fixed)
2014-09-26 14:24:53 -07:00
jisookim0513
43cc6283d3
trying to revert files that have overwritten changes
2014-09-26 12:38:04 -07:00
fjy
eaf0a48b92
Merge branch 'master' into druid-0.7.x
...
Conflicts:
cassandra-storage/pom.xml
common/pom.xml
examples/pom.xml
hdfs-storage/pom.xml
histogram/pom.xml
indexing-hadoop/pom.xml
indexing-service/pom.xml
kafka-eight/pom.xml
kafka-seven/pom.xml
pom.xml
processing/pom.xml
processing/src/main/java/io/druid/guice/PropertiesModule.java
rabbitmq/pom.xml
s3-extensions/pom.xml
server/pom.xml
services/pom.xml
2014-09-26 11:39:24 -07:00
jisookim0513
3bf39cc9f8
attempted to fix merge-conflicts
2014-09-24 15:55:42 -07:00
nishantmonu51
f51ab84386
merge changes from druid-0.7.x
2014-09-22 23:48:45 +05:30
jisookim0513
273205f217
initial attempt for abstraction; druid cluster works with Derby as a default
2014-09-19 17:39:59 -07:00
nishantmonu51
8eb6466487
revert buffer size and add back rowFlushBoundary
2014-09-19 23:06:04 +05:30
nishantmonu51
e6d93a3070
fix NPE
...
fix NPE when the dimension of metric is not present one of the segments
to be reIndexed.
2014-09-17 15:57:58 +05:30
nishantmonu51
f006de8639
fix #732
...
fix metric list discovery
2014-09-16 22:12:36 +05:30
Xavier Léauté
c8b8e3f6e9
negating compare is bad
2014-09-15 13:00:06 -07:00
Xavier Léauté
137ad50bf1
classes that should be static
2014-09-15 13:00:06 -07:00
Xavier Léauté
e57e2d97ba
make constants final
2014-09-15 12:53:40 -07:00
Xavier Léauté
cfa92e8217
fix incorrect nullable annotations
2014-09-15 12:13:52 -07:00
fjy
469ccbbe5e
Merge branch 'master' into druid-0.7.x
...
Conflicts:
cassandra-storage/pom.xml
common/pom.xml
examples/pom.xml
hdfs-storage/pom.xml
histogram/pom.xml
indexing-hadoop/pom.xml
indexing-service/pom.xml
kafka-eight/pom.xml
kafka-seven/pom.xml
pom.xml
processing/pom.xml
processing/src/main/java/io/druid/query/FinalizeResultsQueryRunner.java
processing/src/main/java/io/druid/query/UnionQueryRunner.java
processing/src/main/java/io/druid/query/groupby/GroupByQueryRunnerFactory.java
processing/src/main/java/io/druid/query/topn/TopNQueryEngine.java
processing/src/main/java/io/druid/query/topn/TopNQueryRunnerFactory.java
rabbitmq/pom.xml
s3-extensions/pom.xml
server/pom.xml
server/src/test/java/io/druid/server/initialization/JettyTest.java
services/pom.xml
2014-09-11 16:20:50 -07:00
fjy
fec7b43fcb
make making v9 segments something completely configurable
2014-09-10 15:28:30 -07:00
fjy
351afb8be7
allow legacy index generator
2014-09-09 17:04:35 -07:00
Xavier Léauté
508e982190
Merge remote-tracking branch 'origin/master' into druid-0.7.x
...
Conflicts:
cassandra-storage/pom.xml
common/pom.xml
examples/config/historical/runtime.properties
examples/config/overlord/runtime.properties
examples/config/realtime/runtime.properties
examples/pom.xml
hdfs-storage/pom.xml
histogram/pom.xml
indexing-hadoop/pom.xml
indexing-service/pom.xml
kafka-eight/pom.xml
kafka-seven/pom.xml
pom.xml
processing/pom.xml
rabbitmq/pom.xml
s3-extensions/pom.xml
server/pom.xml
server/src/main/java/io/druid/server/ClientQuerySegmentWalker.java
services/pom.xml
2014-08-30 22:42:36 -07:00
Xavier Léauté
58ab759fc6
remove unused imports
2014-08-29 14:03:47 -07:00
Xavier Léauté
ac05836833
make Java 8 javadoc happy
2014-08-29 13:58:50 -07:00
Gian Merlino
8bcb73a0eb
Java programs are best run with a java command.
2014-08-28 18:10:01 -07:00
Gian Merlino
5564a67bfc
File.pathSeparator is better than ":".
2014-08-28 18:09:33 -07:00
Gian Merlino
68aeafaacd
Allow indexing tasks to specify extra classpaths.
...
This could be used by Hadoop tasks to reference configs for different clusters, assuming
that the possible configs have been pre-distributed to middle managers.
2014-08-28 18:00:26 -07:00
fjy
d64879ccca
more cleanup
2014-08-20 13:22:42 -07:00
fjy
1614f40f1a
fix index task
2014-08-20 13:11:00 -07:00
fjy
92f26d9a1f
cleanup rowflushboundary
2014-08-20 13:09:37 -07:00
nishantmonu51
fe105d52ee
use bufferSize for IndexTask
2014-08-20 22:41:34 +05:30
nishantmonu51
33354cf7fe
replace maxRowsInMemory with BufferSize
2014-08-20 20:59:44 +05:30
nishantmonu51
e525562767
review comments - cleanup ColumnSelectorFactory
2014-08-20 15:04:43 +05:30
nishantmonu51
60906c3244
Revert "make valueType configurable"
...
This reverts commit 6f60a3f604
.
2014-08-20 11:55:26 +05:30
fjy
4fd5479559
fix typo
2014-08-19 12:34:10 -07:00
fjy
77e514688a
Merge branch 'druid-0.7.x' into offheap-incremental-index
2014-08-18 11:14:19 -07:00
Xavier Léauté
1fd30ab588
default service/host/port for all nodes
2014-08-15 17:14:05 -07:00
nishantmonu51
6f60a3f604
make valueType configurable
2014-08-13 14:37:57 +05:30
nishantmonu51
1b0a72751b
Add support for LongColumn
2014-08-13 08:52:36 +05:30
nishantmonu51
c6712739dc
merge changes from druid-0.7.x
2014-08-12 15:47:42 +05:30
fjy
91ebe45b4e
support both rejectionPolicy and rejectionPolicyFactory in serde
2014-08-07 10:06:27 -07:00
nishantmonu51
637bd35785
merge changes from druid-0.7.x
2014-07-31 16:07:22 +05:30
Gian Merlino
09fcfc3b6d
Fix race in RemoteTaskRunner that could lead to zombie tasks.
2014-07-18 11:41:50 -07:00
nishantmonu51
4ce12470a1
Add way to skip determine partitions for index task
...
Add a way to skip determinePartitions for IndexTask by manually
specifying numShards.
2014-07-18 18:52:15 +05:30
fjy
d8b8826c2e
Merge branch 'cleanup-ingest' of github.com:metamx/druid into cleanup-ingest
...
Conflicts:
server/src/test/java/io/druid/realtime/firehose/CombiningFirehoseFactoryTest.java
2014-07-17 20:26:11 -07:00
fjy
291f4c00ae
Merge branch 'master' of github.com:metamx/druid into cleanup-ingest
2014-07-17 20:24:59 -07:00
nishantmonu51
0e0454a34c
switch reingest task to noop & fix compilation
...
switch back to noop task, its confusing to have a reinvest task that
does nothing.
fix compilation
2014-07-18 06:50:58 +05:30
fjy
ded83557dd
Merge pull request #640 from metamx/disable-worker
...
An alternative way to disable middlemanagers based on worker version
2014-07-17 19:07:27 -06:00
fjy
beac0be45b
fix enabled endpoint
2014-07-17 18:04:36 -07:00
fjy
c6078ca841
address code review
2014-07-17 13:34:05 -07:00
fjy
ba978d8b79
some minor cleanups to ingest firehose
2014-07-17 13:05:59 -07:00
fjy
bc650a1c80
Merge pull request #627 from metamx/druid-firehose
...
Functionality to ingest a Druid segment and change the schema
2014-07-17 13:41:16 -06:00
fjy
5197ea527a
disable middlemanagers based on worker version
2014-07-17 12:35:45 -07:00
nishantmonu51
e59c9ebdbc
minor fixes
...
fix IndexOutOfBoundsException
fix ingestFirehose
2014-07-17 17:24:57 +05:30
nishantmonu51
972c5dac31
improve memory usage and rename firehose
2014-07-14 21:17:53 +05:30
nishantmonu51
f5f05e3a9b
Sync changes from branch new-ingestion PR #599
...
Sync and Resolve Conflicts
2014-07-11 16:15:10 +05:30
nishantmonu51
7168adcca7
Use Ingest task instead of Noop Task
2014-07-08 21:20:17 +05:30
fjy
ce478be899
Remove redundant forking task runner config
2014-07-07 18:08:37 -06:00
nishantmonu51
97b58eb193
review comments
...
1) Rename firehose to IngestSegment
2) fix segment overlapping, intervals
3) fix overshadow
2014-07-04 12:31:33 +05:30
nishantmonu51
518ab473f3
improve port finding strategy for task runner
...
1) Recycle free ports
2) Choose only ports that are free & not used by any other application
2014-07-03 09:58:12 +05:30
nishantmonu51
1de390801f
Druid Fireshose
...
Add druidFirehose, can be used to reIndex filtered data from a
datasource
2014-07-02 20:34:18 +05:30
fjy
4c40e71e54
address cr
2014-06-19 14:48:46 -07:00
fjy
a870fe5cbe
inject column config
2014-06-19 14:47:57 -07:00
Xavier Léauté
09346b0a3c
make column cache configurable
2014-06-19 14:43:03 -07:00
fjy
0bc1915067
Merge pull request #578 from metamx/new-guava
...
Update guava, java-util, and druid-api
2014-06-18 14:23:32 -06:00
fjy
b0517dc55d
Merge pull request #610 from metamx/ec2-user-data
...
More flexible EC2 user data classes.
2014-06-18 11:36:13 -06:00
Gian Merlino
5c4b9a7d5b
Indexing: Allow null userData.
2014-06-18 10:08:50 -07:00
Gian Merlino
65c1553a76
Fix busted TaskConfig.
2014-06-17 17:27:41 -07:00
Gian Merlino
9f16f0a955
More flexible EC2 user data classes.
2014-06-17 17:10:17 -07:00
fjy
5227b43050
fix more test breakage
2014-06-17 10:35:01 -07:00
fjy
8a13e34c50
fix broken ut
2014-06-17 10:31:50 -07:00
Gian Merlino
54833fa43d
Indexing: Make default hadoop coordinates configurable.
2014-06-17 08:53:39 -07:00
nishantmonu51
0629be622c
remove unnecessary changes & fix index closing subquery
2014-06-16 18:50:49 +05:30
fjy
5bd4ad3f0f
address code review
2014-06-13 11:29:49 -07:00
fjy
a63cda3281
Merge branch 'master' into new-guava
...
Conflicts:
server/src/main/java/io/druid/server/QueryResource.java
2014-06-13 10:08:10 -07:00
nishantmonu51
0ddaf4c307
fix race while index closing and querying
2014-06-13 15:36:57 +05:30
nishantmonu51
025814cfff
Merge branch 'master' into offheap-incremental-index
...
Conflicts:
processing/src/main/java/io/druid/query/GroupByParallelQueryRunner.java
processing/src/main/java/io/druid/query/groupby/GroupByQueryRunnerFactor
y.java
processing/src/main/java/io/druid/segment/incremental/IncrementalIndexSt
orageAdapter.java
2014-06-12 20:02:27 +05:30
nishantmonu51
a7e19ad892
configure buffer sizes
2014-06-12 19:32:37 +05:30
fjy
ad7db018d5
add default tuning config for index task
2014-06-09 16:28:47 -07:00
fjy
5e48f3907b
fix index task new schema defaults
2014-06-09 16:24:12 -07:00
nishantmonu51
5bdc4a761a
close IncrementalIndex properly and free up buffer
2014-06-05 19:41:43 +05:30
nishantmonu51
6265613bb9
Merge branch 'master' into offheap-incremental-index
2014-06-05 17:42:57 +05:30
nishantmonu51
01e8a713b6
unit tests passing with offheap-indexing
2014-06-05 17:42:53 +05:30
fjy
9f4cc5ca1f
fix test
2014-06-04 16:29:20 -07:00
fjy
77ec4df797
update guava, java-util, and druid-api
2014-06-03 13:43:38 -07:00
Gian Merlino
8a6384862f
Better task log errors.
2014-06-02 12:55:17 -07:00
Gian Merlino
48511e15cf
Autoscaling: Fix case where target < current, but target is too low.
...
This can happen when workers are provisioned manually.
2014-06-02 12:36:48 -07:00
fjy
7ffe60ca60
clean up a bit of the logic in RTR failure with exceptions in announceTask
2014-06-02 10:49:28 -07:00
nishantmonu51
6f176cee85
fail task on exception
...
If the task announcement throw exception, set status to fail instead of
retrying it again and again.
2014-06-02 22:52:50 +05:30
fjy
4c13327297
more logging for determine hashed
2014-05-30 16:19:20 -07:00
Xavier Léauté
1a8eb25852
update realtime tasks to use FilteredServerView
2014-05-29 16:41:26 -07:00
fjy
7be93a770a
make all firehoses work with tasks, add a lot more documentation about configuration
2014-05-28 16:33:59 -07:00
fjy
09ad32c5c5
fix race condition with merge and persist and sink adding
...
Conflicts:
indexing-service/src/main/java/io/druid/indexing/common/task/IndexTask.java
server/src/main/java/io/druid/segment/realtime/RealtimeManager.java
server/src/main/java/io/druid/segment/realtime/plumber/RealtimePlumber.java
2014-05-16 15:21:09 -07:00
fjy
d75cc7b9b8
fix more serde
2014-05-06 15:17:38 -07:00
fjy
1100d2f2a1
rename configs to make a bit more sense
2014-05-06 14:52:50 -07:00
fjy
b6fb4245aa
Merge branch 'master' into new-schema
...
Conflicts:
indexing-hadoop/src/main/java/io/druid/indexer/HadoopDriverConfig.java
indexing-hadoop/src/main/java/io/druid/indexer/HadoopDruidIndexerConfig.java
indexing-hadoop/src/main/java/io/druid/indexer/HadoopDruidIndexerConfigBuilder.java
pom.xml
server/src/main/java/io/druid/segment/realtime/RealtimeManager.java
server/src/main/java/io/druid/segment/realtime/firehose/EventReceiverFirehoseFactory.java
2014-05-06 14:32:51 -07:00
fjy
79e6d4eb56
Merge pull request #528 from metamx/union-query-source
...
Union query source
2014-05-06 14:13:33 -06:00
Gian Merlino
bdf9e74a3b
Allow config-based overriding of hadoop job properties.
2014-05-06 09:11:31 -07:00
nishantmonu51
728a606d32
Add support for union queries
2014-05-01 01:54:52 +05:30
fjy
171d20d52d
Merge branch 'move-firehose' of github.com:metamx/druid into move-firehose
2014-04-25 14:13:19 -07:00
fjy
eef034ca7e
Merge branch 'master' of github.com:metamx/druid into move-firehose
2014-04-25 14:13:08 -07:00
fjy
76e0a48527
Merge branch 'master' into new-schema
...
Conflicts:
indexing-hadoop/src/main/java/io/druid/indexer/DbUpdaterJob.java
indexing-hadoop/src/test/java/io/druid/indexer/HadoopDruidIndexerConfigTest.java
indexing-service/src/main/java/io/druid/indexing/common/task/HadoopIndexTask.java
server/src/main/java/io/druid/segment/realtime/plumber/RealtimePlumber.java
server/src/main/java/io/druid/segment/realtime/plumber/RealtimePlumberSchool.java
2014-04-25 14:03:28 -07:00
fjy
174ff66a88
Merge pull request #504 from metamx/shard-count-limit-fix
...
Fix shard count limit
2014-04-23 10:12:05 -06:00
nishantmonu51
6eb282b3a8
Fix shard count limit
...
Fix shard count limit at another place,
Use partitioned as a flag
2014-04-23 13:56:06 +05:30
Gian Merlino
8d650ae131
PullDependencies changes.
...
- Pull default Hadoop version by default.
- Allow additional coordinates on the command line.
2014-04-22 14:33:53 -07:00
fjy
c00fb1d08e
Merge pull request #445 from metamx/hadoop-version-update
...
Hadoop version update
2014-04-17 19:16:44 -06:00
fjy
9b06b6aa58
misc fixes for router
2014-04-15 14:23:54 -07:00
fjy
9bfc738032
im deeply saddened but another case sensitivity problem and want to resolve these once and for all
2014-04-14 16:20:31 -07:00
nishantmonu51
0c95c0b689
moved file
2014-04-09 14:25:31 +05:30
fjy
1843316db6
commonalize event receiver firehose
2014-04-03 20:46:36 -07:00
Xavier Léauté
e0ff2aa0d6
make isPostgreSQL check URI instead of metadata
2014-03-27 12:13:10 -07:00
nishantmonu51
a24fc84d0d
Merge branch 'master' into hadoop-version-update
2014-03-27 03:32:11 +05:30
nishantmonu51
a9a6682a0e
Upgrade to Hadoop 2.3.0
2014-03-26 11:43:31 +05:30
fjy
4360c050eb
fix broken ut
2014-03-25 11:13:52 -07:00
fjy
771aa2ae68
backwards compat
2014-03-25 09:23:48 -07:00
fjy
5ebb2d27e3
fix hadoop
2014-03-24 18:43:31 -07:00
fjy
2adcf07f5f
Merge branch 'master' into new-schema
...
Conflicts:
indexing-hadoop/src/main/java/io/druid/indexer/DetermineHashedPartitionsJob.java
indexing-service/src/main/java/io/druid/indexing/common/task/RealtimeIndexTask.java
indexing-service/src/test/java/io/druid/indexing/common/task/TaskSerdeTest.java
processing/src/test/java/io/druid/segment/TestIndex.java
server/src/main/java/io/druid/segment/realtime/RealtimeManager.java
server/src/main/java/io/druid/segment/realtime/plumber/RealtimePlumberSchool.java
2014-03-17 10:59:31 -07:00
nishantmonu51
04f3d0a13a
fix cli hadoop indexer
...
* run determine partitions job using CLI
2014-03-11 16:36:19 +05:30
fjy
b5181a8a89
Merge pull request #422 from tucksaun/feat-postgresql-support
...
Fixed segments SQL queries for PostgreSQL compatibility
2014-03-07 13:42:57 -07:00
Yuval Oren
4b63645802
Merge branch 'master' into subquery
...
Conflicts:
processing/src/main/java/io/druid/query/BaseQuery.java
processing/src/main/java/io/druid/query/groupby/GroupByQueryQueryToolChest.java
processing/src/test/java/io/druid/query/QueryRunnerTestHelper.java
processing/src/test/java/io/druid/query/topn/TopNQueryRunnerTest.java
2014-03-06 13:33:59 -08:00
Tugdual Saunier
ae38c92491
Fixed segments SQL queries for PostgreSQL compatibility
2014-03-06 19:14:32 +00:00
Gian Merlino
70db460f97
Blocking Executors and maxPendingPersists, oh my!
...
- Execs.newBlockingSingleThreaded can now accept capacity = 0.
- Changed default maxPendingPersists from 2 to 0.
- Fixed serde of maxPendingPersists in RealtimeIndexTasks.
2014-03-05 10:55:12 -08:00
fjy
5db00afb37
clean up and default values
2014-03-04 14:38:27 -08:00
fjy
c4c4d80336
make local testing pass
2014-03-03 14:52:43 -08:00
fjy
46b9ac78e7
Merge branch 'master' into new-schema
...
Conflicts:
indexing-hadoop/src/test/java/io/druid/indexer/HadoopDruidIndexerConfigTest.java
pom.xml
publications/whitepaper/druid.pdf
publications/whitepaper/druid.tex
2014-03-03 14:48:15 -08:00
fjy
bf2ddda897
unit tests passing after more refactoring
2014-02-27 15:21:09 -08:00
Yuval Oren
314e38a2c6
Fixed realtime index query serving failure due to incorrect treatment of DataSource
2014-02-26 22:36:27 -08:00
fjy
d57974d58b
Merge pull request #407 from metamx/restore-task
...
Restore task
2014-02-26 19:18:58 -07:00
Tugdual Saunier
e40725d5f3
Added support for PostgreSQL on overlord nodes
2014-02-26 01:24:37 +01:00
Xavier Léauté
7dbafa5453
fix task id
2014-02-25 13:49:39 -08:00
Xavier Léauté
e2defe8bf1
update copyright date
2014-02-25 13:44:15 -08:00
Xavier Léauté
2f61035585
add restore task
2014-02-25 13:41:40 -08:00
fjy
5d2367f0fd
unit tests pass at this point
2014-02-20 15:52:12 -08:00
fjy
20cac8c506
not compiling yet but close
2014-02-19 15:54:27 -08:00
fjy
4b7c76762d
unit tests passingn at this point, finished rt port maybe
2014-02-18 15:14:38 -08:00
fjy
3979eb270c
Revert "Revert "Merge branch 'determine-partitions-improvements'""
...
This reverts commit 189b3e2b9b
.
2014-02-14 12:58:56 -08:00
fjy
a8c4362d72
rejiggering druid api
2014-02-14 12:57:52 -08:00
fjy
189b3e2b9b
Revert "Merge branch 'determine-partitions-improvements'"
...
This reverts commit 7ad228ceb5
, reversing
changes made to 9c55e2b779
.
2014-02-14 12:47:34 -08:00
nishantmonu51
7ad228ceb5
Merge branch 'determine-partitions-improvements'
...
Conflicts:
pom.xml
2014-02-12 10:51:26 +05:30
Gian Merlino
b9556e2e2e
SimpleResourceManagementStrategy: Fix log/return
2014-02-07 10:02:45 -08:00
Gian Merlino
5ec634e498
SimpleResourceManagementStrategy: Scale up to minWorkerCount when increased
2014-02-06 13:20:09 -08:00
fjy
0f6af72ea4
Merge branch 'master' into new-schema
2014-02-06 12:46:13 -08:00
fjy
af48273369
fix bug with dynamic configs in coordinator not working
2014-02-06 11:19:05 -08:00
nishantmonu51
bacc72415f
correct locking and partitionsSpec
2014-02-05 03:17:47 +05:30
fjy
14d0e54327
first commit
2014-02-03 14:15:03 -08:00
Gian Merlino
994c7e3fa8
RemoteTaskActionClient: Retry on ChannelExceptions too
2014-02-03 08:16:51 -08:00
fjy
019be5c3b0
update jquery
2014-01-31 12:37:37 -08:00
nishantmonu51
97e5d68635
determine intervals working with determine partitions
2014-01-31 19:04:52 +05:30
fjy
0c789412bb
add a workaround for jackson bug where jacksoninject fails when a null value is passed through json creator annotated constructor
2014-01-25 07:07:27 +08:00
fjy
2ff86984da
fix broken ut
2014-01-21 10:47:45 -08:00
fjy
ebc66df27d
use terminateWithIds in terminate
2014-01-20 16:58:53 -08:00
fjy
1d81ad2946
remove unused class
2014-01-20 16:45:54 -08:00
fjy
1ecc9d0f98
fix the edge case where autoscaling tries to terminate node without ip
2014-01-20 16:44:19 -08:00
nishantmonu51
3fb72aff93
rename maxPendingPersistBatches to maxPendingPersists
2014-01-17 12:59:21 +05:30
nishantmonu51
fb819abd6f
make maxPendingPersistPeriod configurable
2014-01-17 11:01:55 +05:30
Gian Merlino
1331f2ce56
TaskStorage.add() now throws TaskExistsException, and the servlet respects it
...
The servlet will throw 400 rather than 500 when a task already exists, to
signify that the request has no hope of ever working.
2014-01-13 15:42:05 -08:00
Gian Merlino
a72c4429f7
RemoteTaskRunner: Fix NPE on cleanup due to missing withWorker
2014-01-13 15:41:04 -08:00
fjy
f3476f40e1
fix typo
2014-01-10 18:08:33 -08:00
fjy
f4e3f02c3b
more exceptions
2014-01-10 18:06:42 -08:00
fjy
1ecc94cfb6
another attempt at index task
2014-01-10 17:56:22 -08:00
fjy
f0b4d0c1e4
fix small bug with unusable dims
2014-01-10 14:59:09 -08:00
fjy
fe50104053
fix the index task and more docs
2014-01-10 14:47:18 -08:00
nishantmonu51
da01c4a78a
Add registration for backward compatibility
2014-01-10 02:02:06 +05:30
nishantmonu51
d28f9daccb
Remove duplicate registration of service
...
If the serviceName does not contain ":" this leads to duplicate
registration of firehose with same name
2014-01-10 01:17:41 +05:30
Gian Merlino
9037141c00
IndexTask: Better logging at the end of each segment
2014-01-08 15:22:12 -08:00
Gian Merlino
2c53af4d66
ForkingTaskRunner: Upload task logs even when job fails
2014-01-08 14:46:18 -08:00
Gian Merlino
7f430d9fde
RealtimeIndexTask: If a Throwable was thrown it is not a normalExit
2014-01-08 14:45:35 -08:00
Gian Merlino
83b4641e31
ForkingTaskRunnerConfig: Add java.io.tmpdir to allowedPrefixes
2014-01-07 16:12:24 -08:00
Gian Merlino
bf158102c4
IndexTask: Print metrics even if finishJob fails
2014-01-07 07:17:19 -08:00
Gian Merlino
26991b5a2a
Indexing service: Fix termination related log message
2013-12-20 12:05:42 -08:00
Gian Merlino
4d83837e88
RealtimeIndexTask: Clean up imports and comments
2013-12-20 11:37:16 -08:00
Gian Merlino
17ad4ee2f0
Fix RemoteTaskRunnerTest
2013-12-20 11:23:28 -08:00
Gian Merlino
e5b8546d19
Autoscaling fixes.
...
- Initial targetWorkerCount must be subject to pool size limits
- Use consistent workerSetupData for the entire autoscaling run
- Don't call terminate() when we have nothing to terminate
- Terminate obsolete workers even faster
2013-12-20 11:17:01 -08:00
fjy
3ec2766cd3
Merge pull request #339 from metamx/autoscaling
...
Autoscaling: Move target count independent of actual count.
2013-12-20 10:04:26 -08:00
Gian Merlino
6224577ed1
Autoscaling: Terminate obsolete workers faster
2013-12-20 10:01:32 -08:00
Gian Merlino
4a722c0a6d
Autoscaling changes from code review.
...
- Log and return immediately when workerSetupData is null
- Allow provisioning more nodes while other nodes are still provisioning
- Add tests for bumping up the minimum version
2013-12-20 08:59:35 -08:00
Gian Merlino
0ee6136ea3
NoopTask: Fix things that should be static. Add simple factory method.
2013-12-20 08:56:17 -08:00
Gian Merlino
3dd9a25546
Fix import
2013-12-19 16:18:16 -08:00
Gian Merlino
0ff7f0e8e0
TaskActionToolbox: Combine adjacent ifs
2013-12-19 16:16:34 -08:00
Gian Merlino
f86342f7dc
DbTaskStorage: Protect against invalid lock_payload
2013-12-19 16:16:20 -08:00
Gian Merlino
1f4b99634f
Autoscaling: Move target count independent of actual count.
...
This should let us grow and shrink the worker pool in chunks when necessary
(like when a bunch of them go offline, or when there is a worker version
change).
2013-12-19 16:11:30 -08:00
Gian Merlino
846c3da4ab
Empty task intervals, and empty lock intervals, aren't useful.
...
So prevent them from being created, through checks in AbstractFixedIntervalTask
and TaskLockbox.tryLock.
2013-12-19 13:21:41 -08:00
Gian Merlino
566a3a6112
Indexing service: Break up segment actions
...
Each one now one operates on at most a collection of segments that comprise
a single partition. The main purpose of this change is to prevent audit log
payload sizes from getting out of control.
2013-12-19 13:10:40 -08:00
Gian Merlino
6fbe67eeea
IndexerDBCoordinator: Work around SELECT -> INSERT races when adding segments
2013-12-19 13:10:40 -08:00
Gian Merlino
1ff855d744
Fix MoveTask serde and ArchiveTask id creation
2013-12-18 15:17:12 -08:00
Gian Merlino
58d1262edf
Indexing console: Clarify "Complete" with "recently completed"
2013-12-17 08:16:49 -08:00
Xavier Léauté
e333776aca
rename SegmentMoveAction to SegmentMetadataUpdateAction
2013-12-16 14:00:56 -08:00
Xavier Léauté
ac2ca0e46c
separate move and archive tasks
2013-12-16 14:00:55 -08:00
Xavier Léauté
123bddd615
update for new interfaces
2013-12-16 13:59:16 -08:00
Xavier Léauté
4a291fdf30
better naming
2013-12-16 13:59:16 -08:00
Xavier Léauté
a417cd5df2
add archive task
2013-12-16 13:59:15 -08:00
fjy
87b83bceb1
fix task storage config serde and prepare for next release
2013-12-13 16:55:22 -08:00
fjy
01f9c1df31
fix broken task storage config and prepare for next release
2013-12-13 16:45:32 -08:00
Gian Merlino
600dc7546f
Configurability of recency threshold
2013-12-13 16:02:54 -08:00
fjy
4a8140be81
better messaging to console again
2013-12-13 15:04:25 -08:00
fjy
52cdb20f10
add better messaging and error handling
2013-12-13 15:01:07 -08:00
Gian Merlino
e63c69dd57
TaskStorage: Return recently complete tasks in reverse chronological order
2013-12-13 12:27:45 -08:00
Gian Merlino
6c993d87bf
Indexing service API and GUI improvements!
...
- New APIs: waitingTasks, completeTasks, task payload
- GUI for the above, and for task logs + status
2013-12-13 11:38:18 -08:00
Gian Merlino
f36a5b677c
TaskLifecycleTest: Add test for noop task
2013-12-13 07:48:28 -08:00
Gian Merlino
3b053a66ff
TaskLifecycleTest: Add test for never-ready task
2013-12-13 07:48:27 -08:00
Gian Merlino
863012c384
TaskQueue: Exception during isReady does not warrant an alert.
2013-12-13 07:48:27 -08:00
Gian Merlino
6227963af9
TaskQueue: Copy task list before management loop.
2013-12-13 07:48:27 -08:00
Gian Merlino
70c153592f
CliPeon: Fix local mode
2013-12-12 14:22:57 -08:00
Gian Merlino
370e2f855a
TaskSerdeTest: Fix IndexTask test by including an actual firehoseFactory
2013-12-12 13:58:44 -08:00
Gian Merlino
169f149cf9
TaskLifecycleTest: Fix broken setUp and broken assumptions.
2013-12-12 13:51:13 -08:00
Gian Merlino
ba757b1e5a
IndexTask: Actually make and publish segments for the correct intervals.
2013-12-12 13:50:53 -08:00
Gian Merlino
be25d51a2c
RemoteTaskRunner: Fix issues leading to failing tests
2013-12-12 13:49:49 -08:00
Gian Merlino
c60158a21a
RemoteTaskRunner: Remove task from pendingTaskPayloads on shutdown if needed
2013-12-12 10:59:16 -08:00
Gian Merlino
0129ea99cf
RemoteTaskRunner changes to make bootstrapping actually work.
...
- Workers are not added to zkWorkers until caches have been initialized.
- Worker status we haven't heard about will be added to runningTasks or
completeTasks as appropriate.
- TaskRunnerWorkItem now only needs a taskId, not the entire Task. This makes
it possible to create them from TaskStatus objects, if that's all we have.
- Also remove some dead code.
2013-12-12 10:44:46 -08:00
Gian Merlino
d92b88718c
OverlordResource: Fix comment
2013-12-12 08:46:24 -08:00
Gian Merlino
b6a52610bc
IndexTask: Call plumber.startJob()
2013-12-12 08:46:10 -08:00
Gian Merlino
db9b515e71
IndexTask: Remove unnecessary args to determinePartitions.
2013-12-12 08:46:00 -08:00
Gian Merlino
f4a09d4ee3
TaskAction: Add JsonSubType for LockTryAcquireAction
2013-12-12 08:45:23 -08:00
Gian Merlino
b17dc6f744
Task interval, isReady hygiene
2013-12-11 22:42:20 -08:00
Gian Merlino
05e24bd85c
RemoteTaskRunner: Fix typo
2013-12-11 22:38:04 -08:00
Gian Merlino
bed263efa5
VersionConverterTask: Less goofy import for Preconditions
2013-12-11 22:37:55 -08:00
Gian Merlino
53d90efe30
TaskQueueConfig: Copyright header
2013-12-11 22:37:40 -08:00
Gian Merlino
0adda97776
AbstractFixedIntervalTask: Copyright header
2013-12-11 22:37:28 -08:00
Gian Merlino
c4b8c8bc6f
Rework indexing service internals to hopefully be more reliable.
...
The TaskQueue directly manages the TaskRunner. The main management loop runs
periodically and checks that the runner is doing reasonable things. If not, it
attempts to adjust the runner. The management loop also runs on-demand when a
task is added to keep task assignment relatively low latency. The TaskConsumer
is no longer necessary and so it no longer exists.
Task interval locks are handled differently. Instead of some tasks acquiring
locks at runtime and some tasks having implicit fixed lock intervals, all tasks
ask for locks explicitly. This occurs either in "isReady" (which runs on the
overlord) or in "run" (which runs on the peon).
Other changes:
- The TaskQueue is attached to the leader lifecycle, instead of global
- The TaskLockbox is able to sync itself from storage and is no longer
bootstrapped by the TaskQueue.
- RemoteTaskRunner does not clean up zk paths until asked to. This will
prevent deletion of statuses that have not yet been committed.
- Added retries on DbTaskStorage operations.
- Removed SpawnTasksAction (no more subtasks)
- Removed obsolete EventReceiverFirehose configs
- Removed obsolete OldOverlordResource
- Removed TaskStorageQueryAdapter methods related to subtasks
2013-12-11 15:05:16 -08:00
fjy
96f679f31c
clean up for merge
2013-12-10 17:51:13 -08:00
Gian Merlino
f3cfd1d781
Introduce FileTaskLogs, and move TaskLogs module from server to indexing-service
2013-12-10 17:39:43 -08:00
Gian Merlino
47c1c8cab2
TaskStorage: Rename getRunningTasks -> getActiveTasks
2013-12-10 17:39:42 -08:00
fjy
303f6ff334
fix worker config setup problems
2013-12-09 18:25:29 -08:00
nishantmonu51
2186bd6cb7
Minor fixes and documentation changes
2013-12-09 19:07:48 +05:30
fjy
5d7173ac98
fix autoscaling termination duration bug
2013-11-15 19:21:25 -08:00
fjy
346cf0e04c
fix out of order in urls for hadoop classpath
2013-11-14 18:18:39 -08:00
fjy
6b41681424
fix ordering of urls, ugh, need to write more tests
2013-11-14 15:06:07 -08:00
fjy
23ebca6d32
more hadoop dependency hell, getting the right urls to hadoop is hard
2013-11-14 14:29:42 -08:00
fjy
c51eed060f
actually use a class loader pull out dependencies from extension modules
2013-11-14 11:52:21 -08:00
fjy
fcf0c6ce06
change the classpath ordering for batch processing and prepare for next release
2013-11-14 10:52:51 -08:00
fjy
64b93bf448
fix broken autoscaling and prepare for next release
2013-11-08 11:29:42 -08:00
fjy
a049b42674
fix an issue with task tables not getting created automatically and prepare for next release
2013-11-07 18:01:35 -08:00
fjy
7f85a126ac
fix broken event receiving firehose
2013-11-07 16:59:01 -08:00
fjy
621133d6f2
removing dead code
2013-11-07 16:53:34 -08:00
fjy
084c90aa19
cleanup to prepare for next release
2013-11-07 15:55:51 -08:00
fjy
bad1a7e9f8
fix according to code review
2013-11-07 15:52:34 -08:00
fjy
aeb411a3a3
fix according to code review and fix broken examples
2013-11-07 15:42:48 -08:00
fjy
e2e10fae1f
clean up code
2013-11-07 15:03:14 -08:00
fjy
913ff3a082
clean up code
2013-11-07 15:00:11 -08:00
fjy
6b573c76f1
more fixes
2013-11-07 14:40:45 -08:00
Gian Merlino
8660db93fc
RemoteTaskRunner: Run taskComplete after a task times out
2013-11-04 10:56:26 -08:00
Gian Merlino
186bbd1cb6
WorkerTaskMonitor: Add log message
2013-11-04 10:25:16 -08:00
Gian Merlino
781673a8f8
ForkingTaskRunner: Fix pass-down of nodeType
2013-10-23 14:13:05 -07:00
Gian Merlino
b68b3526e8
IndexingServiceFirehoseModule: Add header
2013-10-18 13:27:40 -07:00
Gian Merlino
2e9c46867f
Fixes for indexing service.
...
- Create IndexingServiceFirehoseModule so firehoses can be loaded by all mains
- Fix implicit lock acquisition in AbstractTask
2013-10-18 11:14:33 -07:00
fjy
4862852b43
more docs about how to use different versions of hadoop in druid
2013-10-16 17:54:49 -07:00
fjy
6192602893
fix extensions config not getting picked up in hadoop index task
2013-10-16 16:52:23 -07:00
fjy
a1c09df17f
make the hadoop index task work again
2013-10-16 09:45:17 -07:00
fjy
9796a40b92
port docs over to 0.6 and a bunch of misc fixes
2013-10-11 18:38:53 -07:00
fjy
4e509d1d09
Merge branch 'master' into is-docs
2013-10-09 14:05:10 -07:00
cheddar
c47fe202c7
Fix HadoopDruidIndexer to work with the new way of things
...
There are multiple and sundry changes in here.
First, "HadoopDruidIndexer" has been split into two pieces, (1) CliHadoop which pulls the hadoop version and builds up the right classpath with the proper hadoop version to run the indexer and (2) CliInternalHadoopIndexer which actually runs the indexer.
In order to work around a bunch of jets3t version conflicts with Hadoop and Druid, I needed to extract the S3 deep storage stuff into its own module. I then also moved the HDFS stuff into its own module so that I could eliminate the dependency on Hadoop for druid-server.
In doing these changes, I wanted to make the extensions buildable with only the druid-api jar, so a few other things had to move out of Druid and into druid-api. They are all API-level things, however, so they really belong in druid-api instead.
Lastly, I removed the druid-realtime module and put it all in druid-server.
2013-10-09 15:15:44 -05:00
fjy
4ec4b8e024
rewrite indexing service docs
2013-10-08 16:34:58 -07:00
fjy
703b674800
add availability zone info to autoscaling
2013-10-07 12:16:50 -07:00
fjy
ac330f72bb
first set of changes to standarize the naming convention we use in druid
2013-10-03 16:36:48 -07:00
fjy
17874eeb67
make the CliPeon actually able to run on its own
2013-10-02 15:55:10 -07:00
fjy
bc8db7daa5
1) make chat handler resource work again
...
2) add more default configs
3) make examples work again
2013-10-02 14:22:39 -07:00
Gian Merlino
384dcda7e4
Chat handlers still don't work, but, they're closer maybe.
2013-10-01 17:45:53 -07:00
fjy
59f2d0711d
Merge branch 'master' into local-index
2013-10-01 13:21:43 -07:00
fjy
30df53671e
remove line iterator factory because it is not needed
2013-10-01 13:21:20 -07:00
Gian Merlino
62eda5020c
ShardSpec: Remove isInChunk(Map<String, String>)
2013-10-01 12:50:08 -07:00
fjy
5d0d71250b
fix chat handler resources not correctly registering themselves
2013-10-01 11:25:39 -07:00
fjy
53698a135a
add interface to new firehose as per code review comments
2013-09-30 18:00:59 -07:00
fjy
f55a5199b1
add a firehose module to remove so much copy and pasted code
2013-09-30 16:29:20 -07:00
fjy
ed9e0cf9f6
add a local firehose for indexing local files
2013-09-30 16:03:26 -07:00
Gian Merlino
dc5dab8747
Fixes for property conversion, firehose registration, and the indexing service
2013-09-27 17:09:59 -07:00
fjy
0b04325ee8
fix things up according to code review comments
2013-09-27 10:17:45 -07:00
fjy
e404295c1f
make indexing service work
2013-09-26 17:44:21 -07:00
fjy
8bc56daa66
fix things up according to code review comments
2013-09-26 11:35:45 -07:00
fjy
87259321b6
port hadoop druid indexer to new guice framework
2013-09-26 11:04:42 -07:00
fjy
15843c3978
refactor how server service discovery is done
2013-09-24 10:36:26 -07:00
fjy
dc8a119787
fix broken unit tests are a result of the last merge
2013-09-23 12:56:01 -07:00
cheddar
5712b29c8c
Fix issues with bindings and handling extensions
...
The way the Guice bindings were setup previously, each process only had bindings
for the things it cared about. This became problematic when adding extension modules
that bound everything that they could possibly need expecting that the processes would
only instantiate what they actually do need. Guice tries to fail-fast and verifies that all
bindings exist before it does anything, which is a problem because the extension bind
some objects that don't necessarily have all of their dependencies bound in all processes.
The fix for this is to build a single Injector with all bindings in it and let each of the
processes only load the things that they care about. This also requires the use of
Module overrides and other such interesting things, which are node done.
In doing the fix, I also swapped out the way that the DataSegmentPusher/Puller stuff is bound, as well as made the Cassandra stuff fail if its settings are not provided. This all of a sudden made all of the things require Cassandra's settings, so I migrated the Cassandra deep storage stuff into its own module.
In doing these changes, I also discovered that some properties weren't properly converting for the ConvertProperties command (specifically, the properties related to data segment loading and pushing), so I fixed that.
2013-09-20 17:45:01 -05:00
fjy
cabae7993d
port over multi threaded realtime and also fix broken realtime nodes that can't start up
2013-09-16 16:03:47 -07:00
fjy
f7c10e3594
rework tests in indexing service to be more unit testy
2013-09-12 16:37:58 -07:00
cheddar
a2dcc45a8e
1) Remove SingleSegmentLoader and replace with OmniSegmentLoader
2013-09-12 11:47:03 -05:00
cheddar
6c9a107356
1) remove duplicate package initialization.initialization
2013-09-09 17:02:57 -05:00
cheddar
3c39f90c89
1) Move Firehose interface and dependencies to druid-api
...
2) Move DataSegment* interfaces and dependencies to druid-api
2013-08-31 16:43:28 -05:00
cheddar
5ab671050e
No more com.metamx.druid, it is now all io.druid!
2013-08-30 19:42:12 -05:00