Gian Merlino
fc55314d1c
ForkingTaskRunner: Log without buffering.
...
In #933 the ForkingTaskRunner's logging was changed to buffered from
unbuffered. This means that the last few KB of the logs are generally
not visible while a task is running, which makes debugging running
tasks difficult.
2015-11-07 15:16:53 -08:00
Charles Allen
929b981710
Change DefaultObjectMapper to NOT overwrite final fields unless explicitly asked to
2015-11-05 18:10:13 -08:00
Gian Merlino
cb409ee928
RemoteTaskActionClient: Fix statusCode check.
2015-11-05 10:03:49 -08:00
fjy
8f231fd3e3
cleanup druid codebase
2015-11-04 13:59:53 -08:00
Himanshu Gupta
84f7d8d264
making static final variables in HadoopDruidIndexerConfig upper case
2015-11-02 23:24:26 -06:00
Himanshu Gupta
8b67417ac8
make methods in Index[Merger,Maker,IO] non-static so that they can have
...
appropriate ObjectMapper injected instead of creating one statically
2015-11-02 23:24:26 -06:00
Gian Merlino
16ae8866b8
Log and continue on failure to schedule cleanup for missing workers at startup.
2015-10-28 08:10:54 -07:00
Gian Merlino
513bc76252
RTR: Ensure that there is only one cleanup task scheduled for a worker at once.
...
This is accomplished by making sure that scheduleTasksCleanupForWorker is only called
from the PathChildrenCache event thread, having it cancel existing cleanup tasks when
it adds a new one, and having tasks check on finish that the thing they are removing
from the task list is actually themselves.
2015-10-27 21:16:58 -07:00
Fangjin Yang
ea2267e08c
Merge pull request #1868 from gianm/fix-announcements
...
Historical and MiddleManager server announcements should not remove parents.
2015-10-27 14:50:05 -07:00
Gian Merlino
7df7370935
Merge pull request #1862 from metamx/indexingServiceMMGone
...
Add timeout to shutdown request to middle manager for indexing service
2015-10-27 14:38:01 -07:00
Charles Allen
44a2b204df
Add timeout to shutdown request to middle manager for indexing service
2015-10-27 13:56:03 -07:00
Gian Merlino
4b92752deb
Historical and MiddleManager server announcements should not remove parents.
...
Removing parent paths causes watchers of the "announcements" path to get stuck
and stop seeing new updates.
2015-10-27 08:06:11 -07:00
Bingkun Guo
4914925d65
New extension loading mechanism
...
1) Remove maven client from downloading extensions at runtime.
2) Provide a way to load Druid extensions and hadoop dependencies through file system.
3) Refactor pull-deps so that it can download extensions into extension directories.
4) Add documents on how to use this new extension loading mechanism.
5) Change the way how Druid tarball is generated. Now all the extensions + hadoop-client 2.3.0
are packaged within the Druid tarball.
2015-10-21 14:22:36 -05:00
Charles Allen
532e1c9fd5
Do not pass `druid.indexer.runner.javaOpts` to Peon as a property
...
* Still places `druid.indexer.runner.javaOpts` on the command line, but the Peon no longer tries to have the property `druid.indexer.runner.javaOpts` set
* Fixes https://github.com/druid-io/druid/issues/1841
2015-10-20 09:24:01 -07:00
Charles Allen
bf11723a52
Update usages of io.druid.client.selector.Server to build URL or URI directly instead of using String.format
2015-10-12 12:30:56 -07:00
Charles Allen
2d847ad654
Merge pull request #1730 from metamx/union-queries-fix
...
fix #1727 - Union bySegment queries fix
2015-09-29 12:23:25 -07:00
Nishant
573aa96bd6
fix #1727 - Union bySegment queries fix
...
Fixes #1727 .
revert to doing merging for results for union queries on broker.
revert unrelated changes
Add test for union query runner
Add test
remove unused imports
fix imports
fix renamed file
fix test
update docs.
2015-09-29 23:32:36 +05:30
Charles Allen
d2e400f063
Merge pull request #1740 from metamx/validate-locks
...
fix #1715
2015-09-29 09:38:42 -07:00
Xavier Léauté
25bbc0b923
Merge pull request #1778 from gianm/redirect-fixes
...
Redirect fixes
2015-09-25 09:54:48 -07:00
Gian Merlino
348172203f
OverlordRedirectInfo: Fix ability to detect that there is no leader.
2015-09-25 09:30:09 -07:00
Parag Jain
b630720164
fail task if finishjob throws any exception
...
add realtime task failure test
2015-09-25 10:55:45 -05:00
Fangjin Yang
aa9d90355e
Merge pull request #1772 from gianm/fix-overlord-startup
...
RemoteTaskRunner: Fix for starting an overlord before any workers ever existed.
2015-09-24 21:55:03 -07:00
Gian Merlino
63bf021077
RemoteTaskRunner: Fix for starting an overlord before any workers ever existed.
2015-09-24 21:15:36 -07:00
Himanshu Gupta
6e550d5346
update doc about aggregation field in merge task and a null check
2015-09-24 22:25:07 -05:00
Nishant
b638400acb
fix #1715
...
fixes #1715
- TaskLockBox has a set of active tasks
- lock requests throws exception for if they are from a task not in
active task set.
- TaskQueue is responsible for updating the active task set on
tasklockbox
fix #1715
fixes #1715
- TaskLockBox has a set of active tasks
- lock requests throws exception for if they are from a task not in
active task set.
- TaskQueue is responsible for updating the active task set on
tasklockbox
review comment
remove duplicate line
use ISE instead
organise imports
2015-09-24 10:06:50 +05:30
Himanshu
61b0743943
Merge pull request #1748 from metamx/forkingJavaOptionsWithQuotes
...
Allow ForkingTaskRunner javaOpts to have quoted arguments which contain spaces
2015-09-21 21:03:00 -05:00
Charles Allen
465035e531
Allow ForkingTaskRunner javaOpts to have quoted arguments which contain spaces
2015-09-21 17:32:27 -07:00
Fangjin Yang
e48f6dd660
Merge pull request #1736 from gianm/additional-ingest-segment-timeline-test
...
IngestSegmentFirehostFactoryTimelineTest for overshadowing of the middle of a segment.
2015-09-17 14:42:29 -07:00
Gian Merlino
64e33b2bcb
IngestSegmentFirehostFactoryTimelineTest for overshadowing of the middle of a segment.
2015-09-16 10:17:43 -07:00
Himanshu Gupta
74f4572bd4
Lazily deserialize "parser" to InputRowParser in DataSchema
...
so that user hadoop related InputRowParsers are created only when needed
this allows overlord to accept a HadoopIndexTask with a hadoopy InputRowParser
and not fail because hadoopy InputRowParser might need hadoop libraries
2015-09-16 10:58:13 -05:00
Charles Allen
f5ed6e885c
Merge pull request #1702 from himanshug/double_datasource_in_storage_dir
...
do not have dataSource twice in path to segment storage on hdfs
2015-09-15 14:00:35 -07:00
Nishant
4681ff22ed
add task duration in response for completed tasks
2015-09-10 13:51:50 +05:30
Himanshu Gupta
fe0233adf2
removing unused imports from HadoopIndexTask
2015-09-09 11:12:01 -05:00
Nishant
47aac991ec
add null check for task context.
...
make variable final
2015-09-04 22:19:01 +05:30
Fangjin Yang
75a582974b
Merge pull request #1639 from gianm/new-plumber
...
New plumber
2015-09-03 18:52:57 -07:00
Gian Merlino
062a47fba4
Modify Plumbers in these ways,
...
1) Persist using Committer instead of Runnable. (Although the metadata object
is ignored in this patch)
2) Remove the getSink method.
3) Plumbers are now responsible for time-based and hydrant-full-based periodic
committing. (FireChief, RealtimeIndexTask, and IndexTask used to do this)
2015-09-03 11:13:06 -07:00
Nishant
726326abc3
Add Task Context and ability to override task specific properties
...
override javaOpts
fix compilation
review comments
Add Test for typecast
review comments - remove unused method.
2015-09-03 23:36:32 +05:30
Gian Merlino
940e1aa3eb
Replace funky imports with standard ones.
...
1) Lots of Guava imports were not coming from the actual Guava
2) junit.framework.Assert should be org.junit.Assert
2015-08-28 18:02:05 -07:00
Gian Merlino
414a6fb477
Fix overlapping segments in IngestSegmentFirehose, DatasourceInputFormat.
...
Fixes #1678 . IngestSegmentFirehose (and its users) need to remember which
windows of which segments should actually be read, based on a timeline.
2015-08-28 07:32:41 -07:00
Himanshu Gupta
2e0dd1d792
adding UTs and addressing review comments to
...
firehoseV2 addition to Realtime[Manager|Plumber],
essential segment metadata persist support,
kafka-simple-consumer-firehose extension patch
2015-08-27 20:50:46 -05:00
lvjq
2237a8cf0f
kafka 8 simple consumer firehose
2015-08-27 20:50:46 -05:00
Nishant
b306739e9c
fix convert segment task
...
1) fix serde
2) fix wrong parameter being passed when creating subtask
remove sysout
2015-08-27 11:34:41 +05:30
Charles Allen
e38cf54bc8
Migrate TestDerbyConnector to a JUnit @Rule
2015-08-26 21:47:40 -07:00
Xavier Léauté
fdb6a6651b
Merge pull request #1669 from metamx/upgrade-dependencies
...
Upgrade dependencies
2015-08-25 21:30:22 -07:00
Xavier Léauté
5c19ffa98c
Merge pull request #1663 from gianm/segment-insert-constraints
...
TaskActionToolbox: Remove allowOlderVersions, lift interval constraint
2015-08-25 18:11:46 -07:00
Xavier Léauté
51f6a9a2c9
update jackson to 2.6.1
2015-08-25 16:07:01 -07:00
Gian Merlino
33681525e3
TaskActionToolbox: Remove allowOlderVersions switch, lift interval constraint.
...
allowOlderVersions has been stuck true for a while due to a bug (introduced in
566a3a61
), but I think it's actually OK this way. I think it's reasonable to
expect tasks to choose versions in some way that makes sense, so long as they
don't choose one larger than their taskLock version. This is still verified.
The interval constraint was introduced to force tasks to break up their
segment insert lists into manageable chunks. They are already doing this, and
I think it's reasonable to expect them to do so without enforcement.
Lifting these constraints paves the way for transactional insertion of segments
that have varying versions and may be for varying intervals.
2015-08-25 14:17:38 -07:00
Paul Otto
2301b60365
Add ability to provide taskResource for IndexTask.
2015-08-24 17:38:31 -07:00
Himanshu Gupta
15fa43dd43
changing DatasourcePathSpec, to get segment list, so that hadoop indexer uses overlord action to get list of segments and passes when running as an overlord task. and, uses metadata store directly when running as standalone hadoop indexer
...
also, serialized list of segments is passed to DatasourcePathSpec so that hadoop classloader issues do not creep up
2015-08-16 14:07:35 -05:00
Himanshu Gupta
4d4aa8bfc6
refactor IngestSegmentFirehoseFactory so that IngestSegmentFirehose becomes reusable
...
Conflicts:
indexing-service/src/main/java/io/druid/indexing/firehose/IngestSegmentFirehoseFactory.java
2015-08-14 14:44:22 -05:00
Gian Merlino
bc0c7dd65d
Avoid the Hadoop objectMapper in the local IndexTask. Fixes #1545 .
2015-08-11 10:40:53 -07:00
Charles Allen
1ddaa3fb33
Merge pull request #1592 from metamx/clean-test-files
...
clean temporary files
2015-08-03 11:47:20 -07:00
Nishant
2679efee7a
clean temporary files
2015-08-03 23:32:58 +05:30
Fangjin Yang
6f65e6d3ef
Merge pull request #1547 from pjain1/improve_overlord_test
...
add test to OverlordResourceTest
2015-07-28 07:35:48 -10:00
Parag Jain
2e1b617346
add more tests
2015-07-24 15:12:08 -05:00
Fangjin Yang
97242356b4
Merge pull request #1480 from guobingkun/kill_task_test
...
Unit tests for KillTask and MetadataTaskStorage
2015-07-20 16:31:45 -07:00
Fangjin Yang
3f7ba58227
Merge pull request #1504 from metamx/fix-1447
...
fix for #1447
2015-07-14 08:50:08 -07:00
Himanshu
e2ddfb7a1a
Merge pull request #1511 from pjain1/remove_test
...
remove flaky overlord test
2015-07-13 18:38:34 -05:00
Parag Jain
59dec89f6a
remove flaky overlord test
2015-07-13 15:32:12 -05:00
Himanshu
725086cc89
Merge pull request #1506 from gianm/realtime-plumber-nulls
...
Consider null inputRows and parse errors as unparseable during realtime ingestion.
2015-07-13 10:12:12 -05:00
Gian Merlino
9068bcd062
Consider null inputRows and parse errors as unparseable during realtime ingestion.
...
Also, harmonize exception handling between the RealtimeIndexTask and the RealtimeManager.
Conditions other than null inputRows and parse errors bubble up in both.
2015-07-11 20:40:03 -07:00
Himanshu
cac722968e
Merge pull request #1503 from metamx/fix-leaking-zk-nodes
...
Fix leaking Status Path nodes in ZK
2015-07-10 17:40:18 -05:00
Fangjin Yang
9f19e96658
Merge pull request #1477 from pjain1/overlord_test
...
overlord and task master test
2015-07-10 14:27:14 -07:00
Parag Jain
55c4fe64f3
overlord and task master test
2015-07-10 16:17:45 -05:00
Nishant
5fe27fe4ad
fix for #1447
...
fixes #1447
2015-07-09 19:05:48 +05:30
Nishant
8d7a566bae
Fix leaking Status Path nodes in ZK
...
- remove ZK status path nodes for workers after they are removed
2015-07-09 17:20:09 +05:30
Charles Allen
c0b60c0d2f
I'm not your mom, indexing-service/test... cleanup after yourself
2015-07-01 15:00:09 -07:00
Bingkun Guo
282a0f9760
Unit tests for KillTask and MetadataTaskStorage
2015-06-29 17:55:41 -05:00
Himanshu
b5b9ca1446
Merge pull request #1470 from pjain1/rtindex_test
...
Realtime Index Task test
2015-06-29 16:51:35 -05:00
Parag Jain
284b80b09e
Realtime Index Task test
2015-06-29 09:52:41 -05:00
nishant
fb4052d577
JavaScript Worker Select Strategy
...
this PR adds a JavaScriptWorkerSelectStrategy which allows defining
arbitrary logic for selecting workers to run task using a JavaScript
function.
This gives users full control to implement complex worker selection
strategies based on task attributes.
more tests and a complex javascript config
fix for java8 modify for nashorn compatibility
2015-06-20 02:01:34 +05:30
Charles Allen
acc0a3fbf7
Add jitter to the retries for RemoteTaskActionClient
2015-06-12 17:43:25 -07:00
nishant
e9afec4a2b
fix task status issues on zk outages
...
docs
review comments
fix test
review comments
Review comments
fix compilation
fix typo
2015-06-11 00:49:52 +05:30
Xavier Léauté
78d468700b
Merge pull request #1388 from metamx/fix-1360
...
fix race described in 1360
2015-06-10 11:59:36 -07:00
Xavier Léauté
f6b336ac3e
Merge pull request #1432 from metamx/config-fix
...
fix passing of config from IndexTuningConfig to RealtimeTuningConfig
2015-06-10 11:42:58 -07:00
nishant
963682d696
Add check for valid rowFlushBoundary configuration and fix tests
2015-06-10 21:38:34 +05:30
nishant
191b302f6a
fix passing of config from IndexTuningConfig to RealtimeTuningConfig
...
- pass rowFlushboundary correctly instead of using default.
- fixes indexTask failing with
io.druid.segment.incremental.IndexSizeExceededException when
rowFlushboundary is set higher than
RealtimeTuningConfig.defaultMaxRowsInMemory
rename test method
2015-06-10 21:07:25 +05:30
nishant
af9ea08041
fix race described in 1360
...
review comments
review comments
review comments
no need to remove
fix test
review comments
2015-06-10 12:19:12 +05:30
Charles Allen
056cab93ed
Add Hadoop Converter Job and task
...
* Fixes https://github.com/druid-io/druid/issues/1363
* Add extra utils in JobHelper based on PR feedback
2015-06-09 14:47:38 -07:00
Charles Allen
ef9b67cce3
Merge pull request #1422 from metamx/fix-ec2-public-ip
...
fix public IP not working in EC2 autoscaling
2015-06-03 16:30:51 -07:00
Xavier Léauté
4ebdfea76f
fix public IP not working in EC2 autoscaling
2015-06-03 16:05:59 -07:00
Charles Allen
8289914f76
Make AbstractTask.makeId use AbstractTask.joinId
...
* Also remove TaskUtil
2015-06-03 13:24:20 -07:00
Fangjin Yang
ac9057c00e
Merge pull request #1401 from metamx/ec2-public-ip
...
flag to enable public IP in EC2 autoscaling
2015-05-28 20:21:32 -07:00
Xavier Léauté
d834a974ba
flag to enable public IP in EC2-VPC autoscaling
2015-05-28 18:14:12 -07:00
fjy
bb1145ef56
Make the index task use indexmerger and not indexmaker
2015-05-28 13:34:57 -07:00
Xavier Léauté
5ad5d7d18b
Merge pull request #1379 from flowbehappy/fix-hadoop-ha
...
bug fix: hdfs task log and indexing task not work properly with Hadoop HA
2015-05-22 09:14:50 -04:00
flow
07659f30ab
bug fix: hdfs task log and indexing task not work properly with Hadoop HA
2015-05-21 20:49:42 +08:00
Charles Allen
29ba05c04f
Abstractify HadoopTask
...
* Add `invokeForeignLoader` to commonize the way tasks are attempted to be launched in a foreign class loader
* Add `buildClassLoader` to accomplish the common tasks for hadoop jobs when building a ClassLoader
2015-05-14 17:04:43 -07:00
Gian Merlino
e69d82a2b4
Realtime: Delay firehose connection until job is started.
...
Some firehoses (like the Kafka firehose) acquire input resources when they
connect, so it helps to delay this until after plumber.startJob() runs.
2015-05-04 10:54:07 -07:00
Xavier Léauté
721505c017
Merge pull request #1208 from druid-io/rework-metrics
...
Schemaless metrics + additional metrics for things we care about
2015-04-27 15:04:54 -07:00
fjy
963e5765bf
Schemaless metrics + additional metrics for things we care about
2015-04-27 13:39:40 -07:00
Charles Allen
633fdb029e
Add option to ConvertSegmentTask to skip validation
...
* Validation is enabled by default
2015-04-27 08:37:55 -07:00
Charles Allen
29341f9837
Fix random unit test failure from NoopTask ID collision
2015-04-24 13:07:48 -07:00
Xavier Léauté
f73f14ab91
Merge pull request #1297 from metamx/versionConverterTaskUpdates
...
Update VersionConverterTask for IndexSpec and allowing Forced updates
2015-04-20 16:44:35 -07:00
Charles Allen
7479ac9012
Update VersionConverterTask for IndexSepc and allowing Forced updates
2015-04-20 16:17:06 -07:00
fjy
d260515a43
update druid-api version
2015-04-17 14:58:35 -07:00
Xavier Léauté
ea5572d001
Merge pull request #1271 from metamx/strictErrorChecking
...
Add stricter checking for potential coding errors
2015-04-15 15:21:41 -07:00
Charles Allen
abdeaa0746
Add stricter checking for potential coding errors
...
Can use via `mvn clean compile test-compile -P strict'
2015-04-15 14:52:25 -07:00
Xavier Léauté
3a3046ccf3
add support for dimension compression
...
- compression for single-value dimensions using CompressedVSizeIntsIndexedSupplier
- makes dimension compression configurable via IndexSpec
- IndexSpec also enables configuring bitmap and metric compression
2015-04-14 10:44:18 -07:00
fjy
195a3b8bb8
ignore rows with invalid interval
2015-04-06 16:08:40 -07:00