druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	501dcb43fa	Some changes that make it possible to restart tasks on the same hardware. This is done by killing and respawning the jvms rather than reconnecting to existing jvms, for a couple reasons. One is that it lets you restore tasks after server reboots too, and another is that it lets you upgrade all the software on a box at once by just restarting everything. The main changes are, 1) Add "canRestore" and "stopGracefully" methods to Tasks that say if a task can stop gracefully, and actually do a graceful stop. RealtimeIndexTask is the only one that currently implements this. 2) Add "stop" method to TaskRunners that attempts to do an orderly shutdown. ThreadPoolTaskRunner- call stopGracefully on restorable tasks, wait for exit ForkingTaskRunner- close output stream to restorable tasks, wait for exit RemoteTaskRunner- do nothing special, we actually don't want to shutdown 3) Add "restore" method to TaskRunners that attempts to bootstrap tasks from last run. Only ForkingTaskRunner does anything here. It maintains a "restore.json" file with a list of restorable tasks. 4) Have the CliPeon's ExecutorLifecycle lock the task base directory to avoid a restored task and a zombie old task from stomping on each other.	2015-11-23 11:22:08 -08:00
Gian Merlino	666d785787	Switch TaskActions from Optionals to nullable. Deserialization of Optionals does not work quite right- they come back as actual nulls, rather than absent Optionals. So these probably only ever worked for the local task action client.	2015-11-20 09:14:07 -08:00
Fangjin Yang	21c84b5ff7	Merge pull request #1896 from gianm/allocate-segment SegmentAllocateAction (fixes #1515)	2015-11-18 21:05:46 -08:00
Fangjin Yang	e52c156066	Merge pull request #1880 from gianm/rtr-adjust RTR: Ensure that there is only one cleanup task scheduled for a worker at once.	2015-11-18 15:12:55 -08:00
Charles Allen	8fcf2403e3	Merge pull request #1943 from metamx/realtime-caching Enable caching on intermediate realtime persists	2015-11-17 15:06:43 -08:00
Charles Allen	dbe201aeed	Merge pull request #1929 from pjain1/jetty_threads separate ingestion and query thread pool	2015-11-17 12:14:25 -08:00
Parag Jain	6c498b7d4a	separate ingestion and query thread pool	2015-11-17 13:42:41 -06:00
Xavier Léauté	d7eb2f717e	enable query caching on intermediate realtime persists	2015-11-17 10:58:00 -08:00
Charles Allen	46527a9610	Merge pull request #1954 from metamx/fix-stupid-aws-limit EC2 autoscaler: avoid hitting aws filter limits	2015-11-13 10:52:35 -08:00
Fangjin Yang	4f46d457f1	Merge pull request #1947 from noddi/feature/count-parameter-history-endpoints Add count parameter to history endpoints	2015-11-12 10:23:44 -08:00
Xavier Léauté	749ac12f88	EC2 autoscaler: avoid hitting aws filter limits	2015-11-11 20:28:06 -08:00
Fangjin Yang	465cbcf9a7	Merge pull request #1956 from metamx/remove-unused-imports Cleanup + remove unused imports	2015-11-11 17:36:47 -08:00
Gian Merlino	e4e5f0375b	SegmentAllocateAction (fixes #1515 ) This is a feature meant to allow realtime tasks to work without being told upfront what shardSpec they should use (so we can potentially publish a variable number of segments per interval). The idea is that there is a "pendingSegments" table in the metadata store that tracks allocated segments. Each one has a segment id (the same segment id we know and love) and is also part of a sequence. The sequences are an idea from @cheddar that offers a way of doing replication. If there are N tasks reading exactly the same data with exactly the same logic (think Kafka tasks reading a fixed range of offsets) then you can place them in the same sequence, and they will generate the same sequence of segments.	2015-11-11 16:54:35 -08:00
Bartosz Ługowski	6e5d2c6745	Add count parameter to history endpoints.	2015-11-11 23:03:57 +01:00
Xavier Léauté	fa6142e217	cleanup and remove unused imports	2015-11-11 12:25:21 -08:00
zhxiaog	c197a4cf32	fix #1918 , add unit tests for RemoteTaskActionClient	2015-11-12 03:15:17 +08:00
Charles Allen	abae47850a	Add backwards compatability for PR #1922	2015-11-11 10:27:00 -08:00
Charles Allen	1df4baf489	Move Jackson Guice adapters into io.druid * Removes access to protected methods in com.fasterxml * Eliminates druid-common's use of foreign package com.fasterxml	2015-11-09 10:50:45 -08:00
Gian Merlino	fc55314d1c	ForkingTaskRunner: Log without buffering. In #933 the ForkingTaskRunner's logging was changed to buffered from unbuffered. This means that the last few KB of the logs are generally not visible while a task is running, which makes debugging running tasks difficult.	2015-11-07 15:16:53 -08:00
Charles Allen	929b981710	Change DefaultObjectMapper to NOT overwrite final fields unless explicitly asked to	2015-11-05 18:10:13 -08:00
Gian Merlino	cb409ee928	RemoteTaskActionClient: Fix statusCode check.	2015-11-05 10:03:49 -08:00
fjy	8f231fd3e3	cleanup druid codebase	2015-11-04 13:59:53 -08:00
Himanshu Gupta	84f7d8d264	making static final variables in HadoopDruidIndexerConfig upper case	2015-11-02 23:24:26 -06:00
Himanshu Gupta	8b67417ac8	make methods in Index[Merger,Maker,IO] non-static so that they can have appropriate ObjectMapper injected instead of creating one statically	2015-11-02 23:24:26 -06:00
Gian Merlino	16ae8866b8	Log and continue on failure to schedule cleanup for missing workers at startup.	2015-10-28 08:10:54 -07:00
Gian Merlino	513bc76252	RTR: Ensure that there is only one cleanup task scheduled for a worker at once. This is accomplished by making sure that scheduleTasksCleanupForWorker is only called from the PathChildrenCache event thread, having it cancel existing cleanup tasks when it adds a new one, and having tasks check on finish that the thing they are removing from the task list is actually themselves.	2015-10-27 21:16:58 -07:00
Fangjin Yang	ea2267e08c	Merge pull request #1868 from gianm/fix-announcements Historical and MiddleManager server announcements should not remove parents.	2015-10-27 14:50:05 -07:00
Gian Merlino	7df7370935	Merge pull request #1862 from metamx/indexingServiceMMGone Add timeout to shutdown request to middle manager for indexing service	2015-10-27 14:38:01 -07:00
Charles Allen	44a2b204df	Add timeout to shutdown request to middle manager for indexing service	2015-10-27 13:56:03 -07:00
Gian Merlino	4b92752deb	Historical and MiddleManager server announcements should not remove parents. Removing parent paths causes watchers of the "announcements" path to get stuck and stop seeing new updates.	2015-10-27 08:06:11 -07:00
Bingkun Guo	4914925d65	New extension loading mechanism 1) Remove maven client from downloading extensions at runtime. 2) Provide a way to load Druid extensions and hadoop dependencies through file system. 3) Refactor pull-deps so that it can download extensions into extension directories. 4) Add documents on how to use this new extension loading mechanism. 5) Change the way how Druid tarball is generated. Now all the extensions + hadoop-client 2.3.0 are packaged within the Druid tarball.	2015-10-21 14:22:36 -05:00
Himanshu	b7c68ec449	Merge pull request #1842 from metamx/DRUID-1841 Do not pass `druid.indexer.runner.javaOpts` to Peon as a property	2015-10-21 13:15:36 -05:00
Xavier Léauté	e4ac78e43d	bump next snapshot to 0.9.0	2015-10-20 13:46:13 -07:00
Charles Allen	532e1c9fd5	Do not pass `druid.indexer.runner.javaOpts` to Peon as a property * Still places `druid.indexer.runner.javaOpts` on the command line, but the Peon no longer tries to have the property `druid.indexer.runner.javaOpts` set * Fixes https://github.com/druid-io/druid/issues/1841	2015-10-20 09:24:01 -07:00
Xavier Léauté	4c2c7a2c37	update version to 0.8.3	2015-10-14 21:40:55 -07:00
Charles Allen	bf11723a52	Update usages of io.druid.client.selector.Server to build URL or URI directly instead of using String.format	2015-10-12 12:30:56 -07:00
Charles Allen	2d847ad654	Merge pull request #1730 from metamx/union-queries-fix fix #1727 - Union bySegment queries fix	2015-09-29 12:23:25 -07:00
Nishant	573aa96bd6	fix #1727 - Union bySegment queries fix Fixes #1727. revert to doing merging for results for union queries on broker. revert unrelated changes Add test for union query runner Add test remove unused imports fix imports fix renamed file fix test update docs.	2015-09-29 23:32:36 +05:30
Charles Allen	d2e400f063	Merge pull request #1740 from metamx/validate-locks fix #1715	2015-09-29 09:38:42 -07:00
Xavier Léauté	25bbc0b923	Merge pull request #1778 from gianm/redirect-fixes Redirect fixes	2015-09-25 09:54:48 -07:00
Gian Merlino	348172203f	OverlordRedirectInfo: Fix ability to detect that there is no leader.	2015-09-25 09:30:09 -07:00
Parag Jain	b630720164	fail task if finishjob throws any exception add realtime task failure test	2015-09-25 10:55:45 -05:00
Fangjin Yang	aa9d90355e	Merge pull request #1772 from gianm/fix-overlord-startup RemoteTaskRunner: Fix for starting an overlord before any workers ever existed.	2015-09-24 21:55:03 -07:00
Gian Merlino	63bf021077	RemoteTaskRunner: Fix for starting an overlord before any workers ever existed.	2015-09-24 21:15:36 -07:00
Himanshu Gupta	6e550d5346	update doc about aggregation field in merge task and a null check	2015-09-24 22:25:07 -05:00
Nishant	b638400acb	fix #1715 fixes #1715 - TaskLockBox has a set of active tasks - lock requests throws exception for if they are from a task not in active task set. - TaskQueue is responsible for updating the active task set on tasklockbox fix #1715 fixes #1715 - TaskLockBox has a set of active tasks - lock requests throws exception for if they are from a task not in active task set. - TaskQueue is responsible for updating the active task set on tasklockbox review comment remove duplicate line use ISE instead organise imports	2015-09-24 10:06:50 +05:30
Himanshu	61b0743943	Merge pull request #1748 from metamx/forkingJavaOptionsWithQuotes Allow ForkingTaskRunner javaOpts to have quoted arguments which contain spaces	2015-09-21 21:03:00 -05:00
Charles Allen	465035e531	Allow ForkingTaskRunner javaOpts to have quoted arguments which contain spaces	2015-09-21 17:32:27 -07:00
Fangjin Yang	e48f6dd660	Merge pull request #1736 from gianm/additional-ingest-segment-timeline-test IngestSegmentFirehostFactoryTimelineTest for overshadowing of the middle of a segment.	2015-09-17 14:42:29 -07:00
Gian Merlino	64e33b2bcb	IngestSegmentFirehostFactoryTimelineTest for overshadowing of the middle of a segment.	2015-09-16 10:17:43 -07:00
Himanshu Gupta	74f4572bd4	Lazily deserialize "parser" to InputRowParser in DataSchema so that user hadoop related InputRowParsers are created only when needed this allows overlord to accept a HadoopIndexTask with a hadoopy InputRowParser and not fail because hadoopy InputRowParser might need hadoop libraries	2015-09-16 10:58:13 -05:00
Charles Allen	f5ed6e885c	Merge pull request #1702 from himanshug/double_datasource_in_storage_dir do not have dataSource twice in path to segment storage on hdfs	2015-09-15 14:00:35 -07:00
Nishant	4681ff22ed	add task duration in response for completed tasks	2015-09-10 13:51:50 +05:30
Himanshu Gupta	fe0233adf2	removing unused imports from HadoopIndexTask	2015-09-09 11:12:01 -05:00
Nishant	47aac991ec	add null check for task context. make variable final	2015-09-04 22:19:01 +05:30
Fangjin Yang	75a582974b	Merge pull request #1639 from gianm/new-plumber New plumber	2015-09-03 18:52:57 -07:00
Gian Merlino	062a47fba4	Modify Plumbers in these ways, 1) Persist using Committer instead of Runnable. (Although the metadata object is ignored in this patch) 2) Remove the getSink method. 3) Plumbers are now responsible for time-based and hydrant-full-based periodic committing. (FireChief, RealtimeIndexTask, and IndexTask used to do this)	2015-09-03 11:13:06 -07:00
Nishant	726326abc3	Add Task Context and ability to override task specific properties override javaOpts fix compilation review comments Add Test for typecast review comments - remove unused method.	2015-09-03 23:36:32 +05:30
Gian Merlino	940e1aa3eb	Replace funky imports with standard ones. 1) Lots of Guava imports were not coming from the actual Guava 2) junit.framework.Assert should be org.junit.Assert	2015-08-28 18:02:05 -07:00
Gian Merlino	414a6fb477	Fix overlapping segments in IngestSegmentFirehose, DatasourceInputFormat. Fixes #1678. IngestSegmentFirehose (and its users) need to remember which windows of which segments should actually be read, based on a timeline.	2015-08-28 07:32:41 -07:00
Himanshu Gupta	2e0dd1d792	adding UTs and addressing review comments to firehoseV2 addition to Realtime[Manager\|Plumber], essential segment metadata persist support, kafka-simple-consumer-firehose extension patch	2015-08-27 20:50:46 -05:00
lvjq	2237a8cf0f	kafka 8 simple consumer firehose	2015-08-27 20:50:46 -05:00
Nishant	b306739e9c	fix convert segment task 1) fix serde 2) fix wrong parameter being passed when creating subtask remove sysout	2015-08-27 11:34:41 +05:30
Charles Allen	e38cf54bc8	Migrate TestDerbyConnector to a JUnit @Rule	2015-08-26 21:47:40 -07:00
Xavier Léauté	fdb6a6651b	Merge pull request #1669 from metamx/upgrade-dependencies Upgrade dependencies	2015-08-25 21:30:22 -07:00
Xavier Léauté	5c19ffa98c	Merge pull request #1663 from gianm/segment-insert-constraints TaskActionToolbox: Remove allowOlderVersions, lift interval constraint	2015-08-25 18:11:46 -07:00
Xavier Léauté	51f6a9a2c9	update jackson to 2.6.1	2015-08-25 16:07:01 -07:00
Gian Merlino	33681525e3	TaskActionToolbox: Remove allowOlderVersions switch, lift interval constraint. allowOlderVersions has been stuck true for a while due to a bug (introduced in `566a3a61`), but I think it's actually OK this way. I think it's reasonable to expect tasks to choose versions in some way that makes sense, so long as they don't choose one larger than their taskLock version. This is still verified. The interval constraint was introduced to force tasks to break up their segment insert lists into manageable chunks. They are already doing this, and I think it's reasonable to expect them to do so without enforcement. Lifting these constraints paves the way for transactional insertion of segments that have varying versions and may be for varying intervals.	2015-08-25 14:17:38 -07:00
Paul Otto	2301b60365	Add ability to provide taskResource for IndexTask.	2015-08-24 17:38:31 -07:00
Xavier Léauté	3b2e41e42a	update for next release	2015-08-18 17:16:46 -07:00
Himanshu Gupta	15fa43dd43	changing DatasourcePathSpec, to get segment list, so that hadoop indexer uses overlord action to get list of segments and passes when running as an overlord task. and, uses metadata store directly when running as standalone hadoop indexer also, serialized list of segments is passed to DatasourcePathSpec so that hadoop classloader issues do not creep up	2015-08-16 14:07:35 -05:00
Himanshu Gupta	4d4aa8bfc6	refactor IngestSegmentFirehoseFactory so that IngestSegmentFirehose becomes reusable Conflicts: indexing-service/src/main/java/io/druid/indexing/firehose/IngestSegmentFirehoseFactory.java	2015-08-14 14:44:22 -05:00
Gian Merlino	bc0c7dd65d	Avoid the Hadoop objectMapper in the local IndexTask. Fixes #1545 .	2015-08-11 10:40:53 -07:00
Charles Allen	1ddaa3fb33	Merge pull request #1592 from metamx/clean-test-files clean temporary files	2015-08-03 11:47:20 -07:00
Nishant	2679efee7a	clean temporary files	2015-08-03 23:32:58 +05:30
Fangjin Yang	6f65e6d3ef	Merge pull request #1547 from pjain1/improve_overlord_test add test to OverlordResourceTest	2015-07-28 07:35:48 -10:00
Parag Jain	2e1b617346	add more tests	2015-07-24 15:12:08 -05:00
Fangjin Yang	97242356b4	Merge pull request #1480 from guobingkun/kill_task_test Unit tests for KillTask and MetadataTaskStorage	2015-07-20 16:31:45 -07:00
Xavier Léauté	4cfb00bc8a	inrement version	2015-07-15 13:09:05 -07:00
Fangjin Yang	3f7ba58227	Merge pull request #1504 from metamx/fix-1447 fix for #1447	2015-07-14 08:50:08 -07:00
Himanshu	e2ddfb7a1a	Merge pull request #1511 from pjain1/remove_test remove flaky overlord test	2015-07-13 18:38:34 -05:00
Parag Jain	59dec89f6a	remove flaky overlord test	2015-07-13 15:32:12 -05:00
Himanshu	725086cc89	Merge pull request #1506 from gianm/realtime-plumber-nulls Consider null inputRows and parse errors as unparseable during realtime ingestion.	2015-07-13 10:12:12 -05:00
Gian Merlino	9068bcd062	Consider null inputRows and parse errors as unparseable during realtime ingestion. Also, harmonize exception handling between the RealtimeIndexTask and the RealtimeManager. Conditions other than null inputRows and parse errors bubble up in both.	2015-07-11 20:40:03 -07:00
Himanshu	cac722968e	Merge pull request #1503 from metamx/fix-leaking-zk-nodes Fix leaking Status Path nodes in ZK	2015-07-10 17:40:18 -05:00
Fangjin Yang	9f19e96658	Merge pull request #1477 from pjain1/overlord_test overlord and task master test	2015-07-10 14:27:14 -07:00
Parag Jain	55c4fe64f3	overlord and task master test	2015-07-10 16:17:45 -05:00
Nishant	5fe27fe4ad	fix for #1447 fixes #1447	2015-07-09 19:05:48 +05:30
Nishant	8d7a566bae	Fix leaking Status Path nodes in ZK - remove ZK status path nodes for workers after they are removed	2015-07-09 17:20:09 +05:30
Charles Allen	c0b60c0d2f	I'm not your mom, indexing-service/test... cleanup after yourself	2015-07-01 15:00:09 -07:00
Bingkun Guo	282a0f9760	Unit tests for KillTask and MetadataTaskStorage	2015-06-29 17:55:41 -05:00
Himanshu	b5b9ca1446	Merge pull request #1470 from pjain1/rtindex_test Realtime Index Task test	2015-06-29 16:51:35 -05:00
Parag Jain	284b80b09e	Realtime Index Task test	2015-06-29 09:52:41 -05:00
Himanshu	4a83a22f8c	Merge pull request #1445 from metamx/JSWorkerSelectStrategy JavaScript Worker Select Strategy	2015-06-22 17:19:13 -05:00
nishant	fb4052d577	JavaScript Worker Select Strategy this PR adds a JavaScriptWorkerSelectStrategy which allows defining arbitrary logic for selecting workers to run task using a JavaScript function. This gives users full control to implement complex worker selection strategies based on task attributes. more tests and a complex javascript config fix for java8 modify for nashorn compatibility	2015-06-20 02:01:34 +05:30
Xavier Léauté	0a5bb909a2	[maven-release-plugin] prepare for next development iteration	2015-06-18 17:35:19 -07:00
Xavier Léauté	59c6b2b279	[maven-release-plugin] prepare release druid-0.8.0-rc1	2015-06-18 17:35:14 -07:00
Charles Allen	acc0a3fbf7	Add jitter to the retries for RemoteTaskActionClient	2015-06-12 17:43:25 -07:00
nishant	e9afec4a2b	fix task status issues on zk outages docs review comments fix test review comments Review comments fix compilation fix typo	2015-06-11 00:49:52 +05:30
Xavier Léauté	78d468700b	Merge pull request #1388 from metamx/fix-1360 fix race described in 1360	2015-06-10 11:59:36 -07:00

1 2 3 4 5 ...

1150 Commits