druid

Commit Graph

Author	SHA1	Message	Date
binlijin	cd1c71ceb4	rename persistBackgroundCount to numBackgroundPersistThreads	2016-01-22 14:29:41 +08:00
Charles Allen	2a69a58570	Merge pull request #2149 from binlijin/master Do persist IncrementalIndex in another thread in IndexGeneratorReducer	2016-01-20 17:06:42 -08:00
Fangjin Yang	996c1173c6	Merge pull request #2223 from navis/besteffort-split-locations Best effort to find locations for input splits	2016-01-20 16:53:43 -08:00
Fangjin Yang	695f107870	Merge pull request #2302 from metamx/lowerCaseGranPathTest Make GranularityPathSpecTest check with lower-case enums	2016-01-20 09:18:06 -08:00
Charles Allen	3c5ca3a5f2	Make GranularityPathSpecTest check with lower-case enums	2016-01-20 08:35:13 -08:00
binlijin	8e43e2c446	Do persist IncrementalIndex in another thread in IndexGeneratorReducer	2016-01-20 09:20:09 +08:00
jon-wei	747343e621	Preserve dimension order across indexes during ingestion	2016-01-19 13:34:11 -08:00
Gian Merlino	1dcf22edb7	Respect buildV9Directly in PlumberSchools, so it works on standalone realtime nodes. Also parameterize some tests to run with/without buildV9Directly: - IndexGeneratorJobTest - RealtimeIndexTaskTest - RealtimePlumberSchoolTest	2016-01-19 12:15:06 -08:00
navis.ryu	f03f7fb625	Best effort to find locations for input splits	2016-01-18 08:31:05 +09:00
Kurt Young	82ff98c2bf	add config for build v9 directly and update docs	2016-01-16 11:26:34 +08:00
dclim	2308c8c07f	continue hadoop job for sparse intervals	2016-01-07 01:35:08 -07:00
Fangjin Yang	14229ba0f2	Merge pull request #1922 from metamx/jsonIgnoresFinalFields Change DefaultObjectMapper to NOT overwrite final fields unless explicitly asked to	2015-12-18 15:38:32 -08:00
Fangjin Yang	d957a6602c	Merge pull request #2049 from himanshug/hadoop_indexing_unique_path add a unique string to intermediate path for the hadoop indexing task	2015-12-07 11:46:16 -08:00
Himanshu Gupta	6cfaf59d7e	add a unique string to intermediate path for the hadoop indexing task	2015-12-06 22:20:38 -06:00
Himanshu Gupta	62ba9ade37	unifying license header in all java files	2015-12-05 22:16:23 -06:00
Himanshu Gupta	61aaa09012	support multiple intervals in dataSource input spec	2015-12-03 21:28:04 -06:00
Xavier Léauté	fa6142e217	cleanup and remove unused imports	2015-11-11 12:25:21 -08:00
Charles Allen	abae47850a	Add backwards compatability for PR #1922	2015-11-11 10:27:00 -08:00
fjy	8f231fd3e3	cleanup druid codebase	2015-11-04 13:59:53 -08:00
Himanshu Gupta	84f7d8d264	making static final variables in HadoopDruidIndexerConfig upper case	2015-11-02 23:24:26 -06:00
Himanshu Gupta	8b67417ac8	make methods in Index[Merger,Maker,IO] non-static so that they can have appropriate ObjectMapper injected instead of creating one statically	2015-11-02 23:24:26 -06:00
Nishant	3641a0e553	Fix Race in jar upload during hadoop indexing - https://github.com/druid-io/druid/issues/582 few fixes delete intermediate file early better exception handling use static pattern instead of compiling it every time Add retry for transient exceptions remove usage of deprecated method. Add test fix imports fix javadoc review comment. review comment: handle crazy snapshot naming review comments remove default retry count in favour of already present constant review comment make random intermediate and final paths. review comment, use temporaryFolder where possible	2015-10-22 21:41:07 +05:30
Gian Merlino	3aba401ee0	SQLMetadataConnector: Retry table creation, in case something goes wrong. Also rejigger table creation methods to not take a DBI. It's already available inside the connector, and everyone was just using that one anyway.	2015-09-24 21:39:36 -07:00
Himanshu Gupta	e8b9ee85a7	HadoopyStringInputRowParser to convert stringy Text, BytesWritable etc into InputRow	2015-09-16 10:58:13 -05:00
Himanshu Gupta	74f4572bd4	Lazily deserialize "parser" to InputRowParser in DataSchema so that user hadoop related InputRowParsers are created only when needed this allows overlord to accept a HadoopIndexTask with a hadoopy InputRowParser and not fail because hadoopy InputRowParser might need hadoop libraries	2015-09-16 10:58:13 -05:00
Himanshu Gupta	9ca6106128	user specified hadoop settings are ignored if explicitly set in code	2015-08-31 10:50:18 -05:00
Gian Merlino	940e1aa3eb	Replace funky imports with standard ones. 1) Lots of Guava imports were not coming from the actual Guava 2) junit.framework.Assert should be org.junit.Assert	2015-08-28 18:02:05 -07:00
jon-wei	e5c4927b14	Add support for parsing BytesWritable strings to Hadoop Indexer	2015-08-28 14:27:14 -07:00
Gian Merlino	414a6fb477	Fix overlapping segments in IngestSegmentFirehose, DatasourceInputFormat. Fixes #1678. IngestSegmentFirehose (and its users) need to remember which windows of which segments should actually be read, based on a timeline.	2015-08-28 07:32:41 -07:00
Charles Allen	e38cf54bc8	Migrate TestDerbyConnector to a JUnit @Rule	2015-08-26 21:47:40 -07:00
Himanshu Gupta	b3c570e78d	update BatchDeltaIngestion.testDeltaIngestion(..) to check for proper glob path handling	2015-08-20 21:36:34 -05:00
Himanshu Gupta	a603bd9547	HadoopGlobPathSplitter implementation to split hadoop glob paths This can be safely reverted once https://issues.apache.org/jira/browse/MAPREDUCE-5061 is fixed	2015-08-20 21:36:34 -05:00
Himanshu Gupta	a3bab5b7d9	IndexGeneratorJobTest type unit test for batch delta ingestion and reindexing	2015-08-16 14:07:35 -05:00
Himanshu Gupta	15fa43dd43	changing DatasourcePathSpec, to get segment list, so that hadoop indexer uses overlord action to get list of segments and passes when running as an overlord task. and, uses metadata store directly when running as standalone hadoop indexer also, serialized list of segments is passed to DatasourcePathSpec so that hadoop classloader issues do not creep up	2015-08-16 14:07:35 -05:00
Himanshu Gupta	45947a1021	add ability to specify Multiple PathSpecs in batch ingestion, so that we can grab data from multiple places in same ingestion Conflicts: indexing-hadoop/src/main/java/io/druid/indexer/HadoopDruidIndexerConfig.java indexing-hadoop/src/main/java/io/druid/indexer/JobHelper.java Conflicts: indexing-hadoop/src/main/java/io/druid/indexer/path/PathSpec.java	2015-08-16 13:15:38 -05:00
Himanshu Gupta	1ae56f139b	Druid Hadoop InputFormat and pathSpec Conflicts: indexing-hadoop/src/main/java/io/druid/indexer/path/PathSpec.java indexing-service/pom.xml	2015-08-16 13:15:38 -05:00
Himanshu Gupta	0eec1bbee2	json serde tests for HadoopTuningConfig	2015-07-20 12:01:53 -05:00
Himanshu Gupta	f836c3a7ac	adding flag useCombiner to hadoop tuning config that can be used to add a hadoop combiner to hadoop batch ingestion to do merges on the mappers if possible	2015-07-20 12:01:53 -05:00
Himanshu Gupta	f7a92db332	generic byte[] serde for InputRow	2015-07-20 12:01:53 -05:00
Michael Schiff	6ad451a44a	JobHelper.ensurePaths will set job properties from config (tuningConfig.jobProperties) before adding input paths to the config. Adding input paths will create Path and FileSystem instances which may depend on the values in the job config. This allows all properties to be set from the spec file, avoiding having to directly edit cluster xml files. IndexGeneratorJob.run adds job properties before adding input paths (adding input paths may depend on having job properies set) JobHelperTest confirms that JobHelper.ensurePaths adds job properties javadoc for addInputPaths to explain relationship with addJobProperties	2015-07-01 12:45:32 -07:00
Charles Allen	94a567732a	Wipe FileContext off the face of the earth * Fixes https://github.com/druid-io/druid/issues/1433 * Works arround https://issues.apache.org/jira/browse/HADOOP-10643 * Reverts to the prior method of renaming	2015-06-16 09:48:09 -07:00
Charles Allen	056cab93ed	Add Hadoop Converter Job and task * Fixes https://github.com/druid-io/druid/issues/1363 * Add extra utils in JobHelper based on PR feedback	2015-06-09 14:47:38 -07:00
Charles Allen	2a76bdc60a	Abstractify hadoopy indexer configuration. * Moves many items to JobHelper * Remove dependencies of these functions on HadoopDruidIndexerConfig in favor of more general items * Changes functionalities of some of the path methods to always return a path with scheme * Adds retry to uploads * Change output loadSpec determining from using outputFS.getClass().getName() to using outputFS.getScheme()	2015-06-08 10:53:27 -07:00
fjy	be2a35188e	Additional schema validations and better logs for common extensions	2015-05-27 16:25:02 -07:00
Bingkun Guo	b46aff12ae	Unit test for IndexGeneratorJob	2015-05-18 12:31:16 -05:00
Fangjin Yang	a2dc58cd2d	Merge pull request #1345 from pjain1/unit_test_warn_fix fix warn msg and some unit tests	2015-05-08 08:06:20 -07:00
Parag Jain	01448d264c	Fix warn msg and added some unit tests	2015-05-07 17:10:05 -05:00
Bingkun Guo	1ee550dd91	Fix a potential issue in DeterminePartitionsJob by making HadoopDruidIndexerConfig non-static, and two unit tests for DeterminPartitionsJob and LocalDataSegmentKiller	2015-05-04 20:00:29 -07:00
Xavier Léauté	3a3046ccf3	add support for dimension compression - compression for single-value dimensions using CompressedVSizeIntsIndexedSupplier - makes dimension compression configurable via IndexSpec - IndexSpec also enables configuring bitmap and metric compression	2015-04-14 10:44:18 -07:00
Prajwal Tuladhar	3044bf5592	use Job.getInstance() to fix deprecated warnings	2015-04-09 13:22:21 -04:00
Himanshu Gupta	8c1f0834ba	Removing MapWritableInputRowParser from indexing-hadoop it should really be an extension if user needs	2015-03-19 18:37:08 -05:00
Himanshu Gupta	3f7a7ba5d3	For batch hadoop indexing, make hadoop input format configuration. Given input format must extend from org.apache.hadoop.mapreduce.InputFormat	2015-03-18 16:09:45 -05:00
Himanshu Gupta	30f64ff19e	UTs update for indexing-hadoop	2015-02-25 15:45:57 -08:00
Himanshu Gupta	126262edce	support for PasswordProvider interface to enable writing druid extension which can get metadata store password from secured location or anywhere instead of plain text properties file	2015-02-25 14:05:19 -06:00
Fangjin Yang	92e616de11	Merge pull request #1077 from metamx/remove-unused-imports remove unused imports	2015-02-02 10:45:27 -08:00
nishantmonu51	ba932bb1f2	remove unused imports	2015-02-02 21:53:39 +05:30
fjy	d05032b98a	towards a community led druid	2015-01-31 20:57:36 -08:00
Xavier Léauté	cd9635ff5e	Merge pull request #1034 from druid-io/minor-rename minor rename of things in hadoop ingestion config to match 0.6.x	2015-01-15 15:46:13 -08:00
fjy	ccddbf8747	minor rename of things in hadoop ingestion config to match 0.6.x	2015-01-15 14:04:55 -08:00
Charles Allen	b1b5c9099e	Update all String conversions to and from byte[] to use the java-util StringUtils functions * Speedup of GroupBy with javaScript filters by ~10% * Requires https://github.com/metamx/java-util/pull/15	2015-01-05 11:22:32 -08:00
Xavier Léauté	7cd45a6e1f	IncrementalIndex throws exception if limit exceeded - For now uses a hardcoded ratio of aggregator to timeanddim buffer sizes - canAppendRow is a workaround for realtime index since the Firehose currently does not have a way of rolling back the last event in case of error - canAppendRow needs a fudge factor; there is a race between checking if we can add a row and actually adding a row, because of the way MapDB reports its size.	2014-12-04 14:38:16 -08:00
Gian Merlino	20a7239ffd	Replace google-http-client imports with real guava imports.	2014-12-04 10:57:57 -08:00
nishantmonu51	da8bd7836b	Introduce buffer size	2014-12-03 16:28:22 +05:30
nishantmonu51	f0452c5968	merge from master	2014-11-18 19:34:51 +05:30
nishantmonu51	edf0fc0851	Make hashed partitions spec default - make hashed partitionsSpec as default partitions spec for 0.7	2014-11-17 19:48:12 +05:30
nishantmonu51	0c2d06475d	merge from master	2014-11-17 19:19:18 +05:30
Xavier Léauté	20a9aef96a	fix test	2014-11-06 17:27:05 -08:00
Xavier Léauté	9c06db021f	rename db->metadata postgres->postgresql	2014-10-31 10:30:27 -07:00
jisookim0513	aa754b86e8	build success!	2014-10-24 11:28:42 -07:00
fjy	bef74104d9	merge with 0.7.x and resolve any conflicts	2014-10-23 17:24:06 -07:00
jisookim0513	37979282fe	enabled ansi-quote in mysql; insert statement should now work	2014-10-21 00:09:19 -07:00
jisookim0513	7d5c5f2083	fixed createTable; fixed miscellaneous stuff; added DerbyMetadataRuleManagerProvider	2014-10-17 00:10:36 -07:00
nishantmonu51	41e88baeca	Add test for bucket selection	2014-10-15 23:09:28 +05:30
nishantmonu51	454acd3f5a	remove backwards compatible code 1) remove backwards compatible and deprecated code 2) make hashed partitions spec default	2014-10-13 19:30:44 +05:30
jisookim0513	74565c9371	cleaned up the code	2014-09-27 13:10:01 -07:00
jisookim0513	6a641621b2	finished merging into druid-0.7.x; derby not working (to be fixed)	2014-09-26 14:24:53 -07:00
jisookim0513	273205f217	initial attempt for abstraction; druid cluster works with Derby as a default	2014-09-19 17:39:59 -07:00
Xavier Léauté	58ab759fc6	remove unused imports	2014-08-29 14:03:47 -07:00
fjy	a870fe5cbe	inject column config	2014-06-19 14:47:57 -07:00
Xavier Léauté	09346b0a3c	make column cache configurable	2014-06-19 14:43:03 -07:00
fjy	1100d2f2a1	rename configs to make a bit more sense	2014-05-06 14:52:50 -07:00
fjy	76e0a48527	Merge branch 'master' into new-schema Conflicts: indexing-hadoop/src/main/java/io/druid/indexer/DbUpdaterJob.java indexing-hadoop/src/test/java/io/druid/indexer/HadoopDruidIndexerConfigTest.java indexing-service/src/main/java/io/druid/indexing/common/task/HadoopIndexTask.java server/src/main/java/io/druid/segment/realtime/plumber/RealtimePlumber.java server/src/main/java/io/druid/segment/realtime/plumber/RealtimePlumberSchool.java	2014-04-25 14:03:28 -07:00
nishantmonu51	0d8c1ffe54	review comments and add partitioner	2014-04-23 03:30:30 +05:30
nishantmonu51	ea4a80e8d2	Add serde test for shardCount	2014-04-23 00:24:08 +05:30
fjy	c4c4d80336	make local testing pass	2014-03-03 14:52:43 -08:00
fjy	46b9ac78e7	Merge branch 'master' into new-schema Conflicts: indexing-hadoop/src/test/java/io/druid/indexer/HadoopDruidIndexerConfigTest.java pom.xml publications/whitepaper/druid.pdf publications/whitepaper/druid.tex	2014-03-03 14:48:15 -08:00
nishantmonu51	8af63005a6	refactor randomPartitionsSpec to hashedPartitionsSpec refactor to a more appropriate name	2014-02-25 03:07:31 +05:30
fjy	5d2367f0fd	unit tests pass at this point	2014-02-20 15:52:12 -08:00
fjy	20cac8c506	not compiling yet but close	2014-02-19 15:54:27 -08:00
fjy	3979eb270c	Revert "Revert "Merge branch 'determine-partitions-improvements'"" This reverts commit `189b3e2b9b`.	2014-02-14 12:58:56 -08:00
fjy	189b3e2b9b	Revert "Merge branch 'determine-partitions-improvements'" This reverts commit `7ad228ceb5`, reversing changes made to `9c55e2b779`.	2014-02-14 12:47:34 -08:00
nishantmonu51	48d0c37f98	documentation for random partition spec	2014-02-05 15:30:44 +05:30
nishantmonu51	bacc72415f	correct locking and partitionsSpec	2014-02-05 03:17:47 +05:30
fjy	a1c09df17f	make the hadoop index task work again	2013-10-16 09:45:17 -07:00
cheddar	5ab671050e	No more com.metamx.druid, it is now all io.druid!	2013-08-30 19:42:12 -05:00
cheddar	bd0756e360	More stuff moved, things still compiling and tests still passing. Yay!	2013-08-30 18:58:35 -05:00
cheddar	eee1efdcb5	Merge branch 'master' into guice Conflicts: client/src/main/java/com/metamx/druid/client/DruidServerConfig.java indexing-service/src/main/java/com/metamx/druid/indexing/common/index/ChatHandlerProvider.java indexing-service/src/main/java/com/metamx/druid/indexing/coordinator/TaskMasterLifecycle.java indexing-service/src/main/java/com/metamx/druid/indexing/worker/executor/ExecutorNode.java indexing-service/src/test/java/com/metamx/druid/indexing/coordinator/TaskLifecycleTest.java	2013-08-06 13:33:31 -07:00
cheddar	3c808b15c3	1) Fix HadoopDruidIndexerConfigTest to actually verify the current correct behavior.	2013-08-05 11:37:20 -07:00
cheddar	2361e0112a	Make it all compile again...	2013-08-02 10:14:46 -07:00
cheddar	9e78bb38f5	Merge branch 'master' into guice Conflicts: client/src/main/java/com/metamx/druid/QueryableNode.java client/src/main/java/com/metamx/druid/client/ServerInventoryView.java client/src/main/java/com/metamx/druid/coordination/SingleDataSegmentAnnouncer.java client/src/main/java/com/metamx/druid/initialization/CuratorDiscoveryConfig.java client/src/main/java/com/metamx/druid/query/MetricsEmittingExecutorService.java indexing-hadoop/src/test/java/com/metamx/druid/indexer/HadoopDruidIndexerConfigTest.java indexing-service/src/main/java/com/metamx/druid/indexing/common/TaskToolbox.java indexing-service/src/main/java/com/metamx/druid/indexing/coordinator/http/IndexerCoordinatorNode.java indexing-service/src/main/java/com/metamx/druid/indexing/worker/executor/ExecutorNode.java indexing-service/src/main/java/com/metamx/druid/indexing/worker/http/WorkerNode.java pom.xml server/src/main/java/com/metamx/druid/coordination/ServerManager.java server/src/main/java/com/metamx/druid/coordination/ZkCoordinator.java server/src/main/java/com/metamx/druid/db/DatabaseRuleManager.java server/src/main/java/com/metamx/druid/db/DatabaseSegmentManager.java server/src/main/java/com/metamx/druid/http/ComputeNode.java server/src/main/java/com/metamx/druid/http/MasterMain.java server/src/main/java/com/metamx/druid/loading/SegmentLoaderConfig.java server/src/main/java/com/metamx/druid/loading/SingleSegmentLoader.java server/src/main/java/com/metamx/druid/master/DruidMaster.java	2013-08-01 16:42:47 -07:00
Jan Rudert	ad087a7a22	correct segment path for hadoop indexer	2013-07-10 09:21:45 +02:00
cheddar	f68df7ab69	1) Make tests work and continue trying to make the DruidMaster start up with just Guice	2013-06-07 12:01:46 -07:00
fjy	26e0eb62cb	merge and other refactorings	2013-05-15 17:28:08 -07:00

... 2 3 4 5 6

253 Commits