druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	3534483433	Better handling of ParseExceptions. Two changes: - Allow IncrementalIndex to suppress ParseExceptions on "aggregate". - Add "reportParseExceptions" option to realtime tuning configs. By default this is "false". Behavior of the counters should now be: - processed: Number of rows indexed, including rows where some fields could be parsed and some could not. - thrownAway: Number of rows thrown away due to rejection policy. - unparseable: Number of rows thrown away due to being completely unparseable (no fields salvageable at all). If "reportParseExceptions" is true then "unparseable" will always be zero (because a parse error would cause an exception to be thrown). In addition, "processed" will only include fully parseable rows (because even partial parse failures will cause exceptions to be thrown). Fixes #2510.	2016-02-23 10:11:43 -08:00
Himanshu Gupta	09ffcae4ae	give user the option to specify the segments for dataSource inputSpec	2016-02-21 23:15:31 -06:00
Himanshu Gupta	2faae9d0d1	In JobHelper.makeSegmentOutputPath(..) use DataSegmentPusherUtils to construct the segment storage path	2016-02-09 21:42:32 -06:00
Himanshu Gupta	b3437825f0	add ignoreWhenNoSegments flag to optionally ignore the dataSource inputSpec when no segments were found	2016-01-26 17:23:55 -06:00
binlijin	cd1c71ceb4	rename persistBackgroundCount to numBackgroundPersistThreads	2016-01-22 14:29:41 +08:00
Charles Allen	2a69a58570	Merge pull request #2149 from binlijin/master Do persist IncrementalIndex in another thread in IndexGeneratorReducer	2016-01-20 17:06:42 -08:00
Fangjin Yang	996c1173c6	Merge pull request #2223 from navis/besteffort-split-locations Best effort to find locations for input splits	2016-01-20 16:53:43 -08:00
Fangjin Yang	695f107870	Merge pull request #2302 from metamx/lowerCaseGranPathTest Make GranularityPathSpecTest check with lower-case enums	2016-01-20 09:18:06 -08:00
Charles Allen	3c5ca3a5f2	Make GranularityPathSpecTest check with lower-case enums	2016-01-20 08:35:13 -08:00
binlijin	8e43e2c446	Do persist IncrementalIndex in another thread in IndexGeneratorReducer	2016-01-20 09:20:09 +08:00
jon-wei	747343e621	Preserve dimension order across indexes during ingestion	2016-01-19 13:34:11 -08:00
Jonathan Wei	df2906a91c	Merge pull request #2290 from gianm/index-merger-v9-stuff Respect buildV9Directly in PlumberSchools, so it works on standalone realtime.	2016-01-19 13:04:00 -08:00
Gian Merlino	1dcf22edb7	Respect buildV9Directly in PlumberSchools, so it works on standalone realtime nodes. Also parameterize some tests to run with/without buildV9Directly: - IndexGeneratorJobTest - RealtimeIndexTaskTest - RealtimePlumberSchoolTest	2016-01-19 12:15:06 -08:00
Himanshu Gupta	164b0aad7a	removing Map<String,Object> segmentMetadata from methods in Index[Maker/Merger] and using Metadata class instead of a Map to store segment metadata	2016-01-18 22:03:46 -06:00
navis.ryu	f03f7fb625	Best effort to find locations for input splits	2016-01-18 08:31:05 +09:00
Kurt Young	82ff98c2bf	add config for build v9 directly and update docs	2016-01-16 11:26:34 +08:00
Kurt Young	1f2168fae5	add IndexMergerV9 add unit tests for IndexMergerV9 and fix some bugs add more unit tests and fix bugs handle null values and add more tests minor changes & use LoggingProgressIndicator in IndexGeneratorReducer make some static class public from IndexMerger minor changes and add some comments changes for comments	2016-01-16 11:25:28 +08:00
navis.ryu	976ebc45c0	Simplify information in IncrementalIndex	2016-01-12 10:18:11 +09:00
dclim	2308c8c07f	continue hadoop job for sparse intervals	2016-01-07 01:35:08 -07:00
fjy	faf421726b	remove IndexMaker	2015-12-28 14:19:02 -08:00
Fangjin Yang	14229ba0f2	Merge pull request #1922 from metamx/jsonIgnoresFinalFields Change DefaultObjectMapper to NOT overwrite final fields unless explicitly asked to	2015-12-18 15:38:32 -08:00
binlijin	219367221b	optimize InputRowSerde	2015-12-09 09:51:56 +08:00
Fangjin Yang	d957a6602c	Merge pull request #2049 from himanshug/hadoop_indexing_unique_path add a unique string to intermediate path for the hadoop indexing task	2015-12-07 11:46:16 -08:00
Himanshu Gupta	6cfaf59d7e	add a unique string to intermediate path for the hadoop indexing task	2015-12-06 22:20:38 -06:00
Himanshu Gupta	62ba9ade37	unifying license header in all java files	2015-12-05 22:16:23 -06:00
Himanshu Gupta	61aaa09012	support multiple intervals in dataSource input spec	2015-12-03 21:28:04 -06:00
Fangjin Yang	21c84b5ff7	Merge pull request #1896 from gianm/allocate-segment SegmentAllocateAction (fixes #1515)	2015-11-18 21:05:46 -08:00
Gian Merlino	e4e5f0375b	SegmentAllocateAction (fixes #1515 ) This is a feature meant to allow realtime tasks to work without being told upfront what shardSpec they should use (so we can potentially publish a variable number of segments per interval). The idea is that there is a "pendingSegments" table in the metadata store that tracks allocated segments. Each one has a segment id (the same segment id we know and love) and is also part of a sequence. The sequences are an idea from @cheddar that offers a way of doing replication. If there are N tasks reading exactly the same data with exactly the same logic (think Kafka tasks reading a fixed range of offsets) then you can place them in the same sequence, and they will generate the same sequence of segments.	2015-11-11 16:54:35 -08:00
Xavier Léauté	fa6142e217	cleanup and remove unused imports	2015-11-11 12:25:21 -08:00
Charles Allen	abae47850a	Add backwards compatability for PR #1922	2015-11-11 10:27:00 -08:00
Gian Merlino	dfbd0e2b60	Merge pull request #1925 from gianm/fix-index-generator Fix reference to INDEX_MAKER in IndexGeneratorJob.	2015-11-06 09:56:30 -08:00
Gian Merlino	75122dc396	Fix reference to INDEX_MAKER in IndexGeneratorJob.	2015-11-06 09:19:58 -08:00
Himanshu Gupta	6bed633121	do not use LoggingProcessIndicator in IndexGeneratorJob because that uses Stopwatch methods from guava not available in older guava versions, this makes the behavior same as LegacyIndexGeneratorJob	2015-11-06 00:40:51 -06:00
Charles Allen	929b981710	Change DefaultObjectMapper to NOT overwrite final fields unless explicitly asked to	2015-11-05 18:10:13 -08:00
Xavier Léauté	223d1ebe9f	fix a very old todo	2015-11-05 13:00:30 -08:00
fjy	8f231fd3e3	cleanup druid codebase	2015-11-04 13:59:53 -08:00
Himanshu Gupta	84f7d8d264	making static final variables in HadoopDruidIndexerConfig upper case	2015-11-02 23:24:26 -06:00
Himanshu Gupta	8b67417ac8	make methods in Index[Merger,Maker,IO] non-static so that they can have appropriate ObjectMapper injected instead of creating one statically	2015-11-02 23:24:26 -06:00
Himanshu Gupta	aeffeaf3e2	fixing hadoop test scope dependencies in indexing-hadoop	2015-10-26 17:09:39 -05:00
Nishant	3641a0e553	Fix Race in jar upload during hadoop indexing - https://github.com/druid-io/druid/issues/582 few fixes delete intermediate file early better exception handling use static pattern instead of compiling it every time Add retry for transient exceptions remove usage of deprecated method. Add test fix imports fix javadoc review comment. review comment: handle crazy snapshot naming review comments remove default retry count in favour of already present constant review comment make random intermediate and final paths. review comment, use temporaryFolder where possible	2015-10-22 21:41:07 +05:30
Xavier Léauté	e4ac78e43d	bump next snapshot to 0.9.0	2015-10-20 13:46:13 -07:00
Xavier Léauté	4c2c7a2c37	update version to 0.8.3	2015-10-14 21:40:55 -07:00
Himanshu Gupta	0368260018	For dataSource inputSpec in hadoop batch ingestion, use configured query granularity for reading existing segments instead of NONE	2015-10-12 22:19:44 -05:00
Gian Merlino	3aba401ee0	SQLMetadataConnector: Retry table creation, in case something goes wrong. Also rejigger table creation methods to not take a DBI. It's already available inside the connector, and everyone was just using that one anyway.	2015-09-24 21:39:36 -07:00
Himanshu Gupta	e8b9ee85a7	HadoopyStringInputRowParser to convert stringy Text, BytesWritable etc into InputRow	2015-09-16 10:58:13 -05:00
Himanshu Gupta	74f4572bd4	Lazily deserialize "parser" to InputRowParser in DataSchema so that user hadoop related InputRowParsers are created only when needed this allows overlord to accept a HadoopIndexTask with a hadoopy InputRowParser and not fail because hadoopy InputRowParser might need hadoop libraries	2015-09-16 10:58:13 -05:00
Himanshu Gupta	9ca6106128	user specified hadoop settings are ignored if explicitly set in code	2015-08-31 10:50:18 -05:00
Gian Merlino	940e1aa3eb	Replace funky imports with standard ones. 1) Lots of Guava imports were not coming from the actual Guava 2) junit.framework.Assert should be org.junit.Assert	2015-08-28 18:02:05 -07:00
jon-wei	e5c4927b14	Add support for parsing BytesWritable strings to Hadoop Indexer	2015-08-28 14:27:14 -07:00
Gian Merlino	414a6fb477	Fix overlapping segments in IngestSegmentFirehose, DatasourceInputFormat. Fixes #1678. IngestSegmentFirehose (and its users) need to remember which windows of which segments should actually be read, based on a timeline.	2015-08-28 07:32:41 -07:00
Himanshu Gupta	2e0dd1d792	adding UTs and addressing review comments to firehoseV2 addition to Realtime[Manager\|Plumber], essential segment metadata persist support, kafka-simple-consumer-firehose extension patch	2015-08-27 20:50:46 -05:00
lvjq	2237a8cf0f	kafka 8 simple consumer firehose	2015-08-27 20:50:46 -05:00
Charles Allen	e38cf54bc8	Migrate TestDerbyConnector to a JUnit @Rule	2015-08-26 21:47:40 -07:00
Himanshu Gupta	b3c570e78d	update BatchDeltaIngestion.testDeltaIngestion(..) to check for proper glob path handling	2015-08-20 21:36:34 -05:00
Himanshu Gupta	85e3ce9096	split hadoop glob path before adding it to MultipleInputs This can be safely reverted once https://issues.apache.org/jira/browse/MAPREDUCE-5061 is fixed	2015-08-20 21:36:34 -05:00
Himanshu Gupta	a603bd9547	HadoopGlobPathSplitter implementation to split hadoop glob paths This can be safely reverted once https://issues.apache.org/jira/browse/MAPREDUCE-5061 is fixed	2015-08-20 21:36:34 -05:00
Himanshu Gupta	cf3ec8eb46	helpful cause explaining why SegmentDescriptorInfo did not exist	2015-08-19 10:29:04 -05:00
Xavier Léauté	3b2e41e42a	update for next release	2015-08-18 17:16:46 -07:00
Himanshu Gupta	a3bab5b7d9	IndexGeneratorJobTest type unit test for batch delta ingestion and reindexing	2015-08-16 14:07:35 -05:00
Himanshu Gupta	15fa43dd43	changing DatasourcePathSpec, to get segment list, so that hadoop indexer uses overlord action to get list of segments and passes when running as an overlord task. and, uses metadata store directly when running as standalone hadoop indexer also, serialized list of segments is passed to DatasourcePathSpec so that hadoop classloader issues do not creep up	2015-08-16 14:07:35 -05:00
Himanshu Gupta	45947a1021	add ability to specify Multiple PathSpecs in batch ingestion, so that we can grab data from multiple places in same ingestion Conflicts: indexing-hadoop/src/main/java/io/druid/indexer/HadoopDruidIndexerConfig.java indexing-hadoop/src/main/java/io/druid/indexer/JobHelper.java Conflicts: indexing-hadoop/src/main/java/io/druid/indexer/path/PathSpec.java	2015-08-16 13:15:38 -05:00
Himanshu Gupta	1ae56f139b	Druid Hadoop InputFormat and pathSpec Conflicts: indexing-hadoop/src/main/java/io/druid/indexer/path/PathSpec.java indexing-service/pom.xml	2015-08-16 13:15:38 -05:00
Himanshu Gupta	f1d309a671	do not run parser if value from InputFormat is already an InputRow	2015-08-14 14:44:22 -05:00
Himanshu Gupta	0eec1bbee2	json serde tests for HadoopTuningConfig	2015-07-20 12:01:53 -05:00
Himanshu Gupta	f836c3a7ac	adding flag useCombiner to hadoop tuning config that can be used to add a hadoop combiner to hadoop batch ingestion to do merges on the mappers if possible	2015-07-20 12:01:53 -05:00
Himanshu Gupta	4ef484048a	take control of InputRow serde between Mapper/Reducer in Hadoop Indexing This allows for arbitrary InputFormat while hadoop batch ingestion that can return records of value type other than Text	2015-07-20 12:01:53 -05:00
Himanshu Gupta	f7a92db332	generic byte[] serde for InputRow	2015-07-20 12:01:53 -05:00
Xavier Léauté	4cfb00bc8a	inrement version	2015-07-15 13:09:05 -07:00
Charles Allen	b2bc46be17	Merge pull request #1484 from tubemogul/feature/1463 JobHelper.ensurePaths will set job properties from config (tuningConf…	2015-07-07 10:58:16 -07:00
Michael Schiff	6ad451a44a	JobHelper.ensurePaths will set job properties from config (tuningConfig.jobProperties) before adding input paths to the config. Adding input paths will create Path and FileSystem instances which may depend on the values in the job config. This allows all properties to be set from the spec file, avoiding having to directly edit cluster xml files. IndexGeneratorJob.run adds job properties before adding input paths (adding input paths may depend on having job properies set) JobHelperTest confirms that JobHelper.ensurePaths adds job properties javadoc for addInputPaths to explain relationship with addJobProperties	2015-07-01 12:45:32 -07:00
Davide Anastasia	4a3a7dd1ad	read hadoop-indexer configuration file from HDFS	2015-06-24 14:08:53 -07:00
Hao Xia	1931491c9f	A couple of hdfs related fixes * Class loading issue with hdfs-storage extension * Exception when using hdfs with non-fully qualified segment path	2015-06-19 17:22:20 -07:00
Xavier Léauté	0a5bb909a2	[maven-release-plugin] prepare for next development iteration	2015-06-18 17:35:19 -07:00
Xavier Léauté	59c6b2b279	[maven-release-plugin] prepare release druid-0.8.0-rc1	2015-06-18 17:35:14 -07:00
Charles Allen	94a567732a	Wipe FileContext off the face of the earth * Fixes https://github.com/druid-io/druid/issues/1433 * Works arround https://issues.apache.org/jira/browse/HADOOP-10643 * Reverts to the prior method of renaming	2015-06-16 09:48:09 -07:00
Charles Allen	6230ac90ae	Use IndexMerger for conversion	2015-06-10 11:34:58 -07:00
Charles Allen	056cab93ed	Add Hadoop Converter Job and task * Fixes https://github.com/druid-io/druid/issues/1363 * Add extra utils in JobHelper based on PR feedback	2015-06-09 14:47:38 -07:00
Charles Allen	2a76bdc60a	Abstractify hadoopy indexer configuration. * Moves many items to JobHelper * Remove dependencies of these functions on HadoopDruidIndexerConfig in favor of more general items * Changes functionalities of some of the path methods to always return a path with scheme * Adds retry to uploads * Change output loadSpec determining from using outputFS.getClass().getName() to using outputFS.getScheme()	2015-06-08 10:53:27 -07:00
fjy	be2a35188e	Additional schema validations and better logs for common extensions	2015-05-27 16:25:02 -07:00
Xavier Léauté	4466e77b25	Merge pull request #1371 from guobingkun/unit_test Unit test for IndexGeneratorJob	2015-05-22 10:34:24 -04:00
flow	07659f30ab	bug fix: hdfs task log and indexing task not work properly with Hadoop HA	2015-05-21 20:49:42 +08:00
Bingkun Guo	b46aff12ae	Unit test for IndexGeneratorJob	2015-05-18 12:31:16 -05:00
fjy	7a6acf5c1b	update pom to 0.8	2015-05-11 19:41:58 -06:00
Fangjin Yang	a2dc58cd2d	Merge pull request #1345 from pjain1/unit_test_warn_fix fix warn msg and some unit tests	2015-05-08 08:06:20 -07:00
Parag Jain	01448d264c	Fix warn msg and added some unit tests	2015-05-07 17:10:05 -05:00
fjy	b19435d172	fix typos with batch ingestion in docs	2015-05-07 14:46:17 -07:00
Bingkun Guo	1ee550dd91	Fix a potential issue in DeterminePartitionsJob by making HadoopDruidIndexerConfig non-static, and two unit tests for DeterminPartitionsJob and LocalDataSegmentKiller	2015-05-04 20:00:29 -07:00
Xavier Léauté	3a3046ccf3	add support for dimension compression - compression for single-value dimensions using CompressedVSizeIntsIndexedSupplier - makes dimension compression configurable via IndexSpec - IndexSpec also enables configuring bitmap and metric compression	2015-04-14 10:44:18 -07:00
Prajwal Tuladhar	3044bf5592	use Job.getInstance() to fix deprecated warnings	2015-04-09 13:22:21 -04:00
Xavier Léauté	8b5fa8f85d	always upload SNAPSHOT self-contained jars	2015-04-03 21:18:09 -07:00
fjy	aea7f9d192	[maven-release-plugin] prepare for next development iteration	2015-03-30 16:35:24 -07:00
fjy	060d7aef03	[maven-release-plugin] prepare release druid-0.7.1	2015-03-30 16:35:20 -07:00
Dia Kharrat	3a6dc99384	log invalid rows in mapper of Hadoop indexer	2015-03-19 22:31:04 -07:00
Dia Kharrat	58d5f5e7f0	Honor ignoreInvalidRows in Hadoop indexer The reducer of the hadoop indexer now ignores lines with parsing exceptions (if enabled by the indexer config).	2015-03-19 22:31:04 -07:00
Himanshu Gupta	8c1f0834ba	Removing MapWritableInputRowParser from indexing-hadoop it should really be an extension if user needs	2015-03-19 18:37:08 -05:00
Xavier Léauté	a98187f798	Merge pull request #1177 from himanshug/custom_input_format1 Feature: Make hadoop input format configurable for batch ingestion	2015-03-19 15:49:36 -07:00
fjy	b389cfe404	[maven-release-plugin] prepare for next development iteration	2015-03-19 12:38:17 -07:00
fjy	60e7d543cc	[maven-release-plugin] prepare release druid-0.7.1-rc1	2015-03-19 12:38:13 -07:00
Himanshu Gupta	3f7a7ba5d3	For batch hadoop indexing, make hadoop input format configuration. Given input format must extend from org.apache.hadoop.mapreduce.InputFormat	2015-03-18 16:09:45 -05:00
fjy	bfe10bd156	This fixes arbitrary gran spec breaking	2015-03-17 12:19:43 -07:00
Himanshu Gupta	6a0405de20	fail early if there is no input data for batch hadoop indexing	2015-03-07 12:45:57 -06:00
Himanshu Gupta	30f64ff19e	UTs update for indexing-hadoop	2015-02-25 15:45:57 -08:00
Xavier Léauté	0784d7e30e	Merge pull request #1152 from himanshug/metastorage-pwd-provider support for metadata store PasswordProvider interface	2015-02-25 15:19:37 -08:00
Fangjin Yang	708f35151d	Merge pull request #1121 from gianm/issue-1116 Use the proper FileSystems for writing segments and caching jars. (for issue #1116)	2015-02-25 13:03:59 -08:00
Fangjin Yang	6424815f88	Merge pull request #1097 from metamx/better-hadoop-sort-key Sort HadoopIndexer rows by time+dim bucket to help reduce spilling	2015-02-25 12:49:58 -08:00
Fangjin Yang	3d50a3771a	Merge pull request #1151 from himanshug/remove-s3-fs-dep removing dependency on NativeS3FileSystem and other file systems	2015-02-25 12:31:45 -08:00
Himanshu Gupta	126262edce	support for PasswordProvider interface to enable writing druid extension which can get metadata store password from secured location or anywhere instead of plain text properties file	2015-02-25 14:05:19 -06:00
Xavier Léauté	b167dcf82c	[maven-release-plugin] prepare for next development iteration	2015-02-23 14:28:06 -08:00
Xavier Léauté	e81ac2ba43	[maven-release-plugin] prepare release druid-0.7.0	2015-02-23 14:27:58 -08:00
Himanshu Gupta	01a4f19ea2	removing dependency on NativeS3FileSystem and other file systems	2015-02-23 14:27:50 -06:00
Xavier Léauté	78df7f6165	Move Druid release artifacts to Sonatype - Switch to using Druid parent POM - Add required fields for Sonatype - Common plugin versions and settings have been moved to the parent pom - Cleanup artifacts and POMs for consistent formatting - Remove org.hyperic.sigar dependency and update docs to reflect necessary jars to add at runtime when sigar is needed	2015-02-13 14:26:31 -08:00
Gian Merlino	fd5a7d1f08	Use the proper FileSystems for writing segments and caching jars. (for issue #1116 )	2015-02-12 16:20:10 -08:00
fjy	d29740ed9f	[maven-release-plugin] prepare for next development iteration	2015-02-12 16:16:00 -08:00
fjy	211fd15b7e	[maven-release-plugin] prepare release druid-0.7.0-rc3	2015-02-12 16:15:56 -08:00
Xavier Léauté	b1ec7afc12	Sort HadoopIndexer rows by time+dim bucket to help reduce spilling	2015-02-10 14:26:28 -08:00
fjy	1f12c5b2f1	[maven-release-plugin] prepare for next development iteration	2015-02-03 12:06:49 -08:00
fjy	e82d431be7	[maven-release-plugin] prepare release druid-0.7.0-rc2	2015-02-03 12:06:41 -08:00
Fangjin Yang	92e616de11	Merge pull request #1077 from metamx/remove-unused-imports remove unused imports	2015-02-02 10:45:27 -08:00
nishantmonu51	ba932bb1f2	remove unused imports	2015-02-02 21:53:39 +05:30
fjy	d05032b98a	towards a community led druid	2015-01-31 20:57:36 -08:00
fjy	1f94de22c6	[maven-release-plugin] prepare for next development iteration	2015-01-20 14:23:55 -08:00
fjy	17476edc31	[maven-release-plugin] prepare release druid-0.7.0-rc1	2015-01-20 14:23:51 -08:00
Xavier Léauté	cd9635ff5e	Merge pull request #1034 from druid-io/minor-rename minor rename of things in hadoop ingestion config to match 0.6.x	2015-01-15 15:46:13 -08:00
fjy	ccddbf8747	minor rename of things in hadoop ingestion config to match 0.6.x	2015-01-15 14:04:55 -08:00
Fangjin Yang	5bfcc43377	Merge pull request #1008 from metamx/stringConversionJavaUtilUpdate Update all String conversions to and from byte[] to use the java-util StringUtils functions	2015-01-15 13:50:27 -08:00
Fangjin Yang	852e863425	Merge pull request #981 from druid-io/strictModuleTyping Use Module instead of generic Object in Guice related items	2015-01-05 12:43:20 -08:00
Charles Allen	b1b5c9099e	Update all String conversions to and from byte[] to use the java-util StringUtils functions * Speedup of GroupBy with javaScript filters by ~10% * Requires https://github.com/metamx/java-util/pull/15	2015-01-05 11:22:32 -08:00
Xavier Léauté	f1375b0bfb	workaround to pass down bitmap type to map-reduce tasks	2015-01-02 17:29:00 -08:00
Charles Allen	7c8d4a7433	Use Module instead of generic Object in Guice related items	2014-12-19 10:54:06 -08:00
fjy	43d27ddaf0	update http client and fix logging	2014-12-15 16:59:57 -08:00
fjy	e872952390	fix working path default bug	2014-12-15 14:51:58 -08:00
fjy	28b72a69ad	redocumenting ingestion	2014-12-08 16:15:46 -08:00
nishantmonu51	40f223215a	fix buffer pool usage	2014-12-05 16:09:26 +05:30
nishantmonu51	6e03a6245f	Merge branch 'master' into onheap-incremental-index	2014-12-05 10:40:28 +05:30
Xavier Léauté	7cd45a6e1f	IncrementalIndex throws exception if limit exceeded - For now uses a hardcoded ratio of aggregator to timeanddim buffer sizes - canAppendRow is a workaround for realtime index since the Firehose currently does not have a way of rolling back the last event in case of error - canAppendRow needs a fudge factor; there is a race between checking if we can add a row and actually adding a row, because of the way MapDB reports its size.	2014-12-04 14:38:16 -08:00
Gian Merlino	20a7239ffd	Replace google-http-client imports with real guava imports.	2014-12-04 10:57:57 -08:00
Charles Allen	c2add5730b	Fix Hadoop CLI jobs * Change "schema" --> "spec" for cli hadoop to keep up with internal hadoop * Added check for HadoopDruidIndexerConfig deserialization from Map to see if it is trying to get a HadoopDruidIndexerConfig or a HadoopIngestionSpec	2014-12-04 10:57:56 -08:00
xvrl	c867d59ee0	fix error message	2014-12-03 15:30:32 -08:00
Xavier Léauté	2e6c254937	metadata injection not needed for indexing service	2014-12-03 15:09:31 -08:00
Gian Merlino	d388a8fe89	Replace google-http-client imports with real guava imports.	2014-12-03 10:52:57 -08:00
nishantmonu51	4dc0fdba8a	consider mapped size in limit calculation & review comments	2014-12-03 23:47:30 +05:30
nishantmonu51	da8bd7836b	Introduce buffer size	2014-12-03 16:28:22 +05:30
Charles Allen	7cd689be75	Fix Hadoop CLI jobs * Change "schema" --> "spec" for cli hadoop to keep up with internal hadoop * Added check for HadoopDruidIndexerConfig deserialization from Map to see if it is trying to get a HadoopDruidIndexerConfig or a HadoopIngestionSpec	2014-12-02 11:23:04 -08:00
nishantmonu51	eac776f1a7	tests passing with on heap incremental index	2014-12-02 22:29:28 +05:30
Xavier Léauté	59542c41f8	fix port not set in DruidNode	2014-12-01 14:37:28 -08:00
Charles Allen	8b3652a67a	Modify HadoopDruidIndexerConfig to give a port of 0 instead of -1 when binding DruidNode @Self annotation	2014-12-01 14:08:41 -08:00
fjy	fdeab0c6af	make Druid case sensitive	2014-11-19 14:27:31 -08:00
Fangjin Yang	590d31799e	Merge pull request #876 from metamx/remove-backwards-compatible Remove backwards compatible	2014-11-19 14:33:14 -07:00
Charles Allen	dc66e1708e	Added src jar build to maven poms and re-formatted to conform to style guidelines.	2014-11-18 09:05:30 -08:00
nishantmonu51	f0452c5968	merge from master	2014-11-18 19:34:51 +05:30
nishantmonu51	edf0fc0851	Make hashed partitions spec default - make hashed partitionsSpec as default partitions spec for 0.7	2014-11-17 19:48:12 +05:30
nishantmonu51	0c2d06475d	merge from master	2014-11-17 19:19:18 +05:30
Xavier Léauté	0498df25df	override metadata storage injection in CliHadoopIndexer	2014-11-07 13:44:56 -08:00
Xavier Léauté	50a191425c	fix injection on MetadataStorageUpdaterJob	2014-11-07 11:11:14 -08:00
Xavier Léauté	20a9aef96a	fix test	2014-11-06 17:27:05 -08:00
Xavier Léauté	9c06db021f	rename db->metadata postgres->postgresql	2014-10-31 10:30:27 -07:00
jisookim0513	aa754b86e8	build success!	2014-10-24 11:28:42 -07:00
fjy	bef74104d9	merge with 0.7.x and resolve any conflicts	2014-10-23 17:24:06 -07:00
fjy	3b29e77866	[maven-release-plugin] prepare for next development iteration	2014-10-22 16:25:32 -07:00
fjy	dcab2997f2	[maven-release-plugin] prepare release druid-0.6.160	2014-10-22 16:25:27 -07:00
fjy	d76d57d95d	update docs	2014-10-22 16:16:28 -07:00
jisookim0513	37979282fe	enabled ansi-quote in mysql; insert statement should now work	2014-10-21 00:09:19 -07:00
jisookim0513	7d5c5f2083	fixed createTable; fixed miscellaneous stuff; added DerbyMetadataRuleManagerProvider	2014-10-17 00:10:36 -07:00
nishantmonu51	41e88baeca	Add test for bucket selection	2014-10-15 23:09:28 +05:30
nishantmonu51	f4a97aebbc	fix rollup for hashed partitions truncate timestamp while calculating the partitionNumber	2014-10-15 22:32:56 +05:30
nishantmonu51	b5d66381f3	more cleanup	2014-10-14 18:32:40 +05:30
fjy	a4c8f04409	[maven-release-plugin] prepare for next development iteration	2014-10-13 12:50:45 -07:00
fjy	7fd1747ffa	[maven-release-plugin] prepare release druid-0.6.159	2014-10-13 12:50:41 -07:00
nishantmonu51	454acd3f5a	remove backwards compatible code 1) remove backwards compatible and deprecated code 2) make hashed partitions spec default	2014-10-13 19:30:44 +05:30
fjy	d0fbd5f729	[maven-release-plugin] prepare for next development iteration	2014-10-09 14:24:53 -07:00
fjy	dada60a301	[maven-release-plugin] prepare release druid-0.6.158	2014-10-09 14:24:48 -07:00
fjy	8a4b32c482	[maven-release-plugin] prepare for next development iteration	2014-10-07 14:30:26 -07:00
fjy	96e29c2fdd	[maven-release-plugin] prepare release druid-0.6.157	2014-10-07 14:30:20 -07:00
fjy	c7b4d5b7b4	Merge branch 'master' into druid-0.7.x Conflicts: processing/src/test/java/io/druid/segment/filter/SpatialFilterTest.java	2014-10-02 18:12:10 -07:00
nishantmonu51	ad75a21040	separate offheapIncrementalIndex implementation	2014-10-01 13:58:51 +05:30
jisookim0513	9d7b5d9b0f	fixed javadoc; fixed pom files; deleted unnecessary class	2014-09-30 13:47:35 -07:00
nishantmonu51	358ff915bb	fix merge conflicts	2014-09-30 22:19:18 +05:30
nishantmonu51	2789536bed	merge changes from druid-0.7.x	2014-09-30 22:05:49 +05:30
nishantmonu51	61c7fd2e6e	make ingestOffheap tuneable	2014-09-30 15:30:02 +05:30
nishantmonu51	adb4a65e0a	Merge branch 'offheap-incremental-index' into mapdb-branch	2014-09-29 12:38:31 +05:30
jisookim0513	74565c9371	cleaned up the code	2014-09-27 13:10:01 -07:00
jisookim0513	aa887edb73	added two seperate modules for mysql and postgres	2014-09-27 13:08:53 -07:00
flow	2dd62979bb	Fixed the issue of batch ingestion with indexing service to hdfs end up with the path of metadata in mysql missing "hdfs://host" prefix. The detail describe can be found here: https://groups.google.com/forum/#!topic/druid-development/ofvSxiPpCxI	2014-09-27 22:26:52 +08:00
jisookim0513	6a641621b2	finished merging into druid-0.7.x; derby not working (to be fixed)	2014-09-26 14:24:53 -07:00
jisookim0513	43cc6283d3	trying to revert files that have overwritten changes	2014-09-26 12:38:04 -07:00
fjy	eaf0a48b92	Merge branch 'master' into druid-0.7.x Conflicts: cassandra-storage/pom.xml common/pom.xml examples/pom.xml hdfs-storage/pom.xml histogram/pom.xml indexing-hadoop/pom.xml indexing-service/pom.xml kafka-eight/pom.xml kafka-seven/pom.xml pom.xml processing/pom.xml processing/src/main/java/io/druid/guice/PropertiesModule.java rabbitmq/pom.xml s3-extensions/pom.xml server/pom.xml services/pom.xml	2014-09-26 11:39:24 -07:00
jisookim0513	a5d02b9030	second attempt to fix merge-conflicts	2014-09-24 15:59:00 -07:00
jisookim0513	3bf39cc9f8	attempted to fix merge-conflicts	2014-09-24 15:55:42 -07:00
Xavier Léauté	7cfe86b7bc	[maven-release-plugin] prepare for next development iteration	2014-09-23 16:31:25 -07:00
Xavier Léauté	80166314d9	[maven-release-plugin] prepare release druid-0.6.156	2014-09-23 16:31:21 -07:00
fjy	4235e14fc6	[maven-release-plugin] prepare for next development iteration	2014-09-23 16:12:43 -07:00
fjy	576e9b8a84	[maven-release-plugin] prepare release druid-0.6.155	2014-09-23 16:12:39 -07:00
nishantmonu51	f51ab84386	merge changes from druid-0.7.x	2014-09-22 23:48:45 +05:30
nishantmonu51	443e5788fb	make OffheapIncrementalIndex tuneable	2014-09-22 19:26:10 +05:30
jisookim0513	273205f217	initial attempt for abstraction; druid cluster works with Derby as a default	2014-09-19 17:39:59 -07:00
nishantmonu51	8eb6466487	revert buffer size and add back rowFlushBoundary	2014-09-19 23:06:04 +05:30
fjy	a17794a516	[maven-release-plugin] prepare for next development iteration	2014-09-16 15:22:55 -07:00
fjy	f9c91ace3f	[maven-release-plugin] prepare release druid-0.6.154	2014-09-16 15:22:49 -07:00
Xavier Léauté	d501b052ea	remove unused columnConfig	2014-09-15 13:02:47 -07:00
Xavier Léauté	e57e2d97ba	make constants final	2014-09-15 12:53:40 -07:00
fjy	469ccbbe5e	Merge branch 'master' into druid-0.7.x Conflicts: cassandra-storage/pom.xml common/pom.xml examples/pom.xml hdfs-storage/pom.xml histogram/pom.xml indexing-hadoop/pom.xml indexing-service/pom.xml kafka-eight/pom.xml kafka-seven/pom.xml pom.xml processing/pom.xml processing/src/main/java/io/druid/query/FinalizeResultsQueryRunner.java processing/src/main/java/io/druid/query/UnionQueryRunner.java processing/src/main/java/io/druid/query/groupby/GroupByQueryRunnerFactory.java processing/src/main/java/io/druid/query/topn/TopNQueryEngine.java processing/src/main/java/io/druid/query/topn/TopNQueryRunnerFactory.java rabbitmq/pom.xml s3-extensions/pom.xml server/pom.xml server/src/test/java/io/druid/server/initialization/JettyTest.java services/pom.xml	2014-09-11 16:20:50 -07:00
Xavier Léauté	32e256a842	[maven-release-plugin] prepare for next development iteration	2014-09-10 17:24:15 -07:00
Xavier Léauté	f7dc13d240	[maven-release-plugin] prepare release druid-0.6.153	2014-09-10 17:24:11 -07:00
fjy	fec7b43fcb	make making v9 segments something completely configurable	2014-09-10 15:28:30 -07:00
fjy	351afb8be7	allow legacy index generator	2014-09-09 17:04:35 -07:00
Xavier Léauté	22553b0b28	[maven-release-plugin] prepare for next development iteration	2014-09-05 11:07:22 -07:00
Xavier Léauté	570a7a07bf	[maven-release-plugin] prepare release druid-0.6.152	2014-09-05 11:07:14 -07:00
fjy	e24be03a38	[maven-release-plugin] prepare for next development iteration	2014-09-04 14:10:13 -07:00
fjy	a933ea1866	[maven-release-plugin] prepare release druid-0.6.151	2014-09-04 14:10:09 -07:00
fjy	f473e62f1f	Merge pull request #714 from metamx/cleanup-javadocs Cleanup javadocs	2014-09-04 12:28:10 -06:00
Gian Merlino	96d7b18163	[maven-release-plugin] prepare for next development iteration	2014-08-29 15:29:58 -07:00
Gian Merlino	860cf36ab1	[maven-release-plugin] prepare release druid-0.6.150	2014-08-29 15:29:54 -07:00
Xavier Léauté	58ab759fc6	remove unused imports	2014-08-29 14:03:47 -07:00
Xavier Léauté	ac05836833	make Java 8 javadoc happy	2014-08-29 13:58:50 -07:00
Xavier Léauté	daca178631	[maven-release-plugin] prepare for next development iteration	2014-08-29 10:43:19 -07:00
Xavier Léauté	0793a5aee6	[maven-release-plugin] prepare release druid-0.6.149	2014-08-29 10:43:15 -07:00
Xavier Léauté	1d6842c7f0	[maven-release-plugin] prepare for next development iteration	2014-08-27 12:22:53 -07:00
Xavier Léauté	575951f99a	[maven-release-plugin] prepare release druid-0.6.148	2014-08-27 12:22:49 -07:00
fjy	00f7077f50	[maven-release-plugin] prepare for next development iteration	2014-08-22 08:28:56 -07:00
fjy	df488563e4	[maven-release-plugin] prepare release druid-0.6.147	2014-08-22 08:28:51 -07:00
fjy	12f4147df5	switch index gen job to use logging indicator	2014-08-21 13:28:15 -07:00
fjy	df48b48e35	[maven-release-plugin] prepare for next development iteration	2014-08-21 12:50:33 -07:00
fjy	4f0b994037	[maven-release-plugin] prepare release druid-0.6.146	2014-08-21 12:50:30 -07:00
fjy	d64879ccca	more cleanup	2014-08-20 13:22:42 -07:00
fjy	bb73b2556e	fix compilation	2014-08-20 13:17:00 -07:00
fjy	92f26d9a1f	cleanup rowflushboundary	2014-08-20 13:09:37 -07:00
nishantmonu51	79ff993b31	increase default buffer size to 512m	2014-08-20 22:15:06 +05:30
nishantmonu51	33354cf7fe	replace maxRowsInMemory with BufferSize	2014-08-20 20:59:44 +05:30
fjy	88a904e0b3	address cr about progress ind	2014-08-19 12:59:01 -07:00
fjy	51988be776	[maven-release-plugin] prepare for next development iteration	2014-08-14 14:59:00 -07:00
fjy	2830028092	[maven-release-plugin] prepare release druid-0.6.145	2014-08-14 14:58:57 -07:00
fjy	4e3f4fbc22	[maven-release-plugin] prepare for next development iteration	2014-08-12 13:24:55 -07:00
fjy	f3aafc246c	[maven-release-plugin] prepare release druid-0.6.144	2014-08-12 13:24:50 -07:00
nishantmonu51	c6712739dc	merge changes from druid-0.7.x	2014-08-12 15:47:42 +05:30
nishantmonu51	9598a524a8	review comment - move index closure to finally	2014-08-12 14:58:55 +05:30
fjy	0cff63c7c6	[maven-release-plugin] prepare for next development iteration	2014-08-09 19:36:11 -07:00
fjy	724562b181	[maven-release-plugin] prepare release druid-0.6.143	2014-08-09 19:36:05 -07:00
fjy	d58ae945a4	[maven-release-plugin] prepare for next development iteration	2014-08-08 16:22:38 -07:00
fjy	d240479133	[maven-release-plugin] prepare release druid-0.6.142	2014-08-08 16:22:34 -07:00
fjy	199f2b1683	[maven-release-plugin] prepare for next development iteration	2014-08-08 15:53:53 -07:00
fjy	29a077d893	[maven-release-plugin] prepare release druid-0.6.141	2014-08-08 15:53:49 -07:00
fjy	958792db5c	[maven-release-plugin] prepare for next development iteration	2014-08-08 15:16:38 -07:00
fjy	4482e1b7d8	[maven-release-plugin] prepare release druid-0.6.140	2014-08-08 15:16:34 -07:00
fjy	ddcea18631	fix dependencies	2014-08-08 14:02:14 -07:00
fjy	74646f18f4	[maven-release-plugin] prepare for next development iteration	2014-08-08 11:40:52 -07:00
fjy	78c7140b52	[maven-release-plugin] prepare release druid-0.6.139	2014-08-08 11:40:46 -07:00
fjy	9f4dd7b33e	[maven-release-plugin] prepare for next development iteration	2014-08-06 13:34:04 -07:00
fjy	aa5cb91618	[maven-release-plugin] prepare release druid-0.6.138	2014-08-06 13:33:59 -07:00
fjy	65b8278add	[maven-release-plugin] prepare for next development iteration	2014-08-01 10:14:34 -07:00
fjy	0ecbef6c5d	[maven-release-plugin] prepare release druid-0.6.137	2014-08-01 10:14:30 -07:00

... 3 4 5 6 7 ...

963 Commits