druid

Commit Graph

Author	SHA1	Message	Date
Fokko Driesprong	3a58431bff	Bump jackson-jq from 0.0.7 to 0.0.10 (#8293 ) * Bump jackson-jq from 0.0.7 to 0.0.10 For the changelog: https://github.com/eiiches/jackson-jq/releases * Update dependent licenses	2019-08-20 16:09:04 -07:00
Chi Cao Minh	6fa22f6939	Enable code coverage (#8303 ) * Enable code coverage Code coverage was disabled via https://github.com/apache/incubator-druid/pull/3122 due to an issue with cobertura in Travis CI. Switch code coverage tool from cobertura to jacoco to avoid issue and re-enable coveralls for Travis CI. * Exclude non-production code * Exclude benchmark generated code * Exclude DruidTestRunnerFactory	2019-08-20 15:36:19 -07:00
Fokko Driesprong	cb1339e19a	Bump derby from 10.11.1.1 to 10.14.2.0 (#8292 ) * Bump derby from 10.11.1.1 to 10.15.1.3 * Update server/pom.xml as well * Move to derby 10.14.2.0 10.15.* is Java9+ https://db.apache.org/derby/derby_downloads.html	2019-08-20 14:03:32 -07:00
Fokko Driesprong	1a3aa1cfc0	Bump commons-io from 2.5 to 2.6 (#8006 ) * Bump commons-io from 2.5 to 2.6 * Update licenses.yaml * Address comments	2019-08-13 17:10:37 -07:00
Benedict Jin	170368999d	Bump rhino from 1.7R5 to 1.7.11 (#8008 ) * Bump rhino from 1.7R5 to 1.7.11 * Update the version of rhino in licenses.yaml	2019-08-09 13:10:54 -07:00
Benedict Jin	f7cf2f7cad	Bump httpcore from 4.4.4 to 4.4.11 (#7870 ) * Bump httpcore from 4.4.4 to 4.4.11 * Update the version of httpcore in licenses.yaml	2019-08-09 19:53:20 +03:00
Chi Cao Minh	b359c5b3d9	Fix SIGAR dependency connection timeout (#8258 ) After enabling parallel builds for "mvn install", the sigar dependency would sometimes resolve to the incorrect artifact repo for some of the maven modules. This issue seems to be fixed by moving the definition of the sigar dependency's artifact repo to the root POM. Also, depending on network speeds, "mvn -q install" may take longer than the default 10 minute timeout to print any output. Use travis_wait to extend the timeout to 15 minutes.	2019-08-08 20:13:18 -05:00
Chi Cao Minh	05b44e3467	Speedup Travis CI jobs (#8240 ) Reorganize Travis CI jobs into smaller faster (and more) jobs. Add various maven options to skip unnecessary work and refactored Travis CI job definitions to follow DRY. Detailed changes: .travis.yml - Refactor build logic to get rid of copy-and-paste logic - Skip static checks and enable parallelism for maven install - Split static analysis into different jobs to ease triage - Use "name" attribute instead of NAME environment variable - Split "indexing" and "web console" out of "other modules test" - Split 2 integration test jobs into multiple smaller jobs build.sh - Enable parallelism - Disable more static checks travis_script_integration.sh travis_script_integration_part2.sh integration-tests/README.md - Use TestNG groups instead of shell scripts and move definition of jobs into Travis CI yaml integration-tests/pom.xml - Show elapsed time of individual tests to aid in future rebalancing of Travis CI integration test jobs run time TestNGGroup.java - Use TestNG groups to make it easy to have multiple Travis CI integration test jobs. TestNG groups also make it easier to have an "other" integration test group and make it less likely a test will accidentally not be included in a CI job. IT*Test.java AbstractITBatchIndexTest.java AbstractKafkaIndexerTest.java - Add TestNG group - Fix various IntelliJ inspection warnings - Reduce scope of helper methods since the TestNG group annotation on the class makes TestNG consider all public methods as test methods pom.xml - Allow enforce plugin to be run from command-line - Bump resources plugin version so that "[debug] execute contextualize" output is correctly suppressed by "mvn -q" - Bump exec plugin version so that skip property is renamed from "skip" to "exec.skip" web-console/pom.xml - Add property to allow disabling javascript-related work. This property is overridden in Travis CI to speed up the jobs.	2019-08-07 09:52:42 -07:00
Chi Cao Minh	7783b31846	Add IPv4 druid expressions (#8197 ) * Add IPv4 druid expressions New druid expressions for filtering IPv4 addresses: - ipv4address_match: Check if IP address belongs to a subnet - ipv4address_parse: Convert string IP address to long - ipv4address_stringify: Convert long IP address to string These expressions operate on IP addresses represented as either strings or longs, so that they can be applied to dimensions with mixed representation of IP addresses. The filtering is more efficient when operating on IP addresses as longs. In other words, the intended use case is: 1) Use ipv4address_parse to convert to long at ingestion time 2) Use ipv4address_match to filter (on longs) at query time 3) Use ipv4adress_stringify to convert to (readable) string at query time * Fix licenses and null handling * Simplify IPv4 expressions * Fix tests * Fix check for valid ipv4 address string	2019-08-01 11:45:04 -07:00
Chi Cao Minh	ab71a2e1e4	Revert "Fix dependency analyze warnings (#8128 )" (#8189 ) This reverts commit `5dd0d8e873`.	2019-07-29 11:42:16 -07:00
Chi Cao Minh	5dd0d8e873	Fix dependency analyze warnings (#8128 ) * Fix dependency analyze warnings Update the maven dependency plugin to the latest version and fix all warnings for unused declared and used undeclared dependencies in the compile scope. Added new travis job to add the check to CI. Also fixed some source code files to use the correct packages for their imports. * Fix licenses and dependencies * Fix licenses and dependencies again * Fix integration test dependency * Address review comments * Fix unit test dependencies * Fix integration test dependency * Fix integration test dependency again * Fix integration test dependency third time * Fix integration test dependency fourth time * Fix compile error * Fix assert package	2019-07-26 10:49:03 -07:00
Gian Merlino	ffa25b7832	Query vectorization. (#6794 ) * Benchmarks: New SqlBenchmark, add caching & vectorization to some others. - Introduce a new SqlBenchmark geared towards benchmarking a wide variety of SQL queries. Rename the old SqlBenchmark to SqlVsNativeBenchmark. - Add (optional) caching to SegmentGenerator to enable easier benchmarking of larger segments. - Add vectorization to FilteredAggregatorBenchmark and GroupByBenchmark. * Query vectorization. This patch includes vectorized timeseries and groupBy engines, as well as some analogs of your favorite Druid classes: - VectorCursor is like Cursor. (It comes from StorageAdapter.makeVectorCursor.) - VectorColumnSelectorFactory is like ColumnSelectorFactory, and it has methods to create analogs of the column selectors you know and love. - VectorOffset and ReadableVectorOffset are like Offset and ReadableOffset. - VectorAggregator is like BufferAggregator. - VectorValueMatcher is like ValueMatcher. There are some noticeable differences between vectorized and regular execution: - Unlike regular cursors, vector cursors do not understand time granularity. They expect query engines to handle this on their own, which a new VectorCursorGranularizer class helps with. This is to avoid too much batch-splitting and to respect the fact that vector selectors are somewhat more heavyweight than regular selectors. - Unlike FilteredOffset, FilteredVectorOffset does not leverage indexes for filters that might partially support them (like an OR of one filter that supports indexing and another that doesn't). I'm not sure that this behavior is desirable anyway (it is potentially too eager) but, at any rate, it'd be better to harmonize it between the two classes. Potentially they should both do some different thing that is smarter than what either of them is doing right now. - When vector cursors are created by QueryableIndexCursorSequenceBuilder, they use a morphing binary-then-linear search to find their start and end rows, rather than linear search. Limitations in this patch are: - Only timeseries and groupBy have vectorized engines. - GroupBy doesn't handle multi-value dimensions yet. - Vector cursors cannot handle virtual columns or descending order. - Only some filters have vectorized matchers: "selector", "bound", "in", "like", "regex", "search", "and", "or", and "not". - Only some aggregators have vectorized implementations: "count", "doubleSum", "floatSum", "longSum", "hyperUnique", and "filtered". - Dimension specs other than "default" don't work yet (no extraction functions or filtered dimension specs). Currently, the testing strategy includes adding vectorization-enabled tests to TimeseriesQueryRunnerTest, GroupByQueryRunnerTest, GroupByTimeseriesQueryRunnerTest, CalciteQueryTest, and all of the filtering tests that extend BaseFilterTest. In all of those classes, there are some test cases that don't support vectorization. They are marked by special function calls like "cannotVectorize" or "skipVectorize" that tell the test harness to either expect an exception or to skip the test case. Testing should be expanded in the future -- a project in and of itself. Related to #3011. * WIP * Adjustments for unused things. * Adjust javadocs. * DimensionDictionarySelector adjustments. * Add "clone" to BatchIteratorAdapter. * ValueMatcher javadocs. * Fix benchmark. * Fixups post-merge. * Expect exception on testGroupByWithStringVirtualColumn for IncrementalIndex. * BloomDimFilterSqlTest: Tag two non-vectorizable tests. * Minor adjustments. * Update surefire, bump up Xmx in Travis. * Some more adjustments. * Javadoc adjustments * AggregatorAdapters adjustments. * Additional comments. * Remove switching search. * Only missiles.	2019-07-12 12:54:07 -07:00
Clint Wylie	42a7b8849a	remove FirehoseV2 and realtime node extensions (#8020 ) * remove firehosev2 and realtime node extensions * revert intellij stuff * rat exclusion	2019-07-04 15:40:22 -07:00
Benedict Jin	6395c08309	Bump commons-codec from 1.7 to 1.12 (#7995 )	2019-06-29 07:40:19 -07:00
Benedict Jin	7a5bc5ffcd	Bump jaxb-api from 2.3.0 to 2.3.1 (#7978 )	2019-06-27 08:51:00 -07:00
Roman Leventov	46ea5b88b7	Add the pull-request template (#7206 ) * Add the pull-request template * Rewording * Replaced checklist link, added Rat exclusion * Update the PR template. Add Concurrency Checklist to the repository * Merge Description and Design sections. Softer language. Removed requirement to test in production environment. Added a committer's instruction to justify addition of meta tags. * Rephrase item about comments * Add license header * Add item to concurrency checklist	2019-06-27 15:51:25 +03:00
Benedict Jin	bc1413e4e3	Bump commons-cli from 1.2 to 1.3.1 (#7966 )	2019-06-26 08:05:13 -07:00
Fokko Driesprong	48f20fe754	Add Spotbugs (#7894 ) * Add Spotbugs Exclude all the issues for now, so we can add them one by one. (cherry picked from commit ceda4754dc8c703d1e0de85b48cd5f5409cfd5b7) * Add additional rules to the list * More rules * More rules * Add comments to the xml * Move the spotbugs-exclude.xml to codestyle/	2019-06-20 21:06:52 +03:00
Fokko Driesprong	41f23b5120	Bump commons-compress from 1.16 to 1.18 (#7924 )	2019-06-19 10:43:01 -07:00
Xue Yu	20d1db9dff	bump fastutil to 8.2.3 (#7920 )	2019-06-18 09:17:34 -07:00
Benedict Jin	fb7f8ec362	Bump RoaringBitmap from 0.8.0 to 0.8.6 (#7906 )	2019-06-17 17:02:52 +08:00
Jihoon Son	d00a9676b7	Set aws.region for unit tests automatically (#7868 ) * Set aws.region for unit tests automatically * Update README.template	2019-06-14 15:34:21 -07:00
Fokko Driesprong	f2b00023f8	Bump Checkstyle to 8.21 (#7826 )	2019-06-04 01:02:46 -07:00
Fokko Driesprong	c8e1511f12	Bump Joda time to 2.10.2 (#7809 )	2019-05-31 14:25:35 -07:00
Jihoon Son	7abfbb066a	Bump up snapshot version to 0.16.0 (#7802 )	2019-05-30 17:17:33 -07:00
Xavier Léauté	58a6f0d5d0	Enable compiling against Java 9+ (tests disabled) This change only enables compilation to ensure code compiles against recent Java versions going forward. Tests are still disabled in this profile until test failures are addressed.	2019-05-27 18:40:19 -07:00
Clint Wylie	db3792727e	use unminified jquery to be more friendly for source releases, fix license stuff (#7751 ) * use unminified jquery to be more friendly for source releases, fix license stuff * other license file * rats	2019-05-24 11:53:25 -07:00
awelsh93	6964ac23a2	Adding influxdb emitter as a contrib extension (#7717 ) * Adding influxdb emitter as a contrib extension * addressing code review comments	2019-05-23 11:11:48 -07:00
mcbrewster	1b284ca847	add tests to dialogs, compnents and views. Add index files to components and dialogs. add nested file structure (#7669 )	2019-05-22 20:36:51 -07:00
Gian Merlino	b6941551ae	Upgrade various build and doc links to https. (#7722 ) * Upgrade various build and doc links to https. Where it wasn't possible to upgrade build-time dependencies to https, I kept http in place but used hardcoded checksums or GPG keys to ensure that artifacts fetched over http are verified properly. * Switch to https://apache.org.	2019-05-21 11:30:14 -07:00
Clint Wylie	ddda8b74cb	update lz4-java to 1.6.0 (lz4 1.9.1) (#7700 )	2019-05-20 13:01:48 -07:00
Fokko Driesprong	2aa9613bed	Bump Checkstyle to 8.20 (#7651 ) * Bump Checkstyle to 8.20 Moderate severity vulnerability that affects: com.puppycrawl.tools:checkstyle Checkstyle prior to 8.18 loads external DTDs by default, which can potentially lead to denial of service attacks or the leaking of confidential information. Affected versions: < 8.18 * Oops, missed one * Oops, missed a few	2019-05-14 11:53:37 -07:00
Xue Yu	35a1fbefea	upgrade avatica to 1.12.0 (#7644 )	2019-05-12 14:38:06 -07:00
Clint Wylie	6a6c6d573d	Add plain text README.txt, use relative link from README.md to build.md (#7611 ) * use relative link to build instructions from top level readme * add textfile to readme * formatting * make README.BINARY plaintext, move LABELS.md to LABELS, README.txt to README * exclude README.BINARY still * remove jdk links/recommmendations * add script to use DRUIDVERSION in textfile README instead of latest, add links to recommended jdk to build.md * license * better readme template, links to latest if does not detect an apache release version * fix	2019-05-09 21:29:26 -07:00
Samarth Jain	b542bb9f34	TDigest backed sketch aggregators (#7331 ) * First set of changes for tDigest histogram * Add license * Address code review comments * Add a doc page for new T-Digest sketch aggregators. Minor code cleanup and comments. * Remove synchronization from BufferAggregators. Address code review comments * Fix typo	2019-05-09 17:22:55 -07:00
Xavier Léauté	751e1c9ba7	add javax.xml.bind dependencies removed in jdk11 (#7604 ) We depend on javax.xml.bind at runtime. This change adds an explicit dependency on the J2EE module that was removed in Java 11.	2019-05-06 19:30:14 -07:00
Jonathan Wei	7c2ca474da	Add single-machine deployment example cfgs and scripts (#7590 ) * Add single-machine deployment example cfgs and scripts * Add (8u92+) * Use combined coordinator-overlord for single machine confs * RAT fix	2019-05-06 19:11:13 -07:00
Xavier Léauté	51a62cb31b	Update dependencies for JDK11 support (#7601 ) * update asm for jdk11 support * update jvm-attach-api for jdk11 support	2019-05-06 14:07:44 -07:00
Xavier Léauté	f7bfe8f269	Update mocking libraries for Java 11 support (#7596 ) * update easymock / powermock for to 4.0.2 / 2.0.2 for JDK11 support * update tests to use new easymock interfaces * fix tests failing due to easymock fixes * remove dependency on jmockit * fix race condition in ResourcePoolTest	2019-05-06 12:28:56 -07:00
Eyal Yurman	f02251ab2d	Contributing Moving-Average Query to open source. (#6430 ) * Contributing Moving-Average Query to open source. * Fix failing code inspections. * See if explicit types will invoke the correct comparison function. * Explicitly remove support for druid.generic.useDefaultValueForNull configuration parameter. * Update styling and headers for complience. * Refresh code with latest master changes: * Remove NullDimensionSelector. * Apply changes of RequestLogger. * Apply changes of TimelineServerView. * Small checkstyle fix. * Checkstyle fixes. * Fixing rat errors; Teamcity errors. * Removing support theta sketches. Will be added back in this pr or a following once DI conflicts with datasketches are resolved. * Implements some of the review fixes. * Contributing Moving-Average Query to open source. * Fix failing code inspections. * See if explicit types will invoke the correct comparison function. * Explicitly remove support for druid.generic.useDefaultValueForNull configuration parameter. * Update styling and headers for complience. * Refresh code with latest master changes: * Remove NullDimensionSelector. * Apply changes of RequestLogger. * Apply changes of TimelineServerView. * Small checkstyle fix. * Checkstyle fixes. * Fixing rat errors; Teamcity errors. * Removing support theta sketches. Will be added back in this pr or a following once DI conflicts with datasketches are resolved. * Implements some of the review fixes. * More fixes for review. * More fixes from review. * MapBasedRow is Unmodifiable. Create new rows instead of modifying existing ones. * Remove more changes related to datasketches support. * Refactor BaseAverager startFrom field and add a comment. * fakeEvents field: Refactor initialization and add comment. * Rename parameters (tiny change). * Fix variable name typo in test (JAN_4). * Fix styling of non camelCase fields. * Fix Preconditions.checkArgument for cycleSize. * Add more documentation to RowBucketIterable and other classes. * key/value comment on in MovingAverageIterable. * Fix anonymous makeColumnValueSelector returning null. * Replace IdentityYieldingAccumolator with Yielders.each(). * * internalNext() should return null instead of throwing exception. * Remove unused variables/prarameters. * Harden MovingAverageIterableTest (Switch anyOf to exact match). * Change internalNext() from recursion to iteration; Simplify next() and hasNext(). * Remove unused imports. * Address review comments. * Rename fakeEvents to emptyEvents. * Remove redundant parameter key from computeMovingAverage. * Check yielder as well in RowBucketIterable#hasNext() * Fix javadoc.	2019-04-26 17:07:48 -07:00
Clint Wylie	89bb43f382	'core' ORC extension (#7138 ) * orc extension reworked to use apache orc map-reduce lib, moved to core extensions, support for flattenSpec, tests, docs * change binary handling to be compatible with avro and parquet, Rows.objectToStrings now converts byte[] to base64, change date handling * better docs and tests * fix it * formatting * doc fix * fix it * exclude redundant dependencies * use latest orc-mapreduce, add hadoop jobProperties recommendations to docs * doc fix * review stuff and fix binaryAsString * cache for root level fields * more better	2019-04-09 09:03:26 -07:00
Richard Startin	d29a32062f	upgrade to RoaringBitmap 0.8.0 and serialise directly to ByteBuffer (#7408 )	2019-04-04 13:22:50 -04:00
Charles Allen	eeb3dbe79d	Move GCP to a core extension (#6953 ) * Move GCP to a core extension * Don't provide druid-core >.< * Keep AWS and GCP modules separate * Move AWSModule to its own module * Add aws ec2 extension and more modules in more places * Fix bad imports * Fix test jackson module * Include AWS and GCP core in server * Add simple empty method comment * Update version to 15 * One more 0.13.0-->0.15.0 change * Fix multi-binding problem * Grep for s3-extensions and update docs * Update extensions.md	2019-03-27 09:00:43 -07:00
Jonathan Wei	8ca7cb4886	Fix rat check for source assembly after build (#7333 )	2019-03-22 22:48:35 -07:00
Jonathan Wei	7a57bc0dc3	Exclude git.version from rat check (#7322 )	2019-03-21 20:54:27 -07:00
Jonathan Wei	5486c2abf8	Update LICENSE and NOTICE files (#7026 ) * Update LICENSE and NOTICE files * Update react-table version	2019-03-04 18:45:22 -08:00
Jonathan Wei	258485a2fb	Exclude github issue templates from license check (#7070 ) * Exclude github issue templates from license check * Adjust capitalization	2019-02-19 12:38:52 -08:00
Surekha	80a2ef7be4	Support kafka transactional topics (#5404 ) (#6496 ) * Support kafka transactional topics * update kafka to version 2.0.0 * Remove the skipOffsetGaps option since it's not used anymore * Adjust kafka consumer to use transactional semantics * Update tests * Remove unused import from test * Fix compilation * Invoke transaction api to fix a unit test * temporary modification of travis.yml for debugging * another attempt to get travis tasklogs * update kafka to 2.0.1 at all places * Remove druid-kafka-eight dependency from integration-tests, remove the kafka firehose test and deprecate kafka-eight classes * Add deprecated in docs for kafka-eight and kafka-simple extensions * Remove skipOffsetGaps and code changes for transaction support * Fix indentation * remove skipOffsetGaps from kinesis * Add transaction api to KafkaRecordSupplierTest * Fix indent * Fix test * update kafka version to 2.1.0	2019-02-18 11:50:08 -08:00
Edward Gan	90c1a54b86	Moments Sketch custom aggregator (#6581 ) * Moments Sketch Integration with Druid * updates, add documentation, fix warnings * nits * disallowed base64 * update to druid 0.14	2019-02-13 14:03:47 -08:00
Jonathan Wei	fafbc4a80e	Set version to 0.15.0-incubating-SNAPSHOT (#7014 )	2019-02-07 14:02:52 -08:00
Jonathan Wei	8bc5eaa908	Set version to 0.14.0-incubating-SNAPSHOT (#7003 )	2019-02-04 19:36:20 -08:00
Vadim Ogievetsky	7f1b19bfb1	Adding a Unified web console. (#6923 ) * Adding new web console. * fixed css * fix form height * fix typo * do import custom react-table css * added repo field so npm does not complain * ask travis for node 10 * move indexing-service/src/main/resources/indexer_static into web-console * fix resource names and paths * add licenses * fix exclude file * add licenses to misc files and tidy up * remove rebase marker * fix link * updated env variable name * tidy up licenses and surface errors * cleanup * remove unused code, fix missing await * TeamCity does not like the name aux * add more links to tasks view * rm pages * update gitignore * update readme to be accurate * make clean script * removed old console dependancy * update Jetty routes * add a comment for welcome files for coordinator * do not show inital notifaction for now * renamed overlord console back to console.html * fix coordinator console * rename coordinator-console.html to index.html	2019-01-31 17:26:41 -08:00
Ankit Kothari	8492d94f59	Kill Hadoop MR task on kill of Hadoop ingestion task (#6828 ) * KillTask from overlord UI now makes sure that it terminates the underlying MR job, thus saving unnecessary compute Run in jobby is now split into 2 1. submitAndGetHadoopJobId followed by 2. run submitAndGetHadoopJobId is responsible for submitting the job and returning the jobId as a string, run monitors this job for completion JobHelper writes this jobId in the path provided by HadoopIndexTask which in turn is provided by the ForkingTaskRunner HadoopIndexTask reads this path when kill task is clicked to get hte jobId and fire the kill command via the yarn api. This is taken care in the stopGracefully method which is called in SingleTaskBackgroundRunner. Have enabled `canRestore` method to return `true` for HadoopIndexTask in order for the stopGracefully method to be called HadoopJob files have been changed to incorporate the changes to jobby Addressing PR comments * Addressing PR comments - Fix taskDir * Addressing PR comments - For changing the contract of Task.stopGracefully() `SingleTaskBackgroundRunner` calls stopGracefully in stop() and then checks for canRestore condition to return the status of the task * Addressing PR comments 1. Formatting 2. Removing `submitAndGetHadoopJobId` from `Jobby` and calling writeJobIdToFile in the job itself * Addressing PR comments 1. POM change. Moving hadoop dependency to indexing-hadoop * Addressing PR comments 1. stopGracefully now accepts TaskConfig as a param Handling isRestoreOnRestart in stopGracefully for `AppenderatorDriverRealtimeIndexTask, RealtimeIndexTask, SeekableStreamIndexTask` Changing tests to make TaskConfig param isRestoreOnRestart to true	2019-01-25 15:43:06 -08:00
Gian Merlino	e497141e92	Update Curator to 4.1.0. (#6862 )	2019-01-15 14:12:07 -08:00
Richard Startin	99097617a1	use RoaringBitmapWriter for RoaringBitmap construction (#6764 )	2019-01-08 17:18:41 -08:00
dongyifeng	def823124c	add version comparator for StringComparator (#6745 ) * add version comparator for StringComparator * add more test case and docs	2019-01-08 17:17:03 -08:00
Joshua Sun	7c7997e8a1	Add Kinesis Indexing Service to core Druid (#6431 ) * created seekablestream classes * created seekablestreamsupervisor class * first attempt to integrate kafa indexing service to use SeekableStream * seekablestream bug fixes * kafkarecordsupplier * integrated kafka indexing service with seekablestream * implemented resume/suspend and refactored some package names * moved kinesis indexing service into core druid extensions * merged some changes from kafka supervisor race condition * integrated kinesis-indexing-service with seekablestream * unite tests for kinesis-indexing-service * various bug fixes for kinesis-indexing-service * refactored kinesisindexingtask * finished up more kinesis unit tests * more bug fixes for kinesis-indexing-service * finsihed refactoring kinesis unit tests * removed KinesisParititons and KafkaPartitions to use SeekableStreamPartitions * kinesis-indexing-service code cleanup and docs * merge #6291 merge #6337 merge #6383 * added more docs and reordered methods * fixd kinesis tests after merging master and added docs in seekablestream * fix various things from pr comment * improve recordsupplier and add unit tests * migrated to aws-java-sdk-kinesis * merge changes from master * fix pom files and forbiddenapi checks * checkpoint JavaType bug fix * fix pom and stuff * disable checkpointing in kinesis * fix kinesis sequence number null in closed shard * merge changes from master * fixes for kinesis tasks * capitalized <partitionType, sequenceType> * removed abstract class loggers * conform to guava api restrictions * add docker for travis other modules test * address comments * improve RecordSupplier to supply records in batch * fix strict compile issue * add test scope for localstack dependency * kinesis indexing task refactoring * comments * github comments * minor fix * removed unneeded readme * fix deserialization bug * fix various bugs * KinesisRecordSupplier unable to catch up to earliest position in stream bug fix * minor changes to kinesis * implement deaggregate for kinesis * Merge remote-tracking branch 'upstream/master' into seekablestream * fix kinesis offset discrepancy with kafka * kinesis record supplier disable getPosition * pr comments * mock for kinesis tests and remove docker dependency for unit tests * PR comments * avg lag in kafkasupervisor #6587 * refacotred SequenceMetadata in taskRunners * small fix * more small fix * recordsupplier resource leak * revert .travis.yml formatting * fix style * kinesis docs * doc part2 * more docs * comments * comments2 revert string replace changes * comments * teamcity * comments part 1 * comments part 2 * comments part 3 * merge #6754 * fix injection binding * comments * KinesisRegion refactor * comments part idk lol * can't think of a commit msg anymore * remove possiblyResetDataSourceMetadata() for IncrementalPublishingTaskRunner * commmmmmmmmmments * extra error handling in KinesisRecordSupplier getRecords * comments * quickfix * typo * oof	2018-12-21 12:49:24 -07:00
Clint Wylie	522ef1a013	update roaring to latest (#6750 )	2018-12-20 15:10:24 -08:00
David Lim	3443f9a008	add missing contrib extensions to distribution packaging, fix influx package name, fix checkstyle plugin config to support Maven < 3.3.9 (#6717 )	2018-12-10 17:56:32 -08:00
Furkan KAMACI	bbb283fa34	Double-checked locking bugs (#6662 ) * Double-checked locking bug is fixed. * @Nullable is removed since there is no need to use along with @MonotonicNonNull. * Static import is removed. * Lazy initialization is implemented. * Local variables used instead of volatile ones. * Local variables used instead of volatile ones.	2018-12-07 17:10:29 +01:00
Roman Leventov	87b96fb1fd	Add checkstyle rules about imports and empty lines between members (#6543 ) * Add checkstyle rules about imports and empty lines between members * Add suppressions * Update Eclipse import order * Add empty line * Fix StatsDEmitter	2018-11-20 12:42:15 +01:00
Roman Leventov	8f3fe9cd02	Prohibit String.replace() and String.replaceAll(), fix and prohibit some toString()-related redundancies (#6607 ) * Prohibit String.replace() and String.replaceAll(), fix and prohibit some toString()-related redundancies * Fix bug * Replace checkstyle regexp with IntelliJ inspection	2018-11-15 13:21:34 -08:00
David Lim	afb239b17a	add missing license headers, in particular to MD files; clean up RAT … (#6563 ) * add missing license headers, in particular to MD files; clean up RAT exclusions * revert inadvertent doc changes * docs * cr changes * fix modified druid-production.svg	2018-11-13 09:38:37 -08:00
Roman Leventov	54351a5c75	Fix various bugs; Enable more IntelliJ inspections and update error-prone (#6490 ) * Fix various bugs; Enable more IntelliJ inspections and update error-prone * Fix NPE * Fix inspections * Remove unused imports	2018-11-06 14:38:08 -08:00
Clint Wylie	1224d8b746	overhaul 'druid-parquet-extensions' module, promoting from 'contrib' to 'core' (#6360 ) * move parquet-extensions from contrib to core, adds new hadoop parquet parser that does not convert to avro first and supports flattenSpec and int96 columns, add support for flattenSpec for parquet-avro conversion parser, much test with a bunch of files lifted from spark-sql * fix avro flattener to support nullable primitives for auto discovery and now only supports primitive arrays instead of all arrays * remove leftover print * convert micro timestamp to millis * checkstyle * add ignore for .parquet and .parq to rat exclude * fix legit test failure from avro flattern behavior change * fix rebase * add exclusions to pom to cut down on redundant jars * refactor tests, add support for unwrapping lists for parquet-avro, review comments * more comment * fix oops * tweak parquet-avro list handling * more docs * fix style * grr styles	2018-11-05 21:33:42 -08:00
Joshua Sun	bf90b2b183	fix shallow clone git-commit-id plugin unable to find commits until tag issue (#6495 )	2018-10-20 14:45:09 -07:00
David Lim	e1a53fd17a	fix distribution to not include contrib extensions by default, don't pull the entire AWS SDK bundle (#6494 )	2018-10-19 13:50:05 -07:00
David Lim	73780536a6	Apache-ize POM (#6482 ) * Apache-ize POM * put revision information into MANIFEST.MF for binary release * remove nightly profile * fix flaky travis by overriding maven-remote-resources-plugin execution from the parent POM	2018-10-17 18:37:01 -07:00
patelh	f0b977ea7f	Upgrade lz4-java to 1.5.0 (#6478 )	2018-10-17 18:34:20 -07:00
Clint Wylie	84598fba3b	combine druid-api, druid-common, java-util into druid-core (#6443 ) * combine druid-api, druid-common, java-util * spacing	2018-10-14 20:37:37 -07:00
Charles Allen	0f4f5f2877	Cleanup jackson dependency exclusions (#6438 ) * Remove pulling in jackson from fasterxml in a lot of places * Remove codehaus extensions	2018-10-12 17:25:39 -06:00
David Lim	20ab213ba6	change project versions to 0.13.0-incubating-SNAPSHOT (#6453 )	2018-10-11 19:28:01 -07:00
Charles Allen	1c4f787ed4	Upgrade Netty to 4.1.x (#6417 ) * Update netty to 4.1.30.Final * Fix compile time problems with new netty * Remove netty-all from rocketmq extension	2018-10-05 12:30:00 -07:00
Roman Leventov	3ae563263a	Renamed 'Generic Column' -> 'Numeric Column'; Fixed a few resource leaks in processing; misc refinements (#5957 ) This PR accumulates many refactorings and small improvements that I did while preparing the next change set of https://github.com/druid-io/druid/projects/2. I finally decided to make them a separate PR to minimize the volume of the main PR. Some of the changes: - Renamed confusing "Generic Column" term to "Numeric Column" (what it actually implies) in many class names. - Generified `ComplexMetricExtractor`	2018-10-02 14:50:22 -03:00
Gian Merlino	3548396a45	SQL: Update to Calcite 1.17.0. (#6404 ) * SQL: Update to Calcite 1.17.0. Other than keeping things fresh, another motivation is that this fixes CALCITE-1436 (AggregateNode NPE for aggregators other than SUM/COUNT), which affects aggregate functions on our system tables. Also sets shouldConvertRaggedUnionTypesToVarying = true, a new type system parameter that prefers VARCHAR over CHAR. This is better for Druid, because we don't really have support for a true CHAR type. * Remove unused import.	2018-09-29 18:33:29 -07:00
QiuMM	13bbbbf608	Fix issue that forbidden-api check prevents building individual modules (#6394 )	2018-09-27 11:25:10 -07:00
Nishant Bangarwa	c9d281a2e9	Add ability to pass in Bloom filter from Hive Queries (#6222 ) * Bloom filter initial implementation fix checkstyle review comments Fix wierd failure review comments Revert "Fix wierd failure" This reverts commit a13a83ad7887e679f6d539191b52aeaaea85b613. * fix test * review comment	2018-09-26 16:04:26 -07:00
QiuMM	6843cbba1d	Fix issue that the forbidden-apis check do not always work (#6371 ) * Fix the forbidden apis check do not work issue * use SuppressForbidden annotation	2018-09-26 19:39:39 +03:00
Roman Leventov	4afa85e4e8	Move the Rat plugin to a separate Maven profile (#6376 ) * Move the Rat plugin to a separate Maven profile * Revert self-check	2018-09-25 14:39:24 -07:00
Jonathan Wei	8972244c68	Mutual TLS support (#6076 ) * Mutual TLS support * Kafka test fixes * TeamCity fix * Split integration tests * Use localhost DOCKER_IP * Increase server thread count * Increase SSL handshake timeouts * Add broken pipe retries, use injected client config params * PR comments, Rat license check exclusion	2018-09-19 09:56:15 -07:00
Jonathan Wei	2e82edc5e0	More exclusions for Rat license check (#6346 )	2018-09-18 20:47:56 -07:00
Slim Bouguerra	028354eea8	Adding licenses and enable apache-rat-plugin. (#6215 ) * Adding licenses and enable apache-rat-plugi. Change-Id: I4685a2d9f1e147855dba69329b286f2d5bee3c18 * restore the copywrite of demo_table and add it to the list of allowed ones Change-Id: I2a9efde6f4b984bc1ac90483e90d98e71f818a14 * revirew comments Change-Id: I0256c930b7f9a5bb09b44b5e7a149e6ec48cb0ca * more fixup Change-Id: I1355e8a2549e76cd44487abec142be79bec59de2 * align Change-Id: I70bc47ecb577bdf6b91639dd91b6f5642aa6b02f	2018-09-18 08:39:26 -07:00
Clint Wylie	96a1076e23	allow 3 retries for failing tests (#6324 ) * allow 1 retry for failing tests idk if this is a good idea, but false failure rate due to flaky tests seems pretty bad lately * try to fix retry issue with teardown * Update pom.xml * Update pom.xml	2018-09-11 19:16:59 -07:00
Gian Merlino	431d3d8497	Rename io.druid to org.apache.druid. (#6266 ) * Rename io.druid to org.apache.druid. * Fix META-INF files and remove some benchmark results. * MonitorsConfig update for metrics package migration. * Reorder some dimensions in inner queries for some reason. * Fix protobuf tests.	2018-08-30 09:56:26 -07:00
Gian Merlino	0172326c62	SQL: Support more result formats, add columns header. (#6191 ) * SQL: Support more result formats, add columns header. - Add result formats for line-based JSON and CSV. - Add X-Druid-Sql-Columns header with a list of all columns that the response will contain. - Add more comprehensive documentation on what callers should expect when making Druid SQL queries. * Fix some tests. * Adjust tests. * Adjust trailer, add types header. * Fix trailers.	2018-08-26 23:00:14 -06:00
QiuMM	ef91fdbf03	Zstandard decompression support (#6224 )	2018-08-26 16:09:24 -07:00
Kirill Kozlov	62e580050c	Use JUnit TemporaryFolder rule instead of system temp folder (#6070 ) * Use JUnit TemporaryFolder rule instead of system tmp folder * Allow to forbid apis which present not in all mvn modules	2018-08-16 11:05:45 -07:00
Jihoon Son	ecee3e0a24	Further optimize memory for Travis jobs (#6150 ) * Further optimize memory for Travis jobs * fix build * sudo false	2018-08-10 22:03:36 -07:00
Gian Merlino	04ea3c9f8c	Update license headers. (#5976 ) * Update license headers. For compliance with http://www.apache.org/legal/src-headers.html. * More license adjustments. * Fix mistakenly edited package line.	2018-07-11 09:55:18 -07:00
Dylan Wylie	8c6651022d	Update jsonpath dependency (#5794 ) * Update JSONPath Library Re: #5792 - Add a unit test containing a JSONPath conditional - Update the JSONPath library and no longer exclude the json-smart dependency. - I believe the original reason for excluding this has been fixed: https://github.com/json-path/JsonPath/pull/315 * Add test * Fix test	2018-06-15 13:50:48 -07:00
zhangxinyu	e43e5ebbcd	Materialized view implementation (#5556 ) * implement materialized view * modify code according to jihoonson's comments * modify code according to jihoonson's comments - 2 * add documentation about materialized view * use new HadoopTuningConfig in pr 5583 * add minDataLag and fix optimizer bug * correct value of DEFAULT_MIN_DATA_LAG_MS * modify code according to jihoonson's comments - 3 * use the boolean expression instead of if-else	2018-06-09 12:24:54 -07:00
Hongze Zhang	cfa94b747b	Update to jetty 9.4; Enable request decompression (#5624 ) * Update to jetty 9.4; Enable request decompression; Add http compression config options * Fix BadMessageException from jetty server at HttpGenerator.generateHeaders(...)	2018-06-08 14:53:08 -07:00
Jonathan Wei	684b5d18c1	Moving averages for ingestion row stats (#5748 ) * Moving averages for ingestion row stats * PR comments * Make RowIngestionMeters extensible * test and checkstyle fixes * More PR comments * Fix metrics * Add some comments * PR comments * Comments	2018-06-05 09:08:57 -07:00
Fokko Driesprong	a95ec92296	Move to the org.lz4 dependency (#5746 ) The net.jpountz.lz4 moved to org.lz4	2018-05-07 08:16:45 -07:00
Gian Merlino	5ab17668c0	CompressionUtils: Add support for decompressing xz, bz2, zip. (#5586 ) Also switch various firehoses to the new method. Fixes #5585.	2018-04-06 08:06:45 -07:00
Charles Allen	b86ed99d9a	Deprecate spark2 profile in pom.xml (#5581 ) Deprecated due to https://github.com/druid-io/druid/pull/5382	2018-04-06 05:37:16 -07:00
Nathan Hartwell	ea30c05355	Adding ParserSpec for Influx Line Protocol (#5440 ) * Adding ParserSpec for Influx Line Protocol * Addressing PR feedback - Remove extraneous TODO - Better handling of parse errors (e.g. invalid timestamp) - Handle sub-millisecond timestamps * Adding documentation for Influx parser * Fixing docs	2018-03-26 14:28:46 -07:00
Jihoon Son	1ad898bde2	Use the official aws-sdk instead of jet3t (#5382 ) * Use the official aws-sdk instead of jet3t * fix compile and serde tests * address comments and fix test * add http version string * remove redundant dependencies, fix potential NPE, and fix test * resolve TODOs * fix build * downgrade jackson version to 2.6.7 * fix test * resolve the last TODO * support proxy and endpoint configurations * fix build * remove debugging log * downgrade hadoop version to 2.8.3 * fix tests * remove unused log * fix it test * revert KerberosAuthenticator change * change hadoop-aws scope to provided in hdfs-storage * address comments * address comments	2018-03-21 15:36:54 -07:00
Charles Allen	58f110f7f8	Future-proof some Guava usage (#5414 ) * Future-proof some Guava usage * Use a java-util EmptyIterator instead of Guava's * Change some of the guava future handling to do manual async transforms. Guava changes transform into transformAsync by deprecating transform in ONLY Guava 19. Then its gone in 20 * Use `Collections.emptyIterator()` * Pretty formatting * Make listenable future transforms a thing in default druid * Format fix * Add forbidden guava apis * Make the ListenableFutrues.transformAsync have comments * Undo intellij bad pattern matching in comments * Futrues --> Futures * Add empty iterators forbidding * Fix extra `A` * Correct method signature * Address review comments * Finish Gian review comments * Proper syntax from https://github.com/policeman-tools/forbidden-apis/wiki/SignaturesSyntax	2018-03-20 08:59:33 -07:00
bolkedebruin	8f07a39af7	Skip OS cache on Linux when pulling segments (#5421 ) Druid relies on the page cache of Linux in order to have memory segments. However when loading segments from deep storage or rebalancing the page cache can get poisoned by segments that should not be in memory yet. This can significantly slow down Druid in case rebalancing happens as data that might not be queried often is suddenly in the page cache. This PR implements the same logic as is in Apache Cassandra and Apache Bookkeeper. Closes #4746	2018-03-08 07:54:21 -08:00
Vinesh Chemmala Paul	fb493ae13a	Add repository url (#5437 ) snapshot build fails to find dependent artifacts. Add repoistory URL to resolved dependencies	2018-03-01 07:38:24 -08:00
QiuMM	aa7aee53ce	Opentsdb emitter extension (#5380 ) * opentsdb emitter extension * doc for opentsdb emitter extension * update opentsdb emitter doc * add the ms unit to the constant name * add a configurable event limit * fix version to 0.13.0-SNAPSHOT * using a thread to consume metric event * rename method and parameter	2018-02-13 13:10:22 -08:00
Slim	37c09ce3f8	Use both Joad Ids and Java IDs as Timezone to string readers (#5349 ) * Use both Joad Ids and Java IDs as Timezone to string readers Change-Id: Ieb5c18559879f3f3a0104912ce2f0a354ad0aac3 * move the function to DateTimes and add org.joda.time.DateTimeZone#forID as part of forbidden api Change-Id: Iff97fa044758019ed0c231587d10e31a9cc18da0 * exclude class and remove other usage Change-Id: Ib458c2caaa1865535767e1009fbf017a92c8f615 * remove it from test classes Change-Id: I9b576324f6c7e17a74bd8b13879232c9a8cd40b4 * remove unused Change-Id: If1c5b70c26c2b7c83c20434cb72b2060653f5052	2018-02-06 16:34:11 +05:30
Gian Merlino	7e02408510	Update versions to 0.13.0-SNAPSHOT. (#5323 )	2018-02-02 12:06:38 -06:00
Jonathan Wei	80419752b5	Add metamx emitter, http clients, and metrics packages to druid java-util (#5289 ) * Add metamx java-util emitter, http clients, and metrics packages to druid java-util * Remove metamx java-util from pom.xml files * Checkstyle fixes * Import fix * TeamCity inspection fixes * Use slf4j, move some version defs to master pom.xml * Use parent jvm-attach-api and maven-surefire-plugin versions * Add ] to log msg, suppress inspection	2018-01-24 22:10:36 +01:00
Roman Leventov	f99c27e9e0	Fix bugs in ImmutableRTree; Merge bytebuffer-collections module into druid-processing (#5275 ) * Fix bugs in ImmutableRTree; optimize ImmmutableRTreeObjectStrategy.writeTo(); Merge bytebuffer-collections module into druid-processing * Remove unused declaration * Fix another bug	2018-01-23 00:49:59 +05:30
Akash Dwivedi	d6932c1621	java-util version update + Add UnusedConnectionTimeout config. (#5239 ) * java-util version update + Add UnusedConnectionTimeout config. * warn if unusedConnectionTime >= readTimeout. * Doc update + addressed comment. * Use compareTo to compare duration. * remove unused variable. * addressed comments and default for unusedConnectionTimeout.	2018-01-17 15:54:18 -06:00
Jonathan Wei	935ac646f4	Upgrade to Calcite 1.15.0 (#5210 ) * Upgrade to Calcite 1.15.0 * Use Filtration.eternity()	2018-01-04 12:11:24 -08:00
Nishant Bangarwa	4cc31e4e7a	Update Zookeeper version (#5184 )	2018-01-04 10:59:20 +08:00
Roman Leventov	5787d04fad	Bump Druid version to 0.12.0 (#5138 )	2017-12-15 07:37:01 -08:00
Jonathan Wei	f48c9d7be1	Basic auth extension (#5099 ) * Basic auth extension * Add auth configuration integration test * Fix missing authorizerName property * PR comments * Fix missing @JsonProperty annotation * PR comments * more PR comments	2017-12-14 10:36:04 -08:00
Roman Leventov	a7a6a0487e	Replace IOPeon with SegmentWriteOutMedium; Improve buffer compression (#4762 ) * Replace IOPeon with OutputMedium; Improve compression * Fix test * Cleanup CompressionStrategy * Javadocs * Add OutputBytesTest * Address comments * Random access in OutputBytes and GenericIndexedWriter * Fix bugs * Fixes * Test OutputBytes.readFully() * Address comments * Rename OutputMedium to SegmentWriteOutMedium and OutputBytes to WriteOutBytes * Add comments to ByteBufferInputStream * Remove unused declarations	2017-12-04 18:04:27 -08:00
Fokko Driesprong	2487152b59	Update Avro to 1.8.2 (#5075 ) And add exclusions that are required to have a single version of Apache Avro on the classpath.	2017-11-20 20:29:17 -08:00
Roman Leventov	95154f9ef6	Netty downgrade (#5059 )	2017-11-09 11:27:50 -08:00
Gian Merlino	e6ec4310b1	IT: Switch to OpenJDK8 base image. (#5060 ) * IT: Switch to OpenJDK8 base image. Also split the Docker image into a base image and a child image, and build the base image ahead of time for efficiency's sake. Also upgrade ZK to 3.4.10. * Additional comments about ZK upgrades.	2017-11-08 19:56:31 -08:00
Roman Leventov	5eb08c27cb	Add Emitter monitoring (#4973 ) * Add Emitter monitoring * Fix typo * Fixes * testing new emitter * Fix failed test (#71) * testing new emitter * fix on failed test * Remove emitter's readTimeout from docs * Update docs * Add HttpEmittingMonitor * Update java-util to 1.3.2	2017-11-03 21:27:57 -06:00
Jihoon Son	d7024f22e1	Upgrade fastutil to 8.1.0 (#4988 ) * Upgrade failutil to 8.1.0 * unused import	2017-10-19 23:37:43 -05:00
Slim	af2bc5f814	Make float default representation for DoubleSum/Min/Max aggregators (#4944 ) * Introduce System wide property to select how to store double. Set the default to store as float Change-Id: Id85cca04ed0e7ecbce78624168c586dcc2adafaa * fix tests Change-Id: Ib42db724b8a8f032d204b58c366caaeabdd0d939 * Change the property name Change-Id: I3ed69f79fc56e3735bc8f3a097f52a9f932b4734 * add tests and make default distribution store doubles as 64bits Change-Id: I237b07829117ac61e247a6124423b03992f550f2 * adding mvn argument to parallel-test profile Change-Id: Iae5d1328f901c4876b133894fa37e0d9a4162b05 * move property name and helper function to io.druid.segment.column.Column Change-Id: I62ea903d332515de2b7ca45c02587a1b015cb065 * fix docs and clean style Change-Id: I726abb8f52d25dc9dc62ad98814c5feda5e4d065 * fix docs Change-Id: If10f4cf1e51a58285a301af4107ea17fe5e09b6d	2017-10-16 17:17:22 -07:00
Gian Merlino	b20e3038b6	SQL: Upgrade to Calcite 1.14.0, some refactoring of internals. (#4889 ) * SQL: Upgrade to Calcite 1.14.0, some refactoring of internals. This brings benefits: - Ability to do GROUP BY and ORDER BY with ordinals. - Ability to support IN filters beyond 19 elements (fixes #4203). Some refactoring of druid-sql internals: - Builtin aggregators and operators are implemented as SqlAggregators and SqlOperatorConversions rather being special cases. This simplifies the Expressions and GroupByRules code, which were becoming complex. - SqlAggregator implementations are no longer responsible for filtering. Added new functions: - Expressions: strpos. - SQL: TRUNCATE, TRUNC, LENGTH, CHAR_LENGTH, STRLEN, STRPOS, SUBSTR, and DATE_TRUNC. * Add missing @Override annotation. * Adjustments for forbidden APIs. * Adjustments for forbidden APIs. * Disable GROUP BY alias. * Doc reword.	2017-10-10 12:44:05 -07:00
Gian Merlino	1f2074c247	Bump versions in master to 0.11.1-SNAPSHOT. (#4878 ) * Bump versions in master to 0.11.1-SNAPSHOT. * Missed a few.	2017-09-28 17:09:51 -05:00
Goh Wei Xiang	2c30d5ba55	Add org.joda.time.DateTime.parse() to forbidden APIs (#4857 ) * Added org.joda.time.DateTime#(java.lang.String) to forbidden API. * Added org.joda.time.DateTime#(java.lang.String, org.joda.time.format.DateTimeFormatter) to forbidden API. * Add additional APIs that may create DateTime with default time zone * Add helper function that accepts formatter to parse String. * Add additional forbidden APIs * Replace existing usage of forbidden APIs * Use wrapper class to enforce Chronology on DateTimeFormatter. * Creates constant UtcFormatter for constant ISODateTimeFormat.	2017-09-27 17:46:44 -05:00
Roman Leventov	9c126e2aa9	Forbid MapMaker (#4845 ) * Forbid MapMaker * Shorter syntax * Forbid Maps.newConcurrentMap()	2017-09-27 06:49:47 -07:00
Charles Allen	a6470c1d03	Move caffeine out of extension and make it the default cache implementation. (#4810 ) * Move caffeine out of extension. * Remove `JsonTypeName` from the class itself * Fix bad docs * Fix distribution pom * Fix unused import * Make caffeine default * Address code comments * Add more description around the jre version in the readme * Add suggested comments	2017-09-22 10:46:55 -07:00
Charles Allen	edd9c76fa5	Add profile for building for use with Spark 2.x (#4808 ) * Add profile for building for use with Spark 2.x * Update aws sdk version	2017-09-18 23:39:40 -05:00
Jihoon Son	d606bd72de	Upgrade curator (#4786 )	2017-09-15 10:48:32 -07:00
Roman Leventov	267f415dc3	Update emitter library and add support for ParametrizedUriEmitter (#4722 ) * Move emitters from io.druid.server.initialization to the dedicated io.druid.server.emitter package; Update emitter library to 0.6.0; Add support for ParametrizedUriEmitter; Support hierarical properties in JsonConfigurator (was needed for ParametrizedUriEmitter) * Log created RequestLoggers * Fix forbidden API * Test fix * More Http and Parametrized Http Emitter docs * Switch to debug level	2017-09-13 17:17:19 -05:00
Gian Merlino	2ce8123bdb	Move scan-query from a contrib extension into core. (#4751 ) * Move scan-query from a contrib extension into core. Based on a proposal at: https://groups.google.com/d/topic/druid-development/ME_OatUDnbk/discussion This patch also adds support for virtual columns to the Scan query, and updates Druid SQL to use Scan instead of Select. This patch also makes some behavioral changes to handling of the __time column. In particular, it is now is returned as "__time" rather than "timestamp"; it is no longer included if you do not specifically ask for it in your "columns"; and it is returned as a long rather than a string. Users can revert time handling to the legacy extension behavior by setting "legacy" : true in their queries, or setting the property druid.query.scan.legacy = true. This is meant to provide a migration path for users that were formerly using the contrib extension. * Adjustments from review. * Add back Select query. * Adjust SQL docs. * Restore SelectQuery link.	2017-09-13 09:51:24 -07:00
Kenji Noguchi	c0be050242	Add jq expression support in flattenSpec (#4171 ) * add jq expression in the flattenSpec * more tests * add benchmark * fix style * use JsonNode for both JSONPath and JQ * clean up * more clean up * add documentation * fix style * move jackson-jq version to dependencyManagement section. remove commented code * oops. revert wrong fix * throw IllegalArgumentException for JQ syntax error * remove e.printStackTrace() that is forbidden * touch	2017-09-12 14:18:34 -05:00
Gian Merlino	dc5c6f13b1	Use trusty for travis jobs. (#4755 ) * Use trusty for travis jobs. The distro was set to "precise" in #4572 due to memory issues on trusty, but we've been seeing performance issues on "precise" recently so let's see how trusty is working these days. * Less quiet. * Adjust memory settings. * Add back -q option. * Tweak memory again. * Adjustments. * Try squeezing memory a bit more.	2017-09-06 18:51:06 +09:00
Roman Leventov	cbd1902db8	Add forbidden-apis plugin; prohibit using system time zone (#4611 ) * Forbidden APIs WIP * Remove some tests * Restore io.druid.math.expr.Function * Integration tests fix * Add comments * Fix in SimpleWorkerProvisioningStrategy * Formatting * Replace String.format() with StringUtils.format() in RemoteTaskRunnerTest * Address comments * Fix GroupByMultiSegmentTest	2017-08-21 13:02:42 -07:00
QiuMM	f18cc5df97	Redis cache extension (#4615 ) * Redis cache extension * Fix some trival and optimize code * Add Override annotation in RedisCacheTest	2017-08-08 10:11:45 -07:00
Charles Allen	729e44d767	Update server-metrics to 0.5.2 (#4624 )	2017-08-01 14:45:03 -07:00
Roman Leventov	684cfbf889	Upgrade to server-metrics 0.5.0 (#4480 ) * Upgrade to server-metrics 0.4.3 * Upgrade to 0.5.0 * Add CpuAcctDeltaMonitor description to docs	2017-07-26 08:56:00 -07:00
Roman Leventov	c0beb78ffd	Enforce brace formatting with Checkstyle (#4564 )	2017-07-21 10:26:59 -05:00
Roman Leventov	60cdf94677	Add PMD and prohibit unnecessary fully qualified class names in code (#4350 ) * Add PMD and prohibit unnecessary fully qualified class names in code * Extra fixes * Remove extra unnecessary fully-qualified names * Remove qualifiers * Remove qualifier	2017-07-17 22:22:29 +09:00
Gian Merlino	16817e408d	SQL + Expressions = Best friends forever. (#4360 ) * SQL + Expressions = Best friends forever. - Use expressions as a projection layer for anything that can't be expressed using traditional Druid extractionFns. Sometimes they're embedded directly (like "expression" filters, builtin aggregators, or "expression" post-aggregators). Sometimes they're referenced through virtual columns (like dimensionSpecs, which can't innately reference functions of more than one column without the virtual column layer). - Add many new functions and operators, taking advantage of the expression capability (see the querying/sql.md doc). - Improve consistency of constant reduction and of casting by using Druid expressions for this instead of Calcite's RexExecutor. * Fix casting bug, and other code review comments. * Fix docs.	2017-07-07 08:48:26 -07:00
Parag Jain	6e2f78f552	TLS support (#4270 )	2017-07-06 17:40:12 -07:00
Roman Leventov	ae900a4934	Update versions to 0.11.0-SNAPSHOT (#4483 )	2017-06-28 17:05:58 -07:00
David Lim	0f99467cfb	rollback to previous httpclient/httpcore versions (#4457 )	2017-06-23 21:43:49 -05:00
Himanshu	61c38b66ad	exclude aws-java-sdk from hadoop-aws dep in hdfs-storage module (#4437 ) * exclude aws-java-sdk from hdfs-storage module * address review comments	2017-06-22 15:56:35 -05:00
Roman Leventov	5285eb961b	Update dependencies (#4313 ) * Update dependencies * Downgrade curator * Rollback aws-java-sdk dependency to 1.10.77 * Revert exclusions in integration-tests * Depend only on aws-java-sdk-ec2 instead of umbrella aws-java-sdk (fixes #4382)	2017-06-09 14:32:07 -07:00
Roman Leventov	31d33b333e	Make using implicit system Charset an error (#4326 ) * Make using implicit system charset an error * Use StringUtils.toUtf8() and fromUtf8() instead of String.getBytes() and new String() * Use English locale in StringUtils.safeFormat() * Restore comment	2017-06-05 23:57:25 -07:00
Slim	a2584d214a	Delagate creation of segmentPath/LoadSpec to DataSegmentPushers and add S3a support (#4116 ) * Adding s3a schema and s3a implem to hdfs storage module. * use 2.7.3 * use segment pusher to make loadspec * move getStorageDir and makeLoad spec under DataSegmentPusher * fix uts * fix comment part1 * move to hadoop 2.8 * inject deep storage properties * set version to 2.7.3 * fix build issue about static class * fix comments * fix default hadoop default coordinate * fix create filesytem * downgrade aws sdk * bump the version	2017-06-04 00:55:09 -06:00
Kenji Noguchi	3400f601db	Protobuf extension (#4039 ) * move ProtoBufInputRowParser from processing module to protobuf extensions * Ported PR #3509 * add DynamicMessage * fix local test stuff that slipped in * add license header * removed redundant type name * removed commented code * fix code style * rename ProtoBuf -> Protobuf * pom.xml: shade protobuf classes, handle .desc resource file as binary file * clean up error messages * pick first message type from descriptor if not specified * fix protoMessageType null check. add test case * move protobuf-extension from contrib to core * document: add new configuration keys, and descriptions * update document. add examples * move protobuf-extension from contrib to core (2nd try) * touch * include protobuf extensions in the distribution * fix whitespace * include protobuf example in the distribution * example: create new pb obj everytime * document: use properly quoted json * fix whitespace * bump parent version to 0.10.1-SNAPSHOT * ignore Override check * touch	2017-05-30 13:11:58 -07:00
Jihoon Son	000b0ffed7	Increase the max heap size for strict compilation (#4306 )	2017-05-21 03:42:44 +09:00
Roman Leventov	b7a52286e8	Make @Override annotation obligatory (#4274 ) * Make MissingOverride an error * Make travis stript to fail fast * Add missing Override annotations * Comment	2017-05-16 13:30:30 -05:00
Roman Leventov	1ebfa22955	Update Error prone configuration; Fix bugs (#4252 ) * Make Errorprone the default compiler * Address comments * Make Error Prone's ClassCanBeStatic rule a error * Preconditions allow only %s pattern * Fix DruidCoordinatorBalancerTester * Try to give the compiler more memory * Remove distribution module activation on jdk 1.8 because only jdk 1.8 is used now * Don't show compiler warnings * Try different travis script * Fix travis.yml * Make Error Prone optional again * For error-prone compiler * Increase compiler's maxmem * Don't run Error Prone for benchmarks because of OOM * Skip install step in Travis * Remove MetricHolder.writeToChannel() * In travis.yml, check compilation before tests, because it may fail faster	2017-05-12 15:55:17 +09:00
Gian Merlino	2ca7b00346	Update versions to 0.10.1-SNAPSHOT. (#4191 )	2017-04-20 18:12:28 -07:00
Dongkyu Hwangbo	0d2e91ed50	Adding Kafka-emitter (#3860 ) * Initial commit * Apply another config: clustername * Rename variable * Fix bug * Add retry logic * Edit retry logic * Upgrade kafka-clients version to the most recent release * Make callback single object * Write documentation * Rewrite error message and emit logic * Handling AlertEvent * Override toString() * make clusterName more optional * bump up druid version * add producer.config option which make user can apply another optional config value of kafka producer * remove potential blocking in emit() * using MemoryBoundLinkedBlockingQueue * Fixing coding convention * Remove logging every exception and just increment counting * refactoring * trivial modification * logging when callback has exception * Replace kafka-clients 0.10.1.1 with 0.10.2.0 * Resolve the problem related of classloader * adopt try statement * code reformatting * make variables final * rewrite toString	2017-04-04 14:07:43 -07:00
Gian Merlino	81d6b49d69	Downgrade Curator. (#4103 ) Reverts #4060, fixes #4095, unfixes #4056, #3837. Better the devil you know than the devil you don't, I always say. See also https://issues.apache.org/jira/browse/CURATOR-394.	2017-03-23 13:44:00 -07:00
Roman Leventov	84fe91ba0b	Monomorphic processing of TopN queries with 1 and 2 aggregators (key part of #3798 ) (#3889 ) * Monomorphic processing: add HotLoopCallee, CalledFromHotLoop, RuntimeShapeInspector, SpecializationService. Specialize topN queries with 1 or 2 aggregators. Add Cursor.advanceUninterruptibly() and isDoneOrInterrupted() for exception-free query processing. * Use Execs.singleThreaded() * RuntimeShapeInspector to support nullable fields * Make CalledFromHotLoop annotation Inherited * Remove unnecessary conversion of array of ColumnSelectorPluses to list and back to array in CardinalityAggregatorFactory * Close InputStream in SpecializationService * Formatting * Test specialized PooledTopNScanners * Set flags in PooledTopNAlgorithm directly * Fix tests, dependent on CountAggragatorFactory toString() form * Fix * Revert CountAggregatorFactory changes * Implement inspectRuntimeShape() for LongWrappingDimensionSelector and FloatWrappingDimensionSelector * Remove duplicate RoaringBitmap dependency in the extendedset pom.xml * Fix * Treat ByteBuffers specially in StringRuntimeShape * Doc fix * Annotate BufferAggregator.init() with CalledFromHotLoop * Make triggerSpecializationIterationsThreshold an int * Remove SpecializationService.PerPrototypeClassState.of() * Add comments * Limit the amount of specializations that SpecializationService could make * Add default implementation for BufferAggregator.inspectRuntimeShape(), for compatibility with extensions * Use more efficient ConcurrentMap's idioms in SpecializationService	2017-03-17 14:44:36 -05:00
Gian Merlino	9cd666282c	Update Curator to 2.12.0. (#4060 ) Fixes #4056, #3837.	2017-03-15 09:38:31 -07:00
Charles Allen	805d85afda	Allow compilation as Java8 source and target (#3328 ) * Allow compilation as Java8 source and target for everything except API * Remove conditions in tests which assume that we may run with Java 7 * Update easymock to 3.4 * Make Animal Sniffer to check Java 1.8 usage; remove redundant druid-caffeine-cache configuration * Use try-with-resources in LargeColumnSupportedComplexColumnSerializerTest.testSanity() * Remove java7 special for druid-api	2017-03-14 22:23:47 -06:00
Eugene Sevastyanov	16bf62bacc	BACKEND-564: Emitter upgrade from 0.4.0 to 0.4.1 (#3977 )	2017-03-01 13:03:01 -08:00
Gian Merlino	78b0d134ae	Require Java 8 and include some Java 8 dependencies. (#3914 ) * Require Java 8 and include some Java 8 dependencies. - Upgrade Jetty to 9.3.16.v20170120. - Upgrade DataSketches to 0.8.4. - Bundle caffeine-cache by default. - Still target Java 7 when compiling base Druid classes. * Update cluster, quickstart docs. * Remove oraclejdk7 from travis.yml.	2017-02-14 12:51:51 -08:00
Roman Leventov	38000576ea	Optimizations of union, intersection and iterators of concise bitsets (part of #3798 ) (#3883 ) * Port of metamx/extendedset#10, metamx/extendedset#13, metamx/extendedset#14, metamx/extendedset#15, metamx/bytebuffer-collections@9b199e3349, metamx/bytebuffer-collections#38 to Druid, remove unused code from extendedset module * Remove ConciseSet.modCount * Replace comments with assertions in ImmutableConciseSet * Fix comments * Fix asssertions in ImmutableConciseSet * Add tests * Comment fix	2017-02-10 18:02:26 -08:00
Gian Merlino	9191588656	Fix mvn javadoc:jar failure due to HadoopFsWrapper. (#3912 )	2017-02-08 13:54:41 -06:00
Gian Merlino	12317fd001	Bump version to 0.10.0-SNAPSHOT. (#3913 )	2017-02-06 17:54:35 -08:00
DaimonPl	93b71e265e	Extract HLL related code to separate module (#3900 )	2017-02-03 09:45:11 -08:00
Nishant Bangarwa	a457cded28	Druid Extension to enable Authentication using Kerberos. (#3853 ) * Add extension for supporting kerberos security - This PR adds an extension for supporting druid authentication via Kerberos. - Working on the docs. * Add docs * review comments * more review comments * Block all paths by default * more review comments - use proper Oid * Allow extensions to override httpclient for integration tests * Add kerberos lock to prevent multithreaded issues. * review comment - remove enabled flag and fix router injection * Add Cookie Handling and more detailed docs * review comment - rename DruidKerberosConfig -> AuthKerberosConfig * review comments * fix travis failure on jdk7	2017-02-02 14:55:21 -06:00
Himanshu	efb1b40fe0	build sqlserver-metadata-storage contrib extension (#3871 )	2017-01-20 14:39:15 -08:00
kaijianding	33ae9dd485	streaming version of select query (#3307 ) * streaming version of select query * use columns instead of dimensions and metrics;prepare for valueVector;remove granularity * respect query limit within historical * use constant * fix thread name corrupted bug when using jetty qtp thread rather than processing thread while working with SpecificSegmentQueryRunner * add some test for scan query * add scan query document * fix merge conflicts * add compactedList resultFormat, this format is better for json ser/der * respect query timeout * respect query limit on broker * use static consts and remove unused code	2017-01-19 16:09:53 -06:00
Slim	ae5a349a54	Exclude the transitive dependency LGPL jar since it is not needed (#3865 ) * Exclude the transitive dependency LGPL jar since it is not needed * add reason why exclude * exclude from the root dependency * add banning tool to enforce exclusions	2017-01-19 11:49:08 -08:00
Akash Dwivedi	dd0c4e2ead	Migrating extendedset from Metamarkets. (#3694 ) * Migrating extendedset from Metamarkets. * Notice change * More details in NOTICE * NOTICE formatting. * suppress header checkstlye for extendedset.	2017-01-17 10:10:27 -08:00
Jihoon Son	d80bec83cc	Enable auto license checking (#3836 ) * Enable license checking * Clean duplicated license headers	2017-01-10 18:13:47 -08:00
Gian Merlino	a4f81a6471	Update to Calcite 1.11.0. (#3825 )	2017-01-06 14:45:17 -08:00
Gian Merlino	1f35120c7e	Downgrade to avatica-server 1.8.0, skip avatica-core. (#3813 ) This matches the version bundled by Calcite 1.10.0.	2017-01-03 16:00:37 -08:00
Roman Leventov	76cb06a8d8	Lookup cache refactoring (the main part of #3667 ) (#3697 ) * Lookup cache refactoring (the main part of druid-io/druid#3667) * Use PowerMock's static methods in NamespaceLookupExtractorFactoryTest * Fix KafkaLookupExtractorFactoryTest * Use VisibleForTesting annotation instead of Javadoc comment * Create a NamespaceExtractionCacheManager separately for each test in NamespaceExtractionCacheManagersTest * Rename CacheScheduler.NoCache.ENTRY_DISPOSED to ENTRY_CLOSED * Reduce visibility of NamespaceExtractionCacheManager.cacheCount() and monitor() implementations, and don't run NamespaceExtractionCacheManagerExecutorsTest with off-heap cache (it didn't before) * In NamespaceLookupExtractorFactory, use safer idiom to check if CacheState is NoCache or VersionedCache * More logging in CacheHandler constructor and close(), VersionedCache.close() * PR comments addressed * Make CacheScheduler.EntryImpl AutoCloseable, avoid 'dispose' verb in comments, logging and naming in CacheScheduler in favor of 'close' * More Javadoc comments to CacheScheduler * Fix NPE * Remove logging in OnHeapNamespaceExtractionCacheManager.expungeCollectedCaches() * Make NamespaceExtractionCacheManagersTest.testRacyCreation() to have similar load to what it be before the refactoring * Unwrap NamespaceExtractionCacheManager.scheduledExecutorService from unneeded MoreExecutors.listeningDecorator() and specify that this is ScheduledThreadPoolExecutor, which ensures happens-before between periodic runs of the tasks * More comments on MapDbCacheDisposer.disposed * Replace concat with Long.toString() * Comment on why NamespaceExtractionCacheManager.scheduledExecutorService() returns ScheduledThreadPoolExecutor * Place logging statements in VersionedCache.close() and CacheHandler.close() after actual closing logic, because logging may fail * Make JDBCExtractionNamespaceCacheFactory and StaticMapExtractionNamespaceCacheFactory to try to close newly created VersionedCache if population has failed, as it is done already in URIExtractionNamespaceCacheFactory * Don't close the whole CacheScheduler.Entry, if the cache update task failed * Replace AtomicLong updateCounter and firstRunLatch with Phaser-based UpdateCounter in CacheScheduler.EntryImpl	2016-12-23 18:04:27 -08:00
Gian Merlino	6440ddcbca	Fix #3795 (Java 7 compatibility). (#3796 ) * Fix #3795 (Java 7 compatibility). Also introduce Animal Sniffer checks during build, which would have caught the original problems. * Add Animal Sniffer on caffeine-cache for JDK8.	2016-12-21 10:19:13 -08:00
Nishant	f576a0ff14	Contrib Extension for Ambari Metrics Emitter (#3767 ) * Contrib Extension for Ambari Metrics Emitter extension to enable druid to send metrics to ambari metrics server (https://cwiki.apache.org/confluence/display/AMBARI/Metrics) review comments switch to public repo * review comments * add docs * fix pom version * Add link for doc page in extensions.md * remove unused imports * review comments review comments remove unused dependency review comment	2016-12-19 11:12:47 -08:00
Gian Merlino	dd63f54325	Built-in SQL. (#3682 )	2016-12-16 17:15:59 -08:00
Jihoon Son	5e39578eee	Enable parallel test (#3774 ) * Enable parallel test * Remove unnecessary NotThreadSafe annocation * Randomize the start port when finding available ports * Fix test failure * Change to handle all negatives	2016-12-14 21:05:56 -08:00
Ninglin Du	469ab21091	[Feature] Thrift support for realtime and batch ingestion (#3418 ) * Thrift ingestion plugin 1. thrift binary is platform dependent, use scrooge to generate java files to avoid style check failure 2. stream and hadoop ingesion are both supported, input format can be sequence file and lzo thrift block file. 3. base64 and protocol aware change header * fix conlicts in pom	2016-12-13 10:05:15 -08:00
Gleb Smirnov	07384d6f40	Update Apache curator to a non-leaky version (see CURATOR-354) (#3769 )	2016-12-12 09:52:40 -08:00
Akash Dwivedi	6386e6a4dc	root and java-util pom cleanup (#3764 ) * Remove bytebuffer-collections dependency from the root pom and java-util pom cleanup. * Remove json-smart exclusion from root pom	2016-12-08 11:30:19 -08:00
Gian Merlino	943982b7b0	Configurable HTTP compression. (#3759 ) * Configurable HTTP compression. * Call real-time nodes real-time processes in docs.	2016-12-07 17:40:39 -08:00
Roman Leventov	949e65165c	Bitset iteration optimization and improve safety (#3753 ) * Deduplicate looking for bitset.nextSetBit() in BitSetIterator.next() and hasNext() * Add BitmapIterationTest * More elaborate comment on why Roaring is not tested in BitmapIterationTest	2016-12-07 15:49:16 -08:00
Navis Ryu	c74d267f50	Support virtual column for select query (#2511 ) * Support virtual column for select query * Addressed comments	2016-12-05 15:14:35 -08:00
Erik Dubbelboer	7d36f540e8	WIP: Add Google Storage support (#2458 ) Also excludes the correct artifacts from #2741	2016-11-16 14:06:45 +05:30
Keuntae Park	094f5b851b	Support Min/Max for Timestamp (#3299 ) * Min/Max aggregator for Timestamp * remove unused imports and method * rebase and zip the test data * add docs	2016-11-14 23:00:21 -08:00
Roman Leventov	fbbb55f867	Update emitter dependency to 0.4.0 and emit "version" dimension for all druid metrics (#3679 ) * Update emitter dependency to 0.4.0 and emit "version" dimension for all druid metrics, not only query metrics * Remove unused imports * Use empty string instead of "testing-version" as a version placeholder	2016-11-11 17:17:27 -06:00
Akash Dwivedi	3e408497b3	Migrating bytebuffercollections from Metamarkets. (#3647 ) * Migrating bytebuffercollections from Metamarkets. * resolving code conflicts and removing <p> from bytebuffer-collections.	2016-11-11 10:51:07 -08:00
Gian Merlino	657e4512d2	Checkstyle checks for AvoidStaticImport, UnusedImports. (#3660 ) Excludes tests from AvoidStaticImport, since those are used often there and I didn't want to make this changeset too large. Production code use was minimal and I switched those to non-static imports.	2016-11-05 11:34:36 -07:00
Akash Dwivedi	4b3bd8bd63	Migrating java-util from Metamarkets. (#3585 ) * Migrating java-util from Metamarkets. * checkstyle and updated license on java-util files. * Removed unused imports from whole project. * cherry pick metamx/java-util@826021f. * Copyright changes on java-util pom, address review comments.	2016-10-21 14:57:07 -07:00
Roman Leventov	5dc95389f7	Add Checkstyle framework (#3551 ) * Add Checkstyle framework * Avoid star import * Need braces for control flow statements * Redundant imports * Add NewLineAtEndOfFile check	2016-10-13 13:37:47 -07:00
Roman Leventov	85ac8eff90	Improve performance of IndexMergerV9 (#3440 ) * Improve performance of StringDimensionMergerV9 and StringDimensionMergerLegacy by avoiding primitive int boxing by using IntIterator in IndexedInts instead of Iterator<Integer>; Extract some common logic for V9 and Legacy mergers; Minor improvements to resource handling in StringDimensionMergerV9 * Don't mask index in MergeIntIterator.makeQueueElement() * DRY conversion RoaringBitmap's IntIterator to fastutil's IntIterator * Do implement skip(n) in IntIterators extending AbstractIntIterator because original implementation is not reliable * Use Test(expected=Exception.class) instead of try { } catch (Exception e) { /* ignore */ }	2016-10-13 08:28:46 -07:00
Gian Merlino	40f2fe7893	Bump versions to 0.9.3-SNAPSHOT (#3524 )	2016-09-29 13:53:32 -07:00
John Zhang	78b06a7d7e	make global http client worker threads configurable (#3514 )	2016-09-28 23:18:51 -07:00
Slim	3175e17a3b	Cached lookup module. first cut implementing JDBC cache (#2819 )	2016-09-16 13:45:54 -07:00
Gian Merlino	76fcbd8fc5	Update Curator, ZK to latest stable versions. (#3461 )	2016-09-16 09:16:14 -07:00
Gian Merlino	2613e68477	Update java-util to 0.27.10. (#3337 )	2016-08-09 13:37:30 +05:30
Navis Ryu	5b3f0ccb1f	Support variance and standard deviation (#2525 ) * Support variance and standard deviation * addressed comments	2016-08-04 17:32:58 -07:00
Keuntae Park	95a58097e2	Hadoop InputRowParser for Orc file (#3019 ) * InputRowParser to decode OrcStruct from OrcNewInputFormat * add unit test for orc hadoop indexing * update docs and fix test code bug * doc updated * resove maven dependency conflict * remove unused imports * fix returning array type from Object[] to correct primitive array type * fix to support getDimension() of MapBasedRow : changing return type of orc list from array to list * rebase and updated based on comments * updated based on comments * on reflecting review comments * fix bug in typeStringFromParseSpec() and add unit test * add license header	2016-07-26 09:42:56 -07:00
Nishant	47894c4eff	add comment for default hadoop coordinates (#3257 ) 1) Modify CliHadoopIndexer to share constant from `TaskConfig.DEFAULT_DEFAULT_HADOOP_COORDINATES` 2) add comment to pom.xml as discussed in https://github.com/druid-io/druid/pull/3044 fix name	2016-07-18 15:23:11 -07:00
Gian Merlino	13d8d96bc6	Update to guice-4.1.0. (#3222 )	2016-07-18 08:08:43 -07:00
Charles Allen	3f1681c16c	Caffeine cache extension (#3028 ) * Initial commit of caffeine cache * Address code comments * Move and fixup README.md a bit * Improve caffeine readme information * Cleanup caffeine pom * Address review comments * Bump caffeine to 2.3.1 * Bump druid version to 0.9.2-SNAPSHOT * Make test not fail randomly. See https://github.com/ben-manes/caffeine/pull/93#issuecomment-227617998 for an explanation * Fix distribution and documentation * Add caffeine to extensions.md * Fix links in extensions.md * Lexicographic	2016-07-06 15:42:54 -07:00
Gian Merlino	ebf890fe79	Update master version to 0.9.2-SNAPSHOT. (#3133 )	2016-06-13 13:10:38 -07:00
Charles Allen	aa2982ee31	Update bytebuffer-collections to 0.2.5 (#3117 )	2016-06-13 08:41:20 -07:00
Fangjin Yang	53886a677c	include avro in the druid tarball (#3123 )	2016-06-13 16:58:21 +05:30
David Lim	6d38dde2f8	exclude slf4j-log4j12 (#3075 )	2016-06-03 11:39:23 -07:00
Charles Allen	8024b915e2	[QTL] Implement LookupExtractorFactory of namespaced lookup (#2926 ) * support LookupReferencesManager registration of namespaced lookup and eliminate static configurations for lookup from namespecd lookup extensions - druid-namespace-lookup and druid-kafka-extraction-namespace are modified - However, druid-namespace-lookup still has configuration about ON/OFF HEAP cache manager selection, which is not namespace wide configuration but node wide configuration as multiple namespace shares the same cache manager * update KafkaExtractionNamespaceTest to reflect argument signature changes * Add more synchronization functionality to NamespaceLookupExtractorFactory * Remove old way of using extraction namespaces * resolve compile error by supporting LookupIntrospectHandler * Remove kafka lookups * Remove unused stuff * Fix start and stop behavior to be consistent with new javadocs * Remove unused strings * Add timeout option * Address comments on configurations and improve docs * Add more options and update hash key and replaces * Move monitoring to the overriding classes * Add better start/stop logging * Remove old docs about namespace names * Fix bad comma * Add `@JsonIgnore` to lookup factory * Address code review comments * Remove ExtractionNamespace from module json registration * Fix problems with naming and initialization. Add tests * Optimize imports / reformat * Fix future not being properly cancelled on failed initial scheduling * Fix delete returns * Add more docs about whole introspection * Add `/version` introspection point for lookups * Add more tests and address comments * Add StaticMap extraction namespace for testing. Also add a bunch of tests * Move cache system property to `druid.lookup.namespace.cache.type` * Make VERSION lower case * Change poll period to 0ms for StaticMap * Move cache key to bytebuffer * Change hashCode and equals on static map extraction fn * Add more comments on StaticMap * Address comments * Make scheduleAndWait use a latch * Sanity renames and fix imports * Remove extra info in docs * Fix review comments * Strengthen failure on start from warn to error * Address comments * Rename namespace-lookup to lookups-cached-global * Fix injective mis-naming * Also add serde test	2016-05-24 10:56:40 -07:00
Xavier Léauté	e79284da59	new interval based cost function (#2972 ) * new interval based cost function Addresses issues with balancing of segments in the existing cost function - `gapPenalty` led to clusters of segments ~30 days apart - `recencyPenalty` caused imbalance among recent segments - size-based cost could be skewed by compression New cost function is purely based on segment intervals: - assumes each time-slice of a partition is a constant cost - cost is additive, i.e. cost(A, B union C) = cost(A, B) + cost(A, C) - cost decays exponentially based on distance between time-slices * comments and formatting * add more comments to explain the calculation	2016-05-17 09:56:00 -07:00
michaelschiff	2203a812bc	statsd-emitter (#2410 )	2016-04-28 18:41:02 -07:00
Xavier Léauté	fc91120b54	Merge pull request #2857 from metamx/upgrade-zk upgrade zookeeper client dependency to 3.4.8	2016-04-20 10:36:07 +05:30
Xavier Léauté	838768c632	upgrade curator, fixes #2829 (#2849 )	2016-04-18 13:17:36 -07:00
Himanshu Gupta	308211cc18	math expression language with parser/lexer generated using ANTLR	2016-04-08 11:40:29 -05:00
DuNinglin [杜宁林]	0f67ff7dfb	reoganize code folder according to recent upstream folder changes, seperate it from avro code and take it into extensions-conrib. docs rewite too	2016-03-30 11:21:41 +08:00
Fangjin Yang	62c1dc7a09	Merge pull request #2602 from binlijin/distinctcount implement special distinctcount	2016-03-28 17:20:17 -07:00
Gian Merlino	977e867ad8	Downgrade geoip2, exclude com.google.http-client. Reverts "Update com.maxmind.geoip2 to 2.6.0" and exclude the google http client from com.maxmind.geoip2. This should satisfy the original need from #2646 (wanting to run Druid along with an upgraded com.google.http-client) while preventing Jackson conflicts pointed out in #2717. Fixes #2717. This reverts commit `21b7572533`.	2016-03-25 14:43:22 -07:00
Gian Merlino	7e7a886f65	Move druid-api into the druid repo. This is from druid-api-0.3.17, as of commit 51884f1d05d5512cacaf62cedfbb28c6ab2535cf in the druid-api repo.	2016-03-24 11:04:34 -07:00
binlijin	2729efca71	implement special distinctcount	2016-03-24 11:11:11 +08:00
jon-wei	a59c9ee1b1	Support use of DimensionSchema class in DimensionsSpec	2016-03-21 13:12:04 -07:00
Gian Merlino	738dcd8cd9	Update version to 0.9.1-SNAPSHOT. Fixes #2462	2016-03-17 10:34:20 -07:00
Nishant	773d6fe86c	Merge pull request #2646 from atomx/update-maxmind Update com.maxmind.geoip2 to 2.6.0	2016-03-14 11:20:48 -07:00
Erik Dubbelboer	21b7572533	Update com.maxmind.geoip2 to 2.6.0 com.maxmind.geoip2 2.6.0 depends on com.google.http-client 1.15.0-rc (3 years old). When trying to include other libraries in Druid that require an up to date version of com.google.http-client this causes a problem.	2016-03-12 09:44:00 +00:00
Gian Merlino	f22fb2c2cf	KafkaIndexTask. Reads a specific offset range from specific partitions, and can use dataSource metadata transactions to guarantee exactly-once ingestion. Each task has a finite lifecycle, so it is expected that some process will be supervising existing tasks and creating new ones when needed.	2016-03-10 18:41:43 -08:00
Nishant	ba1185963b	Fix a bunch of dependencies * Eliminate exclusion groups from pull-deps * Only consider dependency nodes in pull-deps if they are not in the following scopes * provided * test * system * Fix a bunch of `<scope>provided</scope>` missing tags * Better exclusions for a couple of problematic libs	2016-03-10 10:18:08 -08:00
fjy	e3e932a4d4	refactor extensions into core and contrib	2016-03-08 17:12:09 -08:00
Gian Merlino	004028b887	Make first few allocatePendingSegment retries quiet. Some light retrying can happen during normal operation (SELECT -> INSERT races) and the ensuing log messages would be scary for users.	2016-03-02 13:40:29 -08:00
Fangjin Yang	3a9fe2aad0	Merge pull request #2231 from lizhanhui/pull_request Add druid-rocketmq module	2016-02-25 17:19:57 -08:00
Bingkun Guo	9e4c908922	generate tarball by mvn package	2016-02-18 16:42:41 -06:00
Slim Bouguerra	4e119b7a24	Adding lookup ref manager and lookup dimension spec impl	2016-02-11 12:11:51 -06:00
Charles Allen	3a26b3926c	Identify druid.io as committer in pom.xml	2016-02-02 17:01:58 -08:00
Xavier Léauté	e3d1e07b34	Merge pull request #2261 from metamx/improve-segment-ordering Prioritize loading of segments based on segment interval	2016-01-27 10:05:54 -08:00
Nishant	fd6bf3fe22	Use interval comparator instead of bucketMonthComparator fix when two segments have same interval review comments	2016-01-27 17:35:43 +05:30
Charles Allen	937ae6ad20	Update druid-api to 0.3.16 Fixes https://github.com/druid-io/druid/issues/2316	2016-01-22 14:37:16 -08:00
Slim Bouguerra	e0d90f875c	Graphite emitter	2016-01-21 13:43:37 -06:00
Fangjin Yang	1b162a67ff	Merge pull request #2235 from druid-io/updateCommonsIO Update commons-io to 2.4	2016-01-10 08:48:25 -08:00
pdeva	62aa8fec94	Updated log4j version	2016-01-09 10:45:40 -08:00
Charles Allen	c1abcc3ef9	Update commons-io to 2.4 Hadoop2.3.0 uses version 2.4 as per http://central.maven.org/maven2/org/apache/hadoop/hadoop-project/2.3.0/hadoop-project-2.3.0.pom	2016-01-08 21:39:50 -08:00
Li Zhanhui	8eb332c1c4	Add druid-rocketmq module	2016-01-08 08:13:04 +08:00
Charles Allen	b7b4d9f284	Update bytebuffer-collections to 0.2.4 Pulls in fix for https://github.com/RoaringBitmap/RoaringBitmap/issues/61	2016-01-07 10:21:49 -08:00
Charles Allen	3c4bdb7cc8	Manually update <tag> from <scm> in pom.xml	2016-01-05 14:42:25 -08:00
Gian Merlino	b93feb5e77	Update java-util, fixes #2193	2016-01-05 11:16:03 -05:00
Zhao Weinan	5e57ddb8cc	Adding avro support to realtime & hadoop batch indexing.	2016-01-05 10:21:27 +08:00
Charles Allen	2097669cce	Update bytebuffer-collections to 0.2.3 * Fixes https://github.com/druid-io/druid/issues/2175	2016-01-04 11:20:45 -08:00
Gian Merlino	891d639188	Remove unused kafka-seven extension.	2015-12-29 12:05:27 -05:00
fjy	398a3ec620	add docs for more specs	2015-12-17 18:06:30 -08:00
jon-wei	c53bf85d83	Add docs and benchmark for JSON flattening parser	2015-12-09 16:13:30 -08:00
Gian Merlino	f6f7bec2b6	Update java-util.	2015-12-08 15:32:27 -08:00
Himanshu	5f2466afd1	Merge pull request #2045 from metamx/updateEmitter036 Update mmx emitter to 0.3.6	2015-12-05 23:20:17 -06:00
Charles Allen	ea5fdc30f8	Update mmx emitter to 0.3.6 * 0.3.5 updated better logging messages * 0.3.6 updates validator dependency to help prevent stale validator jars from being pulled in	2015-12-04 12:50:22 -08:00
Gian Merlino	fde4753e25	Disable javadoc linting.	2015-12-03 19:11:29 -08:00
Himanshu Gupta	9c569be11e	adding datasketches module to top level pom	2015-11-12 00:04:33 -06:00
Xavier Léauté	a57cbfd2c3	Merge pull request #1387 from metamx/enableShutdownLogging Add special handler to allow logger messages during shutdown	2015-11-09 17:20:09 -08:00
Xavier Léauté	c896818241	Update curator to 2.9.1 Lots of bugfixes since 2.8.0 - https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12314425&version=12333324 - https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12314425&version=12332392	2015-11-05 15:53:01 -08:00
Lou Marvin Caraig	c924f9fe56	Added cloudfiles-extensions in order to support Rackspace's cloudfiles as deep storage	2015-11-04 17:44:48 +01:00
Nishant	dcd4468156	update emitter version contains changes - - https://github.com/metamx/emitter/pull/9 - https://github.com/metamx/emitter/pull/13 - https://github.com/metamx/emitter/pull/12 - https://github.com/metamx/emitter/pull/10	2015-10-29 17:43:03 +05:30
Nishant	20a3ebc022	update server metrics version - fixes Sigar loading for JvmCpuMetrics https://github.com/metamx/server-metrics/pull/16 update server metrics	2015-10-29 17:37:45 +05:30
Gian Merlino	7df7370935	Merge pull request #1862 from metamx/indexingServiceMMGone Add timeout to shutdown request to middle manager for indexing service	2015-10-27 14:38:01 -07:00
Charles Allen	7a2ceef690	Add special handler to allow logger messages during shutdown * Adds a special PropertyChecker interface which is ONLY for setting string properties at the very start of psvm	2015-10-27 14:33:36 -07:00
Charles Allen	44a2b204df	Add timeout to shutdown request to middle manager for indexing service	2015-10-27 13:56:03 -07:00
Bingkun Guo	4914925d65	New extension loading mechanism 1) Remove maven client from downloading extensions at runtime. 2) Provide a way to load Druid extensions and hadoop dependencies through file system. 3) Refactor pull-deps so that it can download extensions into extension directories. 4) Add documents on how to use this new extension loading mechanism. 5) Change the way how Druid tarball is generated. Now all the extensions + hadoop-client 2.3.0 are packaged within the Druid tarball.	2015-10-21 14:22:36 -05:00
Xavier Léauté	e4ac78e43d	bump next snapshot to 0.9.0	2015-10-20 13:46:13 -07:00
Xavier Léauté	4c2c7a2c37	update version to 0.8.3	2015-10-14 21:40:55 -07:00
Xavier Léauté	5a98d4e650	update coveralls plugin	2015-10-14 10:25:23 -07:00
Charles Allen	e9b81430f4	Bump server-metrics to 0.2.5 to catch a few fixes.	2015-10-08 11:05:51 -07:00
Nishant	38664904e2	Revert Jetty version update to 9.2.10, latest version that is working revert to jetty 9.2.5, last known good version	2015-10-08 21:53:13 +05:30
Nishant	42e971d1c1	Merge pull request #1797 from himanshug/fix_ingest_segment_firehose_ut ingest segment firehose ut	2015-10-02 21:57:22 +05:30
Himanshu Gupta	e2b16ab281	update java-util dep version	2015-10-01 16:06:04 -05:00
Charles Allen	bdae0cb135	Update httpcomponents and aws-sdk	2015-10-01 13:28:46 -07:00
Himanshu	63a3a4a254	Merge pull request #1763 from metamx/server-metrics-fixes fix NPE and duplicate metric keys	2015-09-22 09:39:01 -05:00
Xavier Léauté	0fe9aeb3d6	fix NPE and duplicate metric keys	2015-09-21 22:50:49 -07:00
Charles Allen	045f72505c	Merge pull request #1759 from metamx/update-roaring better faster smaller roaring bitmaps	2015-09-21 18:50:19 -07:00
Xavier Léauté	df6988bbd2	better faster smaller roaring bitmaps	2015-09-21 17:00:57 -07:00
Xavier Léauté	af86c0e6ea	update druid-api + java-util for timstamp parsing speedup	2015-09-21 09:57:29 -07:00
Himanshu Gupta	ebdb612933	composing emitter module to use multiple emitters together	2015-09-09 16:45:50 -05:00
Himanshu Gupta	2e0dd1d792	adding UTs and addressing review comments to firehoseV2 addition to Realtime[Manager\|Plumber], essential segment metadata persist support, kafka-simple-consumer-firehose extension patch	2015-08-27 20:50:46 -05:00
lvjq	2237a8cf0f	kafka 8 simple consumer firehose	2015-08-27 20:50:46 -05:00
Xavier Léauté	25853aee9b	update druid-api for jackson 2.4.6	2015-08-26 19:23:37 -07:00
Gian Merlino	2a866f49df	Downgrade Jackson to 2.4.6.	2015-08-26 18:25:55 -07:00
Xavier Léauté	d5143c0807	update java-util to 0.27.2	2015-08-25 16:07:02 -07:00
Xavier Léauté	21aadb8927	update joda to 2.8.2	2015-08-25 16:07:02 -07:00
Xavier Léauté	dc11f907c9	update jdbi to 2.63.1	2015-08-25 16:07:02 -07:00
Xavier Léauté	123f6340d5	update coveralls plugin to 3.2.0	2015-08-25 16:07:02 -07:00
Xavier Léauté	3a83a0fe40	update irc library to 1.0-0014	2015-08-25 16:07:02 -07:00
Xavier Léauté	2fa87c4bba	update joda-time to 2.8	2015-08-25 16:07:02 -07:00
Xavier Léauté	f681b2022d	update aws-sdk to 1.10.12	2015-08-25 16:07:01 -07:00
Xavier Léauté	8be5fe091f	update slf4j to 1.7.12	2015-08-25 16:07:01 -07:00
Xavier Léauté	172a44b794	update log4j to 2.3	2015-08-25 16:07:01 -07:00
Xavier Léauté	51f6a9a2c9	update jackson to 2.6.1	2015-08-25 16:07:01 -07:00
Xavier Léauté	ac4a856a17	update jetty to 9.2.13.v20150730 (latest Java 7 compatible version)	2015-08-25 16:07:01 -07:00
Xavier Léauté	8b294d4d98	update mapdb to 1.0.8	2015-08-25 16:07:00 -07:00
Xavier Léauté	edaaf528ab	update jets3t to 0.9.4	2015-08-25 16:07:00 -07:00
Xavier Léauté	9a0c15c52c	update airline to 0.7	2015-08-25 16:07:00 -07:00
Xavier Léauté	d601038271	update druid-api to 0.3.11	2015-08-25 16:07:00 -07:00
Fangjin Yang	e21954a70f	Merge pull request #1619 from metamx/update-server-metrics update server metrics	2015-08-25 13:58:28 -07:00
Xavier Léauté	2093187c91	rework tarball distribution: - move assembly out of druid-services into a 'distribution' module - create separate 'extensions-distribution' module and assembly to package extensions and their dependencies into a local maven repository - include this extensions maven repository in the binaries tarball	2015-08-18 18:32:33 -07:00
Xavier Léauté	3b2e41e42a	update for next release	2015-08-18 17:16:46 -07:00
Charles Allen	db19d2d547	Revert "Update to guice 4.0"	2015-08-14 09:26:07 -07:00
Nishant	a6a9886339	update server metrics changes include - 1) https://github.com/metamx/server-metrics/pull/8 - Add custom dimensions for Jvm and Sys monitors 2) https://github.com/metamx/server-metrics/pull/9 - Add net, tcp, uptime and cpu metrics	2015-08-12 22:52:17 +05:30
Charles Allen	c8c8169c69	Bump druid-api to 0.3.10 to include guice 4.0 update	2015-08-10 13:57:55 -07:00
Charles Allen	7e61216287	Update to guice 4.0 - Mark a lot of `@Provides` methods as final since guice 4.0 disallows overriding them	2015-08-10 13:57:18 -07:00
Charles Allen	7d5a77b882	Bumb Jersey to 1.19	2015-08-07 17:32:27 -07:00
Himanshu Gupta	a9ee2a383f	update druid-api to 0.3.9	2015-07-30 16:35:37 -05:00
Charles Allen	86ede702b1	Add namespaced lookups as extensions * Adds kafka, URI, and JDBC namespace defintions * Add ability to explicitly rename using a "namespace" which is a particular data collection that is loaded on all realtime, historic nodes, and brokers. If any of these nodes has the namespace extension, ALL nodes have the namespace extension. * Add namespace caching and populating (can be on heap or off heap) * Add NamespaceExtractionCacheManager for handling caches * Added ExtractionNamespace for handling metadata on the extraction namespaces * Added ExtractionNamespaceUpdate for handling metadata related to updates * Add extension which caches renames from a kafka stream (requires kafka8) * Added README.md for the namespace kafka extension * Added docs * Added namespace/size, namespace/count, namespace/deltaTasksStarted metrics Add static config for namespaces via `druid.query.extraction.namespace` * This is a rebase of https://github.com/b-slim/druid/tree/static_config_only	2015-07-28 11:14:14 -07:00
Himanshu Gupta	efef20e754	update curator version to 2.8.0 to get fix for https://issues.apache.org/jira/browse/CURATOR-168	2015-07-23 13:49:55 -05:00
Xavier Léauté	4cfb00bc8a	inrement version	2015-07-15 13:09:05 -07:00
Nishant	6fa49b771c	Merge pull request #1453 from metamx/concicebenchmarkComplement Update bytebuffer-collections and add basic ConciseComplementBenchmark for testing concise complements	2015-06-23 08:41:42 +05:30
Charles Allen	88b2ef5d6b	Update bytebuffer-collections	2015-06-22 18:07:24 -07:00

... 4 5 6 7 8 ...

1622 Commits