druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	2ce8123bdb	Move scan-query from a contrib extension into core. (#4751 ) * Move scan-query from a contrib extension into core. Based on a proposal at: https://groups.google.com/d/topic/druid-development/ME_OatUDnbk/discussion This patch also adds support for virtual columns to the Scan query, and updates Druid SQL to use Scan instead of Select. This patch also makes some behavioral changes to handling of the __time column. In particular, it is now is returned as "__time" rather than "timestamp"; it is no longer included if you do not specifically ask for it in your "columns"; and it is returned as a long rather than a string. Users can revert time handling to the legacy extension behavior by setting "legacy" : true in their queries, or setting the property druid.query.scan.legacy = true. This is meant to provide a migration path for users that were formerly using the contrib extension. * Adjustments from review. * Add back Select query. * Adjust SQL docs. * Restore SelectQuery link.	2017-09-13 09:51:24 -07:00
Bartosz Ługowski	8dddccc687	Graphite emitter - add plaintext protocol (#4265 ) * Graphite emitter - add plaintext protocol. Configurable option of replacing slash to dot in metric name. * Graphite emitter - fix misspelling in docs. * Graphite emitter - extend docs. * Graphite emitter - fix code style.	2017-08-29 06:23:06 -07:00
Gian Merlino	9fbfc1be32	Add @ExtensionPoint and @PublicApi annotations. (#4433 ) * Add @ExtensionPoint and @PublicApi annotations. * Clean up wording. * Remove unused import. * Remove unused imports. * Only types can be extension points. * Adjust annotations some more. * Remove unused import. * Make ServletFilterHolder an extension point. * Add a couple extension points, and update docs.	2017-08-28 14:50:58 -07:00
QiuMM	59a48a560a	Redis cache extension doc (#4702 ) * Redis cache extension doc * link redis cache doc in extensions.md	2017-08-24 09:53:51 -05:00
Yuewen Wang	c821bc9a5a	Implement "earlyMessageRejectionPeriod" config discussed in issue #4599 (#4607 ) * Implement "earlyMessageRejectionPeriod" config discussed in issue #4599 * implement the logics of this param * Added doc of this config * Added unit tests of it * Update KafkaSupervisor.java ameliorate comment * fix format * fix bug when rebasing	2017-08-11 09:12:08 +09:00
Peter Cunningham	ede7cf9eef	Added support for where clauses to JDBC lookups. (#4643 ) * Added support for where clauses to filter lookup values on ingestion. Added a filter field to the JDBC lookups that is used to generate a where clause so that only rows matching the filter value will be brought into Druid. Example being filter="SOMECOLUMN=1" * Required changes based on code review. * Required changes based on code review. * Added support for where clauses to filter lookup values on ingestion. Added a filter field to the JDBC lookups that is used to generate a where clause so that only rows matching the filter value will be brought into Druid. Example being filter="SOMECOLUMN=1" * Updates based on code review, mainly formatting and small refactor of the buildLookupQuery method. * Fixed broken buildLookupQuery method * Removed empty line. * Updates per review comments	2017-08-09 10:47:46 -07:00
Parag Jain	6e2f78f552	TLS support (#4270 )	2017-07-06 17:40:12 -07:00
Roman Leventov	2fa4b10145	More fine-grained DI for management node types. Don't allocate processing resources on Router (#4429 ) * Remove DruidProcessingModule, QueryableModule and QueryRunnerFactoryModule from DI for coordinator, overlord, middle-manager. Add RouterDruidProcessing not to allocate processing resources on router * Fix examples * Fixes * Revert Peon configs and add comments * Remove qualifier	2017-06-27 22:58:01 -07:00
Roman Leventov	05d58689ad	Remove the ability to create segments in v8 format (#4420 ) * Remove ability to create segments in v8 format * Fix IndexGeneratorJobTest * Fix parameterized test name in IndexMergerTest * Remove extra legacy merging stuff * Remove legacy serializer builders * Remove ConciseBitmapIndexMergerTest and RoaringBitmapIndexMergerTest	2017-06-26 13:21:39 -07:00
Fokko Driesprong	ff501e8f13	Add Date support to the parquet reader (#4423 ) * Add Date support to the parquet reader Add support for the Date logical type. Currently this is not supported. Since the parquet date is number of days since epoch gets interpreted as seconds since epoch, it will fails on indexing the data because it will not map to the appriopriate bucket. * Cleaned up code and tests Got rid of unused json files in the examples, cleaned up the tests by using try-with-resources. Now get the filenames from the json file instead of hard coding them and integrated general improvements from the feedback provided by leventov. * Got rid of the caching Remove the caching of the logical type of the time dimension column and cleaned up the code a bit.	2017-06-22 15:56:08 -05:00
Yuya Fujiwara	152d4e89ab	Fix typo in the avro.md. (#4370 )	2017-06-06 07:14:08 -07:00
David Lim	13ecf90923	Report Kafka lag information in supervisor status report (#4314 ) * refactor lag reporting and report lag at status endpoint * refactor offset reporting logic to fetch offsets periodically vs. at request time * remove JavaCompatUtils * code review changes * code review changes	2017-06-05 13:26:25 -07:00
Jihoon Son	1150bf7a2c	Refactoring Appenderator Driver (#4292 ) * Refactoring Appenderator 1) Added publishExecutor and handoffExecutor for background publishing and handing segments off 2) Change add() to not move segments out in it * Address comments 1) Remove publishTimeout for KafkaIndexTask 2) Simplifying registerHandoff() 3) Add increamental handoff test * Remove unused variable * Add persist() to Appenderator and more tests for AppenderatorDriver * Remove unused imports * Fix strict build * Address comments	2017-06-02 07:09:11 +09:00
Kenji Noguchi	3400f601db	Protobuf extension (#4039 ) * move ProtoBufInputRowParser from processing module to protobuf extensions * Ported PR #3509 * add DynamicMessage * fix local test stuff that slipped in * add license header * removed redundant type name * removed commented code * fix code style * rename ProtoBuf -> Protobuf * pom.xml: shade protobuf classes, handle .desc resource file as binary file * clean up error messages * pick first message type from descriptor if not specified * fix protoMessageType null check. add test case * move protobuf-extension from contrib to core * document: add new configuration keys, and descriptions * update document. add examples * move protobuf-extension from contrib to core (2nd try) * touch * include protobuf extensions in the distribution * fix whitespace * include protobuf example in the distribution * example: create new pb obj everytime * document: use properly quoted json * fix whitespace * bump parent version to 0.10.1-SNAPSHOT * ignore Override check * touch	2017-05-30 13:11:58 -07:00
Jihoon Son	11b7b1bea6	Add support for HttpFirehose (#4297 ) * Add support for HttpFirehose * Fix document * Add documents	2017-05-25 16:13:04 -05:00
Jihoon Son	733dfc9b30	Add PrefetchableTextFilesFirehoseFactory for cloud storage types (#4193 ) * Add PrefetcheableTextFilesFirehoseFactory * fix comment * exception handling * Fix wrong json property * Remove ReplayableFirehoseFactory and fix misspelling * Defer object initialization * Add a temporaryDirectory parameter to FirehoseFactory.connect() * fix when cache and fetch are disabled * Address comments * Add more test * Increase timeout for test * Add wrapObjectStream * Move methods to Firehose from PrefetchableFirehoseFactory * Cleanup comment * add directory listing to s3 firehose * Rename a variable * Addressing comments * Update document * Support disabling prefetch * Fix race condition * Add fetchLock * Remove ReplayableFirehoseFactoryTest * Fix compilation error * Fix test failure * Address comments * Add default implementation for new method	2017-05-18 15:37:18 +09:00
David Lim	8333043b7b	add skipOffsetGaps flag (#4256 )	2017-05-16 12:19:28 -06:00
Jihoon Son	50a4ec2b0b	Add support for headers and skipping thereof for CSV and TSV (#4254 ) * initial commit * small fixes * fix bug * fix bug * address code review * more cr * more cr * more cr * fix * Skip head rows for CSV and TSV * Move checking skipHeadRows to FileIteratingFirehose * Remove checking null iterators * Remove unused imports * Address comments * Fix compilation error * Address comments * Add more tests * Add a comment to ReplayableFirehose * Addressing comments * Add docs and fix typos	2017-05-15 22:57:31 -07:00
hzy001	0c464f4a84	Fix docs (#4225 ) * Fix one typo Signed-off-by: Hao Ziyu <haoziyu@qiyi.com> * Fix deprecated links Signed-off-by: Hao Ziyu <haoziyu@qiyi.com>	2017-05-01 09:55:43 -07:00
Gian Merlino	631068b099	Fix broken DataSketches link. (#4221 ) * Fix broken DataSketches link. * Better fixed link.	2017-04-27 17:37:12 -07:00
satishbhor	d51097c809	Fix lz4 library incompatibility in kafka-indexing-service extension (#4115 ) * Fix lz4 library incompatibility in kafka-indexing-service extension #3266 * Bumped Kafka version to 0.10.2.0 for : Fix lz4 library incompatibility in kafka-indexing-service extension #3266 * Replaced Lists.newArrayList() with Collections.singletonList() For Fix lz4 library incompatibility in kafka-indexing-service extension #4115	2017-04-25 12:23:51 +09:00
Dongkyu Hwangbo	0d2e91ed50	Adding Kafka-emitter (#3860 ) * Initial commit * Apply another config: clustername * Rename variable * Fix bug * Add retry logic * Edit retry logic * Upgrade kafka-clients version to the most recent release * Make callback single object * Write documentation * Rewrite error message and emit logic * Handling AlertEvent * Override toString() * make clusterName more optional * bump up druid version * add producer.config option which make user can apply another optional config value of kafka producer * remove potential blocking in emit() * using MemoryBoundLinkedBlockingQueue * Fixing coding convention * Remove logging every exception and just increment counting * refactoring * trivial modification * logging when callback has exception * Replace kafka-clients 0.10.1.1 with 0.10.2.0 * Resolve the problem related of classloader * adopt try statement * code reformatting * make variables final * rewrite toString	2017-04-04 14:07:43 -07:00
Fokko Driesprong	add17fa7db	Remove the metadataUpdateSpec from specfile (#3973 ) Get rid of the metadataUpdateSpec section in the json example to ingest parquet into druid. When this element is present, it will fail start an indexing job.	2017-03-01 14:24:36 -08:00
Akash Dwivedi	94da5e80f9	Namespace optimization for hdfs data segments. (#3877 ) * NN optimization for hdfs data segments. * HdfsDataSegmentKiller, HdfsDataSegment finder changes to use new storage format.Docs update. * Common utility function in DataSegmentPusherUtil. * new static method `makeSegmentOutputPathUptoVersionForHdfs` in JobHelper * reuse getHdfsStorageDirUptoVersion in DataSegmentPusherUtil.getHdfsStorageDir() * Addressed comments. * Review comments. * HdfsDataSegmentKiller requested changes. * extra newline * Add maprfs.	2017-03-01 09:51:20 -08:00
michaelschiff	e5fb0e1ff5	New property for each metric that tells the StatsDEmitter to convert metric values from range 0-1 to 0-100. This (#3936 ) prevents rates and percentages expressed as Doubles (0.xx) from being rounded down to 0.	2017-02-16 13:55:56 -08:00
Himanshu	9dfcf0763a	disable javascript execution by default (#3818 )	2017-02-13 15:11:18 -08:00
Erik Dubbelboer	2aa2fa57b5	Simple doc fix (#3907 )	2017-02-06 15:52:17 +05:30
Nishant Bangarwa	a457cded28	Druid Extension to enable Authentication using Kerberos. (#3853 ) * Add extension for supporting kerberos security - This PR adds an extension for supporting druid authentication via Kerberos. - Working on the docs. * Add docs * review comments * more review comments * Block all paths by default * more review comments - use proper Oid * Allow extensions to override httpclient for integration tests * Add kerberos lock to prevent multithreaded issues. * review comment - remove enabled flag and fix router injection * Add Cookie Handling and more detailed docs * review comment - rename DruidKerberosConfig -> AuthKerberosConfig * review comments * fix travis failure on jdk7	2017-02-02 14:55:21 -06:00
kaijianding	33ae9dd485	streaming version of select query (#3307 ) * streaming version of select query * use columns instead of dimensions and metrics;prepare for valueVector;remove granularity * respect query limit within historical * use constant * fix thread name corrupted bug when using jetty qtp thread rather than processing thread while working with SpecificSegmentQueryRunner * add some test for scan query * add scan query document * fix merge conflicts * add compactedList resultFormat, this format is better for json ser/der * respect query timeout * respect query limit on broker * use static consts and remove unused code	2017-01-19 16:09:53 -06:00
Nishant	f576a0ff14	Contrib Extension for Ambari Metrics Emitter (#3767 ) * Contrib Extension for Ambari Metrics Emitter extension to enable druid to send metrics to ambari metrics server (https://cwiki.apache.org/confluence/display/AMBARI/Metrics) review comments switch to public repo * review comments * add docs * fix pom version * Add link for doc page in extensions.md * remove unused imports * review comments review comments remove unused dependency review comment	2016-12-19 11:12:47 -08:00
David Lim	8eee259629	add documentation on segments generated (#3785 )	2016-12-19 09:41:47 -08:00
Ninglin Du	469ab21091	[Feature] Thrift support for realtime and batch ingestion (#3418 ) * Thrift ingestion plugin 1. thrift binary is platform dependent, use scrooge to generate java files to avoid style check failure 2. stream and hadoop ingesion are both supported, input format can be sequence file and lzo thrift block file. 3. base64 and protocol aware change header * fix conlicts in pom	2016-12-13 10:05:15 -08:00
Erik Dubbelboer	9f7050e221	Fix some grammar and spelling mistakes (#3717 )	2016-11-28 11:49:30 -08:00
Himanshu	7d37f675ba	fix the documented property name for specifying avro reader schema (#3708 )	2016-11-22 15:02:41 -08:00
Parag Jain	7ee6bb7410	option to reset offest automatically in case of OffsetOutOfRangeException (#3678 ) * option to reset offset automatically in case of OffsetOutOfRangeException if the next offset is less than the earliest available offset for that partition * review comments * refactoring * refactor * review comments	2016-11-21 16:29:46 -06:00
Erik Dubbelboer	7d36f540e8	WIP: Add Google Storage support (#2458 ) Also excludes the correct artifacts from #2741	2016-11-16 14:06:45 +05:30
Keuntae Park	094f5b851b	Support Min/Max for Timestamp (#3299 ) * Min/Max aggregator for Timestamp * remove unused imports and method * rebase and zip the test data * add docs	2016-11-14 23:00:21 -08:00
Gian Merlino	bcd20441be	Make buildV9Directly the default. (#3688 )	2016-11-14 09:29:32 -08:00
Mark	575aeb843a	Metadata Storage extension for Microsoft SqlServer (sqlserver-metadata-storage) (#3421 )	2016-11-08 14:56:52 -08:00
Nicolas Colomer	37ecffb648	Add support for Confluent Schema Registry in the avro extension (#3529 )	2016-11-08 16:10:45 -06:00
cheddar	c49a9d5693	Call out semver expectations for modules (#3659 ) * Call out semver expectations for modules * Update modules.md * Link to versioning	2016-11-04 12:52:05 -07:00
Gian Merlino	4203580290	URIExtractionNamespace: Treat null values in lookup maps as missing entries. (#3512 ) * URIExtractionNamespace: Treat null values in lookup maps as missing entries. This is useful when many logical lookups are derived from the same base JSON file, and some lookups' values may be unknown sometimes. * Add test, logging message, and address other comments. * Update docs.	2016-11-03 13:53:04 -07:00
David Lim	9226d4af3c	configurable shutdownTimeout for Kakfa supervisor (#3497 ) * configurable shutdownTimeout * cr change	2016-09-23 13:26:45 -06:00
David Lim	ca9114b41b	add supervisor reset API (#3484 ) * add supervisor reset API * CR doc changes and kill running tasks / clear offsets from supervisor	2016-09-22 17:51:06 -07:00
Gian Merlino	27bd5cb13a	Add forceExtendableShardSpecs option to Hadoop indexing, IndexTask. (#3473 ) Fixes #3241.	2016-09-21 13:40:04 -06:00
David Lim	96fcca18ea	update KafkaSupervisor to make HTTP requests to tasks in parallel where possible (#3452 )	2016-09-20 22:51:15 +05:30
Slim	3175e17a3b	Cached lookup module. first cut implementing JDBC cache (#2819 )	2016-09-16 13:45:54 -07:00
Gian Merlino	e0e28866ee	JavaScript docs: Fix links and typos, add to TOC. (#3457 )	2016-09-13 15:26:44 -07:00
Himanshu	a069257d37	avro-extension -- feature to specify multiple avro reader schemas inline (#3368 ) * rename SimpleAvroBytesDecoder to InlineSchemaAvroBytesDecoder * feature to specify multiple schemas inline in avro module	2016-09-13 14:54:31 -07:00
Gian Merlino	76a24054e3	JavaScript docs, including docs for globals. (#3454 )	2016-09-13 13:46:55 -07:00
Slim	ba6ddf307e	Adding hadoop kerberos authentification. (#3419 ) * adding kerberos authentication * make the 2 functions identical	2016-09-13 10:42:50 -07:00
David Lim	3a97fd4d6c	doc fix (#3430 )	2016-09-06 13:13:30 -06:00
Stéphane Derosiaux	48dce88aab	Add flag binaryAsString for parquet ingestion (#3381 )	2016-08-30 17:30:50 -07:00
Dave Li	c4e8440c22	Adds long compression methods (#3148 ) * add read * update deprecated guava calls * add write and vsizeserde * add benchmark * separate encoding and compression * add header and reformat * update doc * address PR comment * fix buffer order * generate benchmark files * separate encoding strategy and format * fix benchmark * modify supplier write to channel * add float NONE handling * address PR comment * address PR comment 2	2016-08-30 16:17:46 -07:00
Fangjin Yang	edb0eca3a9	fix docs (#3370 )	2016-08-16 16:25:50 -07:00
Fangjin Yang	6beb8ac342	fix some docs and add new content (#3369 )	2016-08-16 15:00:18 -07:00
Himanshu	46da682231	avro-extensions -- feature to specify avro reader schema inline in the task json for all events (#3249 )	2016-08-10 10:49:26 -07:00
Jonathan Wei	decefb7477	Add time interval dim filter and retention analysis example (#3315 ) * Add time interval dim filter and retention analysis example * Use closed-open matching for intervals, update cache key generation * Fix time filtering tests for interval boundary change	2016-08-05 07:25:04 -07:00
Navis Ryu	5b3f0ccb1f	Support variance and standard deviation (#2525 ) * Support variance and standard deviation * addressed comments	2016-08-04 17:32:58 -07:00
Fangjin Yang	d51ec398d4	fix parquet docs (#3304 )	2016-08-01 07:54:48 -07:00
Keuntae Park	95a58097e2	Hadoop InputRowParser for Orc file (#3019 ) * InputRowParser to decode OrcStruct from OrcNewInputFormat * add unit test for orc hadoop indexing * update docs and fix test code bug * doc updated * resove maven dependency conflict * remove unused imports * fix returning array type from Object[] to correct primitive array type * fix to support getDimension() of MapBasedRow : changing return type of orc list from array to list * rebase and updated based on comments * updated based on comments * on reflecting review comments * fix bug in typeStringFromParseSpec() and add unit test * add license header	2016-07-26 09:42:56 -07:00
Gian Merlino	ea03906fcf	Configurable compressRunOnSerialization for Roaring bitmaps. (#3228 ) Defaults to true, which is a change in behavior (this used to be false and unconfigurable).	2016-07-08 10:24:19 +05:30
Charles Allen	3f1681c16c	Caffeine cache extension (#3028 ) * Initial commit of caffeine cache * Address code comments * Move and fixup README.md a bit * Improve caffeine readme information * Cleanup caffeine pom * Address review comments * Bump caffeine to 2.3.1 * Bump druid version to 0.9.2-SNAPSHOT * Make test not fail randomly. See https://github.com/ben-manes/caffeine/pull/93#issuecomment-227617998 for an explanation * Fix distribution and documentation * Add caffeine to extensions.md * Fix links in extensions.md * Lexicographic	2016-07-06 15:42:54 -07:00
Charles Allen	8b7d9750ee	Update extension docs for global lookup module (#3206 )	2016-06-29 12:51:52 -07:00
David Lim	b24425a280	update docs with new behavior (#3200 )	2016-06-28 16:17:04 -07:00
Gian Merlino	c12712e8b8	Move "libraries.md" out of docs, onto the main site. (#3159 )	2016-06-16 18:14:35 -07:00
michaelschiff	7294ea87c3	link to statsd metrics emitter docs from development/extensions.html doc page (#3125 )	2016-06-10 16:27:16 -07:00
Gian Merlino	99ee3f4dc3	Fixups, clarifications to lookup docs. (#3060 )	2016-06-07 10:43:35 -07:00
Charles Allen	fa41a6466a	Cleanup the base lookup cluster wide config docs (#3061 ) * Cleanup the base lookup cluster wide config docs * Add better examples in lookups-cached-global.md * Use actual valid stock lookups * Fixed maps with : * Add mix of lookups * Better examples in extension * Remove unneeded namespace requirement * Add extra line space * Add link to lookup tiers * Renamed header	2016-06-07 10:42:41 -07:00
Charles Allen	8cac710546	Async lookups-cached-global by default (#3074 ) * Async lookups-cached-global by default * Also better lookup docs * Fix test timeouts * Fix timing of deserialized test * Fix problem with 0 wait failing immediately	2016-06-03 15:58:10 -05:00
David Lim	a2290a8f05	support seamless config changes (#3051 )	2016-06-03 13:50:19 -07:00
Erik Dubbelboer	b4737336e5	Added info about Google Cloud Storage (#3056 )	2016-06-02 10:06:07 -07:00
David Lim	f6c39cc844	Kafka task minimum message time (#3035 ) * add KafkaIndexTask support for minimumMessageTime * add Kafka supervisor support for lateMessageRejectionPeriod	2016-05-31 11:37:00 -07:00
scusjs	ebb6831770	rm , of jobProperties. jackson can not parse it (#3012 )	2016-05-26 09:46:33 -07:00
Charles Allen	245077b47f	Fix formatting in lookups-cached-global.md (#3009 )	2016-05-24 17:28:39 -07:00
Charles Allen	c738c0e1cd	Silly Typo in docs	2016-05-24 13:31:58 -07:00
Charles Allen	8024b915e2	[QTL] Implement LookupExtractorFactory of namespaced lookup (#2926 ) * support LookupReferencesManager registration of namespaced lookup and eliminate static configurations for lookup from namespecd lookup extensions - druid-namespace-lookup and druid-kafka-extraction-namespace are modified - However, druid-namespace-lookup still has configuration about ON/OFF HEAP cache manager selection, which is not namespace wide configuration but node wide configuration as multiple namespace shares the same cache manager * update KafkaExtractionNamespaceTest to reflect argument signature changes * Add more synchronization functionality to NamespaceLookupExtractorFactory * Remove old way of using extraction namespaces * resolve compile error by supporting LookupIntrospectHandler * Remove kafka lookups * Remove unused stuff * Fix start and stop behavior to be consistent with new javadocs * Remove unused strings * Add timeout option * Address comments on configurations and improve docs * Add more options and update hash key and replaces * Move monitoring to the overriding classes * Add better start/stop logging * Remove old docs about namespace names * Fix bad comma * Add `@JsonIgnore` to lookup factory * Address code review comments * Remove ExtractionNamespace from module json registration * Fix problems with naming and initialization. Add tests * Optimize imports / reformat * Fix future not being properly cancelled on failed initial scheduling * Fix delete returns * Add more docs about whole introspection * Add `/version` introspection point for lookups * Add more tests and address comments * Add StaticMap extraction namespace for testing. Also add a bunch of tests * Move cache system property to `druid.lookup.namespace.cache.type` * Make VERSION lower case * Change poll period to 0ms for StaticMap * Move cache key to bytebuffer * Change hashCode and equals on static map extraction fn * Add more comments on StaticMap * Address comments * Make scheduleAndWait use a latch * Sanity renames and fix imports * Remove extra info in docs * Fix review comments * Strengthen failure on start from warn to error * Address comments * Rename namespace-lookup to lookups-cached-global * Fix injective mis-naming * Also add serde test	2016-05-24 10:56:40 -07:00
Nishant	dea4391a49	fix broken links (#3003 )	2016-05-23 06:38:21 -07:00
Fangjin Yang	00de26c76a	fix extensions docs (#2995 ) * fix extensions docs * fix mistakes	2016-05-19 14:01:06 -07:00
Slim	45b2e65d75	[QTL] adding listDelimiter to lookup parser spec (#2941 ) * adding listDelimiter to lookup parser spec * cleaning code	2016-05-10 15:41:16 +05:30
David Lim	b489f63698	Supervisor for KafkaIndexTask (#2656 ) * supervisor for kafka indexing tasks * cr changes	2016-05-04 23:13:13 -07:00
Gian Merlino	e680665f1c	Fix Avro parseSpec example, "type" should be "format". (#2918 )	2016-05-03 09:22:43 -07:00
Charles Allen	6b957aa072	[QTL] Make URI Exctraction Namespace take more sane arguments (#2738 ) * Make URI Exctraction Namespace take more sane arguments * Fixes https://github.com/druid-io/druid/issues/2669 * Update docs * Rename error message * Undo overzealous deletion of docs * Explain caching mechanism a bit more in docs	2016-05-02 12:54:34 -07:00
Charles Allen	54b717bdc3	[QTL] Move kafka-extraction-namespace to the Lookup framework. (#2800 ) * Move kafka-extraction-namespace to the Lookup framework. * Address comments * Fix missing kafka introspection * Fix tests to be less racy * Make testing a bit more leniant * Make tests even more forgiving * Add comments to kafka lookup cache method * Move startStopLock to just use started * Make start() and stop() idempotent * Forgot to update test after last change, test now accounts for idempotency * Add extra idempotency on stop check * Add more descriptive docs of behavior	2016-05-02 09:45:13 -07:00
michaelschiff	2203a812bc	statsd-emitter (#2410 )	2016-04-28 18:41:02 -07:00
Slim	58510d826b	fix emit wait time (#2869 )	2016-04-26 17:07:03 -07:00
Gaurav Kumar	f5822faca3	Fixed wrong parseSpec in Avro Hadoop Parser (#2846 ) `parseSpec` should contain `format` instead of `type`. It was wrongly defaulting to `tsv`	2016-04-16 11:34:54 -07:00
Gian Merlino	e320d13385	Fix various broken links in the docs. (#2833 )	2016-04-13 13:30:01 -07:00
Charles Allen	ed5377465a	add AirBnB Caravel to list of libraries (#2719 )	2016-04-12 12:53:50 -07:00
Charles Allen	2b99f717e4	Move lookup config doc to proper location	2016-04-08 08:15:38 -07:00
fjy	14dbc431ef	clean up for extensions docs	2016-03-30 17:14:58 -07:00
Fangjin Yang	a8b28879f1	Merge pull request #2369 from du00cs/master [Feature] Extension: Offline Ingestion with limited Parquet Support	2016-03-29 23:19:35 -07:00
DuNinglin [杜宁林]	0f67ff7dfb	reoganize code folder according to recent upstream folder changes, seperate it from avro code and take it into extensions-conrib. docs rewite too	2016-03-30 11:21:41 +08:00
r4ruchir	4bff008d65	Update libraries.md Adding embedded-druid information in helper libraries	2016-03-29 15:16:36 -07:00
fjy	c418a55638	cleanup distinct count agg	2016-03-28 17:29:41 -07:00
Fangjin Yang	62c1dc7a09	Merge pull request #2602 from binlijin/distinctcount implement special distinctcount	2016-03-28 17:20:17 -07:00
Gian Merlino	dbdfcd2443	Fix extension reference in Kafka namespaced lookup docs. The reference to io.druid.extensions:kafka-extraction-namespace is wrong (should be druid-kafka-extraction-namespace) and unnecessary (the extension id is written at the top of the doc file).	2016-03-28 09:23:24 -07:00
Fangjin Yang	a0216dcf7d	Merge pull request #2735 from metamx/fixlookupDocs Move lookup docs that are in druid-proper back into lookups.md	2016-03-26 15:38:48 -07:00
Charles Allen	ab324e4ac0	Move lookup docs that are in druid-proper back into lookups.md	2016-03-25 10:46:50 -07:00
Gian Merlino	6d18382fb2	Fix broken link in datasketches-aggregators.md.	2016-03-25 09:32:40 -07:00
binlijin	2729efca71	implement special distinctcount	2016-03-24 11:11:11 +08:00
fjy	943cbe6e76	refactor extensions into their own docs	2016-03-22 18:54:10 -07:00
Charles Allen	7b1bfbf704	Add documentation to modules about what should be excluded.	2016-03-10 10:18:33 -08:00
fjy	e3e932a4d4	refactor extensions into core and contrib	2016-03-08 17:12:09 -08:00
Fangjin Yang	4f300cfe49	Merge pull request #2526 from druid-io/b-slim-patch-1 fix docs about sketches	2016-02-23 10:23:53 -08:00
Slim	86c4900347	fix thetaSketch post aggregator doc	2016-02-23 10:43:54 -06:00
Himanshu Gupta	c7cb5bff14	fix thetaSketchSetOp doc	2016-02-23 09:17:49 -06:00
Himanshu Gupta	f7679dd5a9	updating thetaSketchSetOp post agg documentation to reflect the possibility of nesting	2016-02-22 09:38:58 -06:00
Bingkun Guo	9e4c908922	generate tarball by mvn package	2016-02-18 16:42:41 -06:00
fjy	7da6594bfe	more doc fixes	2016-02-17 09:43:47 -08:00
Fangjin Yang	f204dfbebe	Merge pull request #2413 from pdeva/patch-9 added note about including extension lib	2016-02-10 17:01:27 -08:00
Himanshu	f6eebf5884	Merge pull request #2422 from rasahner/docMinorFixes some minor doc changes	2016-02-09 10:03:22 -06:00
Robin	1d57e3267d	some minor doc changes	2016-02-09 08:20:53 -06:00
pdeva	b75862da7e	make 0.9 compatible	2016-02-08 17:25:34 -08:00
fjy	6fc5bcb1ef	fix docs	2016-02-08 13:40:53 -08:00
pdeva	525a911a3c	added note about including extension lib	2016-02-08 12:59:41 -08:00
fjy	9e2295aa61	whitespace fixes	2016-02-04 16:25:51 -08:00
fjy	b52e1e9161	fix spacing again	2016-02-04 16:13:12 -08:00
fjy	962e7bac14	fix rendering	2016-02-04 15:58:20 -08:00
fjy	003f54e268	add doc rendering	2016-02-04 14:21:59 -08:00
fjy	1aa363cea7	new quickstart	2016-02-04 09:37:38 -08:00
Sameer Al-Sakran	ee2a0e4afa	Update libraries.md	2016-02-01 11:47:50 -08:00
Fangjin Yang	bbfb8aa7dd	Merge pull request #2358 from druid-io/addCommunityExtensions Add Community Extensions	2016-01-31 17:45:11 -08:00
Erik Dubbelboer	246473c58a	Remove duplicate doc section	2016-01-30 13:50:32 +00:00
Charles Allen	5ec5c7221b	Add Community Extensions Add a "Community Extensions" section to the known libraries	2016-01-29 13:09:15 -08:00
navis.ryu	55a888ea2f	time-descending result of select queries	2016-01-29 10:06:05 +09:00
Robin	c9368702fa	do some editing of the instructions for using mysql for metadata	2016-01-21 10:37:30 -06:00
Himanshu Gupta	0d5f82aee7	document size attribute in thetaSketchSetOp post aggregator	2016-01-07 23:59:03 -06:00
fjy	d3d2ee03ce	minor fixes to docs	2016-01-03 11:37:06 -08:00
Gian Merlino	5a63c3dd63	Merge pull request #2186 from druid-io/dev-docs2 Add intro developer docs	2016-01-03 11:36:41 -05:00
fjy	88f6b9b5ad	Multiple improvements for docs	2016-01-02 21:54:54 -08:00
fjy	06a8e14820	Add intro developer docs	2016-01-02 14:44:45 -08:00
Bingkun Guo	89b477970f	DataSegmentFinder tool `insert-segment-to-db` is a tool that can insert segments into Druid metadata storage. It is intended to be used to update the segment table in metadata storage after people manually migrate segments from one place to another. It can also be used to insert missing segment into Druid, or even recover metadata storage by telling it where the segments are stored. Note: This tool expects users to have Druid cluster running in a "safe" mode, where there are no active tasks to interfere the segments being inserted. Users can optionally bring down the cluster to make 100% sure nothing is interfering.	2015-12-21 00:02:04 -06:00
Gian Merlino	8e594a2e72	Change service names in docs, examples to match defaults in the code.	2015-12-06 10:04:21 -08:00
Himanshu Gupta	fde9df2720	update to sketches-core-0.2.2 . adds support for "cardinality" aggregator. do not create sketch per event at ingestion time to make realtime ingestion faster	2015-11-19 01:05:59 -06:00
Himanshu Gupta	7788f7c2a1	update doc with new thetaSketch api	2015-11-12 00:04:34 -06:00
Himanshu Gupta	6c6a38cedb	adding datasketches aggregator to documentation	2015-11-12 00:04:33 -06:00
Bingkun Guo	b24eccfb9e	add doc for bundling custom extensions with other Druid extensions	2015-11-09 13:11:22 -06:00
Bingkun Guo	962f65cc76	fix metadata typo and rename default extension directory	2015-11-03 14:50:42 -06:00
Himanshu Gupta	c74a4490e1	add metamarket histogram post to approx-histo doc	2015-11-03 01:19:22 -06:00
Angel M de Miguel	a2510c9b0b	Update ruby-druid URL	2015-10-28 10:31:30 +01:00
Angel M de Miguel	04c5d0f8e2	Update Ruby libraries in docs	2015-10-28 09:08:26 +01:00
Bingkun Guo	4914925d65	New extension loading mechanism 1) Remove maven client from downloading extensions at runtime. 2) Provide a way to load Druid extensions and hadoop dependencies through file system. 3) Refactor pull-deps so that it can download extensions into extension directories. 4) Add documents on how to use this new extension loading mechanism. 5) Change the way how Druid tarball is generated. Now all the extensions + hadoop-client 2.3.0 are packaged within the Druid tarball.	2015-10-21 14:22:36 -05:00
Xavier Léauté	faf4c865d5	update R / Python clients	2015-10-01 13:42:09 -04:00
fjy	beab6fd487	add pivot as a UI	2015-09-15 14:58:32 -07:00
Himanshu Gupta	2e0dd1d792	adding UTs and addressing review comments to firehoseV2 addition to Realtime[Manager\|Plumber], essential segment metadata persist support, kafka-simple-consumer-firehose extension patch	2015-08-27 20:50:46 -05:00
lvjq	2237a8cf0f	kafka 8 simple consumer firehose	2015-08-27 20:50:46 -05:00
fjy	4055f9ca48	more docs for common questions	2015-08-25 17:49:04 -07:00
Himanshu Gupta	0daeb830b0	update approx-histogram document to explain how to ignore rows with no value at ingestion time	2015-08-19 15:20:37 -05:00
Xavier Léauté	f583cad2e2	disclaimer + more docs for approximate histograms	2015-08-10 13:13:49 -07:00

1 2 3 4 5 ...

254 Commits