druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	4203580290	URIExtractionNamespace: Treat null values in lookup maps as missing entries. (#3512 ) * URIExtractionNamespace: Treat null values in lookup maps as missing entries. This is useful when many logical lookups are derived from the same base JSON file, and some lookups' values may be unknown sometimes. * Add test, logging message, and address other comments. * Update docs.	2016-11-03 13:53:04 -07:00
David Lim	9226d4af3c	configurable shutdownTimeout for Kakfa supervisor (#3497 ) * configurable shutdownTimeout * cr change	2016-09-23 13:26:45 -06:00
David Lim	ca9114b41b	add supervisor reset API (#3484 ) * add supervisor reset API * CR doc changes and kill running tasks / clear offsets from supervisor	2016-09-22 17:51:06 -07:00
Gian Merlino	27bd5cb13a	Add forceExtendableShardSpecs option to Hadoop indexing, IndexTask. (#3473 ) Fixes #3241.	2016-09-21 13:40:04 -06:00
David Lim	96fcca18ea	update KafkaSupervisor to make HTTP requests to tasks in parallel where possible (#3452 )	2016-09-20 22:51:15 +05:30
Slim	3175e17a3b	Cached lookup module. first cut implementing JDBC cache (#2819 )	2016-09-16 13:45:54 -07:00
Gian Merlino	e0e28866ee	JavaScript docs: Fix links and typos, add to TOC. (#3457 )	2016-09-13 15:26:44 -07:00
Himanshu	a069257d37	avro-extension -- feature to specify multiple avro reader schemas inline (#3368 ) * rename SimpleAvroBytesDecoder to InlineSchemaAvroBytesDecoder * feature to specify multiple schemas inline in avro module	2016-09-13 14:54:31 -07:00
Gian Merlino	76a24054e3	JavaScript docs, including docs for globals. (#3454 )	2016-09-13 13:46:55 -07:00
Slim	ba6ddf307e	Adding hadoop kerberos authentification. (#3419 ) * adding kerberos authentication * make the 2 functions identical	2016-09-13 10:42:50 -07:00
David Lim	3a97fd4d6c	doc fix (#3430 )	2016-09-06 13:13:30 -06:00
Stéphane Derosiaux	48dce88aab	Add flag binaryAsString for parquet ingestion (#3381 )	2016-08-30 17:30:50 -07:00
Dave Li	c4e8440c22	Adds long compression methods (#3148 ) * add read * update deprecated guava calls * add write and vsizeserde * add benchmark * separate encoding and compression * add header and reformat * update doc * address PR comment * fix buffer order * generate benchmark files * separate encoding strategy and format * fix benchmark * modify supplier write to channel * add float NONE handling * address PR comment * address PR comment 2	2016-08-30 16:17:46 -07:00
Fangjin Yang	edb0eca3a9	fix docs (#3370 )	2016-08-16 16:25:50 -07:00
Fangjin Yang	6beb8ac342	fix some docs and add new content (#3369 )	2016-08-16 15:00:18 -07:00
Himanshu	46da682231	avro-extensions -- feature to specify avro reader schema inline in the task json for all events (#3249 )	2016-08-10 10:49:26 -07:00
Jonathan Wei	decefb7477	Add time interval dim filter and retention analysis example (#3315 ) * Add time interval dim filter and retention analysis example * Use closed-open matching for intervals, update cache key generation * Fix time filtering tests for interval boundary change	2016-08-05 07:25:04 -07:00
Navis Ryu	5b3f0ccb1f	Support variance and standard deviation (#2525 ) * Support variance and standard deviation * addressed comments	2016-08-04 17:32:58 -07:00
Fangjin Yang	d51ec398d4	fix parquet docs (#3304 )	2016-08-01 07:54:48 -07:00
Keuntae Park	95a58097e2	Hadoop InputRowParser for Orc file (#3019 ) * InputRowParser to decode OrcStruct from OrcNewInputFormat * add unit test for orc hadoop indexing * update docs and fix test code bug * doc updated * resove maven dependency conflict * remove unused imports * fix returning array type from Object[] to correct primitive array type * fix to support getDimension() of MapBasedRow : changing return type of orc list from array to list * rebase and updated based on comments * updated based on comments * on reflecting review comments * fix bug in typeStringFromParseSpec() and add unit test * add license header	2016-07-26 09:42:56 -07:00
Gian Merlino	ea03906fcf	Configurable compressRunOnSerialization for Roaring bitmaps. (#3228 ) Defaults to true, which is a change in behavior (this used to be false and unconfigurable).	2016-07-08 10:24:19 +05:30
Charles Allen	3f1681c16c	Caffeine cache extension (#3028 ) * Initial commit of caffeine cache * Address code comments * Move and fixup README.md a bit * Improve caffeine readme information * Cleanup caffeine pom * Address review comments * Bump caffeine to 2.3.1 * Bump druid version to 0.9.2-SNAPSHOT * Make test not fail randomly. See https://github.com/ben-manes/caffeine/pull/93#issuecomment-227617998 for an explanation * Fix distribution and documentation * Add caffeine to extensions.md * Fix links in extensions.md * Lexicographic	2016-07-06 15:42:54 -07:00
Charles Allen	8b7d9750ee	Update extension docs for global lookup module (#3206 )	2016-06-29 12:51:52 -07:00
David Lim	b24425a280	update docs with new behavior (#3200 )	2016-06-28 16:17:04 -07:00
Gian Merlino	c12712e8b8	Move "libraries.md" out of docs, onto the main site. (#3159 )	2016-06-16 18:14:35 -07:00
michaelschiff	7294ea87c3	link to statsd metrics emitter docs from development/extensions.html doc page (#3125 )	2016-06-10 16:27:16 -07:00
Gian Merlino	99ee3f4dc3	Fixups, clarifications to lookup docs. (#3060 )	2016-06-07 10:43:35 -07:00
Charles Allen	fa41a6466a	Cleanup the base lookup cluster wide config docs (#3061 ) * Cleanup the base lookup cluster wide config docs * Add better examples in lookups-cached-global.md * Use actual valid stock lookups * Fixed maps with : * Add mix of lookups * Better examples in extension * Remove unneeded namespace requirement * Add extra line space * Add link to lookup tiers * Renamed header	2016-06-07 10:42:41 -07:00
Charles Allen	8cac710546	Async lookups-cached-global by default (#3074 ) * Async lookups-cached-global by default * Also better lookup docs * Fix test timeouts * Fix timing of deserialized test * Fix problem with 0 wait failing immediately	2016-06-03 15:58:10 -05:00
David Lim	a2290a8f05	support seamless config changes (#3051 )	2016-06-03 13:50:19 -07:00
Erik Dubbelboer	b4737336e5	Added info about Google Cloud Storage (#3056 )	2016-06-02 10:06:07 -07:00
David Lim	f6c39cc844	Kafka task minimum message time (#3035 ) * add KafkaIndexTask support for minimumMessageTime * add Kafka supervisor support for lateMessageRejectionPeriod	2016-05-31 11:37:00 -07:00
scusjs	ebb6831770	rm , of jobProperties. jackson can not parse it (#3012 )	2016-05-26 09:46:33 -07:00
Charles Allen	245077b47f	Fix formatting in lookups-cached-global.md (#3009 )	2016-05-24 17:28:39 -07:00
Charles Allen	c738c0e1cd	Silly Typo in docs	2016-05-24 13:31:58 -07:00
Charles Allen	8024b915e2	[QTL] Implement LookupExtractorFactory of namespaced lookup (#2926 ) * support LookupReferencesManager registration of namespaced lookup and eliminate static configurations for lookup from namespecd lookup extensions - druid-namespace-lookup and druid-kafka-extraction-namespace are modified - However, druid-namespace-lookup still has configuration about ON/OFF HEAP cache manager selection, which is not namespace wide configuration but node wide configuration as multiple namespace shares the same cache manager * update KafkaExtractionNamespaceTest to reflect argument signature changes * Add more synchronization functionality to NamespaceLookupExtractorFactory * Remove old way of using extraction namespaces * resolve compile error by supporting LookupIntrospectHandler * Remove kafka lookups * Remove unused stuff * Fix start and stop behavior to be consistent with new javadocs * Remove unused strings * Add timeout option * Address comments on configurations and improve docs * Add more options and update hash key and replaces * Move monitoring to the overriding classes * Add better start/stop logging * Remove old docs about namespace names * Fix bad comma * Add `@JsonIgnore` to lookup factory * Address code review comments * Remove ExtractionNamespace from module json registration * Fix problems with naming and initialization. Add tests * Optimize imports / reformat * Fix future not being properly cancelled on failed initial scheduling * Fix delete returns * Add more docs about whole introspection * Add `/version` introspection point for lookups * Add more tests and address comments * Add StaticMap extraction namespace for testing. Also add a bunch of tests * Move cache system property to `druid.lookup.namespace.cache.type` * Make VERSION lower case * Change poll period to 0ms for StaticMap * Move cache key to bytebuffer * Change hashCode and equals on static map extraction fn * Add more comments on StaticMap * Address comments * Make scheduleAndWait use a latch * Sanity renames and fix imports * Remove extra info in docs * Fix review comments * Strengthen failure on start from warn to error * Address comments * Rename namespace-lookup to lookups-cached-global * Fix injective mis-naming * Also add serde test	2016-05-24 10:56:40 -07:00
Nishant	dea4391a49	fix broken links (#3003 )	2016-05-23 06:38:21 -07:00
Fangjin Yang	00de26c76a	fix extensions docs (#2995 ) * fix extensions docs * fix mistakes	2016-05-19 14:01:06 -07:00
Slim	45b2e65d75	[QTL] adding listDelimiter to lookup parser spec (#2941 ) * adding listDelimiter to lookup parser spec * cleaning code	2016-05-10 15:41:16 +05:30
David Lim	b489f63698	Supervisor for KafkaIndexTask (#2656 ) * supervisor for kafka indexing tasks * cr changes	2016-05-04 23:13:13 -07:00
Gian Merlino	e680665f1c	Fix Avro parseSpec example, "type" should be "format". (#2918 )	2016-05-03 09:22:43 -07:00
Charles Allen	6b957aa072	[QTL] Make URI Exctraction Namespace take more sane arguments (#2738 ) * Make URI Exctraction Namespace take more sane arguments * Fixes https://github.com/druid-io/druid/issues/2669 * Update docs * Rename error message * Undo overzealous deletion of docs * Explain caching mechanism a bit more in docs	2016-05-02 12:54:34 -07:00
Charles Allen	54b717bdc3	[QTL] Move kafka-extraction-namespace to the Lookup framework. (#2800 ) * Move kafka-extraction-namespace to the Lookup framework. * Address comments * Fix missing kafka introspection * Fix tests to be less racy * Make testing a bit more leniant * Make tests even more forgiving * Add comments to kafka lookup cache method * Move startStopLock to just use started * Make start() and stop() idempotent * Forgot to update test after last change, test now accounts for idempotency * Add extra idempotency on stop check * Add more descriptive docs of behavior	2016-05-02 09:45:13 -07:00
michaelschiff	2203a812bc	statsd-emitter (#2410 )	2016-04-28 18:41:02 -07:00
Slim	58510d826b	fix emit wait time (#2869 )	2016-04-26 17:07:03 -07:00
Gaurav Kumar	f5822faca3	Fixed wrong parseSpec in Avro Hadoop Parser (#2846 ) `parseSpec` should contain `format` instead of `type`. It was wrongly defaulting to `tsv`	2016-04-16 11:34:54 -07:00
Gian Merlino	e320d13385	Fix various broken links in the docs. (#2833 )	2016-04-13 13:30:01 -07:00
Charles Allen	ed5377465a	add AirBnB Caravel to list of libraries (#2719 )	2016-04-12 12:53:50 -07:00
Charles Allen	2b99f717e4	Move lookup config doc to proper location	2016-04-08 08:15:38 -07:00
fjy	14dbc431ef	clean up for extensions docs	2016-03-30 17:14:58 -07:00
Fangjin Yang	a8b28879f1	Merge pull request #2369 from du00cs/master [Feature] Extension: Offline Ingestion with limited Parquet Support	2016-03-29 23:19:35 -07:00
DuNinglin [杜宁林]	0f67ff7dfb	reoganize code folder according to recent upstream folder changes, seperate it from avro code and take it into extensions-conrib. docs rewite too	2016-03-30 11:21:41 +08:00
r4ruchir	4bff008d65	Update libraries.md Adding embedded-druid information in helper libraries	2016-03-29 15:16:36 -07:00
fjy	c418a55638	cleanup distinct count agg	2016-03-28 17:29:41 -07:00
Fangjin Yang	62c1dc7a09	Merge pull request #2602 from binlijin/distinctcount implement special distinctcount	2016-03-28 17:20:17 -07:00
Gian Merlino	dbdfcd2443	Fix extension reference in Kafka namespaced lookup docs. The reference to io.druid.extensions:kafka-extraction-namespace is wrong (should be druid-kafka-extraction-namespace) and unnecessary (the extension id is written at the top of the doc file).	2016-03-28 09:23:24 -07:00
Fangjin Yang	a0216dcf7d	Merge pull request #2735 from metamx/fixlookupDocs Move lookup docs that are in druid-proper back into lookups.md	2016-03-26 15:38:48 -07:00
Charles Allen	ab324e4ac0	Move lookup docs that are in druid-proper back into lookups.md	2016-03-25 10:46:50 -07:00
Gian Merlino	6d18382fb2	Fix broken link in datasketches-aggregators.md.	2016-03-25 09:32:40 -07:00
binlijin	2729efca71	implement special distinctcount	2016-03-24 11:11:11 +08:00
fjy	943cbe6e76	refactor extensions into their own docs	2016-03-22 18:54:10 -07:00
Charles Allen	7b1bfbf704	Add documentation to modules about what should be excluded.	2016-03-10 10:18:33 -08:00
fjy	e3e932a4d4	refactor extensions into core and contrib	2016-03-08 17:12:09 -08:00
Fangjin Yang	4f300cfe49	Merge pull request #2526 from druid-io/b-slim-patch-1 fix docs about sketches	2016-02-23 10:23:53 -08:00
Slim	86c4900347	fix thetaSketch post aggregator doc	2016-02-23 10:43:54 -06:00
Himanshu Gupta	c7cb5bff14	fix thetaSketchSetOp doc	2016-02-23 09:17:49 -06:00
Himanshu Gupta	f7679dd5a9	updating thetaSketchSetOp post agg documentation to reflect the possibility of nesting	2016-02-22 09:38:58 -06:00
Bingkun Guo	9e4c908922	generate tarball by mvn package	2016-02-18 16:42:41 -06:00
fjy	7da6594bfe	more doc fixes	2016-02-17 09:43:47 -08:00
Fangjin Yang	f204dfbebe	Merge pull request #2413 from pdeva/patch-9 added note about including extension lib	2016-02-10 17:01:27 -08:00
Himanshu	f6eebf5884	Merge pull request #2422 from rasahner/docMinorFixes some minor doc changes	2016-02-09 10:03:22 -06:00
Robin	1d57e3267d	some minor doc changes	2016-02-09 08:20:53 -06:00
pdeva	b75862da7e	make 0.9 compatible	2016-02-08 17:25:34 -08:00
fjy	6fc5bcb1ef	fix docs	2016-02-08 13:40:53 -08:00
pdeva	525a911a3c	added note about including extension lib	2016-02-08 12:59:41 -08:00
fjy	9e2295aa61	whitespace fixes	2016-02-04 16:25:51 -08:00
fjy	b52e1e9161	fix spacing again	2016-02-04 16:13:12 -08:00
fjy	962e7bac14	fix rendering	2016-02-04 15:58:20 -08:00
fjy	003f54e268	add doc rendering	2016-02-04 14:21:59 -08:00
fjy	1aa363cea7	new quickstart	2016-02-04 09:37:38 -08:00
Sameer Al-Sakran	ee2a0e4afa	Update libraries.md	2016-02-01 11:47:50 -08:00
Fangjin Yang	bbfb8aa7dd	Merge pull request #2358 from druid-io/addCommunityExtensions Add Community Extensions	2016-01-31 17:45:11 -08:00
Erik Dubbelboer	246473c58a	Remove duplicate doc section	2016-01-30 13:50:32 +00:00
Charles Allen	5ec5c7221b	Add Community Extensions Add a "Community Extensions" section to the known libraries	2016-01-29 13:09:15 -08:00
navis.ryu	55a888ea2f	time-descending result of select queries	2016-01-29 10:06:05 +09:00
Robin	c9368702fa	do some editing of the instructions for using mysql for metadata	2016-01-21 10:37:30 -06:00
Himanshu Gupta	0d5f82aee7	document size attribute in thetaSketchSetOp post aggregator	2016-01-07 23:59:03 -06:00
fjy	d3d2ee03ce	minor fixes to docs	2016-01-03 11:37:06 -08:00
Gian Merlino	5a63c3dd63	Merge pull request #2186 from druid-io/dev-docs2 Add intro developer docs	2016-01-03 11:36:41 -05:00
fjy	88f6b9b5ad	Multiple improvements for docs	2016-01-02 21:54:54 -08:00
fjy	06a8e14820	Add intro developer docs	2016-01-02 14:44:45 -08:00
Bingkun Guo	89b477970f	DataSegmentFinder tool `insert-segment-to-db` is a tool that can insert segments into Druid metadata storage. It is intended to be used to update the segment table in metadata storage after people manually migrate segments from one place to another. It can also be used to insert missing segment into Druid, or even recover metadata storage by telling it where the segments are stored. Note: This tool expects users to have Druid cluster running in a "safe" mode, where there are no active tasks to interfere the segments being inserted. Users can optionally bring down the cluster to make 100% sure nothing is interfering.	2015-12-21 00:02:04 -06:00
Gian Merlino	8e594a2e72	Change service names in docs, examples to match defaults in the code.	2015-12-06 10:04:21 -08:00
Himanshu Gupta	fde9df2720	update to sketches-core-0.2.2 . adds support for "cardinality" aggregator. do not create sketch per event at ingestion time to make realtime ingestion faster	2015-11-19 01:05:59 -06:00
Himanshu Gupta	7788f7c2a1	update doc with new thetaSketch api	2015-11-12 00:04:34 -06:00
Himanshu Gupta	6c6a38cedb	adding datasketches aggregator to documentation	2015-11-12 00:04:33 -06:00
Bingkun Guo	b24eccfb9e	add doc for bundling custom extensions with other Druid extensions	2015-11-09 13:11:22 -06:00
Bingkun Guo	962f65cc76	fix metadata typo and rename default extension directory	2015-11-03 14:50:42 -06:00
Himanshu Gupta	c74a4490e1	add metamarket histogram post to approx-histo doc	2015-11-03 01:19:22 -06:00
Angel M de Miguel	a2510c9b0b	Update ruby-druid URL	2015-10-28 10:31:30 +01:00
Angel M de Miguel	04c5d0f8e2	Update Ruby libraries in docs	2015-10-28 09:08:26 +01:00
Bingkun Guo	4914925d65	New extension loading mechanism 1) Remove maven client from downloading extensions at runtime. 2) Provide a way to load Druid extensions and hadoop dependencies through file system. 3) Refactor pull-deps so that it can download extensions into extension directories. 4) Add documents on how to use this new extension loading mechanism. 5) Change the way how Druid tarball is generated. Now all the extensions + hadoop-client 2.3.0 are packaged within the Druid tarball.	2015-10-21 14:22:36 -05:00
Xavier Léauté	faf4c865d5	update R / Python clients	2015-10-01 13:42:09 -04:00
fjy	beab6fd487	add pivot as a UI	2015-09-15 14:58:32 -07:00
Himanshu Gupta	2e0dd1d792	adding UTs and addressing review comments to firehoseV2 addition to Realtime[Manager\|Plumber], essential segment metadata persist support, kafka-simple-consumer-firehose extension patch	2015-08-27 20:50:46 -05:00
lvjq	2237a8cf0f	kafka 8 simple consumer firehose	2015-08-27 20:50:46 -05:00
fjy	4055f9ca48	more docs for common questions	2015-08-25 17:49:04 -07:00
Himanshu Gupta	0daeb830b0	update approx-histogram document to explain how to ignore rows with no value at ingestion time	2015-08-19 15:20:37 -05:00
Xavier Léauté	f583cad2e2	disclaimer + more docs for approximate histograms	2015-08-10 13:13:49 -07:00
Maxime Beauchemin	db4928e73b	Documentation entry for Panormix (a druid UI)	2015-07-21 18:23:46 -07:00
Himanshu Gupta	4114b4902e	fixing the links to doc images	2015-06-29 12:17:42 -05:00
Xavier Léauté	d2346b6834	shorten links and file names * remove redundant parts in file names * delete unsupported "Druid-Personal-Demo-Cluster"	2015-05-29 20:55:42 -05:00
Himanshu Gupta	8edc2aaca3	renaming all *.md filenames to only have lowercase and dashes so that they are editable on case-insensitive os as well	2015-05-29 20:55:42 -05:00

1 2 3 4 5

213 Commits