druid

Commit Graph

Author	SHA1	Message	Date
Jonathan Wei	35601bb7a0	Add finalizeAsBase64Binary option to FixedBucketsHistogramAggregatorFactory (#7784 ) * Add finalizeAsBase64Binary option to FixedBucketsHistogramAggregatorFactory * Add finalizeAsBase64Binary option to ApproximateHistogramFactory * Update approx histogram doc	2019-06-21 18:00:19 -07:00
Clint Wylie	494b8ebe56	multi-value string column support for expressions (#7588 ) * array support for expression language for multi-value string columns * fix tests? * fixes * more tests * fixes * cleanup * more better, more test * ignore inspection * license * license fix * inspection * remove dumb import * more better * some comments * add expr rewrite for arrayfn args for more magic, tests * test stuff * more tests * fix test * fix test * castfunc can deal with arrays * needs more empty array * more tests, make cast to long array more forgiving * refactor * simplify ExprMacro Expr implementations with base classes in core * oops * more test * use Shuttle for Parser.flatten, javadoc, cleanup * fixes and more tests * unused import * fixes * javadocs, cleanup, refactors * fix imports * more javadoc * more javadoc * more * more javadocs, nonnullbydefault, minor refactor * markdown fix * adjustments * more doc * move initial filter out * docs * map empty arg lambda, apply function argument validation * check function args at parse time instead of eval time * more immutable * more more immutable * clarify grammar * fix docs * empty array is string test, we need a way to make arrays better maybe in the future, or define empty arrays as other types..	2019-06-19 13:57:37 -07:00
Clint Wylie	71997c16a2	switch links from druid.io to druid.apache.org (#7914 ) * switch links from druid.io to druid.apache.org * fix it	2019-06-18 09:06:27 -07:00
Vadim Ogievetsky	24dd4573da	Added the web console to the quickstart tutorials and docs (#7863 ) * added console to the quickstart tutorials * feedback fixes * feedback fixes * more typo fixes * moved reseting cluster section after load data * update images * stage -> step * feedback fixes * more feedback fixes	2019-06-17 18:00:54 -07:00
Himanshu	b3328b2785	endpoint to delete lookup tier and remove tier on last lookup deletion (#7852 )	2019-06-15 17:55:50 -07:00
Justin Borromeo	8e5003b01c	Scan Doc Change (#7903 )	2019-06-15 01:21:34 -07:00
Jihoon Son	3cd9a7507d	Fix script for dependencies report for extensions (#7899 )	2019-06-14 18:53:50 -07:00
Jihoon Son	a648e1548d	Add support of --exclude-extension argument for dependency report script (#7786 )	2019-06-14 15:18:59 -07:00
Xue Yu	456a3654ce	add PolygonBound and missing extentions list doc (#7885 )	2019-06-13 12:03:58 -07:00
Clint Wylie	8117222da3	use right port for kafka tutorial, reinfoce that tutorials assume you are using micro-quickstart single-server configuration (#7862 )	2019-06-11 08:50:52 -07:00
Xue Yu	ce591d1457	Support var_pop, var_samp, stddev_pop and stddev_samp etc in sql (#7801 ) * support var_pop, stddev_pop etc in sql * fix sql compatible * rebase on master * update doc	2019-06-10 09:40:09 -07:00
Clint Wylie	3fbb0a5e00	Supervisor list api with states and health (#7839 ) * allow optionally listing all supervisors with their state and health * docs * add state to full * clean * casing * format * spelling	2019-06-07 16:26:33 -07:00
Jihoon Son	61ec521135	Remove keepSegmentGranularity option for compaction (#7747 ) * Remove keepSegmentGranularity option from compaction * fix it test * clean up * remove from web console * fix test	2019-06-03 12:59:15 -07:00
Jihoon Son	e289820bbd	Add a script to find missing backports (#7817 )	2019-06-03 07:56:52 -07:00
Eyal Yurman	69e9b8a464	Enables SQL by default. (#7808 )	2019-05-31 20:53:42 -07:00
Justin Borromeo	8032c4add8	Add errors and state to stream supervisor status API endpoint (#7428 ) * Add state and error tracking for seekable stream supervisors * Fixed nits in docs * Made inner class static and updated spec test with jackson inject * Review changes * Remove redundant config param in supervisor * Style * Applied some of Jon's recommendations * Add transience field * write test * implement code review changes except for reconsidering logic of markRunFinishedAndEvaluateHealth() * remove transience reporting and fix SeekableStreamSupervisorStateManager impl * move call to stateManager.markRunFinished() from RunNotice to runInternal() for tests * remove stateHistory because it wasn't adding much value, some fixes, and add more tests * fix tests * code review changes and add HTTP health check status * fix test failure * refactor to split into a generic SupervisorStateManager and a specific SeekableStreamSupervisorStateManager * fixup after merge * code review changes - add additional docs * cleanup KafkaIndexTaskTest * add additional documentation for Kinesis indexing * remove unused throws class	2019-05-31 17:16:01 -07:00
Jonathan Wei	83152a7a00	Fix performance-faq and remove insert-segment-to-db redirects (#7759 )	2019-05-24 13:20:02 -07:00
Jonathan Wei	cfb7756c9b	Fix references to removed performance FAQ page (#7755 )	2019-05-24 11:52:40 -07:00
Jonathan Wei	eb0e1a056c	Add limit to timeseries docs (#7750 )	2019-05-23 19:41:52 -07:00
Jonathan Wei	f2e34a76bd	Fix TOC clustering example link (#7749 )	2019-05-23 19:41:27 -07:00
Jonathan Wei	ec4d09a02f	Remove obsolete isExcluded config from Kerberos authenticator (#7745 )	2019-05-23 16:00:05 -07:00
awelsh93	6964ac23a2	Adding influxdb emitter as a contrib extension (#7717 ) * Adding influxdb emitter as a contrib extension * addressing code review comments	2019-05-23 11:11:48 -07:00
Fangjin Yang	3dec5cd1e4	reorganizing the ToC (#7734 )	2019-05-23 09:24:38 -07:00
gocho1	bd899b9224	add s3 authentication method informations (#7674 ) * add s3 authentication method informations * add druid.s3.fileSessionCredentials related content * remove authentication parameters to avoid confusion as it is more detailed in S3 Deep Storage page * streamline s3 docs	2019-05-22 11:46:02 -07:00
Gian Merlino	cbbce955de	SQL: Allow NULLs in place of optional arguments in many functions. (#7709 ) * SQL: Allow NULLs in place of optional arguments in many functions. Also adjust SQL docs to describe how to make time literals using TIME_PARSE (which is now possible in a nicer way). * Be less forbidden.	2019-05-21 11:54:34 -07:00
Gian Merlino	b6941551ae	Upgrade various build and doc links to https. (#7722 ) * Upgrade various build and doc links to https. Where it wasn't possible to upgrade build-time dependencies to https, I kept http in place but used hardcoded checksums or GPG keys to ensure that artifacts fetched over http are verified properly. * Switch to https://apache.org.	2019-05-21 11:30:14 -07:00
Xue Yu	dd7dace70a	Add TIMESTAMPDIFF sql support (#7695 ) * add timestampdiff sql support * feedback address	2019-05-21 08:05:38 -07:00
Vadim Ogievetsky	156322932f	Update Druid Console docs for 0.15.0 (#7697 ) * Update Druid Console docs for 0.15.0 * SQL -> query * added links and fix typos	2019-05-21 04:00:42 -07:00
andrewluotechnologies	1add566411	Fix typo (ComplexMetricSerde class name was spelled incorrectly) (#7694 )	2019-05-19 09:49:54 -07:00
Jihoon Son	94721de141	Add auto tagging milestone script (#7677 ) * Add auto tagging milestone script * fix usage * missing newline * missing newline	2019-05-16 23:11:16 -07:00
Clint Wylie	939b417379	Update tutorial-kafka.md (#7678 )	2019-05-16 23:10:45 -07:00
Jonathan Wei	d99f77a01b	Add option to use YARN RM as fallback for JobHistory failure (#7673 ) * Add option to use YARN RM as fallback for job status * PR comments	2019-05-16 13:59:10 -07:00
Fangjin Yang	dc85a5309e	some more doc improvements (#7675 )	2019-05-16 13:17:21 -07:00
Jonathan Wei	d667655871	Add basic tuning guide, getting started page, updated clustering docs (#7629 ) * Add basic tuning guide, getting started page, updated clustering docs * Add note about caching, fix tutorial paths * Adjust hadoop wording * Add license * Tweak * Shrink overlord heaps, fix tutorial urls * Tweak xlarge peon, update peon sizing * Update Data peon buffer size * Fix cluster start scripts * Add upper level _common to classpath * Fix cluster data/query confs * Address PR comments * Elaborate on connection pools * PR comments * Increase druid.broker.http.maxQueuedBytes * Add guidelines for broker backpressure * PR comments	2019-05-16 11:13:48 -07:00
Benedict Jin	3df364c472	Fix broken links in api-reference.md (#7670 )	2019-05-15 18:53:34 -07:00
Clint Wylie	c2abbc24a7	minor web console doc fixes (#7668 )	2019-05-15 18:52:51 -07:00
Surekha	d3545f5086	Show all server types in sys.servers table (#7654 ) * update sys.servers table to show all servers * update docs * Fix integration test * modify test query for batch integration test * fix case in test queries * make the server_type lowercase * Apply suggestions from code review Co-Authored-By: Himanshu <g.himanshu@gmail.com> * Fix compilation from git suggestion * fix unit test	2019-05-15 16:54:02 -07:00
Gian Merlino	0352f450d7	Fix broken links in docs, add broken link checker. (#7658 ) Also adds back insert-segment-to-db.md with some docs about why and when it was removed (in #6911).	2019-05-15 14:49:50 -07:00
Surekha	917106985f	Update tutorial to delete data (#7577 ) * Update tutorial to delete data * update tutorial, remove old ways to drop data * PR comments	2019-05-15 14:40:06 -07:00
Jonathan Wei	e874da7cea	Add simpler permissions option to BasicAuthorizer GET APIs (#7635 ) * Add simpler permissions option to BasicAuthorizer GET APIs * Adjust log message Co-Authored-By: Himanshu <g.himanshu@gmail.com> * Adjust log message Co-Authored-By: Himanshu <g.himanshu@gmail.com>	2019-05-15 12:59:32 -07:00
Clint Wylie	b87c8f0314	fix lookup editor to use lookup tiers instead of historical tiers (#7647 ) * fix lookup editor to use lookup tiers instead of historical tiers * use default tier if empty response, fix if configured lookups is null * fixes * fix typo	2019-05-14 13:30:51 -07:00
Alexander Saydakov	ca1a6649f6	Datasketches quantiles more post-aggs (#7550 ) * rank and CDF post-aggs * added post-aggs to the module * added new post-aggs * moved post-agg IDs * moved post-agg IDs	2019-05-10 11:46:54 -07:00
Clint Wylie	402d76a10f	make-redirects.py requires python3, explicitly specify it (#7625 )	2019-05-09 21:32:58 -07:00
Clint Wylie	6a6c6d573d	Add plain text README.txt, use relative link from README.md to build.md (#7611 ) * use relative link to build instructions from top level readme * add textfile to readme * formatting * make README.BINARY plaintext, move LABELS.md to LABELS, README.txt to README * exclude README.BINARY still * remove jdk links/recommmendations * add script to use DRUIDVERSION in textfile README instead of latest, add links to recommended jdk to build.md * license * better readme template, links to latest if does not detect an apache release version * fix	2019-05-09 21:29:26 -07:00
Samarth Jain	b542bb9f34	TDigest backed sketch aggregators (#7331 ) * First set of changes for tDigest histogram * Add license * Address code review comments * Add a doc page for new T-Digest sketch aggregators. Minor code cleanup and comments. * Remove synchronization from BufferAggregators. Address code review comments * Fix typo	2019-05-09 17:22:55 -07:00
Magnus Henoch	2ac112151f	Fix formatting in scan query documentation (#7622 ) Escape underscores in `__time`, so they're not interpreted as bold formatting.	2019-05-09 11:32:37 -07:00
Jinseon Lee	0ef435a16c	add postgresql meta db table schema configuration property (#7137 ) (#7183 ) * add postgresql meta db table schema configuration property (#7137) If the postgresql db schema changes, you must set the configuration values. You do not need to set it if there is no change from the default schema 'public'. druid.metadata.postgres.dbTableSchema=public * create postgresql metadb table schema configuration property (#7137) If the postgresql db schema changes, you must set the configuration values. You do not need to set it if there is no change from the default schema 'public'. druid.metadata.postgres.dbTableSchema=public check PostgreSQLTablesConfig.java * modify postgresql readme file. - metadb table schema (#7137) If the postgresql db schema changes, you must set the configuration values. You do not need to set it if there is no change from the default schema 'public'. druid.metadata.postgres.dbTableSchema=public check PostgreSQLTablesConfig.java	2019-05-08 12:56:30 -07:00
Jonathan Wei	dadf6a2f11	Add tool for migrating from local deep storage/Derby metadata (#7598 ) * Add tool for migrating from local deep storage/Derby metadata * Split deep storage and metadata migration docs * Support import into Derby * Fix create tables cmd * Fix create tables cmd * Fix commands * PR comment * Add -p	2019-05-06 23:39:40 -07:00
Jonathan Wei	7c2ca474da	Add single-machine deployment example cfgs and scripts (#7590 ) * Add single-machine deployment example cfgs and scripts * Add (8u92+) * Use combined coordinator-overlord for single machine confs * RAT fix	2019-05-06 19:11:13 -07:00
Gian Merlino	727b65c7e5	Remove SQL experimental banner and other doc adjustments. (#7591 ) * Remove SQL experimental banner and other doc adjustments. Also, - Adjust the ToC and other docs a bit so SQL and native queries are presented on more equal footing. - De-emphasize querying historicals and peons directly in the native query docs. This is a really niche thing and may have been confusing to include prominently in the very first paragraph. - Remove DataSketches and Kafka indexing service from the experimental features ToC. They are not experimental any longer and were there in error. * More notes. * Slight tweak. * Remove extra extra word. * Remove RT node from ToC.	2019-05-06 12:31:51 -07:00
Samarth Jain	afbcb9c07f	Improve parallelism of zookeeper based segment change processing (#7088 ) * V1 - improve parallelism of zookeeper based segment change processing * Create zk nodes in batches. Address code review comments. Introduce various configs. * Add documentation for the newly added configs * Fix test failures * Fix more test failures * Remove prinstacktrace statements * Address code review comments * Use a single queue * Address code review comments Since we have a separate load peon for every historical, just having a single SegmentChangeProcessor task per historical is enough. This commit also gets rid of the associated config druid.coordinator.loadqueuepeon.curator.numCreateThreads * Resolve merge conflict * Fix compilation failure * Remove batching since we already have a dynamic config maxSegmentsInNodeLoadingQueue that provides that control * Fix NPE in test * Remove documentation for configs that are no longer needed * Address code review comments * Address more code review comments * Fix checkstyle issue * Address code review comments * Code review comments * Add back monitor node remove executor * Cleanup code to isolate null checks and minor refactoring * Change param name since it conflicts with member variable name	2019-05-03 15:58:42 +02:00
Jonathan Wei	a013350018	Adjust required permissions for system schema (#7579 ) * Adjust required permissions for system schema * PR comments, fix current_size handling * Checkstyle * Set curr_size instead of current_size * Adjust information schema docs * Fix merge conflict * Update tests	2019-05-02 07:18:02 -07:00
Surekha	15d19f3059	Add is_overshadowed column to sys.segments table (#7425 ) * Add is_overshadowed column to sys.segments table * update docs * Rename class and variables * PR comments * PR comments * remove unused variables in MetadataResource * move constants together * add getFullyOvershadowedSegments method to ImmutableDruidDataSource * Fix compareTo of SegmentWithOvershadowedStatus * PR comment * PR comments * PR comments * PR comments * PR comments * fix issue with already consumed stream * minor refactoring * PR comments	2019-05-01 18:00:57 +02:00
Gian Merlino	c648775b5b	SQL: Remove "useFallback" feature. (#7567 ) This feature allows Calcite's Bindable interpreter to be bolted on top of Druid queries and table scans. I think it should be removed for a few reasons: 1. It is not recommended for production anyway, because it generates unscalable query plans (e.g. it will plan a join into two table scans and then try to do the entire join in memory on the broker). 2. It doesn't work with Druid-specific SQL functions, like TIME_FLOOR, REGEXP_EXTRACT, APPROX_COUNT_DISTINCT, etc. 3. It makes the SQL planning code needlessly complicated. With SQL coming out of experimental status soon, it's a good opportunity to remove this feature.	2019-04-28 18:26:44 -07:00
Eyal Yurman	f02251ab2d	Contributing Moving-Average Query to open source. (#6430 ) * Contributing Moving-Average Query to open source. * Fix failing code inspections. * See if explicit types will invoke the correct comparison function. * Explicitly remove support for druid.generic.useDefaultValueForNull configuration parameter. * Update styling and headers for complience. * Refresh code with latest master changes: * Remove NullDimensionSelector. * Apply changes of RequestLogger. * Apply changes of TimelineServerView. * Small checkstyle fix. * Checkstyle fixes. * Fixing rat errors; Teamcity errors. * Removing support theta sketches. Will be added back in this pr or a following once DI conflicts with datasketches are resolved. * Implements some of the review fixes. * Contributing Moving-Average Query to open source. * Fix failing code inspections. * See if explicit types will invoke the correct comparison function. * Explicitly remove support for druid.generic.useDefaultValueForNull configuration parameter. * Update styling and headers for complience. * Refresh code with latest master changes: * Remove NullDimensionSelector. * Apply changes of RequestLogger. * Apply changes of TimelineServerView. * Small checkstyle fix. * Checkstyle fixes. * Fixing rat errors; Teamcity errors. * Removing support theta sketches. Will be added back in this pr or a following once DI conflicts with datasketches are resolved. * Implements some of the review fixes. * More fixes for review. * More fixes from review. * MapBasedRow is Unmodifiable. Create new rows instead of modifying existing ones. * Remove more changes related to datasketches support. * Refactor BaseAverager startFrom field and add a comment. * fakeEvents field: Refactor initialization and add comment. * Rename parameters (tiny change). * Fix variable name typo in test (JAN_4). * Fix styling of non camelCase fields. * Fix Preconditions.checkArgument for cycleSize. * Add more documentation to RowBucketIterable and other classes. * key/value comment on in MovingAverageIterable. * Fix anonymous makeColumnValueSelector returning null. * Replace IdentityYieldingAccumolator with Yielders.each(). * * internalNext() should return null instead of throwing exception. * Remove unused variables/prarameters. * Harden MovingAverageIterableTest (Switch anyOf to exact match). * Change internalNext() from recursion to iteration; Simplify next() and hasNext(). * Remove unused imports. * Address review comments. * Rename fakeEvents to emptyEvents. * Remove redundant parameter key from computeMovingAverage. * Check yielder as well in RowBucketIterable#hasNext() * Fix javadoc.	2019-04-26 17:07:48 -07:00
Adam Peck	ebdf07b69f	Add reload by interval API (#7490 ) * Add reload by interval API Implements the reload proposal of #7439 Added tests and updated docs * PR updates * Only build timeline with required segments Use 404 with message when a segmentId is not found Fix typo in doc Return number of segments modified. * Fix checkstyle errors * Replace String.format with StringUtils.format * Remove return value * Expand timeline to segments that overlap for intervals Restrict update call to only segments that need updating. * Only add overlapping enabled segments to the timeline * Some renames for clarity Added comments * Don't rely on cached poll data Only fetch required information from DB * Match error style * Merge and cleanup doc * Fix String.format call * Add unit tests * Fix unit tests that check for overshadowing	2019-04-26 16:01:50 -07:00
Clint Wylie	09b7700d13	fix docs (#7556 )	2019-04-25 22:00:37 -07:00
Justin Borromeo	012ab02bf4	Update select doc disclaimer (#7554 )	2019-04-25 19:23:39 -07:00
Surekha	8308ffef1f	API to drop data by interval (#7494 ) * Add api to drop data by interval * update to address comments * unused imports * PR comments + add tests in SQLMetadataSegmentManagerTest * update tests and docs	2019-04-25 14:24:40 -07:00
Jonathan Wei	658fb2b062	Fix bugs in milestone contributor script (#7545 ) * Only check PRs in milestone contributor script * Fix no-pagination bug	2019-04-24 22:11:57 -07:00
Jonathan Wei	8b1a4e18dd	Additional Apache branding doc updates (#7524 )	2019-04-23 14:39:16 -07:00
Xue Yu	2c8a71f883	Support LPAD and RPAD sql function (#7388 ) * lpad and rpad sql function * feedback address * feedback address * add doc and format * update docs	2019-04-22 14:51:32 -07:00
Jonathan Wei	3487663de9	Adjust approx agg deprecation wording (#7518 )	2019-04-19 19:31:50 -07:00
Jonathan Wei	74960e82bf	Add more Apache branding to docs (#7515 )	2019-04-19 15:52:26 -07:00
Slim Bouguerra	5463ecb979	Fix broken link due to Typo. (#7513 ) Change-Id: I5792f89ed6afe945f386058edd44f0400998460a	2019-04-19 09:58:54 -07:00
Jonathan Wei	8078f567aa	Update kafka version in tutorials (#7500 )	2019-04-17 14:56:29 -07:00
Kazuhito Takeuchi	7c19c92a81	Add ROUND function in druid-sql. (#7224 ) * Implement round function in druid-sql * Return value according to the type of argument * Fix codes for abnoraml inputs, updated math-expr.md * Fix assert text * Fix error messages and refactor codes * Fix compile error, update sql.md, refactor codes and format tests	2019-04-16 11:15:39 -07:00
Lucas Capistrant	8acad27d99	Enhance the Http Firehose to work with URIs requiring basic authentication (#7145 ) * Enhnace the HttpFirehose to work with both insecure URIs and URIs requiring basic authentication * Improve security of enhanced HttpFirehoseFactory by not logging auth credentials * Fix checkstyle failure in HttpFirehoseFactory.java * Update docs and fix TeamCity build with required noinspection * Indentation cleanup and logic modification for HttpFirehose object stream * Remove default Empty string password provider in http firehose * Add JavaDoc for MixIn describing its intended use * Reverting documentation notation for json code to be inline with rest of doc * Improve instantiation of ObjectMappers that require MixIn for redacting password from task logs * Add comment to clarify fully qualified references of Objects in SQLMetadataStorageActionHandler	2019-04-15 14:29:01 -07:00
Justin Borromeo	85f10ed0d0	Support querying realtime segments using time-ordered scan queries and fix broken scan queries without time column (#7454 ) * Update scan query runner factory to accept SpecificSegmentSpec * nit * Sorry travis * Improve logging and fix doc * Bug fix * Friendlier error msgs and tests to cover bug * Address Gian's comments * Fix doc * Added tests for empty and null column list * Style * Fix checking wrong order (looking at query param when it should be looking at the null-handled order) * Add test case for null order * Fix ScanQueryRunnerTest * Forbidden APIs fixed	2019-04-12 19:08:34 -07:00
zhaojiandong	1d9450da81	Some docs optimization (#6890 ) * some markdown docs optimization * markdown escape	2019-04-12 17:30:57 -07:00
Gian Merlino	2470b3279f	SQL: Fix docs for STRING_FORMAT. (#7455 )	2019-04-11 21:57:28 -07:00
Gian Merlino	a517f8ce49	Coordinator: Allow dropping all segments. (#7447 ) Removes the coordinator sanity check that prevents it from dropping all segments. It's useful to get rid of this, since the behavior is unintuitive for dev/testing clusters where users might regularly want to drop all their data to get back to a clean slate. But the sanity check was there for a reason: to prevent a race condition where the coordinator might drop all segments if it ran before the first metadata store poll finished. This patch addresses that concern differently, by allowing methods in MetadataSegmentManager to return null if a poll has not happened yet, and canceling coordinator runs in that case. This patch also makes the "dataSources" reference in SQLMetadataSegmentManager volatile. I'm not sure why it wasn't volatile before, but it seems necessary to me: it's not final, and it's dereferenced from multiple threads without synchronization.	2019-04-11 08:45:38 -07:00
Justin Borromeo	408e3e1b2a	Remove select execution code from SQL planner (#7416 ) * Removed select execution code from SQL planner * Update doc	2019-04-10 22:32:57 -07:00
Benjamin Hopp	78e6f6fb38	Updated Javascript Affinity config docs (#7441 ) Updated with hostname:port rather than IP Address.	2019-04-10 21:44:50 -07:00
Benedict Jin	2f64414ade	Add "REVERSE" / "REPEAT" / "RIGHT" / "LEFT" functions (#7334 ) * Add "REVERSE" / "REPEAT" / "RIGHT" / "LEFT" functions * Fix ImportOrder * Use RuntimeException instead of OutOfMemoryError according to "Effective Java" * Simplify * Patch suggestions	2019-04-10 11:46:29 +08:00
Clint Wylie	89bb43f382	'core' ORC extension (#7138 ) * orc extension reworked to use apache orc map-reduce lib, moved to core extensions, support for flattenSpec, tests, docs * change binary handling to be compatible with avro and parquet, Rows.objectToStrings now converts byte[] to base64, change date handling * better docs and tests * fix it * formatting * doc fix * fix it * exclude redundant dependencies * use latest orc-mapreduce, add hadoop jobProperties recommendations to docs * doc fix * review stuff and fix binaryAsString * cache for root level fields * more better	2019-04-09 09:03:26 -07:00
Justin Borromeo	799c66d9ac	Allow max rows and max segments for time-ordered scans to be overridden using the scan query JSON spec (#7413 ) * Initial changes * Fixed NPEs * Fixed failing spec test * Fixed failing Calcite test * Move configs to context * Validated and added docs * fixed weird indentation * Update default context vals in doc * Fixed allowable values	2019-04-07 20:12:52 -07:00
Clint Wylie	e28a15f9f5	fix expressions docs operator table (#7420 ) * fix expressions docs operator table * Update math-expr.md	2019-04-07 20:12:00 -07:00
Justin Borromeo	e23fd41fa7	Update SQL doc for planning change (#7415 )	2019-04-05 15:14:07 -07:00
Jonathan Wei	0f6cb1e7e0	Update theta/hll sketch doc comparison (#7407 )	2019-04-03 15:21:33 -07:00
Gian Merlino	8c104a115c	SQL: Add STRING_FORMAT function. (#7327 )	2019-04-03 17:09:54 -04:00
David Glasser	4e23c11345	Make IngestSegmentFirehoseFactory splittable for parallel ingestion (#7048 ) * Make IngestSegmentFirehoseFactory splittable for parallel ingestion * Code review feedback - Get rid of WindowedSegment - Don't document 'segments' parameter or support splitting firehoses that use it - Require 'intervals' in WindowedSegmentId (since it won't be written by hand) * Add missing @JsonProperty * Integration test passes * Add unit test * Remove two FIXME comments from CompactionTask I'd like to leave this PR in a potentially mergeable state, but I still would appreciate reviewer eyes on the questions I'm removing here. * Updates from code review	2019-04-02 14:59:17 -07:00
Xue Yu	78fd5aff21	support radians and degrees in sql (#7336 ) * support radians and degrees in sql * update test case	2019-04-02 12:47:49 -07:00
Qi Shu	134f71d1b4	Add documentation for Druid native query in SQL view of web console (#7381 ) * Add docmentation for Druid native query in SQL view of web console * Edit sentence	2019-04-02 12:20:51 -07:00
Michael Trelinski	347779b17a	Zookeeper loss (#6740 ) * Update init Fix bin/init to source from proper directory. * Fix for Proposal #6518: Shutdown druid processes upon complete loss of ZK connectivity * Zookeeper Loss: - Add feature documentation - Cosmetic refactors - Variable extractions - Remove getter * - Change config key name and reword documentation - Switch from Function<Void,Void> to Runnable/Lambda - try { … } finally { … } * Fix line length too long * - change to formatted string for logging - use System.err.println after lifecycle stops * commenting on makeEnsembleProvider()-created Zookeeper termination * Add javadoc * added java doc reference back to apache discussion thread. * move comment to other class * favor two-slash comments instead of multiline comments	2019-03-29 15:10:42 -07:00
Justin Borromeo	ad7862c58a	Time Ordering On Scans (#7133 ) * Moved Scan Builder to Druids class and started on Scan Benchmark setup * Need to form queries * It runs. * Stuff for time-ordered scan query * Move ScanResultValue timestamp comparator to a separate class for testing * Licensing stuff * Change benchmark * Remove todos * Added TimestampComparator tests * Change number of benchmark iterations * Added time ordering to the scan benchmark * Changed benchmark params * More param changes * Benchmark param change * Made Jon's changes and removed TODOs * Broke some long lines into two lines * nit * Decrease segment size for less memory usage * Wrote tests for heapsort scan result values and fixed bug where iterator wasn't returning elements in correct order * Wrote more tests for scan result value sort * Committing a param change to kick teamcity * Fixed codestyle and forbidden API errors * . * Improved conciseness * nit * Created an error message for when someone tries to time order a result set > threshold limit * Set to spaces over tabs * Fixing tests WIP * Fixed failing calcite tests * Kicking travis with change to benchmark param * added all query types to scan benchmark * Fixed benchmark queries * Renamed sort function * Added javadoc on ScanResultValueTimestampComparator * Unused import * Added more javadoc * improved doc * Removed unused import to satisfy PMD check * Small changes * Changes based on Gian's comments * Fixed failing test due to null resultFormat * Added config and get # of segments * Set up time ordering strategy decision tree * Refactor and pQueue works * Cleanup * Ordering is correct on n-way merge -> still need to batch events into ScanResultValues * WIP * Sequence stuff is so dirty :( * Fixed bug introduced by replacing deque with list * Wrote docs * Multi-historical setup works * WIP * Change so batching only occurs on broker for time-ordered scans Restricted batching to broker for time-ordered queries and adjusted tests Formatting Cleanup * Fixed mistakes in merge * Fixed failing tests * Reset config * Wrote tests and added Javadoc * Nit-change on javadoc * Checkstyle fix * Improved test and appeased TeamCity * Sorry, checkstyle * Applied Jon's recommended changes * Checkstyle fix * Optimization * Fixed tests * Updated error message * Added error message for UOE * Renaming * Finish rename * Smarter limiting for pQueue method * Optimized n-way merge strategy * Rename segment limit -> segment partitions limit * Added a bit of docs * More comments * Fix checkstyle and test * Nit comment * Fixed failing tests -> allow usage of all types of segment spec * Fixed failing tests -> allow usage of all types of segment spec * Revert "Fixed failing tests -> allow usage of all types of segment spec" This reverts commit `ec470288c7`. * Revert "Merge branch '6088-Time-Ordering-On-Scans-N-Way-Merge' of github.com:justinborromeo/incubator-druid into 6088-Time-Ordering-On-Scans-N-Way-Merge" This reverts commit `57033f36df`, reversing changes made to `8f01d8dd16`. * Check type of segment spec before using for time ordering * Fix bug in numRowsScanned * Fix bug messing up count of rows * Fix docs and flipped boolean in ScanQueryLimitRowIterator * Refactor n-way merge * Added test for n-way merge * Refixed regression * Checkstyle and doc update * Modified sequence limit to accept longs and added test for long limits * doc fix * Implemented Clint's recommendations	2019-03-28 14:37:09 -07:00
Surekha	be318f4de3	Add column type to sys table docs (#7359 ) * Add column type * oops should be used=1	2019-03-27 20:21:57 -07:00
Charles Allen	eeb3dbe79d	Move GCP to a core extension (#6953 ) * Move GCP to a core extension * Don't provide druid-core >.< * Keep AWS and GCP modules separate * Move AWSModule to its own module * Add aws ec2 extension and more modules in more places * Fix bad imports * Fix test jackson module * Include AWS and GCP core in server * Add simple empty method comment * Update version to 15 * One more 0.13.0-->0.15.0 change * Fix multi-binding problem * Grep for s3-extensions and update docs * Update extensions.md	2019-03-27 09:00:43 -07:00
Justin Borromeo	c7fea6ac8f	Added better QueryInterruptedException error message for UnsupportedOperationException (#7248 ) * Added error message for UOE * Updated docs * Doc change * Doc change	2019-03-26 15:20:24 -07:00
Gian Merlino	4ca5fe0f60	SQL: Add PARSE_LONG function. (#7326 ) * SQL: Add PARSE_LONG function. * Fix test.	2019-03-22 15:40:10 -07:00
Vadim Ogievetsky	e4f2dcacf2	Druid console docs (#7300 ) * console docs * fix typo	2019-03-21 00:37:33 -07:00
Justin Borromeo	ff94bd16e6	Fix conflicting information in configuration doc (#7299 ) * Doc fix * Fix typo	2019-03-19 14:55:58 -07:00
Qi Shu	5406aaa49d	Add SQL auto complete in druid console (#7244 ) * Add SQL auto complete in druid console * Add comment in sql.md to alert user to change create-sql-function-doc if sql.md format gets changed	2019-03-16 01:45:53 -07:00
Jihoon Son	892d1d35d6	Deprecate NoneShardSpec and drop support for automatic segment merge (#6883 ) * Deprecate noneShardSpec * clean up noneShardSpec constructor * revert unnecessary change * Deprecate mergeTask * add more doc * remove convert from indexMerger * Remove mergeTask * remove HadoopDruidConverterConfig * fix build * fix build * fix teamcity * fix teamcity * fix ServerModule * fix compilation * fix compilation	2019-03-15 23:29:25 -07:00
Atul Mohan	2daeb50008	Add support for optional client authentication on TLS (#7250 ) * Add optional client auth * Add docs	2019-03-15 15:14:34 -07:00
Hongze Zhang	f9d99b245b	Add missing doc link for operations/http-compression.html; Fix magic numbers in test cases using JettyServerInitUtils.wrapWithDefaultGzipHandler (#7110 )	2019-03-13 14:09:19 -07:00
Clint Wylie	3895914aa2	consolidate CompressionUtils.java since now in the same jar (#6908 )	2019-03-13 11:02:44 -04:00
Gian Merlino	9178793ab5	Further improve caching documentation. (#7236 ) Follow-up to #7223 that fixes a doc bug (a result-level cache property was misspelled), changes the recommended "small cluster" threshold from 20 to 5 servers, and clarifies behavior of the various caching options.	2019-03-11 17:57:00 -07:00
Pierre-Emile Ferron	a88fbcd5db	Improve caching doc (#7223 ) - Set correct default values for query context result cache parameters - Add details about broker cache impact on local historical merging	2019-03-11 20:06:28 -04:00
Venkatraman P	3118160387	Adding a tutorial in doc for using Kerberized Hadoop as deep storage. (#6863 ) * Adding a tutorial in doc for using Kerberized Hadoop as deep storage. * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md Fixed - to ~ in Apache License section. * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md	2019-03-11 11:39:15 -07:00

1 2 3 4 5 ...

1920 Commits