druid

Commit Graph

Author	SHA1	Message	Date
Clint Wylie	2ce7b3dcf4	bitwise math function expressions (#10605 ) * expressions: adding bitwise expressions * double handling and vectorization * move conversion to Evals * revert unintended changes * less magic, split convert functions, fix parser for funny exponent doubles * fix spelling exceptions list * more spelling * fix grammar, add more test, fix docs * fix docs Co-authored-by: Max Kaplan <max@maxkaplan.me>	2021-01-28 11:16:53 -08:00
Maytas Monsereenusorn	a46d561bd7	Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead (#10740 ) * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * fix checkstyle * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * fix test * fix test * add log * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * address comments * fix checkstyle * fix checkstyle * add config to skip overhead memory calculation * add test for the skipBytesInMemoryOverheadCheck config * add docs * fix checkstyle * fix checkstyle * fix spelling * address comments * fix travis * address comments	2021-01-27 00:34:56 -08:00
Himadri Singh	1c1b396eaa	AWS Web Identity / IRSA Support (#10541 ) * AWS Web Identity Support required for AWS IRSA * Update kinesis-ingestion.md * disabling coverage tests https://github.com/apache/druid/pull/10541#issuecomment-737558213 * exclude coverage * Update licenses.yaml	2021-01-25 18:44:02 +05:30
Charles Smith	99494e3d16	suggest index parallel for native batch reindexing > 1GB (#10788 )	2021-01-22 21:54:28 -08:00
zhangyue19921010	bf1d1d583b	modify (#10778 ) Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2021-01-22 09:20:13 -08:00
zhangyue19921010	2837a9b62f	[Minor Doc Fix] Correct the default value of `druid.server.http.gracefulShutdownTimeout` (#10661 ) * done * done * done Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2021-01-08 15:23:08 -08:00
Abhishek Agarwal	f66fdbfa5d	add offsetFetchPeriod to kinesis ingestion doc (#10734 )	2021-01-08 14:19:26 -08:00
Himanshu	c7b1212a43	AWS RDS token based password provider (#9518 ) * refresh db pwd * aws iam token password provider * fix analyze-dependencies build * fix doc build * add ut for BasicDataSourceExt * more doc updates * more doc update * moving aws token password provider to new extension * remove duplicate changes * make all config inline * extension docs * refresh db password in SQL Firehose code path as well * add ut * fix build * add new extension to distribution * rds lib is not provided * fix license build * add version to license * change parent version to 0.19.0-snapshot * address review comments * fix core/ code coverage * Update server/src/main/java/org/apache/druid/metadata/BasicDataSourceExt.java Co-authored-by: Clint Wylie <cjwylie@gmail.com> * address review comments * fix spellchecker * remove inadvertant website file change Co-authored-by: Clint Wylie <cjwylie@gmail.com>	2021-01-06 21:15:29 -08:00
Makdon	f9fc1892d1	Typo: missing comma in json (#10711 )	2021-01-06 13:49:50 -08:00
Jonathan Wei	68bb038b31	Multiphase segment merge for IndexMergerV9 (#10689 ) * Multiphase merge for IndexMergerV9 * JSON fix * Cleanup temp files * Docs * Address logging and add IT * Fix spelling and test unloader datasource name	2021-01-05 22:19:09 -08:00
Himanshu	d2e6240cac	k8s-int-test-build: zk-less druid cluster and http based segment/task managment (#10686 ) * zk-less druid cluster in k8s build * attempt to fix build and use http based remote task management * mm/router logs for debugging * add default account k8s role and binding for pod, configMap access * fix issue * change router port to 8088 for common readinessProbe * break build_run_k8s_cluster.sh into separate scripts * revert changes to K8sDruidNodeAnnouncer.java * k8s extension doc update * add license to new file * address review comments * do not try to load lookups at startup to improve cluster startup time	2021-01-05 18:51:47 -08:00
Charles Smith	797371598d	update syntax for golbal cached uri lookups (#10629 )	2020-12-24 09:49:01 -08:00
Xavier Léauté	b7a16d08a6	Update Apache Kafka to 2.7.0 (#10701 ) - align scala versions to match Kafka	2020-12-22 13:56:00 -08:00
Lucas Capistrant	58ce2e55d8	Add dynamic coordinator config that allows control over how many segments are considered when picking a segment to move. (#10284 ) * dynamic coord config adding more balancing control add new dynamic coordinator config, maxSegmentsToConsiderPerMove. This config caps the number of segments that are iterated over when selecting a segment to move. The default value combined with current balancing strategies will still iterate over all provided segments. However, setting this value to something > 0 will cap the number of segments visited. This could make sense in cases where a cluster has a very large number of segments and the admins prefer less iterations vs a thorough consideration of all segments provided. * fix checkstyle failure * Make doc more detailed for admin to understand when/why to use new config * refactor PR to use a % of segments instead of raw number * update the docs * remove bad doc line * fix typo in name of new dynamic config * update RservoirSegmentSampler to gracefully deal with values > 100% * add handler for <= 0 in ReservoirSegmentSampler * fixup CoordinatorDynamicConfigTest naming and argument ordering * fix items in docs after spellcheck flags * Fix lgtm flag on missing space in string literal * improve documentation for new config * Add default value to config docs and add advice in cluster tuning doc * Add percentOfSegmentsToConsiderPerMove to web console coord config dialog * update jest snapshot after console change * fix spell checker errors * Improve debug logging in getRandomSegmentBalancerHolder to cover all bad inputs for % of segments to consider * add new config back to web console module after merge with master * fix ReservoirSegmentSamplerTest * fix line breaks in coordinator console dialog * Add a test that helps ensure not regressions for percentOfSegmentsToConsiderPerMove * Make improvements based off of feedback in review * additional cleanup coming from review * Add a warning log if limit on segments to consider for move can't be calcluated * remove unused import * fix tests for CoordinatorDynamicConfig * remove precondition test that is redundant in CoordinatorDynamicConfig Builder class	2020-12-22 08:27:55 -08:00
Clint Wylie	da0eabaa01	integration test for coordinator and overlord leadership client (#10680 ) * integration test for coordinator and overlord leadership, added sys.servers is_leader column * docs * remove not needed * fix comments * fix compile heh * oof * revert unintended * fix tests, split out docker-compose file selection from starting cluster, use docker-compose down to stop cluster * fixes * style * dang * heh * scripts are hard * fix spelling * fix thing that must not matter since was already wrong ip, log when test fails * needs more heap * fix merge * less aggro	2020-12-17 22:50:12 -08:00
sthetland	6ae8059c09	cleaning up and fixing links (#10528 ) * cleaning up and fixing links * reverting local link * Update indexer.md * link checking * Fixing one more stale link for PostgreSQL	2020-12-17 13:37:43 -08:00
Himanshu	ac1882bf74	kubernetes based discovery druid extension to run Druid on K8S without Zookeeper (#10544 ) * honor zk enablement config in more places in druid code * kubernetes based discovery module * fix spotbugs check * fix intellij checks error * fix doc link to kubernetes.md from extension * make spellchecker happy * update license.yaml * fix dependency check errors * update extension coverage * UTs for BaseNodeRoleWatcher * fix forbidden-api check * update k8s module coverage ignores * add Bouncy Castle License being same as MIT License for license checking purposes * further update licenses.yaml * label/annotation pre-existence assumption * address review comment	2020-12-14 21:10:31 -08:00
Himanshu	be019760bb	document DynamicConfigProvider for kafka consumer properties (#10658 ) * document DynamicConfigProvider for kafka consumer properties * Update docs/development/extensions-core/kafka-ingestion.md Co-authored-by: Jihoon Son <jihoonson@apache.org> * Update docs/development/extensions-core/kafka-ingestion.md * fix doc build Co-authored-by: Jihoon Son <jihoonson@apache.org>	2020-12-10 08:24:33 -08:00
Atul Mohan	44df05b8b2	Clarify split hint spec behavior (#10656 )	2020-12-09 08:24:32 -06:00
Abhishek Agarwal	4ea1ab8531	Fix links in the grouping function doc (#10654 )	2020-12-09 14:56:32 +08:00
Gian Merlino	96a387d972	Fixes and tests related to the Indexer process. (#10631 ) * Fixes and tests related to the Indexer process. Three bugs fixed: 1) Indexers would not announce themselves as segment servers if they did not have storage locations defined. This used to work, but was broken in #9971. Fixed this by adding an "isSegmentServer" method to ServerType and updating SegmentLoadDropHandler to always announce if this method returns true. 2) Certain batch task types were written in a way that assumed "isReady" would be called before "run", which is not guaranteed. In particular, they relied on it in order to initialize "taskLockHelper". Fixed this by updating AbstractBatchIndexTask to ensure "isReady" is called before "run" for these tasks. 3) UnifiedIndexerAppenderatorsManager did not properly handle complex datasources. Introduced DataSourceAnalysis in order to fix this. Test changes: 1) Add a new "docker-compose.cli-indexer.yml" config that spins up an Indexer instead of a MiddleManager. 2) Introduce a "USE_INDEXER" environment variable that determines if docker-compose will start up an Indexer or a MiddleManager. 3) Duplicate all the jdk8 tests and run them in both MiddleManager and Indexer mode. 4) Various adjustments to encourage fail-fast errors in the Docker build scripts. 5) Various adjustments to speed up integration tests and reduce memory usage. 6) Add another Mac-specific approach to determining a machine's own IP. This was useful on my development machine. 7) Update segment-count check in ITCompactionTaskTest to eliminate a race condition (it was looking for 6 segments, which only exist together briefly, until the older 4 are marked unused). Javadoc updates: 1) AbstractBatchIndexTask: Added javadocs to determineLockGranularityXXX that make it clear when taskLockHelper will be initialized as a side effect. (Related to the second bug above.) 2) Task: Clarified that "isReady" is not guaranteed to be called before "run". It was already implied, but now it's explicit. 3) ZkCoordinator: Clarified deprecation message. 4) DataSegmentServerAnnouncer: Clarified deprecation message. * Fix stop_cluster script. * Fix sanity check in script. * Fix hashbang lines. * Test and doc adjustments. * Additional tests, and adjustments for tests. * Split ITs back out. * Revert change to druid_coordinator_period_indexingPeriod. * Set Indexer capacity to match MM. * Bump up Historical memory. * Bump down coordinator, overlord memory. * Bump up Broker memory.	2020-12-08 16:02:26 -08:00
frank chen	c410648630	fix injection failure of StorageLocationSelectorStrategy objects (#10363 ) * fix to allow customer storage location selector strategy * add test cases to check instance of selector strategy * update doc * code format * resolve code review comments * inject StorageLocation * fix CI * fix mismatched license item reported by CI * change property path from druid.segmentCache.locationSelectorStrategy.type to druid.segmentCache.locationSelector.strategy * using a helper method to bind to correct property path	2020-12-08 09:48:31 -08:00
Abhishek Agarwal	26d74b3580	Add grouping_id function (#10518 ) * First draft of grouping_id function * Add more tests and documentation * Add calcite tests * Fix travis failures * bit of a change * Add documentation * Fix typos * typo fix	2020-12-07 11:46:29 -08:00
zhangyue19921010	229b5f359f	Remove hard limitation that druid(after 0.15.0) only can consume Kafka version 0.11.x or better (#10551 ) * remove build in kafka consumer config : * modify druid docs of kafka indexing service * yuezhang * modify doc * modify docs * fix kafkaindexTaskTest.java * revert uncessary change * add more logs and modify docs * revert jdk version * modify docs * modify-kafka-version v2 * modify docs * modify docs * modify docs * modify docs * modify docs * done * remove useless import * change code and add UT Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2020-12-03 17:37:59 -08:00
zhangyue19921010	e7e07eab11	[Improve Doc] : Modify the disadvantages of the lazyLoadOnStart feature. (#10608 ) * modify docs * modify docs Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2020-12-01 18:33:22 -08:00
frank chen	24f1e35b5d	fix desc of 'required' for granularity property (#10616 )	2020-12-01 18:29:51 -08:00
Lucas Capistrant	2560bf0a19	Add new coordinator metrics for coordinator duty runtimes (#10603 ) * Add new coordinator metrics for duty runtimes * fix spelling for a constant variable value * add comment clarifying why the global runtime metric is emitted where it is * Remove duty alias in lieu of using the class name for metrics * fix docs * CoordinatorStats tests + add duty stats to accumulate() logic	2020-11-29 14:47:35 -08:00
frank chen	fe693a4f01	Improve doc and exception message for invalid user configurations (#10598 ) * improve doc and exception message * add spelling check rules and remove unused import * add a test to improve test coverage	2020-11-23 15:03:13 -08:00
Atul Mohan	111b431c07	Introduce query/timeout/count metric (#10567 ) * Add timeout metric * Add tests	2020-11-20 15:17:26 -08:00
sthetland	ba915b7f56	Security overview documentation (#10339 ) * initial file * initial file * security overview added * ldap added * spacing adjustments * nits * security graphics and doc review * Update docs/operations/security-overview.md Co-authored-by: Jonathan Wei <jon-wei@users.noreply.github.com> * Update docs/operations/security-user-auth.md Co-authored-by: Jonathan Wei <jon-wei@users.noreply.github.com> * Update docs/operations/security-overview.md Co-authored-by: Jonathan Wei <jon-wei@users.noreply.github.com> * Update docs/operations/security-overview.md Co-authored-by: Jonathan Wei <jon-wei@users.noreply.github.com> * updates frm review * review comments * finish up review and light edits * broken links * spell check Co-authored-by: Jonathan Wei <jon-wei@users.noreply.github.com>	2020-11-19 15:24:58 -08:00
michaelschiff	2f4d6da33f	Updates segment metadata query documentation (#10589 ) * updates segment metadata query documentation to be clearer about cardinality estimation * typo in documentation	2020-11-20 00:08:27 +05:30
zhangyue19921010	1272fb17e5	modify druid.historical.cache.maxEntrySize property in Unified format (#10590 ) Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2020-11-17 16:36:50 -06:00
Atul Mohan	21e3c4b39c	Add missing docs for timeout exceptions (#10554 ) * Add missing docs for timeout exceptions * Add info on auth failures	2020-11-13 08:45:40 -06:00
Gian Merlino	3436297354	Clarify how ORDER BY works with UNION ALL (#10561 ) Hopefully a bit clearer.	2020-11-05 20:12:03 -08:00
Mainak Ghosh	d8e5a159e8	Update index.md (#10549 ) Removing the extra `_` in the default for middlemanager category	2020-11-03 13:44:47 +05:30
Husky Zeng	9286153145	doc wrong description of configuration (#10546 )	2020-11-02 17:57:16 -08:00
Nishant Bangarwa	6b14bdb3a5	Add support for Blacklisting some domains for HTTPInputSource (#10535 ) fix inspections refactor class name change name add allowList as well distinguish between empty and null list Fix CI	2020-11-02 21:47:25 +05:30
Pierre Carrier	835b328851	docs/: use tuningConfig (#10540 )	2020-10-30 09:39:21 -05:00
Charles Smith	9c51047cc8	Document correlation between credential iterations and query latency (#10532 ) use link / heading instead of footnote	2020-10-29 12:47:24 -07:00
Atul Mohan	65a42f9eb1	Update overlord api docs (#10539 )	2020-10-29 11:19:12 -05:00
Clint Wylie	aa9c0ec650	update quickstart docker-compose example, add to release instructions (#10527 ) * update quickstart docker-compose example, add to release instructions * adjust * spelling	2020-10-26 23:14:07 -07:00
awelsh93	a966de5319	Add https to druid-influxdb-emitter extension (#9938 ) * Add https to druid-influxdb-emitter extension * address CI failures * increase test coverage * tests for being unable to load trustStore * fix EqualsVerifier test * fix intellij inspection error * use try-with-resources when loading trustStore	2020-10-26 19:49:26 -07:00
Abhishek Agarwal	04546b65ec	Additional documentation for query caching (#10503 ) * Add documentation for when caching is unsupported * Minor changes * Minor doc fix * Review comments * Add more details * Fix spelling check * Fix doc for union query * Trailing dot	2020-10-20 13:49:13 -07:00
Maytas Monsereenusorn	3538abd5d0	Make sure all fields in sys.segments are JSON-serialized (#10481 ) * fix JSON format * Change all columns in sys segments to be JSON * Change all columns in sys segments to be JSON * add tests * fix failing tests * fix failing tests	2020-10-14 13:49:46 -07:00
Maytas Monsereenusorn	9056d113d0	Add docs and integration tests for Auto-compaction snapshot status API (#10510 ) * add docs and IT for Auto-compaction snapshot status API * fix spellings * fix test * address comments	2020-10-14 06:42:22 -07:00
Jihoon Son	ad437dd655	Add shuffle metrics for parallel indexing (#10359 ) * Add shuffle metrics for parallel indexing * javadoc and concurrency test * concurrency * fix javadoc * Feature flag * doc * fix doc and add a test * checkstyle * add tests * fix build and address comments	2020-10-10 19:35:17 -07:00
Joseph Glanville	7ce9ac4548	Fix Avro support in Web Console (#10232 ) * Fix Avro OCF detection prefix and run formation detection on raw input * Support Avro Fixed and Enum types correctly * Check Avro version byte in format detection * Add test for AvroOCFReader.sample Ensures that the Sampler doesn't receive raw input that it can't serialize into JSON. * Document Avro type handling * Add TS unit tests for guessInputFormat	2020-10-07 21:08:22 -07:00
Mainak Ghosh	8168e14e92	Adding task slot count metrics to Druid Overlord (#10379 ) * Adding more worker metrics to Druid Overlord * Changing the nomenclature from worker to peon as that represents the metrics that we want to monitor better * Few more instance of worker usage replaced with peon * Modifying the peon idle count logic to only use eligible workers available capacity * Changing the naming to task slot count instead of peon * Adding some unit test coverage for the new test runner apis * Addressing Review Comments * Modifying the TaskSlotCountStatsProvider apis so that overlords which are not leader do not emit these metrics * Fixing the spelling issue in the docs * Setting the annotation Nullable on the TaskSlotCountStatsProvider methods	2020-09-28 23:50:38 -07:00
Clint Wylie	1d6cb624f4	add vectorizeVirtualColumns query context parameter (#10432 ) * add vectorizeVirtualColumns query context parameter * oops * spelling * default to false, more docs * fix test * fix spelling	2020-09-28 18:48:34 -07:00
Clint Wylie	b95bf444b2	add docs for kinesis lag metrics (#10435 )	2020-09-28 13:13:53 -07:00

1 2 3 4 5 ...

2225 Commits