druid/docs/operations
Abhishek Radhakrishnan 38c1def95a
Kill tasks honor the buffer period of unused segments (#15710)
* Kill tasks should honor the buffer period of unused segments.

- The coordinator duty KillUnusedSegments determines an umbrella interval
 for each datasource to determine the kill interval. There can be multiple unused
segments in an umbrella interval with different used_status_last_updated timestamps.
For example, consider an unused segment that is 30 days old and one that is 1 hour old. Currently
the kill task after the 30-day mark would kill both the unused segments and not retain the 1-hour
old one.

- However, when a kill task is instantiated with this umbrella interval, it’d kill
all the unused segments regardless of the last updated timestamp. We need kill
tasks and RetrieveUnusedSegmentsAction to honor the bufferPeriod to avoid killing
unused segments in the kill interval prematurely.

* Clarify default behavior in docs.

* test comments

* fix canDutyRun()

* small updates.

* checkstyle

* forbidden api fix

* doc fix, unused import, codeql scan error, and cleanup logs.

* Address review comments

* Rename maxUsedFlagLastUpdatedTime to maxUsedStatusLastUpdatedTime

This is consistent with the column name `used_status_last_updated`.

* Apply suggestions from code review

Co-authored-by: Kashif Faraz <kashif.faraz@gmail.com>

* Make period Duration type

* Remove older variants of runKilLTask() in OverlordClient interface

* Test can now run without waiting for canDutyRun().

* Remove previous variants of retrieveUnusedSegments from internal metadata storage coordinator interface.

Removes the following interface methods in favor of a new method added:
- retrieveUnusedSegmentsForInterval(String, Interval)
- retrieveUnusedSegmentsForInterval(String, Interval, Integer)

* Chain stream operations

* cleanup

* Pass in the lastUpdatedTime to markUnused test function and remove sleep.

---------

Co-authored-by: Kashif Faraz <kashif.faraz@gmail.com>
2024-01-18 22:23:50 -08:00
..
alerts.md Docusaurus build framework + ingestion doc refresh. (#8311) 2019-08-20 21:48:59 -07:00
auth-ldap.md docs: LDAP trust store property clarification (#15028) 2023-11-02 13:00:08 -07:00
auth.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
basic-cluster-tuning.md docs: Anchor link checker (#15624) 2024-01-08 15:19:05 -08:00
clean-metadata-store.md Kill tasks honor the buffer period of unused segments (#15710) 2024-01-18 22:23:50 -08:00
deep-storage-migration.md cleaning up and fixing links (#10528) 2020-12-17 13:37:43 -08:00
dump-segment.md Improve memory efficiency of WrappedRoaringBitmap. (#13889) 2023-03-09 15:48:02 -08:00
durable-storage.md Add MSQ Durable Storage Connector for Google Cloud Storage and change current Google Cloud Storage client library (#15398) 2023-12-14 07:34:49 +05:30
dynamic-config-provider.md Docs: Splits Kafka topic. Adds detailed example for kafka inputFormat (#11912) 2021-11-12 13:02:23 -08:00
export-metadata.md Update Kafka loading docs to use the streaming data loader (#8544) 2019-09-22 15:00:52 -07:00
high-availability.md fixes web console introduction and addresses linking issues (#11609) 2021-08-18 08:37:05 -07:00
http-compression.md De-incubation cleanup in code, docs, packaging (#9108) 2020-01-03 12:33:19 -05:00
insert-segment-to-db.md De-incubation cleanup in code, docs, packaging (#9108) 2020-01-03 12:33:19 -05:00
java.md Claim full support for Java 17. (#14384) 2023-07-07 12:52:35 -07:00
kubernetes.md Update kubernetes.md (#13858) 2023-02-28 11:20:24 -08:00
metadata-migration.md Clean up after add kill bufferPeriod (#14868) 2023-08-19 00:00:04 +05:30
metrics.md Clean up duty for non-overlapping eternity tombstones (#15281) 2023-12-11 08:57:15 -08:00
migrate-from-firehose-ingestion.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
mixed-workloads.md Enable query request queuing by default when total laning is turned on. (#15440) 2024-01-09 07:54:26 +05:30
other-hadoop.md Removes support for Hadoop 2 (#14763) 2023-08-09 17:47:52 +05:30
password-provider.md Security overview documentation (#10339) 2020-11-19 15:24:58 -08:00
pull-deps.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
request-logging.md Add sql planning time metric (#12923) 2022-08-22 11:09:44 +05:30
reset-cluster.md Update reset-cluster.md (#10990) 2021-03-29 20:38:35 -07:00
rolling-updates.md add links to release notes, light refactor of landing page (#11051) 2021-05-07 14:26:47 -07:00
rule-configuration.md Update retention rules doc (#15439) 2023-12-05 09:53:17 -08:00
security-overview.md remove references to Jupyter notebooks within the Druid repo (#15143) 2023-11-01 13:17:06 -07:00
security-user-auth.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
segment-optimization.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
single-server.md Docs: reword single server page (#13659) 2023-01-11 21:12:52 -08:00
tls-support.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
use_sbt_to_build_fat_jar.md Docusaurus build framework + ingestion doc refresh. (#8311) 2019-08-20 21:48:59 -07:00
web-console.md docs: Anchor link checker (#15624) 2024-01-08 15:19:05 -08:00