druid/docs/ingestion
Kashif Faraz d6949b1b79
Track input processedBytes with MSQ ingestion (#13559)
Follow up to #13520

Bytes processed are currently tracked for intermediate stages in MSQ ingestion.
This patch adds the capability to track the bytes processed by an MSQ controller
task while reading from an external input source or a segment source.

Changes:
- Track `processedBytes` for every `InputSource` read in `ExternalInputSliceReader`
- Update `ChannelCounters` with the above obtained `processedBytes` when incrementing
the input file count.
- Update task report structure in docs

The total input processed bytes can be obtained by summing the `processedBytes` as follows:

totalBytes = 0
for every root stage (i.e. a stage which does not have another stage as an input):
    for every worker in that stage:
        for every input channel: (i.e. channels with prefix "input", e.g. "input0", "input1", etc.)
            totalBytes += processedBytes
2022-12-16 02:20:01 +05:30
..
data-formats.md Update nested columns docs (#13461) 2022-12-01 10:47:32 -08:00
data-model.md Various documentation updates. (#13107) 2022-09-16 21:58:11 -07:00
faq.md Various documentation updates. (#13107) 2022-09-16 21:58:11 -07:00
hadoop.md clarify hadoop input paths (#11781) 2021-10-07 20:22:51 -07:00
index.md Add missing MSQ error code fields to docs (#13308) 2022-11-10 21:03:04 +05:30
ingestion-spec.md Updated docs on front coding (#13387) 2022-11-19 00:01:04 -08:00
migrate-from-firehose-ingestion.md Correction to firehose migration doc (#13423) 2022-11-28 10:24:27 +05:30
native-batch-firehose.md Firehose migration doc (#12981) 2022-11-21 11:17:12 -08:00
native-batch-input-source.md Use standard library to correctly glob and stop at the correct folder structure when filtering cloud objects (#13027) 2022-11-10 23:46:40 -08:00
native-batch-simple-task.md Various documentation updates. (#13107) 2022-09-16 21:58:11 -07:00
native-batch.md Update to native ingestion doc (#13482) 2022-12-07 15:08:19 +05:30
partitioning.md Add missing MSQ error code fields to docs (#13308) 2022-11-10 21:03:04 +05:30
rollup.md Add missing MSQ error code fields to docs (#13308) 2022-11-10 21:03:04 +05:30
schema-design.md Update nested columns docs (#13461) 2022-12-01 10:47:32 -08:00
standalone-realtime.md Reduce visibility of Tranquility documentation (#11134) 2021-05-03 16:48:24 -07:00
tasks.md Track input processedBytes with MSQ ingestion (#13559) 2022-12-16 02:20:01 +05:30
tranquility.md Reduce visibility of Tranquility documentation (#11134) 2021-05-03 16:48:24 -07:00