Clarify split hint spec behavior (#10656)

This commit is contained in:
Atul Mohan 2020-12-09 08:24:32 -06:00 committed by GitHub
parent 4ea1ab8531
commit 44df05b8b2
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 3 additions and 2 deletions

View File

@ -230,7 +230,7 @@ Note that each worker task processes a single input split. You can control the a
#### Size-based Split Hint Spec #### Size-based Split Hint Spec
The size-based split hint spec is respected by all splittable input sources except for the HTTP input source. The size-based split hint spec is respected by all splittable input sources except for the HTTP input source and SQL input source.
|property|description|default|required?| |property|description|default|required?|
|--------|-----------|-------|---------| |--------|-----------|-------|---------|
@ -1135,7 +1135,7 @@ the [S3 input source](#s3-input-source) or the [Google Cloud Storage input sourc
The HTTP input source is to support reading files directly The HTTP input source is to support reading files directly
from remote sites via HTTP. from remote sites via HTTP.
The HTTP input source is _splittable_ and can be used by the [Parallel task](#parallel-task), The HTTP input source is _splittable_ and can be used by the [Parallel task](#parallel-task),
where each worker task of `index_parallel` will read only one file. where each worker task of `index_parallel` will read only one file. This input source does not support Split Hint Spec.
Sample specs: Sample specs:
@ -1340,6 +1340,7 @@ Only rows where `page` = `Druid` will be returned.
The SQL input source is used to read data directly from RDBMS. The SQL input source is used to read data directly from RDBMS.
The SQL input source is _splittable_ and can be used by the [Parallel task](#parallel-task), where each worker task will read from one SQL query from the list of queries. The SQL input source is _splittable_ and can be used by the [Parallel task](#parallel-task), where each worker task will read from one SQL query from the list of queries.
This input source does not support Split Hint Spec.
Since this input source has a fixed input format for reading events, no `inputFormat` field needs to be specified in the ingestion spec when using this input source. Since this input source has a fixed input format for reading events, no `inputFormat` field needs to be specified in the ingestion spec when using this input source.
Please refer to the Recommended practices section below before using this input source. Please refer to the Recommended practices section below before using this input source.