diff --git a/docs/_graphics/indexing_service.dot b/docs/_graphics/indexing_service.dot index 41e70c36d6f..38505512998 100644 --- a/docs/_graphics/indexing_service.dot +++ b/docs/_graphics/indexing_service.dot @@ -27,7 +27,7 @@ digraph g { subgraph cluster_mm1 { style="dotted" - mm1 [shape = "box" label = "Middle Manager" ] + mm1 [shape = "box" label = "Middle Manager 1" ] peon_11[label = "peon"] peon_12[label = "peon"] peon_13[label = "peon"] diff --git a/docs/content/Tasks.md b/docs/content/Tasks.md index 6fce57d798c..cc6b5553023 100644 --- a/docs/content/Tasks.md +++ b/docs/content/Tasks.md @@ -72,7 +72,9 @@ The Hadoop Index Task is used to index larger data sets that require the paralle |property|description|required?| |--------|-----------|---------| |type|The task type, this should always be "index_hadoop".|yes| -|config|See [Batch Ingestion](Batch-ingestion.html)|yes| +|config|A Hadoop Index Config. See [Batch Ingestion](Batch-ingestion.html)|yes| +|hadoopCoordinates|The Maven :: of Hadoop to use. The default is "org.apache.hadoop:hadoop-core:1.0.3".|no| + The Hadoop Index Config submitted as part of an Hadoop Index Task is identical to the Hadoop Index Config used by the `HadoopBatchIndexer` except that three fields must be omitted: `segmentOutputPath`, `workingPath`, `updaterJobSpec`. The Indexing Service takes care of setting these fields internally. diff --git a/docs/img/indexing_service.png b/docs/img/indexing_service.png index 7a736760565..522936ec57c 100644 Binary files a/docs/img/indexing_service.png and b/docs/img/indexing_service.png differ