From 21299a6f813dd1b243b2345190d06955c8beedb9 Mon Sep 17 00:00:00 2001 From: Igal Levy Date: Fri, 21 Feb 2014 10:43:52 -0800 Subject: [PATCH 1/2] refactored hadoop intro since the replacement page for instructions also wasn't right and there are quite a few available that are right --- docs/content/Tutorial:-Loading-Your-Data-Part-2.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/content/Tutorial:-Loading-Your-Data-Part-2.md b/docs/content/Tutorial:-Loading-Your-Data-Part-2.md index 4fa9d98dcd2..d9abc75d88c 100644 --- a/docs/content/Tutorial:-Loading-Your-Data-Part-2.md +++ b/docs/content/Tutorial:-Loading-Your-Data-Part-2.md @@ -205,9 +205,9 @@ Issuing a [TimeBoundaryQuery](TimeBoundaryQuery.html) to the real-time node shou Batch Ingestion --------------- -Druid is designed for large data volumes, and most real-world data sets require batch indexing be done through a Hadoop job. +Druid is designed for large data volumes, and most real-world data sets require batch indexing be done through a Hadoop job. -The setup for a single node, 'standalone' Hadoop cluster is available [here](http://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-common/SingleCluster.html). +For this tutorial, we used [Hadoop 1.0.3](https://archive.apache.org/dist/hadoop/core/hadoop-1.0.3/). There are many pages on the Internet showing how to set up a single-node (standalone) Hadoop cluster, which is all that's needed for this example. For the purposes of this tutorial, we are going to use our very small and simple Wikipedia data set. This data can directly be ingested via other means as shown in the previous [tutorial](Tutorial%3A-Loading-Your-Data-Part-1.html), but we are going to use Hadoop here for demonstration purposes. From 95c491db0f581559fab7c4789bf2c5f34a476c5c Mon Sep 17 00:00:00 2001 From: Igal Levy Date: Wed, 26 Feb 2014 17:37:52 -0800 Subject: [PATCH 2/2] fixed broken link --- docs/content/Tutorial:-Loading-Your-Data-Part-2.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/content/Tutorial:-Loading-Your-Data-Part-2.md b/docs/content/Tutorial:-Loading-Your-Data-Part-2.md index d9abc75d88c..ff065a7f16f 100644 --- a/docs/content/Tutorial:-Loading-Your-Data-Part-2.md +++ b/docs/content/Tutorial:-Loading-Your-Data-Part-2.md @@ -307,7 +307,7 @@ Examining the contents of the file, you should find: } ``` -If you are curious about what all this configuration means, see [here](Task.html). +If you are curious about what all this configuration means, see [here](Tasks.html). To submit the task: