more exceptions

2025-02-17 07:25:02 +00:00 · 2014-01-10 18:06:42 -08:00 · 2014-01-10 18:06:42 -08:00 · f4e3f02c3b
commit f4e3f02c3b
parent 1ecc94cfb6 749634b8c3
5 changed files with 117 additions and 104 deletions
--- a/docs/content/Booting-a-production-cluster.md
+++ b/docs/content/Booting-a-production-cluster.md
@ -1,30 +1,84 @@
 ---
 layout: doc_page
 ---
-# Booting a Single Node Cluster

-[Loading Your Data](Tutorial%3A-Loading-Your-Data-Part-2.html) and [All About Queries](Tutorial%3A-All-About-Queries.html) contain recipes to boot a small druid cluster on localhost. Here we will boot a small cluster on EC2. You can checkout the code, or download a tarball from [here](http://static.druid.io/artifacts/druid-services-0.6.46-bin.tar.gz).
+# Booting a Druid Cluster
+[Loading Your Data](Tutorial%3A-Loading-Your-Data-Part-2.html) and [All About Queries](Tutorial%3A-All-About-Queries.html) contain recipes to boot a small druid cluster on localhost. However, when it's time to run a more realistic setup&mdash;for production or just for testing production&mdash;you'll want to find a way to start the cluster on multiple hosts. This document describes two different ways to do this: manually, or as a cloud service via Apache Whirr.

-The [ec2 run script](https://github.com/metamx/druid/blob/master/examples/bin/run_ec2.sh), run_ec2.sh, is located at 'examples/bin' if you have checked out the code, or at the root of the project if you've downloaded a tarball. The scripts rely on the [Amazon EC2 API Tools](http://aws.amazon.com/developertools/351), and you will need to set three environment variables:
+## Manually Booting a Druid Cluster
+You can provision individual servers, loading Druid onto each machine (or building it) and setting the required configuration for each type of node. You'll also have to set up required external dependencies. Then you'll have to start each node. This process is outlined in [Tutorial: The Druid Cluster](Tutorial:-The-Druid-Cluster.html).
+
+## Apache Whirr
+
+[Apache Whirr](http://whirr.apache.org/) is a set of libraries for launching cloud services. For Druid, Whirr serves as an easy way to launch a cluster in Amazon AWS by using simple commands and configuration files (called *recipes*).
+
+**NOTE:** Whirr will install Druid 0.5.x. At this time Whirr can launch Druid 0.5.x only, but in the near future will support Druid 0.6.x. You can download and install your own copy of Druid 0.5.x [here](http://static.druid.io/artifacts/releases/druid-services-0.5.7-bin.tar.gz).
+
+You'll need an AWS account, and an EC2 key pair from that account so that Whirr can connect to the cloud via the EC2 API. If you haven't generated a key pair, see the [AWS documentation](http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-key-pairs.html) or see this [Whirr FAQ](http://whirr.apache.org/faq.html#how-do-i-find-my-cloud-credentials).
+
+
+### Installing Whirr
+You must use a version of Whirr that includes and supports a Druid recipe. You can do it so in one of two ways:
+
+#### Build the Following Version of Whirr
+Clone the code from [https://github.com/rjurney/whirr/tree/trunk](https://github.com/rjurney/whirr/tree/trunk) and build Whirr:
+
+    git clone git@github.com:rjurney/whirr.git
+    cd whirr
+    git checkout trunk
+    mvn clean install -Dmaven.test.failure.ignore=true
+
+#### Build the Latest Version of Whirr
+Clone the code from the Whirr repository:
+
+    git clone git://git.apache.org/whirr.git
+    
+Then run `mvn install` from the root directory.
+
+### Configure Whirr
+The Whirr recipe for Druid is the configuration file `$WHIRR_HOME/recipies/druid.properties`. You can edit this file to suit your needs -- it is annotated and self-explanatory. Here are some hints about that file:
+
+* Set `whirr.location-id` to a specific AWS region (e.g., us-east-1) if desired, else one will be chosen for you.
+* You can choose the hardware used with `whirr.hardware-id` to a specific AWS region (e.g., m1.large). If you don't choose an image via `whirr.image-id` (image must be compatible with hardware), you'll get plain vanilla Linux.
+* SSH keys (not password protected) must exist for the local user. If they are in the default locations, `${sys:user.home}/.ssh/id_rsa` and `${sys:user.home}/.ssh/id_rsa.pub`, Whirr will find them. Otherwise, you'll have to specify them with `whirr.private-key-file` and `whirr.public-key-file`.
+* Be sure to specify the absolute path of the Druid realtime spec file `realtime.spec` in `whirr.druid.realtime.spec.path`.
+* Two Druid cluster templates (see `whirr.instance-templates`) are provided: a small cluster running on a single EC2 instance, and a larger cluster running on multiple instances. The first is a good test case to start with.
+
+The following AWS information must be set in `druid.properties`, as environment variables, or in the file `$WHIRR_HOME/conf/credentials`:
+
+    PROVIDER=aws-ec2
+    IDENTITY=<aws-id-key>
+    CREDENTIAL=<aws-private-key>
+    
+How to get the IDENTITY and CREDENTIAL keys is discussed above.
+
+### Start a Test Cluster With Whirr
+Run the following command:

 ```bash
-# Setup environment for ec2-api-tools
-export EC2_HOME=/path/to/ec2-api-tools-1.6.7.4/
-export PATH=$PATH:$EC2_HOME/bin
-export AWS_ACCESS_KEY=
-export AWS_SECRET_KEY=
+% $WHIRR_HOME/bin/whirr launch-cluster --config $WHIRR_HOME/recipes/druid.properties
 ```
+If Whirr starts without an errors, you should see the following message:

-Then, booting an ec2 instance running one node of each type is as simple as running the script, run_ec2.sh :)
+    Running on provider aws-ec2 using identity <your-aws-id-here>
+    
+You can then use the EC2 dashboard to locate the instance and confirm that it has started up.

+If both the instance and the Druid cluster launch successfully, a few minutes later other messages to STDOUT should follow with information returned from EC2, including the instance ID:
+
+<<<<<<< HEAD
 # Apache Whirr
+=======
+    Started cluster of 1 instances
+    Cluster{instances=[Instance{roles=[zookeeper, druid-mysql, druid-master, druid-broker, druid-compute, druid-realtime], publicIp= ...
+    
+The final message will contain login information for the instance.
+>>>>>>> master

-Apache Whirr is a set of libraries for launching cloud services. You can clone a version of Whirr that includes Druid as a service from git@github.com:rjurney/whirr.git:
+Note that the Whirr will return an exception if any of the nodes fail to launch, and the cluster will be destroyed. To destroy the cluster manually, run the following command:

 ```bash
-git clone git@github.com:rjurney/whirr.git
-cd whirr
-git checkout trunk
-mvn clean install -Dmaven.test.failure.ignore=true -Dcheckstyle.skip
-sp;bin/whirr launch-cluster --config recipes/druid.properties
+% $WHIRR_HOME/bin/whirr destroy-cluster --config $WHIRR_HOME/recipes/druid.properties
 ```
+
+
--- a/examples/bin/run_ec2.sh
+++ b/examples/bin/run_ec2.sh
@ -1,76 +0,0 @@
-# Before running, you will need to download the EC2 tools from http://aws.amazon.com/developertools/351
-# and then setup your EC2_HOME and PATH variables (or similar):
-# 
-# # Setup environment for ec2-api-tools
-# export EC2_HOME=/path/to/ec2-api-tools-1.6.7.4/
-# export PATH=$PATH:$EC2_HOME/bin
-# export AWS_ACCESS_KEY=
-# export AWS_SECRET_KEY=
-
-# Check for ec2 commands we require and die if they're missing
-type ec2-create-keypair >/dev/null 2>&1 || { echo >&2 "I require ec2-create-keypair but it's not installed.  Aborting."; exit 1; }
-type ec2-create-group >/dev/null 2>&1 || { echo >&2 "I require ec2-create-group but it's not installed.  Aborting."; exit 1; }
-type ec2-authorize >/dev/null 2>&1 || { echo >&2 "I require ec2-authorize but it's not installed.  Aborting."; exit 1; }
-type ec2-run-instances >/dev/null 2>&1 || { echo >&2 "I require ec2-run-instances but it's not installed.  Aborting."; exit 1; }
-type ec2-describe-instances >/dev/null 2>&1 || { echo >&2 "I require ec2-describe-instances but it's not installed.  Aborting."; exit 1; }
-
-# Create a keypair for our servers
-echo "Removing old keypair for druid..."
-ec2-delete-keypair druid-keypair
-echo "Creating new keypair for druid..."
-ec2-create-keypair druid-keypair > druid-keypair
-chmod 0600 druid-keypair
-mv druid-keypair ~/.ssh/
-
-# Create a security group for our servers
-echo "Creating a new security group for druid..."
-ec2-create-group druid-group -d "Druid Cluster"
-
-# Create rules that allow necessary services in our group
-echo "Creating new firewall rules for druid..."
-# SSH from outside
-ec2-authorize druid-group -P tcp -p 22
-# Enable all traffic within group
-ec2-authorize druid-group -P tcp -p 1-65535 -o druid-group
-ec2-authorize druid-group -P udp -p 1-65535 -o druid-group
-
-echo "Booting a single small instance for druid..."
-# Use ami ami-e7582d8e - Alestic Ubuntu 12.04 us-east
-INSTANCE_ID=$(ec2-run-instances ami-e7582d8e -n 1 -g druid-group -k druid-keypair --instance-type m1.small| awk '/INSTANCE/{print $2}')
-while true; do
-    sleep 1
-    INSTANCE_STATUS=$(ec2-describe-instances|grep INSTANCE|grep $INSTANCE_ID|cut -f6)
-    if [ $INSTANCE_STATUS == "running" ]
-        then
-            echo "Instance $INSTANCE_ID is status $INSTANCE_STATUS..."
-            break
-    fi
-done
-
-# Wait for the instance to come up
-echo "Waiting 60 seconds for instance $INSTANCE_ID to boot..."
-sleep 60
-
-# Get hostname and ssh with the key we created, and ssh there
-INSTANCE_ADDRESS=`ec2-describe-instances|grep 'INSTANCE'|grep $INSTANCE_ID|cut -f4`
-echo "Connecting to $INSTANCE_ADDRESS to prepare environment for druid..."
-scp -i ~/.ssh/druid-keypair -o StrictHostKeyChecking=no ./ec2/env.sh ubuntu@${INSTANCE_ADDRESS}:
-ssh -q -f -i ~/.ssh/druid-keypair -o StrictHostKeyChecking=no ubuntu@${INSTANCE_ADDRESS} 'chmod +x ./env.sh;./env.sh'
-
-echo "Prepared $INSTANCE_ADDRESS for druid."
-
-# Now to scp a tarball up that can run druid!
-if [ -f ../../services/target/druid-services-*-bin.tar.gz ];
-then
-  echo "Uploading druid tarball to server..."
-  scp -i ~/.ssh/druid-keypair -o StrictHostKeyChecking=no ../../services/target/druid-services-*-bin.tar.gz ubuntu@${INSTANCE_ADDRESS}:
-else
-  echo "ERROR - package not built!"
-fi
-
-# Now boot druid parts
-scp -i ~/.ssh/druid-keypair -o StrictHostKeyChecking=no ./ec2/run.sh ubuntu@${INSTANCE_ADDRESS}:
-ssh -q -f -i ~/.ssh/druid-keypair -o StrictHostKeyChecking=no ubuntu@${INSTANCE_ADDRESS} 'chmod +x ./run.sh;./run.sh'
-
-echo "Druid booting complete!"
-echo "ssh -i ~/.ssh/druid-keypair ubuntu@${INSTANCE_ADDRESS} #to connect"
--- a/indexing-service/src/main/java/io/druid/indexing/common/task/IndexTask.java
+++ b/indexing-service/src/main/java/io/druid/indexing/common/task/IndexTask.java
@ -31,6 +31,7 @@ import com.google.common.collect.Ordering;
 import com.google.common.collect.Sets;
 import com.google.common.collect.TreeMultiset;
 import com.google.common.primitives.Ints;
+import com.metamx.common.ISE;
 import com.metamx.common.guava.Comparators;
 import com.metamx.common.logger.Logger;
 import io.druid.data.input.Firehose;
@ -137,6 +138,10 @@ public class IndexTask extends AbstractFixedIntervalTask
    final Set<DataSegment> segments = Sets.newHashSet();

    final Set<Interval> validIntervals = Sets.intersection(granularitySpec.bucketIntervals(), getDataIntervals());
+    if (validIntervals.isEmpty()) {
+      throw new ISE("No valid data intervals found. Check out configs!");
+    }
+

    for (final Interval bucket : validIntervals) {
      final List<ShardSpec> shardSpecs;
--- a/server/src/main/java/io/druid/server/QueryResource.java
+++ b/server/src/main/java/io/druid/server/QueryResource.java
@ -128,6 +128,7 @@ public class QueryResource
              .setUser4(query.getType())
              .setUser5(query.getIntervals().get(0).toString())
              .setUser6(String.valueOf(query.hasFilters()))
+              .setUser7(req.getRemoteAddr())
              .setUser9(query.getDuration().toPeriod().toStandardMinutes().toString())
              .build("request/time", requestTime)
      );
--- a/server/src/test/java/io/druid/client/selector/ServerSelectorTest.java
+++ b/server/src/test/java/io/druid/client/selector/ServerSelectorTest.java
@ -17,35 +17,41 @@
 * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301, USA.
 */

-package io.druid.client.selector;
+package io.druid.client;

-import com.fasterxml.jackson.dataformat.smile.SmileFactory;
 import com.google.common.collect.Lists;
 import com.google.common.collect.Maps;
 import com.google.common.util.concurrent.SettableFuture;
+import com.metamx.common.guava.Sequence;
+import com.metamx.common.guava.Sequences;
 import com.metamx.http.client.HttpClient;
 import com.metamx.http.client.Request;
 import com.metamx.http.client.RequestBuilder;
-import io.druid.client.DirectDruidClient;
+import io.druid.client.selector.ConnectionCountServerSelectorStrategy;
+import io.druid.client.selector.QueryableDruidServer;
+import io.druid.client.selector.ServerSelector;
 import io.druid.jackson.DefaultObjectMapper;
 import io.druid.query.Druids;
 import io.druid.query.ReflectionQueryToolChestWarehouse;
+import io.druid.query.Result;
 import io.druid.query.timeboundary.TimeBoundaryQuery;
 import io.druid.timeline.DataSegment;
 import io.druid.timeline.partition.NoneShardSpec;
 import junit.framework.Assert;
 import org.easymock.EasyMock;
 import org.jboss.netty.handler.codec.http.HttpMethod;
+import org.jboss.netty.handler.timeout.ReadTimeoutException;
 import org.joda.time.DateTime;
 import org.joda.time.Interval;
 import org.junit.Before;
 import org.junit.Test;

+import java.io.ByteArrayInputStream;
+import java.io.InputStream;
 import java.net.URL;
+import java.util.List;

-/**
- */
-public class ServerSelectorTest
+public class DirectDruidClientTest
 {
  private HttpClient httpClient;

@ -56,10 +62,16 @@ public class ServerSelectorTest
  }

  @Test
-  public void testPick() throws Exception
+  public void testRun() throws Exception
  {
    RequestBuilder requestBuilder = new RequestBuilder(httpClient, HttpMethod.POST, new URL("http://foo.com"));
    EasyMock.expect(httpClient.post(EasyMock.<URL>anyObject())).andReturn(requestBuilder).atLeastOnce();
+
+    SettableFuture futureException = SettableFuture.create();
+
+    SettableFuture<InputStream> futureResult = SettableFuture.create();
+    EasyMock.expect(httpClient.go(EasyMock.<Request>anyObject())).andReturn(futureResult).times(1);
+    EasyMock.expect(httpClient.go(EasyMock.<Request>anyObject())).andReturn(futureException).times(1);
    EasyMock.expect(httpClient.go(EasyMock.<Request>anyObject())).andReturn(SettableFuture.create()).atLeastOnce();
    EasyMock.replay(httpClient);

@ -80,13 +92,13 @@ public class ServerSelectorTest

    DirectDruidClient client1 = new DirectDruidClient(
        new ReflectionQueryToolChestWarehouse(),
-        new DefaultObjectMapper(new SmileFactory()),
+        new DefaultObjectMapper(),
        httpClient,
        "foo"
    );
    DirectDruidClient client2 = new DirectDruidClient(
        new ReflectionQueryToolChestWarehouse(),
-        new DefaultObjectMapper(new SmileFactory()),
+        new DefaultObjectMapper(),
        httpClient,
        "foo2"
    );
@ -104,11 +116,28 @@ public class ServerSelectorTest

    TimeBoundaryQuery query = Druids.newTimeBoundaryQueryBuilder().dataSource("test").build();

-    client1.run(query);
-    client1.run(query);
-    client1.run(query);
+    Sequence s1 = client1.run(query);
+    Assert.assertEquals(1, client1.getNumOpenConnections());

-    Assert.assertTrue(client1.getNumOpenConnections() == 3);
+    // simulate read timeout
+    Sequence s2 = client1.run(query);
+    Assert.assertEquals(2, client1.getNumOpenConnections());
+    futureException.setException(new ReadTimeoutException());
+    Assert.assertEquals(1, client1.getNumOpenConnections());
+
+    // subsequent connections should work
+    Sequence s3 = client1.run(query);
+    Sequence s4 = client1.run(query);
+    Sequence s5 = client1.run(query);
+
+    Assert.assertTrue(client1.getNumOpenConnections() == 4);
+
+    // produce result for first connection
+    futureResult.set(new ByteArrayInputStream("[{\"timestamp\":\"2014-01-01T01:02:03Z\", \"result\": 42.0}]".getBytes()));
+    List<Result> results = Sequences.toList(s1, Lists.<Result>newArrayList());
+    Assert.assertEquals(1, results.size());
+    Assert.assertEquals(new DateTime("2014-01-01T01:02:03Z"), results.get(0).getTimestamp());
+    Assert.assertEquals(3, client1.getNumOpenConnections());

    client2.run(query);
    client2.run(query);