Edited and additions to pseudo-distributed section after trying it and finding what was there missing
git-svn-id: https://svn.apache.org/repos/asf/hbase/trunk@1521252 13f79535-47bb-0310-9956-ffa450edef68
This commit is contained in:
parent
87b4bfefa0
commit
65c68a146e
|
@ -39,7 +39,7 @@
|
||||||
</inlinemediaobject>
|
</inlinemediaobject>
|
||||||
</link>
|
</link>
|
||||||
</subtitle>
|
</subtitle>
|
||||||
<copyright><year>2012</year><holder>Apache Software Foundation.
|
<copyright><year>2013</year><holder>Apache Software Foundation.
|
||||||
All Rights Reserved. Apache Hadoop, Hadoop, MapReduce, HDFS, Zookeeper, HBase, and the HBase project logo are trademarks of the Apache Software Foundation.
|
All Rights Reserved. Apache Hadoop, Hadoop, MapReduce, HDFS, Zookeeper, HBase, and the HBase project logo are trademarks of the Apache Software Foundation.
|
||||||
</holder>
|
</holder>
|
||||||
</copyright>
|
</copyright>
|
||||||
|
|
|
@ -351,12 +351,8 @@ to ensure well-formedness of your document after an edit session.
|
||||||
<title>HBase run modes: Standalone and Distributed</title>
|
<title>HBase run modes: Standalone and Distributed</title>
|
||||||
|
|
||||||
<para>HBase has two run modes: <xref linkend="standalone" /> and <xref linkend="distributed" />. Out of the box, HBase runs in
|
<para>HBase has two run modes: <xref linkend="standalone" /> and <xref linkend="distributed" />. Out of the box, HBase runs in
|
||||||
standalone mode. To set up a distributed deploy, you will need to
|
standalone mode. Whatever your mode, you will need to configure HBase by editing files in the HBase <filename>conf</filename>
|
||||||
configure HBase by editing files in the HBase <filename>conf</filename>
|
directory. At a minimum, you must edit <code>conf/hbase-env.sh</code> to tell HBase which
|
||||||
directory.</para>
|
|
||||||
|
|
||||||
<para>Whatever your mode, you will need to edit
|
|
||||||
<code>conf/hbase-env.sh</code> to tell HBase which
|
|
||||||
<command>java</command> to use. In this file you set HBase environment
|
<command>java</command> to use. In this file you set HBase environment
|
||||||
variables such as the heapsize and other options for the
|
variables such as the heapsize and other options for the
|
||||||
<application>JVM</application>, the preferred location for log files,
|
<application>JVM</application>, the preferred location for log files,
|
||||||
|
@ -386,11 +382,12 @@ to ensure well-formedness of your document after an edit session.
|
||||||
comes from Hadoop.</para>
|
comes from Hadoop.</para>
|
||||||
</footnote>.</para>
|
</footnote>.</para>
|
||||||
|
|
||||||
<para>Distributed modes require an instance of the <emphasis>Hadoop
|
<para>Pseudo-distributed mode can run against the local filesystem or
|
||||||
Distributed File System</emphasis> (HDFS). See the Hadoop <link
|
it can run against an instance of the <emphasis>Hadoop
|
||||||
|
Distributed File System</emphasis> (HDFS). Fully-distributed mode can
|
||||||
|
ONLY run on HDFS. See the Hadoop <link
|
||||||
xlink:href="http://hadoop.apache.org/common/docs/r1.1.1/api/overview-summary.html#overview_description">
|
xlink:href="http://hadoop.apache.org/common/docs/r1.1.1/api/overview-summary.html#overview_description">
|
||||||
requirements and instructions</link> for how to set up a HDFS. Before
|
requirements and instructions</link> for how to set up HDFS.</para>
|
||||||
proceeding, ensure you have an appropriate, working HDFS.</para>
|
|
||||||
|
|
||||||
<para>Below we describe the different distributed setups. Starting,
|
<para>Below we describe the different distributed setups. Starting,
|
||||||
verification and exploration of your install, whether a
|
verification and exploration of your install, whether a
|
||||||
|
@ -399,45 +396,65 @@ to ensure well-formedness of your document after an edit session.
|
||||||
section that follows, <xref linkend="confirm" />. The same verification script applies to both
|
section that follows, <xref linkend="confirm" />. The same verification script applies to both
|
||||||
deploy types.</para>
|
deploy types.</para>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
<section xml:id="pseudo">
|
<section xml:id="pseudo">
|
||||||
<title>Pseudo-distributed</title>
|
<title>Pseudo-distributed</title>
|
||||||
|
|
||||||
<para>A pseudo-distributed mode is simply a distributed mode run on
|
<para>A pseudo-distributed mode is simply a fully-distributed mode run on
|
||||||
a single host. Use this configuration testing and prototyping on
|
a single host. Use this configuration testing and prototyping on
|
||||||
HBase. Do not use this configuration for production nor for
|
HBase. Do not use this configuration for production nor for
|
||||||
evaluating HBase performance.</para>
|
evaluating HBase performance.</para>
|
||||||
|
|
||||||
<para>First, setup your HDFS in <link xlink:href="http://hadoop.apache.org/docs/r1.0.3/single_node_setup.html">pseudo-distributed mode</link>.
|
<para>First, if you want to run on HDFS rather than on the local filesystem,
|
||||||
|
setup your HDFS. You can set up HDFS also in
|
||||||
|
<link xlink:href="http://hadoop.apache.org/docs/r1.0.3/single_node_setup.html">pseudo-distributed mode</link>.
|
||||||
|
Ensure you have a working HDFS before proceeding.
|
||||||
</para>
|
</para>
|
||||||
<para>Next, configure HBase. Below is an example <filename>conf/hbase-site.xml</filename>.
|
|
||||||
This is the file into
|
|
||||||
which you add local customizations and overrides for
|
|
||||||
<xref linkend="hbase_default_configurations" /> and <xref linkend="hdfs_client_conf" />.
|
|
||||||
Note that the <varname>hbase.rootdir</varname> property points to the
|
|
||||||
local HDFS instance.
|
|
||||||
</para>
|
|
||||||
|
|
||||||
<para>Now skip to <xref linkend="confirm" /> for how to start and verify your
|
<para>Next, configure HBase. Edit <filename>conf/hbase-site.xml</filename>.
|
||||||
pseudo-distributed install. <footnote>
|
This is the file into which you add local customizations and overrides.
|
||||||
<para>See <xref linkend="pseudo.extras">Pseudo-distributed
|
At a minimum, you must tell HBase to run in (pseudo-)distributed mode rather than
|
||||||
mode extras</xref> for notes on how to start extra Masters and
|
in default standalone mode. To do this, set the <varname>hbase.cluster.distributed</varname>
|
||||||
RegionServers when running pseudo-distributed.</para>
|
property to true (Its default is <varname>false</varname>). The absolute bare-minimum
|
||||||
</footnote></para>
|
<filename>hbase-site.xml</filename> is therefore as follows:
|
||||||
|
<programlisting>
|
||||||
|
<configuration>
|
||||||
|
<property>
|
||||||
|
<name>hbase.cluster.distributed</name>
|
||||||
|
<value>true</value>
|
||||||
|
</property>
|
||||||
|
</configuration>
|
||||||
|
</programlisting>
|
||||||
|
With this configuration, HBase will start up an HBase Master process, a ZooKeeper server,
|
||||||
|
and a RegionServer process running against the
|
||||||
|
local filesystem writing to wherever your operating system stores temporary files into a directory
|
||||||
|
named <filename>hbase-YOUR_USER_NAME</filename>.</para>
|
||||||
|
|
||||||
|
<para>Such a setup, using the local filesystem and
|
||||||
|
writing to the operating systems's temporary directory is an ephemeral setup; the Hadoop
|
||||||
|
local filesystem -- which is what HBase uses when it is writing the local filesytem does not
|
||||||
|
support <command>sync</command> so unless the system is shutdown properly, the data will be lost. Writing to
|
||||||
|
the operating system's temporary directory can also make for data loss when the machine
|
||||||
|
is restarted as this directory is usually cleared on reboot. For a more permanent
|
||||||
|
setup, see the next example where we make use of an instance of HDFS; HBase data will
|
||||||
|
be written to the Hadoop distributed filesystem rather than to the local filesystem's
|
||||||
|
tmp directory.</para>
|
||||||
|
<para>In this <filename>conf/hbase-site.xml</filename> example, the
|
||||||
|
<varname>hbase.rootdir</varname> property points to the local HDFS instance
|
||||||
|
homed on the node <varname>h-24-30.example.com</varname>.
|
||||||
<note>
|
<note>
|
||||||
|
<title>Let HBase create <filename>${hbase.rootdir}</filename></title>
|
||||||
<para>Let HBase create the <varname>hbase.rootdir</varname>
|
<para>Let HBase create the <varname>hbase.rootdir</varname>
|
||||||
directory. If you don't, you'll get warning saying HBase needs a
|
directory. If you don't, you'll get warning saying HBase needs a
|
||||||
migration run because the directory is missing files expected by
|
migration run because the directory is missing files expected by
|
||||||
HBase (it'll create them if you let it).</para>
|
HBase (it'll create them if you let it).</para>
|
||||||
</note>
|
</note>
|
||||||
|
|
||||||
<section xml:id="pseudo.config">
|
|
||||||
<title>Pseudo-distributed Configuration File</title>
|
|
||||||
<para>Below is a sample pseudo-distributed file for the node <varname>h-24-30.example.com</varname>.
|
|
||||||
<filename>hbase-site.xml</filename>
|
|
||||||
<programlisting>
|
<programlisting>
|
||||||
<configuration>
|
<configuration>
|
||||||
...
|
|
||||||
<property>
|
<property>
|
||||||
<name>hbase.rootdir</name>
|
<name>hbase.rootdir</name>
|
||||||
<value>hdfs://h-24-30.sfo.stumble.net:8020/hbase</value>
|
<value>hdfs://h-24-30.sfo.stumble.net:8020/hbase</value>
|
||||||
|
@ -446,16 +463,15 @@ to ensure well-formedness of your document after an edit session.
|
||||||
<name>hbase.cluster.distributed</name>
|
<name>hbase.cluster.distributed</name>
|
||||||
<value>true</value>
|
<value>true</value>
|
||||||
</property>
|
</property>
|
||||||
<property>
|
|
||||||
<name>hbase.zookeeper.quorum</name>
|
|
||||||
<value>h-24-30.sfo.stumble.net</value>
|
|
||||||
</property>
|
|
||||||
...
|
|
||||||
</configuration>
|
</configuration>
|
||||||
</programlisting>
|
</programlisting>
|
||||||
</para>
|
</para>
|
||||||
|
<para>Now skip to <xref linkend="confirm" /> for how to start and verify your
|
||||||
</section>
|
pseudo-distributed install. <footnote>
|
||||||
|
<para>See <xref linkend="pseudo.extras">Pseudo-distributed
|
||||||
|
mode extras</xref> for notes on how to start extra Masters and
|
||||||
|
RegionServers when running pseudo-distributed.</para>
|
||||||
|
</footnote></para>
|
||||||
|
|
||||||
<section xml:id="pseudo.extras">
|
<section xml:id="pseudo.extras">
|
||||||
<title>Pseudo-distributed Extras</title>
|
<title>Pseudo-distributed Extras</title>
|
||||||
|
@ -495,6 +511,10 @@ to ensure well-formedness of your document after an edit session.
|
||||||
|
|
||||||
</section>
|
</section>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
<section xml:id="fully_dist">
|
<section xml:id="fully_dist">
|
||||||
<title>Fully-distributed</title>
|
<title>Fully-distributed</title>
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue