HBASE-3901 Update documentation for ImportTsv to reflect recent features

git-svn-id: https://svn.apache.org/repos/asf/hbase/trunk@1125046 13f79535-47bb-0310-9956-ffa450edef68
This commit is contained in:
Michael Stack 2011-05-19 18:37:10 +00:00
parent 557ce312af
commit d5a6c9d1b1
2 changed files with 8 additions and 2 deletions

View File

@ -256,7 +256,8 @@ public class ImportTsv {
"as the row key for each imported record. You must specify exactly one column\n" +
"to be the row key.\n" +
"\n" +
"In order to prepare data for a bulk data load, pass the option:\n" +
"By default importtsv will load data directly into HBase. To instead generate\n" +
"HFiles of data to prepare for a bulk data load, pass the option:\n" +
" -D" + BULK_OUTPUT_CONF_KEY + "=/path/for/output\n" +
"\n" +
"Other options that may be specified with -D include:\n" +

View File

@ -100,12 +100,17 @@ column name HBASE_ROW_KEY is used to designate that this column should be used
as the row key for each imported record. You must specify exactly one column
to be the row key.
In order to prepare data for a bulk data load, pass the option:
By default importtsv will load data directly into HBase. To instead generate
HFiles of data to prepare for a bulk data load, pass the option:
-Dimporttsv.bulk.output=/path/for/output
Other options that may be specified with -D include:
-Dimporttsv.skip.bad.lines=false - fail if encountering an invalid line
-Dimporttsv.timestamp=currentTimeAsLong - use the specified timestamp for the import
'-Dimporttsv.separator=|' - eg separate on pipes instead of tabs
-Dimporttsv.timestamp=currentTimeAsLong - use the specified timestamp for the import
-Dimporttsv.mapper.class=my.Mapper - A user-defined Mapper to use instead of TsvImporterMapper
</pre></code>
</section>
<section name="Importing the prepared data using the completebulkload tool">