Added note on index size

git-svn-id: https://svn.apache.org/repos/asf/hbase/trunk@1127689 13f79535-47bb-0310-9956-ffa450edef68
This commit is contained in:
Michael Stack 2011-05-25 21:23:20 +00:00
parent 9d197ae1ce
commit 1a628ccd42
1 changed files with 5 additions and 1 deletions

View File

@ -187,10 +187,11 @@ throws InterruptedException, IOException {
</section> </section>
<section xml:id="keysize"> <section xml:id="keysize">
<title>Try to minimize row and column sizes</title> <title>Try to minimize row and column sizes</title>
<subtitle>Or why are my storefile indices large?</subtitle>
<para>In HBase, values are always freighted with their coordinates; as a <para>In HBase, values are always freighted with their coordinates; as a
cell value passes through the system, it'll be accompanied by its cell value passes through the system, it'll be accompanied by its
row, column name, and timestamp. Always. If your rows and column names row, column name, and timestamp. Always. If your rows and column names
are large, especially compared o the size of the cell value, then are large, especially compared to the size of the cell value, then
you may run up against some interesting scenarios. One such is you may run up against some interesting scenarios. One such is
the case described by Marc Limotte at the tail of the case described by Marc Limotte at the tail of
<link xlink:url="https://issues.apache.org/jira/browse/HBASE-3551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=13005272#comment-13005272">HBASE-3551</link> <link xlink:url="https://issues.apache.org/jira/browse/HBASE-3551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=13005272#comment-13005272">HBASE-3551</link>
@ -202,6 +203,9 @@ throws InterruptedException, IOException {
entries in the store file index happen at a larger interval or entries in the store file index happen at a larger interval or
modify the table schema so it makes for smaller rows and column modify the table schema so it makes for smaller rows and column
names. names.
Compression will also make for larger indices. See
the thread <link xref:href="http://search-hadoop.com/m/hemBv1LiN4Q1/a+question+storefileIndexSize&amp;subj=a+question+storefileIndexSize">a question storefileIndexSize</link>
up on the user mailing list.
`</para> `</para>
</section> </section>
<section xml:id="schema.versions"> <section xml:id="schema.versions">