Go to file
Mate Szalay-Beko 481662ab39
HBASE-25318 Config option for IntegrationTestImportTsv where to generate HFiles to bulkload (#2777)
IntegrationTestImportTsv is generating HFiles under the working directory of the
current hdfs user executing the tool, before bulkloading it into HBase.

Assuming you encrypt the HBase root directory within HDFS (using HDFS
Transparent Encryption), you can bulkload HFiles only if they sit in the same
encryption zone in HDFS as the HBase root directory itself.

When IntegrationTestImportTsv is executed against a real distributed cluster
and the working directory of the current user (e.g. /user/hbase) is not in the
same encryption zone as the HBase root directory (e.g. /hbase/data) then you
will get an exception:

```
ERROR org.apache.hadoop.hbase.regionserver.HRegion: There was a partial failure
due to IO when attempting to load d :
hdfs://mycluster/user/hbase/test-data/22d8460d-04cc-e032-88ca-2cc20a7dd01c/
IntegrationTestImportTsv/hfiles/d/74655e3f8da142cb94bc31b64f0475cc

org.apache.hadoop.ipc.RemoteException(java.io.IOException):
/user/hbase/test-data/22d8460d-04cc-e032-88ca-2cc20a7dd01c/
IntegrationTestImportTsv/hfiles/d/74655e3f8da142cb94bc31b64f0475cc
can't be moved into an encryption zone.
```

In this commit I make it configurable where the IntegrationTestImportTsv
generates the HFiles.

Co-authored-by: Mate Szalay-Beko <symat@apache.com>
Signed-off-by: Peter Somogyi <psomogyi@apache.org>
2021-01-05 09:24:24 +01:00
.idea HBASE-23707 Add IntelliJ check style plugin configuration (#1064) 2020-01-27 10:40:24 -08:00
bin HBASE-24620 : Add a ClusterManager which submits command to ZooKeeper and its Agent which picks and execute those Commands (#2299) 2020-12-21 15:33:36 +05:30
conf HBASE-24343 Document how to configure the http request log 2020-05-29 16:50:16 -07:00
dev-support HBASE-25437 Update refguide RC section; point at the dev-tools/create-releases/README.txt rather than repeat its content (#2804) 2021-01-02 11:34:55 -08:00
hbase-annotations HBASE-23253 Rewrite rsgroup related UTs with the new methods introduced in HBASE-22932 (#813) 2020-03-02 15:43:40 +08:00
hbase-archetypes HBAE-24507 Remove HTableDescriptor and HColumnDescriptor (#2186) 2020-08-04 10:31:42 +08:00
hbase-assembly HBASE-14067 bundle ruby files for hbase shell into a jar. 2020-10-13 11:12:07 -05:00
hbase-asyncfs HBASE-24912 Enlarge MemstoreFlusherChore/CompactionChecker period for unit test (#2285) 2020-08-21 12:41:20 +08:00
hbase-backup HBASE-24640 Purge use of VisibleForTesting (#2695) 2020-11-25 14:10:12 -08:00
hbase-balancer HBASE-24640 Purge use of VisibleForTesting (#2695) 2020-11-25 14:10:12 -08:00
hbase-build-configuration HBASE-25342 Upgrade error prone to 2.4.0 (#2725) 2020-12-02 22:23:03 +08:00
hbase-checkstyle HBASE-22749 Distributed MOB compactions 2020-02-19 16:06:38 -06:00
hbase-client HBASE-25457 Possible race in AsyncConnectionImpl between getChoreServ… (#2839) 2021-01-04 23:30:32 +08:00
hbase-common HBASE-25439 Add BYTE unit in PrettyPrinter.Unit (#2812) 2021-01-03 09:34:00 -08:00
hbase-endpoint HBASE-24640 Purge use of VisibleForTesting (#2695) 2020-11-25 14:10:12 -08:00
hbase-examples HBASE-25451 Upgrade commons-io to 2.8.0 (#2825) 2020-12-31 16:57:27 +08:00
hbase-external-blockcache HBASE-25292 Improve InetSocketAddress usage discipline (#2669) 2020-11-28 21:01:22 +08:00
hbase-hadoop-compat HBASE-24640 Purge use of VisibleForTesting (#2695) 2020-11-25 14:10:12 -08:00
hbase-hbtop HBASE-25452 Use MatcherAssert.assertThat instead of org.junit.Assert.assertThat (#2826) 2020-12-30 09:47:56 +08:00
hbase-http HBASE-25452 Use MatcherAssert.assertThat instead of org.junit.Assert.assertThat (#2826) 2020-12-30 09:47:56 +08:00
hbase-it HBASE-25318 Config option for IntegrationTestImportTsv where to generate HFiles to bulkload (#2777) 2021-01-05 09:24:24 +01:00
hbase-logging HBASE-24309 Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly (#1640) 2020-05-12 12:03:30 +08:00
hbase-mapreduce HBASE-25246 Backup/Restore hbase cell tags 2020-12-16 18:52:36 +05:30
hbase-metrics HBASE-24663 Add procedure process time statistics UI (#2035) 2020-07-10 15:03:52 +08:00
hbase-metrics-api HBASE-24640 Purge use of VisibleForTesting (#2695) 2020-11-25 14:10:12 -08:00
hbase-procedure HBASE-25447 remoteProc is suspended due to OOM ERROR (#2824) 2021-01-04 21:04:38 +05:30
hbase-protocol-shaded HBASE-25181 Add options for disabling column family encryption and choosing hash algorithm for wrapped encryption keys. 2020-11-09 13:53:50 -06:00
hbase-replication HBASE-25452 Use MatcherAssert.assertThat instead of org.junit.Assert.assertThat (#2826) 2020-12-30 09:47:56 +08:00
hbase-resource-bundle HBASE-23834 HBase fails to run on Hadoop 3.3.0/3.2.2/3.1.4 due to jetty version mismatch (#2222) 2020-08-25 12:05:52 +08:00
hbase-rest HBASE-25452 Use MatcherAssert.assertThat instead of org.junit.Assert.assertThat (#2826) 2020-12-30 09:47:56 +08:00
hbase-server HBASE-25447 remoteProc is suspended due to OOM ERROR (#2824) 2021-01-04 21:04:38 +05:30
hbase-shaded HBASE-23834 HBase fails to run on Hadoop 3.3.0/3.2.2/3.1.4 due to jetty version mismatch (#2222) 2020-08-25 12:05:52 +08:00
hbase-shell HBASE-25439 Add BYTE unit in PrettyPrinter.Unit (#2812) 2021-01-03 09:34:00 -08:00
hbase-testing-util HBASE-24309 Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly (#1640) 2020-05-12 12:03:30 +08:00
hbase-thrift HBASE-25242 Add Increment/Append support to RowMutations (#2630) 2020-11-26 13:31:43 +09:00
hbase-zookeeper HBASE-25279 Make ZKWatcher ExecutorService launch daemon threads 2020-12-31 19:19:23 -05:00
src Shenshengli hbase-25450 The parameter "hbase.bucketcache.size" is misdescribed (#2821) 2021-01-03 09:27:08 -08:00
.asf.yaml HBASE-24261: (Addendum) Make notifications a toplevel object 2020-05-04 16:19:58 -07:00
.editorconfig HBASE-23767 Add JDK11 compilation and unit test support to Github precommit 2020-03-05 11:46:15 -08:00
.gitattributes HBASE-6816. [WINDOWS] line endings on checkout for .sh files 2013-01-23 19:30:14 +00:00
.gitignore HBASE-24271 Set values in `conf/hbase-site.xml` that enable running on `LocalFileSystem` out of the box 2020-05-14 13:18:43 -07:00
.pylintrc HBASE-21712 : Make submit-patch.py python3 compatible 2019-01-14 14:50:50 -08:00
.rubocop.yml HBASE-23943 Updated Rubocop configuration to current version 2020-03-18 11:09:36 +01:00
CHANGES.txt HBASE-18548 Move sources of website gen and check jobs into source control 2017-08-10 14:48:14 -07:00
LICENSE.txt HBASE-18548 Move sources of website gen and check jobs into source control 2017-08-10 14:48:14 -07:00
NOTICE.txt HBASE-24417 update copyright notices year to 2020 (#1760) 2020-05-25 12:58:29 +08:00
README.txt HBASE-14348 Update download mirror link 2018-04-04 14:30:06 -07:00
pom.xml HBASE-25451 Upgrade commons-io to 2.8.0 (#2825) 2020-12-31 16:57:27 +08:00

README.txt

Apache HBase [1] is an open-source, distributed, versioned, column-oriented
store modeled after Google' Bigtable: A Distributed Storage System for
Structured Data by Chang et al.[2]  Just as Bigtable leverages the distributed
data storage provided by the Google File System, HBase provides Bigtable-like
capabilities on top of Apache Hadoop [3].

To get started using HBase, the full documentation for this release can be
found under the doc/ directory that accompanies this README.  Using a browser,
open the docs/index.html to view the project home page (or browse to [1]).
The hbase 'book' at http://hbase.apache.org/book.html has a 'quick start'
section and is where you should being your exploration of the hbase project.

The latest HBase can be downloaded from an Apache Mirror [4].

The source code can be found at [5]

The HBase issue tracker is at [6]

Apache HBase is made available under the Apache License, version 2.0 [7]

The HBase mailing lists and archives are listed here [8].

The HBase distribution includes cryptographic software. See the export control
notice here [9].

1. http://hbase.apache.org
2. http://research.google.com/archive/bigtable.html
3. http://hadoop.apache.org
4. http://www.apache.org/dyn/closer.lua/hbase/
5. https://hbase.apache.org/source-repository.html
6. https://hbase.apache.org/issue-tracking.html
7. http://hbase.apache.org/license.html
8. http://hbase.apache.org/mail-lists.html
9. https://hbase.apache.org/export_control.html