HADOOP-11522. Update S3A Documentation. (Thomas Demoor via stevel)
This commit is contained in:
parent
701b96ca8e
commit
f52fcdc2e0
|
@ -202,6 +202,8 @@ Release 2.7.0 - UNRELEASED
|
||||||
|
|
||||||
HADOOP-11600. Fix up source codes to be compiled with Guava 17.0. (ozawa)
|
HADOOP-11600. Fix up source codes to be compiled with Guava 17.0. (ozawa)
|
||||||
|
|
||||||
|
HADOOP-11522. Update S3A Documentation. (Thomas Demoor via stevel)
|
||||||
|
|
||||||
OPTIMIZATIONS
|
OPTIMIZATIONS
|
||||||
|
|
||||||
HADOOP-11323. WritableComparator#compare keeps reference to byte array.
|
HADOOP-11323. WritableComparator#compare keeps reference to byte array.
|
||||||
|
|
|
@ -682,12 +682,12 @@ for ldap providers in the same way as above does.
|
||||||
</property>
|
</property>
|
||||||
|
|
||||||
<property>
|
<property>
|
||||||
<name>fs.s3a.access.key</name>
|
<name>fs.s3a.awsAccessKeyId</name>
|
||||||
<description>AWS access key ID. Omit for Role-based authentication.</description>
|
<description>AWS access key ID. Omit for Role-based authentication.</description>
|
||||||
</property>
|
</property>
|
||||||
|
|
||||||
<property>
|
<property>
|
||||||
<name>fs.s3a.secret.key</name>
|
<name>fs.s3a.awsSecretAccessKey</name>
|
||||||
<description>AWS secret key. Omit for Role-based authentication.</description>
|
<description>AWS secret key. Omit for Role-based authentication.</description>
|
||||||
</property>
|
</property>
|
||||||
|
|
||||||
|
@ -703,6 +703,46 @@ for ldap providers in the same way as above does.
|
||||||
<description>Enables or disables SSL connections to S3.</description>
|
<description>Enables or disables SSL connections to S3.</description>
|
||||||
</property>
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.endpoint</name>
|
||||||
|
<description>AWS S3 endpoint to connect to. An up-to-date list is
|
||||||
|
provided in the AWS Documentation: regions and endpoints. Without this
|
||||||
|
property, the standard region (s3.amazonaws.com) is assumed.
|
||||||
|
</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.host</name>
|
||||||
|
<description>Hostname of the (optional) proxy server for S3 connections.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.port</name>
|
||||||
|
<description>Proxy server port. If this property is not set
|
||||||
|
but fs.s3a.proxy.host is, port 80 or 443 is assumed (consistent with
|
||||||
|
the value of fs.s3a.connection.ssl.enabled).</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.username</name>
|
||||||
|
<description>Username for authenticating with proxy server.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.password</name>
|
||||||
|
<description>Password for authenticating with proxy server.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.domain</name>
|
||||||
|
<description>Domain for authenticating with proxy server.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.workstation</name>
|
||||||
|
<description>Workstation for authenticating with proxy server.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
<property>
|
<property>
|
||||||
<name>fs.s3a.attempts.maximum</name>
|
<name>fs.s3a.attempts.maximum</name>
|
||||||
<value>10</value>
|
<value>10</value>
|
||||||
|
@ -722,6 +762,33 @@ for ldap providers in the same way as above does.
|
||||||
directory listings at a time.</description>
|
directory listings at a time.</description>
|
||||||
</property>
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.threads.max</name>
|
||||||
|
<value>256</value>
|
||||||
|
<description> Maximum number of concurrent active (part)uploads,
|
||||||
|
which each use a thread from the threadpool.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.threads.core</name>
|
||||||
|
<value>15</value>
|
||||||
|
<description>Number of core threads in the threadpool.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.threads.keepalivetime</name>
|
||||||
|
<value>60</value>
|
||||||
|
<description>Number of seconds a thread can be idle before being
|
||||||
|
terminated.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.max.total.tasks</name>
|
||||||
|
<value>1000</value>
|
||||||
|
<description>Number of (part)uploads allowed to the queue before
|
||||||
|
blocking additional uploads.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
<property>
|
<property>
|
||||||
<name>fs.s3a.multipart.size</name>
|
<name>fs.s3a.multipart.size</name>
|
||||||
<value>104857600</value>
|
<value>104857600</value>
|
||||||
|
|
|
@ -32,7 +32,7 @@ The specifics of using these filesystems are documented below.
|
||||||
|
|
||||||
## Warning: Object Stores are not filesystems.
|
## Warning: Object Stores are not filesystems.
|
||||||
|
|
||||||
Amazon S3 is an example of "an object store". In order to achieve scalalablity
|
Amazon S3 is an example of "an object store". In order to achieve scalability
|
||||||
and especially high availability, S3 has —as many other cloud object stores have
|
and especially high availability, S3 has —as many other cloud object stores have
|
||||||
done— relaxed some of the constraints which classic "POSIX" filesystems promise.
|
done— relaxed some of the constraints which classic "POSIX" filesystems promise.
|
||||||
|
|
||||||
|
@ -164,6 +164,46 @@ If you do any of these: change your credentials immediately!
|
||||||
<description>Enables or disables SSL connections to S3.</description>
|
<description>Enables or disables SSL connections to S3.</description>
|
||||||
</property>
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.endpoint</name>
|
||||||
|
<description>AWS S3 endpoint to connect to. An up-to-date list is
|
||||||
|
provided in the AWS Documentation: regions and endpoints. Without this
|
||||||
|
property, the standard region (s3.amazonaws.com) is assumed.
|
||||||
|
</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.host</name>
|
||||||
|
<description>Hostname of the (optional) proxy server for S3 connections.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.port</name>
|
||||||
|
<description>Proxy server port. If this property is not set
|
||||||
|
but fs.s3a.proxy.host is, port 80 or 443 is assumed (consistent with
|
||||||
|
the value of fs.s3a.connection.ssl.enabled).</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.username</name>
|
||||||
|
<description>Username for authenticating with proxy server.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.password</name>
|
||||||
|
<description>Password for authenticating with proxy server.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.domain</name>
|
||||||
|
<description>Domain for authenticating with proxy server.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.proxy.workstation</name>
|
||||||
|
<description>Workstation for authenticating with proxy server.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
<property>
|
<property>
|
||||||
<name>fs.s3a.attempts.maximum</name>
|
<name>fs.s3a.attempts.maximum</name>
|
||||||
<value>10</value>
|
<value>10</value>
|
||||||
|
@ -183,6 +223,33 @@ If you do any of these: change your credentials immediately!
|
||||||
directory listings at a time.</description>
|
directory listings at a time.</description>
|
||||||
</property>
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.threads.max</name>
|
||||||
|
<value>256</value>
|
||||||
|
<description> Maximum number of concurrent active (part)uploads,
|
||||||
|
which each use a thread from the threadpool.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.threads.core</name>
|
||||||
|
<value>15</value>
|
||||||
|
<description>Number of core threads in the threadpool.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.threads.keepalivetime</name>
|
||||||
|
<value>60</value>
|
||||||
|
<description>Number of seconds a thread can be idle before being
|
||||||
|
terminated.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
|
<property>
|
||||||
|
<name>fs.s3a.max.total.tasks</name>
|
||||||
|
<value>1000</value>
|
||||||
|
<description>Number of (part)uploads allowed to the queue before
|
||||||
|
blocking additional uploads.</description>
|
||||||
|
</property>
|
||||||
|
|
||||||
<property>
|
<property>
|
||||||
<name>fs.s3a.multipart.size</name>
|
<name>fs.s3a.multipart.size</name>
|
||||||
<value>104857600</value>
|
<value>104857600</value>
|
||||||
|
@ -231,6 +298,9 @@ If you do any of these: change your credentials immediately!
|
||||||
|
|
||||||
## Testing the S3 filesystem clients
|
## Testing the S3 filesystem clients
|
||||||
|
|
||||||
|
Due to eventual consistency, tests may fail without reason. Transient
|
||||||
|
failures, which no longer occur upon rerunning the test, should thus be ignored.
|
||||||
|
|
||||||
To test the S3* filesystem clients, you need to provide two files
|
To test the S3* filesystem clients, you need to provide two files
|
||||||
which pass in authentication details to the test runner
|
which pass in authentication details to the test runner
|
||||||
|
|
||||||
|
@ -256,7 +326,10 @@ each filesystem for its testing.
|
||||||
2. `test.fs.s3.name` : the URL of the bucket for "S3" tests
|
2. `test.fs.s3.name` : the URL of the bucket for "S3" tests
|
||||||
|
|
||||||
The contents of each bucket will be destroyed during the test process:
|
The contents of each bucket will be destroyed during the test process:
|
||||||
do not use the bucket for any purpose other than testing.
|
do not use the bucket for any purpose other than testing. Furthermore, for
|
||||||
|
s3a, all in-progress multi-part uploads to the bucket will be aborted at the
|
||||||
|
start of a test (by forcing fs.s3a.multipart.purge=true) to clean up the
|
||||||
|
temporary state of previously failed tests.
|
||||||
|
|
||||||
Example:
|
Example:
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue