HADOOP-13045. hadoop_add_classpath is not working in .hadooprc

This commit is contained in:
Allen Wittenauer 2016-04-30 12:23:51 -07:00
parent c7484eb541
commit 971af603ea
4 changed files with 48 additions and 11 deletions

View File

@ -110,7 +110,7 @@ hadoop_exec_userfuncs
# IMPORTANT! User provided code is now available! # IMPORTANT! User provided code is now available!
# #
hadoop_exec_hadooprc hadoop_exec_user_hadoopenv
hadoop_verify_confdir hadoop_verify_confdir
# do all the OS-specific startup bits here # do all the OS-specific startup bits here
@ -142,6 +142,10 @@ hadoop_shellprofiles_nativelib
hadoop_add_common_to_classpath hadoop_add_common_to_classpath
hadoop_shellprofiles_classpath hadoop_shellprofiles_classpath
# user API commands can now be run since the runtime
# environment has been configured
hadoop_exec_hadooprc
# #
# backwards compatibility. new stuff should # backwards compatibility. new stuff should
# call this when they are ready # call this when they are ready

View File

@ -383,6 +383,20 @@ function hadoop_exec_userfuncs
## @audience private ## @audience private
## @stability evolving ## @stability evolving
## @replaceable yes ## @replaceable yes
function hadoop_exec_user_hadoopenv
{
if [[ -f "${HOME}/.hadoop-env" ]]; then
hadoop_debug "Applying the user's .hadoop-env"
# shellcheck disable=SC1090
. "${HOME}/.hadoop-env"
fi
}
## @description Read the user's settings. This provides for users to
## @description run Hadoop Shell API after system bootstrap
## @audience private
## @stability evolving
## @replaceable yes
function hadoop_exec_hadooprc function hadoop_exec_hadooprc
{ {
if [[ -f "${HOME}/.hadooprc" ]]; then if [[ -f "${HOME}/.hadooprc" ]]; then

View File

@ -16,15 +16,15 @@
<!-- MACRO{toc|fromDepth=0|toDepth=3} --> <!-- MACRO{toc|fromDepth=0|toDepth=3} -->
Much of Hadoop's functionality is controlled via [the shell](CommandsManual.html). There are several ways to modify the default behavior of how these commands execute. Much of Apache Hadoop's functionality is controlled via [the shell](CommandsManual.html). There are several ways to modify the default behavior of how these commands execute.
## Important End-User Environment Variables ## Important End-User Environment Variables
Hadoop has many environment variables that control various aspects of the software. (See `hadoop-env.sh` and related files.) Some of these environment variables are dedicated to helping end users manage their runtime. Apache Hadoop has many environment variables that control various aspects of the software. (See `hadoop-env.sh` and related files.) Some of these environment variables are dedicated to helping end users manage their runtime.
### `HADOOP_CLIENT_OPTS` ### `HADOOP_CLIENT_OPTS`
This environment variable is used for almost all end-user operations. It can be used to set any Java options as well as any Hadoop options via a system property definition. For example: This environment variable is used for almost all end-user operations. It can be used to set any Java options as well as any Apache Hadoop options via a system property definition. For example:
```bash ```bash
HADOOP_CLIENT_OPTS="-Xmx1g -Dhadoop.socks.server=localhost:4000" hadoop fs -ls /tmp HADOOP_CLIENT_OPTS="-Xmx1g -Dhadoop.socks.server=localhost:4000" hadoop fs -ls /tmp
@ -34,7 +34,7 @@ will increase the memory and send this command via a SOCKS proxy server.
### `HADOOP_USER_CLASSPATH` ### `HADOOP_USER_CLASSPATH`
The Hadoop scripts have the capability to inject more content into the classpath of the running command by setting this environment variable. It should be a colon delimited list of directories, files, or wildcard locations. The Apache Hadoop scripts have the capability to inject more content into the classpath of the running command by setting this environment variable. It should be a colon delimited list of directories, files, or wildcard locations.
```bash ```bash
HADOOP_USER_CLASSPATH=${HOME}/lib/myjars/*.jar hadoop classpath HADOOP_USER_CLASSPATH=${HOME}/lib/myjars/*.jar hadoop classpath
@ -44,13 +44,13 @@ A user can provides hints to the location of the paths via the `HADOOP_USER_CLAS
### Auto-setting of Variables ### Auto-setting of Variables
If a user has a common set of settings, they can be put into the `${HOME}/.hadooprc` file. This file is always read to initialize and override any variables that the user may want to customize. It uses bash syntax, similar to the `.bashrc` file: If a user has a common set of settings, they can be put into the `${HOME}/.hadoop-env` file. This file is always read to initialize and override any variables that the user may want to customize. It uses bash syntax, similar to the `.bashrc` file:
For example: For example:
```bash ```bash
# #
# my custom Hadoop settings! # my custom Apache Hadoop settings!
# #
HADOOP_USER_CLASSPATH=${HOME}/hadoopjars/* HADOOP_USER_CLASSPATH=${HOME}/hadoopjars/*
@ -58,7 +58,7 @@ HADOOP_USER_CLASSPATH_FIRST=yes
HADOOP_CLIENT_OPTS="-Xmx1g" HADOOP_CLIENT_OPTS="-Xmx1g"
``` ```
The `.hadooprc` file can also be used to extend functionality and teach Hadoop new tricks. For example, to run hadoop commands accessing the server referenced in the environment variable `${HADOOP_SERVER}`, the following in the `.hadooprc` will do just that: The `.hadoop-env` file can also be used to extend functionality and teach Apache Hadoop new tricks. For example, to run hadoop commands accessing the server referenced in the environment variable `${HADOOP_SERVER}`, the following in the `.hadoop-env` will do just that:
```bash ```bash
@ -67,13 +67,15 @@ if [[ -n ${HADOOP_SERVER} ]]; then
fi fi
``` ```
One word of warning: not all of Unix Shell API routines are available or work correctly in `.hadoop-env`. See below for more information on `.hadooprc`.
## Administrator Environment ## Administrator Environment
There are many environment variables that impact how the system operates. By far, the most important are the series of `_OPTS` variables that control how daemons work. These variables should contain all of the relevant settings for those daemons. There are many environment variables that impact how the system operates. By far, the most important are the series of `_OPTS` variables that control how daemons work. These variables should contain all of the relevant settings for those daemons.
More, detailed information is contained in `hadoop-env.sh` and the other env.sh files. More, detailed information is contained in `hadoop-env.sh` and the other env.sh files.
Advanced administrators may wish to supplement or do some platform-specific fixes to the existing scripts. In some systems, this means copying the errant script or creating a custom build with these changes. Hadoop provides the capabilities to do function overrides so that the existing code base may be changed in place without all of that work. Replacing functions is covered later under the Shell API documentation. Advanced administrators may wish to supplement or do some platform-specific fixes to the existing scripts. In some systems, this means copying the errant script or creating a custom build with these changes. Apache Hadoop provides the capabilities to do function overrides so that the existing code base may be changed in place without all of that work. Replacing functions is covered later under the Shell API documentation.
## Developer and Advanced Administrator Environment ## Developer and Advanced Administrator Environment
@ -89,7 +91,7 @@ An example of a shell profile is in the libexec directory.
## Shell API ## Shell API
Hadoop's shell code has a [function library](./UnixShellAPI.html) that is open for administrators and developers to use to assist in their configuration and advanced feature management. These APIs follow the standard [Hadoop Interface Classification](./InterfaceClassification.html), with one addition: Replaceable. Apache Hadoop's shell code has a [function library](./UnixShellAPI.html) that is open for administrators and developers to use to assist in their configuration and advanced feature management. These APIs follow the standard [Apache Hadoop Interface Classification](./InterfaceClassification.html), with one addition: Replaceable.
The shell code allows for core functions to be overridden. However, not all functions can be or are safe to be replaced. If a function is not safe to replace, it will have an attribute of Replaceable: No. If a function is safe to replace, it will have the attribute of Replaceable: Yes. The shell code allows for core functions to be overridden. However, not all functions can be or are safe to be replaced. If a function is not safe to replace, it will have an attribute of Replaceable: No. If a function is safe to replace, it will have the attribute of Replaceable: Yes.
@ -98,3 +100,15 @@ In order to replace a function, create a file called `hadoop-user-functions.sh`
Functions that are marked Public and Stable are safe to use in shell profiles as-is. Other functions may change in a minor release. Functions that are marked Public and Stable are safe to use in shell profiles as-is. Other functions may change in a minor release.
### User-level API Access
In addition to `.hadoop-env`, which allows individual users to override `hadoop-env.sh`, user's may also use `.hadooprc`. This is called after the Apache Hadoop shell environment has been configured and allows the full set of shell API function calls.
For example:
```bash
hadoop_add_classpath /some/path/custom.jar
```
would go into `.hadooprc`

View File

@ -89,4 +89,9 @@ create_fake_dirs () {
[ ${unittest} = "hadooprc" ] [ ${unittest} = "hadooprc" ]
} }
@test "hadoop_exec_user_hadoopenv" {
HOME=${TMP}
echo "unittest=hadoopenv" > "${TMP}/.hadoop-env"
hadoop_exec_user_hadoopenv
[ ${unittest} = "hadoopenv" ]
}