| <noautolink> |
| |
| [[index][::Go back to Oozie Documentation Index::]] |
| |
| ---+!! Hadoop Configuration |
| |
| ---++ Hadoop Services Whitelisting |
| |
| Oozie supports whitelisting Hadoop services (JobTracker, HDFS), via 2 configuration properties: |
| |
| <verbatim> |
| ... |
| <property> |
| <name>oozie.service.HadoopAccessorService.jobTracker.whitelist</name> |
| <value> </value> |
| <description> |
| Whitelisted job tracker for Oozie service. |
| </description> |
| </property> |
| <property> |
| <name>oozie.service.HadoopAccessorService.nameNode.whitelist</name> |
| <value> </value> |
| <description> |
| Whitelisted job tracker for Oozie service. |
| </description> |
| </property> |
| ... |
| </verbatim> |
| |
| The value must follow the pattern =[AUTHORITY,...]=. Where =AUTHORITY= is the =HOST:PORT= of |
| the Hadoop service (JobTracker, HDFS). |
| |
| If the value is empty any HOST:PORT is accepted. Empty is the default value. |
| |
| ---++ Hadoop Default Configuration Values |
| |
| Oozie supports Hadoop configuration equivalent to the Hadoop =*-site.xml= files. |
| |
| The configuration property in the =oozie-site.xml= is =oozie.service.HadoopAccessorService.hadoop.configurations= |
| and its value must follow the pattern =[<AUTHORITY>=<HADOOP_CONF_DIR>,]*=. Where =<AUTHORITY>= is the =HOST:PORT= of |
| the Hadoop service (JobTracker, HDFS). The =<HADOOP_CONF_DIR>= is a Hadoop configuration directory. If the specified |
| directory is a relative path, it will be looked under the Oozie configuration directory. And absolute path can |
| also be specified. Oozie will load the Hadoop =*-site.xml= files in the following order: core-site.xml, hdfs-site.xml, |
| mapred-site.xml, yarn-site.xml, hadoop-site.xml, ssl-client.xml. |
| |
| In addition to explicit authorities, a '*' wildcard is supported. The configuration file associated with the wildcard |
| will be used as default if there is no configuration for the requested Hadoop service. |
| |
| For example, the configuration in the =oozie-site.xml= would look like: |
| |
| <verbatim> |
| ... |
| <property> |
| <name>oozie.service.HadoopAccessorService.hadoop.configurations</name> |
| <value>*=hadoop-conf,jt-bar:8021=bar-cluster,nn-bar:8020=bar-cluster</value> |
| </property> |
| ... |
| </verbatim> |
| |
| The Hadoop configuration files use the Hadoop configuration syntax. |
| |
| By default Oozie defines =*=hadoop-conf= and the default values of the =hadoop-site.xml= file are: |
| |
| <verbatim> |
| <configuration> |
| <property> |
| <name>mapreduce.jobtracker.kerberos.principal</name> |
| <value>mapred/_HOST@LOCALREALM</value> |
| </property> |
| <property> |
| <name>yarn.resourcemanager.principal</name> |
| <value>yarn/_HOST@LOCALREALM</value> |
| </property> |
| <property> |
| <name>dfs.namenode.kerberos.principal</name> |
| <value>hdfs/_HOST@LOCALREALM</value> |
| </property> |
| <property> |
| <name>mapreduce.framework.name</name> |
| <value>yarn</value> |
| </property> |
| </configuration> |
| </verbatim> |
| |
| ---++ Limitations |
| |
| All actions in a workflow application must interact with the same Hadoop JobTracker and NameNode. |
| |
| [[index][::Go back to Oozie Documentation Index::]] |
| |
| </noautolink> |