{% include JB/setup %}
Zeppelin on yarn means to run interpreter process in yarn container. The key benefit is the scalability, you won't run out of memory of the zeppelin server host if you run large amount of interpreter processes.
The following is required for yarn interpreter mode.
$HADOOP_HOME/bin
is put in PATH
. Because internally zeppelin will run command hadoop classpath
to get all the hadoop jars and put them in the classpath of Zeppelin.USE_HADOOP
as true
in zeppelin-env.sh
.Yarn interpreter mode needs to be set for each interpreter. You can set zeppelin.interpreter.launcher
to be yarn
to run it in yarn mode. Besides that, you can also specify other properties as following table.
There're several differences between yarn interpreter mode with non-yarn interpreter mode (local mode)
zeppelin.python
exist in all the nodes of yarn cluster. Because the python interpreter may launch in any node.