| --- |
| layout: page |
| title: "Apache Zeppelin on CDH" |
| description: "This document will guide you how you can build and configure the environment on CDH with Apache Zeppelin using docker scripts." |
| group: setup/deployment |
| --- |
| <!-- |
| Licensed under the Apache License, Version 2.0 (the "License"); |
| you may not use this file except in compliance with the License. |
| You may obtain a copy of the License at |
| |
| http://www.apache.org/licenses/LICENSE-2.0 |
| |
| Unless required by applicable law or agreed to in writing, software |
| distributed under the License is distributed on an "AS IS" BASIS, |
| WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
| See the License for the specific language governing permissions and |
| limitations under the License. |
| --> |
| {% include JB/setup %} |
| |
| # Apache Zeppelin on CDH |
| |
| <div id="toc"></div> |
| |
| ### 1. Import Cloudera QuickStart Docker image |
| |
| >[Cloudera](http://www.cloudera.com/) has officially provided CDH Docker Hub in their own container. Please check [this guide page](https://hub.docker.com/r/cloudera/quickstart/) for more information. |
| |
| You can import the Docker image by pulling it from Cloudera Docker Hub. |
| |
| ```bash |
| docker pull cloudera/quickstart:latest |
| ``` |
| |
| |
| ### 2. Run docker |
| |
| ```bash |
| docker run -it \ |
| -p 80:80 \ |
| -p 4040:4040 \ |
| -p 8020:8020 \ |
| -p 8022:8022 \ |
| -p 8030:8030 \ |
| -p 8032:8032 \ |
| -p 8033:8033 \ |
| -p 8040:8040 \ |
| -p 8042:8042 \ |
| -p 8088:8088 \ |
| -p 8480:8480 \ |
| -p 8485:8485 \ |
| -p 8888:8888 \ |
| -p 9083:9083 \ |
| -p 10020:10020 \ |
| -p 10033:10033 \ |
| -p 18088:18088 \ |
| -p 19888:19888 \ |
| -p 25000:25000 \ |
| -p 25010:25010 \ |
| -p 25020:25020 \ |
| -p 50010:50010 \ |
| -p 50020:50020 \ |
| -p 50070:50070 \ |
| -p 50075:50075 \ |
| -h quickstart.cloudera --privileged=true \ |
| agitated_payne_backup /usr/bin/docker-quickstart; |
| ``` |
| |
| ### 3. Verify running CDH |
| |
| To verify the application is running well, check the web UI for HDFS on `http://<hostname>:50070/` and YARN on `http://<hostname>:8088/cluster`. |
| |
| |
| ### 4. Configure Spark interpreter in Zeppelin |
| Set following configurations to `conf/zeppelin-env.sh`. |
| |
| ```bash |
| export HADOOP_CONF_DIR=[your_hadoop_conf_path] |
| export SPARK_HOME=[your_spark_home_path] |
| ``` |
| |
| `HADOOP_CONF_DIR`(Hadoop configuration path) is defined in `/scripts/docker/spark-cluster-managers/cdh/hdfs_conf`. |
| |
| Don't forget to set Spark `spark.master` as `yarn-client` in Zeppelin **Interpreters** setting page like below. |
| |
| <img src="{{BASE_PATH}}/assets/themes/zeppelin/img/docs-img/zeppelin_yarn_conf.png" /> |
| |
| ### 5. Run Zeppelin with Spark interpreter |
| After running a single paragraph with Spark interpreter in Zeppelin, |
| |
| <img src="{{BASE_PATH}}/assets/themes/zeppelin/img/docs-img/zeppelin_with_cdh.png" /> |
| |
| <br/> |
| |
| browse `http://<hostname>:8088/cluster/apps` to check Zeppelin application is running well or not. |
| |
| <img src="{{BASE_PATH}}/assets/themes/zeppelin/img/docs-img/cdh_yarn_applications.png" /> |
| |