Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

Clone this repo:
  1. 6097af1 [LIVY-1003][RSC] Interactive session - Setting large value of rsc.server.connect.timeout blocks other tasks by wangdengshan · 7 weeks ago master
  2. 6dcb294 [LIVY-1007] Livy should not spawn one thread per job to track the job on Kubernetes by Asif Khatri · 3 months ago
  3. b089dd6 [LIVY-702] Submit Spark apps to Kubernetes (#451) by Asif Khatri · 4 months ago
  4. 03ceb4a [LIVY-997][DOC] Add file .sdkmanrc to .gitignore (#444) by Javi Roman · 4 months ago
  5. 21ca618 [LIVY-991][SERVER] Facing issues with the Livy UI Driver link (#437) by RajshekharMuchandi · 4 months ago

Apache Livy

Build Status

Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere. It supports executing snippets of code or programs in a Spark context that runs locally or in Apache Hadoop YARN.

  • Interactive Scala, Python and R shells
  • Batch submissions in Scala, Java, Python
  • Multiple users can share the same server (impersonation support)
  • Can be used for submitting jobs from anywhere with REST
  • Does not require any code change to your programs

Pull requests are welcomed! But before you begin, please check out the Contributing section on the Community page of our website.

Online Documentation

Guides and documentation on getting started using Livy, example code snippets, and Livy API documentation can be found at livy.incubator.apache.org.

Before Building Livy

To build Livy, you will need:

Debian/Ubuntu:

  • mvn (from maven package or maven3 tarball)
  • openjdk-8-jdk (or Oracle JDK 8)
  • Python 2.7+
  • R 3.x

Redhat/CentOS:

  • mvn (from maven package or maven3 tarball)
  • java-1.8.0-openjdk (or Oracle JDK 8)
  • Python 2.7+
  • R 3.x

MacOS:

  • Xcode command line tools
  • Oracle's JDK 1.8
  • Maven (Homebrew)
  • Python 2.7+
  • R 3.x

Required python packages for building Livy:

  • cloudpickle
  • requests
  • requests-kerberos
  • flake8
  • flaky
  • pytest

To run Livy, you will also need a Spark installation. You can get Spark releases at https://spark.apache.org/downloads.html.

Livy requires Spark 2.4+. You can switch to a different version of Spark by setting the SPARK_HOME environment variable in the Livy server process, without needing to rebuild Livy.

Building Livy

Livy is built using Apache Maven. To check out and build Livy, run:

git clone https://github.com/apache/incubator-livy.git
cd incubator-livy
mvn package

You can also use the provided Dockerfile:

git clone https://github.com/apache/incubator-livy.git
cd incubator-livy
docker build -t livy-ci dev/docker/livy-dev-base/
docker run --rm -it -v $(pwd):/workspace -v $HOME/.m2:/root/.m2 livy-ci mvn package

Note: The docker run command maps the maven repository to your host machine's maven cache so subsequent runs will not need to download dependencies.

By default Livy is built against Apache Spark 2.4.5, but the version of Spark used when running Livy does not need to match the version used to build Livy. Livy internally handles the differences between different Spark versions.

The Livy package itself does not contain a Spark distribution. It will work with any supported version of Spark without needing to rebuild.

Build Profiles

FlagPurpose
-Phadoop2Choose Hadoop2 based build dependencies (default configuration)
-Pspark2Choose Spark 2.x based build dependencies (default configuration)
-Pspark3Choose Spark 3.x based build dependencies
-Pscala-2.11Choose Scala 2.11 based build dependencies (default configuration)
-Pscala-2.12Choose scala 2.12 based build dependencies