Mirror of Apache AsterixDB

Clone this repo:
  1. 67a2492 [ASTERIXDB-3430] Log schema as a one-liner JSON by preetham0202 · 32 hours ago master
  2. e3a06f7 [NO ISSUE][STO] Column cloud refactoring by Wail Alkowaileet · 11 hours ago
  3. f590735 [ASTERIXDB-3427][COMP] CBO not picking array index by Vijay Sarathy · 21 hours ago
  4. b7107ab [NO ISSUE][TEST] Change cloud test cache policy to selective by Wail Alkowaileet · 36 hours ago
  5. ff8a16a [ASTERIXDB-3435][STO] Use syncWrite instead of doSyncWrite on persist by Wail Alkowaileet · 2 days ago

What is AsterixDB?

AsterixDB is a BDMS (Big Data Management System) with a rich feature set that sets it apart from other Big Data platforms. Its feature set makes it well-suited to modern needs such as web data warehousing and social data storage and analysis. AsterixDB has:

  • Data model
    A semistructured NoSQL style data model (ADM) resulting from extending JSON with object database ideas

  • Query languages
    An expressive and declarative query language (SQL++ that supports a broad range of queries and analysis over semistructured data

  • Scalability
    A parallel runtime query execution engine, Apache Hyracks, that has been scale-tested on up to 1000+ cores and 500+ disks

  • Native storage
    Partitioned LSM-based data storage and indexing to support efficient ingestion and management of semistructured data

  • External storage
    Support for query access to externally stored data (e.g., data in HDFS) as well as to data stored natively by AsterixDB

  • Data types
    A rich set of primitive data types, including spatial and temporal data in addition to integer, floating point, and textual data

  • Indexing
    Secondary indexing options that include B+ trees, R trees, and inverted keyword (exact and fuzzy) index types

  • Transactions
    Basic transactional (concurrency and recovery) capabilities akin to those of a NoSQL store

Learn more about AsterixDB at its website.

Build from source

To build AsterixDB from source, you should have a platform with the following:

  • A Unix-ish environment (Linux, OS X, will all do).
  • git
  • Maven 3.3.9 or newer.
  • JDK 11 or newer.
  • Python 3.6+ with pip and venv

Instructions for building the master:

  • Checkout AsterixDB master:

      $git clone https://github.com/apache/asterixdb.git
  • Build AsterixDB master:

      $cd asterixdb
      $mvn clean package -DskipTests

Run the build on your machine

Here are steps to get AsterixDB running on your local machine:

  • Start a single-machine AsterixDB instance:

      $cd asterixdb/asterix-server/target/asterix-server-*-binary-assembly/apache-asterixdb-*-SNAPSHOT
  • Good to go and run queries in your browser at:

  • Read more documentation to learn the data model, query language, and how to create a cluster instance.


To generate the documentation, run asterix-doc with the generate.rr profile in maven, e.g mvn -Pgenerate.rr ... Be sure to run mvn package beforehand or run mvn site in asterix-lang-sqlpp to generate some resources that are used in the documentation that are generated directly from the grammar.

Community support