An open source ML system for the end-to-end data science lifecycle

Clone this repo:
  1. 17803b1 [BWARE] Speed up frame-to-matrix conversion and harden number parsing (#2480) by Sebastian Baunsgaard · 3 days ago main
  2. c58e5b5 [BWARE] Add HashMapIntToInt primitive int-to-int hash map (#2478) by Sebastian Baunsgaard · 3 days ago
  3. d90ecf6 [MINOR] Fix CholeskyTest crash when residual is exactly zero (#2487) by Sebastian Baunsgaard · 3 days ago
  4. 37189bf [MINOR][CI] Retry Maven test-compile on transient repository download errors by Sebastian Baunsgaard · 3 days ago
  5. 1b991ae [MINOR] Reduce surefire per-fork test timeout from 1380s to 600s (#2485) by Sebastian Baunsgaard · 3 days ago

Apache SystemDS

Overview: Apache SystemDS is an open-source machine learning (ML) system for the end-to-end data science lifecycle from data preparation and cleaning, over efficient ML model training, to debugging and serving. ML algorithms or pipelines are specified in a high-level language with R-like syntax or related Python and Java APIs (with many builtin primitives), and the system automatically generates hybrid runtime plans of local, in-memory operations and distributed operations on Apache Spark. Additional backends exist for GPUs and federated learning.

ResourceLinks
Quick StartInstall, Quick Start and Hello World
Documentation:SystemDS Documentation
Python DocumentationPython SystemDS Documentation
Issue TrackerJira Dashboard

Status and Build: SystemDS is renamed from SystemML which is an Apache Top Level Project. To build from source visit SystemDS Install from source

Build Documentation LicenseCheck Java Tests codecov Python Test Total PyPI downloads Monthly PyPI downloads