Apache datasketches

Clone this repo:

Branches

  1. 0656269 Merge pull request #1 from apache/remove_travis by Lee Rhodes · 1 year, 3 months ago master
  2. 4f3c27e Delete .travis.yml by Jon Malkin · 1 year, 3 months ago
  3. c8932c4 Adding tools directory by Lee Rhodes · 3 years, 2 months ago
  4. 6c46f6f Update datasketches (parent) by Lee Rhodes · 3 years, 4 months ago
  5. 58d1892 Cleanout in preparation for a parent POM by Lee Rhodes · 3 years, 4 months ago

DataSketches is now Apache DataSketches.

DataSketches is an open source, high-performance library of stochastic streaming algorithms commonly called “sketches” in the data sciences. Sketches are small, stateful programs that process massive data as a stream and can provide approximate answers, with mathematical guarantees, to computationally difficult queries orders-of-magnitude faster than traditional, exact methods.

In 2019, after 8 years of development and 5 years as in Open Source, we began the important migration from a stand-alone GitHub site to being a member of the Apache Software Foundation community. As of December, 2020, we became an official Top-Level Project within the ASF.