tree: fd5ede29bb281e1f9af4f5f81dde3b678871447b [path history] [tgz]
  1. .gitignore
  2. .gitmodules
  3. CMakeLists.txt
  4. DISCLAIMER-WIP
  5. LICENSE
  6. MANIFEST.in
  7. Makefile
  8. NOTICE
  9. README.md
  10. cmake/
  11. common/
  12. config.mk
  13. cpc/
  14. fi/
  15. hll/
  16. kll/
  17. pyproject.toml
  18. python/
  19. setup.py
  20. theta/
README.md

This is a C++ version of the DataSketches core library. See Apache DataSketches home

Apache DataSketches is an open source, high-performance library of stochastic streaming algorithms commonly called “sketches” in the data sciences. Sketches are small, stateful programs that process massive data as a stream and can provide approximate answers, with mathematical guarantees, to computationally difficult queries orders-of-magnitude faster than traditional, exact methods.

This code requires C++11. It was tested with GCC 4.8.5 (standard in RedHat at the time of this writing), GCC 8.2.0 and Apple LLVM version 10.0.1 (clang-1001.0.46.4)

This includes Python bindings. For the Python interface, see the README notes in the python subdirectory.