Merge pull request #142 from apache/include_sstream

includes
tree: ff43e04bd4a72e510e115e150583e6f680a45f91
  1. .github/
  2. common/
  3. cpc/
  4. fi/
  5. hll/
  6. kll/
  7. python/
  8. sampling/
  9. theta/
  10. .gitignore
  11. .gitmodules
  12. CMakeLists.txt
  13. DISCLAIMER-WIP
  14. LICENSE
  15. MANIFEST.in
  16. NOTICE
  17. pyproject.toml
  18. README.md
  19. setup.py
README.md

This is a C++ version of the DataSketches core library. See Apache DataSketches home

Apache DataSketches is an open source, high-performance library of stochastic streaming algorithms commonly called “sketches” in the data sciences. Sketches are small, stateful programs that process massive data as a stream and can provide approximate answers, with mathematical guarantees, to computationally difficult queries orders-of-magnitude faster than traditional, exact methods.

This code requires C++11. It was tested with GCC 4.8.5 (standard in RedHat at the time of this writing), GCC 8.2.0 and Apple LLVM version 10.0.1 (clang-1001.0.46.4)

This includes Python bindings. For the Python interface, see the README notes in the python subdirectory.

This library is header-only. The build process provided is only for building unit tests and the python library.

Building the unit tests requires cmake 3.12.0 or higher.

Installing the latest cmake on OSX: brew install cmake

Building and running unit tests using cmake for OSX and Linux:

$ mkdir build
$ cd build
$ cmake ..
$ make
$ make test

Building and running unit tests using cmake for Windows from the command line:

$ mkdir build $ cd build $ cmake .. $ cd .. $ cmake --build build --config Release $ cmake --build build --config Release --target RUN_TESTS

How to Contact Us