Core C++ Sketch Library

Clone this repo:
  1. 982efcc Merge pull request #229 from fivepapertigers/patch-1 by Alexander Saydakov · 5 days ago master
  2. 03c56bf Fix typo in python/README.md by Jacob · 5 days ago
  3. 8dc88bb added 2021 by AlexanderSaydakov · 13 days ago
  4. b180b12 Merge pull request #228 from apache/fix_forwarding_iterators by Alexander Saydakov · 2 weeks ago
  5. e520318 no move if const_iterator by AlexanderSaydakov · 3 weeks ago

Apache DataSketches Core C++ Library Component

This is the core C++ component of the Apache DataSketches library. It contains all of the key sketching algorithms that are in the Java component and can be accessed directly from user applications.

This component is also a dependency of other components of the library that create adaptors for target systems, such as PostgreSQL.

Note that we have a parallel core component for Java implementations of the same sketch algorithms, datasketches-java.

Please visit the main Apache DataSketches website for more information.

If you are interested in making contributions to this site please see our Community page for how to contact us.


This code requires C++11.

This includes Python bindings. For the Python interface, see the README notes in the python subdirectory.

This library is header-only. The build process provided is only for building unit tests and the python library.

Building the unit tests requires cmake 3.12.0 or higher.

Installing the latest cmake on OSX: brew install cmake

Building and running unit tests using cmake for OSX and Linux:

	$ cd build
	$ cmake ..
	$ make
	$ make test

Building and running unit tests using cmake for Windows from the command line:

	$ cd build
	$ cmake ..
	$ cd ..
	$ cmake --build build --config Release
	$ cmake --build build --config Release --target RUN_TESTS