commit | 2d9edb993ac651477cc86bd7ce68a9c4d6807de1 | [log] [tgz] |
---|---|---|

author | AlexanderSaydakov <AlexanderSaydakov@users.noreply.github.com> | Fri Jan 29 14:31:12 2021 -0800 |

committer | AlexanderSaydakov <AlexanderSaydakov@users.noreply.github.com> | Fri Jan 29 14:31:12 2021 -0800 |

tree | 199a567e6fdc243e530da12d93b01bff4873a172 | |

parent | a707a00cd2ead9e9977320d38d13c29806acf2d4 [diff] |

promoted new implementation of theta sketch

- theta/CMakeLists.txt[diff]
- theta/include/bounds_on_ratios_in_sampled_sets.hpp[Renamed from tuple/include/bounds_on_ratios_in_sampled_sets.hpp - diff]
- theta/include/bounds_on_ratios_in_theta_sketched_sets.hpp[Renamed from tuple/include/bounds_on_ratios_in_theta_sketched_sets.hpp - diff]
- theta/include/theta_a_not_b.hpp[diff]
- theta/include/theta_a_not_b_impl.hpp[diff]
- theta/include/theta_comparators.hpp[Renamed from tuple/include/theta_comparators.hpp - diff]
- theta/include/theta_constants.hpp[Renamed from tuple/include/theta_constants.hpp - diff]
- theta/include/theta_helpers.hpp[Renamed from tuple/include/theta_helpers.hpp - diff]
- theta/include/theta_intersection.hpp[diff]
- theta/include/theta_intersection_base.hpp[Renamed from tuple/include/theta_intersection_base.hpp - diff]
- theta/include/theta_intersection_base_impl.hpp[Renamed from tuple/include/theta_intersection_base_impl.hpp - diff]
- theta/include/theta_intersection_impl.hpp[diff]
- theta/include/theta_jaccard_similarity.hpp[Copied from tuple/include/theta_constants.hpp - diff]
- theta/include/theta_jaccard_similarity_base.hpp[Renamed from tuple/include/jaccard_similarity.hpp - diff]
- theta/include/theta_set_difference_base.hpp[Renamed from tuple/include/theta_set_difference_base.hpp - diff]
- theta/include/theta_set_difference_base_impl.hpp[Renamed from tuple/include/theta_set_difference_base_impl.hpp - diff]
- theta/include/theta_sketch.hpp[diff]
- theta/include/theta_sketch_impl.hpp[diff]
- theta/include/theta_union.hpp[diff]
- theta/include/theta_union_base.hpp[Renamed from tuple/include/theta_union_base.hpp - diff]
- theta/include/theta_union_base_impl.hpp[Renamed from tuple/include/theta_union_base_impl.hpp - diff]
- theta/include/theta_union_impl.hpp[diff]
- theta/include/theta_update_sketch_base.hpp[Renamed from tuple/include/theta_update_sketch_base.hpp - diff]
- theta/include/theta_update_sketch_base_impl.hpp[Renamed from tuple/include/theta_update_sketch_base_impl.hpp - diff]
- theta/test/CMakeLists.txt[diff]
- theta/test/theta_jaccard_similarity_test.cpp[Renamed from tuple/test/theta_jaccard_similarity_test.cpp - diff]
- theta/test/theta_sketch_test.cpp[diff]
- tuple/CMakeLists.txt[diff]
- tuple/include/theta_a_not_b_experimental.hpp[Deleted - diff]
- tuple/include/theta_a_not_b_experimental_impl.hpp[Deleted - diff]
- tuple/include/theta_intersection_experimental.hpp[Deleted - diff]
- tuple/include/theta_intersection_experimental_impl.hpp[Deleted - diff]
- tuple/include/theta_sketch_experimental.hpp[Deleted - diff]
- tuple/include/theta_sketch_experimental_impl.hpp[Deleted - diff]
- tuple/include/theta_union_experimental.hpp[Deleted - diff]
- tuple/include/theta_union_experimental_impl.hpp[Deleted - diff]
- tuple/include/tuple_jaccard_similarity.hpp[Renamed from tuple/test/theta_union_experimental_test.cpp - diff]
- tuple/include/tuple_sketch.hpp[diff]
- tuple/include/tuple_sketch_impl.hpp[diff]
- tuple/test/CMakeLists.txt[diff]
- tuple/test/theta_a_not_b_experimental_test.cpp[Deleted - diff]
- tuple/test/theta_compact_empty_from_java.sk[Deleted - diff]
- tuple/test/theta_compact_estimation_from_java.sk[Deleted - diff]
- tuple/test/theta_compact_single_item_from_java.sk[Deleted - diff]
- tuple/test/theta_intersection_experimental_test.cpp[Deleted - diff]
- tuple/test/theta_sketch_experimental_test.cpp[Deleted - diff]
- tuple/test/tuple_a_not_b_test.cpp[diff]
- tuple/test/tuple_intersection_test.cpp[diff]
- tuple/test/tuple_jaccard_similarity_test.cpp[diff]
- tuple/test/tuple_sketch_allocation_test.cpp[diff]
- tuple/test/tuple_union_test.cpp[diff]

51 files changed

tree: 199a567e6fdc243e530da12d93b01bff4873a172

- .asf.yaml
- .github/
- .gitignore
- .gitmodules
- CMakeLists.txt
- LICENSE
- MANIFEST.in
- NOTICE
- README.md
- build/
- common/
- cpc/
- fi/
- hll/
- kll/
- pyproject.toml
- python/
- req/
- sampling/
- setup.py
- theta/
- tuple/

README.md

This is the core C++ component of the DataSketches library. It contains all of the key sketching algorithms that are in the Java component and can be accessed directly from user applications.

This component is also a dependency of other components of the library that create adaptors for target systems, such as PostgreSQL.

Note that we have a parallel core component for Java implementations of the same sketch algorithms, datasketches-java.

Please visit the main DataSketches website for more information.

If you are interested in making contributions to this site please see our Community page for how to contact us.

This code requires C++11. It was tested with GCC 4.8.5 (standard in RedHat at the time of this writing), GCC 8.2.0 and Apple LLVM version 10.0.1 (clang-1001.0.46.4)

This includes Python bindings. For the Python interface, see the README notes in the python subdirectory.

This library is header-only. The build process provided is only for building unit tests and the python library.

Building the unit tests requires cmake 3.12.0 or higher.

Installing the latest cmake on OSX: brew install cmake

Building and running unit tests using cmake for OSX and Linux:

$ cd build $ cmake .. $ make $ make test

Building and running unit tests using cmake for Windows from the command line:

$ cd build $ cmake .. $ cd .. $ cmake --build build --config Release $ cmake --build build --config Release --target RUN_TESTS