layout: doc_page

Features Matrix for Distinct Count Sketches

CategorySub-CategoryThetaTupleHLLCPC
Space Accuracy ConfigMin Log Precision (Min LgK)4444
Max Log Precision (Max LgK)26262126
Min Log Sparse Precision64642626
Max Log Sparce Precision64642626
Space Accuracy FactorsEntropy bits / slot = b646444.8
Error Coefficient = C111.040.69
HIP Error Coefficient = C0.830.59
Space Accuracy Merit1Merging644.332.31
Not Merging (HIP)2.781.66
Input TypesintYYYY
longYYYY
doubleYYYY
StringYYYY
byte[]YYYY
char[]YYY
int[]YYYY
long[]YYYY
Sketch resultsEstimate ()doubledoubledoubledouble
Upper Bound ( Std Dev )doubledoubledoubledouble
Lower Bound ( Std Dev )doubledoubledoubledouble
Estimate (numSubSetRows)double
Upper Bound (Std Dev, numSubset Rows)double
Lower Bound (Std Dev, numSubset Rows)double
Iterator()YYYY
Set OperationsUnionYYYY
IntersectionYY
DifferenceYY
Enables full set expressionsYY
Set Op Result TypeSketchSketchSketchSketch
Merge different LgKYYYY
Serialize OperationsTo Byte ArrayYYYY
Deserialize OperationsHeapify()YYYY
Wrap()YYY
WritableWrapYYY
LanguagesJavaYYYY
C++YYY
Python2YYY
Binary compatibility across LanguagesYYYY
Other Operations and ModesJaccard IndexY
Off-Heap OperationYYYY
Associative ColumnsY
Generic ExtensionsY
User Specified Hash SeedYY
Pre SamplingYY

1 Space Accuracy Merit (Lower is better) = b * C^2 = b * K * RSE^2
2 See Python Install Instructions