layout: toc_page
The Challenge
The Major Sketch Families
Sketch Origins
Sketch Elements
Key Features
Large Scale Computing
Architecture
Overview Slide Deck
Frequent Items Sketches
HLL Sketches
Memory Package
Quantiles Sketches
Sampling Sketches
Theta Sketches
Tuple Sketches
Other Information
Frequent Items Overview
Frequent Items Java Example
Frequent Items Pig UDFs
Frequent Items Hive UDFs
Frequent Items Error Table
Frequent Items References
HLL Sketch
HLL Map Sketch
Memory Package
Quantiles Overview
Quantiles Accuracy and Size
Quantiles Sketch Java Example
Quantiles Sketch Pig UDFs
Quantiles Sketch Hive UDFs
Optimal Quantile Approximation in Streams
Quantiles References
Reservoir Sampling
Reservoir Sampling Performance
Reservoir Sampling Java Example
Theta Sketch Framework
Theta Sketch Java Example
Theta Sketch Spark Example
The Inverse Estimate
Empty Sketch
First Estimator
Better Estimator
Rejection Rules
Update V(kth) Rule
Set Operations
Basic Accuracy
Accuracy Plots
Relative Error Table
SetOp Accuracy
Unions With Different k
Theta Sketch Size
Update Speed
Merge Speed
Theta Sketch Pig UDFs
Theta Sketch Hive UDFs
Integration with Druid
Memory Package
p-Sampling
Theta Sketch Framework (PDF)
Sketch Equations (PDF)
DataSketches (PDF)
Confidence Intervals Notes
Merging Algorithm Notes
Theta References
Tuple Sketch Overview
Tuple Sketch Java Example
Tuple Sketch Pig UDFs
Tuple Sketch Hive UDFs
Creating Command Line Executables
Who Uses
License