| <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> |
| <!-- NewPage --> |
| <html lang="en"> |
| <head> |
| <!-- Generated by javadoc (1.8.0_181-google-v7) on Mon Jan 27 16:42:31 PST 2020 --> |
| <title>HllCount (Apache Beam 2.20.0-SNAPSHOT)</title> |
| <meta name="date" content="2020-01-27"> |
| <link rel="stylesheet" type="text/css" href="../../../../../../stylesheet.css" title="Style"> |
| <script type="text/javascript" src="../../../../../../script.js"></script> |
| </head> |
| <body> |
| <script type="text/javascript"><!-- |
| try { |
| if (location.href.indexOf('is-external=true') == -1) { |
| parent.document.title="HllCount (Apache Beam 2.20.0-SNAPSHOT)"; |
| } |
| } |
| catch(err) { |
| } |
| //--> |
| var methods = {"i0":9}; |
| var tabs = {65535:["t0","All Methods"],1:["t1","Static Methods"],8:["t4","Concrete Methods"]}; |
| var altColor = "altColor"; |
| var rowColor = "rowColor"; |
| var tableTab = "tableTab"; |
| var activeTableTab = "activeTableTab"; |
| </script> |
| <noscript> |
| <div>JavaScript is disabled on your browser.</div> |
| </noscript> |
| <!-- ========= START OF TOP NAVBAR ======= --> |
| <div class="topNav"><a name="navbar.top"> |
| <!-- --> |
| </a> |
| <div class="skipNav"><a href="#skip.navbar.top" title="Skip navigation links">Skip navigation links</a></div> |
| <a name="navbar.top.firstrow"> |
| <!-- --> |
| </a> |
| <ul class="navList" title="Navigation"> |
| <li><a href="../../../../../../overview-summary.html">Overview</a></li> |
| <li><a href="package-summary.html">Package</a></li> |
| <li class="navBarCell1Rev">Class</li> |
| <li><a href="package-tree.html">Tree</a></li> |
| <li><a href="../../../../../../deprecated-list.html">Deprecated</a></li> |
| <li><a href="../../../../../../index-all.html">Index</a></li> |
| <li><a href="../../../../../../help-doc.html">Help</a></li> |
| </ul> |
| </div> |
| <div class="subNav"> |
| <ul class="navList"> |
| <li>Prev Class</li> |
| <li><a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Extract.html" title="class in org.apache.beam.sdk.extensions.zetasketch"><span class="typeNameLink">Next Class</span></a></li> |
| </ul> |
| <ul class="navList"> |
| <li><a href="../../../../../../index.html?org/apache/beam/sdk/extensions/zetasketch/HllCount.html" target="_top">Frames</a></li> |
| <li><a href="HllCount.html" target="_top">No Frames</a></li> |
| </ul> |
| <ul class="navList" id="allclasses_navbar_top"> |
| <li><a href="../../../../../../allclasses-noframe.html">All Classes</a></li> |
| </ul> |
| <div> |
| <script type="text/javascript"><!-- |
| allClassesLink = document.getElementById("allclasses_navbar_top"); |
| if(window==top) { |
| allClassesLink.style.display = "block"; |
| } |
| else { |
| allClassesLink.style.display = "none"; |
| } |
| //--> |
| </script> |
| </div> |
| <div> |
| <ul class="subNavList"> |
| <li>Summary: </li> |
| <li><a href="#nested.class.summary">Nested</a> | </li> |
| <li><a href="#field.summary">Field</a> | </li> |
| <li>Constr | </li> |
| <li><a href="#method.summary">Method</a></li> |
| </ul> |
| <ul class="subNavList"> |
| <li>Detail: </li> |
| <li><a href="#field.detail">Field</a> | </li> |
| <li>Constr | </li> |
| <li><a href="#method.detail">Method</a></li> |
| </ul> |
| </div> |
| <a name="skip.navbar.top"> |
| <!-- --> |
| </a></div> |
| <!-- ========= END OF TOP NAVBAR ========= --> |
| <!-- ======== START OF CLASS DATA ======== --> |
| <div class="header"> |
| <div class="subTitle">org.apache.beam.sdk.extensions.zetasketch</div> |
| <h2 title="Class HllCount" class="title">Class HllCount</h2> |
| </div> |
| <div class="contentContainer"> |
| <ul class="inheritance"> |
| <li>java.lang.Object</li> |
| <li> |
| <ul class="inheritance"> |
| <li>org.apache.beam.sdk.extensions.zetasketch.HllCount</li> |
| </ul> |
| </li> |
| </ul> |
| <div class="description"> |
| <ul class="blockList"> |
| <li class="blockList"> |
| <hr> |
| <br> |
| <pre><a href="../../../../../../org/apache/beam/sdk/annotations/Experimental.html" title="annotation in org.apache.beam.sdk.annotations">@Experimental</a> |
| public final class <span class="typeNameLabel">HllCount</span> |
| extends java.lang.Object</pre> |
| <div class="block"><code>PTransform</code>s to compute HyperLogLogPlusPlus (HLL++) sketches on data streams based on the |
| <a href="https://github.com/google/zetasketch">ZetaSketch</a> implementation. |
| |
| <p>HLL++ is an algorithm implemented by Google that estimates the count of distinct elements in a |
| data stream. HLL++ requires significantly less memory than the linear memory needed for exact |
| computation, at the cost of a small error. Cardinalities of arbitrary breakdowns can be computed |
| using the HLL++ sketch. See this <a |
| href="http://static.googleusercontent.com/media/research.google.com/en/us/pubs/archive/40671.pdf">published |
| paper</a> for details about the algorithm. |
| |
| <p>HLL++ functions are also supported in <a |
| href="https://cloud.google.com/bigquery/docs/reference/standard-sql/hll_functions">Google Cloud |
| BigQuery</a>. The <code>HllCount PTransform</code>s provided here produce and consume sketches |
| compatible with BigQuery. |
| |
| <p>For detailed design of this class, see https://s.apache.org/hll-in-beam. |
| |
| <h3>Examples</h3> |
| |
| <h4>Example 1: Create long-type sketch for a <code>PCollection<Long></code> and specify precision</h4> |
| |
| <pre><code> |
| PCollection<Long> input = ...; |
| int p = ...; |
| PCollection<byte[]> sketch = input.apply(HllCount.Init.forLongs().withPrecision(p).globally()); |
| </code></pre> |
| |
| <h4>Example 2: Create bytes-type sketch for a <code>PCollection<KV<String, byte[]>></code></h4> |
| |
| <pre><code> |
| PCollection<KV<String, byte[]>> input = ...; |
| PCollection<KV<String, byte[]>> sketch = input.apply(HllCount.Init.forBytes().perKey()); |
| </code></pre> |
| |
| <h4>Example 3: Merge existing sketches in a <code>PCollection<byte[]></code> into a new one</h4> |
| |
| <pre><code> |
| PCollection<byte[]> sketches = ...; |
| PCollection<byte[]> mergedSketch = sketches.apply(HllCount.MergePartial.globally()); |
| </code></pre> |
| |
| <h4>Example 4: Estimates the count of distinct elements in a <code>PCollection<String></code></h4> |
| |
| <pre><code> |
| PCollection<String> input = ...; |
| PCollection<Long> countDistinct = |
| input.apply(HllCount.Init.forStrings().globally()).apply(HllCount.Extract.globally()); |
| </code></pre> |
| |
| Note: Currently HllCount does not work on FnAPI workers. See <a |
| href="https://issues.apache.org/jira/browse/BEAM-7879">Jira ticket [BEAM-7879]</a>.</div> |
| </li> |
| </ul> |
| </div> |
| <div class="summary"> |
| <ul class="blockList"> |
| <li class="blockList"> |
| <!-- ======== NESTED CLASS SUMMARY ======== --> |
| <ul class="blockList"> |
| <li class="blockList"><a name="nested.class.summary"> |
| <!-- --> |
| </a> |
| <h3>Nested Class Summary</h3> |
| <table class="memberSummary" border="0" cellpadding="3" cellspacing="0" summary="Nested Class Summary table, listing nested classes, and an explanation"> |
| <caption><span>Nested Classes</span><span class="tabEnd"> </span></caption> |
| <tr> |
| <th class="colFirst" scope="col">Modifier and Type</th> |
| <th class="colLast" scope="col">Class and Description</th> |
| </tr> |
| <tr class="altColor"> |
| <td class="colFirst"><code>static class </code></td> |
| <td class="colLast"><code><span class="memberNameLink"><a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Extract.html" title="class in org.apache.beam.sdk.extensions.zetasketch">HllCount.Extract</a></span></code> |
| <div class="block">Provides <code>PTransform</code>s to extract the estimated count of distinct elements (as <code>Long</code>s) from each HLL++ sketch.</div> |
| </td> |
| </tr> |
| <tr class="rowColor"> |
| <td class="colFirst"><code>static class </code></td> |
| <td class="colLast"><code><span class="memberNameLink"><a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Init.html" title="class in org.apache.beam.sdk.extensions.zetasketch">HllCount.Init</a></span></code> |
| <div class="block">Provides <code>PTransform</code>s to aggregate inputs into HLL++ sketches.</div> |
| </td> |
| </tr> |
| <tr class="altColor"> |
| <td class="colFirst"><code>static class </code></td> |
| <td class="colLast"><code><span class="memberNameLink"><a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.MergePartial.html" title="class in org.apache.beam.sdk.extensions.zetasketch">HllCount.MergePartial</a></span></code> |
| <div class="block">Provides <code>PTransform</code>s to merge HLL++ sketches into a new sketch.</div> |
| </td> |
| </tr> |
| </table> |
| </li> |
| </ul> |
| <!-- =========== FIELD SUMMARY =========== --> |
| <ul class="blockList"> |
| <li class="blockList"><a name="field.summary"> |
| <!-- --> |
| </a> |
| <h3>Field Summary</h3> |
| <table class="memberSummary" border="0" cellpadding="3" cellspacing="0" summary="Field Summary table, listing fields, and an explanation"> |
| <caption><span>Fields</span><span class="tabEnd"> </span></caption> |
| <tr> |
| <th class="colFirst" scope="col">Modifier and Type</th> |
| <th class="colLast" scope="col">Field and Description</th> |
| </tr> |
| <tr class="altColor"> |
| <td class="colFirst"><code>static int</code></td> |
| <td class="colLast"><code><span class="memberNameLink"><a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.html#DEFAULT_PRECISION">DEFAULT_PRECISION</a></span></code> |
| <div class="block">The default <code>precision</code> value used in <a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Init.Builder.html#withPrecision-int-"><code>HllCount.Init.Builder.withPrecision(int)</code></a> is |
| 15.</div> |
| </td> |
| </tr> |
| <tr class="rowColor"> |
| <td class="colFirst"><code>static int</code></td> |
| <td class="colLast"><code><span class="memberNameLink"><a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.html#MAXIMUM_PRECISION">MAXIMUM_PRECISION</a></span></code> |
| <div class="block">The maximum <code>precision</code> value you can set in <a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Init.Builder.html#withPrecision-int-"><code>HllCount.Init.Builder.withPrecision(int)</code></a> is |
| 24.</div> |
| </td> |
| </tr> |
| <tr class="altColor"> |
| <td class="colFirst"><code>static int</code></td> |
| <td class="colLast"><code><span class="memberNameLink"><a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.html#MINIMUM_PRECISION">MINIMUM_PRECISION</a></span></code> |
| <div class="block">The minimum <code>precision</code> value you can set in <a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Init.Builder.html#withPrecision-int-"><code>HllCount.Init.Builder.withPrecision(int)</code></a> is |
| 10.</div> |
| </td> |
| </tr> |
| </table> |
| </li> |
| </ul> |
| <!-- ========== METHOD SUMMARY =========== --> |
| <ul class="blockList"> |
| <li class="blockList"><a name="method.summary"> |
| <!-- --> |
| </a> |
| <h3>Method Summary</h3> |
| <table class="memberSummary" border="0" cellpadding="3" cellspacing="0" summary="Method Summary table, listing methods, and an explanation"> |
| <caption><span id="t0" class="activeTableTab"><span>All Methods</span><span class="tabEnd"> </span></span><span id="t1" class="tableTab"><span><a href="javascript:show(1);">Static Methods</a></span><span class="tabEnd"> </span></span><span id="t4" class="tableTab"><span><a href="javascript:show(8);">Concrete Methods</a></span><span class="tabEnd"> </span></span></caption> |
| <tr> |
| <th class="colFirst" scope="col">Modifier and Type</th> |
| <th class="colLast" scope="col">Method and Description</th> |
| </tr> |
| <tr id="i0" class="altColor"> |
| <td class="colFirst"><code>static byte[]</code></td> |
| <td class="colLast"><code><span class="memberNameLink"><a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.html#getSketchFromByteBuffer-java.nio.ByteBuffer-">getSketchFromByteBuffer</a></span>(java.nio.ByteBuffer bf)</code> |
| <div class="block">Converts the passed-in sketch from <code>ByteBuffer</code> to <code>byte[]</code>, mapping <code>null |
| ByteBuffer</code>s (representing empty sketches) to empty <code>byte[]</code>s.</div> |
| </td> |
| </tr> |
| </table> |
| <ul class="blockList"> |
| <li class="blockList"><a name="methods.inherited.from.class.java.lang.Object"> |
| <!-- --> |
| </a> |
| <h3>Methods inherited from class java.lang.Object</h3> |
| <code>clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait</code></li> |
| </ul> |
| </li> |
| </ul> |
| </li> |
| </ul> |
| </div> |
| <div class="details"> |
| <ul class="blockList"> |
| <li class="blockList"> |
| <!-- ============ FIELD DETAIL =========== --> |
| <ul class="blockList"> |
| <li class="blockList"><a name="field.detail"> |
| <!-- --> |
| </a> |
| <h3>Field Detail</h3> |
| <a name="MINIMUM_PRECISION"> |
| <!-- --> |
| </a> |
| <ul class="blockList"> |
| <li class="blockList"> |
| <h4>MINIMUM_PRECISION</h4> |
| <pre>public static final int MINIMUM_PRECISION</pre> |
| <div class="block">The minimum <code>precision</code> value you can set in <a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Init.Builder.html#withPrecision-int-"><code>HllCount.Init.Builder.withPrecision(int)</code></a> is |
| 10.</div> |
| <dl> |
| <dt><span class="seeLabel">See Also:</span></dt> |
| <dd><a href="../../../../../../constant-values.html#org.apache.beam.sdk.extensions.zetasketch.HllCount.MINIMUM_PRECISION">Constant Field Values</a></dd> |
| </dl> |
| </li> |
| </ul> |
| <a name="MAXIMUM_PRECISION"> |
| <!-- --> |
| </a> |
| <ul class="blockList"> |
| <li class="blockList"> |
| <h4>MAXIMUM_PRECISION</h4> |
| <pre>public static final int MAXIMUM_PRECISION</pre> |
| <div class="block">The maximum <code>precision</code> value you can set in <a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Init.Builder.html#withPrecision-int-"><code>HllCount.Init.Builder.withPrecision(int)</code></a> is |
| 24.</div> |
| <dl> |
| <dt><span class="seeLabel">See Also:</span></dt> |
| <dd><a href="../../../../../../constant-values.html#org.apache.beam.sdk.extensions.zetasketch.HllCount.MAXIMUM_PRECISION">Constant Field Values</a></dd> |
| </dl> |
| </li> |
| </ul> |
| <a name="DEFAULT_PRECISION"> |
| <!-- --> |
| </a> |
| <ul class="blockListLast"> |
| <li class="blockList"> |
| <h4>DEFAULT_PRECISION</h4> |
| <pre>public static final int DEFAULT_PRECISION</pre> |
| <div class="block">The default <code>precision</code> value used in <a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Init.Builder.html#withPrecision-int-"><code>HllCount.Init.Builder.withPrecision(int)</code></a> is |
| 15.</div> |
| <dl> |
| <dt><span class="seeLabel">See Also:</span></dt> |
| <dd><a href="../../../../../../constant-values.html#org.apache.beam.sdk.extensions.zetasketch.HllCount.DEFAULT_PRECISION">Constant Field Values</a></dd> |
| </dl> |
| </li> |
| </ul> |
| </li> |
| </ul> |
| <!-- ============ METHOD DETAIL ========== --> |
| <ul class="blockList"> |
| <li class="blockList"><a name="method.detail"> |
| <!-- --> |
| </a> |
| <h3>Method Detail</h3> |
| <a name="getSketchFromByteBuffer-java.nio.ByteBuffer-"> |
| <!-- --> |
| </a> |
| <ul class="blockListLast"> |
| <li class="blockList"> |
| <h4>getSketchFromByteBuffer</h4> |
| <pre>public static byte[] getSketchFromByteBuffer(<a href="https://static.javadoc.io/com.google.code.findbugs/jsr305/3.0.2/javax/annotation/Nullable.html?is-external=true" title="class or interface in javax.annotation">@Nullable</a> |
| java.nio.ByteBuffer bf)</pre> |
| <div class="block">Converts the passed-in sketch from <code>ByteBuffer</code> to <code>byte[]</code>, mapping <code>null |
| ByteBuffer</code>s (representing empty sketches) to empty <code>byte[]</code>s. |
| |
| <p>Utility method to convert sketches materialized with ZetaSQL/BigQuery to valid inputs for |
| Beam <code>HllCount</code> transforms.</div> |
| </li> |
| </ul> |
| </li> |
| </ul> |
| </li> |
| </ul> |
| </div> |
| </div> |
| <!-- ========= END OF CLASS DATA ========= --> |
| <!-- ======= START OF BOTTOM NAVBAR ====== --> |
| <div class="bottomNav"><a name="navbar.bottom"> |
| <!-- --> |
| </a> |
| <div class="skipNav"><a href="#skip.navbar.bottom" title="Skip navigation links">Skip navigation links</a></div> |
| <a name="navbar.bottom.firstrow"> |
| <!-- --> |
| </a> |
| <ul class="navList" title="Navigation"> |
| <li><a href="../../../../../../overview-summary.html">Overview</a></li> |
| <li><a href="package-summary.html">Package</a></li> |
| <li class="navBarCell1Rev">Class</li> |
| <li><a href="package-tree.html">Tree</a></li> |
| <li><a href="../../../../../../deprecated-list.html">Deprecated</a></li> |
| <li><a href="../../../../../../index-all.html">Index</a></li> |
| <li><a href="../../../../../../help-doc.html">Help</a></li> |
| </ul> |
| </div> |
| <div class="subNav"> |
| <ul class="navList"> |
| <li>Prev Class</li> |
| <li><a href="../../../../../../org/apache/beam/sdk/extensions/zetasketch/HllCount.Extract.html" title="class in org.apache.beam.sdk.extensions.zetasketch"><span class="typeNameLink">Next Class</span></a></li> |
| </ul> |
| <ul class="navList"> |
| <li><a href="../../../../../../index.html?org/apache/beam/sdk/extensions/zetasketch/HllCount.html" target="_top">Frames</a></li> |
| <li><a href="HllCount.html" target="_top">No Frames</a></li> |
| </ul> |
| <ul class="navList" id="allclasses_navbar_bottom"> |
| <li><a href="../../../../../../allclasses-noframe.html">All Classes</a></li> |
| </ul> |
| <div> |
| <script type="text/javascript"><!-- |
| allClassesLink = document.getElementById("allclasses_navbar_bottom"); |
| if(window==top) { |
| allClassesLink.style.display = "block"; |
| } |
| else { |
| allClassesLink.style.display = "none"; |
| } |
| //--> |
| </script> |
| </div> |
| <div> |
| <ul class="subNavList"> |
| <li>Summary: </li> |
| <li><a href="#nested.class.summary">Nested</a> | </li> |
| <li><a href="#field.summary">Field</a> | </li> |
| <li>Constr | </li> |
| <li><a href="#method.summary">Method</a></li> |
| </ul> |
| <ul class="subNavList"> |
| <li>Detail: </li> |
| <li><a href="#field.detail">Field</a> | </li> |
| <li>Constr | </li> |
| <li><a href="#method.detail">Method</a></li> |
| </ul> |
| </div> |
| <a name="skip.navbar.bottom"> |
| <!-- --> |
| </a></div> |
| <!-- ======== END OF BOTTOM NAVBAR ======= --> |
| </body> |
| </html> |