blob: 044aae7f9b1a20a3e7e54c2be1fe5080a9c00d7b [file] [log] [blame]
<!--
~ Licensed to the Apache Software Foundation (ASF) under one
~ or more contributor license agreements. See the NOTICE file
~ distributed with this work for additional information
~ regarding copyright ownership. The ASF licenses this file
~ to you under the Apache License, Version 2.0 (the
~ "License"); you may not use this file except in compliance
~ with the License. You may obtain a copy of the License at
~
~ http://www.apache.org/licenses/LICENSE-2.0
~
~ Unless required by applicable law or agreed to in writing, software
~ distributed under the License is distributed on an "AS IS" BASIS,
~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
~ See the License for the specific language governing permissions and
~ limitations under the License.
-->
<html>
<h2>Combine - Combine PerKey</h2>
<p>
After creating a keyed PCollection (for example, by using a GroupByKey transform), a common
pattern is to combine the collection of values associated with each key into a single, merged
value. This pattern of a GroupByKey followed by merging the collection of values is equivalent to
Combine PerKey transform. The combine function you supply to Combine PerKey must be an associative
reduction function or a subclass of CombineFn.
</p>
<p>
<b>Kata:</b> Implement the sum of scores per player using
<a href="https://beam.apache.org/releases/pydoc/current/apache_beam.transforms.core.html#apache_beam.transforms.core.CombinePerKey">
CombinePerKey</a>.
</p>
<br>
<div class="hint">
Use <a href="https://beam.apache.org/releases/pydoc/current/apache_beam.transforms.core.html#apache_beam.transforms.core.CombinePerKey">
CombinePerKey(CombineFn)</a>.
</div>
<div class="hint">
Extend the
<a href="https://beam.apache.org/releases/pydoc/current/apache_beam.transforms.core.html#apache_beam.transforms.core.CombineFn">
CombineFn</a> class that counts the sum of the number.
</div>
<div class="hint">
Refer to the Beam Programming Guide
<a href="https://beam.apache.org/documentation/programming-guide/#combining-values-in-a-keyed-pcollection">
"Combining values in a keyed PCollection"</a> section for more information.
</div>
</html>