title: “Apache Beam 2.52.0” date: 2023-11-17 09:00:00 -0400 categories:
We are happy to present the new 2.52.0 release of Beam. This release includes both improvements and new functionality. See the download page for this release.
For more information on changes in 2.52.0, check out the detailed release notes.
beam-sdks-java-extensions-avro
instead. This will allow to easily update Avro version in user code without potential breaking changes in Beam “core” since the Beam Avro extension already supports the latest Avro versions and should handle this. (#25252).UseDataStreamForBatch
pipeline option to the Flink runner. When it is set to true, Flink runner will run batch jobs using the DataStream API. By default the option is set to false, so the batch jobs are still executed using the DataSet API.upload_graph
as one of the Experiments options for DataflowRunner is no longer required when the graph is larger than 10MB for Java SDK (PR#28621).--max_cache_memory_usage_mb=X
to provide cache size for the user state API and side inputs. (Python) (#28770).org.apache.beam.sdk.io.CountingSource.CounterMark
uses custom CounterMarkCoder
as a default coder since all Avro-dependent classes finally moved to extensions/avro
. In case if it's still required to use AvroCoder
for CounterMark
, then, as a workaround, a copy of “old” CountingSource
class should be placed into a project code and used directly (#25252).host
to firestoreHost
in FirestoreOptions
to avoid potential conflict of command line arguments (Java) (#29201).watch_file_pattern
arg of the RunInference arg had no effect prior to 2.52.0. To use the behavior of arg watch_file_pattern
prior to 2.52.0, follow the documentation at https://beam.apache.org/documentation/ml/side-input-updates/ and use WatchFilePattern
PTransform as a SideInput. (#28948)MLTransform
doesn't output artifacts such as min, max and quantiles. Instead, MLTransform
will add a feature to output these artifacts as human readable format - #29017. For now, to use the artifacts such as min and max that were produced by the eariler MLTransform
, use read_artifact_location
of MLTransform
, which reads artifacts that were produced earlier in a different MLTransform
(#29016)According to git shortlog, the following people contributed to the 2.52.0 release. Thank you to all contributors!
Ahmed Abualsaud
Ahmet Altay
Aleksandr Dudko
Alexey Romanenko
Anand Inguva
Andrei Gurau
Andrey Devyatkin
BjornPrime
Bruno Volpato
Bulat
Chamikara Jayalath
Damon
Danny McCormick
Devansh Modi
Dominik Dębowczyk
Ferran Fernández Garrido
Hai Joey Tran
Israel Herraiz
Jack McCluskey
Jan Lukavský
JayajP
Jeff Kinard
Jeffrey Kinard
Jiangjie Qin
Jing
Joar Wandborg
Johanna Öjeling
Julien Tournay
Kanishk Karanawat
Kenneth Knowles
Kerry Donny-Clark
Luís Bianchin
Minbo Bae
Pranav Bhandari
Rebecca Szper
Reuven Lax
Ritesh Ghorse
Robert Bradshaw
Robert Burke
RyuSA
Shunping Huang
Steven van Rossum
Svetak Sundhar
Tony Tang
Vitaly Terentyev
Vivek Sumanth
Vlado Djerek
Yi Hu
aku019
brucearctor
caneff
damccorm
ddebowczyk92
dependabot[bot]
dpcollins-google
edman124
gabry.wu
illoise
johnjcasey
jonathan-lemos
kennknowles
liferoad
magicgoody
martin trieu
nancyxu123
pablo rodriguez defino
tvalentyn