learning/prompts/documentation-lookup/15_advanced

Prompt: What is a multi-language pipeline in Apache Beam?

Response: Apache Beam lets you combine transforms written in any supported SDK language (currently, Java and Python) and use them in one multi-language pipeline. For example, a multi-language pipeline might read from a Python source, process the data using a Java transform, and write the data to a Python sink.

For example, you can use the Apache Kafka connector and SQL transform from the Java SDK in Python pipelines.

To learn how to create a multi-language pipeline, see the quickstart examples for Java and Python.

Depending on the SDK language of the pipeline, you can use a high-level SDK wrapper class or a low-level transform class to access a cross-language transform. For more information, see Using cross-language transforms.

To develop a cross-language transform, you need to define a Uniform Resource Name(URN) for registering the transform with an expansion service. For more information, see Defining a URN.