blob: 933fb8f2751582cd1fafb57df4eba45b2c2ea7fc [file] [log] [blame]
[[xtokenize-language]]
= XML Tokenize Language
:page-source: core/camel-jaxp/src/main/docs/xtokenize-language.adoc
*Available as of Camel version 2.14*
The xml tokenizer language is a built-in language in camel-core, which
is a truly XML-aware tokenizer that can be used with the Splitter as the
conventional Tokenizer to efficiently and
effectively tokenize XML documents. XMLTokenizer is capable of not only
recognizing XML namespaces and hierarchical structures of the document
but also more efficiently tokenizing XML documents than the conventional
Tokenizer.
For more details see Splitter.
== XML Tokenizer Options
// language options: START
The XML Tokenize language supports 4 options, which are listed below.
[width="100%",cols="2,1m,1m,6",options="header"]
|===
| Name | Default | Java Type | Description
| headerName | | String | Name of header to tokenize instead of using the message body.
| mode | | String | The extraction mode. The available extraction modes are: i - injecting the contextual namespace bindings into the extracted token (default) w - wrapping the extracted token in its ancestor context u - unwrapping the extracted token to its child content t - extracting the text content of the specified element
| group | | Integer | To group N parts together
| trim | true | Boolean | Whether to trim the value to remove leading and trailing whitespaces and line breaks
|===
// language options: END