blob: 22743f107504f040690eaf3c4be67e51234f17ef [file] [log] [blame]
<?xml version='1.0' encoding='UTF-8'?>
<!DOCTYPE s1 SYSTEM 'dtd/document.dtd'>
<s1 title='XNI Samples'>
<s2 title='Overview'>
<p>
The Xerces Native Interface (XNI) is an internal API that is
independent of other XML APIs and is used to implement the
Xerces family of parsers. XNI allows a wide variety of parsers
to be written in an easy and modular fashion. The XNI samples
included with Xerces are simple examples of how to program
using the XNI API. However, for information on how to take full
advantage of this powerful framework, refer to the
<link idref='xni'>XNI Manual</link>.
</p>
<p>Basic XNI samples:</p>
<ul>
<li><link anchor='Counter'>xni.Counter</link></li>
<li><link anchor='DocumentTracer'>xni.DocumentTracer</link></li>
<li><link anchor='Writer'>xni.Writer</link></li>
<li><link anchor='PSVIWriter'>xni.PSVIWriter</link></li>
</ul>
<ul>
<li><link anchor='PassThroughFilter'>xni.PassThroughFilter</link></li>
<li><link anchor='UpperCaseFilter'>xni.UpperCaseFilter</link></li>
</ul>
<p>Parser configuration samples:</p>
<ul>
<li><link anchor='NonValidatingParserConfiguration'>xni.parser.NonValidatingParserConfiguration</link></li>
<li><link anchor='AbstractConfiguration'>xni.parser.AbstractConfiguration</link></li>
<!--
- REVISIT: Add in this sample once the proper interfaces have been
- designed and implemented in the parser. *And* after the
- sample code has been written.
-
<li><link anchor='DynamicParserConfiguration'>xni.parser.DynamicParserConfiguration</link></li>
-->
</ul>
<ul>
<li><link anchor='CSVConfiguration'>xni.parser.CSVConfiguration</link></li>
<li><link anchor='CSVParser'>xni.parser.CSVParser</link></li>
<li><link anchor='PSVIConfiguration'>xni.parser.PSVIConfiguration</link></li>
<li><link anchor='PSVIParser'>xni.parser.PSVIParser</link></li>
</ul>
<p>
Most of the XNI samples have a command line option that allows the
user to specify a different XNI parser configuration to use. In
order to supply another parser configuration besides the default
Xerces <code>StandardParserConfiguration</code>, the configuration
must implement the
<code>org.apache.xerces.xni.parser.XMLParserConfiguration</code>
interface.
</p>
</s2>
<anchor name='Counter'/>
<s2 title='Sample xni.Counter'>
<p>
A sample XNI counter. The output of this program shows the time
and count of elements, attributes, ignorable whitespaces, and
characters appearing in the document.
</p>
<p>
This class is useful as a "poor-man's" performance tester to
compare the speed and accuracy of various parser configurations.
However, it is important to note that the first parse time of a
parser will include both VM class load time and parser
initialization that would not be present in subsequent parses
with the same file.
</p>
<note>
The results produced by this program should never be accepted as
true performance measurements.
</note>
<s3 title='usage'>
<source>java xni.Counter (options) uri ...</source>
</s3>
<s3 title='options'>
<table>
<tr><th>Option</th><th>Description</th></tr>
<tr><td>-p name</td><td>Select parser configuration by name.</td></tr>
<tr><td>-x number</td><td>Select number of repetitions.</td></tr>
<tr><td>-n | -N</td><td>Turn on/off namespace processing.</td></tr>
<tr>
<td>-np | -NP</td>
<td>
Turn on/off namespace prefixes.<br/>
<strong>NOTE:</strong> Requires use of -n.
</td>
</tr>
<tr><td>-v | -V</td><td>Turn on/off validation.</td></tr>
<tr>
<td>-s | -S</td>
<td>
Turn on/off Schema validation support.<br/>
<strong>NOTE:</strong> Not supported by all parser configurations.
</td>
</tr>
<tr>
<td>-f | -F</td>
<td>
Turn on/off Schema full checking.<br/>
<strong>NOTE:</strong> Requires use of -s and not supported by all parsers.
</td>
</tr>
<tr><td>-m | -M</td><td>Turn on/off memory usage report.</td></tr>
<tr><td>-t | -T</td><td>Turn on/off \"tagginess\" report.</td></tr>
<tr>
<td>--rem text</td>
<td>Output user defined comment before next parse.</td>
</tr>
<tr><td>-h</td><td>Display help screen.</td></tr>
</table>
</s3>
<s3 title='notes'>
<p>
The speed and memory results from this program should NOT be used
as the basis of parser performance comparison! Real analytical
methods should be used. For better results, perform multiple
document parses within the same virtual machine to remove class
loading from parse time and memory usage.
</p>
<p>
The "tagginess" measurement gives a rough estimate of the percentage
of markup versus content in the XML document. The percent tagginess
of a document is equal to the minimum amount of tag characters
required for elements, attributes, and processing instructions
divided by the total amount of characters (characters, ignorable
whitespace, and tag characters) in the document.
</p>
<p>
Not all features are supported by different parser configurations.
</p>
</s3>
</s2>
<anchor name='DocumentTracer'/>
<s2 title='Sample xni.DocumentTracer'>
<p>
Provides a complete trace of XNI document and DTD events for
files parsed.
</p>
<s3 title='usage'>
<source>java xni.DocumentTracer (options) uri ...</source>
</s3>
<s3 title='options'>
<table>
<tr><th>Option</th><th>Description</th></tr>
<tr><td>-p name</td><td>Specify parser configuration by name.</td></tr>
<tr><td>-n | -N</td><td>Turn on/off namespace processing.</td></tr>
<tr><td>-v | -V</td><td>Turn on/off validation.</td></tr>
<tr>
<td>-s | -S</td>
<td>
Turn on/off Schema validation support.<br/>
<strong>NOTE:</strong> Not supported by all parser configurations.
</td>
</tr>
<tr><td>-c | -C</td><td>Turn on/off character notifications");</td></tr>
<tr><td>-h</td><td>Display help screen.</td></tr>
</table>
</s3>
</s2>
<anchor name='Writer'/>
<s2 title='Sample xni.Writer'>
<p>
A sample XNI writer. This sample program illustrates how to
take receive XMLDocumentHandler callbacks in order to print
a document that is parsed.
</p>
<s3 title='usage'>
<source>java xni.Writer (options) uri ...</source>
</s3>
<s3 title='options'>
<table>
<tr><th>Option</th><th>Description</th></tr>
<tr><td>-p name</td><td>Select parser configuration by name.</td></tr>
<tr><td>-n | -N</td><td>Turn on/off namespace processing.</td></tr>
<tr><td>-v | -V</td><td>Turn on/off validation.</td></tr>
<tr>
<td>-s | -S</td>
<td>
Turn on/off Schema validation support.<br/>
<strong>NOTE:</strong> Not supported by all parser configurations.
</td>
</tr>
<!--
<tr>
<td>-c | -C</td>
<td>
Turn on/off Canonical XML output.<br/>
<strong>NOTE:</strong> This is not W3C canonical output.
</td>
</tr>
-->
<tr><td>-h</td><td>Display help screen.</td></tr>
</table>
</s3>
</s2>
<anchor name='PSVIWriter'/>
<s2 title='Sample xni.PSVIWriter'>
<p>
This is an example of a component that intercepts document handler events and gathers PSVI information. The PSVI output is written as XML.
</p>
<p>
This class can <strong>NOT</strong> be run as a standalone
program. It is only an example of how to write a component. See <link anchor='PSVIParser'>xni.parser.PSVIParser</link>.
</p>
</s2>
<anchor name='PassThroughFilter'/>
<s2 title='Sample xni.PassThroughFilter'>
<p>
This sample demonstrates how to implement a simple pass-through
filter for the document "streaming" information set using XNI.
This filter could be used in a pipeline of XNI parser components
that communicate document events.
</p>
<p>
This class can <strong>NOT</strong> be run as a standalone
program. It is only an example of how to write a document
handler.
</p>
</s2>
<anchor name='UpperCaseFilter'/>
<s2 title='Sample xni.UpperCaseFilter'>
<p>
This sample demonstrates how to create a filter for the document
"streaming" information set that turns element names into upper
case.
</p>
<p>
This class can <strong>NOT</strong> be run as a standalone
program. It is only an example of how to write a document
handler.
</p>
</s2>
<anchor name='NonValidatingParserConfiguration'/>
<s2 title='Sample xni.parser.NonValidatingParserConfiguration'>
<p>Non-validating parser configuration.</p>
<p>
This class can <strong>NOT</strong> be run as a standalone
program. It is only an example of how to write a parser
configuration using XNI. You can use this parser configuration
by specifying the fully qualified class name to all of the XNI
samples that accept a parser configuration using the
<code>-p</code> option. For example:
</p>
<source>java xni.Counter -p xni.parser.NonValidatingParserConfiguration document.xml</source>
</s2>
<anchor name='AbstractConfiguration'/>
<s2 title='Sample xni.parser.AbstractConfiguration'>
<p>
This abstract parser configuration simply helps manage components,
features and properties, and other tasks common to all parser
configurations. In order to subclass this configuration and use
it effectively, the subclass is required to do the following:
</p>
<ul>
<li>
Add all configurable components using the <code>addComponent</code>
method,</li>
<li>Implement the <code>parse</code> method, and</li>
<li>Call the <code>resetComponents</code> before parsing.</li>
</ul>
<p>
This class can <strong>NOT</strong> be run as a standalone
program. It is only an example of how to write a parser
configuration using XNI.
</p>
</s2>
<!--
- REVISIT: Add in this sample once the proper interfaces have been
- designed and implemented in the parser. *And* after the
- sample code has been written.
-
<anchor name='DynamicParserConfiguration'/>
<s2 title='Sample xni.parser.DynamicParserConfiguration'>
</s2>
-->
<anchor name='CSVConfiguration'/>
<s2 title='Sample xni.parser.CSVConfiguration'>
<p>
This example is a very simple parser configuration that can
parse files with comma-separated values (CSV) to generate XML
events. For example, the following CSV document:
</p>
<source>Andy Clark,16 Jan 1973,Cincinnati</source>
<p>
produces the following XML "document" as represented by the
XNI streaming document information:
</p>
<source><![CDATA[<?xml version='1.0' encoding='UTF-8' standalone='true'?>
<!DOCTYPE csv [
<!ELEMENT csv (row)*>
<!ELEMENT row (col)*>
<!ELEMENT col (#PCDATA)>
]>
<csv>
<row>
<col>Andy Clark</col>
<col>16 Jan 1973</col>
<col>Cincinnati</col>
</row>
</csv>]]></source>
<p>
This class can <strong>NOT</strong> be run as a standalone
program. It is only an example of how to write a parser
configuration using XNI. You can use this parser configuration
by specifying the fully qualified class name to all of the XNI
samples that accept a parser configuration using the
<code>-p</code> option. For example:
</p>
<source>java xni.Counter -p xni.parser.CSVConfiguration document.xml</source>
</s2>
<anchor name='CSVParser'/>
<s2 title='Samples xni.parser.CSVParser'>
<p>
This parser class implements a SAX parser that can parse simple
comma-separated value (CSV) files.
</p>
<p>
This class can <strong>NOT</strong> be run as a standalone
program. It is only an example of how to write a parser
using XNI. You can use this parser
by specifying the fully qualified class name to all of the SAX
samples that accept a parser using the
<code>-p</code> option. For example:
</p>
<source>java sax.Counter -p xni.parser.CSVParser document.xml</source>
</s2>
<anchor name='PSVIConfiguration'/>
<s2 title='Sample xni.parser.PSVIConfiguration'>
<p>
This example is a parser configuration that can output post schema validation infoset. The configuration infludes: DTD validator, Namespace binder, XML Schema validators and PSVIWriter component.
</p>
<p>
This class can <strong>NOT</strong> be run as a standalone
program. It is only an example of how to write a parser
configuration using XNI. You can use this parser configuration
by specifying the fully qualified class name to all of the XNI
samples that accept a parser configuration using the
<code>-p</code> option:
</p>
<source>java xni.Counter -v -s -p xni.parser.PSVIConfiguration personal-schema.xml</source>
<note><link idref='features' anchor="validation">Validation</link> and <link idref='features' anchor="validation.schema">schema validation</link> features must be set to true to receive the correct PSVI output.</note>
</s2>
<anchor name='PSVIParser'/>
<s2 title='Samples xni.parser.PSVIParser'>
<p>
This parser class implements a SAX parser that outputs post schema validation infoset as XML.
</p>
<p>
This class can <strong>NOT</strong> be run as a standalone
program. It is only an example of how to write a parser
using XNI. You can use this parser
by specifying the fully qualified class name to all of the SAX
samples that accept a parser using the
<code>-p</code> option. For example:
</p>
<source>java sax.Counter -v -s -p xni.parser.PSVIParser personal-schema.xml</source>
<note><link idref='features' anchor="validation">Validation</link> and <link idref='features' anchor="validation.schema">schema validation</link> features must be set to true to receive the correct PSVI output.</note>
</s2>
</s1>