| ========================================================================== |
| Copyright 2008-2010 Digital Enterprise Research Institute (DERI) |
| |
| Licensed under the Apache License, Version 2.0 (the "License"); |
| you may not use this file except in compliance with the License. |
| You may obtain a copy of the License at |
| |
| http://www.apache.org/licenses/LICENSE-2.0 |
| |
| Unless required by applicable law or agreed to in writing, software |
| distributed under the License is distributed on an "AS IS" BASIS, |
| WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
| See the License for the specific language governing permissions and |
| limitations under the License. |
| |
| ========================================================================== |
| |
| $Id$ |
| |
| Any23 0.2.2 |
| Release Notes |
| |
| * Fixed dependency management on Maven. A second level dependency of Xerces |
| introduced a conflict on the java.xml.transform API causing wrong XSLT |
| transformations within RDFa extractor. |
| |
| ========================================================================== |
| |
| Any23 0.2.1 |
| Release Notes |
| |
| * Major fix on Tika configuration management. This fix solves the auto detection |
| of the main Semantic Web related formats. |
| |
| ========================================================================== |
| |
| Any23 0.2 |
| Release Notes |
| |
| ============ |
| Introduction |
| ============ |
| |
| This release features a redesigned API and incorporating enhancements and |
| bug fixes that have accumulated since the 0.1 release. |
| Apart from some new or changed dependencies on the underlying libraries, |
| this version comes with an improved unit test coverage and other features |
| like the automatic charset encoding detection and an improved documentation. |
| Maven build system has been introduced. |
| |
| |
| ================================== |
| Summary of major changes since 0.1 |
| ================================== |
| |
| * Redesigned Java API |
| - Input from string, stream, file, or URI |
| - Allow choosing which extractors to use |
| - Report origin of triples (document/extractor) to client processors |
| - Various processors/serializers for extracted triples |
| * Added flexible command-line tool for easy testing |
| * Vastly improved website and documentation |
| * Media type and encoding detection via Apache Tika |
| * Switched RDF library from Jena to Sesame |
| * Added Maven build |
| * Better RDF extraction from Microformats |
| * Extractors now come with an example file to document typical in- and output |
| * Major refactoring |
| * Lots and lots of bugfixes |
| |
| ================= |
| Supported formats |
| ================= |
| |
| * RDF/XML |
| * Notation3 and Turtle |
| * N-Triples |
| * RDFa |
| |
| Various microformats, see http://sindice.com/developers/microformat on Sindice Microformats support. |
| |
| =================== |
| Dependency Upgrade |
| =================== |
| |
| CyberNeko Html parser has been upgraded to 1.9.14. |
| |
| Apache Tika 0.3 has been replaced with 0.6, with the |
| new support for the automatic encoding detection. |
| |
| EOF |
| |