| ============= |
| Any23 Plugins |
| ============= |
| |
| This is the root dir of the Any23 Plugins module. |
| |
| A plugin is an extension of the Any23 core and can be plugged using |
| the Plugin Manager capabilities. |
| |
| Plugins |
| ======= |
| |
| basic-crawler |
| ------------- |
| |
| A CLI tool which extends the Rover CLI adding crawler specific |
| capabilities. |
| |
| html-scraper |
| ------------ |
| |
| The HTML scraper is able to convert any HTML page to triples |
| containing the text scraped from the page. |
| |
| office-scraper |
| -------------- |
| |
| The Office scraper is able to convert the main MS Office compatible |
| formats and convert them to triples. |
| |
| integration-test |
| ---------------- |
| |
| This module contains the integration tests for all the defined plugins. |
| |
| Generate Plugin Packaging |
| ========================= |
| |
| To generate the desired plugin package, navigate to the plugin directory and execute 'mvn package' |
| e.g. to generate the basic-crawler plugin package |
| |
| $cd $ANY23-HOME/plugins/basic-crawler |
| $ mvn package |
| |
| From the basic-crawler directory this generates |
| |
| . |
| |-- pom.xml |
| |-- src |
| | |-- main |
| | | |-- assembly |
| | | `-- java |
| | `-- test |
| `-- target |
| |-- any23-basic-crawler-${version}.jar |
| |-- apache-any23-basic-crawler-${version}-bin.tar.gz <<< |
| |-- apache-any23-basic-crawler-${version}-bin.zip <<< |
| |-- archive-tmp |
| |-- classes |
| | |-- META-INF |
| | `-- org |
| |-- generated-sources |
| |-- maven-archiver |
| |-- maven-shared-archive-resources |
| |-- surefire |
| |-- surefire-reports |
| `-- test-classes |
| ... |
| |
| Plugin specific README's can be found in either ./target/*.tar.gz || ./target/*.zip (annotated above with '<<<'), where much more detailed information sources can be located. |
| |