This BundleList includes three modules that bring basic language support for Chinese to Apache Stanbol.
See comments in the lists.xml for more details.
When you plan to use the Smartcn Analyzer to process Chinese texts it is important to also properly configure the Solr schema.xml used by the Entityhub SolrYard.
For that you will need to add two things:
A fieldType specification for Chinese
:::xml
A dynamic field using this field type that matches against Chinese language literals
:::xml
The smartcn.solrindex.zip is identical with the default configuration but uses the above fieldType and dynamicField specification.
As an alternative to (2) you can also explicitly configure the name of the solr config as value to the “solrConf:smartcn” of SolrYardIndexingDestination.
:::text indexingDestination=org.apache.stanbol.entityhub.indexing.destination.solryard.SolrYardIndexingDestination,solrConf:smartcn,boosts:fieldboosts
If you want to create an empty SolrYard instance using the smartcn.solrindex.zip configuration you will need to
If you want to use the smartcn.solrindex.zip as default you can rename the file in the datafilee folder to “default.solrindex.zip” and the enable the “Use default SolrCore configuration” (org.apache.stanbol.entityhub.yard.solr.useDefaultConfig) when you configure a SolrYard instance.
See also the documentation on how to configure a managed site).