The Entity co-reference Engine perform co-reference resolution of Named Entities in a given text. The co-references will be noun phrases which refer to those Named Entities by having a minimal set of attributes which match contextual information (rdf:type of the entity and spatial and object function giving info) from entity repositories such as Dbpedia and Yago for that Named Entity.
We have the following text as an example : “Microsoft has posted its 2013 earnings. The software company did better than expected. ... The Redmond-based company will hire 500 new developers this year.” The enhancement engine will link “Microsoft” with “The software company” and “The Redmond-based company”.
This index will contain the yago rdf:types and several spatial/org membership and functional properties from the DBpedia index. NOTE: At the moment the index is available only for english.
http://downloads.dbpedia.org/3.9/dbpedia_3.9.owl http://downloads.dbpedia.org/3.9/en/labels_en.nt.bz2 http://downloads.dbpedia.org/3.9/en/instance_types_en.nt.bz2 http://downloads.dbpedia.org/3.9/en/mappingbased_properties_en.nt.bz2 http://downloads.dbpedia.org/3.9/links/yago_types.nt.bz2
rdfs:label | d=entityhub:text rdf:type | d=entityhub:ref dbp-ont:birthPlace | d=entityhub:ref dbp-ont:region | d=entityhub:ref dbp-ont:foundationPlace | d=entityhub:ref dbp-ont:locationCity | d=entityhub:ref dbp-ont:location | d=entityhub:ref dbp-ont:hometown | d=entityhub:ref dbp-ont:country | d=entityhub:ref
which contains the labels of the yago types.
aforementioned yago types labels. After you init the indexer go through the following steps:
name=dbpedia description=DBpedia.org
of the generic rdf indexing at point ### (3).
The results of all these steps will be the dbpedia.solrindex.zip archive which should be used as described in entityhub/indexing/dbpedia/README.md.
TODO
In order to run the engine you need to add it to a chain that also contains the following engine types: