Context-Aware Representations for Knowledge Base Relation Extraction

We provide a subcorpus of Wikipedia that was annotated with Wikidata relations using a distant supervision procedure. The corpus contains two types of annotations: entities and relations. Entity annotations were extracted from the Wikipedia linkes in the article text. Each link was converted to a Wikidata identifier using the mappings from the Wikidata itself. Additional entities were recognised using a named entity recognizer and were later linked to Wikidata. For each pair of entities in each sentence we searched for Wikidata relations that connect this pair of entities and stored all unambigious instances (only one relation is possible).

Identifier
Source https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2776
Related Identifier https://doi.org/10.18653/v1/D17-1188
Metadata Access https://tudatalib.ulb.tu-darmstadt.de/oai/openairedata?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:tudatalib.ulb.tu-darmstadt.de:tudatalib/2776
Provenance
Creator Sorokin, Daniil; Gurevych, Iryna
Publisher TU Darmstadt
Contributor TU Darmstadt
Publication Year 2017
Rights Creative Commons Attribution Share-Alike 4.0; info:eu-repo/semantics/openAccess
OpenAccess true
Contact https://tudatalib.ulb.tu-darmstadt.de/page/contact
Representation
Language English
Resource Type Dataset
Format application/zip
Discipline Other