Affixoid Dataset (DE)

DOI

The dataset contains the manual annotations for the COLING 2018 submission "Distinguishing affixoid formations from compounds" by Josef Ruppenhofer, Michael Wiegand, Rebecca Wilm and Katja Markert.

1788 complex words containing one of 7 German suffixoid candidates (e.g. -hai, -gott) were annotated manually as to whether the complex forms represent regular compounds or affixoid formations. The main experiments in the paper use automatically extracted features of the complex forms in trying to correctly make this distinction.

Additionally, the words were labeled for five properties related to any intensifying and evaluative meaning potentially associated with the whole word and its components. These manual feature annotations were used to establish the upper-bound performance of a classifier trained to distinguish affixoid formations from regular compounds.

Identifier
DOI https://doi.org/10.11588/data/QKF4LT
Related Identifier https://www.aclweb.org/anthology/C18-1325
Metadata Access https://heidata.uni-heidelberg.de/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.11588/data/QKF4LT
Provenance
Creator Ruppenhofer, Josef
Publisher heiDATA
Contributor Ruppenhofer, Josef
Publication Year 2019
Rights info:eu-repo/semantics/openAccess
OpenAccess true
Contact Ruppenhofer, Josef (Leibniz Institute for the German Language)
Representation
Resource Type textual data, CSV text file format; Dataset
Format text/tab-separated-values; text/plain
Size 63075; 758
Version 1.0
Discipline Humanities
Spatial Coverage Institute for German Language Mannheim, Heidelberg University