The MSC Data Set

DOI

From this page you can download resources we created for modal sense classification as reported in Zhou et al. (2015), Marasović et al. (2016) and Marasović and Frank (2015) (see "Related Publication" below):

Heuristically sense-annotated training data acquired from EUROPARL and OpenSubtitles (EPOS_E, English). The dataset was used for:

the EMNLP 2015 Workshop submission "Semantically enriched models for modal sense classification" by Mengfei Zhou, Anette Frank,Annemarie Friedrich, and Alexis Palmer the LiLT submission "Modal Sense Classification At Large: Paraphrase-Driven Sense Projection, Semantically Enriched Classification Models and Cross-Genre Evaluations" by Ana Marasović, Mengfei Zou, Alexis Palmer, Anette Frank the RepL4NLP submission "Multilingual Modal Sense Classification using a Convolutional Neural Network" by Ana Marasović and Anette Frank.

Composition of training and testing used for the classification experiments. The dataset was used for:

the EMNLP 2015 Workshop submission "Semantically enriched models for modal sense classification" by submission Mengfei Zhou, Anette Frank,Annemarie Friedrich, and Alexis Palmer the RepL4NLP submission "Multilingual Modal Sense Classification using a Convolutional Neural Network" by Ana Marasović and Anette Frank.

Manually annotated subsection of MASC (English). The dataset was used for the LiLT submission "Modal Sense Classification At Large: Paraphrase-Driven Sense Projection, Semantically Enriched Classification Models and Cross-Genre Evaluations" by Ana Marasović, Mengfei Zou, Alexis Palmer, Anette Frank. Heuristically modal sense annotated training data and manually annotated test data from EUROPARL and OpenSubtitles (EPOS_G, German). The dataset was used for the RepL4NLP submission "Multilingual Modal Sense Classification using a Convolutional Neural Network" by Ana Marasović and Anette Frank.

 

Identifier
DOI https://doi.org/10.11588/data/JEESIQ
Related Identifier https://www.aclweb.org/anthology/W15-2705
Related Identifier http://csli-lilt.stanford.edu/ojs/index.php/LiLT/article/view/65/65
Related Identifier https://www.aclweb.org/anthology/W16-1613
Metadata Access https://heidata.uni-heidelberg.de/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.11588/data/JEESIQ
Provenance
Creator Marasović, Ana; Zhou, Mengfei; Frank, Anette
Publisher heiDATA
Contributor Marasović, Ana
Publication Year 2019
Rights info:eu-repo/semantics/openAccess
OpenAccess true
Contact Marasović, Ana
Representation
Resource Type textual data; Dataset
Format application/zip
Size 6543509
Version 1.0
Discipline Humanities
Spatial Coverage Heidelberg University