Polish multi-word lexical unit recognition

PID

A dataset of Polish multi-word expressions manually annotated with respect to their lexicality status. We show annotators' decisions with respect to two criteria: terminology (that is whether a given word combination can be classified as 'term', and 'paraphrase' (that is whether a given word combination can be can be easily paraphrased). In the last column, we present lexicographers' decision with respect to their lexicality status: "tak" - 'yes' means a given word combination is a multi-word lexical unit, "nie" - 'no' means it is not.

Identifier
PID http://hdl.handle.net/11321/940
Metadata Access https://clarin-pl.eu/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:clarin-pl.eu:11321/940
Provenance
Creator Rudnicka, Ewa; Maziarz, Marek; Grabowski, Łukasz; Pasternak, Simone; Przybysz, Zuzanna; Czerepowicka, Monika
Publisher Wrocław University of Science and Technology
Publication Year 2024
Rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0); http://creativecommons.org/licenses/by-sa/4.0/; CC
OpenAccess true
Contact clarin-pl(at)pwr.edu.pl
Representation
Language Polish
Resource Type lexicalConceptualResource
Format text/plain; charset=utf-8; application/pdf; downloadable_files_count: 1
Discipline Linguistics