Czech word and MWE lists

DOI

This post contains word and MWE (multi-word expression) lists used for the operationalization of some of the linguistic features in the multi-dimensional analysis (MDA) of Czech project carried out at the Czech National Corpus. The MDA procedure requires identifying and operationalizing linguistic features relevant for register variation in the language under scrutiny. In the Czech MDA project, some of these features were operationalized by compiling lists of words and multi-word expressions, which can then be matched against a text to identify occurrences. Compiling such a list can be tedious and error prone work, which is why we provide ours as a resource for other linguists either to adopt wholesale or at least use as a starting point to build on top of.

Identifier
DOI https://doi.org/10.18710/PGDWXC
Related Identifier IsCitedBy https://doi.org/10.1007/s10579-020-09487-4
Related Identifier IsCitedBy https://doi.org/10.1515/cllt-2018-0020
Metadata Access https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/PGDWXC
Provenance
Creator Cvrček, Václav ORCID logo
Publisher DataverseNO
Contributor Lukeš, David; Czech National Corpus; Cvrček, Václav; Komrsková, Zuzana; Poukarová, Petra; Řehořková, Anna; Zasina, Adrian Jan; The Tromsø Repository of Language and Linguistics (TROLLing)
Publication Year 2020
Funding Reference European Regional Development Fund CZ.02.1.01/0.0/0.0/16_013/0001758
Rights CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Lukeš, David (Czech National Corpus)
Representation
Resource Type corpus data; Dataset
Format application/vnd.openxmlformats-officedocument.wordprocessingml.document; application/pdf; text/plain
Size 34412; 415505; 6453; 661; 376; 9758; 1441; 1268; 1591; 3611; 4166; 797; 13261; 39474; 20048; 678; 774; 663; 1261; 389; 10252; 6891; 8296; 1595; 2002; 2310; 314; 1058; 613; 18675; 203
Version 1.1
Discipline Humanities
Spatial Coverage Prague