Data for "A corpus study of grammatical case forms in written and spoken Estonian: Frequency, distribution and grammatical role"

Dataset

DOI

This dataset makes available the sample of clauses used in the study "A corpus study of grammatical case forms in written and spoken Estonian: Frequency, distribution and grammatical role". It includes 751 clauses from the fiction subcorpus of the University of Tartu’s Balanced Corpus of Written Estonian (cl.ut.ee/korpused) and 758 clauses from the Corpus of Spoken Estonian, maintained by the University of Tartu’s research group of Spoken Estonian (not publicly available at the time of publication). The spoken language selection derives from a subset of everyday (face-to-face and telephone) conversations. The dataset includes both the randomly selected clauses and manual coding, described in the paper.

Identifier
DOI	https://datadoi.ee/handle/33/567
Metadata Access	https://datadoi.ee/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:datadoi.ee:33/567

Provenance
Creator	Vihman, Virve-Anneli; Miljan, Merilin
Publisher	University of Tartu, Institute of Estonian and General Linguistics
Publication Year	2023
Rights	info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by-nc-nd/4.0/
OpenAccess	true
Contact	University of Tartu, Institute of Estonian and General Linguistics

Representation
Language	Estonian
Resource Type	info:eu-repo/semantics/dataset
Format	CSV; text/plain; text/csv; application/pdf
Discipline	Other