Replication Data for: A long birth: The development of gender-specific paucal constructions in Russian

DOI

The databases and scripts for statistical analysis included in this TROLLing post concern the so-called paucal construction in Russian where a numeral (dva ‘two’, tri ‘three’, chetyre ‘four’) is followed by an adjective and a noun. There are two versions of the database, one with examples in Cyrillic and one without. The version without examples can be used for statistical analysis, since some statistical software has problems with Cyrillic.

Article abstract: This article investigates the diachronic development of Russian numeral constructions consisting of a paucal numeral (dva ‘two’, tri ‘three’, chetyre ‘four’) followed by an adjective and a noun. Based on statistical analysis of more than 6,000 corpus examples, it is shown that a split took place in the second half of the twentieth century when feminine nouns developed a different agreement pattern from that of masculine and neuter nouns. This split is argued to represent the final step in a long “birth process” of gender-specific paucal constructions that started with the loss of the dual in the Middle Ages. It is suggested that we are witnessing a cascading effect, whereby the feminine pattern develops when the pattern for masculine and neuter nouns are approaching stabilization. The article furthermore includes a discussion of the hypothesis that “S-curves” represent a template for language change. While the documented changes resemble S-curves, the proposed analysis also addresses some general problems with testing the S-curve hypothesis empirically.

Identifier
DOI https://doi.org/10.18710/54ZJGQ
Related Identifier IsCitedBy https://doi.org/10.1075/dia
Metadata Access https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/54ZJGQ
Provenance
Creator Nesset, Tore ORCID logo
Publisher DataverseNO
Contributor Nesset, Tore; UiT The Arctic University of Norway; The Tromsø Repository of Language and Linguistics (TROLLing)
Publication Year 2020
Rights CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Nesset, Tore (UiT The Arctic University of Norway)
Representation
Resource Type corpus data; Dataset
Format text/tab-separated-values; text/plain; type/x-r-syntax
Size 764453; 628592; 1717681; 1530049; 11727; 4318
Version 1.1
Discipline Humanities