Replication Data for: Russian verbal borrowings in Udmurt

Dataset

DOI

This is the dataset used in a study of Russian verbal loans in Udmurt. The files contain lists of Russian verbs found in the Udmurt social media corpus (http://udmurt.web-corpora.net/index_en.html), manually annotated for several features such as aspect or frequencies in different corpora.

Abstract: In Udmurt, a Uralic language that has experienced long and extensive contact with the dominant Russian language, all four typologically relevant strategies of verbal borrowing are attested. This is unusual both cross-linguistically and for the Uralic family. The paper investigates these strategies and the factors that govern their choice. It turns out that, although free variation plays a major role in the distribution of strategies, there are also several important morphological, stylistic and areal factors. By analyzing these factors and the available historical data, I propose a diachronic explanation of the currently observed distribution. The study is mostly based on corpus data collected from contemporary Udmurt-language social media.

Identifier
DOI	https://doi.org/10.18710/5N34CG
Related Identifier	https://doi.org/10.1515/flin-2019-2019
Metadata Access	https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/5N34CG

Provenance
Creator	Arkhangelskiy, Timofey
Publisher	DataverseNO
Contributor	Arkhangelskiy, Timofey; Universität Hamburg; The Tromsø Repository of Language and Linguistics
Publication Year	2019
Funding Reference	Alexander for Humboldt Foundation
Rights	CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess	true
Contact	Arkhangelskiy, Timofey (Universität Hamburg)

Representation
Resource Type	corpus data; Dataset
Format	text/plain
Size	5483; 73005; 23757
Version	1.1
Discipline	Humanities
Spatial Coverage	Hamburg