Replication Data for: From machine learning to classroom learning: mobile vowels and the Russian preposition v ‘in(to)’

DOI

The present study reports on a machine learning experiment concerning mobile vowels in the Russian preposition v ‘in(to)’. Data are extracted from the Russian National Corpus.

It is shown that a neural network is able to predict mobile vowels in 97.4% of the cases in our dataset, and a decision tree is used to extract a set of three rules that a language learner can use to achieve nearly the same level of accuracy. We argue that these rules are valuable from the perspective of language pedagogy, but that some adjustments are necessary in order to make the rules simpler and more precise. Our study lends support to earlier analyses which emphasize the capacity of mobile vowels to prevent sequences of identical segments. We advance the Word Onset Hierarchy, which enables us to evaluate the relative importance of phonological features for mobile vowels and highlights the gradient and asymmetric nature of mobile vowels. It is suggested that machine learning represents a valuable tool for language pedagogy, not only for mobile vowels, but also for other areas of Russian grammar that are challenging for students of Russian as a foreign language.

Identifier
DOI https://doi.org/10.18710/ZCDX1B
Metadata Access https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/ZCDX1B
Provenance
Creator Nesset, Tore ORCID logo; Xavier, Kevin
Publisher DataverseNO
Contributor Nesset, Tore; UiT The Arctic University of Norway; The Tromsø Repository of Language and Linguistics (TROLLing)
Publication Year 2023
Rights CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Nesset, Tore (UiT The Arctic University of Norway)
Representation
Resource Type annotated corpus data; Dataset
Format text/comma-separated-values; text/plain
Size 4752823; 5205
Version 1.0
Discipline Humanities