Replication data for: Chunking or predicting – frequency information and reduction in the perception of multi-word sequences - Dataset

Dataset

Replication data for: Chunking or predicting – frequency information and reduction in the perception of multi-word sequences

DOI

This is the data and code from a word-monitoring task, in which participants responded to the word 'to' in verb + to-infinitive structures (V-to-Vinf) in English, where 'to' could occur in a full or reduced pronunciation. Accuracy and response times were analysed with mixed-effects generalized additive models (GAMM); the code also includes visualisations of these models. The paper is accepted for publication in Cognitive Linguistics. The experiment was run with OpenSesame (version 3.0.7 for Mac, cf. Mathôt et al. 2012). The data include information on frequencies of occurrence of words and bigrams; this was extracted from the Corpus of Contemporary American English (COCA, Davies 2008–). We used R (R Core Team 2017) for all data analyses, hence the code can best be replicated in R.

Abstract: Frequently used linguistic structures become entrenched in memory; this is often assumed to make their consecutive parts more predictable, as well as fuse them into a single unit (chunking). High frequency moreover leads to a propensity for phonetic reduction. We present a word recognition experiment which tests how frequency information (string frequency, transitional probability) interacts with reduction in speech perception. Detection of the element to is tested in V-to-Vinf sequences in English (e.g. need to Vinf), where to can undergo reduction (“needa”). Results show that reduction impedes recognition, but this can be mitigated by the predictability of the item. Recognition generally benefits from surface frequency, while a modest chunking effect is found in delayed responses to reduced forms of high-frequency items. Transitional probability shows a facilitating effect on reduced but not on full forms. Reduced forms also pose more difficulty when the phonological context obscures the onset of to. We conclude that listeners draw on frequency information in a predictive manner to cope with reduction. High-frequency structures are not inevitably perceived as chunks, but depend on cues in the phonetic form – reduction leads to perceptual prominence of the whole over the parts and thus promotes a holistic access.

OpenSesame, 3.0.7.

Identifier
DOI	https://doi.org/10.18710/7TSABU
Related Identifier	IsCitedBy https://doi.org/10.1515/cog-2017-0138
Metadata Access	https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/7TSABU

Provenance
Creator	Lorenz, David (ORCID: 0000-0002-7451-099X); Tizón-Couto, David
Publisher	DataverseNO
Contributor	Lorenz, David; University of Freiburg; University of Vigo; The Tromsø Repository of Language and Linguistics (TROLLing)
Publication Year	2019
Funding Reference	Spanish Ministry of Economy and Competitiveness FFI2016-77018-P ; European Regional Development Fund IJCI-2015-25843 ; Xunta de Galicia ED431C 2017/50 ; Wissenschaftliche Gesellschaft Freiburg
Rights	CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess	true
Contact	Lorenz, David (University of Rostock)

Representation
Resource Type	experimental data; Dataset
Format	text/plain; application/x-rlang-transport; text/tab-separated-values
Size	8470; 25648; 316412; 31345; 575470; 369845; 599815; 352431
Version	1.3
Discipline	Humanities
Spatial Coverage	Freiburg; Vigo