Replication data for: Chunking or predicting – frequency information and reduction in the perception of multi-word sequences

DOI

This is the data and code from a word-monitoring task, in which participants responded to the word 'to' in verb + to-infinitive structures (V-to-Vinf) in English, where 'to' could occur in a full or reduced pronunciation. Accuracy and response times were analysed with mixed-effects generalized additive models (GAMM); the code also includes visualisations of these models. The paper is accepted for publication in Cognitive Linguistics. The experiment was run with OpenSesame (version 3.0.7 for Mac, cf. Mathôt et al. 2012). The data include information on frequencies of occurrence of words and bigrams; this was extracted from the Corpus of Contemporary American English (COCA, Davies 2008–). We used R (R Core Team 2017) for all data analyses, hence the code can best be replicated in R.

Abstract: Frequently used linguistic structures become entrenched in memory; this is often assumed to make their consecutive parts more predictable, as well as fuse them into a single unit (chunking). High frequency moreover leads to a propensity for phonetic reduction. We present a word recognition experiment which tests how frequency information (string frequency, transitional probability) interacts with reduction in speech perception. Detection of the element to is tested in V-to-Vinf sequences in English (e.g. need to Vinf), where to can undergo reduction (“needa”). Results show that reduction impedes recognition, but this can be mitigated by the predictability of the item. Recognition generally benefits from surface frequency, while a modest chunking effect is found in delayed responses to reduced forms of high-frequency items. Transitional probability shows a facilitating effect on reduced but not on full forms. Reduced forms also pose more difficulty when the phonological context obscures the onset of to. We conclude that listeners draw on frequency information in a predictive manner to cope with reduction. High-frequency structures are not inevitably perceived as chunks, but depend on cues in the phonetic form – reduction leads to perceptual prominence of the whole over the parts and thus promotes a holistic access.

OpenSesame, 3.0.7.

Identifier
DOI https://doi.org/10.18710/7TSABU
Related Identifier IsCitedBy https://doi.org/10.1515/cog-2017-0138
Metadata Access https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/7TSABU
Provenance
Creator Lorenz, David (ORCID: 0000-0002-7451-099X); Tizón-Couto, David ORCID logo
Publisher DataverseNO
Contributor Lorenz, David; University of Freiburg; University of Vigo; The Tromsø Repository of Language and Linguistics (TROLLing)
Publication Year 2019
Funding Reference Spanish Ministry of Economy and Competitiveness FFI2016-77018-P ; European Regional Development Fund IJCI-2015-25843 ; Xunta de Galicia ED431C 2017/50 ; Wissenschaftliche Gesellschaft Freiburg
Rights CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Lorenz, David (University of Rostock)
Representation
Resource Type experimental data; Dataset
Format text/plain; application/x-rlang-transport; text/tab-separated-values
Size 8470; 25648; 316412; 31345; 575470; 369845; 599815; 352431
Version 1.3
Discipline Humanities
Spatial Coverage Freiburg; Vigo