LifeLUCID Corpus: Recordings of Speakers Aged 8 to 85 Years Engaged in Interactive Task in the Presence of Energetic and Informational Masking, 2017-2020

DOI

This speech corpus contains recordings for 104 monolingual native southern British English speakers aged between 8 and 85 years old while they engaged in a problem-solving picture-based ‘spot the difference’ task (Diapix) with a conversational partner in four listening conditions. In NORM (quiet, no masking), participants heard each other normally. In SPSN (speech-shaped noise), participants communicated in speech shaped noise at an approximate signal-to-noise ratio of 0 dB. In IMRE (informational masking related picture), they communicated while three voices in the background talked about the same picture. In IMUR (information masking unrelated picture), they communicated while three voices in the background talked about a different Diapix picture. Participants were recorded in sex and age band matched pairs; they were unfamiliar with one another. They were seated in separate sound-treated booths and communicated via headphones. One participant in the pair took the lead in the interaction and participants switched roles, each completing four conditions in each role. For each recording, there is a stereo audio file (.wav format) with each participant on a separate channel and two textgrid files, with time-aligned orthographic transcripts for each of the participants.Our ability to communicate successfully with others can be strongly affected by the presence of noise and other voices in the environment, and children and older adults are more greatly affected than young adults in these situations. Even greater disruption is experienced for populations with hearing or language impairments, or even healthy adults who are non-native speakers. Some of the disruption is due to physical masking by interfering sounds (energetic masking EM) but if the disrupting sound can be understood, this also causes further difficulty (informational masking IM). Previous work suggests that informational masking causes relatively more disruption for children and older adults than for young adults but these findings are based on laboratory tests that are far from realistic communication. Although the impact of adverse conditions on speech communication has been the object of studies in different age groups, no study to date has taken a full lifespan view, looking at the relative impact of IM and EM on participants aged from 8 to 85 using a common experimental design. Also, many studies have focused on the impact of EM and IM on speech perception using recorded sentences or words; this ignores the fact that speakers make dynamic adaptations in speech communication to counter the effects of masking. We evaluate the impact of adverse conditions in an interactive task using measures reflecting speech communication efficiency (e.g., task transaction time, rate of dysfluencies). Finally, there is little evidence to date as to whether laboratory-based evaluations reflect the level of difficulty experienced in everyday life. The proposed project will, for the first time, relate evaluations of speech communication difficulties in adverse listening conditions as measured in the laboratory with real-life ratings of difficulty collected in real time over a two week period. It will also test whether primarily informational masking causes greater interference for some age groups (e.g. children, older adults), and if the underlying reasons for the interference differ between children and adults. In Study 1, 120 individuals aged 8 to 85 yrs will be tested in pairs while they carry out a problem-solving task in conditions varying in the degree of informational and energetic masking present. A secondary task will be added to make the task more cognitively demanding, thus reflecting real-life multitasking situations. Baseline measures of hearing, speech perception and cognitive function and a standardised questionnaire of auditory disability (SSQ) will also be collected. In Study 2, the same participants will be asked to report perceived communication difficulty and their listening environment on 6-7 occasions per day during a 2-week period using a smartphone-based app. Data from this study will be related to measures of communication effectiveness and SSQ data collected in Study 1. This project will lead to a better understanding of how the impact of adverse listening conditions changes across the lifespan, of the relative effects of IM and EM in different age groups, evaluated in realistic communicative conditions, and of the true ecological validity of laboratory-based evaluations. It will also provide normative data for a set of acoustic-phonetic measures across a 8-85 year age range. This benchmark will be of use for practitioners such as speech and language therapists and audiologists who work on aspects of communication with individuals of all ages. Importantly, this research will also contribute to our basic understanding of speech perception and production development across the lifespan.

104 monolingual native speakers of Standard Southern English participants are included in this corpus; a further 10 were also recorded and included in our study but did not give permission for their recordings to be used beyond the project. Participants were aged between 8-12 (Young Children, CH-Y, M=10.34 years), 13-17 (Older Children, CH-O, M=15.94 years), 18-29 years (Younger Adults, YA, M=21.82 years), 30-49 years (Middle Aged, MA, M=42.98 years), 50-64 (Older Middle Aged, OMA, M=59.30 years) and 65-85 years (Older Adults, OA, M=71.19 years). In the original study, there were 20 participants (10F) in each of the six age bands, apart from the 13-17 band, with only 4 males. Participants were recruited via a variety of means (adverts, social media, university participant pool). They were screened by phone for their language background and were then matched with another participant of the same sex and age band; participant pairs were therefore unknown to one another. All participants were classified as normal hearing, achieving a better ear average of <20 dB across the 0.25-4 kHz octave frequencies. They reported no history of speech and language impairments or neurological trauma. All participants aged over 65 passed the Montreal Cognitive Assessment (MoCA; Nasreddine et al. 2005) screening test. Ethical approval was obtained from the University College London (UCL) Research Ethics Committee, and informed written consent was obtained from each participant and a parent/guardian of each child. Participants completed a range of sensory and cognitive background tests and were then recorded while they completed a 'spot the difference' picture task (diapix) in four different conditions, both as 'Speaker A' (speaking taking the lead in the interaction) and as 'Speaker B' (taking a more passive role). Each of the tasks lasted for a maximum of ten minutes and recordings typically took place over two sessions.

Identifier
DOI https://doi.org/10.5255/UKDA-SN-854350
Metadata Access https://datacatalogue.cessda.eu/oai-pmh/v0/oai?verb=GetRecord&metadataPrefix=oai_ddi25&identifier=743351790f5eb233df9c2c03207f90ba4860774d6184376df175d1be48001600
Provenance
Creator Tuomainen, O, UCL; Taschenberger, L, UCL; Hazan, V, UCL
Publisher UK Data Service
Publication Year 2021
Funding Reference Economic and Social Research Council
Rights Valerie Hazan, UCL. Outi Tuomainen, UCL; The Data Collection is available for download to users registered with the UK Data Service. All requests are subject to the permission of the data owner or his/her nominee. Please email the contact person for this data collection to request permission to access the data, explaining your reason for wanting access to the data, then contact our Access Helpdesk.
OpenAccess true
Representation
Resource Type Text; Audio
Discipline Humanities; Linguistics; Psychology; Social and Behavioural Sciences
Spatial Coverage London; United Kingdom