DigiDiaDem Speech-Cognitive Dataset (DSCD-CZ-2)

PID

An updated and expanded version of the dataset was created to investigate the speech and cognitive performance of people with varying degrees of cognitive impairment, primarily dementia. The dataset contains a comprehensive set of data including the results of standardized neuropsychological tests (RBANS, ALBA, POBAV, MASTCZ), speech tasks focused on comprehension, memory, naming, and repetition, and demographic data (age, gender, education).

Participants were divided into four groups based on clinical assessment: healthy individuals, healthy individuals with possible mild cognitive impairment, patients with mild cognitive impairment, and patients with dementia. All recordings and examinations were managed as part of routine clinical practice in the neurological outpatient clinic – Memory Clinic at the Department of Neurology at the Faculty Hospital Královské Vinohrady. The dataset containing 371 examinations was divided into a training and test part using stratification by clinical group, age, gender, and level of education to ensure an even distribution of these key characteristics in both parts of the data.

Additionally, Manually Engineered Features and Scores were added to the previous version of the dataset.

The aim of the dataset is to support the development of methods for automated detection of cognitive disorders based on speech analysis and cognitive performance. The data are suitable for research in the areas of clinical neuropsychology, computational linguistics, and machine learning. The dataset is intended for non-commercial research purposes.

Identifier
PID http://hdl.handle.net/11234/1-6043
Related Identifier http://hdl.handle.net/11234/1-5912
Related Identifier https://starfos.tacr.cz/en/projekty/TQ01000332
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-6043
Provenance
Creator Šmídl, Luboš; Kompasová, Marie; Zapletalová, Michaela; Polák, Filip; Zajícová, Lucie; Švec, Jan; Víta, Martin; Bartoš, Aleš
Publisher University of West Bohemia, Department of Cybernetics
Publication Year 2025
Rights Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0); http://creativecommons.org/licenses/by-nc-nd/4.0/; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language Czech
Resource Type corpus
Format application/zip; application/octet-stream; text/plain; charset=utf-8; downloadable_files_count: 2
Discipline Linguistics