CopCo: The Copenhagen Corpus of Eye-Tracking Recordings from Natural Reading

PID

CopCo is an eye-tracking corpus tailored to both psycholinguistics and natural language processing. The goal is to investigate reading behavior of Danish texts in various populations. To this end, we record eye movements of participants reading continuous Danish texts in their own speed. The CopCo corpus contains eye movement data from Danish native speakers, both from readers without dyslexia and readers with dyslexia. Additionally, there is a set of non-native speaking participants.

The data contains one CSV file per participant with the computed eye-tracking metrics for each word. Each file contains the text read by the participant with one line per word. In addition to the words, word IDs, sentence IDs and text IDs, the files contain eye-tracking features for each word, namely, landing position, first fixation duration, first pass duration, go-past time, mean fixation duration, total fixation duration, number of fixations, mean saccade duration and peak saccade velocity.

This project has been approved by the Ethics Commission of the Faculty of Humanities of the University of Copenhagen.

Identifier
PID http://hdl.handle.net/20.500.12115/48
Related Identifier https://aclanthology.org/2022.lrec-1.182/
Related Identifier https://osf.io/ud8s5/
Metadata Access http://repository.clarin.dk/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:repository.clarin.dk:20.500.12115/48
Provenance
Creator Hollenstein, Nora; Björnsdóttir, Marina; Barrett, Maria
Publisher Centre for Language Technology, NorS, University of Copenhagen
Publication Year 2022
Rights Creative Commons - Attribution 4.0 International (CC BY 4.0); http://creativecommons.org/licenses/by/4.0/; PUB
OpenAccess true
Contact info(at)clarin.dk
Representation
Language Danish
Resource Type corpus
Format application/octet-stream; text/plain; text/plain; charset=utf-8; downloadable_files_count: 58
Discipline Linguistics