ChiSCor: Children's Story Corpus

DOI

ChiSCor is a new corpus containing 619 fantasy stories, told freely by 442 Dutch children aged 4-12. ChiSCor was compiled for studying how children render character perspectives, and unravelling language and cognition in development, with computational tools. ChiSCor hosts text, audio, and annotations for character complexity and linguistic complexity. Additional metadata (e.g. education of caregivers) is available for one third of the Dutch children. ChiSCor also includes a small set of 62 English stories for which the same kinds of annotations are available, as well as detailed background information for a smaller subset of English-speaking children.

This is the corpus accompanying the publication "ChiSCor: A Corpus of Freely Told Fantasy Stories by Dutch Children for Computational Linguistics and Cognitive Science", presented at the Conference for Natural Language Learning (CoNLL) 2023 in Singapore.

Link to the paper accompanying ChiSCor: https://aclanthology.org/2023.conll-1.23/ Authors: Bram M.A. van Dijk, Max J. van Duijn, Suzan Verberne, Marco R. Spruit * indicates equal contribution

Note: if you use this corpus, please cite the paper mentioned above!

Note: read the corpus manual, also if you are looking for a quick overview of ChiSCor's contents.

Note: the corpus (structure) is best viewed in the 'tree view' mode.

Identifier
DOI https://doi.org/10.17026/SS/TGPDJF
Metadata Access https://ssh.datastations.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.17026/SS/TGPDJF
Provenance
Creator van Dijk, Bram; van Duijn, Max J.; Verberne, S.; Spruit, Marco R.
Publisher DANS Data Station Social Sciences and Humanities
Contributor van Dijk, Bram; Duijn, Max J.
Publication Year 2024
Rights CC-BY-NC-4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by-nc/4.0
OpenAccess true
Contact van Dijk, Bram (Leiden University); Duijn, Max J. (Leiden University)
Representation
Resource Type Dataset
Format application/zip; text/csv; application/x-ipynb+json; image/png; application/vnd.oasis.opendocument.spreadsheet; text/tab-separated-values; text/plain; application/pdf; application/octet-stream
Size 176739; 186638; 8571; 3610562; 162; 96; 235537; 323710; 46936; 39117; 19173; 44497; 51029; 52248; 85522; 715; 144; 731549; 382201; 15958; 12282; 2727; 14254; 9146; 6816955; 502555; 165078; 170357
Version 2.0
Discipline Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Humanities; Life Sciences; Linguistics; Social Sciences; Social and Behavioural Sciences; Soil Sciences