RAC - Recovery from Ana/Anorexia Corpus

PID

RAC - Recovery from Ana/Anorexia Corpus is a collection of Italian ED-recovery community content downloaded from TikTok. It consists of 1000 videos from 27 TikTok channels (26 females and 1 male). Given the wide variety of features and formatting styles that characterize TikTok videos, we organized the data into 4 categories: 1) "Speech-only" videos, in which the user was talking in the absence of background music and/or written text. 2) "Playback" videos, in which the user sings over a song that is played in the background. 3) "Text-only" videos, in which there is neither background music nor the users themselves speaking, but only written text. 4) "Mixed" videos, in which the above-mentioned features are present in various combinations. "Speech-only" and "playback" videos were transcribed automatically using the Google Web Speech API. Afterward, transcriptions were manually checked. "Text-only" and "mixed" videos underwent manual transcription.

Identifier
PID http://hdl.handle.net/20.500.11752/OPEN-997
Related Identifier https://amsacta.unibo.it/id/eprint/7248/
Related Identifier https://site.unibo.it/metaphan/en
Metadata Access http://dspace-clarin-it.ilc.cnr.it/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:dspace-clarin-it.ilc.cnr.it:20.500.11752/OPEN-997
Provenance
Creator Donati, Melissa; Vernillo, Paola; Polidori, Ludovica; Gagliardi, Gloria
Publisher Alma Mater Studiorum – Università di Bologna
Publication Year 2023
OpenAccess true
Contact dspace-clarin-it-ilc-help(at)ilc.cnr.it
Representation
Language Italian
Resource Type corpus
Format downloadable_files_count: 0
Discipline Linguistics