RAC - Recovery from Ana/Anorexia Corpus is a collection of Italian ED-recovery community content downloaded from TikTok. It consists of 1000 videos from 27 TikTok channels (26 females and 1 male).
Given the wide variety of features and formatting styles that characterize TikTok videos, we organized the data into 4 categories:
1) "Speech-only" videos, in which the user was talking in the absence of background music and/or written text.
2) "Playback" videos, in which the user sings over a song that is played in the background.
3) "Text-only" videos, in which there is neither background music nor the users themselves speaking, but only written text.
4) "Mixed" videos, in which the above-mentioned features are present in various combinations.
"Speech-only" and "playback" videos were transcribed automatically using the Google Web Speech API. Afterward, transcriptions were manually checked. "Text-only" and "mixed" videos underwent manual transcription.