-
kharaman-czypionka-eulitz-2025
This data package contains the data described in the article "Event-related potentials and oscillatory brain activity reflect a complex interplay of syntactic, semantic and... -
The YouTube Corpus of Singapore English Podcasts
The YouTube Corpus of Singapore English Podcasts (YCSEP) contains transcripts from 620 hours of over 1,300 podcast episodes by Singapore-based content creators. The dataset,... -
Udmurt dialectal dataset: discourse particles and other clitics
This is a dataset that contains sentences in various dialects of Udmurt (Permic < Uralic; ISO 639-3 code udm). It mainly contains questionnaire responses collected for the...
