-
Oral History Resource: Lithuanian Testimonies of Siberian Deportations
The oral history resource includes: (1) Audio recordings (recorded in 2009-2010) of personal narratives by siblings Pranas Šuminskas and Vladislava Šuminskaitė about their... -
ParCzech 4.0
The ParCzech 4.0 corpus consists of stenographic protocols that record the Chamber of Deputies' meetings in the 7th term (2013-2017), the 8th term (2017-2021) and the current... -
Spoken corpus of Karel Makoň (2020-11-16)
Talks of Karel Makoň given to his friends in the course of late sixties through early nineties of the 20th century. The topic is mostly christian mysticism. -
English TTS speech corpus of air traffic (pilot) messages - Taiwanese accent
The corpus contains recordings of male speaker, native in Taiwanese, talking in English. The sentences that were read by the speaker originate in the domain of air traffic... -
English TTS speech corpus of air traffic (pilot) messages - Serbian accent
The corpus contains recordings of male speaker, native in Serbian, talking in English. The sentences that were read by the speaker originate in the domain of air traffic control... -
Corpus bilingüe d’alternança de llengües (codeswitching)
8 interactive recordings of group dynamics. Bilingual speakers (L1 -> English; L1 -> Catalan/Spanish). -
Oasis Numbers
spoken, monolingual, manually segmented domain-specific corpus of numbers, 5857 recorded words -
Balaxan Corpus of Kurmanji
Balaxan is the first speech corpus of Kurmanji Kurdish with 58 utterances by speakers of Kurmanji. utterances are divided into 4 categories based on their sentence structures:... -
Phonetic Corpus of Estonian Spontaneous Speech (online search engine)
Studio recordings of spontaneous Estonian segmented phonetically on word, sound, and other linguistic levels. Current size about 22 hours of speech, 155 000 words. Online search... -
A Speech Test Set of Practice Business Presentations with Additional Relevant...
We present a test corpus of audio recordings and transcriptions of presentations of students' enterprises together with their slides and web-pages. The corpus is intended for... -
UFAL Speech Corpus of North Levantine Arabic 1.0 - Part 2
The corpus contains recordings by the native speakers of the North Levantine Arabic (apc) acquired during 2020, 2021, and 2023 in Prague, Paris, Kabardia, and St. Petersburg.... -
UFAL Speech Corpus of North Levantine Arabic 1.0 - Part 3
The corpus contains recordings by the native speakers of the North Levantine Arabic (apc) acquired during 2020, 2021, and 2023 in Prague, Paris, Kabardia, and St. Petersburg.... -
English TTS speech corpus of air traffic (pilot) messages - Czech accent
The corpus contains recordings of male speaker, native in Czech, talking in English. The sentences that were read by the speaker originate in the domain of air traffic control... -
Vystadial 2013 – English data
Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems.... -
Czech Senior COMPANION Expressive Speech Corpus
The corpus contains Czech expressive speech recorded using scenario-based approach by a professional female speaker. The scenario was created on the basis of previously recorded... -
UFAL Speech Corpus of North Levantine Arabic 1.0 - Part 1
The corpus contains recordings by the native speakers of the North Levantine Arabic (apc) acquired during 2020, 2021, and 2023 in Prague, Paris, Kabardia, and St. Petersburg.... -
Database of speech corpora of Czech laryngectomy patients
The corpus contains Czech speech of laryngectomy patients recorded before a surgery causing their voice to be lost in order to preserve the voice which can be later used for... -
ORAL2013: balanced corpus of informal spoken Czech (transcriptions & audio)
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (private environment, spontaneity, unpreparedness etc.) in the area of the whole... -
English TTS speech corpus of air traffic (pilot) messages - German accent
The corpus contains recordings of male speaker, native in German, talking in English. The sentences that were read by the speaker originate in the domain of air traffic control... -
STAZKA – Speech recordings from vehicles
The database actually contains two sets of recordings, both recorded in the moving or stationary vehicles (passenger cars or trucks). All data were recorded within the project...
