-
NeMo Conformer CTC BPE E2E Automated Speech Recognition service RSDO-DS2-ASR-...
Automated Speech Recognition service for NeMo Conformer CTC BPE E2E models. For more details about building such models, see the official NVIDIA NeMo documentation... -
Speech Recognition System for Polish: Studio Quality
This resource contains dockerized models and scripts of an automatic speech recognition system for Polish trained on studio quality speech. The system is based on the Kaldi... -
STAZKA – Speech recordings from vehicles
The database actually contains two sets of recordings, both recorded in the moving or stationary vehicles (passenger cars or trucks). All data were recorded within the project... -
Speech Processing, Recognition and Automatic Annotation Kit (SPRAAK)
SPRAAK (also Dutch for 'speech') is a speech recognition package. As such it is useful for transcription of speech, alignment of spoken and written language, annotation of... -
A Speech Test Set of Practice Business Presentations with Additional Relevant...
We present a test corpus of audio recordings and transcriptions of presentations of students' enterprises together with their slides and web-pages. The corpus is intended for... -
Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0)
A richly annotated and genre-diversified language resource, The Prague Dependency Treebank – Consolidated 1.0 (PDT-C 1.0, or PDT-C in short in the sequel) is a consolidated... -
Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0)
The Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0) is a corpus of spoken language, consisting of 742,316 tokens and 73,835 sentences, representing 7,324 minutes... -
Prague DaTabase of Spoken Czech 1.0
PDTSC 1.0 is a multi-purpose corpus of spoken language. 768,888 tokens, 73,374 sentences and 7,324 minutes of spontaneous dialog speech have been recorded, transcribed and...