Dataset - B2FIND

SNABI database for continuous speech recognition 1.2

The SNABI speech database can be used to train continuous speech recognition for Slovene language. The database comprises 1530 sentences, 150 words and the alphabet. 132...

NeMo Conformer CTC BPE E2E Automated Speech Recognition service RSDO-DS2-ASR-...

Automated Speech Recognition service for NeMo Conformer CTC BPE E2E models. For more details about building such models, see the official NVIDIA NeMo documentation...

Parliamentary spoken corpus of Serbian ParlaSpeech-RS 1.0

The ParlaSpeech-RS dataset is built from the transcripts of parliamentary proceedings available in the Serbian part of the ParlaMint (ParlaMint-RS) corpus, and the parliamentary...

ASR training dataset for Serbian JuzneVesti-SR v1.0

The JuzneVesti-SR dataset consists of audio recordings and manual transcripts from the Južne Vesti website and its host show called '15 minuta'...

ASR training dataset for Croatian ParlaSpeech-HR v1.0

The ParlaSpeech-HR dataset is built from parliamentary proceedings available in the Croatian part of the ParlaMint corpus and the parliamentary recordings available from the...

Spoken corpus Gos VideoLectures 4.1 (transcription)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. It can be used for training...

Parliamentary spoken corpus of Czech ParlaSpeech-CZ 1.0

The ParlaSpeech-CZ dataset is built from the transcripts of parliamentary proceedings available in the Czech part of the ParlaMint corpus, and the parliamentary recordings...

Slovene Conformer CTC BPE E2E Automated Speech Recognition model RSDO-DS2-ASR...

This Conformer CTC BPE E2E Automated Speech Recognition model was trained following the NVIDIA NeMo Conformer-CTC recipe (for details see the official NVIDIA NeMo NMT...

Spoken corpus Gos VideoLectures 4.0 (transcription)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus...

Spoken corpus Gos VideoLectures 4.2 (transcription)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. It can be used for training...

Parliamentary spoken corpus of Polish ParlaSpeech-PL 1.0

The ParlaSpeech-PL dataset is built from the transcripts of parliamentary proceedings available in the Polish part of the ParlaMint corpus, and the parliamentary recordings...

Spoken corpus Gos VideoLectures 2.0 (transcription)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus...

Parliamentary spoken corpus of Croatian ParlaSpeech-HR 2.0

The ParlaSpeech-HR dataset is built from the transcripts of parliamentary proceedings available in the Croatian part of the ParlaMint corpus, and the parliamentary recordings...

Spoken corpus Gos VideoLectures 4.0 (audio)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus...

Speech Recognition System for Polish: Polish Film Chronicles

This resource contains dockerized models and scripts of an automatic speech recognition system for Polish trained on recording of the Polish Film Chronicles. The system is based...

DiaBiz ASR benchmark

An evaluation report with accompanying datasets benchmarking the performance of commercially available ASR services of Polish on the DiaBiz corpus.

Speech Recognition System for Polish: Parliamentary Speech

This resource contains dockerized models and scripts of an automatic speech recognition system for Polish trained on Polish Parliament speeches. The system is based on the Kaldi...

Speech Recognition System for Polish: Studio Quality

This resource contains dockerized models and scripts of an automatic speech recognition system for Polish trained on studio quality speech. The system is based on the Kaldi...

Acoustic Data Building Toolset

This folder contains data and software tools (in python) that can be used in experiments with phoneme recognition in speech samples recorder in Polish. Acoustic data used here...

Modelling word learning and recognition using visually grounded speech

A set of recorded isolated nouns, verbs and image annotations used for testing the word recognition performance of our speech2image model. We trained a word recognition model...

40 datasets found