Dataset - B2FIND

Modelling word learning and recognition using visually grounded speech

A set of recorded isolated nouns, verbs and image annotations used for testing the word recognition performance of our speech2image model. We trained a word recognition model...

Spoken corpus Gos VideoLectures 4.2 (transcription)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. It can be used for training...

Spoken corpus Gos VideoLectures 4.0 (audio)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus...

Acoustic Data Building Toolset

This folder contains data and software tools (in python) that can be used in experiments with phoneme recognition in speech samples recorder in Polish. Acoustic data used here...

Speech Recognition System for Polish: Parliamentary Speech

This resource contains dockerized models and scripts of an automatic speech recognition system for Polish trained on Polish Parliament speeches. The system is based on the Kaldi...

Speech Recognition System for Polish: Polish Film Chronicles

This resource contains dockerized models and scripts of an automatic speech recognition system for Polish trained on recording of the Polish Film Chronicles. The system is based...

DiaBiz ASR benchmark

An evaluation report with accompanying datasets benchmarking the performance of commercially available ASR services of Polish on the DiaBiz corpus.

Spoken corpus Gos VideoLectures 4.1 (transcription)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. It can be used for training...

Spoken corpus Gos VideoLectures 4.0 (transcription)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus...

Spoken corpus Gos VideoLectures 3.0 (transcription)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus...

Spoken corpus Gos VideoLectures 2.0 (audio)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus...

SNABI database for continuous speech recognition 1.2

The SNABI speech database can be used to train continuous speech recognition for Slovene language. The database comprises 1530 sentences, 150 words and the alphabet. 132...

Spoken corpus Gos VideoLectures 3.0 (audio)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus...

Speech Database of Spoken Flight Information Enquiries SOFES 1.0

The SOFES speech database (Spoken Flight Enquiries in Slovene) is a collection of transcribed and segmented audio recordings of spoken flight-information enquiries in Slovene....

Spoken corpus Gos VideoLectures 1.0 (audio)

Gos VideoLectures is an add-on to the Gos reference speech corpus of Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos Videolectures...

ASR training dataset for Serbian JuzneVesti-SR v1.0

The JuzneVesti-SR dataset consists of audio recordings and manual transcripts from the Južne Vesti website and its host show called '15 minuta'...

Slovene Conformer CTC BPE E2E Automated Speech Recognition model RSDO-DS2-ASR...

This Conformer CTC BPE E2E Automated Speech Recognition model was trained following the NVIDIA NeMo Conformer-CTC recipe (for details see the official NVIDIA NeMo NMT...

ASR training dataset for Croatian ParlaSpeech-HR v1.0

The ParlaSpeech-HR dataset is built from parliamentary proceedings available in the Croatian part of the ParlaMint corpus and the parliamentary recordings available from the...

Spoken corpus Gos VideoLectures 1.0 (transcription)

Gos Videolectures is an add-on to the Gos reference speech corpus of Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos Videolectures...

Spoken corpus Gos VideoLectures 2.0 (transcription)

Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus...

28 datasets found