Dataset card for common voice corpus 17.0 dataset summary the common voice dataset consists of a unique mp3 and corresponding text file. A multilingual dataset for 166. This will result in the largest representative (age, gender, accents, etc) voice dataset that anyone can use to build innovative voice technology solutions which can work for every luganda.
Scratch Off Prizes Remaining Michigan at Mabel Singer blog
The dataset contains 725175 clips representing 1119 hours of recorded speech. Many of the 31175 recorded hours in the. We selected data from a single speaker with the most utterances for luganda and hausa.
The audio and transcripts were sourced from mozilla common voice (luganda v12.0 and kiswahili v15.0) and curated for voice consistency and quality.
Alffa project [1] developed tts and asr technologies and. This dataset is designed for. The audio and transcripts were sourced from mozilla common voice (luganda v12.0 and kiswahili v15.0) and curated for voice consistency and quality. The dataset currently consists of 22,642 validated hours in 137 languages, but we’re always adding more voices and languages.
A swahili dataset for language modeling and additional datasets for swahili syllabic alphabet and swahili word analogy. This datasheet is for version 23.0 of the the mozilla common voice scripted speech dataset for swahili (sw). Take a look at our languages page to request a language.