Mozilla Open Speech Recognition Library now includes 18 languages

Over the last year, Mozilla has expanded its Common Voice initiative to include open source speech recognition data sets in more languages. At present, the organization has published the largest collection of human voices available in 18 different languages, including Dutch, hakha-chin, Esperanto, Farsi, Basque, Spanish, French, Mandarin Chinese (traditional), Welsh and Kabyle. The collection consists of 1,400 hours of recorded voice clips from 42,000 contributors. Some of them are volunteers who only wanted to help, while others are linguists and professionals working in voice technologies.

Source link