Topic: voice dataset

Mozilla: Common Voice is now the largest publicly available transcribed voice dataset

Following through on its goal of producing the world’s most diverse voice dataset, Mozilla believes it has now released what is now the largest transcribed voice dataset available publicly. The Common Voice project was started by the company as a way to make voice recognition available to everyone. “Most of the data used by large … continue reading

Mozilla open sources speech recognition model DeepSpeech

Mozilla announced a mission to help developers create speech-to-text applications earlier this year by making voice recognition and deep learning algorithms available to everyone. Today, the company’s machine learning group is one step closer to completing that mission with the initial open source release of its speech recognition model and voice dataset. “There are only … continue reading Protection Status