Stream

Our Deep Speech system for speech recognition attains 16.5% error on Switchboard (Hub5'00), outperforming previous published results. We also focus on realistic noisy environments (speech in a noisy crowd, car, etc.) In this regime Deep Speech significantly outperforms commercial systems. Key to this approach were (i) Our scalable multi-GPU infrastructure for training an RNN, (ii) Using 7,000 hours of clean speech data, and using that to synthesize a massive 100,000 hours of data (by adding the clean data to different types of noise) to train the models. I think end-to-end deep learning is the future of speech. Paper here: http://arxiv.org/abs/1412.5567﻿

The text of a 19-page, international trade agreement being drafted in secret was published by WikiLeaks on Thursday as the transparency group’s editor commemorated his two-year anniversary confined to the Ecuadorian Embassy in London.