Tech Showcase: Customizing Speech Recognition for Higher Accuracy Transcriptions

Play Tech Showcase: Customizing Speech Recognition for Higher Accuracy Transcriptions
Sign in to queue


Two of the most important components of speech recognition systems are the acoustic model and the language model. Those models behind Microsoft's speech recognition engine have been optimized for certain usage scenarios, such as interacting with Cortana on a smart phone, searching the web by voice, or sending text messages to a friend. But if a user has specific needs, such as recognizing domain-specific vocabulary or the ability to understand accents, then the acoustic and language models need to be customized. This demo shows the benefits of customizing acoustic and language models to improve the accuracy of speech recognition for lectures. Using the Custom Speech Service (Cognitive Service) technics, the demo shows how the technology can tune speech recognition for specific topic and lecturers.



Download this episode

Download captions

The Discussion

Add Your 2 Cents