Abstract
Machine learning has proven to be a very effective tool in automatic speech recognition. This paper is an attempt to give a broad overview of the applications of various approaches of machine learning in speech recognition with special reference to deep learning and CMU Sphinx. Deep learning in Speech recognition is a relatively recent development. On the other hand, CMU Sphinx, an open source software has been in use for this purpose for a relatively longer time. CNN, a Deep Learning algorithm learns the invariant features that help it to differentiate between different words and word sequences. CMU Sphinx uses GMM-HMM model to predict the phonemes in the utterance to determine the word or set of continuous words that were spoken.
Original language | English |
---|---|
Title of host publication | 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 2296-2301 |
Number of pages | 6 |
Volume | 2017-January |
ISBN (Electronic) | 9781509063673 |
DOIs | |
Publication status | Published - 30-11-2017 |
Externally published | Yes |
Event | 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017 - Manipal, Mangalore, India Duration: 13-09-2017 → 16-09-2017 |
Conference
Conference | 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017 |
---|---|
Country/Territory | India |
City | Manipal, Mangalore |
Period | 13-09-17 → 16-09-17 |
All Science Journal Classification (ASJC) codes
- Computer Networks and Communications
- Computer Science Applications
- Information Systems