Study of deep learning and CMU sphinx in automatic speech recognition

Abhishek Dhankar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Machine learning has proven to be a very effective tool in automatic speech recognition. This paper is an attempt to give a broad overview of the applications of various approaches of machine learning in speech recognition with special reference to deep learning and CMU Sphinx. Deep learning in Speech recognition is a relatively recent development. On the other hand, CMU Sphinx, an open source software has been in use for this purpose for a relatively longer time. CNN, a Deep Learning algorithm learns the invariant features that help it to differentiate between different words and word sequences. CMU Sphinx uses GMM-HMM model to predict the phonemes in the utterance to determine the word or set of continuous words that were spoken.

Original languageEnglish
Title of host publication2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2296-2301
Number of pages6
Volume2017-January
ISBN (Electronic)9781509063673
DOIs
Publication statusPublished - 30-11-2017
Externally publishedYes
Event2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017 - Manipal, Mangalore, India
Duration: 13-09-201716-09-2017

Conference

Conference2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017
CountryIndia
CityManipal, Mangalore
Period13-09-1716-09-17

Fingerprint

Speech recognition
Learning systems
Learning algorithms
Deep learning

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems

Cite this

Dhankar, A. (2017). Study of deep learning and CMU sphinx in automatic speech recognition. In 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017 (Vol. 2017-January, pp. 2296-2301). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICACCI.2017.8126189
Dhankar, Abhishek. / Study of deep learning and CMU sphinx in automatic speech recognition. 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017. Vol. 2017-January Institute of Electrical and Electronics Engineers Inc., 2017. pp. 2296-2301
@inproceedings{b189b313db2c492e97a85dbda329564c,
title = "Study of deep learning and CMU sphinx in automatic speech recognition",
abstract = "Machine learning has proven to be a very effective tool in automatic speech recognition. This paper is an attempt to give a broad overview of the applications of various approaches of machine learning in speech recognition with special reference to deep learning and CMU Sphinx. Deep learning in Speech recognition is a relatively recent development. On the other hand, CMU Sphinx, an open source software has been in use for this purpose for a relatively longer time. CNN, a Deep Learning algorithm learns the invariant features that help it to differentiate between different words and word sequences. CMU Sphinx uses GMM-HMM model to predict the phonemes in the utterance to determine the word or set of continuous words that were spoken.",
author = "Abhishek Dhankar",
year = "2017",
month = "11",
day = "30",
doi = "10.1109/ICACCI.2017.8126189",
language = "English",
volume = "2017-January",
pages = "2296--2301",
booktitle = "2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

Dhankar, A 2017, Study of deep learning and CMU sphinx in automatic speech recognition. in 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017. vol. 2017-January, Institute of Electrical and Electronics Engineers Inc., pp. 2296-2301, 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017, Manipal, Mangalore, India, 13-09-17. https://doi.org/10.1109/ICACCI.2017.8126189

Study of deep learning and CMU sphinx in automatic speech recognition. / Dhankar, Abhishek.

2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017. Vol. 2017-January Institute of Electrical and Electronics Engineers Inc., 2017. p. 2296-2301.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Study of deep learning and CMU sphinx in automatic speech recognition

AU - Dhankar, Abhishek

PY - 2017/11/30

Y1 - 2017/11/30

N2 - Machine learning has proven to be a very effective tool in automatic speech recognition. This paper is an attempt to give a broad overview of the applications of various approaches of machine learning in speech recognition with special reference to deep learning and CMU Sphinx. Deep learning in Speech recognition is a relatively recent development. On the other hand, CMU Sphinx, an open source software has been in use for this purpose for a relatively longer time. CNN, a Deep Learning algorithm learns the invariant features that help it to differentiate between different words and word sequences. CMU Sphinx uses GMM-HMM model to predict the phonemes in the utterance to determine the word or set of continuous words that were spoken.

AB - Machine learning has proven to be a very effective tool in automatic speech recognition. This paper is an attempt to give a broad overview of the applications of various approaches of machine learning in speech recognition with special reference to deep learning and CMU Sphinx. Deep learning in Speech recognition is a relatively recent development. On the other hand, CMU Sphinx, an open source software has been in use for this purpose for a relatively longer time. CNN, a Deep Learning algorithm learns the invariant features that help it to differentiate between different words and word sequences. CMU Sphinx uses GMM-HMM model to predict the phonemes in the utterance to determine the word or set of continuous words that were spoken.

UR - http://www.scopus.com/inward/record.url?scp=85042633179&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85042633179&partnerID=8YFLogxK

U2 - 10.1109/ICACCI.2017.8126189

DO - 10.1109/ICACCI.2017.8126189

M3 - Conference contribution

AN - SCOPUS:85042633179

VL - 2017-January

SP - 2296

EP - 2301

BT - 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

Dhankar A. Study of deep learning and CMU sphinx in automatic speech recognition. In 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017. Vol. 2017-January. Institute of Electrical and Electronics Engineers Inc. 2017. p. 2296-2301 https://doi.org/10.1109/ICACCI.2017.8126189