Characterization of aspirated and unaspirated sounds in speech

Pravin Bhaskar Ramteke, Anmol Sadanand, Shashidhar G. Koolagudi, Vidya Pai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

In this work, consonant aspiration and unaspiration phenomena are studied. It is known that, pronunciation of aspiration and unaspiration is characterized by the 'puff of air' released at the place of constriction in the vocal tract which is known as burst. Here, the properties of vowel immediately after the burst are studied for characterization of the burst. Excitation source signal estimated from the speech linear prediction residual is used for the task. The signal characteristics such as glottal pulse, duration of open, closed & return phases, slope of open & return phases, duration of burst, ratio of highest and lowest energies of signal and voice onset time (VOT) are explored to characterize aspiration and unaspiration. TIMIT English speech corpus is used to test the proposed approach. Random forest (RF) and support vector machine (SVMs) are used as classifiers to test the effectiveness of the features used for the task. An accuracy of 99.93% and 94.03% is achieved respectively. From the results, it is observed that the proposed features are robust in classifying the aspirated and unaspirated consonants.

Original languageEnglish
Title of host publicationTENCON 2017 - 2017 IEEE Region 10 Conference
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2840-2845
Number of pages6
Volume2017-December
ISBN (Electronic)9781509011339
DOIs
Publication statusPublished - 19-12-2017
Externally publishedYes
Event2017 IEEE Region 10 Conference, TENCON 2017 - Penang, Malaysia
Duration: 05-11-201708-11-2017

Conference

Conference2017 IEEE Region 10 Conference, TENCON 2017
Country/TerritoryMalaysia
CityPenang
Period05-11-1708-11-17

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Characterization of aspirated and unaspirated sounds in speech'. Together they form a unique fingerprint.

Cite this