TY - GEN
T1 - Neural Network based Speech Assistance tool to enhance the fluency of adults who stutter
AU - Narasimhan, Sharan
AU - Rao, Rohini R.
PY - 2019/8
Y1 - 2019/8
N2 - Millions of adults suffer from a condition called stuttering or stammering. The authors propose the use of a Speech Assistance tool, which helps stuttered speakers achieve higher fluency and a slower rate of speech. The fluency is achieved by adhering to the proposed fluency enhancing technique. The fluency enhancing technique (FET) is inspired by fluency shaping methods and requires the speaker to use a rhythmic method called gentle onset with words and a slower rate of speech. In the training mode, the Speech assistance tool trains an artificial neural network to identify the speaker's FET based words vs. the non-FET or normal words. The audio features are represented using Mel-Frequency Cepstral Coefficients (MFCC), which captures the prosody of the spoken words. In the real-life conversation mode, the speaker gets visual cues to ensure that the speaker adheres to the proposed FET technique. The tool also performs disfluency analysis and provides feedback to users, in terms of FET words ratio, the disfluency score for a hundred words, and the speech rate. The tool also logs the disfluencies periodically to help the speaker track his/her fluency over time. The DTW analysis of MFCC features proven that there is a clear difference in the prosody of the FET and non-FET words. While using the proposed FET based tool, the fluency of the speaker increases and slower speech rate is also achieved. The Speech assistance tool can be used along with Cognitive Behavior Therapy to help rehabilitate adults who stutter.
AB - Millions of adults suffer from a condition called stuttering or stammering. The authors propose the use of a Speech Assistance tool, which helps stuttered speakers achieve higher fluency and a slower rate of speech. The fluency is achieved by adhering to the proposed fluency enhancing technique. The fluency enhancing technique (FET) is inspired by fluency shaping methods and requires the speaker to use a rhythmic method called gentle onset with words and a slower rate of speech. In the training mode, the Speech assistance tool trains an artificial neural network to identify the speaker's FET based words vs. the non-FET or normal words. The audio features are represented using Mel-Frequency Cepstral Coefficients (MFCC), which captures the prosody of the spoken words. In the real-life conversation mode, the speaker gets visual cues to ensure that the speaker adheres to the proposed FET technique. The tool also performs disfluency analysis and provides feedback to users, in terms of FET words ratio, the disfluency score for a hundred words, and the speech rate. The tool also logs the disfluencies periodically to help the speaker track his/her fluency over time. The DTW analysis of MFCC features proven that there is a clear difference in the prosody of the FET and non-FET words. While using the proposed FET based tool, the fluency of the speaker increases and slower speech rate is also achieved. The Speech assistance tool can be used along with Cognitive Behavior Therapy to help rehabilitate adults who stutter.
UR - http://www.scopus.com/inward/record.url?scp=85081995491&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85081995491&partnerID=8YFLogxK
U2 - 10.1109/DISCOVER47552.2019.9008034
DO - 10.1109/DISCOVER47552.2019.9008034
M3 - Conference contribution
T3 - 2019 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2019 - Proceedings
BT - 2019 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2019 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2019
Y2 - 11 August 2019 through 12 August 2019
ER -