Nitk Kids' speech corpus

Pravin Bhaskar Ramteke, Sujata Supanekar, Pradyoth Hegde, Hanna Nelson, Venkataraja Aithal, Shashidhar G. Koolagudi

Research output: Contribution to journalConference article

Abstract

This paper introduces speech database for analyzing children's speech. The proposed database of children is recorded in Kannada language (one of the South Indian languages) from children between age 2 12 to 6 12 years. The database is named as National Institute of Technology Karnataka Kids' Speech Corpus (NITK Kids' Speech Corpus). The relevant design considerations for the database collection are discussed in detail. It is divided into four age groups with an interval of 1 year between each age group. The speech corpus includes nearly 10 hours of speech recordings from 160 children. For each age range, the data is recorded from 40 children (20 male and 20 female). Further, the effect of developmental changes on the speech from 2 12 to 6 12 years are analyzed using pitch and formant analysis. Some of the potential applications, of the NITK Kids' Speech Corpus, such as, systematic study on the language learning ability of children, phonological process analysis and children speech recognition are discussed.

Original languageEnglish
Pages (from-to)331-335
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2019-September
DOIs
Publication statusPublished - 01-01-2019
Event20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 - Graz, Austria
Duration: 15-09-201919-09-2019

Fingerprint

Corpus
Speech
Speech Recognition
Children
Speech recognition
Interval
Data Base
Range of data
Language
Design
Language Acquisition
Formants
Phonological Processes
Indian Languages

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Cite this

Ramteke, Pravin Bhaskar ; Supanekar, Sujata ; Hegde, Pradyoth ; Nelson, Hanna ; Aithal, Venkataraja ; Koolagudi, Shashidhar G. / Nitk Kids' speech corpus. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019 ; Vol. 2019-September. pp. 331-335.
@article{26c87be9a54f477ba4fcc0bbe93e8ef0,
title = "Nitk Kids' speech corpus",
abstract = "This paper introduces speech database for analyzing children's speech. The proposed database of children is recorded in Kannada language (one of the South Indian languages) from children between age 2 12 to 6 12 years. The database is named as National Institute of Technology Karnataka Kids' Speech Corpus (NITK Kids' Speech Corpus). The relevant design considerations for the database collection are discussed in detail. It is divided into four age groups with an interval of 1 year between each age group. The speech corpus includes nearly 10 hours of speech recordings from 160 children. For each age range, the data is recorded from 40 children (20 male and 20 female). Further, the effect of developmental changes on the speech from 2 12 to 6 12 years are analyzed using pitch and formant analysis. Some of the potential applications, of the NITK Kids' Speech Corpus, such as, systematic study on the language learning ability of children, phonological process analysis and children speech recognition are discussed.",
author = "Ramteke, {Pravin Bhaskar} and Sujata Supanekar and Pradyoth Hegde and Hanna Nelson and Venkataraja Aithal and Koolagudi, {Shashidhar G.}",
year = "2019",
month = "1",
day = "1",
doi = "10.21437/Interspeech.2019-2061",
language = "English",
volume = "2019-September",
pages = "331--335",
journal = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",
issn = "2308-457X",

}

Nitk Kids' speech corpus. / Ramteke, Pravin Bhaskar; Supanekar, Sujata; Hegde, Pradyoth; Nelson, Hanna; Aithal, Venkataraja; Koolagudi, Shashidhar G.

In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2019-September, 01.01.2019, p. 331-335.

Research output: Contribution to journalConference article

TY - JOUR

T1 - Nitk Kids' speech corpus

AU - Ramteke, Pravin Bhaskar

AU - Supanekar, Sujata

AU - Hegde, Pradyoth

AU - Nelson, Hanna

AU - Aithal, Venkataraja

AU - Koolagudi, Shashidhar G.

PY - 2019/1/1

Y1 - 2019/1/1

N2 - This paper introduces speech database for analyzing children's speech. The proposed database of children is recorded in Kannada language (one of the South Indian languages) from children between age 2 12 to 6 12 years. The database is named as National Institute of Technology Karnataka Kids' Speech Corpus (NITK Kids' Speech Corpus). The relevant design considerations for the database collection are discussed in detail. It is divided into four age groups with an interval of 1 year between each age group. The speech corpus includes nearly 10 hours of speech recordings from 160 children. For each age range, the data is recorded from 40 children (20 male and 20 female). Further, the effect of developmental changes on the speech from 2 12 to 6 12 years are analyzed using pitch and formant analysis. Some of the potential applications, of the NITK Kids' Speech Corpus, such as, systematic study on the language learning ability of children, phonological process analysis and children speech recognition are discussed.

AB - This paper introduces speech database for analyzing children's speech. The proposed database of children is recorded in Kannada language (one of the South Indian languages) from children between age 2 12 to 6 12 years. The database is named as National Institute of Technology Karnataka Kids' Speech Corpus (NITK Kids' Speech Corpus). The relevant design considerations for the database collection are discussed in detail. It is divided into four age groups with an interval of 1 year between each age group. The speech corpus includes nearly 10 hours of speech recordings from 160 children. For each age range, the data is recorded from 40 children (20 male and 20 female). Further, the effect of developmental changes on the speech from 2 12 to 6 12 years are analyzed using pitch and formant analysis. Some of the potential applications, of the NITK Kids' Speech Corpus, such as, systematic study on the language learning ability of children, phonological process analysis and children speech recognition are discussed.

UR - http://www.scopus.com/inward/record.url?scp=85074735128&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85074735128&partnerID=8YFLogxK

U2 - 10.21437/Interspeech.2019-2061

DO - 10.21437/Interspeech.2019-2061

M3 - Conference article

AN - SCOPUS:85074735128

VL - 2019-September

SP - 331

EP - 335

JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

SN - 2308-457X

ER -