论文信息 - Phonetic and Prosodically Rich Transcribed speech corpus in Indian languages: Bengali and Odia

Phonetic and Prosodically Rich Transcribed speech corpus in Indian languages: Bengali and Odia

In this paper, we introduce a speech corpus in Indian languages namely Bengali and Odia, which provides phonetic and prosodic information. Phonetics and prosody are vital parameters in human speech perception, hence systematically studying them will help in performing various speech processing tasks. Motivated by this, we have developed Phonetic and Prosodically Rich Transcribed (PPRT) Speech corpus in Bengali and Oriya languages. In this speech corpus ten hours of read speech, five hours of conversation speech and five hours of extempore speech have been collected. The database has been transcribed using International Phonetic Alphabet (IPA) for representing all possible phoneme variations. Along with the phonetic transcription, prosodic information such as duration patterns of syllables, intonation patterns of phrases and break patterns within and across phrases are represented.

K. Sreenivasa Rao | Shakti Kumar | Debadatta Pati

[1] Shashidhar G. Koolagudi,et al. Emotion recognition from speech using source, system, and prosodic features , 2012, Int. J. Speech Technol..

[2] Pabitra Mitra,et al. Developing Bengali Speech Corpus for Phone Recognizer Using Optimum Text Selection Technique , 2011, 2011 International Conference on Asian Language Processing.

[3] Bayya Yegnanarayana,et al. Extraction and representation of prosodic features for language and speaker recognition , 2008, Speech Commun..

[4] Shashidhar G. Koolagudi,et al. Emotion recognition from speech using global and local prosodic features , 2013, Int. J. Speech Technol..

[5] Pabitra Mitra,et al. Bengali speech corpus for continuous auutomatic speech recognition system , 2011, 2011 International Conference on Speech Database and Assessments (Oriental COCOSDA).

[6] K. Sreenivasa Rao,et al. Robust Speaker Recognition in Noisy Environments , 2014 .