论文信息 - Duration Modelling Using Neural Networks for Hindi TTS System Considering Position of Syllable in a Word

Duration Modelling Using Neural Networks for Hindi TTS System Considering Position of Syllable in a Word

Abstract The main criterion in duration modeling is to model the duration pattern of the natural speech, considering various features that affect the pattern. Proper estimation of segmental durations plays a vital role in natural sounding text-to-speech (TTS) synthesis. The primary reason for choosing the syllable as a basic unit is that the Indian languages are syllable centered. This paper presents a novel text processing and a syllable based data driven modelling of segmental duration for Hindi, using feed forward neural networks. The effectiveness of the system is demonstrated by synthesizing natural sounding speech for Hindi, national language of India.

[1] Katarina Bartkova,et al. A model of segmental duration for speech synthesis in French , 1987, Speech Commun..

[2] Bayya Yegnanarayana,et al. Modeling syllable duration in Indian languages using neural networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] D. Klatt. Linguistic uses of segmental duration in English: acoustic and perceptual evidence. , 1976, The Journal of the Acoustical Society of America.

[4] T Shreekanth,et al. Development of Speech Database for Hindi Text-To-Speech System Considering Syllable as a Basic Unit , 2014 .

[5] B. Yegnanarayana,et al. Artificial Neural Networks , 2004 .

[6] D. J. Ravi,et al. A Novel Approach to Develop Speech Database for Kannada Text-to-Speech System , 2011 .

[7] Sin-Horng Chen,et al. A new duration modeling approach for Mandarin speech , 2003, IEEE Trans. Speech Audio Process..

[8] Mahendra Caturvedī,et al. A Practical Hindi-English dictionary = व्यावहारिक हिंदी-अँग्रेज़ी कोश , 1975 .

[9] Hema A. Murthy,et al. Duration modeling of Indian languages Hindi and Telugu , 2004, SSW.

[10] Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .