A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems

Several text-to-speech (TTS) systems are available today for languages such as English, Japanese, and Chinese, but still Indian languages are lacking behind in terms of good quality synthesized speech. Even though almost all Indian languages share a common phonetic base, till now a usable TTS system for all official Indian languages is not available. Also the existing speech synthesis techniques are found to be less effective in the scripting format of Indian languages. Considering the intelligibility of speech production and increasing memory requirement for Indian language TTS systems, in this paper we have proposed a rule-based concatenative technique for speech synthesis in Indian languages. It is being compared with the existing technique and the results of our experiments show our technique outperforms the existing technique.

[1]  K. Sreenivasa Rao,et al.  Corpus Based Emotional Speech Synthesis in Hindi , 2013, PReMI.

[2]  Kishore Prahallad,et al.  Indian Language Screen Readers and Syllable Based Festival Text-to-Speech Synthesis System , 2011 .

[3]  S. R. Hertz Integration of rule-based formant synthesis and waveform concatenation: a hybrid approach to text-to-speech synthesis , 2002, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002..

[4]  David Malah,et al.  A Hybrid Text-to-Speech System That Combines Concatenative and Statistical Synthesis Units , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  V. Ramu Reddy,et al.  Development of syllable-based text to speech synthesis system in Bengali , 2011, Int. J. Speech Technol..

[6]  J. Solomon Speech synthesis techniques , 1981, 1981 IEEE International Solid-State Circuits Conference. Digest of Technical Papers.

[7]  Youcef Tabet,et al.  Speech synthesis techniques. A survey , 2011, International Workshop on Systems, Signal Processing and their Applications, WOSSPA.

[8]  Joan Claudi Socoró,et al.  Towards High-Quality Next-Generation Text-to-Speech Synthesis: A Multidomain Approach by Automatic Domain Classification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  John H. L. Hansen,et al.  Trends in Speech and Language Processing [In the Spotlight] , 2012, IEEE Signal Process. Mag..