A Unified Parser for Developing Indian Language Text to Speech Synthesizers

This paper describes the design of a language independent parser for text-to-speech synthesis in Indian languages. Indian languages come from 5–6 different language families of the world. Most Indian languages have their own scripts. This makes parsing for text to speech systems for Indian languages a difficult task. In spite of the number of different families which leads to divergence, there is a convergence owing to borrowings across language families. Most importantly Indian languages are more or less phonetic and can be considered to consist broadly of about 35–38 consonants and 15–18 vowels. In this paper, an attempt is made to unify the languages based on this broad list of phones. A common label set is defined to represent the various phones in Indian languages. A uniform parser is designed across all the languages capitalising on the syllable structure of Indian languages. The proposed parser converts UTF-8 text to common label set, applies letter-to-sound rules and generates the corresponding phoneme sequences. The parser is tested against the custom-built parsers for multiple Indian languages. The TTS results show that the accuracy of the phoneme sequences generated by the proposed parser is more accurate than that of language specific parsers.

[1]  Keiichi Tokuda,et al.  Speech parameter generation algorithms for HMM-based speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[2]  Anil Kumar Singh A Computational Phonetic Model for Indian Language Scripts , 2006 .

[3]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[4]  Kishore Prahallad,et al.  Global syllable set for building speech synthesis in Indian languages , 2008, 2008 IEEE Spoken Language Technology Workshop.

[5]  Tony Mason,et al.  Lex & Yacc , 1992 .

[6]  Dan Flickinger,et al.  An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG , 2000, LREC.

[7]  Hema A. Murthy,et al.  A common attribute based unified HTS framework for speech synthesis in Indian languages , 2013, SSW.

[8]  Pushpraj Shukla,et al.  sAT k A Unified Computational Lexicon for Hindi-English Code-Switching , 2004 .

[9]  Prahallad Kishore,et al.  A simple approach for building transliteration editors for Indian languages , 2005 .

[10]  M. Ramasubba Reddy,et al.  An approach to building language-independent text-to-speech synthesis for Indian languages , 2014, 2014 Twentieth National Conference on Communications (NCC).

[11]  Kishore Prahallad,et al.  Building sleek synthesizers for multi-lingual screen reader , 2008, INTERSPEECH.

[12]  Rohit Kumar,et al.  A Data-Driven Synthesis Approach for Indian Language using Syllable as Basic Unit , 2002 .