An advanced NLP framework for high-quality Text-to-Speech synthesis

In order to build a TTS (Text-to-Speech) synthesis system one must provide two key components: a NLP (Natural Language Processing) stage, which essentially operates on the input text, and a speech generation stage to produce the desired output. These two distinct levels must exchange both data and commands to produce intelligible and natural speech. As the complete TTS task relies on many distinct scientific areas, any achievement toward standardization can minimize the effort and increase the dynamic of the results. This paper gives an overview of the NLP stage in the TTS system for Romanian language built by our collective, and describes the integration into the system of SSML (Speech Synthesis Markup Language), as a nowadays well recognized standard for TTS document authoring and inter-modules communication.

[1]  Stefan-Adrian Toma,et al.  Automatic rule-based syllabication for Romanian , 2009, 2009 Proceedings of the 5-th Conference on Speech Technology and Human-Computer Dialogue.

[2]  Rada Mihalcea,et al.  Letter Level Learning for Language Independent Diacritics Restoration , 2002, CoNLL.

[3]  C. Negrescu,et al.  AUTOMATIC DIACRITIC RESTORATION FOR A TTS-BASED E-MAIL READER APPLICATION , 2008 .

[4]  Dragos Burileanu,et al.  Prosody modeling for an embedded TTS system implementation , 2006, 2006 14th European Signal Processing Conference.

[5]  Dragos Burileanu,et al.  Basic Research and Implementation Decisions for a Text-to-Speech Synthesis System in Romanian , 2002, Int. J. Speech Technol..

[6]  Dragos Burileanu,et al.  A statistical approach to lexical stress assignment for TTS synthesis , 2009, Int. J. Speech Technol..

[7]  Vladimir Popescu,et al.  HYBRID SYLLABIFICATION AND LETTER-TO-PHONE CONVERSION FOR TTS SYNTHESIS , 2011 .

[8]  Cristian Negrescu,et al.  RECENT ADVANCES IN ROMANIAN LANGUAGE TEXT-TO-SPEECH SYNTHESIS , 2010 .

[9]  Melania Duma,et al.  Enhanced Rule-Based Phonetic Transcription for the Romanian Language , 2009, 2009 11th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing.

[10]  D. Tufi Automatic Diacritics Insertion in Romanian Texts , 1999 .

[11]  Lianhong Cai,et al.  A Unified Framework for Multilingual Text-to-Speech Synthesis with SSML Specification as Interface * , 2009 .

[12]  Simon King,et al.  The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate , 2011, Speech Commun..

[13]  Eugeniu Oancea,et al.  Stressed Syllable Determination for Romanian Words within Speech Synthesis Applications , 2002, Int. J. Speech Technol..