论文信息 - Text Normalization for Bangla, Khmer, Nepali, Javanese, Sinhala and Sundanese Text-to-Speech Systems

Text Normalization for Bangla, Khmer, Nepali, Javanese, Sinhala and Sundanese Text-to-Speech Systems

Text normalization is the process of converting non-standard words (NSWs) such as numbers, and abbreviations into standard words so that their pronunciations can be derived by a typical means (usually lexicon lookups). Text normalization is, thus, an important component of any text-to-speech (TTS) system. Without text normalization, the resulting voice may sound unintelligent. In this paper, we describe an approach to develop rule-based text normalization. We also describe our open source repository containing text normalization grammars and tests for Bangla, Javanese, Khmer, Nepali, Sinhala and Sundanese. Finally, we present a recipe for utilizing the grammars in a TTS system.

[1] Shankar Kumar,et al. Normalization of non-standard words , 2001, Comput. Speech Lang..

[2] Masao Utiyama,et al. Khmer Word Segmentation Using Conditional Random Fields , 2015 .

[3] Paul Taylor,et al. Text-to-Speech Synthesis , 2009 .

[4] Richard Sproat,et al. TTS for Low Resource Languages: A Bangla Synthesizer , 2016, LREC.

[5] Kumudu Gamage,et al. Festival-si: A Sinhala Text-to-Speech System , 2007, TSD.

[6] Richard Sproat,et al. The Kestrel TTS text normalization system , 2014, Natural Language Engineering.

[7] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[8] Brian Roark,et al. The OpenGrm open-source finite-state grammar software libraries , 2012, ACL.

[9] Richard Sproat,et al. Minimally Supervised Number Normalization , 2016, TACL.

[10] Firoj Alam,et al. Text normalization system for Bangla , 2008 .