CHULA TTS: A Modularized Text-To-Speech Framework

Spoken and written languages evolve constantly through their everyday usages. Combining with practical expectation for automatically generating synthetic speech suitable for various domains of context, such a reason makes Text-to-Speech (TTS) systems of living languages require characteristics that allow extensible handlers for new language phenomena or customized to the nature of the domains in which TTS systems are deployed. ChulaTTS was designed and implemented with a modularized concept. Its framework lets components of typical TTS systems work together and their combinations are customized using simple human-readable configurations. Under .NET development framework, new text processing and signal synthesis components can be built while existing components can simply be wrapped in .NET dynamic-link libraries exposing expected methods governed by a predefined programming interface. A case of ChulaTTS implementation and sample applications were also discussed in this paper.

[1]  Chai Wutiwiwatchai,et al.  BEST 2009 : Thai word segmentation software contest , 2009, 2009 Eighth International Symposium on Natural Language Processing.

[2]  Atiwong Suchato,et al.  Detection of wordplay generated by reproduction of letters in social media texts , 2013, The 2013 10th International Joint Conference on Computer Science and Software Engineering (JCSSE).

[3]  Atiwong Suchato,et al.  Internet explorer smart toolbar for the blind , 2007, i-CREATe '07.

[4]  Lianhong Cai,et al.  A Unified Framework for Multilingual Text-to-Speech Synthesis with SSML Specification as Interface * , 2009 .

[5]  Othman O. Khalifa,et al.  Development of an Arabic text-to-speech system , 2010, International Conference on Computer and Communication Engineering (ICCCE'10).

[6]  Mario Malcangi,et al.  A framework for mixed-language text-to-speech synthesis , 2009, CI 2009.

[7]  A. Suchato,et al.  Implementing Thai text-to-speech synthesis for hand-held devices , 2008, 2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology.

[8]  Natthawut Kertkeidkachorn,et al.  ChulaDAISY: an automated DAISY audio book generation , 2012 .

[9]  Zeliha Gormez,et al.  The framework of the Turkish syllable-based concatenative text-to-speech system with exceptional case handling , 2008 .

[10]  Virach Sornlertlamvanich,et al.  Thai Tagged Speech Corpus for Speech Synthesis , 2003 .

[11]  Atiwong Suchato,et al.  Chula-FungPloen: assistive software for listening to online contents , 2012 .