Optimizing HMM Speech Synthesis for Low-Resource Devices

[1]  Keiichi Tokuda,et al.  Speech parameter generation algorithms for HMM-based speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[2]  Alan W. Black,et al.  Flite: a small fast run-time synthesis engine , 2001, SSW.

[3]  Barbara Leporini,et al.  Accessing Google Docs via Screen Reader , 2010, ICCHP.

[4]  Inma Hernáez,et al.  HNM-based MFCC+F0 extractor applied to statistical speech synthesis , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Phil D. Green,et al.  Speech technology for e-inclusion of people with physical disabilities and disordered speech , 2005, INTERSPEECH.

[6]  Heiga Zen,et al.  Statistical Parametric Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7]  Géza Németh,et al.  Corpus-Based Unit Selection TTS for Hungarian , 2006, TSD.

[8]  Robert W. Brodersen,et al.  An automated floating-point to fixed-point conversion methodology , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[10]  Aimilios Chalamandaris,et al.  Embedded unit selection text-to-speech synthesis for mobile devices , 2009, IEEE Transactions on Consumer Electronics.

[11]  Géza Németh,et al.  Cross Platform Solution of Communication and Voice/Graphical User Interface for Mobile Devices in Vehicles , 2007 .

[12]  Jong-Jin Kim,et al.  HMM-based Korean speech synthesis system for hand-held devices , 2006, IEEE Transactions on Consumer Electronics.

[13]  Douglas D. O'Shaughnessy,et al.  Diphone speech synthesis , 1988, Speech Commun..

[14]  Jozef Juhar,et al.  Speech and mobile technologies for cognitive communication and information systems , 2011, 2011 2nd International Conference on Cognitive Infocommunications (CogInfoCom).

[15]  Tamás Gábor Csapó,et al.  Spemoticons: Text to Speech Based Emotional Auditory Cues , 2011 .

[16]  Koichi Shinoda,et al.  MDL-based context-dependent subword modeling for speech recognition , 2000 .

[17]  Heiga Zen,et al.  Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems , 2009, INTERSPEECH.

[18]  S. Imai,et al.  Mel Log Spectrum Approximation (MLSA) filter for speech synthesis , 1983 .

[19]  Bernd Möbius Corpus-based speech synthesis : Methods and challenges , 2000 .

[20]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[21]  Keiichi Tokuda,et al.  Mixed excitation for HMM-based speech synthesis , 2001, INTERSPEECH.

[22]  Heiga Zen,et al.  Recent development of the HMM-based speech synthesis system (HTS) , 2009 .

[23]  Peter Baranyi,et al.  Cognitive infocommunications: CogInfoCom , 2010, 2010 11th International Symposium on Computational Intelligence and Informatics (CINTI).