论文信息 - Optimizing HMM Speech Synthesis for Low-Resource Devices - 字舞流文

Optimizing HMM Speech Synthesis for Low-Resource Devices

Géza Németh | Bálint Tóth

[1] Keiichi Tokuda,et al. Speech parameter generation algorithms for HMM-based speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[2] Alan W. Black,et al. Flite: a small fast run-time synthesis engine , 2001, SSW.

[3] Barbara Leporini,et al. Accessing Google Docs via Screen Reader , 2010, ICCHP.

[4] Inma Hernáez,et al. HNM-based MFCC+F0 extractor applied to statistical speech synthesis , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5] Phil D. Green,et al. Speech technology for e-inclusion of people with physical disabilities and disordered speech , 2005, INTERSPEECH.

[6] Heiga Zen,et al. Statistical Parametric Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7] Géza Németh,et al. Corpus-Based Unit Selection TTS for Hungarian , 2006, TSD.

[8] Robert W. Brodersen,et al. An automated floating-point to fixed-point conversion methodology , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9] D. Klatt,et al. Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[10] Aimilios Chalamandaris,et al. Embedded unit selection text-to-speech synthesis for mobile devices , 2009, IEEE Transactions on Consumer Electronics.

[11] Géza Németh,et al. Cross Platform Solution of Communication and Voice/Graphical User Interface for Mobile Devices in Vehicles , 2007 .

[12] Jong-Jin Kim,et al. HMM-based Korean speech synthesis system for hand-held devices , 2006, IEEE Transactions on Consumer Electronics.

[13] Douglas D. O'Shaughnessy,et al. Diphone speech synthesis , 1988, Speech Commun..

[14] Jozef Juhar,et al. Speech and mobile technologies for cognitive communication and information systems , 2011, 2011 2nd International Conference on Cognitive Infocommunications (CogInfoCom).

[15] Tamás Gábor Csapó,et al. Spemoticons: Text to Speech Based Emotional Auditory Cues , 2011 .

[16] Koichi Shinoda,et al. MDL-based context-dependent subword modeling for speech recognition , 2000 .

[17] Heiga Zen,et al. Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems , 2009, INTERSPEECH.

[18] S. Imai,et al. Mel Log Spectrum Approximation (MLSA) filter for speech synthesis , 1983 .

[19] Bernd Möbius. Corpus-based speech synthesis : Methods and challenges , 2000 .

[20] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[21] Keiichi Tokuda,et al. Mixed excitation for HMM-based speech synthesis , 2001, INTERSPEECH.

[22] Heiga Zen,et al. Recent development of the HMM-based speech synthesis system (HTS) , 2009 .

[23] Peter Baranyi,et al. Cognitive infocommunications: CogInfoCom , 2010, 2010 11th International Symposium on Computational Intelligence and Informatics (CINTI).