Non-native speech synthesis preserving speaker individuality based on partial correction of prosodic and phonetic characteristics
暂无分享,去创建一个
Tomoki Toda | Satoshi Nakamura | Graham Neubig | Sakriani Sakti | Shinnosuke Takamichi | Yuji Oshima | Graham Neubig | S. Sakti | T. Toda | Satoshi Nakamura | Shinnosuke Takamichi | Yuji Oshima
[1] Keikichi Hirose,et al. CART-based factor analysis of intelligibility reduction in Japanese English , 2003, INTERSPEECH.
[2] Hideki Kawahara,et al. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..
[3] Takao Kobayashi,et al. Average-Voice-Based Speech Synthesis Using HSMM-Based Speaker Adaptation and Adaptive Training , 2007, IEICE Trans. Inf. Syst..
[4] Tomoki Toda,et al. Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[5] Hermann Ney,et al. Text-Independent Voice Conversion Based on Unit Selection , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[6] Heiga Zen,et al. Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis , 2009, IEEE Transactions on Audio, Speech, and Language Processing.
[7] Frank K. Soong,et al. A frame mapping based HMM approach to cross-lingual voice transformation , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[8] Keiichi Tokuda,et al. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis , 1999, EUROSPEECH.
[9] Keiichi Tokuda,et al. Speech parameter generation algorithms for HMM-based speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[10] Masato Akagi,et al. Speaker individualities in speech spectral envelopes , 1994, ICSLP.
[11] Takayuki Arai,et al. Development of a Japanese and English Speech Synthesis System Based on HMM Using Voice Conversion for the People with Speech Communication Disorder , 2008 .
[12] Tomoki Toda,et al. Cross-language voice conversion based on eigenvoices , 2009, INTERSPEECH.
[13] Alan W. Black. Speech synthesis for educational technology , 2007, SLaTE.
[14] Ricardo Gutierrez-Osuna,et al. Can voice conversion be used to reduce non-native accents? , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Robert A. J. Clark,et al. Native and non-native speaker judgements on the quality of synthesized speech , 2010, INTERSPEECH.
[16] Daniel Erro,et al. Frame alignment method for cross-lingual voice conversion , 2007, INTERSPEECH.
[17] 高本 捨三郎. Applied English phonology : teaching of English pronunciation to the native Japanese speaker , 1965 .
[18] Keiichi Tokuda,et al. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis , 2013, Comput. Speech Lang..
[19] Frank K. Soong,et al. A cross-language state mapping approach to bilingual (Mandarin-English) TTS , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[20] Hiroshi Suzuki,et al. In search of a method to improve the prosodic features of English spoken by Japanese , 1990, ICSLP.
[21] Masato Akagi,et al. Speaker Individualities in Speech Spectral Envelopes and Fundamental Frequency Contours , 2007, Speaker Classification.
[22] Yoshihiko Nankaku,et al. State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis , 2009, INTERSPEECH.
[23] Heiga Zen,et al. Hidden Semi-Markov Model Based Speech Synthesis System , 2006 .
[24] Levent M. Arslan,et al. Application of voice conversion for cross-language rap singing transformation , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[25] Tomoki Toda,et al. Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation , 2006, INTERSPEECH.
[26] Hideki Kawahara,et al. Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT , 2001, MAVEBA.
[27] Nobuaki Minematsu,et al. Development of English Speech Database Read by Japanese to Support CALL Research , 2004 .
[28] Tomoki Toda,et al. Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation , 2011, INTERSPEECH.
[29] Heiga Zen,et al. Speech Synthesis Based on Hidden Markov Models , 2013, Proceedings of the IEEE.