Comparing ANN and GMM in a voice conversion framework
暂无分享,去创建一个
[1] Shashidhar G. Koolagudi,et al. Voice Transformation by Mapping the Features at Syllable Level , 2007, PReMI.
[2] H. Ney,et al. VTLN-based cross-language voice conversion , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[3] Antonio Bonafonte,et al. Including dynamic and phonetic information in voice conversion systems , 2004, INTERSPEECH.
[4] Hermann Ney,et al. A study on residual prediction techniques for voice conversion , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[5] Bayya Yegnanarayana,et al. Transformation of formants for voice conversion using artificial neural networks , 1995, Speech Commun..
[6] K. Shikano,et al. Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[7] Hui Ye,et al. High quality voice morphing , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[8] Moncef Gabbouj,et al. LSF mapping for voice conversion with very small training sets , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[9] H. Hoge,et al. Residual prediction based on unit selection , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..
[10] Werner Verhelst,et al. Voice conversion using partitions of spectral feature space , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[11] Bayya Yegnanarayana,et al. Prosody modification using instants of significant excitation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[12] K. Sreenivasa Rao,et al. Voice conversion by mapping the speaker-specific features using pitch synchronous approach , 2010, Comput. Speech Lang..
[13] Yoshinori Sagisaka,et al. Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks , 1995, Speech Commun..
[14] K.-S. Lee,et al. Statistical Approach for Voice Personality Transformation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[15] Kishore Prahallad,et al. Source and system features for speaker recognition using AANN models , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[16] Bayya Yegnanarayana,et al. Voiced/Nonvoiced Detection Based on Robustness of Voiced Epochs , 2010, IEEE Signal Processing Letters.
[17] Alexander Kain,et al. Spectral voice conversion for text-to-speech synthesis , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[18] Eric Moulines,et al. Continuous probabilistic transform for voice conversion , 1998, IEEE Trans. Speech Audio Process..
[19] Masato Akagi,et al. Speaker individualities in fundamental frequency contours and its control , 1995, EUROSPEECH.
[20] S. R. Mahadeva Prasanna,et al. Extraction of speaker-specific excitation information from linear prediction residual of speech , 2006, Speech Commun..
[21] B. Yegnanarayana,et al. Artificial Neural Networks , 2004 .
[22] Yoshinori Sagisaka,et al. Acoustic characteristics of speaker individuality: Control and conversion , 1995, Speech Commun..
[23] Satoshi Nakamura,et al. Voice conversion through vector quantization , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.
[24] Alexander Kain,et al. Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[25] Satoshi Nakamura,et al. Speaker adaptation and voice conversion by codebook mapping , 1991, 1991., IEEE International Sympoisum on Circuits and Systems.
[26] Levent M. Arslan,et al. Speaker Transformation Algorithm using Segmental Codebooks (STASC) , 1999, Speech Commun..
[27] Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .
[28] Jia Liu,et al. Voice conversion with smoothed GMM and MAP adaptation , 2003, INTERSPEECH.
[29] John H. L. Hansen,et al. Speaker-specific pitch contour modeling and modification , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[30] Levent M. Arslan,et al. Robust processing techniques for voice conversion , 2006, Comput. Speech Lang..
[31] Kishore Prahallad,et al. Spectral Mapping Using Artificial Neural Networks for Voice Conversion , 2010, IEEE Transactions on Audio, Speech, and Language Processing.