Speaker-adaptive-trainable Boltzmann machine and its application to non-parallel voice conversion
暂无分享,去创建一个
[1] Tomoki Toda,et al. Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech , 2012, Speech Commun..
[2] Athanasios Mouchtaris,et al. Nonparallel training for voice conversion based on a parameter adaptation approach , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[3] Alexander Kain,et al. Spectral voice conversion for text-to-speech synthesis , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[4] Chung-Hsien Wu,et al. Map-based adaptation for speech conversion using adaptation data selection and non-parallel training , 2006, INTERSPEECH.
[5] Hermann Ney,et al. Vocal tract normalization equals linear transformation in cepstral space , 2001, IEEE Transactions on Speech and Audio Processing.
[6] Keikichi Hirose,et al. One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space , 2011, INTERSPEECH.
[7] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.
[8] L. Rabiner,et al. The acoustics, speech, and signal processing society - A historical perspective , 1984, IEEE ASSP Magazine.
[9] Tomoki Toda,et al. Eigenvoice conversion based on Gaussian mixture model , 2006, INTERSPEECH.
[10] Kishore Prahallad,et al. Voice conversion using Artificial Neural Networks , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[11] Koichi Shinoda,et al. Vocal tract length normalization using rapid maximum-likelihood estimation for speech recognition , 2002, Systems and Computers in Japan.
[12] Tomoki Toda,et al. Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[13] Tetsuya Takiguchi,et al. Exemplar-based voice conversion in noisy environment , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).
[14] Haizhou Li,et al. Conditional restricted Boltzmann machine for voice conversion , 2013, 2013 IEEE China Summit and International Conference on Signal and Information Processing.
[15] Xu Shao,et al. Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model , 2002, INTERSPEECH.
[16] Nobuaki Minematsu,et al. Japanese Dictation Toolkit-1997 version- , 1999 .
[17] Keikichi Hirose,et al. Application of matrix variate Gaussian mixture model to statistical voice conversion , 2014, INTERSPEECH.
[18] Keikichi Hirose,et al. Speech generation from hand gestures based on space mapping , 2009, INTERSPEECH.
[19] Tetsuya Takiguchi,et al. Noise-robust voice conversion based on spectral mapping on sparse space , 2013, SSW.
[20] R. Gray,et al. Vector quantization , 1984, IEEE ASSP Magazine.
[21] Tsuyoshi Masuda,et al. Cost Reduction of Training Mapping Function Based on Multistep Voice Conversion , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[22] Keikichi Hirose,et al. Voice conversion based on matrix variate Gaussian mixture model , 2014 .
[23] Eric Moulines,et al. Continuous probabilistic transform for voice conversion , 1998, IEEE Trans. Speech Audio Process..
[24] Moncef Gabbouj,et al. Voice Conversion Using Partial Least Squares Regression , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[25] Li-Rong Dai,et al. Joint spectral distribution modeling using restricted boltzmann machines for voice conversion , 2013, INTERSPEECH.
[26] Tetsuya Takiguchi,et al. Voice conversion using speaker-dependent conditional restricted Boltzmann machine , 2015, EURASIP Journal on Audio, Speech, and Music Processing.
[27] Richard M. Schwartz,et al. A compact model for speaker-adaptive training , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[28] Tapani Raiko,et al. Improved Learning of Gaussian-Bernoulli Restricted Boltzmann Machines , 2011, ICANN.
[29] Tomoki Toda,et al. Many-to-many eigenvoice conversion with reference voice , 2009, INTERSPEECH.
[30] Li Deng,et al. High-performance robust speech recognition using stereo training data , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[31] R. Salakhutdinov. Learning and Evaluating Boltzmann Machines , 2008 .
[32] Tetsuya Takiguchi,et al. Voice Conversion Using RNN Pre-Trained by Recurrent Temporal Restricted Boltzmann Machines , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[33] Eric Moulines,et al. Voice transformation using PSOLA technique , 1991, Speech Commun..
[34] Thomas Schaaf,et al. VTLN in the MFCC Domain: Band-Limited versus Local Interpolation , 2011, INTERSPEECH.
[35] Tetsuya Takiguchi,et al. Voice conversion in high-order eigen space using deep belief nets , 2013, INTERSPEECH.