Voice conversion using deep neural networks with speaker-independent pre-training
暂无分享,去创建一个
[1] Seyed Hamidreza Mohammadi,et al. Transmutative voice conversion , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[2] Shashidhar G. Koolagudi,et al. Voice Transformation by Mapping the Features at Syllable Level , 2007, PReMI.
[3] Haizhou Li,et al. Conditional restricted Boltzmann machine for voice conversion , 2013, 2013 IEEE China Summit and International Conference on Signal and Information Processing.
[4] Alexander Kain,et al. High-resolution voice transformation , 2001 .
[5] Tomoki Toda,et al. Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[6] Alexander Kain,et al. Spectral voice conversion for text-to-speech synthesis , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[7] Michael D. Buhrmester,et al. Amazon's Mechanical Turk , 2011, Perspectives on psychological science : a journal of the Association for Psychological Science.
[8] Razvan Pascanu,et al. M L ] 2 0 A ug 2 01 3 Pylearn 2 : a machine learning research library , 2014 .
[9] Kishore Prahallad,et al. Spectral Mapping Using Artificial Neural Networks for Voice Conversion , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[10] Tetsuya Takiguchi,et al. Voice conversion in high-order eigen space using deep belief nets , 2013, INTERSPEECH.
[11] S. King,et al. Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis , 2013, SSW.
[12] Pascal Vincent,et al. Contractive Auto-Encoders: Explicit Invariance During Feature Extraction , 2011, ICML.
[13] Rabul Hussain Laskar,et al. Comparing ANN and GMM in a voice conversion framework , 2012, Appl. Soft Comput..
[14] Elias Azarov,et al. Real-time voice conversion using artificial neural networks with rectified linear units , 2013, INTERSPEECH.
[15] Heiga Zen,et al. Statistical parametric speech synthesis using deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[16] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.
[17] A. F. Smith,et al. Statistical analysis of finite mixture distributions , 1986 .
[18] Peter Glöckner,et al. Why Does Unsupervised Pre-training Help Deep Learning? , 2013 .
[19] Dong Yu,et al. Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[20] Tsuneo Nitta,et al. Voice conversion for arbitrary speakers using articulatory-movement to vocal-tract parameter mapping , 2013, 2013 IEEE International Workshop on Machine Learning for Signal Processing (MLSP).
[21] Li-Rong Dai,et al. Joint spectral distribution modeling using restricted boltzmann machines for voice conversion , 2013, INTERSPEECH.
[22] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.
[23] H. B. Mann,et al. On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .
[24] Pascal Vincent,et al. Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..
[25] Li-Rong Dai,et al. Using bidirectional associative memories for joint spectral envelope modeling in voice conversion , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[26] Bayya Yegnanarayana,et al. Transformation of formants for voice conversion using artificial neural networks , 1995, Speech Commun..
[27] Eric Moulines,et al. Continuous probabilistic transform for voice conversion , 1998, IEEE Trans. Speech Audio Process..
[28] Tara N. Sainath,et al. FUNDAMENTAL TECHNOLOGIES IN MODERN SPEECH RECOGNITION Digital Object Identifier 10.1109/MSP.2012.2205597 , 2012 .
[29] Thierry Dutoit,et al. Towards a Voice Conversion System Based on Frame Selection , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.