Attention-based Wavenet Autoencoder for Universal Voice Conversion
暂无分享,去创建一个
[1] Li-Rong Dai,et al. WaveNet Vocoder with Limited Training Data for Voice Conversion , 2018, INTERSPEECH.
[2] Lauri Juvela,et al. Non-parallel voice conversion using i-vector PLDA: towards unifying speaker verification and transformation , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[3] Peng Song,et al. Text-independent voice conversion using speaker model alignment method from non-parallel speech , 2014, INTERSPEECH.
[4] Sercan Ömer Arik,et al. Deep Voice 2: Multi-Speaker Neural Text-to-Speech , 2017, NIPS.
[5] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[6] Lior Wolf,et al. VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop , 2017, ICLR.
[7] Junichi Yamagishi,et al. The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods , 2018, Odyssey.
[8] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.
[9] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[10] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[11] S. King,et al. The Blizzard Challenge 2011 , 2011 .
[12] Sepp Hochreiter,et al. Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.
[13] Adam Coates,et al. Deep Voice: Real-time Neural Text-to-Speech , 2017, ICML.
[14] Lior Wolf,et al. A Universal Music Translation Network , 2018, ICLR.
[15] Navdeep Jaitly,et al. Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[16] Lior Wolf,et al. Fitting New Speakers Based on a Short Untranscribed Sample , 2018, ICML.
[17] Samy Bengio,et al. Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model , 2017, ArXiv.
[18] Haifeng Li,et al. A KL Divergence and DNN-Based Approach to Voice Conversion without Parallel Training Sentences , 2016, INTERSPEECH.
[19] Karen Simonyan,et al. Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders , 2017, ICML.
[20] Yoshua Bengio,et al. Char2Wav: End-to-End Speech Synthesis , 2017, ICLR.
[21] S. King,et al. The Blizzard Challenge 2013 , 2013, The Blizzard Challenge 2013.
[22] Heiga Zen,et al. Parallel WaveNet: Fast High-Fidelity Speech Synthesis , 2017, ICML.
[23] Daniel Erro,et al. INCA Algorithm for Training Voice Conversion Systems From Nonparallel Corpora , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[24] Sercan Ömer Arik,et al. Deep Voice 3: 2000-Speaker Neural Text-to-Speech , 2017, ICLR 2018.
[25] Bayya Yegnanarayana,et al. Voice conversion , 1989, Speech Commun..
[26] François Laviolette,et al. Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..
[27] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[28] Tsao Yu,et al. Voice conversion from non-parallel corpora using variational auto-encoder , 2016 .
[29] Yu Tsao,et al. Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks , 2017, INTERSPEECH.