Speech Super-Resolution Using Parallel WaveNet
暂无分享,去创建一个
Dong Yu | Dan Su | Jia Jia | Xixin Wu | Zhiyong Wu | Helen M. Meng | Mu Wang | Shiyin Kang
[1] Sercan Ömer Arik,et al. Deep Voice 2: Multi-Speaker Neural Text-to-Speech , 2017, NIPS.
[2] Matthias Bethge,et al. A note on the evaluation of generative models , 2015, ICLR.
[3] Joan Bruna,et al. Super-Resolution with Deep Convolutional Sufficient Statistics , 2015, ICLR.
[4] Li-Rong Dai,et al. Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[5] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[6] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[7] Heiga Zen,et al. Parallel WaveNet: Fast High-Fidelity Speech Synthesis , 2017, ICML.
[8] Daniel Rueckert,et al. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Stefano Ermon,et al. Audio Super Resolution using Neural Networks , 2017, ICLR.
[10] A. Gray,et al. Distance measures for speech processing , 1976 .
[11] Navdeep Jaitly,et al. Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Junichi Yamagishi,et al. SUPERSEDED - CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit , 2016 .
[13] Minh N. Do,et al. Time-Frequency Networks for Audio Super-Resolution , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).