暂无分享,去创建一个
Jasper Snoek | Tim Salimans | Nal Kalchbrenner | Rianne van den Berg | Alexey A. Gritsenko | Tim Salimans | Jasper Snoek | Nal Kalchbrenner | A. Gritsenko
[1] Kainan Peng,et al. WaveFlow: A Compact Flow-based Model for Raw Audio , 2020, ICML.
[2] Yuichi Yoshida,et al. Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.
[3] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] DeLiang Wang,et al. A New Framework for CNN-Based Speech Enhancement in the Time Domain , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[5] Richard S. Zemel,et al. Generative Moment Matching Networks , 2015, ICML.
[6] Kenji Fukumizu,et al. Equivalence of distance-based and RKHS-based statistics in hypothesis testing , 2012, ArXiv.
[7] Cencheng Shen,et al. The Exact Equivalence of Distance and Kernel Methods for Hypothesis Testing , 2018, ArXiv.
[8] Samy Bengio,et al. Density estimation using Real NVP , 2016, ICLR.
[9] Erich Elsen,et al. High Fidelity Speech Synthesis with Adversarial Networks , 2019, ICLR.
[10] Yoshua Bengio,et al. MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis , 2019, NeurIPS.
[11] Erich Elsen,et al. Efficient Neural Audio Synthesis , 2018, ICML.
[12] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[13] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.
[14] Bernhard Schölkopf,et al. A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..
[15] Heiga Zen,et al. Statistical Parametric Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[16] Wei Ping,et al. ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech , 2018, ICLR.
[17] Heiga Zen,et al. Parallel WaveNet: Fast High-Fidelity Speech Synthesis , 2017, ICML.
[18] Ryuichi Yamamoto,et al. Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Han Zhang,et al. Improving GANs Using Optimal Transport , 2018, ICLR.
[20] Xin Wang,et al. Neural Source-filter-based Waveform Model for Statistical Parametric Speech Synthesis , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Sungwon Kim,et al. FloWaveNet : A Generative Flow for Raw Audio , 2018, ICML.
[22] Max Welling,et al. Improved Variational Inference with Inverse Autoregressive Flow , 2016, NIPS 2016.
[23] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[24] Chenjie Gu,et al. DDSP: Differentiable Digital Signal Processing , 2020, ICLR.
[25] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.
[26] Surya Ganguli,et al. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , 2013, ICLR.
[27] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[29] Zoubin Ghahramani,et al. Training generative neural networks via Maximum Mean Discrepancy optimization , 2015, UAI.
[30] Boris Ginsburg,et al. OpenSeq2Seq: Extensible Toolkit for Distributed and Mixed Precision Training of Sequence-to-Sequence Models , 2018, ArXiv.
[31] A. Raftery,et al. Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .
[32] Steven Greenberg,et al. Robust speech recognition using the modulation spectrogram , 1998, Speech Commun..
[33] Jeff Donahue,et al. Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.
[34] Prafulla Dhariwal,et al. Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.
[35] Ilya Sutskever,et al. Jukebox: A Generative Model for Music , 2020, ArXiv.
[36] Sebastian Nowozin,et al. DISCO Nets : DISsimilarity COefficients Networks , 2016, NIPS.
[37] Ryan Prenger,et al. Waveglow: A Flow-based Generative Network for Speech Synthesis , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).