暂无分享,去创建一个
Sungwon Kim | Sungroh Yoon | Jaehyeon Kim | Jungil Kong | Sungroh Yoon | Sungwon Kim | Jaehyeon Kim | Jungil Kong
[1] Shuang Liang,et al. Flow-TTS: A Non-Autoregressive Network for Text to Speech Based on Flow , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Sercan Ömer Arik,et al. Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning , 2017, ICLR.
[3] Renjie Zheng,et al. Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework , 2020, EMNLP.
[4] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[5] Max Welling,et al. Emerging Convolutions for Generative Normalizing Flows , 2019, ICML.
[6] Yoshua Bengio,et al. NICE: Non-linear Independent Components Estimation , 2014, ICLR.
[7] Sungwon Kim,et al. FloWaveNet : A Generative Flow for Raw Audio , 2018, ICML.
[8] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[9] Navdeep Jaitly,et al. Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[10] Heiga Zen,et al. Parallel WaveNet: Fast High-Fidelity Speech Synthesis , 2017, ICML.
[11] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.
[12] Alexander M. Rush,et al. Sequence-Level Knowledge Distillation , 2016, EMNLP.
[13] Ryan Prenger,et al. Waveglow: A Flow-based Generative Network for Speech Synthesis , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[14] Yuxuan Wang,et al. Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron , 2018, ICML.
[15] Xu Tan,et al. FastSpeech: Fast, Robust and Controllable Text to Speech , 2019, NeurIPS.
[16] Patrick Nguyen,et al. Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis , 2018, NeurIPS.
[17] Heiga Zen,et al. LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech , 2019, INTERSPEECH.
[18] Victor O. K. Li,et al. Non-Autoregressive Neural Machine Translation , 2017, ICLR.
[19] Prafulla Dhariwal,et al. Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.
[20] Soroosh Mariooryad,et al. Location-Relative Attention Mechanisms for Robust Long-Form Speech Synthesis , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Samy Bengio,et al. Density estimation using Real NVP , 2016, ICLR.
[22] Erich Elsen,et al. Efficient Neural Audio Synthesis , 2018, ICML.
[23] Ryan Prenger,et al. Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis , 2020, ICLR.
[24] Iain Murray,et al. Neural Spline Flows , 2019, NeurIPS.
[25] Oriol Vinyals,et al. Neural Discrete Representation Learning , 2017, NIPS.
[26] Heiga Zen,et al. Speech Synthesis Based on Hidden Markov Models , 2013, Proceedings of the IEEE.
[27] Yuxuan Wang,et al. Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis , 2018, ICML.
[28] Shujie Liu,et al. Neural Speech Synthesis with Transformer Network , 2018, AAAI.
[29] Sercan Ömer Arik,et al. Deep Voice 2: Multi-Speaker Neural Text-to-Speech , 2017, NIPS.
[30] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[31] Zhao Song,et al. Parallel Neural Text-to-Speech , 2019, ArXiv.
[32] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[33] Joan Serra,et al. Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion , 2019, NeurIPS.
[34] Ashish Vaswani,et al. Self-Attention with Relative Position Representations , 2018, NAACL.
[35] Wei Ping,et al. Non-Autoregressive Neural Text-to-Speech , 2020, ICML.