暂无分享,去创建一个
James Glass | Junichi Yamagishi | Shiyu Chang | Yang Zhang | Yung-Sung Chuang | Alexander H. Liu | Erica Cooper | Kaizhi Qian | David Cox | Cheng-I Jeff Lai | Yi-Lun Liao
[1] Alexei Baevski,et al. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations , 2020, NeurIPS.
[2] Tao Qin,et al. FastSpeech 2: Fast and High-Quality End-to-End Text to Speech , 2021, ICLR.
[3] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Erich Elsen,et al. The State of Sparsity in Deep Neural Networks , 2019, ArXiv.
[5] Ryan Prenger,et al. Waveglow: A Flow-based Generative Network for Speech Synthesis , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Ryuichi Yamamoto,et al. TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[7] Michael Carbin,et al. The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks , 2018, ICLR.
[8] Shujie Liu,et al. Neural Speech Synthesis with Transformer Network , 2018, AAAI.
[9] Tao Qin,et al. A Survey on Neural Speech Synthesis , 2021, ArXiv.
[10] Jose Javier Gonzalez Ortiz,et al. What is the State of Neural Network Pruning? , 2020, MLSys.
[11] Chenjie Gu,et al. DDSP: Differentiable Digital Signal Processing , 2020, ICLR.
[12] Heiga Zen,et al. Statistical Parametric Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[13] Ding Zhao,et al. Dynamic Sparsity Neural Networks for Automatic Speech Recognition , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[14] Heiga Zen,et al. Parallel WaveNet: Fast High-Fidelity Speech Synthesis , 2017, ICML.
[15] Liang Qiao,et al. Optimizing Speech Recognition For The Edge , 2019, ArXiv.
[16] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[17] Enhong Chen,et al. Lightspeech: Lightweight and Fast Text to Speech with Neural Architecture Search , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] Navdeep Jaitly,et al. Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Xu Tan,et al. FastSpeech: Fast, Robust and Controllable Text to Speech , 2019, NeurIPS.
[20] Shuang Liang,et al. EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture , 2020, ICML.
[21] Wei Ping,et al. DiffWave: A Versatile Diffusion Model for Audio Synthesis , 2020, ICLR.
[22] Kainan Peng,et al. WaveFlow: A Compact Flow-based Model for Raw Audio , 2020, ICML.
[23] James Glass,et al. PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition , 2021, ArXiv.
[24] Wei Ping,et al. Non-Autoregressive Neural Text-to-Speech , 2020, ICML.
[25] Heiga Zen,et al. WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis , 2021, Interspeech.
[26] Yoshua Bengio,et al. MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis , 2019, NeurIPS.
[27] Erich Elsen,et al. Efficient Neural Audio Synthesis , 2018, ICML.
[28] Ryuichi Yamamoto,et al. Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[29] Shuang Liang,et al. Flow-TTS: A Non-Autoregressive Network for Text to Speech Based on Flow , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[30] Wei Ping,et al. ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech , 2018, ICLR.
[31] Dong Yu,et al. Exploiting sparseness in deep neural networks for large vocabulary speech recognition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[32] Yiming Wang,et al. A Pruned Rnnlm Lattice-Rescoring Algorithm for Automatic Speech Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[33] Charalampos Saitis,et al. Neural Waveshaping Synthesis , 2021, ArXiv.
[34] Jaehyeon Kim,et al. HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis , 2020, NeurIPS.
[35] Heiga Zen,et al. WaveGrad: Estimating Gradients for Waveform Generation , 2021, ICLR.
[36] Kurt Keutzer,et al. SqueezeWave: Extremely Lightweight Vocoders for On-device Speech Synthesis , 2020, ArXiv.