A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
暂无分享,去创建一个
Jean-Marc Valin | Karim Helwani | Umut Isik | Arvindh Krishnaswamy | Neerad Phansalkar | Ritwik Giri | A. Krishnaswamy | J. Valin | Ritwik Giri | Karim Helwani | N. Phansalkar | Umut Isik
[1] METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .
[2] A. Spanias,et al. Perceptual coding of digital audio , 2000, Proceedings of the IEEE.
[3] S. Boll,et al. Suppression of acoustic noise in speech using spectral subtraction , 1979 .
[4] Subjective evaluation of speech quality with a crowdsourcing approach Summary , 2022 .
[5] DeLiang Wang,et al. A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement , 2018, INTERSPEECH.
[6] Li-Rong Dai,et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[7] Brian C J Moore,et al. Asymmetry of masking between complex tones and noise: partial loudness. , 2003, The Journal of the Acoustical Society of America.
[8] Jean-Marc Valin,et al. A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement , 2017, 2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP).
[9] Hakan Erdogan,et al. Investigations on Data Augmentation and Loss Functions for Deep Learning Based Speech-Background Separation , 2018, INTERSPEECH.
[10] Xavier Serra,et al. A Wavenet for Speech Denoising , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Paris Smaragdis,et al. Experiments on deep learning for speech denoising , 2014, INTERSPEECH.
[12] Xin Wang,et al. Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech , 2016 .
[13] Tillman Weyde,et al. Improved Speech Enhancement with the Wave-U-Net , 2018, ArXiv.
[14] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.
[15] John Princen,et al. Analysis/Synthesis filter bank design based on time domain aliasing cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..
[16] David Malah,et al. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..
[17] Junichi Yamagishi,et al. Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech , 2016, SSW.
[18] Sebastian Braun,et al. Weighted Speech Distortion Losses for Neural-Network-Based Real-Time Speech Enhancement , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Allen Gersho,et al. Adaptive postfiltering for quality enhancement of coded speech , 1995, IEEE Trans. Speech Audio Process..
[20] Antonio Bonafonte,et al. SEGAN: Speech Enhancement Generative Adversarial Network , 2017, INTERSPEECH.
[21] Tao Zhang,et al. DNN-based enhancement of noisy and reverberant speech , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[22] Koen Vos,et al. Voice Coding with Opus , 2013 .
[23] DeLiang Wang,et al. Ideal ratio mask estimation using deep neural networks for robust speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[24] E. Owens,et al. An Introduction to the Psychology of Hearing , 1997 .
[25] Johannes Gehrke,et al. The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results , 2020, INTERSPEECH.
[26] DeLiang Wang,et al. Complex Ratio Masking for Monaural Speech Separation , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[27] David Talkin,et al. A Robust Algorithm for Pitch Tracking ( RAPT ) , 2005 .
[28] Tao Zhang,et al. Late Reverberation Suppression Using Recurrent Neural Networks with Long Short-Term Memory , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).