Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms
暂无分享,去创建一个
Chengshi Zheng | Renhua Peng | Andong Li | Yuxuan Ke | Xiaodong Li | C. Zheng | Xiaodong Li | Renhua Peng | Andong Li | Yuxuan Ke
[1] Kuldip K. Paliwal,et al. The importance of phase in speech enhancement , 2011, Speech Commun..
[2] Jean-Marc Valin,et al. A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech , 2020, INTERSPEECH.
[3] J. Hess,et al. Analysis of variance , 2018, Transfusion.
[4] Changchun Bao,et al. Speech enhancement methods based on binaural cue coding , 2019 .
[5] Philipos C. Loizou,et al. A noise-estimation algorithm for highly non-stationary environments , 2006, Speech Commun..
[6] Yi Hu,et al. Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.
[7] DeLiang Wang,et al. Gated Residual Networks With Dilated Convolutions for Monaural Speech Enhancement , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[8] DeLiang Wang,et al. Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises. , 2016, The Journal of the Acoustical Society of America.
[9] Jonathan G. Fiscus,et al. DARPA TIMIT:: acoustic-phonetic continuous speech corpus CD-ROM, NIST speech disc 1-1.1 , 1993 .
[10] Ke Tan,et al. Complex Spectral Mapping with a Convolutional Recurrent Network for Monaural Speech Enhancement , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] B. Atal,et al. Optimizing digital speech coders by exploiting masking properties of the human ear , 1978 .
[12] Xiaodong Li,et al. A Recursive Network with Dynamic Attention for Monaural Speech Enhancement , 2020, INTERSPEECH.
[13] Angel Manuel Gomez,et al. A Deep Learning Loss Function Based on the Perceptual Evaluation of the Speech Quality , 2018, IEEE Signal Processing Letters.
[14] DeLiang Wang,et al. Supervised Speech Separation Based on Deep Learning: An Overview , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[15] On reliability of log-spectral distortion measure in speech quality estimation , 2017, 2017 IEEE 4th International Conference Actual Problems of Unmanned Aerial Vehicles Developments (APUAVD).
[16] Simon J. Godsill,et al. Efficient Alternatives to the Ephraim and Malah Suppression Rule for Audio Signal Enhancement , 2003, EURASIP J. Adv. Signal Process..
[17] Jacob Benesty,et al. Nonlinear Kronecker product filtering for multichannel noise reduction , 2019, Speech Commun..
[18] Nils L. Westhausen,et al. Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression , 2020, INTERSPEECH.
[19] Deepen Sinha,et al. Low bit rate transparent audio compression using adapted wavelets , 1993, IEEE Trans. Signal Process..
[20] B. Atal,et al. Optimizing digital speech coders by exploiting masking properties of the human ear , 1978 .
[21] DeLiang Wang,et al. Real-time Speech Enhancement Using an Efficient Convolutional Recurrent Network for Dual-microphone Mobile Phones in Close-talk Scenarios , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[22] Richard C. Hendriks,et al. Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[23] DeLiang Wang,et al. Complex Ratio Masking for Monaural Speech Separation , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[24] Jacob Benesty,et al. Quadratic approach for single-channel noise reduction , 2020, EURASIP J. Audio Speech Music. Process..
[25] DeLiang Wang,et al. Learning Complex Spectral Mapping With Gated Convolutional Recurrent Networks for Monaural Speech Enhancement , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[26] Andries P. Hekstra,et al. Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[27] I. Cohen. Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator , 2002, IEEE Signal Processing Letters.
[28] Nathalie Virag,et al. Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..
[29] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[30] DeLiang Wang,et al. On Training Targets for Supervised Speech Separation , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[31] James D. Johnston,et al. Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..
[32] Hilde van der Togt,et al. Publisher's Note , 2003, J. Netw. Comput. Appl..
[33] Ephraim. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .
[34] Jesper Jensen,et al. MMSE based noise PSD tracking with low complexity , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[35] DeLiang Wang,et al. A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement , 2018, INTERSPEECH.
[36] I. Cohen,et al. Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.
[37] Radu Horaud,et al. Online Monaural Speech Enhancement Using Delayed Subband LSTM , 2020, INTERSPEECH.
[38] Kuldip K. Paliwal,et al. Speech enhancement using a minimum mean-square error short-time spectral modulation magnitude estimator , 2012, Speech Commun..
[39] P. Mahadevan,et al. An overview , 2007, Journal of Biosciences.
[40] Paris Smaragdis,et al. Speech Enhancement by Online Non-negative Spectrogram Decomposition in Non-stationary Noise Environments , 2012, INTERSPEECH.
[41] Rainer Martin,et al. Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..
[42] Lei Xie,et al. DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement , 2020, INTERSPEECH.
[43] Yu Tsao,et al. Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality , 2019, IEEE Signal Processing Letters.
[44] DeLiang Wang,et al. Densely Connected Neural Network with Dilated Convolutions for Real-Time Speech Enhancement in The Time Domain , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[45] Israel Cohen,et al. Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..
[46] Li-Rong Dai,et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[47] DeLiang Wang,et al. Two-Stage Deep Learning for Noisy-Reverberant Speech Enhancement , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[48] Xiaodong Li,et al. Speech enhancement using progressive learning-based convolutional recurrent neural network , 2020 .