A Recursive Network with Dynamic Attention for Monaural Speech Enhancement
暂无分享,去创建一个
Xiaodong Li | Renhua Peng | Cunhang Fan | Andong Li | Chengshi Zheng | C. Zheng | Xiaodong Li | Renhua Peng | Andong Li | Cunhang Fan
[1] N. Kraus,et al. A dynamic auditory-cognitive system supports speech-in-noise perception in older adults , 2013, Hearing Research.
[2] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.
[3] DeLiang Wang,et al. On Training Targets for Supervised Speech Separation , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[4] Philipos C. Loizou,et al. Speech Enhancement: Theory and Practice , 2007 .
[5] Li-Rong Dai,et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[6] Seunghoon Hong,et al. Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[7] Roberto Cipolla,et al. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling , 2015, CVPR 2015.
[8] Loïc Le Folgoc,et al. Attention U-Net: Learning Where to Look for the Pancreas , 2018, ArXiv.
[9] Lee M. Miller,et al. Auditory attentional control and selection during cocktail party listening. , 2010, Cerebral cortex.
[10] Yanpeng Li,et al. Improving deep neural networks using softplus units , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).
[11] Xiaodong Li,et al. Speech enhancement using progressive learning-based convolutional recurrent neural network , 2020 .
[12] Shadi Pirhosseinloo,et al. Monaural Speech Enhancement with Dilated Convolutions , 2019, INTERSPEECH.
[13] Jun Du,et al. SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement , 2016, INTERSPEECH.
[14] M. R. Jones,et al. Time, our lost dimension: toward a new theory of perception, attention, and memory. , 1976, Psychological review.
[15] DeLiang Wang,et al. Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises. , 2016, The Journal of the Acoustical Society of America.
[16] DeLiang Wang,et al. A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement , 2018, INTERSPEECH.
[17] Jonathan G. Fiscus,et al. DARPA TIMIT:: acoustic-phonetic continuous speech corpus CD-ROM, NIST speech disc 1-1.1 , 1993 .
[18] Xiaodong Li,et al. Convolutional Recurrent Neural Network Based Progressive Learning for Monaural Speech Enhancement , 2019, ArXiv.
[19] Sepp Hochreiter,et al. Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.
[20] Christopher Joseph Pal,et al. Delving Deeper into Convolutional Networks for Learning Video Representations , 2015, ICLR.
[21] D. Markle,et al. Hearing Aids , 1936, The Journal of Laryngology & Otology.
[23] Yu Tsao,et al. Complex spectrogram enhancement by convolutional neural network with multi-metrics learning , 2017, 2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP).
[24] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[25] DeLiang Wang,et al. Supervised Speech Separation Based on Deep Learning: An Overview , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[26] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[27] Mounya Elhilali,et al. Modelling auditory attention , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.
[28] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..
[29] DeLiang Wang,et al. TCNN: Temporal Convolutional Neural Network for Real-time Speech Enhancement in the Time Domain , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[30] Xiaodong Li,et al. A Time-domain Monaural Speech Enhancement with Recursive Learning , 2020, ArXiv.
[31] Jesper Jensen,et al. A short-time objective intelligibility measure for time-frequency weighted noisy speech , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[32] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[33] DeLiang Wang,et al. Gated Residual Networks With Dilated Convolutions for Monaural Speech Enhancement , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.