论文信息 - DNN-Based Calibrated-Filter Models for Speech Enhancement - 字舞流文

DNN-Based Calibrated-Filter Models for Speech Enhancement

Benoit Champagne | Yazid Attabi | Wei-Ping Zhu

[1] Kohei Yamashita,et al. Spectral subtraction iterated with weighting factors , 2002, Speech Coding, 2002, IEEE Workshop Proceedings..

[2] Kah-Chye Tan,et al. Postprocessing method for suppressing musical noise generated by spectral subtraction , 1998, IEEE Trans. Speech Audio Process..

[3] Kurt S. Riedel,et al. Minimum bias multiple taper spectral estimation , 2018, IEEE Trans. Signal Process..

[4] Herman J. M. Steeneken,et al. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[5] DeLiang Wang,et al. Complex Ratio Masking for Monaural Speech Separation , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[6] Yi Hu,et al. A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..

[7] Nam Soo Kim,et al. NMF-Based Speech Enhancement Using Bases Update , 2015, IEEE Signal Processing Letters.

[8] Changchun Bao,et al. Wiener filtering based speech enhancement with Weighted Denoising Auto-encoder and noise classification , 2014, Speech Commun..

[9] Nam Soo Kim,et al. NMF-based speech enhancement incorporating deep neural network , 2014, INTERSPEECH.

[10] Sofia Ben Jebara. A Perceptual Approach to Reduce Musical Noise Phenomenon with Wiener Denoising Technique , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11] Jesper Jensen,et al. An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[12] Sofia Ben Jebara,et al. Perceptual musical noise reduction using critical bands tonality coefficients and masking thresholds , 2007, INTERSPEECH.

[13] Tuomas Virtanen,et al. Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[14] Yi Hu,et al. Speech enhancement based on wavelet thresholding the multitaper spectrum , 2004, IEEE Transactions on Speech and Audio Processing.

[15] T. Hasan,et al. Iterative noise power subtraction technique for improved speech quality , 2008, 2008 International Conference on Electrical and Computer Engineering.

[16] Sven Nordholm,et al. Spectral subtraction using reduced delay convolution and adaptive averaging , 2001, IEEE Trans. Speech Audio Process..

[17] Jonathan Le Roux,et al. Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18] Thomas Esch,et al. Efficient musical noise suppression for speech enhancement system , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19] DeLiang Wang,et al. Supervised Speech Separation Based on Deep Learning: An Overview , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[20] Kiyohiro Shikano,et al. Theoretical analysis of musical noise in Wiener filtering family via higher-order statistics , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21] Ying Zhang,et al. Attention-Based Multi-NMF Deep Neural Network with Multimodality Data for Breast Cancer Prognosis Model , 2019, BioMed research international.

[22] Pascal Scalart,et al. Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[23] D. Thomson,et al. Spectrum estimation and harmonic analysis , 1982, Proceedings of the IEEE.

[24] Philipos C. Loizou,et al. A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25] S. Boll,et al. Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[26] Simona Halunga,et al. Nonlinear spectral subtraction method for colored noise reduction using multi-band Bark scale , 2008, Signal Process..

[27] Philipos C. Loizou,et al. Speech Enhancement: Theory and Practice , 2007 .

[28] Nancy Bertin,et al. Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.

[29] Kiyohiro Shikano,et al. Automatic optimization scheme of spectral subtraction based on musical noise assessment via higher-order statistics , 2008 .

[30] Wei-Ping Zhu,et al. NMF-based speech enhancement using multitaper spectrum estimation , 2018, 2018 International Conference on Signals and Systems (ICSigSys).

[31] Sven Nordholm,et al. Bayesian noise estimation in the modulation domain , 2018, Speech Commun..

[32] A.V. Oppenheim,et al. Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[33] Eric Plourde,et al. Auditory-Based Spectral Amplitude Estimators for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[34] Nathalie Virag,et al. Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[35] Hanwook Chung,et al. Regularized non-negative matrix factorization with Gaussian mixtures and masking model for speech enhancement , 2017, Speech Commun..

[36] Antonio Bonafonte,et al. SEGAN: Speech Enhancement Generative Adversarial Network , 2017, INTERSPEECH.

[37] Sven Nordholm,et al. Optimization and evaluation of sigmoid function with a priori SNR estimate for real-time speech enhancement , 2013, Speech Commun..

[38] Li-Rong Dai,et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[39] Olivier Cappé,et al. Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[40] Sheng Li,et al. Iterative spectral subtraction method for millimeter-wave conducted speech enhancement , 2010 .

[41] Mark D. Plumbley,et al. Assessment of musical noise using localization of isolated peaks in time-frequency domain , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[42] Mohsen Rahmani,et al. An objective measure for the musical noise assessment in noise reduction systems , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[43] David Malah,et al. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[44] Rémi Gribonval,et al. Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[45] Kiyohiro Shikano,et al. Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[46] Robert B. Dunn,et al. Speech enhancement based on auditory spectral change , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[47] Paris Smaragdis,et al. Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[48] Ephraim. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[49] H. Sebastian Seung,et al. Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[50] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[51] Richard C. Hendriks,et al. Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[52] DeLiang Wang,et al. Ideal ratio mask estimation using deep neural networks for robust speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.