DNN-Based Calibrated-Filter Models for Speech Enhancement

[1]  Kohei Yamashita,et al.  Spectral subtraction iterated with weighting factors , 2002, Speech Coding, 2002, IEEE Workshop Proceedings..

[2]  Kah-Chye Tan,et al.  Postprocessing method for suppressing musical noise generated by spectral subtraction , 1998, IEEE Trans. Speech Audio Process..

[3]  Kurt S. Riedel,et al.  Minimum bias multiple taper spectral estimation , 2018, IEEE Trans. Signal Process..

[4]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[5]  DeLiang Wang,et al.  Complex Ratio Masking for Monaural Speech Separation , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[6]  Yi Hu,et al.  A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..

[7]  Nam Soo Kim,et al.  NMF-Based Speech Enhancement Using Bases Update , 2015, IEEE Signal Processing Letters.

[8]  Changchun Bao,et al.  Wiener filtering based speech enhancement with Weighted Denoising Auto-encoder and noise classification , 2014, Speech Commun..

[9]  Nam Soo Kim,et al.  NMF-based speech enhancement incorporating deep neural network , 2014, INTERSPEECH.

[10]  Sofia Ben Jebara A Perceptual Approach to Reduce Musical Noise Phenomenon with Wiener Denoising Technique , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11]  Jesper Jensen,et al.  An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Sofia Ben Jebara,et al.  Perceptual musical noise reduction using critical bands tonality coefficients and masking thresholds , 2007, INTERSPEECH.

[13]  Tuomas Virtanen,et al.  Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Yi Hu,et al.  Speech enhancement based on wavelet thresholding the multitaper spectrum , 2004, IEEE Transactions on Speech and Audio Processing.

[15]  T. Hasan,et al.  Iterative noise power subtraction technique for improved speech quality , 2008, 2008 International Conference on Electrical and Computer Engineering.

[16]  Sven Nordholm,et al.  Spectral subtraction using reduced delay convolution and adaptive averaging , 2001, IEEE Trans. Speech Audio Process..

[17]  Jonathan Le Roux,et al.  Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  Thomas Esch,et al.  Efficient musical noise suppression for speech enhancement system , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  DeLiang Wang,et al.  Supervised Speech Separation Based on Deep Learning: An Overview , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[20]  Kiyohiro Shikano,et al.  Theoretical analysis of musical noise in Wiener filtering family via higher-order statistics , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21]  Ying Zhang,et al.  Attention-Based Multi-NMF Deep Neural Network with Multimodality Data for Breast Cancer Prognosis Model , 2019, BioMed research international.

[22]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[23]  D. Thomson,et al.  Spectrum estimation and harmonic analysis , 1982, Proceedings of the IEEE.

[24]  Philipos C. Loizou,et al.  A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[26]  Simona Halunga,et al.  Nonlinear spectral subtraction method for colored noise reduction using multi-band Bark scale , 2008, Signal Process..

[27]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[28]  Nancy Bertin,et al.  Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.

[29]  Kiyohiro Shikano,et al.  Automatic optimization scheme of spectral subtraction based on musical noise assessment via higher-order statistics , 2008 .

[30]  Wei-Ping Zhu,et al.  NMF-based speech enhancement using multitaper spectrum estimation , 2018, 2018 International Conference on Signals and Systems (ICSigSys).

[31]  Sven Nordholm,et al.  Bayesian noise estimation in the modulation domain , 2018, Speech Commun..

[32]  A.V. Oppenheim,et al.  Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[33]  Eric Plourde,et al.  Auditory-Based Spectral Amplitude Estimators for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[34]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[35]  Hanwook Chung,et al.  Regularized non-negative matrix factorization with Gaussian mixtures and masking model for speech enhancement , 2017, Speech Commun..

[36]  Antonio Bonafonte,et al.  SEGAN: Speech Enhancement Generative Adversarial Network , 2017, INTERSPEECH.

[37]  Sven Nordholm,et al.  Optimization and evaluation of sigmoid function with a priori SNR estimate for real-time speech enhancement , 2013, Speech Commun..

[38]  Li-Rong Dai,et al.  A Regression Approach to Speech Enhancement Based on Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[39]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[40]  Sheng Li,et al.  Iterative spectral subtraction method for millimeter-wave conducted speech enhancement , 2010 .

[41]  Mark D. Plumbley,et al.  Assessment of musical noise using localization of isolated peaks in time-frequency domain , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[42]  Mohsen Rahmani,et al.  An objective measure for the musical noise assessment in noise reduction systems , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[43]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[44]  Rémi Gribonval,et al.  Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[45]  Kiyohiro Shikano,et al.  Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[46]  Robert B. Dunn,et al.  Speech enhancement based on auditory spectral change , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[47]  Paris Smaragdis,et al.  Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[48]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[49]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[50]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[51]  Richard C. Hendriks,et al.  Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[52]  DeLiang Wang,et al.  Ideal ratio mask estimation using deep neural networks for robust speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.