SEDA: A tunable Q-factor wavelet-based noise reduction algorithm for multi-talker babble

We introduce a new wavelet-based algorithm to enhance the quality of speech corrupted by multi-talker babble noise. The algorithm comprises three stages: The first stage classifies short frames of the noisy speech as speech-dominated or noise-dominated. We design this classifier specifically for multi-talker babble noise. The second stage performs preliminary de-nosing of noisy speech frames using oversampled wavelet transforms and parallel group thresholding. The final stage performs further denoising by attenuating residual high frequency components in the signal produced by the second stage. A significant improvement in intelligibility and quality was observed in evaluation tests of the algorithm with cochlear implant users.

[1]  Bruce J. Gantz,et al.  United States multicenter clinical trial of the cochlear nucleus hybrid implant system , 2015, The Laryngoscope.

[2]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[3]  Ivan W. Selesnick,et al.  Wavelet Transform With Tunable Q-Factor , 2011, IEEE Transactions on Signal Processing.

[4]  W. M. Rabinowitz,et al.  Standardization of a test of speech perception in noise. , 1979, Journal of speech and hearing research.

[5]  Jonathon Shlens,et al.  A Tutorial on Principal Component Analysis , 2014, ArXiv.

[6]  S. Soli,et al.  Development of the Hearing in Noise Test for the measurement of speech reception thresholds in quiet and in noise. , 1994, The Journal of the Acoustical Society of America.

[7]  Robert C. Bilger,et al.  Standardization of a Test of Speech Perception in Noise , 1984 .

[8]  Ivan W. Selesnick,et al.  Sparse signal representations using the tunable Q-factor wavelet transform , 2011, Optical Engineering + Applications.

[9]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[10]  Yi Hu,et al.  Subspace algorithms for noise reduction in cochlear implants. , 2005, The Journal of the Acoustical Society of America.

[11]  Yi Hu,et al.  Use of a sigmoidal-shaped function for noise attenuation in cochlear implants. , 2007, The Journal of the Acoustical Society of America.

[12]  Stefan J. Mauger,et al.  Cochlear implant optimized noise reduction , 2012, Journal of neural engineering.

[13]  E. Domico,et al.  Speech Recognition in Background Noise of Cochlear Implant Patients , 2002, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[14]  David G. Stork,et al.  Pattern Classification , 1973 .

[15]  Huan Liu,et al.  Feature selection for classification: A review , 2014 .

[16]  Ruth Bentler,et al.  Digital Noise Reduction: An Overview , 2006, Trends in amplification.

[17]  DeLiang Wang,et al.  An algorithm to improve speech recognition in noise for hearing-impaired listeners. , 2013, The Journal of the Acoustical Society of America.

[18]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[19]  A. Lobo,et al.  Subspace and envelope subtraction algorithms for noise reduction in cochlear implants , 2003, Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (IEEE Cat. No.03CH37439).

[20]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[21]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[22]  Stefan J. Mauger,et al.  Clinical Evaluation of Signal-to-Noise Ratio–Based Noise Reduction in Nucleus® Cochlear Implant Recipients , 2011, Ear and hearing.

[23]  Q. Fu,et al.  Spectral subtraction-based speech enhancement for cochlear implant patients in background noise. , 2005, The Journal of the Acoustical Society of America.

[24]  H.H. Yue,et al.  Weighted principal component analysis and its applications to improve FDC performance , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[25]  John H. L. Hansen,et al.  Babble Noise: Modeling, Analysis, and Applications , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[26]  Philipos C Loizou,et al.  Use of S-Shaped Input-Output Functions for Noise Suppression in Cochlear Implants , 2007, Ear and hearing.

[27]  Fan-Gang Zeng,et al.  Utilizing advanced hearing aid technologies as pre-processors to enhance cochlear implant performance , 2004, Cochlear implants international.

[28]  H Rudert,et al.  Effects of noise on speech discrimination in cochlear implant patients. , 1995, The Annals of otology, rhinology & laryngology. Supplement.

[29]  Jessica J. M. Monaghan,et al.  Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users , 2017, Hearing Research.

[30]  T L Wiley,et al.  Word recognition performance in various background competitors. , 1997, Journal of the American Academy of Audiology.

[31]  Douglas A. Reynolds Gaussian Mixture Models , 2009, Encyclopedia of Biometrics.

[32]  M. Wallace,et al.  Individual differences in the multisensory temporal binding window predict susceptibility to audiovisual illusions. , 2012, Journal of experimental psychology. Human perception and performance.

[33]  D. Rom A sequentially rejective test procedure based on a modified Bonferroni inequality , 1990 .

[34]  M. Davies,et al.  Endovascular treatment of tracheoinnominate artery fistula: a case report. , 2006, Vascular and endovascular surgery.

[35]  Jiawei Han,et al.  Generalized Fisher Score for Feature Selection , 2011, UAI.

[36]  Ivan W. Selesnick A new sparsity-enabled signal separation method based on signal resonance , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[37]  David R Friedland,et al.  Case-control analysis of cochlear implant performance in elderly patients. , 2010, Archives of otolaryngology--head & neck surgery.

[38]  Oldooz Hazrati,et al.  Blind binary masking for reverberation suppression in cochlear implants. , 2013, The Journal of the Acoustical Society of America.

[39]  Sigfrid D. Soli,et al.  Development of the Hearing In Noise Test (HINT) in Spanish , 2002 .

[40]  Sperry Jl,et al.  Word Recognition Performance in Various Background Competitors , 1997 .

[41]  Pam W. Dawson,et al.  A Wavelet-Based Noise Reduction Algorithm and Its Clinical Evaluation in Cochlear Implants , 2013, PloS one.

[42]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[43]  Christine Brenner,et al.  Postlingual adult performance in noise with HiRes 120 and ClearVoice Low, Medium, and High , 2013, Cochlear implants international.