Noise reduction in speech signals using a cochlear model

Smart systems and artificial intelligence technology are becoming increasingly popular and are continuously finding more applications in real-life situations. Many systems require human-computer interaction and the natural language interface. One of the major issues in speech recognition systems is their performance in real world (noisy) environment. Over the past decades many techniques for noise reduction were developed. Motivated by human auditory processing, and it is well known that humans are remarkably good at detecting speech in the background noise, we propose a noise reduction technique based on a biophysical cochlear model. Using a model of signal reconstruction from the cochlear output, we observed an improvement in the quality of noisy speech and a significant increase in speech recognition performance.

[1]  F Mammano,et al.  Biophysics of the cochlea. II: Stationary nonlinear phenomenology. , 1996, The Journal of the Acoustical Society of America.

[2]  Peter Vary,et al.  Noise suppression by spectral magnitude estimation —mechanism and theoretical limits— , 1985 .

[3]  Mladen Russo,et al.  Biophysical Cochlear Model: Time-Frequency Analysis and Signal Reconstruction , 2011 .

[4]  Richard Lippmann,et al.  Speech recognition by machines and humans , 1997, Speech Commun..

[5]  Kuldip K. Paliwal,et al.  A speech enhancement method based on Kalman filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Ehud Weinstein,et al.  Iterative and sequential Kalman filter-based speech enhancement algorithms , 1998, IEEE Trans. Speech Audio Process..

[7]  Jacob Benesty,et al.  New insights into the noise reduction Wiener filter , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Bernard Widrow,et al.  Adaptive Signal Processing , 1985 .

[9]  F. Mammano,et al.  Biophysics of the cochlea: linear approximation. , 1993, The Journal of the Acoustical Society of America.

[10]  Hamid Sheikhzadeh,et al.  HMM-based strategies for enhancement of speech signals embedded in nonstationary noise , 1998, IEEE Trans. Speech Audio Process..

[11]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[12]  Biing-Hwang Juang,et al.  On the application of hidden Markov models for enhancing noisy speech , 1989, IEEE Trans. Acoust. Speech Signal Process..

[13]  Biing-Hwang Juang,et al.  Auditory perception and cognition , 2008, IEEE Signal Processing Magazine.

[14]  Y. Ephraim,et al.  Extension of the signal subspace speech enhancement approach to colored noise , 2003, IEEE Signal Processing Letters.

[15]  Hendrikus Duifhuis,et al.  Modelling an Active, Nonlinear Cochlea , 1983 .

[16]  Renato Nobili,et al.  Otoacoustic Emissions from Residual Oscillations of the Cochlear Basilar Membrane in a Human Ear Model , 2003, Journal of the Association for Research in Otolaryngology.

[17]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..