Evaluation of model-based versus non-parametric monaural noise-reduction approaches for hearing aids

Abstract Objective: Single channel noise reduction has been well investigated and seems to have reached its limits in terms of speech intelligibility improvement, however, the quality of such schemes can still be advanced. This study tests to what extent novel model-based processing schemes might improve performance in particular for non-stationary noise conditions. Design: Two prototype model-based algorithms, a speech-model-based, and a auditory-model-based algorithm were compared to a state-of-the-art non-parametric minimum statistics algorithm. A speech intelligibility test, preference rating, and listening effort scaling were performed. Additionally, three objective quality measures for the signal, background, and overall distortions were applied. For a better comparison of all algorithms, particular attention was given to the usage of the similar Wiener-based gain rule. Study sample: The perceptual investigation was performed with fourteen hearing-impaired subjects. Results: The results revealed that the non-parametric algorithm and the auditory model-based algorithm did not affect speech intelligibility, whereas the speech-model-based algorithm slightly decreased intelligibility. In terms of subjective quality, both model-based algorithms perform better than the unprocessed condition and the reference in particular for highly non-stationary noise environments. Conclusion: Data support the hypothesis that model-based algorithms are promising for improving performance in non-stationary noise conditions.

[1]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[2]  Monique Boymans,et al.  Field Trials Using a Digital Hearing Aid with Active Noise Reduction and Dual-Microphone Directionality: Estudios de campo utilizando un audifono digital con reduccion activa del ruido y micrófono de direccionalidad dual , 2000, Audiology : official organ of the International Society of Audiology.

[3]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[4]  Marc Moonen,et al.  Evaluation of signal enhancement algorithms for hearing instruments , 2008, 2008 16th European Signal Processing Conference.

[5]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[6]  Hans Werner Strube,et al.  Recognition of isolated words based on psychoacoustics and neurobiology , 1990, Speech Commun..

[7]  Emmanuel Vincent,et al.  Subjective and Objective Quality Assessment of Audio Source Separation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Philipos C. Loizou,et al.  A noise-estimation algorithm for highly non-stationary environments , 2006, Speech Commun..

[9]  Henning Puder,et al.  Integrating recursive minimum tracking and codebook-based noise estimation for improved reduction of non-stationary noise , 2012, Signal Process..

[10]  Birger Kollmeier,et al.  Speech pause detection for noise spectrum estimation by tracking power envelope dynamics , 2002, IEEE Trans. Speech Audio Process..

[11]  Birger Kollmeier,et al.  SNR estimation based on amplitude modulation analysis with applications to noise suppression , 2003, IEEE Trans. Speech Audio Process..

[12]  Volker Hohmann,et al.  Single-channel noise suppression based on a statistical source-model for speech , 2007 .

[13]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[14]  Yang Lu,et al.  An algorithm that improves speech intelligibility in noise for normal-hearing listeners. , 2009, The Journal of the Acoustical Society of America.

[15]  Henning Puder,et al.  Improving Robustness of Codebook-Based Noise Estimation Approaches With Delta Codebooks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  B. Kollmeier,et al.  Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction. , 1994, The Journal of the Acoustical Society of America.

[17]  T Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. I. Model structure. , 1996, The Journal of the Acoustical Society of America.

[18]  Antony William Rix,et al.  Perceptual evaluation of speech quality (PESQ): The new ITU standard for end-to-end speech quality a , 2002 .

[19]  Fei Xie,et al.  A comparative study of speech detection methods , 1997, EUROSPEECH.

[20]  W. Bastiaan Kleijn,et al.  Codebook-Based Bayesian Speech Enhancement for Nonstationary Environments , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[22]  W. Bastiaan Kleijn,et al.  Estimation of the short-term predictor parameters of speech under noisy conditions , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[23]  Rainer Martin,et al.  A Noise Reduction Preprocessor for Mobile Voice Communication , 2004, EURASIP J. Adv. Signal Process..

[24]  Ruth Bentler,et al.  Digital Noise Reduction: An Overview , 2006, Trends in amplification.

[25]  Dafydd Gibbon,et al.  EUROM - a spoken language resource for the EU - the SAM projects , 1995, EUROSPEECH.

[26]  R. Gribonval,et al.  Proposals for Performance Measurement in Source Separation , 2003 .

[27]  Volker Hohmann,et al.  Sub-band SNR estimation using auditory feature processing , 2003, Speech Commun..

[28]  W. Dreschler,et al.  ICRA noises: artificial noise signals with speech-like spectral and temporal properties for hearing instrument assessment. International Collegium for Rehabilitative Audiology. , 2001, Audiology : official organ of the International Society of Audiology.

[29]  Philipos C. Loizou,et al.  Improving Speech Intelligibility in Noise Using Environment-Optimized Algorithms , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[30]  Wouter A Dreschler,et al.  Modeling speech intelligibility in quiet and noise in listeners with normal and impaired hearing. , 2010, The Journal of the Acoustical Society of America.

[31]  W. Bastiaan Kleijn,et al.  Codebook driven short-term predictor parameter estimation for speech enhancement , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[32]  Yariv Ephraim,et al.  Statistical-model-based speech enhancement systems , 1992, Proc. IEEE.

[33]  Yi Hu,et al.  A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..

[34]  Jae Lim,et al.  Signal estimation from modified short-time Fourier transform , 1984 .

[35]  Rainer Martin,et al.  NOISE POWER SPECTRAL DENSITY ESTIMATION ON HIGHLY CORRELATED DATA , 2006 .

[36]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[37]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[38]  Giso Grimm,et al.  The master hearing Aid : A PC-based platform for algorithm development and evaluation , 2006 .

[39]  Hans-Günter Hirsch,et al.  Noise estimation techniques for robust speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[40]  M. Davies,et al.  Endovascular treatment of tracheoinnominate artery fistula: a case report. , 2006, Vascular and endovascular surgery.

[41]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[42]  Tammo Houtgast,et al.  Recognition of digits in different types of noise by normal-hearing and hearing-impaired listeners , 2007, International journal of audiology.

[43]  George S. Kang,et al.  Quality improvement of LPC-processed noisy speech by using spectral subtraction , 1989, IEEE Trans. Acoust. Speech Signal Process..

[44]  Giso Grimm,et al.  Multicenter evaluation of signal enhancement algorithms for hearing aids. , 2010, The Journal of the Acoustical Society of America.

[45]  Hajime Kobayashi,et al.  Weighted autocorrelation for pitch extraction of noisy speech , 2001, IEEE Trans. Speech Audio Process..

[46]  Ian T. Nabney,et al.  Netlab: Algorithms for Pattern Recognition , 2002 .

[47]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[48]  R. Plomp Auditory handicap of hearing impairment and the limited benefit of hearing aids , 1977 .

[49]  J. Bortz,et al.  Verteilungsfreie Methoden in der Biostatistik , 1982 .

[50]  Rainer Martin,et al.  A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[51]  G. Camp,et al.  The Complexity of Age-Related Hearing Impairment: Contributing Environmental and Genetic Factors , 2007 .

[52]  Jae S. Lim,et al.  Speech enhancement , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[53]  Carla Teixeira Lopes,et al.  TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .

[54]  Rainer Martin,et al.  Bias compensation methods for minimum statistics noise power spectral density estimation , 2006, Signal Process..

[55]  Thomas Rohdenburg,et al.  Development and objective perceptual quality assessment of monaural and binaural noise reduction schemes for hearing aids , 2008 .

[56]  R Plomp,et al.  Auditory handicap of hearing impairment and the limited benefit of hearing aids. , 1978, The Journal of the Acoustical Society of America.

[57]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[58]  Yi Hu,et al.  A comparative intelligibility study of single-microphone noise reduction algorithms. , 2007, The Journal of the Acoustical Society of America.

[59]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..