Incorporating Auditory Masking Properties for Speech Enhancement in presence of Near-end Noise

mobile devices, perceived speech signal degrades significantly in the presence of background noise as it reaches directly at the listener"s ears. There is a need to improve the intelligibility and quality of the received speech signal in noisy environments by incorporating speech enhancement algorithms. This paper focuses on speech enhancement method including auditory masking properties of the human ear to improve the intelligibility and quality of the speech signal in the presence of near-end noise. Implemented by dynamically enhancing the speech signal when the near-end noise dominates. Intelligibility and quality of enhanced speech signal are measured using SII and PESQ. Experimental results show improvement in the intelligibility and quality of the enhanced speech signal with the proposed approach over the unprocessed speech signal. This particular approach is far more efficient in overcoming the degradation of speech signals in noisy environments. KeywordsMasking, Near-end noise, Speech enhancement, Speech

[1]  Tom E. Bishop,et al.  Blind Image Restoration Using a Block-Stationary Signal Model , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[2]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[3]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[4]  B. V. Uma,et al.  Low Complexity Speech Enhancement Algorithm for Improved Perception in Mobile Devices , 2013 .

[5]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[6]  Eliathamby Ambikairajah,et al.  FORWARD MASKING THRESHOLD ESTIMATION USING NEURAL NETWORKS AND ITS APPLICATION TO PARALLEL SPEECH ENHANCEMENT , 2010 .

[7]  Sang-min Lee,et al.  A speech enhancement algorithm to reduce noise and compensate for partial masking effect , 2011 .

[8]  Peter Vary,et al.  NEAR END LISTENING ENHANCEMENT CONSIDERING THERMAL LIMIT OF MOBILE PHONE LOUDSPEAKERS , 2011 .

[9]  Jesper Jensen,et al.  On Optimal Linear Filtering of Speech for Near-End Listening Enhancement , 2013, IEEE Signal Processing Letters.

[10]  Peter Vary,et al.  Near End Listening Enhancement: Speech Intelligibility Improvement in Noisy Environments , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11]  B. V. Uma,et al.  Speech Enhancement Algorithm to Reduce the Effect of Background Noise in Mobile Phones , 2013 .

[12]  Jesper Jensen,et al.  SII-based speech preprocessing for intelligibility improvement in noise , 2013, INTERSPEECH.

[13]  Malihe hassani,et al.  Speech enhancement based on spectral subtraction in wavelet domain , 2011, 2011 IEEE 7th International Colloquium on Signal Processing and its Applications.

[14]  E. Ambikairajah,et al.  Speech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters , 2004 .

[15]  Yi Hu,et al.  Incorporating a psychoacoustical model in frequency domain speech enhancement , 2004, IEEE Signal Processing Letters.