Perceptual subspace speech enhancement with SSDR normalization

In this paper a perceptual subspace speech enhancement method using masking property of human auditory system with signal to spectral deviation ratio (SSDR) normalization is presented. The masking property of the human auditory system is used while deciding the gain parameters for the algorithm. Spectral Domain Constrained estimator was employed in determining the filter coefficients and colored noise was handled by replacing the noise variance by Rayleigh quotient. SSDR normalization is further done to reduce the spectral distortion, making the output more intelligible. The objective measures SNRLoss and weep were chosen for performance evaluation based on their efficiency in determining the intelligibility of the output. The results show an improved performance of the proposed method over some of the existing speech enhancement methods in terms of intelligibility.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3]  Benoît Champagne,et al.  Incorporating the human hearing properties in the signal subspace approach for speech enhancement , 2003, IEEE Trans. Speech Audio Process..

[4]  Ching-Ta Lu Reduction of musical residual noise for speech enhancement using masking properties and optimal smoothing , 2007, Pattern Recognit. Lett..

[5]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[6]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[7]  Philipos C. Loizou,et al.  SNR loss: A new objective measure for predicting the intelligibility of noise-suppressed speech , 2011, Speech Commun..

[8]  Perceptual subspace speech enhancement using variance of the reconstruction error , 2014, Digit. Signal Process..

[9]  Nam C. Phamdo,et al.  Signal/noise KLT based approach for enhancing speech degraded by colored noise , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[10]  Xiuhua Geng,et al.  A signal subspace approach for speech enhancement , 2014 .

[11]  Yi Hu,et al.  A subspace approach for enhancing speech corrupted by colored noise , 2002, IEEE Signal Processing Letters.

[12]  Saeed Gazor,et al.  An adaptive KLT approach for speech enhancement , 2001, IEEE Trans. Speech Audio Process..

[13]  Yi Hu,et al.  A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..

[14]  Yi Hu,et al.  Subjective comparison and evaluation of speech enhancement algorithms , 2007, Speech Commun..

[15]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..