Perceptual Subspace Speech Enhancement with Variance Normalization

In this paper a perceptual subspace speech enhancement method using masking property of human auditory system with variance normalization is presented. The masking property of the human auditory system is used while deciding the gain parameters for the algorithm. Spectral Domain Constrained estimator was employed in determining the filter coefficients and colored noise was handled by replacing the noise variance by Rayleigh quotient. Variance normalization is further done to remove the spikes in the values so as to avoid abrupt increase or decrease in power of the output samples making the output more intelligible. The objective measures SNRLoss and SNRLESC were chosen for performance evaluation based on their efficiency in determining the intelligibility of the output. The results show an improved performance of the proposed method over some of the existing speech enhancement methods in terms of intelligibility.

[1]  Susanto Rahardja,et al.  Audible Noise Reduction in Eigendomain for Speech Enhancement , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[3]  Hsiao-Chuan Wang,et al.  Enhancement of single channel speech based on masking property and wavelet transform , 2003, Speech Commun..

[4]  Yi Hu,et al.  Subjective comparison and evaluation of speech enhancement algorithms , 2007, Speech Commun..

[5]  Perceptual subspace speech enhancement using variance of the reconstruction error , 2014, Digit. Signal Process..

[6]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Alexander A. Petrovsky,et al.  Signal subspace approach for psychoacoustically motivated speech enhancement , 2011, Speech Commun..

[8]  Ching-Ta Lu Reduction of musical residual noise for speech enhancement using masking properties and optimal smoothing , 2007, Pattern Recognit. Lett..

[9]  Marco Matassoni,et al.  A perceptual masking approach for noise robust speech recognition , 2012, EURASIP J. Audio Speech Music. Process..

[10]  Yi Hu,et al.  A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..

[11]  Robert M. Gray,et al.  On the asymptotic eigenvalue distribution of Toeplitz matrices , 1972, IEEE Trans. Inf. Theory.

[12]  Nam C. Phamdo,et al.  Signal/noise KLT based approach for enhancing speech degraded by colored noise , 2000, IEEE Trans. Speech Audio Process..

[13]  Y. Ephraim,et al.  Extension of the signal subspace speech enhancement approach to colored noise , 2003, IEEE Signal Processing Letters.

[14]  Philipos C. Loizou,et al.  SNR loss: A new objective measure for predicting the intelligibility of noise-suppressed speech , 2011, Speech Commun..

[15]  Michael Small,et al.  Extension of the local subspace method to enhancement of speech with colored noise , 2008, Signal Process..

[16]  Yi Hu,et al.  A subspace approach for enhancing speech corrupted by colored noise , 2002, IEEE Signal Processing Letters.

[17]  Saeed Gazor,et al.  An adaptive KLT approach for speech enhancement , 2001, IEEE Trans. Speech Audio Process..

[18]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[19]  Benoît Champagne,et al.  Incorporating the human hearing properties in the signal subspace approach for speech enhancement , 2003, IEEE Trans. Speech Audio Process..