A Sparse Representation Speech Denoising Method Based on Adapted Stopping Residue Error

Abstract—A sparse representation speech denoising method based on adapted stopping residue error was presented in this paper. Firstly, the cross-correlation between the clean speech spectrum and the noise spectrum was analyzed, and an estimation method was proposed. In the denoising method, an over-complete dictionary of the clean speech power spectrum was learned with the K-singular value decomposition (K-SVD) algorithm. In the sparse representation stage, the stopping residue error was adaptively achieved according to the estimated cross-correlation and the adjusted noise spectrum, and the orthogonal matching pursuit (OMP) approach was applied to reconstruct the clean speech spectrum from the noisy speech. Finally, the clean speech was re-synthesised via the inverse Fourier transform with the reconstructed speech spectrum and the noisy speech phase. The experiment results show that the proposed method outperforms the conventional methods in terms of subjective and objective measure.

[1]  Joachim M. Buhmann,et al.  Speech Enhancement Using Generative Dictionary Learning , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Ina Kodrasi,et al.  Curvature-based optimization of the trade-off parameter in the speech distortion weighted multichannel wiener filter , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Philipos C. Loizou,et al.  A noise-estimation algorithm for highly non-stationary environments , 2006, Speech Commun..

[4]  Vidyavati M. Gaikwad,et al.  Survey on Quality and Intelligibility Offered by Speech Enhancement Algorithms , 2015, 2015 International Conference on Computing Communication Control and Automation.

[5]  Bo Wang,et al.  A Speech Enhancement Method Employing Sparse Representation of Power Spectral Density , 2013 .

[6]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[7]  Aïcha Bouzid,et al.  Sparse Representations for Single Channel Speech Enhancement Based on Voiced/Unvoiced Classification , 2017, Circuits Syst. Signal Process..

[8]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[9]  Brendt Wohlberg,et al.  Efficient Algorithms for Convolutional Sparse Representations , 2016, IEEE Transactions on Image Processing.

[10]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice, Second Edition , 2013 .

[11]  Jiqing Han,et al.  A solution to residual noise in speech denoising with sparse representation , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Nicholas W. D. Evans,et al.  An Assessment on the Fundamental Limitations of Spectral Subtraction , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[13]  Michael Elad,et al.  Efficient Implementation of the K-SVD Algorithm using Batch Orthogonal Matching Pursuit , 2008 .

[14]  Koichi Shinoda,et al.  Feature normalization based on non-extensive statistics for speech recognition , 2013, Speech Commun..

[15]  Y. C. Pati,et al.  Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[16]  Yang Zhen Speech Enhancement Based on Data-Driven Dictionary and Sparse Representation , 2011 .

[17]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..