A speech enhancement method based on sparse reconstruction of power spectral density

Display Omitted The approximation K-singular value decomposition algorithm with nonnegative constraint is used to train PSD dictionary.The least angle regression algorithm with a new termination rule is applied to obtain the sparse representation.The termination rule is related to the noise level and nonzero cross terms of the speech and noise spectra.The enhanced speech signal is obtained by using the estimated PSD and subspace approach. Using sparse representation of power spectral density (PSD) approximated by magnitude-squared spectrum, a new speech enhancement method is presented. The approximation K-singular value decomposition (K-SVD) algorithm with nonnegative constraint is used to train an overcomplete dictionary of the clean speech PSD. The least angle regression algorithm (LARS) with a termination rule based on the ? 2 norm of the sum of the noise PSD and cross term between the clean speech and noise spectra is applied to estimate the clean speech PSD. Combining the estimated PSD with the signal subspace approach based on the short-time spectral amplitude (SSB-STSA), the enhanced speech signal is obtained. The simulation results show that the new method can yield better performance in most of noise conditions.

[1]  Shi-Wen Deng,et al.  Statistical voice activity detection based on sparse representation over learned dictionary , 2013, Digit. Signal Process..

[2]  Michael Elad,et al.  Efficient Implementation of the K-SVD Algorithm using Batch Orthogonal Matching Pursuit , 2008 .

[3]  Yi Hu,et al.  Subjective comparison and evaluation of speech enhancement algorithms , 2007, Speech Commun..

[4]  Yang Lu,et al.  A geometric approach to spectral subtraction , 2008, Speech Commun..

[5]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[6]  Nurgun Erdol,et al.  The Effect of Spectral Estimation on Speech Enhancement Performance , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[8]  Christophe Beaugeant,et al.  Noise reduction for dual-microphone mobile phones exploiting power level differences , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Mark D. Plumbley,et al.  Fast Dictionary Learning for Sparse Representations of Speech Signals , 2011, IEEE Journal of Selected Topics in Signal Processing.

[11]  Joachim M. Buhmann,et al.  Speech Enhancement Using Generative Dictionary Learning , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Mohammed Bennamoun,et al.  Sparse Representation for Speaker Identification , 2010, 2010 20th International Conference on Pattern Recognition.

[13]  Georgios B. Giannakis,et al.  Compressed Sensing for Wideband Cognitive Radios , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[14]  J. Yin,et al.  Direction-of-Arrival Estimation Using a Sparse Representation of Array Covariance Vectors , 2011, IEEE Transactions on Signal Processing.

[15]  Yang Lu,et al.  Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Tuomas Virtanen,et al.  Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[18]  Michael Elad,et al.  Double Sparsity: Learning Sparse Dictionaries for Sparse Signal Approximation , 2010, IEEE Transactions on Signal Processing.

[19]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[20]  Yi-Hsuan Yang,et al.  Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation , 2012, IEEE Transactions on Multimedia.

[21]  Joachim M. Buhmann,et al.  Speech enhancement with sparse coding in learned dictionaries , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[22]  Michael Elad,et al.  Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.