Speech Enhancement Based on LWT and Artificial Neural Network and Using MMSE Estimate of Spectral Amplitude

In this chapter, we will detail a new speech enhancement technique based on Lifting Wavelet Transform (LWT) and Artifitial Neural Network (ANN). This technique also uses the MMSE Estimate of Spectral Amplitude. It consists at the first step in applying the LWTto the noisy speech signal in order to obtain two noisy details coefficients, cD1 and cD2 and one approximation coefficient, cA2. After that, cD1 and cD2 are denoised by soft thresholding and for their thresholding, we need to use suitable thresholds, thrj,1≤j≤2. Those thresholds, thrj,1≤j≤2, are determined by using an Artificial Neural Network (ANN). The soft thresholding of those coefficients, cD1 and cD2, is performed in order to obtain two denoised coefficients, cDd1 and cDd2 . Then the denoising technique based on MMSE Estimate of Spectral Amplitude is applied to the noisy approximation cA2 in order to obtain a denoised coefficient, cAd2. Finally, the enhanced speech signal is obtained from the application of the inverse of LWT, LWT−1 to cDd1, cDd2 and cAd2. The performance of the proposed speech enhancement technique is justified by the computations of the Signal to Noise Ratio (SNR), Segmental SNR (SSNR) and Perceptual Evaluation of Speech Quality (PESQ).

[1]  Richard C. Hendriks,et al.  Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  J. Dunlop,et al.  Improved adaptive noise cancellation in the presence of signal leakage on the noise reference channel , 1989 .

[3]  S. Selva Nidhyananthan,et al.  A review on speech enhancement algorithms and why to combine with environment classification , 2014 .

[4]  Mohamed Djendi,et al.  A wavelet-based forward BSS algorithm for acoustic noise reduction and speech enhancement , 2016 .

[5]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6]  Christian Jutten,et al.  Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture , 1991, Signal Process..

[7]  Raymond L. Goldsworthy Two-Microphone Spatial Filtering Improves Speech Reception for Cochlear-Implant Users in Reverberant Conditions With Multiple Noise Sources , 2014, Trends in hearing.

[8]  B. Widrow,et al.  Adaptive noise cancelling: Principles and applications , 1975 .

[9]  Marc Moonen,et al.  GSVD-based optimal filtering for single and multimicrophone speech enhancement , 2002, IEEE Trans. Signal Process..

[10]  Mohamed Djendi,et al.  Noise Cancellation using Two Closely Spaced Microphones: Experimental Study witha Specific Model and Two Adaptive Algorithms , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11]  Meir Feder,et al.  Multi-channel signal separation by decorrelation , 1993, IEEE Trans. Speech Audio Process..

[12]  Simon J. Godsill,et al.  Efficient Alternatives to the Ephraim and Malah Suppression Rule for Audio Signal Enhancement , 2003, EURASIP J. Adv. Signal Process..

[13]  Noureddine Ellouze,et al.  Speech enhancement based on wavelet packet of an improved principal component analysis , 2016, Comput. Speech Lang..

[14]  Zhongfu Ye,et al.  A Higher Order Subspace Algorithm for Multichannel Speech Enhancement , 2015, IEEE Signal Processing Letters.

[15]  Mohamed Djendi,et al.  Analysis of two-sensors forward BSS structure with post-filters in the presence of coherent and incoherent noise , 2013, Speech Commun..

[16]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[17]  Carlos E. Davila A subspace approach to estimation of autoregressive parameters from noisy measurements , 1998, IEEE Trans. Signal Process..

[18]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[19]  Jacob Benesty,et al.  Multichannel Signal Enhancement in the STFT Domain , 2016 .

[20]  Yi Zhang,et al.  Real and imaginary modulation spectral subtraction for speech enhancement , 2013, Speech Commun..

[21]  S. B. Dhonde,et al.  A review on speech enhancement techniques , 2015, 2015 International Conference on Pervasive Computing (ICPC).

[22]  Mohamed Djendi,et al.  A Frequency Domain Adaptive Decorrelating Algorithm for Speech Enhancement , 2015, SPECOM.

[23]  Y. Ephraim,et al.  A Brief Survey of Speech Enhancement 1 , 2018, Microelectronics.

[24]  Pascal Scalart,et al.  Speech enhancement using harmonic regeneration , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[25]  Mohamed Djendi,et al.  A new regularized forward blind source separation algorithm for automatic speech quality enhancement , 2016 .

[26]  Kong-Aik Lee,et al.  Improving convergence of the NLMS algorithm using constrained subband updates , 2004, IEEE Signal Processing Letters.

[27]  Christian Jutten,et al.  Blind source separation for convolutive mixtures , 1995, Signal Process..

[28]  Jin-Ho Cho,et al.  Wavelet speech enhancement algorithm using exponential semi-soft mask filtering , 2016, Bioengineered.

[29]  Dirk Van Compernolle,et al.  Feedforward and feedback in a symmetric adaptive noise canceler : “stability analysis in a simplified case” , 1992 .

[30]  Nam Soo Kim,et al.  Spectro-Temporal Filtering for Multichannel Speech Enhancement in Short-Time Fourier Transform Domain , 2014, IEEE Signal Processing Letters.

[31]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[32]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[33]  Y. Itoh,et al.  Noise Reduction Technique , 2011 .

[34]  Mohamed Djendi An efficient wavelet-based adaptive filtering algorithm for automatic blind speech enhancement , 2018, Int. J. Speech Technol..

[35]  Mohamed Djendi,et al.  A frequency co-channel adaptive algorithm for speech quality enhancement , 2016, 2016 International Conference on Engineering & MIS (ICEMIS).