Speech enhancement in discontinuous transmission systems using the constrained-stability least-mean-squares algorithm.

In this paper a novel constrained-stability least-mean-squares (LMS) algorithm for filtering speech sounds is proposed in the adaptive noise cancellation (ANC) problem. It is based on the minimization of the squared Euclidean norm of the weight vector change under a stability constraint over the a posteriori estimation errors. To this purpose, the Lagrangian methodology has been used in order to propose a nonlinear adaptation in terms of the product of differential input and error. Convergence analysis is also studied in terms of the evolution of the natural modes to the optimal Wiener-Hopf solution so that the stability performance depends exclusively on the adaptation parameter mu and the eigenvalues of the difference matrix DeltaR(1). The algorithm shows superior performance over the referenced algorithms in the ANC problem of speech discontinuous transmission systems, which are characterized by rapid transitions of the desired signal. The experimental analysis carried out on the AURORA 3 speech databases provides an extensive performance evaluation together with an exhaustive comparison to the standard LMS algorithms, i.e., the normalized LMS (NLMS), and other recently reported LMS algorithms such as the modified NLMS, the error nonlinearity LMS, or the normalized data nonlinearity LMS adaptation.

[1]  Zayed M. Ramadan,et al.  An Adaptive Noise Canceller Using Error Nonlinearities in the LMS Adaptation , 2004, IEEE SoutheastCon, 2004. Proceedings..

[2]  J. C. Segura,et al.  Improved MO-LRT VAD based on bispectra Gaussian model , 2005 .

[3]  Jae S. Lim,et al.  A new application of adaptive noise cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[4]  S. Douglas A family of normalized LMS algorithms , 1994, IEEE Signal Processing Letters.

[5]  Deniz Erdogmus,et al.  Error whitening criterion for adaptive filtering: theory and algorithms , 2005, IEEE Transactions on Signal Processing.

[6]  C. Quesenberry,et al.  A nonparametric estimate of a multivariate density function , 1965 .

[7]  L. Breiman,et al.  Variable Kernel Estimates of Multivariate Densities , 1977 .

[8]  Juan Manuel Górriz,et al.  Speech/non-speech discrimination based on contextual information integrated bispectrum LRT , 2006, IEEE Signal Processing Letters.

[10]  D. W. Scott,et al.  Variable Kernel Density Estimation , 1992 .

[11]  Monson H. Hayes,et al.  Statistical Digital Signal Processing and Modeling , 1996 .

[12]  C G Puntonet,et al.  An effective cluster-based model for robust speech detection and speech recognition in noisy environments. , 2006, The Journal of the Acoustical Society of America.

[13]  B. Widrow,et al.  Adaptive noise cancelling: Principles and applications , 1975 .

[14]  Alfred O. Hero,et al.  Partial update LMS algorithms , 2005, IEEE Transactions on Signal Processing.

[15]  Seungjin Choi Differential learning algorithms for decorrelation and independent component analysis , 2006, Neural Networks.

[16]  Harold J. Kushner,et al.  Approximation and Weak Convergence Methods for Random Processes , 1984 .

[17]  Khalid Choukri,et al.  SPEECHDAT-CAR. A Large Speech Database for Automotive Environments , 2000, LREC.

[18]  Deniz Erdoğmuş,et al.  Online entropy manipulation: stochastic information gradient , 2003, IEEE Signal Processing Letters.

[20]  J M Górriz,et al.  Statistical voice activity detection based on integrated bispectrum likelihood ratio tests for robust speech recognition. , 2007, The Journal of the Acoustical Society of America.

[21]  Régine Le Bouquin-Jeannès,et al.  Study of a voice activity detector and its influence on a noise reduction system , 1995, Speech Commun..

[22]  Juan Manuel Górriz,et al.  Generalized LRT-Based Voice Activity Detector , 2006, IEEE Signal Processing Letters.

[23]  R C Waag,et al.  Time-shift compensation of ultrasonic pulse focus degradation using least-mean-square error estimates of arrival time. , 1992, The Journal of the Acoustical Society of America.

[24]  Rui Seara,et al.  On the LMS algorithm with constant and variable leakage factor in a nonlinear environment , 2006, IEEE Transactions on Signal Processing.

[25]  Julie E. Greenberg,et al.  Modified LMS algorithms for speech processing with an adaptive noise canceller , 1998, IEEE Trans. Speech Audio Process..

[26]  Bruce E. Hajek,et al.  Review of 'Approximation and Weak Convergence Methods for Random Processes, with Applications to Stochastic Systems Theory' (Kushner, H.J.; 1984) , 1985, IEEE Transactions on Information Theory.

[27]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[28]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[29]  Dirk T. M. Slock,et al.  On the convergence behavior of the LMS and the normalized LMS algorithms , 1993, IEEE Trans. Signal Process..

[30]  Juan Manuel Górriz,et al.  Hard C-means clustering for voice activity detection , 2006, Speech Commun..

[31]  S. Bolanowski,et al.  Individual magnitude estimation functions and their relation to the Pacinian (P) and non‐Pacinian (NP) channels , 1992 .

[32]  Neil J. Bershad On the optimum data nonlinearity in LMS adaptation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[33]  Teresa H. Y. Meng,et al.  Stochastic gradient adaptation under general error criteria , 1994, IEEE Trans. Signal Process..

[34]  Victor E. DeBrunner,et al.  Hirschman Optimal Transform Block LMS Adaptive Filter , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[35]  Neil J. Bershad,et al.  Analysis of the normalized LMS algorithm with Gaussian inputs , 1986, IEEE Trans. Acoust. Speech Signal Process..