Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets

In this paper, we develop a system for enhancement of the speech signal with highest energy from a linear convolutive mixture of n statistically independent sound sources recorded by m microphones, where m<n. In this system we use the concept of independent component analysis (ICA) along with adaptive auditory filter banks and pitch tracking. Computer simulations and real-world experiments carried out in an actual room and measured through objective and subjective measures confirm the validity of the proposed algorithm.

[1]  John H. L. Hansen,et al.  An effective quality evaluation protocol for speech enhancement algorithms , 1998, ICSLP.

[2]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[3]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[4]  Shun-ichi Amari,et al.  Adaptive blind signal processing-neural network approaches , 1998, Proc. IEEE.

[5]  D. Chakrabarti,et al.  A fast fixed - point algorithm for independent component analysis , 1997 .

[6]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[7]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[8]  Allan Kardec Barros,et al.  Amplitude Estimation of Quasi-Periodic Physiological Signals by Wavelets , 2000 .

[9]  Clifford J. Weinstein,et al.  Automatic talker activity labeling for co-channel talker interference suppression , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[10]  Mitchel Weintraub,et al.  A theory and computational model of auditory monaural sound separation , 1985 .

[11]  Allan Kardec Barros,et al.  Estimation of speech embedded in a reverberant environment with multiple sources of noise , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[12]  A. de Cheveigné,et al.  The auditory system as a separation machine , 2001 .

[13]  Eric Moulines,et al.  A blind source separation technique using second-order statistics , 1997, IEEE Trans. Signal Process..

[14]  Yunxin Zhao,et al.  Adaptive co-channel speech separation and recognition , 1999, IEEE Trans. Speech Audio Process..

[15]  Shiro Ikeda,et al.  A METHOD OF ICA IN TIME-FREQUENCY DOMAIN , 2003 .

[16]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[17]  T. W. Parsons Separation of speech from interfering speech by means of harmonic selection , 1976 .

[18]  Nathalie Delfosse,et al.  Adaptive blind separation of independent sources: A deflation approach , 1995, Signal Process..

[19]  Allan Kardec Barros,et al.  Extraction of Specific Signals with Temporal Structure , 2001, Neural Computation.

[20]  Schuster,et al.  Separation of a mixture of independent signals using time delayed correlations. , 1994, Physical review letters.