论文信息 - Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets - 字舞流文

Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets

In this paper, we develop a system for enhancement of the speech signal with highest energy from a linear convolutive mixture of n statistically independent sound sources recorded by m microphones, where m<n. In this system we use the concept of independent component analysis (ICA) along with adaptive auditory filter banks and pitch tracking. Computer simulations and real-world experiments carried out in an actual room and measured through objective and subjective measures confirm the validity of the proposed algorithm.

Allan Kardec Barros | Noboru Ohnishi | Fumitada Itakura | Tomasz M. Rutkowski | A. K. Barros | F. Itakura | N. Ohnishi | A. Barros

[1] John H. L. Hansen,et al. An effective quality evaluation protocol for speech enhancement algorithms , 1998, ICSLP.

[2] Terrence J. Sejnowski,et al. An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[3] S. Haykin,et al. Adaptive Filter Theory , 1986 .

[4] Shun-ichi Amari,et al. Adaptive blind signal processing-neural network approaches , 1998, Proc. IEEE.

[5] D. Chakrabarti,et al. A fast fixed - point algorithm for independent component analysis , 1997 .

[6] Nathalie Virag,et al. Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[7] Pierre Comon,et al. Independent component analysis, A new concept? , 1994, Signal Process..

[8] Allan Kardec Barros,et al. Amplitude Estimation of Quasi-Periodic Physiological Signals by Wavelets , 2000 .

[9] Clifford J. Weinstein,et al. Automatic talker activity labeling for co-channel talker interference suppression , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[10] Mitchel Weintraub,et al. A theory and computational model of auditory monaural sound separation , 1985 .

[11] Allan Kardec Barros,et al. Estimation of speech embedded in a reverberant environment with multiple sources of noise , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[12] A. de Cheveigné,et al. The auditory system as a separation machine , 2001 .

[13] Eric Moulines,et al. A blind source separation technique using second-order statistics , 1997, IEEE Trans. Signal Process..

[14] Yunxin Zhao,et al. Adaptive co-channel speech separation and recognition , 1999, IEEE Trans. Speech Audio Process..

[15] Shiro Ikeda,et al. A METHOD OF ICA IN TIME-FREQUENCY DOMAIN , 2003 .

[16] Hideki Kawahara,et al. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[17] T. W. Parsons. Separation of speech from interfering speech by means of harmonic selection , 1976 .

[18] Nathalie Delfosse,et al. Adaptive blind separation of independent sources: A deflation approach , 1995, Signal Process..

[19] Allan Kardec Barros,et al. Extraction of Specific Signals with Temporal Structure , 2001, Neural Computation.

[20] Schuster,et al. Separation of a mixture of independent signals using time delayed correlations. , 1994, Physical review letters.