论文信息 - Multimodal speech enhancement in noisy environment

Multimodal speech enhancement in noisy environment

This paper presents a new architecture for multimodal speech enhancement in a noisy environment. Real-time experimental and simulations results are included.

[1] S. Thomas Alexander,et al. Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.

[2] Jenq-Neng Hwang,et al. Lipreading from color video , 1997, IEEE Trans. Image Process..

[3] Thomas S. Huang,et al. Real-time lip tracking and bimodal continuous speech recognition , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[4] Gerasimos Potamianos,et al. An image transform approach for HMM based automatic lipreading , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[5] David Malah,et al. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[6] Michael T. Chan,et al. Automatic lip model extraction for constrained contour-based tracking , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[7] Tsuhan Chen,et al. Audiovisual speech processing , 2001, IEEE Signal Process. Mag..

[8] J F Holzrichter,et al. Human speech articulator measurements using low power, 2GHz Homodyne sensors , 1999 .