Moving-Talker, Speaker-Independent Feature Study, and Baseline Results Using the CUAVE Multimodal Speech Corpus
暂无分享,去创建一个
Sabri Gurbuz | Zekeriya Tufekci | John N. Gowdy | Eric K. Patterson | J. Gowdy | Z. Tufekci | E. Patterson | S. Gurbuz
[1] Gerasimos Potamianos,et al. An image transform approach for HMM based automatic lipreading , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).
[2] Sabri Gurbuz,et al. Noise-based audio-visual fusion for robust speech recognition , 2001, AVSP.
[3] Javier R. Movellan,et al. Visual Speech Recognition with Stochastic Networks , 1994, NIPS.
[4] D. Massaro,et al. Perceiving Talking Faces , 1995 .
[5] Gerasimos Potamianos,et al. Speaker independent audio-visual database for bimodal ASR , 1997, AVSP.
[6] Sabri Gurbuz,et al. Application of affine-invariant Fourier descriptors to lipreading for audio-visual speech recognition , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[7] J. Luettin,et al. Audio-visual Speech Recognition Workshop 2000 Final Report , 2000 .
[8] Farzin Deravi,et al. Design issues for a digital audio-visual integrated database , 1996 .
[9] Giridharan Iyengar,et al. Large-vocabulary audio-visual speech recognition by machines and humans , 2001, INTERSPEECH.
[10] Chalapathy Neti,et al. Improved ROI and within frame discriminant features for lipreading , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).
[11] Iain Matthews,et al. Features for Audio-Visual Speech Recognition , 1998 .
[12] B. Ripley,et al. Pattern Recognition , 1968, Nature.
[13] Zekeriya Tufekci,et al. Mel-scaled discrete wavelet coefficients for speech recognition , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[14] Jenq-Neng Hwang,et al. Lipreading from color video , 1997, IEEE Trans. Image Process..
[15] Alan C. Bovik,et al. Computer lipreading for improved accuracy in automatic speech recognition , 1996, IEEE Trans. Speech Audio Process..
[16] Wesley E. Snyder,et al. Application of Affine-Invariant Fourier Descriptors to Recognition of 3-D Objects , 1990, IEEE Trans. Pattern Anal. Mach. Intell..
[17] E. Petajan,et al. An improved automatic lipreading system to enhance speech recognition , 1988, CHI '88.
[18] David F. Rogers,et al. An Introduction to NURBS , 2000 .
[19] Jean-Luc Schwartz,et al. Comparing models for audiovisual fusion in a noisy-vowel recognition task , 1999, IEEE Trans. Speech Audio Process..
[20] G. Plant. Perceiving Talking Faces: From Speech Perception to a Behavioral Principle , 1999 .
[21] Jiri Matas,et al. XM2VTSDB: The Extended M2VTS Database , 1999 .
[22] Q. Summerfield,et al. Lipreading and audio-visual speech perception. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.