Multi-Modal Speech Recognition Using Optical-Flow Analysis for Lip Images
暂无分享,去创建一个
[1] Stephen E. Levinson,et al. Speaker independent audio-visual speech recognition , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).
[2] Keiichi Tokuda,et al. Audio-visual speech recognition using MCE-based hmms and model-dependent stream weights , 2000, INTERSPEECH.
[3] Sonya A. H. McMullen,et al. Mathematical Techniques in Multisensor Data Fusion (Artech House Information Warfare Library) , 2004 .
[4] Berthold K. P. Horn,et al. Determining Optical Flow , 1981, Other Conferences.
[5] Sadaoki Furui,et al. Ubiquitous speech processing , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[6] Chalapathy Neti,et al. Audio-visual large vocabulary continuous speech recognition in the broadcast domain , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).
[7] Yochai Konig,et al. "Eigenlips" for robust speech recognition , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.
[8] Gerasimos Potamianos,et al. Speaker independent audio-visual database for bimodal ASR , 1997, AVSP.
[9] Alex Pentland,et al. Automatic lipreading by optical-flow analysis , 1989 .
[10] Sadaoki Furui,et al. Speech recognition technology in the ubiquitous/wearable computing environment , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[11] D. L. Hall,et al. Mathematical Techniques in Multisensor Data Fusion , 1992 .
[12] Koji Iwano. Bimodal speech recognition using lip movement measured by optical flow analysis , 2001 .
[13] Satoshi Nakamura,et al. Stream weight optimization of speech and lip image sequence for audio-visual speech recognition , 2000, INTERSPEECH.