3d Lip Tracking and Co-inertia Analysis for Improved Robustness of Audio-video Automatic Speech Recognition
暂无分享,去创建一个
[1] Jean Thioulouse,et al. CO‐INERTIA ANALYSIS AND THE LINKING OF ECOLOGICAL DATA TABLES , 2003 .
[2] Chalapathy Neti,et al. Recent advances in the automatic recognition of audiovisual speech , 2003, Proc. IEEE.
[3] Alexander Zelinsky,et al. Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing , 2000 .
[4] Alexander Zelinsky,et al. Real-time stereo tracking for head pose and gaze estimation , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).
[5] Roland Göcke,et al. The audio-video australian English speech data corpus AVOZES , 2012, INTERSPEECH.
[6] Timothy F. Cootes,et al. Extraction of Visual Features for Lipreading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..
[7] Roland Göcke,et al. Statistical analysis of the relationship between audio and video speech parameters for Australian English , 2003, AVSP.
[8] S. Dolédec,et al. Co‐inertia analysis: an alternative method for studying species–environment relationships , 1994 .
[9] K. Ruben Gabriel,et al. A permutation test of association between configurations by means of the rv coefficient , 1998 .
[10] Michael Wagner,et al. Aspects of speaking-face data corpus design methodology , 2004, INTERSPEECH.