Audio-visual speech recognition incorporating facial depth information captured by the Kinect
暂无分享,去创建一个
Fillia Makedon | Gerasimos Potamianos | Georgios Galatas | F. Makedon | G. Potamianos | Georgios Galatas
[1] Chalapathy Neti,et al. Recent advances in the automatic recognition of audiovisual speech , 2003, Proc. IEEE.
[2] 智基 戸田,et al. Recent developments of the HMM-based speech synthesis system (HTS) , 2007 .
[3] Steve Young,et al. The HTK book version 3.4 , 2006 .
[4] Roland Göcke,et al. The audio-video australian English speech data corpus AVOZES , 2012, INTERSPEECH.
[5] K. Brown,et al. Spacecraft hazard avoidance utilizing structured light , 2006, 2006 IEEE Aerospace Conference.
[6] Fillia Makedon,et al. Bilingual corpus for AVASR using multiple sensors and depth information , 2011, AVSP.
[7] Alejandro F. Frangi,et al. AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition , 2004, LREC.
[8] Chalapathy Neti,et al. Improved ROI and within frame discriminant features for lipreading , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).
[9] Dorothea Kolossa,et al. WAPUSK20 - A Database for Robust Audiovisual Speech Recognition , 2010, LREC.
[10] Satoshi Nakamura,et al. Stream weight optimization of speech and lip image sequence for audio-visual speech recognition , 2000, INTERSPEECH.
[11] Gerasimos Potamianos,et al. Linear discriminant analysis for speechreading , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).
[12] Luis Javier Rodríguez-Fuentes,et al. Comparing genetic algorithms to principal component analysis and linear discriminant analysis in reducing feature dimensionality for speaker recognition , 2008, GECCO '08.
[13] Alex Zelinsky,et al. Learning OpenCV---Computer Vision with the OpenCV Library (Bradski, G.R. et al.; 2008)[On the Shelf] , 2009, IEEE Robotics & Automation Magazine.