Lip segmentation using localized active contour model with automatic initial contour

Lip-reading is one of important approaches for human–computer interaction (HCI). Its development would have a large range of applications, especially in augmented reality. Lip segmentation is the first and foremost step in the lip-reading system. Conventional method of region-based active contour model adopts the global information of image and is unable to perform well. In this paper, from a localized perspective, we introduce the methodology of localized active contour model (LACM) and, meanwhile, propose the method that using LACM to perform the lip segmentation with the initial contour automatically generated. The scope for active contour model is reduced to the local region that reduces the disturbances of unrelated factors. The experimental results demonstrate the method adopts this model would dramatically improve the robustness for lip segmentation. On this basis, we analyze the influence of initial contours and local radiuses, study the efficiencies under different initial contours and compare it with the conventional active contour model which adopts the global information.

[1]  M. Lie UNSUPERVISED LIP SEGMENTATION UNDER NATURAL CONDITIONS , 1999 .

[2]  Allen R. Tannenbaum,et al.  Localizing Region-Based Active Contours , 2008, IEEE Transactions on Image Processing.

[3]  Alan Wee-Chung Liew,et al.  Lip segmentation with the presence of beards , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Trent W. Lewis,et al.  Lip Feature Extraction Using Red Exclusion , 2000, VIP.

[5]  Alice Caplier,et al.  Accurate and quasi-automatic lip tracking , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Manassanan Srikham,et al.  Active contours segmentation with edge based and local region based , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[7]  Timothy F. Cootes,et al.  Extraction of Visual Features for Lipreading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Alice Caplier,et al.  New color transformation for lips segmentation , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[9]  Shu Hung Leung,et al.  Lip image segmentation using fuzzy clustering incorporating an elliptic shape function , 2004, IEEE Transactions on Image Processing.

[10]  Franck Luthon,et al.  Nonlinear color space and spatiotemporal MRF for hierarchical segmentation of face features in video , 2004, IEEE Transactions on Image Processing.

[11]  Gaurav Agrawal,et al.  Automatic Lip Contour Tracking and Visual Character Recognition for Computerized Lip Reading , 2009 .

[12]  Patrice Delmas,et al.  Automatic lip tracking: Bayesian segmentation and active contours in a cooperative scheme , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[13]  Juergen Luettin,et al.  Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..

[14]  Montse Pardàs,et al.  Motion estimation based tracking of active contours , 2001, Pattern Recognit. Lett..

[15]  J.N. Gowdy,et al.  CUAVE: A new audio-visual database for multimodal human-computer interface research , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.