Improved lip contour extraction using K-means clustering and ellipse fitting

A Visual Speech Recognition system is completely dependent on correct extraction of lip contours. The extracted lip contour can be useful in tracking the lip movements in Visual Speech Recognition. The accuracy of the system can be improved by correct extraction of the lip contours. In this paper, a computer vision approach is proposed to automatically extract lip contour from the face image. The proposed technique uses Viola Jones algorithm to localize the face and mouth in the image. Unlike existing methods merge thresholds used in Viola Jones are made iterative and adaptive which makes it invariant to the quality of input image. The colour space conversion and clustering method results in separation of lip from non lip pixels which are further subjected to morphological operations and ellipse fitting for an efficient lip contour extraction. The proposed method has been tested on 4000 images from the VidTimit database and results showed significant improvement in lip segmentation than some of the existing methods for lip contour extraction. The proposed method is computationally efficient and robust and can be used for real time applications.

[1]  Alice Caplier,et al.  Accurate and quasi-automatic lip tracking , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Allen R. Tannenbaum,et al.  Localizing Region-Based Active Contours , 2008, IEEE Transactions on Image Processing.

[3]  Tony F. Chan,et al.  Active contours without edges , 2001, IEEE Trans. Image Process..

[4]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[6]  Brian C. Lovell,et al.  Multi-Region Probabilistic Histograms for Robust and Scalable Identity Inference , 2009, ICB.

[7]  Bayya Yegnanarayana,et al.  Improved lip contour extraction for visual speech recognition , 2015, 2015 IEEE International Conference on Consumer Electronics (ICCE).

[8]  Kah Phooi Seng,et al.  Lips Contour Detection and Tracking Using Watershed Region-Based Active Contour Model and Modified $H_{\infty}$ , 2012, IEEE Transactions on Circuits and Systems for Video Technology.