A Cooperative Learning-Based Clustering Approach to Lip Segmentation Without Knowing Segment Number

It is usually hard to predetermine the true number of segments in lip segmentation. This paper, therefore, presents a clustering-based approach to lip segmentation without knowing the true segment number. The objective function in the proposed approach is a variant of the partition entropy (PE) and features that the coincident cluster centroids in pattern space can be equivalently substituted by one centroid with the function value unchanged. It is shown that the minimum of the proposed objective function can be reached provided that: 1) the number of positions occupied by cluster centroids in pattern space is equal to the true number of clusters and 2) these positions are coincident with the optimal cluster centroids obtained under PE criterion. In implementation, we first randomly initialize the clusters provided that the number of clusters is greater than or equal to the ground truth. Then, an iterative algorithm is utilized to minimize the proposed objective function. For each iterative step, not only is the winner, i.e., the centroid with the maximum membership degree, updated to adapt to the corresponding input data, but also the other centroids are adjusted with a specific cooperation strength, so that they are each close to the winner. Subsequently, the initial overpartition will be gradually faded out with the redundant centroids superposed over the convergence of the algorithm. Based upon the proposed algorithm, we present a lip segmentation scheme. Empirical studies have shown its efficacy in comparison with the existing methods.

[1]  Max K. Agoston,et al.  Computer graphics and geometric modelling - implementation and algorithms , 2005 .

[2]  Kannan,et al.  ON IMAGE SEGMENTATION TECHNIQUES , 2022 .

[3]  Rama Chellappa,et al.  Growing Regression Forests by Classification: Applications to Object Pose Estimation , 2013, ECCV.

[4]  Alice Caplier,et al.  Accurate and quasi-automatic lip tracking , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Montse Pardàs,et al.  Motion estimation based tracking of active contours , 2001, Pattern Recognit. Lett..

[6]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[7]  Y. J. Zhang,et al.  A survey on evaluation methods for image segmentation , 1996, Pattern Recognit..

[8]  Meng Li,et al.  Automatic Segmentation of Color Lip Images Based on Morphological Filter , 2010, ICANN.

[9]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[10]  Timothy F. Cootes,et al.  Extraction of Visual Features for Lipreading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  P. Sivakumar,et al.  A REVIEW ON IMAGE SEGMENTATION TECHNIQUES , 2016 .

[12]  Xavier Maldague,et al.  Unsupervised Lips Segmentation Based on ROI Optimisation and Parametric Model , 2007, 2007 IEEE International Conference on Image Processing.

[13]  A. Boudraa Dynamic estimation of number of clusters in data sets , 1999 .

[14]  Shu Hung Leung,et al.  Lip image segmentation using fuzzy clustering incorporating an elliptic shape function , 2004, IEEE Transactions on Image Processing.

[15]  Pierre Soille,et al.  Morphological Image Analysis: Principles and Applications , 2003 .

[16]  Tao Jiang,et al.  Minimum entropy clustering and applications to gene expression analysis , 2004, Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004..

[17]  Michael Spann,et al.  A new approach to clustering , 1990, Pattern Recognit..

[18]  Gérard Bailly,et al.  Non-linear active model for mouth inner and outer contours detection , 2005, 2005 13th European Signal Processing Conference.

[19]  Bülent Sankur,et al.  Robust classification of face and head gestures in video , 2011, Image Vis. Comput..

[20]  Miin-Shen Yang,et al.  A cluster validity index for fuzzy clustering , 2005, Pattern Recognit. Lett..

[21]  Nikos Fakotakis,et al.  An unconstrained method for lip detection in color images , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Xie Yuan-dan,et al.  Survey on Image Segmentation , 2002 .

[23]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[24]  Yiu-ming Cheung,et al.  Maximum weighted likelihood via rival penalized EM for density mixture clustering with automatic model selection , 2005, IEEE Transactions on Knowledge and Data Engineering.

[25]  Alan Wee-Chung Liew,et al.  Lip Region Segmentation with complex background , 2009 .

[26]  A. Murat Tekalp,et al.  Discriminative Analysis of Lip Motion Features for Speaker Identification and Speech-Reading , 2006, IEEE Transactions on Image Processing.

[27]  Sridha Sridharan,et al.  An approach to statistical lip modelling for speaker identification via chromatic feature extraction , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[28]  Yiu-ming Cheung A competitive and cooperative learning approach to robust data clustering , 2004, Neural Networks and Computational Intelligence.

[29]  Russell M. Mersereau,et al.  Lip feature extraction towards an automatic speechreading system , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[30]  James C. Bezdek,et al.  On cluster validity for the fuzzy c-means model , 1995, IEEE Trans. Fuzzy Syst..

[31]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[32]  Alice Caplier,et al.  Jumping snakes and parametric model for lip segmentation , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[33]  David P. Dobkin,et al.  The quickhull algorithm for convex hulls , 1996, TOMS.

[34]  E. Mayoraz,et al.  Fusion of face and speech data for person identity verification , 1999, IEEE Trans. Neural Networks.

[35]  Jean-Philippe Thiran,et al.  Modelling human perception of static facial expressions , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[36]  Max K. Agoston Computer Graphics And Geometric Modelling: Implementation & Algorithms , 2005 .

[37]  Franck Luthon,et al.  Real Time Tracking for 3D Realistic Lip Animation , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[38]  Franck Luthon,et al.  Nonlinear color space and spatiotemporal MRF for hierarchical segmentation of face features in video , 2004, IEEE Transactions on Image Processing.

[39]  Doheon Lee,et al.  On cluster validity index for estimation of the optimal number of fuzzy clusters , 2004, Pattern Recognit..

[40]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Boudewijn P. F. Lelieveldt,et al.  A new cluster validity index for the fuzzy c-mean , 1998, Pattern Recognit. Lett..

[42]  Michael Wagner,et al.  Robust face-voice based speaker identity verification using multilevel fusion , 2008, Image Vis. Comput..

[43]  R. Boostani,et al.  Lip segmentation in color images , 2008, 2008 International Conference on Innovations in Information Technology.

[44]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[45]  A. Martínez,et al.  The AR face databasae , 1998 .

[46]  Franc Solina,et al.  COLOR-BASED FACE DETECTION IN THE "15 SECONDS OF FAME" ART INSTALLATION , 2003 .

[47]  Alan Wee-Chung Liew,et al.  Segmentation of color lip images by spatial fuzzy clustering , 2003, IEEE Trans. Fuzzy Syst..

[48]  Patrice Delmas,et al.  Towards robust lip tracking , 2002, Object recognition supported by user interaction for service robots.

[49]  Jan Havrda,et al.  Quantification method of classification processes. Concept of structural a-entropy , 1967, Kybernetika.

[50]  Dong-Jo Park,et al.  A Novel Validity Index for Determination of the Optimal Number of Clusters , 2001 .

[51]  Alan Wee-Chung Liew,et al.  Robust lip region segmentation for lip images with complex background , 2007, Pattern Recognit..

[52]  Hong Yan,et al.  An adaptive spatial fuzzy clustering algorithm for 3-D MR image segmentation , 2003, IEEE Transactions on Medical Imaging.

[53]  Hadi Seyedarabi,et al.  Automatic Lip Tracking and Action Units Classification using Two-Step Active Contours and Probabilistic Neural Networks , 2006, 2006 Canadian Conference on Electrical and Computer Engineering.