Saliency model-based face segmentation and tracking in head-and-shoulder video sequences

In this paper, a novel face segmentation algorithm is proposed based on facial saliency map (FSM) for head-and-shoulder type video application. This method consists of three stages. The first stage is to generate the saliency map of input video image by our proposed facial attention model. In the second stage, a geometric model and an eye-map built from chrominance components are employed to localize the face region according to the saliency map. The third stage involves the adaptive boundary correction and the final face contour extraction. Based on the segmented result, an effective boundary saliency map (BSM) is then constructed, and applied for the tracking based segmentation of the successive frames. Experimental evaluation on test sequences shows that the proposed method is capable of segmenting the face area quite effectively.

[1]  Son Lam Phung,et al.  Skin colour based face detection , 2001, The Seventh Australian and New Zealand Intelligent Information Systems Conference, 2001.

[2]  Hong Yan,et al.  An Analytic-to-Holistic Approach for Face Recognition Based on a Single Frontal View , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Dante Augusto Couto Barone,et al.  A probabilistic model for the human skin color , 2001, Proceedings 11th International Conference on Image Analysis and Processing.

[4]  James M. Rehg,et al.  Statistical Color Models with Application to Skin Detection , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[5]  Chengjun Liu,et al.  A Bayesian Discriminating Features Method for Face Detection , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Cordelia Schmid,et al.  Face Detection and Tracking in a Video by Propagating Detection Probabilities , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  King Ngi Ngan,et al.  Towards unsupervised attention object extraction by integrating visual attention and object growing , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[8]  A. Criminisi,et al.  Bilayer Segmentation of Live Video , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Ilya Levner,et al.  Classification-Driven Watershed Segmentation , 2007, IEEE Transactions on Image Processing.

[10]  Ioannis Pitas,et al.  A Fully Automatic Approach to Facial Feature Detection and Tracking , 1997, AVBPA.

[11]  Leo Grady,et al.  Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  M. Meribout Video Segmentation for Content-based Coding , 2004 .

[13]  Shigeru Akamatsu,et al.  Detection of human faces in complex scene images by use of a skin color model and of invariant Fourier-Mellin moments , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[14]  Georgios Tziritas,et al.  Face Detection Using Quantized Skin Color Regions Merging and Wavelet Packet Analysis , 1999, IEEE Trans. Multim..

[15]  Yuan Li,et al.  Tracking in Low Frame Rate Video: A Cascade Particle Filter with Discriminative Observers of Different Lifespans , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Sebastian Lang,et al.  Improving adaptive skin color segmentation by incorporating results from face detection , 2002, Proceedings. 11th IEEE International Workshop on Robot and Human Interactive Communication.

[17]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[18]  C. Chen,et al.  Detection of human faces in colour images , 1997 .

[19]  Stan Z. Li,et al.  Learning multiview face subspaces and facial pose estimation using independent component analysis , 2005, IEEE Transactions on Image Processing.

[20]  Weisi Lin,et al.  Modeling visual attention's modulatory aftereffects on visual sensitivity and quality evaluation , 2005, IEEE Transactions on Image Processing.

[21]  Touradj Ebrahimi,et al.  Tracking video objects in cluttered background , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  J. Ohya,et al.  Automatic skin-color distribution extraction for face detection and tracking , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[23]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[24]  Matti Pietikäinen,et al.  Detection of skin color under changing illumination: a comparative study , 2003, 12th International Conference on Image Analysis and Processing, 2003.Proceedings..

[25]  Yuxiao Hu,et al.  Bayesian shape localization for face recognition using global and local textures , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  Hayit Greenspan,et al.  Mixture model for face-color modeling and segmentation , 2001, Pattern Recognit. Lett..

[27]  Franck Luthon,et al.  Nonlinear color space and spatiotemporal MRF for hierarchical segmentation of face features in video , 2004, IEEE Transactions on Image Processing.

[28]  Rama Chellappa,et al.  Background learning for robust face recognition with PCA in the presence of clutter , 2005, IEEE Transactions on Image Processing.

[29]  Yuan Li,et al.  Tracking in Low Frame Rate Video: A Cascade Particle Filter with Discriminative Observers of Different Lifespans , 2007, CVPR.

[30]  Huitao Luo,et al.  Model-based segmentation and tracking of head-and-shoulder video objects for real time multimedia services , 2003, IEEE Trans. Multim..

[31]  Irfan A. Essa,et al.  Tree-based Classifiers for Bilayer Video Segmentation , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[33]  Abdesselam Bouzerdoum,et al.  Skin segmentation using color pixel classification: analysis and comparison , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Cheng-Chew Lim,et al.  Segmentation of the face and hands in sign language video sequences using color and motion cues , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Shih-Fu Chang,et al.  A highly efficient system for automatic face region detection in MPEG video , 1997, IEEE Trans. Circuits Syst. Video Technol..

[36]  Shih-Chang Hsia,et al.  Efficient light balancing techniques for text images in video presentation systems , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[37]  Dante Augusto Couto Barone,et al.  Performance evaluation of single and multiple-Gaussian models for skin color modeling , 2002, Proceedings. XV Brazilian Symposium on Computer Graphics and Image Processing.

[38]  Mohamed A. Deriche,et al.  Scale-Space Properties of the Multiscale Morphological Dilation-Erosion , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Bernd Menser,et al.  Segmentation and tracking of facial regions in color image sequences , 2000, Visual Communications and Image Processing.

[40]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Tieniu Tan,et al.  Skin color detection using multiple cues , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[42]  Bernhard Fröba,et al.  Face Tracking by Means of Continuous Detection , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[43]  Ting Liu,et al.  Video Segmentation via Temporal Pattern Classification , 2007, IEEE Transactions on Multimedia.

[44]  Jungwon Seo,et al.  Detection of human faces using skin color and eyes , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[45]  King Ngi Ngan,et al.  Face segmentation using skin-color map in videophone applications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[46]  A. Ardeshir Goshtasby,et al.  Detecting human faces in color images , 1998, Image Vis. Comput..

[47]  Jitendra Malik,et al.  A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[48]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[49]  HongJiang Zhang,et al.  A model of motion attention for video skimming , 2002, Proceedings. International Conference on Image Processing.

[50]  Lijun Yin,et al.  Multiple-View Face Tracking For Modeling and Analysis Based On Non-Cooperative Video Imagery , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Anil K. Jain,et al.  Face Detection in Color Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..