A comparative study on face detection and tracking algorithms

Face localization is the first stage in many vision based applications and in human-computer interaction. The problem is to define the face location of a person in a color image. The four boosted classifiers embbeded in OpenCV, based on Haar-like features, are compared in terms of speed and efficiency. Skin color distribution is estimated using a non parametric approach. To avoid drifting in color estimate, this model is not updated during the sequence but renewed whenever the face is detected again, that gives the ability to our system to cope with different lighting conditions in a more robust way. Skin color model is then used to localize the face represented by an ellipse: connected component segmentation and a statistical approach, namely the coupled Camshift of Bradsky, are compared in terms of efficiency and speed. The pursuit algorithms are tested on various video sequences, corresponding to various scenarios in terms of illumination, face pose, face size and background complexity (distractor effects).

[1]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[2]  Ahmet M. Kondoz,et al.  Automatic scalable face model design for 2D model-based video coding , 2004, Signal Process. Image Commun..

[3]  Luhong Liang,et al.  A detector tree of boosted classifiers for real-time object detection and tracking , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[4]  Xiaokang Yang,et al.  Camshift Guided Particle Filter for Visual Tracking , 2007, SiPS.

[5]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[6]  Stan Z. Li,et al.  FloatBoost learning and statistical face detection , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[8]  A. Ardeshir Goshtasby,et al.  Detecting and tracking human faces in videos , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[9]  Ji-Yeon Kim,et al.  Fast Detection of Multi-View Face and Eye Based on Cascaded Classifier , 2005, MVA.

[10]  Paul A. Viola,et al.  Fast Multiview Face Detection , 2003 .

[11]  James L. Crowley,et al.  Robust face tracking using color , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[12]  Irfan A. Essa,et al.  Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Vladimir Vezhnevets,et al.  A Survey on Pixel-Based Skin Color Detection Techniques , 2003 .

[14]  Alexander H. Waibel,et al.  Segmenting hands of arbitrary color , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[15]  Tae-Woong Yoo,et al.  A fast algorithm for tracking human faces based on chromatic histograms , 1999, Pattern Recognit. Lett..

[16]  Takeo Kanade,et al.  A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[17]  Andrew W. Fitzgibbon,et al.  Direct least squares fitting of ellipses , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[18]  Narendra Ahuja,et al.  Gaussian mixture model for human skin color and its applications in image and video databases , 1998, Electronic Imaging.

[19]  Abdesselam Bouzerdoum,et al.  Skin segmentation using color pixel classification: analysis and comparison , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Shaogang Gong,et al.  Modelling facial colour and identity with Gaussian mixtures , 1998, Pattern Recognit..

[21]  Paul A. Viola,et al.  Fast Multi-view Face Detection , 2003 .

[22]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Y. Freund,et al.  Discussion of the Paper \additive Logistic Regression: a Statistical View of Boosting" By , 2000 .

[25]  Alexander H. Waibel,et al.  A real-time face tracker , 1996, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96.

[26]  Kenneth E. Barner,et al.  Color-based classifier for region identification in video , 1998, Electronic Imaging.

[27]  Rainer Lienhart,et al.  Empirical Analysis of Detection Cascades of Boosted Classifiers for Rapid Object Detection , 2003, DAGM-Symposium.

[28]  Gary Bradski,et al.  Computer Vision Face Tracking For Use in a Perceptual User Interface , 1998 .

[29]  Jin Zhang,et al.  Multi-cue-based CamShift guided particle filter tracking , 2011, Expert Syst. Appl..

[30]  Mika Laaksonen,et al.  Adaptive skin color modeling using the skin locus for selecting training pixels , 2003, Pattern Recognit..

[31]  Andrew W. Fitzgibbon,et al.  Direct Least Square Fitting of Ellipses , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Lionel Prevost,et al.  Combining model-based classifiers for face localization , 2005, MVA.

[33]  Renaud Seguier A very fast adaptative face detection system , 2004 .

[34]  Bo Wu,et al.  Fast rotation invariant multi-view face detection based on real Adaboost , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..