Multi-view face detection in videos with online adaptation

Most learning-based approaches to face detection suffer from the problem of performance degradation on faces that are not covered by training data. However, including all variations of faces in training is practically infeasible due to the scalability restriction of machine learning algorithms and expensive manual labeling. In this work, we focus on face detection in videos, and alleviate this problem by exploiting strong correlation among video frames. We augment a pre-trained multiview face detection with an incrementally derived Gaussian process regressor. The regressor can extract and propagate visual knowledge across frames, and adapts the detector to handle unseen faces. Testing on two datasets, the promising results manifest the effectiveness of the proposed approach.

[1]  Jan Peters,et al.  Real-Time Local GP Model Learning , 2010, From Motor Learning to Interaction Learning in Robots.

[2]  Harry Shum,et al.  Statistical Learning of Multi-view Face Detection , 2002, ECCV.

[3]  Hwann-Tzong Chen,et al.  Real-time tracking using trust-region methods , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Francois Bremond,et al.  Combining face detection and people tracking in video sequences , 2009, ICDP.

[5]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[6]  Tyng-Luh Liu,et al.  Robust face detection with multi-class boosting , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Sang Ryong Kim,et al.  Integrated approach of multiple face detection for video surveillance , 2002, Object recognition supported by user interaction for service robots.

[8]  Rainer Lienhart,et al.  An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[9]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[11]  Trevor Darrell,et al.  Gaussian Processes for Object Categorization , 2010, International Journal of Computer Vision.

[12]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[13]  Stanley T. Birchfield,et al.  Elliptical head tracking using intensity gradients and color histograms , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[14]  Antonio Torralba,et al.  Sharing Visual Features for Multiclass and Multiview Object Detection , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Duy Nguyen-Tuong,et al.  Local Gaussian Process Regression for Real Time Online Model Learning , 2008, NIPS.

[17]  Gian Luca Foresti,et al.  Face detection for visual surveillance , 2003, 12th International Conference on Image Analysis and Processing, 2003.Proceedings..

[18]  Erik G. Learned-Miller,et al.  Online domain adaptation of a pre-trained cascade of classifiers , 2011, CVPR 2011.

[19]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[20]  Takeo Kanade,et al.  Rotation Invariant Neural Network-Based Face Detection , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[21]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[23]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Zhengyou Zhang,et al.  A Survey of Recent Advances in Face Detection , 2010 .

[25]  Yuan Li,et al.  Vector boosting for rotation invariant multi-view face detection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[26]  Paul A. Viola,et al.  Fast Multi-view Face Detection , 2003 .

[27]  Chiou-Shann Fuh,et al.  Fast Object Detection with Occlusions , 2004, ECCV.

[28]  D.O. Gorodnichy,et al.  Associative neural networks as means for low-resolution video-based recognition , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[29]  PoggioTomaso,et al.  Example-Based Learning for View-Based Human Face Detection , 1998 .

[30]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.