How Large-Scale Training Samples Effect Face Detector? An Empirical Analysis

Recent development in the field of face detection highlights the benefits from large scale training samples, which can be cheaply collected through internet. However, these large training sets are usually constructed in a rather arbitrary manner. In this paper, we empirically investigate the fundamental question of how the training set effects the performance of a given state of the art face detector. In particular, we construct a very large training set containing over 340K face images and study the effect of five common factors of variations (i.e., lighting, expression, blurring, contrast change and noise) which may change face appearance largely. Our results show that noise factor has the most significant influence on the performance of the detector while others (e.g., lighting, expression) are of much less importance. Based on these, we propose a new method to construct an effective training set with much small size for face detection, without significantly reducing the performance.

[1]  Wen Gao,et al.  Matrix-Structural Learning (MSL) of Cascaded Classifier from Enormous Training Set , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[3]  Shihong Lao,et al.  Boosting nested cascade detector for multi-view face detection , 2004, ICPR 2004.

[4]  Patrick J. Flynn,et al.  Overview of the face recognition grand challenge , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  ZhouZhi-Hua,et al.  Face recognition from a single image per person , 2006 .

[6]  Wen Gao,et al.  Locally Assembled Binary (LAB) feature with feature-centric cascade for fast and accurate face detection , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Christophe Garcia,et al.  Convolutional face finder: a neural architecture for fast and robust face detection , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Terence Sim,et al.  The CMU Pose, Illumination, and Expression (PIE) database , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[9]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Zhi-Hua Zhou,et al.  Face recognition from a single image per person: A survey , 2006, Pattern Recognit..