Compact Convolutional Neural Network Cascade for Face Detection

The problem of faces detection in images or video streams is a classical problem of computer vision. The multiple solutions of this problem have been proposed, but the question of their optimality is still open. Many algorithms achieve a high quality face detection, but at the cost of high computational complexity. This restricts their application in the real-time systems. This paper presents a new solution of the frontal face detection problem based on compact convolutional neural networks cascade. The test results on FDDB dataset show that it is competitive with state-of-the-art algorithms. This proposed detector is implemented using three technologies: SSE/AVX/AVX2 instruction sets for Intel CPUs, Nvidia CUDA, OpenCL. The detection speed of our approach considerably exceeds all the existing CPU-based and GPU-based algorithms. Because of high computational efficiency, our detector can processing 4K Ultra HD video stream in real time (up to 27 fps) on mobile platforms (Intel Ivy Bridge CPUs and Nvidia Kepler GPUs) in searching objects with the dimension 60x60 pixels or higher. At the same time its performance weakly dependent on the background and number of objects in scene. This is achieved by the asynchronous computation of stages in the cascade.

[1]  Bin Yang,et al.  Aggregate channel features for multi-view face detection , 2014, IEEE International Joint Conference on Biometrics.

[2]  Andreas Ernst,et al.  Face detection with the modified census transform , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[3]  Bernhard Schölkopf,et al.  Face Detection - Efficient and Rank Deficient , 2004, NIPS.

[4]  Sébastien Marcel,et al.  Fast Bounding Box Estimation based Face Detection , 2010 .

[5]  Gang Hua,et al.  A convolutional neural network cascade for face detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Erik G. Learned-Miller,et al.  Online domain adaptation of a pre-trained cascade of classifiers , 2011, CVPR 2011.

[7]  Adrian Barbu,et al.  Face Detection with a 3D Model , 2016 .

[8]  Forrest N. Iandola,et al.  DenseNet: Implementing Efficient ConvNet Descriptor Pyramids , 2014, ArXiv.

[9]  Pod Hyb Extended Set of Local Binary Patterns for Rapid Object Detection , 2010 .

[10]  Jian Sun,et al.  Joint Cascade Face Detection and Alignment , 2014, ECCV.

[11]  Song Han,et al.  Learning both Weights and Connections for Efficient Neural Network , 2015, NIPS.

[12]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[13]  Jian Zhang,et al.  Face Detection with Effective Feature Extraction , 2010, ACCV.

[14]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[15]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[16]  Zhuowen Tu,et al.  Deeply-Supervised Nets , 2014, AISTATS.

[17]  Forrest N. Iandola,et al.  Communication-minimizing 2D convolution in GPU registers , 2013, 2013 IEEE International Conference on Image Processing.

[18]  Nuno Vasconcelos,et al.  Boosting Classifier Cascades , 2010, NIPS.

[19]  Muhittin Gökmen,et al.  GPU Accelerated Real-time Object Detection on High Resolution Videos using Modified Census Transform , 2012, VISAPP.

[20]  Jiri Matas,et al.  Extended Set of Local Binary Patterns for Rapid Object Detection , 2010 .

[21]  Erik Learned-Miller,et al.  FDDB: A benchmark for face detection in unconstrained settings , 2010 .

[22]  Zhengyou Zhang,et al.  Improving multiview face detection with multi-task deep convolutional neural networks , 2014, IEEE Winter Conference on Applications of Computer Vision.

[23]  Gang Hua,et al.  Efficient Boosted Exemplar-Based Face Detection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[25]  Patrick J. Flynn,et al.  The effectiveness of face detection algorithms in unconstrained crowd scenes , 2014, IEEE Winter Conference on Applications of Computer Vision.

[26]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[27]  Yann LeCun,et al.  Synergistic Face Detection and Pose Estimation with Energy-Based Models , 2004, J. Mach. Learn. Res..

[28]  Li-Jia Li,et al.  Multi-view Face Detection Using Deep Convolutional Neural Networks , 2015, ICMR.

[29]  Quoc V. Le,et al.  On optimization methods for deep learning , 2011, ICML.

[30]  Bin Yang,et al.  Fine-grained evaluation on face detection in the wild , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[31]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Ключникова Татьяна Николаевна Социальные индикаторы преобразований в системах управления , 2010 .

[33]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[34]  Daniel Roviras,et al.  Levenberg-Marquardt learning neural network for adaptive predistortion for time-varying HPA with memory in OFDM systems , 2008, 2008 16th European Signal Processing Conference.

[35]  Xavier Martorell,et al.  Real-time GPU-based face detection in HD video sequences , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[36]  Igor S. Pandzic,et al.  A method for object detection based on pixel intensity comparisons , 2013, ArXiv.

[37]  Christophe Garcia,et al.  Convolutional face finder: a neural architecture for fast and robust face detection , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Junjie Yan,et al.  The Fastest Deformable Part Model for Object Detection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Deepak Ghimire,et al.  A Robust Face Detection Method Based on Skin Color and Edges , 2013, J. Inf. Process. Syst..

[40]  Jan H. van Vuuren,et al.  A survey and comparison of guillotine heuristics for the 2D oriented offline strip packing problem , 2009, Discret. Optim..

[41]  Youngmin Yi,et al.  Real-time face detection in Full HD images exploiting both embedded CPU and GPU , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).

[42]  Jeff Johnson,et al.  Fast Convolutional Nets With fbfft: A GPU Performance Evaluation , 2014, ICLR.

[43]  Anil K. Jain,et al.  Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Tal Hassner,et al.  Face recognition in unconstrained videos with matched background similarity , 2011, CVPR 2011.

[45]  Jianguo Li,et al.  Learning SURF Cascade for Fast and Accurate Object Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Inho Choi,et al.  Local Transform Features and Hybridization for Accurate Face and Human Detection , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Gang Hua,et al.  Probabilistic Elastic Part Model for Unsupervised Face Detector Adaptation , 2013, 2013 IEEE International Conference on Computer Vision.

[48]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[49]  Tat-Jen Cham,et al.  Fast training and selection of Haar features using statistics in boosting-based face detection , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[50]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[51]  Luc Van Gool,et al.  Face Detection without Bells and Whistles , 2014, ECCV.

[52]  Shengcai Liao,et al.  A Fast and Accurate Unconstrained Face Detector , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Bo Wu,et al.  Fast rotation invariant multi-view face detection based on real Adaboost , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[54]  Horst Bischof,et al.  Annotated Facial Landmarks in the Wild: A large-scale, real-world database for facial landmark localization , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[55]  Shuo Yang,et al.  From Facial Parts Responses to Face Detection: A Deep Learning Approach , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[56]  Martin Köstinger,et al.  Efficient Metric Learning for Real-World Face Recognition , 2013 .

[57]  Scott B. Baden,et al.  A software-based dynamic-warp scheduling approach for load-balancing the Viola-Jones face detection algorithm on GPUs , 2013, J. Parallel Distributed Comput..

[58]  Rainer Lienhart,et al.  An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[59]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  Karin Strauss,et al.  Accelerating Deep Convolutional Neural Networks Using Specialized Hardware , 2015 .