Fusion of local and global features for efficient object detection

In this paper, we propose a two-stage method for efficient object detection that combines full advantages of AdaBoost and SVM to achieve a reasonable balance in both training and classification speed. In the first stage, we use Haar wavelet features and AdaBoost to train a cascade of classifiers for quick and efficient rejection. This cascade of classifiers consists of simple-to-complex classifiers that allow adapting to complexity of input patterns, rejects almost 90%-99% non-object patterns rapidly. Hard patterns, object-like patterns, which are passed through the first stage, will be classified by the second stage which uses a non linear SVM-based classifier with pixel-based global features. The nonlinear SVM classifier is robust enough in order to reach high performance. We have investigated our proposed method to detect different kinds of objects such as face and facial features like eye and mouth regions. In training, our system is roughly 25 times faster than the system trained by AdaBoost. In running, the experimental results show 1,000 times faster than SVM based method and slightly slower than AdaBoost based method with a comparable accuracy.

[1]  Thomas Serre,et al.  Hierarchical classification and feature reduction for fast face detection with support vector machines , 2003, Pattern Recognit..

[2]  Harry Shum,et al.  Kullback-Leibler boosting , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[3]  Henry Schneiderman,et al.  Feature-centric evaluation for efficient cascaded object detection , 2004, CVPR 2004.

[4]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.

[5]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Andrew Blake,et al.  Computationally efficient face detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[7]  Chiou-Shann Fuh,et al.  Fast Object Detection with Occlusions , 2004, ECCV.

[8]  Takeo Kanade,et al.  Name-It: association of face and name in video , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  Stan Z. Li,et al.  FloatBoost learning and statistical face detection , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Mingjing Li,et al.  Robust multipose face detection in images , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Takeo Kanade,et al.  A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[13]  Constantine Papageorgiou,et al.  A trainable system for object detection in images and video sequences , 2000 .

[14]  S. Li,et al.  Real-time face detection using boosting in hierarchical feature spaces , 2004, ICPR 2004.

[15]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[17]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[18]  James M. Rehg,et al.  Learning a Rare Event Detection Cascade by Direct Feature Selection , 2003, NIPS.

[19]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..