Fast discrimination by early judgment using linear classifier

Object detection involves classification of a huge number of detection windows obtained by raster scanning of the input image. For each detection window, a classifier trained with local features and a statistical learning method outputs a value for the target class. In this paper, we investigated the introduction of linear SVM approximate computation to object detection to increase the speed of raster scanning. We propose a method of fast discrimination by early judgment using linear classifier based approximation calculation. Doing so enables high-speed linear SVM classification by adaptively determining the number of bases required in the approximation calculations for the input detection window. Also, higher accuracy is attained in the object detection by representing the co-occurrence of binary-coded (B-HOG) forms of the HOG features that are used when doing the linear SVM approximating calculations. Evaluation experiments on human detection show that the proposed method is faster than using HOG features and linear SVM by a factor of 17 and improves the classification accuracy by about 6.1%.

[1]  Mei-Chen Yeh,et al.  Fast Human Detection Using a Cascade of Histograms of Oriented Gradients , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[2]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[3]  Ian Reid,et al.  fastHOG – a real-time GPU implementation of HOG , 2011 .

[4]  Pietro Perona,et al.  The Fastest Pedestrian Detector in the West , 2010, BMVC.

[5]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[6]  Philip H. S. Torr,et al.  Efficient online structured output learning for keypoint-based object tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Hironobu Fujiyoshi,et al.  Relational HOG feature with wild-card for object detection , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[8]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Ichiro Masaki,et al.  Fast human detection with cascaded ensembles on the GPU , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[10]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Piotr Dollár,et al.  Crosstalk Cascades for Frame-Rate Pedestrian Detection , 2012, ECCV.

[12]  David P. Williamson,et al.  Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[13]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[14]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[16]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[17]  Bernt Schiele,et al.  Monocular 3D scene understanding with explicit occlusion reasoning , 2011, CVPR 2011.