Recursive Coarse-to-Fine Localization for Fast Object Detection

Sliding window (SW) technique is one of common paradigms employed for object detection. However, the computational cost of this approach is so expensive because the detection window is scanned at all possible positions and scales. To overcome this problem, we propose a compact feature together with fast recursive coarse-to-fine object localization strategy. To build a compact feature, we project the Histograms of Oriented Gradient (HOG) features to linear subspace by Principal Component Analysis (PCA). We call this feature as PCA-HOG feature. The exploitation of the PCA-HOG feature not only helps the classifiers run faster but also still maintains the accuracy. In order to further speeding up the localization, we propose a recursive coarse-to-fine refinement to scan image. We scan image in both scale space and multi-resolution space from coarsest to finest resolutions. Only the best obtained hypothesis from the coarser resolution could be passed to finer resolution. Each resolution has its own linear Support Vector Machine (SVM) classifier and PCA-HOG features. Evaluation with INRIA dataset shows that our method achieves a significant speedup compared to standard sliding window and original HOG feature, while even get higher detection accuracy.

[1]  David Gerónimo Gómez,et al.  Survey of Pedestrian Detection for Advanced Driver Assistance Systems , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Wei Zhang,et al.  Real-time Accurate Object Detection using Multiple Resolutions , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[3]  Mei-Chen Yeh,et al.  Fast Human Detection Using a Cascade of Histograms of Oriented Gradients , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[6]  Jordi Gonzàlez,et al.  Recursive Coarse-to-Fine Localization for Fast Object Detection , 2010, ECCV.

[7]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Alvin F. Martin,et al.  The DET curve in assessment of detection task performance , 1997, EUROSPEECH.