GPU deformable part model for object recognition

We consider the problem of rapidly detecting objects in static images or videos. The task consists in locating and identifying objects of interest. With the progress of affordable high computing hardware, we propose to analyse and evaluate the deformable part model on the Graphics Processing Unit. We do not take any prior assumptions on the scene and location of the objects. We provide a fast implementation and analyse the different modules of the state-of-the-art detector. Our implementation allows to accelerate both training and testing. While maintaining comparable classification performance, we report a speed-up of $$\times$$×10.6 using a standard GPU card compared to a baseline implemented in C++ on a single core and $$\times$$×5 compared to a multi-core OpenMP (8 threads) implementation.

[1]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Olivier Buisson,et al.  Scalable mining of small visual objects , 2012, ACM Multimedia.

[3]  Luc Van Gool,et al.  A mobile vision system for robust multi-person tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  George J. Grevera,et al.  Distance Transform Algorithms And Their Implementation And Evaluation , 2007 .

[5]  Pietro Perona,et al.  The Fastest Pedestrian Detector in the West , 2010, BMVC.

[6]  Ian Reid,et al.  fastHOG – a real-time GPU implementation of HOG , 2011 .

[7]  Trevor Darrell,et al.  Sparselet Models for Efficient Multiclass Object Detection , 2012, ECCV.

[8]  Bastian Leibe,et al.  Efficient Use of Geometric Constraints for Sliding-Window Object Detection in Video , 2011, ICVS.

[9]  Dima Damen,et al.  Detecting Carried Objects in Short Video Sequences , 2008, ECCV.

[10]  P. Danielsson Euclidean distance mapping , 1980 .

[11]  ZissermanAndrew,et al.  The Pascal Visual Object Classes Challenge , 2015 .

[12]  Luc Van Gool,et al.  Pedestrian detection at 100 frames per second , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[14]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  David A. McAllester,et al.  Cascade object detection with deformable part models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Dariu Gavrila,et al.  Multi-cue Pedestrian Detection and Tracking from a Moving Vehicle , 2007, International Journal of Computer Vision.

[17]  Paul E. Rybski,et al.  Real-time pedestrian detection with deformable part models , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[18]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[19]  Christoph H. Lampert,et al.  Efficient Subwindow Search: A Branch and Bound Framework for Object Localization , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Jiri Matas,et al.  Geometric min-Hashing: Finding a (thick) needle in a haystack , 2009, CVPR.

[21]  Thomas Deselaers,et al.  What is an object? , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.