Using Depth to Extend Randomised Hough Forests for Object Detection and Localisation

Implicit Shape Models (ISM) have been developed for object detection and localisation in 2-D (RGB) imagery and, to a lesser extent, full 3-D point clouds. Research is ongoing to extend the approach to 2-D imagery having co-registered depth (RGB- D) e.g. from stereoscopy, laser scanning, time-of-flight cameras etc.A popular implementation of the ISM is as a Randomised Forest of classifier trees representing codebooks for use in a Hough Transform voting framework. We present three extensions to the Class-Specific Hough Forest (CSHF) that utilises RGB and co- registered depth imagery acquired via stereoscopic mobile imaging. We demonstrate how depth and RGB information can be combined during training and at detection time. Rather than encoding depth as a new dimension of Hough space (which can increase vote sparsity), depth is used to modify the resulting placement and strength of votes in the original 2-D Hough space. We compare the effect of these depth-based extensions to the unmodified CSHF detection framework evaluated against a challenging new real- world dataset of urban street scenes.

[1]  Jitendra Malik,et al.  Object detection using a max-margin Hough transform , 2009, CVPR.

[2]  Juergen Gall,et al.  Class-specific Hough forests for object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Nick Barnes,et al.  Learning Hough Forest with Depth-Encoded Context for Object Detection , 2012, 2012 International Conference on Digital Image Computing Techniques and Applications (DICTA).

[5]  B. Schiele,et al.  Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[6]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[7]  Luc Van Gool,et al.  Hough Transform and 3D SURF for Robust Three Dimensional Classification , 2010, ECCV.

[8]  Sanja Fidler,et al.  Towards Scalable Representations of Object Categories: Learning a Hierarchy of Parts , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[10]  Daniel P. Huttenlocher,et al.  Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[11]  Sven Behnke,et al.  Depth-Enhanced Hough Forests for Object-Class Detection and Continuous Pose Estimation , 2013 .

[12]  Silvio Savarese,et al.  Depth-Encoded Hough Voting for Joint Object Detection and Shape Recovery , 2010, ECCV.

[13]  Mark Everingham,et al.  Shared parts for deformable part-based models , 2011, CVPR 2011.

[14]  Dima Damen,et al.  Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Vijay Kumar A Discriminative Voting Scheme for Object Detection using Hough Forests , 2010 .

[16]  Bastian Leibe,et al.  Efficient object detection and segmentation with a cascaded Hough Forest ISM , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[17]  Richard Bowden,et al.  Static Pose Estimation from Depth Images using Random Regression Forests and Hough Voting , 2012, VISAPP.

[18]  Björn Stenger,et al.  A new distance for scale-invariant 3D shape recognition and registration , 2011, 2011 International Conference on Computer Vision.

[19]  Geoff A. W. West,et al.  Scale Proportionate Histograms of Oriented Gradients for Object Detection in Co-Registered Visual and Range Data , 2012, 2012 International Conference on Digital Image Computing Techniques and Applications (DICTA).

[20]  Federico Tombari,et al.  On the Use of Implicit Shape Models for Recognition of Object Categories in 3D Data , 2010, ACCV.

[21]  Frédéric Jurie,et al.  Randomized Clustering Forests for Image Classification , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[23]  Bernt Schiele,et al.  Multiple Object Class Detection with a Generative Model , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[24]  Pushmeet Kohli,et al.  On Detection of Multiple Object Instances Using Hough Transforms , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.