论文信息 - Using Depth to Extend Randomised Hough Forests for Object Detection and Localisation

Using Depth to Extend Randomised Hough Forests for Object Detection and Localisation

Implicit Shape Models (ISM) have been developed for object detection and localisation in 2-D (RGB) imagery and, to a lesser extent, full 3-D point clouds. Research is ongoing to extend the approach to 2-D imagery having co-registered depth (RGB- D) e.g. from stereoscopy, laser scanning, time-of-flight cameras etc.A popular implementation of the ISM is as a Randomised Forest of classifier trees representing codebooks for use in a Hough Transform voting framework. We present three extensions to the Class-Specific Hough Forest (CSHF) that utilises RGB and co- registered depth imagery acquired via stereoscopic mobile imaging. We demonstrate how depth and RGB information can be combined during training and at detection time. Rather than encoding depth as a new dimension of Hough space (which can increase vote sparsity), depth is used to modify the resulting placement and strength of votes in the original 2-D Hough space. We compare the effect of these depth-based extensions to the unmodified CSHF detection framework evaluated against a challenging new real- world dataset of urban street scenes.

Geoff A. W. West | Tele Tan | Richard Palmer

[1] Jitendra Malik,et al. Object detection using a max-margin Hough transform , 2009, CVPR.

[2] Juergen Gall,et al. Class-specific Hough forests for object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4] Nick Barnes,et al. Learning Hough Forest with Depth-Encoded Context for Object Detection , 2012, 2012 International Conference on Digital Image Computing Techniques and Applications (DICTA).

[5] B. Schiele,et al. Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[6] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[7] Luc Van Gool,et al. Hough Transform and 3D SURF for Robust Three Dimensional Classification , 2010, ECCV.

[8] Sanja Fidler,et al. Towards Scalable Representations of Object Categories: Learning a Hierarchy of Parts , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Dana H. Ballard,et al. Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[10] Daniel P. Huttenlocher,et al. Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[11] Sven Behnke,et al. Depth-Enhanced Hough Forests for Object-Class Detection and Continuous Pose Estimation , 2013 .

[12] Silvio Savarese,et al. Depth-Encoded Hough Voting for Joint Object Detection and Shape Recovery , 2010, ECCV.

[13] Mark Everingham,et al. Shared parts for deformable part-based models , 2011, CVPR 2011.

[14] Dima Damen,et al. Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[15] Vijay Kumar. A Discriminative Voting Scheme for Object Detection using Hough Forests , 2010 .

[16] Bastian Leibe,et al. Efficient object detection and segmentation with a cascaded Hough Forest ISM , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[17] Richard Bowden,et al. Static Pose Estimation from Depth Images using Random Regression Forests and Hough Voting , 2012, VISAPP.

[18] Björn Stenger,et al. A new distance for scale-invariant 3D shape recognition and registration , 2011, 2011 International Conference on Computer Vision.

[19] Geoff A. W. West,et al. Scale Proportionate Histograms of Oriented Gradients for Object Detection in Co-Registered Visual and Range Data , 2012, 2012 International Conference on Digital Image Computing Techniques and Applications (DICTA).

[20] Federico Tombari,et al. On the Use of Implicit Shape Models for Recognition of Object Categories in 3D Data , 2010, ACCV.

[21] Frédéric Jurie,et al. Randomized Clustering Forests for Image Classification , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[23] Bernt Schiele,et al. Multiple Object Class Detection with a Generative Model , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[24] Pushmeet Kohli,et al. On Detection of Multiple Object Instances Using Hough Transforms , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.