An Evaluation of Local Shape Descriptors in Probabilistic Volumetric Scenes

This paper presents the first performance evaluation of local shape descriptors in probabilistic volumetric models (PVM) that are learned from multi-view aerial imagery of large scale urban scenes. The PVM offers a dense solution to the multi-view stereo problem, handling in a probabilistic manner the ambiguities caused by highly reflective surfaces, varying illumination conditions, registration errors, and sensor noise. A GPUbased octree implementation guarantees scalability of the PVM to large urban models, encouraging its application to higher level 3-d computer vision tasks. Furthermore, local descriptors form the basis of many shape-based applications and their performance has been studied extensively for images-based applications. Local descriptors have also been popular for 3-d shape understanding, but most 3-d descriptors have been used within the context of point clouds data collected with range sensors or polygonal meshes of CAD models. This work presents an investigation of the performance of several local shape descriptors in the PVM, which is learned from image data and where surfaces ambiguities are common and explicitly modeled. Descriptors are evaluated on accuracy for object classification using Bag-of-Words models. This evaluation forms a step towards the characterization of a new 3-d probabilistic representation for 3-d scene understanding.

[1]  Leonidas J. Guibas,et al.  Shape google: Geometric words and expressions for invariant shape retrieval , 2011, TOGS.

[2]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[3]  Changchang Wu,et al.  SiftGPU : A GPU Implementation of Scale Invariant Feature Transform (SIFT) , 2007 .

[4]  Richard Szeliski,et al.  A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[5]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Luc Van Gool,et al.  Hough Transform and 3D SURF for Robust Three Dimensional Classification , 2010, ECCV.

[7]  Najla Megherbi Bouallagu,et al.  Object Recognition using 3D SIFT in Complex CT Volumes , 2010, BMVC.

[8]  Mohammed Bennamoun,et al.  Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Steven M. Seitz,et al.  Multicore bundle adjustment , 2011, CVPR 2011.

[10]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[11]  D. Sculley,et al.  Web-scale k-means clustering , 2010, WWW '10.

[12]  Joseph L. Mundy,et al.  Characterization of 3-D Volumetric Probabilistic Scenes for Object Recognition , 2012, IEEE Journal of Selected Topics in Signal Processing.

[13]  Jitendra Malik,et al.  Recognizing Objects in Range Data Using Regional Point Descriptors , 2004, ECCV.

[14]  Afzal Godil,et al.  An evaluation of local shape descriptors for 3D shape retrieval , 2012, Electronic Imaging.

[15]  Vladimir G. Kim,et al.  Shape-based recognition of 3D point clouds in urban environments , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[16]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[17]  Joseph L. Mundy,et al.  Real-time rendering and dynamic updating of 3-d volumetric data , 2011, GPGPU-4.

[18]  Joseph L. Mundy,et al.  Change Detection in a 3-d World , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Chin Seng Chua,et al.  Point Signatures: A New Representation for 3D Object Recognition , 1997, International Journal of Computer Vision.

[20]  Federico Tombari,et al.  Unique Signatures of Histograms for Local Surface Description , 2010, ECCV.

[21]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[22]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[23]  Ghassan Hamarneh,et al.  n -SIFT: n -Dimensional Scale Invariant Feature Transform , 2009, IEEE Trans. Image Process..

[24]  Andrew Zisserman,et al.  A Statistical Approach to Material Classification Using Image Patch Exemplars , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Remco C. Veltkamp,et al.  A survey of content based 3D shape retrieval methods , 2004, Proceedings Shape Modeling Applications, 2004..

[26]  Alexander M. Bronstein,et al.  Volumetric heat kernel signatures , 2010, 3DOR '10.

[27]  Marcin Novotni,et al.  3D zernike descriptors for content based shape retrieval , 2003, SM '03.

[28]  Remco C. Veltkamp,et al.  A Survey of Content Based 3D Shape Retrieval Methods , 2004, SMI.

[29]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[30]  Joseph L. Mundy,et al.  Vehicle Recognition as Changes in Satellite Imagery , 2010, 2010 20th International Conference on Pattern Recognition.

[31]  Mubarak Shah,et al.  A 3-dimensional sift descriptor and its application to action recognition , 2007, ACM Multimedia.

[32]  Bernard Chazelle,et al.  A Reflective Symmetry Descriptor for 3D Models , 2003, Algorithmica.

[33]  Gabriel Taubin,et al.  A Variable-Resolution Probabilistic Three-Dimensional Model for Change Detection , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[34]  Nico Blodow,et al.  Fast Point Feature Histograms (FPFH) for 3D registration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[35]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[36]  Mongi A. Abidi,et al.  Surface matching by 3D point's fingerprint , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[37]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[38]  Cindy Grimm,et al.  Local Shape Descriptors, a Survey and Evaluation , 2011, 3DOR@Eurographics.

[39]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Afzal Godil,et al.  Exploring the Bag-of-Words method for 3D shape retrieval , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[41]  Marcel Körtgen,et al.  3D Shape Matching with 3D Shape Contexts , 2003 .

[42]  Nico Blodow,et al.  Towards 3D Point cloud based object maps for household environments , 2008, Robotics Auton. Syst..

[43]  Alexei A. Efros,et al.  Discovering objects and their location in images , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[44]  Leonidas J. Guibas,et al.  Probabilistic fingerprints for shapes , 2006, SGP '06.

[45]  Tsuhan Chen,et al.  Efficient feature extraction for 2D/3D objects in mesh representation , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[46]  Thomas A. Funkhouser,et al.  Selecting Distinctive 3D Shape Descriptors for Similarity Retrieval , 2006, IEEE International Conference on Shape Modeling and Applications 2006 (SMI'06).