Visual Feature Combination and Distance Metric Learning for 3D Shape Retrieval

<Summary> Majority of existing algorithms for shape-based 3D model retrieval presumes a specific shape class (e.g., rigid-body CAD model of mechanical parts) defined by using limited shape representation methods (e.g., singly-connected, closed mesh). Recently, however, need has arisen for a more versatile algorithm capable of handling wider class of shapes (e.g., articulated models) represented by using diverse shape representations. We have previously proposed an appearance based algorithm for 3D model retrieval that possesses invariance to articulation (global deformation) and is able to handle diverse shape representations. The algorithm extracts local image descriptors at interest points of 2D depth images rendered from multiple viewpoints about 3D models. The algorithm achieved good retrieval accuracy for articulated, simpler 3D shapes. However, retrieval accuracy was not satisfactory for some other classes of shapes, e.g., complex and rigid models. In this paper, we propose a 3D model retrieval algorithm that can handle a wider class of shapes. The proposed algorithm is based on , but employs randomly and densely sampled local visual features as well as a global visual feature. Distances among 3D models are computed by using distance metric learning. Experimental evaluations using multiple standard benchmarks as well as international 3D model retrieval contests have shown that our proposed algorithm outperforms many existing methods.

[1]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[2]  Eric Wahl,et al.  Surflet-pair-relation histograms: a statistical 3D-shape representation for rapid classification , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[3]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[4]  Ryutarou Ohbuchi,et al.  Dense sampling and fast encoding for 3D model retrieval using bag-of-visual features , 2009, CIVR '09.

[5]  Tony Tung,et al.  Augmented Reeb graphs for content-based retrieval of 3D mesh models , 2004, Proceedings Shape Modeling Applications, 2004..

[6]  Ryutarou Ohbuchi,et al.  Salient local visual features for shape-based 3D model retrieval , 2008, 2008 IEEE International Conference on Shape Modeling and Applications.

[7]  Petros Daras,et al.  A Compact Multi-view Descriptor for 3D Object Retrieval , 2009, 2009 Seventh International Workshop on Content-Based Multimedia Indexing.

[8]  Ryutarou Ohbuchi,et al.  Distance metric learning and feature combination for shape-based 3D model retrieval , 2010, 3DOR '10.

[9]  Masaki Aono,et al.  Multi-Fourier spectra descriptor and augmentation with spectral clustering for 3D shape retrieval , 2009, The Visual Computer.

[10]  Noel E. O'Connor,et al.  SHREC’08 entry: Multi-view 3D retrieval using multi-scale contour representation , 2008, 2008 IEEE International Conference on Shape Modeling and Applications.

[11]  Szymon Rusinkiewicz,et al.  Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors , 2003, Symposium on Geometry Processing.