Deep Nonlinear Metric Learning for 3-D Shape Retrieval

Effective 3-D shape retrieval is an important problem in 3-D shape analysis. Recently, feature learning-based shape retrieval methods have been widely studied, where the distance metrics between 3-D shape descriptors are usually hand-crafted. In this paper, motivated by the fact that deep neural network has the good ability to model nonlinearity, we propose to learn an effective nonlinear distance metric between 3-D shape descriptors for retrieval. First, the locality-constrained linear coding method is employed to encode each vertex on the shape and the encoding coefficient histogram is formed as the global 3-D shape descriptor to represent the shape. Then, a novel deep metric network is proposed to learn a nonlinear transformation to map the 3-D shape descriptors to a nonlinear feature space. The proposed deep metric network minimizes a discriminative loss function that can enforce the similarity between a pair of samples from the same class to be small and the similarity between a pair of samples from different classes to be large. Finally, the distance between the outputs of the metric network is used as the similarity for shape retrieval. The proposed method is evaluated on the McGill, SHREC’10 ShapeGoogle, and SHREC’14 Human shape datasets. Experimental results on the three datasets validate the effectiveness of the proposed method.

[1]  Ralph R. Martin,et al.  Non-rigid 3D Shape Retrieval , 2015, 3DOR@Eurographics.

[2]  Gang Wang,et al.  Multi-manifold deep metric learning for image set classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Chunheng Wang,et al.  Deep nonlinear metric learning with independent subspace analysis for face verification , 2012, ACM Multimedia.

[4]  Andrea Giachetti,et al.  Radial Symmetry Detection and Shape Characterization with the Multiscale Area Projection Transform , 2012, Comput. Graph. Forum.

[5]  Iasonas Kokkinos,et al.  Intrinsic shape context descriptors for deformable shapes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[7]  Stephen Tyree,et al.  Non-linear Metric Learning , 2012, NIPS.

[8]  Jiwen Lu,et al.  Discriminative Deep Metric Learning for Face Verification in the Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Leonidas J. Guibas,et al.  Global Intrinsic Symmetries of Shapes , 2008, Comput. Graph. Forum.

[10]  Mohamed Daoudi,et al.  A Bayesian 3-D Search Engine Using Adaptive Views Clustering , 2007, IEEE Transactions on Multimedia.

[11]  Horst Bischof,et al.  Joint Learning of Discriminative Prototypes and Large Margin Nearest Neighbor Classifiers , 2013, 2013 IEEE International Conference on Computer Vision.

[12]  Ioannis Pratikakis,et al.  3D Object Retrieval using an Efficient and Compact Hybrid Shape Descriptor , 2008, 3DOR@Eurographics.

[13]  Longin Jan Latecki,et al.  3D Shape Matching via Two Layer Coding , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Daniel Cremers,et al.  The wave kernel signature: A quantum mechanical approach to shape analysis , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[15]  Ryutarou Ohbuchi,et al.  Distance metric learning and feature combination for shape-based 3D model retrieval , 2010, 3DOR '10.

[16]  Yizhou Yu,et al.  Fast nonrigid 3D retrieval using modal space transform , 2013, ICMR.

[17]  Leonidas J. Guibas,et al.  A concise and provably informative multi-scale signature based on heat diffusion , 2009 .

[18]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Angelos Barmpoutis,et al.  Tensor Body: Real-Time Reconstruction of the Human Body and Avatar Synthesis From RGB-D , 2013, IEEE Transactions on Cybernetics.

[20]  Iasonas Kokkinos,et al.  Scale-invariant heat kernel signatures for non-rigid shape recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Leonidas J. Guibas,et al.  Shape google: Geometric words and expressions for invariant shape retrieval , 2011, TOGS.

[22]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[23]  R. Horaud,et al.  Surface feature detection and description with applications to mesh matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Ali Shokoufandeh,et al.  Retrieving articulated 3-D models using medial surfaces , 2008, Machine Vision and Applications.

[25]  A. Ben Hamza,et al.  A multiresolution descriptor for deformable 3D shape retrieval , 2013, The Visual Computer.

[26]  Ming Ouhyoung,et al.  On Visual Similarity Based 3D Model Retrieval , 2003, Comput. Graph. Forum.

[27]  Hubert P. H. Shum,et al.  Real-Time Posture Reconstruction for Microsoft Kinect , 2013, IEEE Transactions on Cybernetics.

[28]  Hamid Laga,et al.  Covariance Descriptors for 3D Shape Matching and Retrieval , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Guillaume Lavoué,et al.  Combination of bag-of-words descriptors for robust partial shape retrieval , 2012, The Visual Computer.

[30]  Ryutarou Ohbuchi,et al.  Dense sampling and fast encoding for 3D model retrieval using bag-of-visual features , 2009, CIVR '09.

[31]  Hassen Drira,et al.  4-D Facial Expression Recognition by Learning Geometric Deformations , 2014, IEEE Transactions on Cybernetics.

[32]  Andrea Giachetti,et al.  SHREC ’ 15 Track : Non-rigid 3 D Shape Retrieval † , 2016 .

[33]  Edward K. Wong,et al.  Deepshape: Deep learned shape descriptor for 3D shape matching and retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Sergio Escalera,et al.  Spherical Blurred Shape Model for 3-D Object and Pose Recognition: Quantitative Analysis and HCI Applications in Smart Environments , 2014, IEEE Transactions on Cybernetics.

[35]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[36]  Ioannis Pratikakis,et al.  Retrieval of 3D Articulated Objects Using a Graph-based Representation , 2009, 3DOR@Eurographics.

[37]  Alexander M. Bronstein,et al.  Supervised learning of bag‐of‐features shape descriptors using sparse coding , 2014, Comput. Graph. Forum.

[38]  Raif M. Rustamov,et al.  Laplace-Beltrami eigenfunctions for deformation invariant shape representation , 2007 .

[39]  Bo Li,et al.  Shape Retrieval of Non-rigid 3D Human Models , 2014, International Journal of Computer Vision.

[40]  Yosi Keller,et al.  Scale-Invariant Features for 3-D Mesh Models , 2012, IEEE Transactions on Image Processing.

[41]  Hamid Laga,et al.  Compact Vectors of Locally Aggregated Tensors for 3D Shape Retrieval , 2013, 3DOR@Eurographics.