论文信息 - A Unified Deep Metric Representation for Mesh Saliency Detection and Non-Rigid Shape Matching

A Unified Deep Metric Representation for Mesh Saliency Detection and Non-Rigid Shape Matching

In this paper, we propose a deep metric for unifying the representation of mesh saliency detection and non-rigid shape matching. While saliency detection and shape matching are two closely related and fundamental tasks in shape analysis, previous methods approach them separately and independently, failing to exploit their mutually beneficial underlying relationship. In view of the existing gap between saliency and matching, we propose to solve them together using a unified metric representation of surface meshes. We show that saliency and matching can be rigorously derived from our representation as the principal eigenvector and the smoothed Laplacian eigenvectors respectively. Learning the representation jointly allows matching to improve the deformation-invariance of saliency while allowing saliency to improve the feature localization of matching. To parameterize the representation from a mesh, we also propose a deep recurrent neural network (RNN) for effectively integrating multi-scale shape features and a soft-thresholding operator for adaptively enhancing the sparsity of saliency. Results show that by jointly learning from a pair of saliency and matching datasets, matching improves the accuracy of detected salient regions on meshes, which is especially obvious for small-scale saliency datasets, such as those having one to two meshes. At the same time, saliency improves the accuracy of shape matchings among meshes with reduced matching errors on surfaces.

[1] Meng Wang,et al. 3D deep shape descriptor , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Michael J. Black,et al. FAUST: Dataset and Evaluation for 3D Mesh Registration , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Maks Ovsjanikov,et al. Supervised Descriptor Learning for Non-Rigid Shape Matching , 2014, ECCV Workshops.

[4] Thomas A. Funkhouser,et al. The Princeton Shape Benchmark , 2004, Proceedings Shape Modeling Applications, 2004..

[5] Niklas Peinecke,et al. Laplace-Beltrami spectra as 'Shape-DNA' of surfaces and solids , 2006, Comput. Aided Des..

[6] Yaron Lipman,et al. Exact Recovery with Symmetries for Procrustes Matching , 2016, SIAM J. Optim..

[7] Alexander I. Barvinok,et al. Problems of distance geometry and convex properties of quadratic maps , 1995, Discret. Comput. Geom..

[8] Daniel Cremers,et al. Anisotropic Diffusion Descriptors , 2016, Comput. Graph. Forum.

[9] Maks Ovsjanikov,et al. Functional maps , 2012, ACM Trans. Graph..

[10] Andrea Torsello,et al. Matching Deformable Objects in Clutter , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[11] Leonidas J. Guibas,et al. A concise and provably informative multi-scale signature based on heat diffusion , 2009 .

[12] Jianbo Shi,et al. Image Matching via Saliency Region Correspondences , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13] Song Bai,et al. Deep learning representation using autoencoder for 3D shape retrieval , 2014, Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics (SPAC).

[14] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Leonidas J. Guibas,et al. ShapeNet: An Information-Rich 3D Model Repository , 2015, ArXiv.

[16] Ghassan Hamarneh,et al. A Survey on Shape Correspondence , 2011, Comput. Graph. Forum.

[17] Martial Hebert,et al. Unsupervised Learning for Graph Matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[18] Mikhail Belkin,et al. Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[19] Leonidas J. Guibas,et al. Functional Characterization of Intrinsic and Extrinsic Geometry , 2017, ACM Trans. Graph..

[20] Alex Sherstinsky,et al. Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network , 2018, Physica D: Nonlinear Phenomena.

[21] Leonidas J. Guibas,et al. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[22] J. Magnus. On Differentiating Eigenvalues and Eigenvectors , 1985, Econometric Theory.

[23] Yunchao Wei,et al. Deep Salient Object Detection With Dense Connections and Distraction Diagnosis , 2018, IEEE Transactions on Multimedia.

[24] Xiao-Tong Yuan,et al. Truncated power method for sparse eigenvalue problems , 2011, J. Mach. Learn. Res..

[25] Léon Bottou,et al. Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[26] Wojciech Matusik,et al. Articulated mesh animation from multi-view silhouettes , 2008, ACM Trans. Graph..

[27] Sebastian Thrun,et al. SCAPE: shape completion and animation of people , 2005, SIGGRAPH '05.

[28] Jae-Young Sim,et al. Saliency Detection for 3D Surface Geometry Using Semi-regular Meshes , 2017, IEEE Transactions on Multimedia.

[29] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[30] Ulrike von Luxburg,et al. A tutorial on spectral clustering , 2007, Stat. Comput..

[31] Szymon Rusinkiewicz,et al. Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors , 2003, Symposium on Geometry Processing.

[32] H. Y. Martono,et al. SHREC’14 Track: Retrieval and Classification on Textured 3D Models , 2014 .

[33] Yaron Lipman,et al. Point registration via efficient convex relaxation , 2016, ACM Trans. Graph..

[34] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[35] Jonathan Masci,et al. Learning shape correspondence with anisotropic convolutional neural networks , 2016, NIPS.

[36] Daniel Cremers,et al. Product Manifold Filter: Non-rigid Shape Correspondence via Kernel Density Estimation in the Product Space , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Leonidas J. Guibas,et al. SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Ralph R. Martin,et al. Mesh saliency via spectral processing , 2014, ACM Trans. Graph..

[39] Feiping Nie,et al. Revisiting Co-Saliency Detection: A Novel Approach Based on Two-Stage Multi-View Spectral Rotation Co-clustering , 2017, IEEE Transactions on Image Processing.

[40] Daniel Cremers,et al. Dense Non-rigid Shape Correspondence Using Random Forests , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[41] Thomas A. Funkhouser,et al. Schelling points on 3D surface meshes , 2012, ACM Trans. Graph..

[42] Ligang Liu,et al. Mesh saliency via ranking unsalient patches in a descriptor space , 2015, Comput. Graph..

[43] A. Bronstein,et al. Learning Spectral Descriptors for Deformable Shape Correspondence , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44] Vladimir G. Kim,et al. Blended intrinsic maps , 2011, ACM Trans. Graph..

[45] Qi-Xing Huang,et al. Dense Human Body Correspondences Using Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46] Scott T. Rickard,et al. Comparing Measures of Sparsity , 2008, IEEE Transactions on Information Theory.

[47] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[48] A. Ben Hamza,et al. Geodesic matching of triangulated surfaces , 2006, IEEE Transactions on Image Processing.

[49] Andrea Giachetti,et al. Retrieval and classification methods for textured 3D models: a comparative study , 2015, The Visual Computer.

[50] Alexander M. Bronstein,et al. Deep Functional Maps: Structured Prediction for Dense Shape Correspondence , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[51] Ayellet Tal,et al. Surface Regions of Interest for Viewpoint Selection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52] Raif M. Rustamov,et al. Laplace-Beltrami eigenfunctions for deformation invariant shape representation , 2007 .

[53] Robert J. Plemmons,et al. Nonnegative Matrices in the Mathematical Sciences , 1979, Classics in Applied Mathematics.

[54] Martin Vetterli,et al. Euclidean Distance Matrices: Essential theory, algorithms, and applications , 2015, IEEE Signal Processing Magazine.

[55] Pierre Vandergheynst,et al. Learning class‐specific descriptors for deformable shapes using localized spectral convolutional networks , 2015, SGP '15.

[56] Pietro Perona,et al. Graph-Based Visual Saliency , 2006, NIPS.

[57] Karthik Ramani,et al. Deep Learning 3D Shape Surfaces Using Geometry Images , 2016, ECCV.

[58] David W. Jacobs,et al. Mesh saliency , 2005, ACM Trans. Graph..

[59] Jie Yang,et al. Saliency Detection by Fully Learning a Continuous Conditional Random Field , 2017, IEEE Transactions on Multimedia.