3D Keypoint Detection Based on Deep Neural Network with Sparse Autoencoder

Researchers have proposed various methods to extract 3D keypoints from the surface of 3D mesh models over the last decades, but most of them are based on geometric methods, which lack enough flexibility to meet the requirements for various applications. In this paper, we propose a new method on the basis of deep learning by formulating the 3D keypoint detection as a regression problem using deep neural network (DNN) with sparse autoencoder (SAE) as our regression model. Both local information and global information of a 3D mesh model in multi-scale space are fully utilized to detect whether a vertex is a keypoint or not. SAE can effectively extract the internal structure of these two kinds of information and formulate high-level features for them, which is beneficial to the regression model. Three SAEs are used to formulate the hidden layers of the DNN and then a logistic regression layer is trained to process the high-level features produced in the third SAE. Numerical experiments show that the proposed DNN based 3D keypoint detection algorithm outperforms current five state-of-the-art methods for various 3D mesh models.

[1]  Afzal Godil,et al.  Evaluation of 3D interest point detection techniques via human-generated ground truth , 2012, The Visual Computer.

[2]  Quoc V. Le,et al.  Measuring Invariances in Deep Networks , 2009, NIPS.

[3]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[4]  Yosi Keller,et al.  Scale-Invariant Features for 3-D Mesh Models , 2012, IEEE Transactions on Image Processing.

[5]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[6]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[7]  David W. Jacobs,et al.  Mesh saliency , 2005, ACM Trans. Graph..

[8]  Leonidas J. Guibas,et al.  Robust global registration , 2005, SGP '05.

[9]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Federico Tombari,et al.  Learning a Descriptor-Specific 3D Keypoint Detector , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[11]  Thomas A. Funkhouser,et al.  Shape-based retrieval and analysis of 3d models , 2005, CACM.

[12]  Ralph R. Martin,et al.  Mesh saliency via spectral processing , 2014, ACM Trans. Graph..

[13]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[14]  Jim Austin,et al.  A Machine-Learning Approach to Keypoint Detection and Landmarking on 3D Meshes , 2012, International Journal of Computer Vision.

[15]  Anil K. Jain,et al.  On reliable curvature estimation , 1989, CVPR.

[16]  Hui Chen,et al.  3D free-form object recognition in range images using local surface patches , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[17]  Benjamin Bustos,et al.  Harris 3D: a robust extension of the Harris operator for interest point detection on 3D meshes , 2011, The Visual Computer.

[18]  Ko Nishino,et al.  Scale-Dependent 3D Geometric Features , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[19]  Leonidas J. Guibas,et al.  A concise and provably informative multi-scale signature based on heat diffusion , 2009 .

[20]  W. F. Harris,et al.  The second fundamental form of a surface and its relation to the dioptric power matrix, sagitta and lens thickness , 1989, Ophthalmic & physiological optics : the journal of the British College of Ophthalmic Opticians.

[21]  Martial Hebert,et al.  Multi-scale interest regions from unorganized point clouds , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[22]  Afzal Godil,et al.  Salient local 3D features for 3D shape retrieval , 2011, Electronic Imaging.

[23]  Mohammed Bennamoun,et al.  On the Repeatability and Quality of Keypoints for Local Feature-based 3D Object Retrieval from Cluttered Scenes , 2009, International Journal of Computer Vision.

[24]  Ayellet Tal,et al.  Mesh segmentation using feature point and core extraction , 2005, The Visual Computer.

[25]  D. Cox The Regression Analysis of Binary Sequences , 2017 .

[26]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[27]  Jing Hua,et al.  Salient spectral geometric features for shape matching and retrieval , 2009, The Visual Computer.

[28]  Federico Tombari,et al.  Unique Signatures of Histograms for Local Surface Description , 2010, ECCV.

[29]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[30]  Ramsay Dyer,et al.  Spectral Mesh Processing , 2010, Comput. Graph. Forum.

[31]  Michael B. Holte,et al.  3D interest point detection using local surface characteristics with application in action recognition , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[32]  Markus H. Gross,et al.  Point-based multiscale surface representation , 2006, TOGS.

[33]  Umberto Castellani,et al.  Sparse points matching by combining 3D mesh saliency with statistical descriptors , 2008, Comput. Graph. Forum.

[34]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[35]  Seon-Min Rhee,et al.  SDTP: a robust method for interest point detection on 3D range images , 2014, Electronic Imaging.

[36]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[37]  Erdem Akagündüz,et al.  Scale and orientation invariant 3D interest point extraction using HK curvatures , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[38]  Yu Zhong,et al.  Intrinsic shape signatures: A shape descriptor for 3D object recognition , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[39]  Adrian Hilton,et al.  Shape Similarity for 3D Video Sequences of People , 2010, International Journal of Computer Vision.

[40]  Federico Tombari,et al.  Performance Evaluation of 3D Keypoint Detectors , 2012, International Journal of Computer Vision.

[41]  Philippos Mordohai,et al.  3D Interest Point Detection via Discriminative Learning , 2013, ECCV.

[42]  R. Horaud,et al.  Surface feature detection and description with applications to mesh matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Hui Chen,et al.  3D free-form object recognition in range images using local surface patches , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[44]  Ralph R. Martin,et al.  3D point of interest detection via spectral irregularity diffusion , 2013, The Visual Computer.

[45]  Jianfei Cai,et al.  Variational mesh decomposition , 2012, TOGS.