Deep semantic hashing of 3D geometric features for efficient 3D model retrieval

As the scale of 3D model databases increase, speed, in addition to accuracy, of its search becomes very important. One way to achieve fast search is to use a compact 3D shape feature whose cost of comparison is very small. Binarization of a real-valued feature vector, via hashing, is a way to obtain such compact feature vector. Previous algorithms for producing binarized 3D model features via hashing consisted of two disconnected stages; handcrafted real-valued 3D shape feature extraction followed by hashing into binary code. This compartmentalized approach, however, leads to less-than optimal binary codes, as these two stages are optimized independently. This paper proposes a deep semantic hashing algorithm called Binarized Deep Local feature Aggregation Network (BDLAN) which jointly optimizes real-valued feature extraction per 3D model and its banarization via hashing. BDLAN training minimizes quantization error caused by binarization. However, this constraint alone often maps real-valued features to their nearest binary codes in Hamming space, which are nonoptimal local minima. To alleviate the issue, we add a simple regularization called Probabilistic Bit Inversion (PBI) of binary codes. Experimental evaluation of the proposed algorithms demonstrates superior efficiency and competitive accuracy to the existing 3D model retrieval algorithms employing real-valued features.

[1]  Longin Jan Latecki,et al.  GIFT: A Real-Time and Scalable 3D Shape Search Engine , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Guozhong An,et al.  The Effects of Adding Noise During Backpropagation Training on a Generalization Performance , 1996, Neural Computation.

[3]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[4]  Yann LeCun,et al.  Regularization of Neural Networks using DropConnect , 2013, ICML.

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  Andrea Giachetti,et al.  SHREC ’ 15 Track : Non-rigid 3 D Shape Retrieval † , 2016 .

[7]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[8]  José García Rodríguez,et al.  PointNet: A 3D Convolutional Neural Network for real-time object class recognition , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[9]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[10]  Sanjiv Kumar,et al.  Angular Quantization-based Binary Codes for Fast Similarity Search , 2012, NIPS.

[11]  Hamid Laga,et al.  Compact Vectors of Locally Aggregated Tensors for 3D Shape Retrieval , 2013, 3DOR@Eurographics.

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[14]  Ryutarou Ohbuchi,et al.  Randomized Sub-Volume Partitioning for Part-Based 3D Model Retrieval , 2015, 3DOR@Eurographics.

[15]  Akihiro Yamamoto,et al.  Learning semantic categories for 3D model retrieval , 2007, MIR '07.

[16]  Ralph R. Martin,et al.  Non-rigid 3D Shape Retrieval , 2015, 3DOR@Eurographics.

[17]  Ryutarou Ohbuchi,et al.  Hashing Cross-Modal Manifold for Scalable Sketch-Based 3D Model Retrieval , 2014, 2014 2nd International Conference on 3D Vision.

[18]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[19]  Thomas A. Funkhouser,et al.  Selecting Distinctive 3D Shape Descriptors for Similarity Retrieval , 2006, IEEE International Conference on Shape Modeling and Applications 2006 (SMI'06).

[20]  Pierre Vandergheynst,et al.  Geodesic Convolutional Neural Networks on Riemannian Manifolds , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[21]  Chu-Song Chen,et al.  Supervised Learning of Semantics-Preserving Hash via Deep Convolutional Neural Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Ryutarou Ohbuchi,et al.  Deep Aggregation of Local 3D Geometric Features for 3D Model Retrieval , 2016, BMVC.

[23]  Jianmin Wang,et al.  Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[24]  Ming Ouhyoung,et al.  On Visual Similarity Based 3D Model Retrieval , 2003, Comput. Graph. Forum.

[25]  Nikos Paragios,et al.  Data fusion through cross-modality metric learning using similarity-sensitive hashing , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26]  Bernard Chazelle,et al.  Shape distributions , 2002, TOGS.

[27]  Yichen Wei,et al.  Sketch-Based Retrieval Using Content-Aware Hashing , 2014, PCM.

[28]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[29]  Shiguang Shan,et al.  Deep Supervised Hashing for Fast Image Retrieval , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Ryutarou Ohbuchi,et al.  Fusing Multiple Features for Shape-based 3D Model Retrieval , 2014, BMVC.

[32]  Yi Liu,et al.  Shape Topics: A Compact Representation and New Algorithms for 3D Partial Shape Retrieval , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[33]  Misha Denil,et al.  Noisy Activation Functions , 2016, ICML.

[34]  Ryutarou Ohbuchi,et al.  Diffusion-on-Manifold Aggregation of Local Features for Shape-based 3D Model Retrieval , 2015, ICMR.

[35]  Quoc V. Le,et al.  Adding Gradient Noise Improves Learning for Very Deep Networks , 2015, ArXiv.

[36]  Bin Fang,et al.  Large Scale Comprehensive 3D Shape Retrieval , 2014, 3DOR@Eurographics.

[37]  Jiwen Lu,et al.  Learning Compact Binary Descriptors with Unsupervised Deep Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Bin Fang,et al.  SHREC'14 Track: Large Scale Comprehensive 3D Shape Retrieval , 2014 .