Aggregating Sparse Binarized Local Features by Summing for Efficient 3D Model Retrieval

An effective and widespread approach for shape-based 3D model retrieval (3DMR) is to use a feature vector per 3D model obtained by aggregating, or pooling, a set of local features extracted from the 3D model. State-of-the-art feature aggregation algorithms, such as Fisher Vector (FV) coding [7] or Super Vector (SV) coding [22], used in the approach is not spatially efficient, however. The FV or SV, for example, typically encodes a local feature into a very high-dimensional (e.g., 300k-dimensional) vector. For a database containing a large number of 3D models, the spatial cost of storing all the aggregated feature vectors for the database becomes very high. In this paper, we propose a novel, spatially efficient yet accurate feature aggregation algorithm called Sum of Sparse Binary codes (SSB) aggregation. The SSB first encodes a local feature into a highly sparse binary code. Then, a set of sparse binary codes are aggregated efficiently by simple summing into a compact feature vector. We also propose fast SSB (fSSB) aggregation, which is a computationally efficient approximation of the SSB. Experiments using a 3DMR scenario show that the proposed algorithms are significantly more efficient than the state-of-the-art feature aggregation algorithms we have compared against. At the same time, retrieval accuracies of the proposed algorithms are equal or better than the state-of-the-art aggregation algorithms.

[1]  Ryutarou Ohbuchi,et al.  Non-rigid 3D Model Retrieval Using Set of Local Statistical Features , 2012, 2012 IEEE International Conference on Multimedia and Expo Workshops.

[2]  Brendan J. Frey,et al.  k-Sparse Autoencoders , 2013, ICLR.

[3]  Tieniu Tan,et al.  Feature Coding in Image Classification: A Comprehensive Study , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Thomas S. Huang,et al.  Image Classification Using Super-Vector Coding of Local Image Descriptors , 2010, ECCV.

[5]  Dorian Gálvez-López,et al.  Real-time loop detection with bags of binary words , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Ryutarou Ohbuchi,et al.  Dense sampling and fast encoding for 3D model retrieval using bag-of-visual features , 2009, CIVR '09.

[7]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[8]  Bin Fang,et al.  Large Scale Comprehensive 3D Shape Retrieval , 2014, 3DOR@Eurographics.

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Thomas Mensink,et al.  Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[11]  Lei Wang,et al.  In defense of soft-assignment coding , 2011, 2011 International Conference on Computer Vision.

[12]  Ryutarou Ohbuchi,et al.  Randomized Sub-Volume Partitioning for Part-Based 3D Model Retrieval , 2015, 3DOR@Eurographics.

[13]  Thomas A. Funkhouser,et al.  The Princeton Shape Benchmark , 2004, Proceedings Shape Modeling Applications, 2004..

[14]  Sanjiv Kumar,et al.  Angular Quantization-based Binary Codes for Fast Similarity Search , 2012, NIPS.

[15]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[16]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Ryutarou Ohbuchi,et al.  Fusing Multiple Features for Shape-based 3D Model Retrieval , 2014, BMVC.

[18]  Mohammed Bennamoun,et al.  Rotational Projection Statistics for 3D Local Surface Description and Object Recognition , 2013, International Journal of Computer Vision.

[19]  Paul Suetens,et al.  SHREC '11 Track: Shape Retrieval on Non-rigid 3D Watertight Meshes , 2011, 3DOR@Eurographics.

[20]  Weisi Lin,et al.  B-SHOT: A binary feature descriptor for fast and efficient keypoint matching on 3D point clouds , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[21]  Ryutarou Ohbuchi,et al.  Lightweight Binary Voxel Shape Features for 3D Data Matching and Retrieval , 2015, 2015 IEEE International Conference on Multimedia Big Data.

[22]  Karthik Ramani,et al.  Developing an engineering shape benchmark for CAD models , 2006, Comput. Aided Des..

[23]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  Ryutarou Ohbuchi,et al.  Diffusion-on-Manifold Aggregation of Local Features for Shape-based 3D Model Retrieval , 2015, ICMR.

[25]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[26]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  IV. F ISHER,et al.  Image Retrieval with Fisher Vectors of Binary Features , 2013 .

[28]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..