Toward Compressed 3D Descriptors

Visual search for mobile devices relies on transmitting wirelessly a compact representation of the query image, generally in the form of feature descriptors, to a remote server. Descriptors are therefore compressed, so as to reduce bandwidth occupancy and network latency. Given the impressive pace of growth of 3D video technology, we foresee 3D visual search applications for the mobile and the robotic market to become a reality. Accordingly, our work proposes a study on compressed 3D descriptors, a fundamental building block for such prospective applications. Based on analysis of several compression approaches, we develop and assess different schemes to achieve a compact version of a state-of-the-art 3D descriptor. Through experiments on a vast dataset we demonstrate the ability to achieve compression rates as high as 98% with a negligible loss in 3D visual search performance.

[1]  Chuohao Yeo,et al.  Rate-efficient visual correspondences using random projections , 2008, 2008 15th IEEE International Conference on Image Processing.

[2]  Marc Pollefeys,et al.  Leveraging 3D City Models for Rotation Invariant Place-of-Interest Recognition , 2011, International Journal of Computer Vision.

[3]  Bernd Girod,et al.  Compressed Histogram of Gradients: A Low-Bitrate Descriptor , 2011, International Journal of Computer Vision.

[4]  Matthew Johnson,et al.  Generalized Descriptor Compression for Storage and Matching , 2010, BMVC.

[5]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[7]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[9]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[10]  Sam S. Tsai,et al.  Survey of SIFT Compression Schemes , 2010 .

[11]  Yu Zhong,et al.  Intrinsic shape signatures: A shape descriptor for 3D object recognition , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[12]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[13]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Federico Tombari,et al.  A combined texture-shape descriptor for enhanced 3D feature matching , 2011, 2011 18th IEEE International Conference on Image Processing.

[15]  Tomás Pajdla,et al.  3D with Kinect , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[16]  Vivek K. Goyal,et al.  CoDAC: A compressive depth acquisition camera framework , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Eric Bodden,et al.  Arithmetic Coding revealed A guided tour from theory to praxis. , 2004 .

[18]  Michel Barret,et al.  ICA based algorithms for computing optimal 1-D linear block transforms in variable high-rate source coding , 2008, Signal Process..

[19]  Matthew A. Brown,et al.  Picking the best DAISY , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Jitendra Malik,et al.  Recognizing Objects in Range Data Using Regional Point Descriptors , 2004, ECCV.

[21]  E Guizzo,et al.  Robots with their heads in the clouds , 2011 .

[22]  Federico Tombari,et al.  Unique shape context for 3d data description , 2010, 3DOR '10.

[23]  Ioannis Pratikakis,et al.  3D Object Retrieval using an Efficient and Compact Hybrid Shape Descriptor , 2008, 3DOR@Eurographics.

[24]  Mohammed Bennamoun,et al.  On the Repeatability and Quality of Keypoints for Local Feature-based 3D Object Retrieval from Cluttered Scenes , 2009, International Journal of Computer Vision.

[25]  Antonio Torralba,et al.  Small codes and large image databases for recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Lei Li,et al.  On Using Exponential-Golomb Codes and Subexponential Codes for System-on-a-Chip Test Data Compression , 2004, J. Electron. Test..

[27]  Federico Tombari,et al.  Unique Signatures of Histograms for Local Surface Description , 2010, ECCV.

[28]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[29]  D. Falie,et al.  Noise Characteristics of 3D Time-of-Flight Cameras , 2007, 2007 International Symposium on Signals, Circuits and Systems.

[30]  R. Horaud,et al.  Surface feature detection and description with applications to mesh matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.