Binary code learning via optimal class representations

Hashing is an attracting technique for fast retrieval due to its low storage and computation costs. By hashing, each high-dimensional vector is mapped into a low-dimensional binary code vector and retrieval is performed in the Hamming space. Recently several hashing methods have been proposed, among which, supervised hashing methods have shown great performance by incorporating the supervision information. However, most previous supervised methods simply focused on the pairwise label information of data, and ignored the structure information and relationship within data. To tackle this problem, we propose to learn binary codes by explicitly taking into account class semantic relatedness. Specifically, a set of binary codes is computed according to the intrinsic class similarities in data and serves as the optimal class representations. We show that, by mapping images onto the optimal representation of their corresponding classes, our proposed method outperforms several other state-of-the-art supervised hashing methods in image retrieval on three large-scale datasets.

[1]  Kristen Grauman,et al.  Kernelized locality-sensitive hashing for scalable image search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[2]  Wu-Jun Li,et al.  Isotropic Hashing , 2012, NIPS.

[3]  Dewen Hu,et al.  "Notice of Violation of IEEE Publication Principles" Multiobjective Reinforcement Learning: A Comprehensive Overview. , 2013, IEEE transactions on cybernetics.

[4]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  David Haussler,et al.  Convolution kernels on discrete structures , 1999 .

[6]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[7]  Anthony K. H. Tung,et al.  HashFile: An efficient index structure for multimedia data , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[8]  Beng Chin Ooi,et al.  Effective Multi-Modal Retrieval based on Stacked Auto-Encoders , 2014, Proc. VLDB Endow..

[9]  Jiwen Lu,et al.  Cost-Sensitive Local Binary Feature Learning for Facial Age Estimation , 2015, IEEE Transactions on Image Processing.

[10]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[11]  Zi Huang,et al.  Transfer tagging from image to video , 2011, ACM Multimedia.

[12]  Jian Sun,et al.  K-Means Hashing: An Affinity-Preserving Quantization Method for Learning Binary Compact Codes , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Wei Liu,et al.  Learning Binary Codes for Maximum Inner Product Search , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14]  Xianglong Liu,et al.  Structure Sensitive Hashing With Adaptive Product Quantization , 2016, IEEE Transactions on Cybernetics.

[15]  Jiwen Lu,et al.  Learning Compact Binary Face Descriptor for Face Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Wei Liu,et al.  Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  David Suter,et al.  Fast Supervised Hashing with Decision Trees for High-Dimensional Data , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Wei Liu,et al.  Hashing with Graphs , 2011, ICML.

[19]  Heng Tao Shen,et al.  Hashing on Nonlinear Manifolds , 2014, IEEE Transactions on Image Processing.

[20]  Xuelong Li,et al.  Large-Scale Unsupervised Hashing with Shared Structure Learning , 2015, IEEE Transactions on Cybernetics.

[21]  Antonio Torralba,et al.  Multidimensional Spectral Hashing , 2012, ECCV.

[22]  Prateek Jain,et al.  Fast Similarity Search for Learned Metrics , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[24]  Bin Zhao,et al.  Sparse Output Coding for Large-Scale Visual Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Yue Gao,et al.  Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss , 2014, IEEE Transactions on Multimedia.

[26]  David G. Lowe,et al.  Scalable Nearest Neighbor Algorithms for High Dimensional Data , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Trevor Darrell,et al.  Learning to Hash with Binary Reconstructive Embeddings , 2009, NIPS.

[28]  Jingsong Xu,et al.  Locality constrained representation based classification with spatial pyramid patches , 2013, Neurocomputing.

[29]  David J. Fleet,et al.  Minimal Loss Hashing for Compact Binary Codes , 2011, ICML.

[30]  Dimitris N. Metaxas,et al.  Large Scale Medical Image Search via Unsupervised PCA Hashing , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[31]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[32]  Xuelong Li,et al.  Visual Coding in a Semantic Hierarchy , 2015, ACM Multimedia.

[33]  Barnabás Póczos,et al.  Nonparametric Divergence Estimation and its Applications to Machine Learning , 2011 .

[34]  Svetlana Lazebnik,et al.  Locality-sensitive binary codes from shift-invariant kernels , 2009, NIPS.

[35]  K. R. Ramakrishnan,et al.  Kernels on Attributed Pointsets with Applications , 2007, NIPS.

[36]  Wotao Yin,et al.  A feasible method for optimization with orthogonality constraints , 2013, Math. Program..

[37]  Wei Liu,et al.  Discrete Graph Hashing , 2014, NIPS.

[38]  Zi Huang,et al.  Inter-media hashing for large-scale retrieval from heterogeneous data sources , 2013, SIGMOD '13.

[39]  Fumin Shen,et al.  Inductive Hashing on Manifolds , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Beng Chin Ooi,et al.  Effective deep learning-based multi-modal retrieval , 2015, The VLDB Journal.

[41]  Fumin Shen,et al.  Approximate Least Trimmed Sum of Squares Fitting and Applications in Image Analysis , 2013, IEEE Transactions on Image Processing.

[42]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[43]  Zi Huang,et al.  Multiple feature hashing for real-time large scale near-duplicate video retrieval , 2011, ACM Multimedia.

[44]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[45]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[46]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[47]  Jiwen Lu,et al.  Simultaneous Local Binary Feature Learning and Encoding for Face Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[48]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  Jian Sun,et al.  Graph Cuts for Supervised Binary Coding , 2014, ECCV.

[50]  Zi Huang,et al.  Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval , 2013, IEEE Transactions on Multimedia.

[51]  Xianglong Liu,et al.  Multiple feature kernel hashing for large-scale visual search , 2014, Pattern Recognit..