Hadamard Coding for Supervised Discrete Hashing

In this paper, we propose a learning-based supervised discrete hashing (SDH) method. Binary hashing is widely used for large-scale image retrieval as well as video and document searches, because the compact binary code representation is essential for data storage and reasonable for query searches using bit operations. The recently proposed SDH method efficiently solves mixed-integer programming problems by alternating optimization and the discrete cyclic coordinate descent (DCC) method. Based on some preliminary experiments, we show that the SDH method can be simplified without performance degradation. We analyze the simplified model and provide a mathematically exact solution thereof; we reveal that the exact binary code is provided by a “Hadamard matrix.” Therefore, we named our method Hadamard coded-SDH (HC-SDH). In contrast to the SDH, our model does not require an alternating optimization algorithm and does not depend on initial values. The HC-SDH is also easier to implement than the iterative quantization. Experimental results involving a large-scale database show that the Hadamard coding outperforms the conventional SDH in terms of precision, recall, and computational time. On the large data sets SUN-397 and ImageNet, the HC-SDH provides a superior mean average of precision (mAP) and top-accuracy compared with the conventional SDH methods with the same code length and FastHash. The training time of the HC-SDH is 170 times faster than the conventional SDH and the testing time including the encoding time is seven times faster than the FastHash which encodes using a binary-tree.

[1]  Rossano Venturini,et al.  Fast and Compact Hamming Distance Index , 2016, IIR.

[2]  Jungong Han,et al.  Large-scale image retrieval with Sparse Embedded Hashing , 2017, Neurocomputing.

[3]  Yi Fang,et al.  Variational Deep Semantic Hashing for Text Documents , 2017, SIGIR.

[4]  Ling Shao,et al.  Learning to Hash With Optimized Anchor Embedding for Scalable Retrieval , 2017, IEEE Transactions on Image Processing.

[5]  Wei Liu,et al.  Discrete Graph Hashing , 2014, NIPS.

[6]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Matthijs Douze,et al.  How should we evaluate supervised hashing? , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Shih-Fu Chang,et al.  Spherical Hashing: Binary Code Embedding with Hyperspheres , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Jinhui Tang,et al.  Supervised Quantization for Similarity Search , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Matthias Hein,et al.  Matrix factorization with binary components , 2013, NIPS.

[11]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[12]  Xianglong Liu,et al.  Distributed Adaptive Binary Quantization for Fast Nearest Neighbor Search , 2017, IEEE Transactions on Image Processing.

[13]  Cornelia Caragea,et al.  Protein Sequence Classification Using Feature Hashing , 2011, BIBM.

[14]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Zhenan Sun,et al.  Fast Supervised Discrete Hashing , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Lin Yang,et al.  Kernel-Based Supervised Discrete Hashing for Image Retrieval , 2016, ECCV.

[17]  Gou Koutaki,et al.  Binary continuous image decomposition for multi-view display , 2016, ACM Trans. Graph..

[18]  Yue Gao,et al.  Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing , 2016, IEEE Transactions on Image Processing.

[19]  Xuelong Li,et al.  Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval , 2017, IEEE Transactions on Image Processing.

[20]  Xianglong Liu,et al.  Structure Sensitive Hashing With Adaptive Product Quantization , 2016, IEEE Transactions on Cybernetics.

[21]  Patrick Pérez,et al.  SuBiC: A Supervised, Structured Binary Code for Image Search , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[22]  Yang Yang,et al.  A Fast Optimization Method for General Binary Code Learning , 2016, IEEE Transactions on Image Processing.

[23]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[25]  Inderjit S. Dhillon,et al.  Generalized Nonnegative Matrix Approximations with Bregman Divergences , 2005, NIPS.

[26]  Jonathan Krause,et al.  Collecting a Large-scale Dataset of Fine-grained Cars , 2013 .

[27]  Zhi-Hua Zhou,et al.  Column Sampling Based Discrete Supervised Hashing , 2016, AAAI.

[28]  Keqiu Li,et al.  Binary Hashing for Approximate Nearest Neighbor Search on Big Data: A Survey , 2018, IEEE Access.

[29]  Wei Liu,et al.  Hashing with Graphs , 2011, ICML.

[30]  David K. Smith Theory of Linear and Integer Programming , 1987 .

[31]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Yi Yang,et al.  Hashing Based Hierarchical Feature Representation for Hyperspectral Imagery Classification , 2017, Remote. Sens..

[33]  Tieniu Tan,et al.  Supervised Discrete Hashing With Relaxation , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[35]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Arturo de la Escalera,et al.  Traffic sign recognition and analysis for intelligent vehicles , 2003, Image Vis. Comput..

[37]  Tieniu Tan,et al.  Representative Vector Machines: A Unified Framework for Classical Classifiers , 2016, IEEE Transactions on Cybernetics.

[38]  Guosheng Lin,et al.  Learning Hash Functions Using Column Generation , 2013, ICML.

[39]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[40]  Jiwen Lu,et al.  Supervised Discriminative Hashing for Compact Binary Codes , 2014, ACM Multimedia.

[41]  Toshihide Ibaraki,et al.  Resource allocation problems - algorithmic approaches , 1988, MIT Press series in the foundations of computing.

[42]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[43]  Jian Sun,et al.  Optimized Product Quantization , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Wei Liu,et al.  Learning Binary Codes for Maximum Inner Product Search , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[45]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[46]  Xuelong Li,et al.  Large-Scale Unsupervised Hashing with Shared Structure Learning , 2015, IEEE Transactions on Cybernetics.

[47]  Trevor Darrell,et al.  Learning to Hash with Binary Reconstructive Embeddings , 2009, NIPS.

[48]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[49]  Wei Liu,et al.  Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  J. Sylvester LX. Thoughts on inverse orthogonal matrices, simultaneous signsuccessions, and tessellated pavements in two or more colours, with applications to Newton's rule, ornamental tile-work, and the theory of numbers , 1867 .

[51]  David Suter,et al.  Fast Supervised Hashing with Decision Trees for High-Dimensional Data , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[52]  Jiwen Lu,et al.  Learning Compact Binary Descriptors with Unsupervised Deep Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53]  Jiwen Lu,et al.  Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[55]  Ali Cevahir,et al.  Large-scale Multi-class and Hierarchical Product Categorization for an E-commerce Giant , 2016, COLING.

[56]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[57]  Jungong Han,et al.  Robust Quantization for General Similarity Search , 2018, IEEE Transactions on Image Processing.