Unsupervised adaptive hashing based on feature clustering

Abstract An attractive method for image retrieval is binary hashing, which aims to reduce the dimensionality and generate similarity-preserving binary codes. To map the high-dimensional data into a low-dimensional subspace, majority of current unsupervised hashing approaches reduce the dimensionality by principal component analysis (PCA). However, PCA will yield unbalanced variances of projection directions and cause inconvenience in the quantization step. Besides, preserving the original similarity in existing unsupervised hashing methods remains as an NP-hard problem. For addressing these problems, we explore a novel hashing method based on feature clustering to simultaneously generate low-dimensional data with balanced variance and preserve the similarity in Euclidean space. Furthermore, we also propose an adaptive quantization approach to displace the fixed threshold quantization. Our novel method, dubbed as Feature Clustering Hashing (FCH), has shown its superiority to state-of-the-art methods on three benchmark datasets.

[1]  MengChu Zhou,et al.  A Novel Approach to Extracting Non-Negative Latent Factors From Non-Negative Big Sparse Matrices , 2016, IEEE Access.

[2]  Xianglong Liu,et al.  Adaptive multi-bit quantization for hashing , 2015, Neurocomputing.

[3]  Jiwen Lu,et al.  Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Shuai Li,et al.  Symmetric and Nonnegative Latent Factor Models for Undirected, High-Dimensional, and Sparse Networks in Industrial Applications , 2017, IEEE Transactions on Industrial Informatics.

[5]  Jiwen Lu,et al.  Learning Deep Binary Descriptor with Multi-Quantization , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  David G. Stork,et al.  Pattern Classification , 1973 .

[7]  Patrick P. K. Chan,et al.  Two-phase mapping hashing , 2015, Neurocomputing.

[8]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[10]  Wei Liu,et al.  Asymmetric Binary Coding for Image Search , 2017, IEEE Transactions on Multimedia.

[11]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[12]  Yiguang Liu,et al.  The Euclidean embedding learning based on convolutional neural network for stereo matching , 2017, Neurocomputing.

[13]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[14]  Jianping Fan,et al.  Spatial pyramid deep hashing for large-scale image retrieval , 2017, Neurocomputing.

[15]  MengChu Zhou,et al.  An Efficient Non-Negative Matrix-Factorization-Based Approach to Collaborative Filtering for Recommender Systems , 2014, IEEE Transactions on Industrial Informatics.

[16]  Hau-San Wong,et al.  Adaptive activation functions in convolutional neural networks , 2018, Neurocomputing.

[17]  Lianli Gao,et al.  Large-scale image retrieval with supervised sparse hashing , 2017, Neurocomputing.

[18]  Wu-Jun Li,et al.  Double-Bit Quantization for Hashing , 2012, AAAI.

[19]  Daniel S. Yeung,et al.  Bagging-boosting-based semi-supervised multi-hashing with query-adaptive re-ranking , 2018, Neurocomputing.

[20]  Shih-Fu Chang,et al.  Spherical hashing , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  MengChu Zhou,et al.  A Nonnegative Latent Factor Model for Large-Scale Sparse Matrices in Recommender Systems via Alternating Direction Method , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[23]  Jian Sun,et al.  K-Means Hashing: An Affinity-Preserving Quantization Method for Learning Binary Compact Codes , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  MengChu Zhou,et al.  An Inherently Nonnegative Latent Factor Model for High-Dimensional and Sparse Matrices from Industrial Applications , 2018, IEEE Transactions on Industrial Informatics.

[25]  Wu-Jun Li,et al.  Isotropic Hashing , 2012, NIPS.

[26]  Kristen Grauman,et al.  Kernelized Locality-Sensitive Hashing , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Jun Yu,et al.  Local Deep-Feature Alignment for Unsupervised Dimension Reduction , 2018, IEEE Transactions on Image Processing.

[28]  Zhou Yu,et al.  Beyond Bilinear: Generalized Multimodal Factorized High-Order Pooling for Visual Question Answering , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[30]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[31]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[32]  Weihong Deng,et al.  Distortion Minimization Hashing , 2017, IEEE Access.

[33]  Jian Wang,et al.  Linear unsupervised hashing for ANN search in Euclidean space , 2016, Neurocomputing.

[34]  Marcin Kurdziel,et al.  Encouraging orthogonality between weight vectors in pretrained deep neural networks , 2016, Neurocomputing.

[35]  Yong Chen,et al.  Diversity Regularized Latent Semantic Match for Hashing , 2017, Neurocomputing.

[36]  Svetlana Lazebnik,et al.  Locality-sensitive binary codes from shift-invariant kernels , 2009, NIPS.

[37]  Minyi Guo,et al.  Manhattan hashing for large-scale image retrieval , 2012, SIGIR '12.

[38]  Deng Cai,et al.  Density Sensitive Hashing , 2012, IEEE Transactions on Cybernetics.

[39]  Heng Tao Shen,et al.  Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Chun Chen,et al.  Harmonious Hashing , 2013, IJCAI.

[41]  Wei Liu,et al.  Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Jiwen Lu,et al.  Learning Compact Binary Descriptors with Unsupervised Deep Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Xuelong Li,et al.  Learning binary codes with local and inner data structure , 2017, Neurocomputing.

[44]  Yin Zhang,et al.  Kernelized sparse hashing for scalable image retrieval , 2016, Neurocomputing.

[45]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.