论文信息 - Embarrassingly Simple Binary Representation Learning

Embarrassingly Simple Binary Representation Learning

Recent binary representation learning models usually require sophisticated binary optimization, similarity measure or even generative models as auxiliaries. However, one may wonder whether these non-trivial components are needed to formulate practical and effective hashing models. In this paper, we answer the above question by proposing an embarrassingly simple approach to binary representation learning. With a simple classification objective, our model only incorporates two additional fully-connected layers onto the top of an arbitrary backbone network, for binary latents and semantic labels respectively, whilst complying with the binary constraints during training. The proposed model lower-bounds the Information Bottleneck (IB) between data samples and their semantics, and can be related to many recent 'learning to hash' paradigms. We show that, when properly designed, even such a simple network can generate effective binary codes, by fully exploring data semantics without any held-out alternating updating steps or auxiliary models. Experiments are conducted on conventional large-scale benchmarks, i.e., CIFAR-10, NUS-WIDE, and ImageNet, where the proposed simple model outperforms the state-of-the-art methods.

[1] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[2] Bingbing Ni,et al. Binary Coding for Partial Action Analysis with Limited Observation Ratios , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Fumin Shen,et al. Deep Sketch-Shape Hashing With Segmented 3D Stochastic Viewing , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[5] Rongrong Ji,et al. Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Jiwen Lu,et al. Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Yang Yang,et al. Graph Convolutional Network Hashing , 2020, IEEE Transactions on Cybernetics.

[8] Ian D. Reid,et al. Fast Training of Triplet-Based Deep Binary Embedding Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Jianmin Wang,et al. Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[10] Svetlana Lazebnik,et al. Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[11] Ling Shao,et al. Zero-Shot Sketch-Image Hashing , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12] Jingkuan Song,et al. Binary Generative Adversarial Networks for Image Retrieval , 2017, AAAI.

[13] Hanjiang Lai,et al. Simultaneous feature learning and hash coding with deep neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Philip S. Yu,et al. HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[15] Jianmin Wang,et al. HashGAN: Deep Learning to Hash with Pair Conditional Wasserstein GAN , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[17] Ling Shao,et al. Fast Person Re-identification via Cross-Camera Semantic Binary Transformation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[19] Tomasz Trzcinski,et al. BinGAN: Learning Compact Binary Descriptors with a Regularized GAN , 2018, NeurIPS.

[20] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[21] Ling Shao,et al. Unsupervised Binary Representation Learning with Deep Variational Networks , 2019, International Journal of Computer Vision.

[22] Cheng Deng,et al. Unsupervised Deep Generative Adversarial Hashing Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23] Ngai-Man Cheung,et al. Learning to Hash with Binary Deep Neural Network , 2016, ECCV.

[24] Jiwen Lu,et al. Relaxation-Free Deep Hashing via Policy Gradient , 2018, ECCV.

[25] Kun He,et al. MIHash: Online Hashing with Mutual Information , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[26] Wei Liu,et al. Hashing with Graphs , 2011, ICML.

[27] Wei Liu,et al. Discrete Graph Hashing , 2014, NIPS.

[28] Ling Shao,et al. Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[29] Jianmin Wang,et al. Deep Cauchy Hashing for Hamming Space Retrieval , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[30] Le Song,et al. Stochastic Generative Hashing , 2017, ICML.

[31] Victor S. Lempitsky,et al. Learning Deep Embeddings with Histogram Loss , 2016, NIPS.

[32] Kai Han,et al. Greedy Hash: Towards Fast Optimization for Accurate Hash Coding in CNN , 2018, NeurIPS.

[33] Patrick Pérez,et al. SuBiC: A Supervised, Structured Binary Code for Image Search , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[34] Alexander A. Alemi,et al. Deep Variational Information Bottleneck , 2017, ICLR.

[35] Wei Liu,et al. Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[37] Luc Van Gool,et al. Adversarial Binary Coding for Efficient Person Re-Identification , 2018, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[38] Hanjiang Lai,et al. Supervised Hashing for Image Retrieval via Image Representation Learning , 2014, AAAI.

[39] Yoshua Bengio,et al. Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation , 2013, ArXiv.

[40] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[41] Ling Shao,et al. Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[43] Tat-Seng Chua,et al. NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[44] Ling Shao,et al. Fast action retrieval from videos via feature disaggregation , 2017, Computer Vision and Image Understanding.

[45] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[46] Yang Yang,et al. Zero-Shot Hashing via Transferring Supervised Knowledge , 2016, ACM Multimedia.

[47] Naftali Tishby,et al. The information bottleneck method , 2000, ArXiv.