Hierarchical Hashing Learning for Image Set Classification

With the development of video network, image set classification (ISC) has received a lot of attention and can be used for various practical applications, such as video based recognition, action recognition, and so on. Although the existing ISC methods have obtained promising performance, they often have extreme high complexity. Due to the superiority in storage space and complexity cost, learning to hash becomes a powerful solution scheme. However, existing hashing methods often ignore complex structural information and hierarchical semantics of the original features. They usually adopt a single-layer hashing strategy to transform high-dimensional data into short-length binary codes in one step. This sudden drop of dimension could result in the loss of advantageous discriminative information. In addition, they do not take full advantage of intrinsic semantic knowledge from whole gallery sets. To tackle these problems, in this paper, we propose a novel Hierarchical Hashing Learning (HHL) for ISC. Specifically, a coarse-to-fine hierarchical hashing scheme is proposed that utilizes a two-layer hash function to gradually refine the beneficial discriminative information in a layer-wise fashion. Besides, to alleviate the effects of redundant and corrupted features, we impose the $\ell _{2,1}$ norm on the layer-wise hash function. Moreover, we adopt a bidirectional semantic representation with the orthogonal constraint to keep intrinsic semantic information of all samples in whole image sets adequately. Comprehensive experiments demonstrate HHL acquires significant improvements in accuracy and running time. We will release the demo code on https://github.com/sunyuan-cs.

[1]  Jiwen Dong,et al.  Joint Metric Learning-Based Class-Specific Representation for Image Set Classification. , 2022, IEEE transactions on neural networks and learning systems.

[2]  Xiaobo Shen,et al.  Discrete Metric Learning for Fast Image Set Classification , 2022, IEEE Transactions on Image Processing.

[3]  Dezhong Peng,et al.  Feature and Semantic Views Consensus Hashing for Image Set Classification , 2022, ACM Multimedia.

[4]  Hakan Cevikalp,et al.  Deep Discriminative Feature Models (DDFMs) for Set Based Face Recognition and Distance Metric Learning , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Jun Wang,et al.  Enhanced Discrete Multi-Modal Hashing: More Constraints Yet Less Time to Learn , 2022, IEEE Transactions on Knowledge and Data Engineering.

[6]  Sihang Zhou,et al.  Hierarchical Multiple Kernel Clustering , 2021, AAAI.

[7]  Yiu-Ming Cheung,et al.  FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Xin Luo,et al.  High-Dimensional Sparse Cross-Modal Hashing with Fine-Grained Similarity Embedding , 2021, WWW.

[9]  Guangming Lu,et al.  Probability Ordinal-Preserving Semantic Hashing for Large-Scale Image Retrieval , 2021, ACM Trans. Knowl. Discov. Data.

[10]  Jaime Lloret,et al.  Multiple Kernel Driven Clustering With Locally Consistent and Selfish Graph in Industrial IoT , 2021, IEEE Transactions on Industrial Informatics.

[11]  J. Kittler,et al.  SymNet: A Simple Symmetric Positive Definite Manifold Deep Learning Method for Image Set Classification , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Yuhui Zheng,et al.  Optimal discriminative feature and dictionary learning for image set classification , 2021, Inf. Sci..

[13]  Zhiyang Li,et al.  Hash Learning With Variable Quantization for Large-Scale Retrieval , 2021, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Min Meng,et al.  Asymmetric Supervised Consistent and Specific Hashing for Cross-Modal Retrieval , 2020, IEEE Transactions on Image Processing.

[15]  Jingjing Li,et al.  Fast Discrete Collaborative Multi-Modal Hashing for Large-Scale Multimedia Retrieval , 2020, IEEE Transactions on Knowledge and Data Engineering.

[16]  Yilong Yin,et al.  Joint Multi-View Hashing for Large-Scale Near-Duplicate Video Retrieval , 2020, IEEE Transactions on Knowledge and Data Engineering.

[17]  Yang Yang,et al.  Bidirectional Discrete Matrix Factorization Hashing for Image Search , 2020, IEEE Transactions on Cybernetics.

[18]  Zhenwen Ren,et al.  Consensus Affinity Graph Learning for Multiple Kernel Clustering , 2020, IEEE Transactions on Cybernetics.

[19]  Bo Peng,et al.  Query set centered sparse projection learning for set based image classification , 2020, Applied Intelligence.

[20]  Feiping Nie,et al.  WeGAN: Deep Image Hashing With Weighted Generative Adversarial Networks , 2020, IEEE Transactions on Multimedia.

[21]  Zhenwen Ren,et al.  Simultaneous Global and Local Graph Structure Preserving for Multiple Kernel Clustering , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Yilong Yin,et al.  Reinforced Short-Length Hashing , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Rongrong Ji,et al.  Similarity-Preserving Linkage Hashing for Online Image Retrieval , 2020, IEEE Transactions on Image Processing.

[24]  Josef Kittler,et al.  Graph Embedding Multi-Kernel Metric Learning for Image Set Classification With Grassmannian Manifold-Valued Features , 2020, IEEE Transactions on Multimedia.

[25]  Huaxiang Zhang,et al.  Deep Collaborative Multi-View Hashing for Large-Scale Image Search , 2020, IEEE Transactions on Image Processing.

[26]  Jun Wang,et al.  Strongly Constrained Discrete Hashing , 2020, IEEE Transactions on Image Processing.

[27]  Zhenwen Ren,et al.  Multiple kernel dimensionality reduction based on linear regression virtual reconstruction for image set classification , 2019, Neurocomputing.

[28]  Josef Kittler,et al.  Multiple Riemannian Manifold-Valued Descriptors Based Image Set Classification With Multi-Kernel Metric Learning , 2019, IEEE Transactions on Big Data.

[29]  Liqiang Nie,et al.  Fast Scalable Supervised Hashing , 2018, SIGIR.

[30]  Zhenan Sun,et al.  Fast Supervised Discrete Hashing , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Shiguang Shan,et al.  Prototype Discriminative Learning for Face Image Set Classification , 2016, ACCV.

[32]  Luc Van Gool,et al.  A Riemannian Network for SPD Matrix Learning , 2016, AAAI.

[33]  Yicong Zhou,et al.  Pairwise Linear Regression Classification for Image Set Retrieval , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Shih-Fu Chang,et al.  Deep Image Set Hashing , 2016, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[35]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Tieniu Tan,et al.  Simultaneous Feature and Sample Reduction for Image-Set Classification , 2016, AAAI.

[37]  Zhi-Hua Zhou,et al.  Column Sampling Based Discrete Supervised Hashing , 2016, AAAI.

[38]  Shiguang Shan,et al.  Discriminant analysis on Riemannian manifold of Gaussian distributions for face recognition with image sets , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Wei Liu,et al.  Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Liang Chen,et al.  Dual Linear Regression Based Classification for Face Cluster Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Lei Zhang,et al.  Face recognition based on regularized nearest points between image sets , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[42]  Ajmal S. Mian,et al.  Face Recognition Using Sparse Approximated Nearest Points between Image Sets , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[44]  J. Kittler,et al.  Deep Metric Learning on the SPD Manifold for Image Set Classification , 2022, IEEE Transactions on Circuits and Systems for Video Technology.