HOB-net: high-order block network via deep metric learning for person re-identification

Learning effective feature representations with deep convolutional neural network (CNN) and metric learning methods to distinguish pedestrians is the key to the success of recent advances in person re-identification (re-ID) tasks. However, the features provided by the common CNN network are not strong enough to distinguish between similar subjects because these common features describe only the scattered local patterns while neglecting the correlation and combinations between them. In fact, the high-order correlations of common features can be significant to the recognition of identities. In this paper, to tackle this problem, we design a flexible high-order block (HOB) module and a scheme of deep metric learning to produce the high-order representations of deep features for the re-identification of pedestrians. Extensive experiments prove the superiority of our proposed HOB module for person re-ID issue. On three large-scale datasets, including Market-1501, DukeMTMC-ReID, and CUHK03-NP, the HOB-net method achieves the competitive results with the state-of-the-arts, particularly in the mAP.

[1]  Tao Xiang,et al.  Pose-Normalized Image Generation for Person Re-identification , 2017, ECCV.

[2]  Ziyan Wu,et al.  Re-Identification With Consistent Attentive Siamese Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Xiaogang Wang,et al.  DeepReID: Deep Filter Pairing Neural Network for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Shiguang Shan,et al.  Interaction-And-Aggregation Network for Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Kaiqi Huang,et al.  Towards Rich Feature Discovery With Class Activation Maps Augmentation for Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Tao Xiang,et al.  Multi-level Factorisation Net for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Shaogang Gong,et al.  Harmonious Attention Network for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8]  Muhittin Gokmen,et al.  Human Semantic Parsing for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Yang Hua,et al.  Ranked List Loss for Deep Metric Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Qi Tian,et al.  Beyond Part Models: Person Retrieval with Refined Part Pooling , 2017, ECCV.

[11]  Liang Zheng,et al.  Improving Person Re-identification by Attribute and Identity Learning , 2017, Pattern Recognit..

[12]  Jungmin Lee,et al.  Attention-based Ensemble for Deep Metric Learning , 2018, ECCV.

[13]  Robert Pless,et al.  Deep Randomized Ensembles for Metric Learning , 2018, ECCV.

[14]  Mario Sznaier,et al.  moM: Mean of Moments Feature for Person Re-identification , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[15]  Tao Mei,et al.  Part-Aligned Bilinear Representations for Person Re-identification , 2018, ECCV.

[16]  Cheng Wang,et al.  Mancs: A Multi-task Attentional Network with Curriculum Sampling for Person Re-Identification , 2018, ECCV.

[17]  Chi Zhang,et al.  Margin Sample Mining Loss: A Deep Learning Based Method for Person Re-identification , 2017, ArXiv.

[18]  Francesco Solera,et al.  Performance Measures and a Data Set for Multi-target, Multi-camera Tracking , 2016, ECCV Workshops.

[19]  Wei Jiang,et al.  Bag of Tricks and a Strong Baseline for Deep Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[20]  Yan Wang,et al.  Resource Aware Person Re-identification Across Multiple Resolutions , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21]  Lei Zhang,et al.  Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[22]  Liang Wang,et al.  Mask-Guided Contrastive Attention Model for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  Zhedong Zheng,et al.  CamStyle: A Novel Data Augmentation Method for Person Re-Identification , 2019, IEEE Transactions on Image Processing.

[24]  Li Wang,et al.  A Novel Soft Margin Loss Function for Deep Discriminative Embedding Learning , 2020, IEEE Access.

[25]  Zhu Teng,et al.  A divide-and-unite deep network for person re-identification , 2020, Applied Intelligence.

[26]  Xiao Yu,et al.  Infrared Handprint Image Restoration Algorithm Based on Apoptotic Mechanism , 2020, IEEE Access.

[27]  Kim-Hui Yap,et al.  AANet: Attribute Attention Network for Person Re-Identifications , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Nanning Zheng,et al.  Person Re-identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Tong Jia,et al.  Deeply-Learned Spatial Alignment for Person Re-Identification , 2019, IEEE Access.

[30]  Yi Yang,et al.  Person Re-identification: Past, Present and Future , 2016, ArXiv.

[31]  Zuozhuo Dai,et al.  Batch DropBlock Network for Person Re-Identification and Beyond , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[32]  Hao Chen,et al.  Multi-level feature learning with attention for person re-identification , 2020, Multimedia Tools and Applications.

[33]  Qilong Wang,et al.  Is Second-Order Information Helpful for Large-Scale Visual Recognition? , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[34]  Hong Liu,et al.  Bi-Directional Exponential Angular Triplet Loss for RGB-Infrared Person Re-Identification , 2020, IEEE Transactions on Image Processing.

[35]  Ying Wang,et al.  A Mask-Pooling Model With Local-Level Triplet Loss for Person Re-Identification , 2020, IEEE Access.

[36]  Junaid Ali Khan,et al.  Human action recognition using fusion of multiview and deep features: an application to video surveillance , 2020, Multimedia Tools and Applications.

[37]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[38]  Krystian Mikolajczyk,et al.  Higher-Order Occurrence Pooling for Bags-of-Words: Visual Concept Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Mang Ye,et al.  Learning Sparse and Identity-Preserved Hidden Attributes for Person Re-Identification , 2020, IEEE Transactions on Image Processing.

[40]  Aymeric Histace,et al.  Metric Learning With HORDE: High-Order Regularizer for Deep Embeddings , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[41]  Qi Tian,et al.  Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[42]  Yi Yang,et al.  Generalizing a Person Retrieval Model Hetero- and Homogeneously , 2018, ECCV.

[43]  Zheyi Fan,et al.  In-depth exploration of attribute information for person re-identification , 2020, Applied Intelligence.

[44]  Weihong Deng,et al.  Mixed High-Order Attention Network for Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).