Visual Appearance and Soft Biometrics Fusion for Person Re-Identification Using Deep Learning

Learning descriptions of individual pedestrian is a common goal of both person re-identification (P-ReID) and attribute recognition methods, which are typically differentiated only in terms of their granularity. However, existing P-ReID methods only consider identification labels for individual pedestrian. In this article, we present a multi-scale pyramid attention (MSPA) model for P-ReID that jointly manipulates the complementarity between semantic attributes and visual appearance to address this limitation. The proposed MSPA method mainly comprises three steps. Initially, a backbone model followed by appearance and attribute networks is individually trained to perform P-ReID and pedestrian attribute classification tasks. The attribute network primarily focuses on suppressed image areas associated with soft biometric data while retaining the semantic context among attributes using a convolutional long short-term memory architecture. Additionally, the identification network extracts rich contextual features from an image at varying scales using a residual pyramid module. In the second step, the dual network features are fused, and MSPA is re-trained for the P-ReID task to further improve its complementary capabilities. Finally, we experimentally evaluated the proposed model on the two benchmark datasets Market-1501 and DukeMTMC-reID, and the results show that our approach achieved state-of-the-art performance.

[1]  Zixuan Wang,et al.  Wi-ATCN: Attentional Temporal Convolutional Network for Human Action Prediction Using WiFi Channel State Information , 2022, IEEE Journal of Selected Topics in Signal Processing.

[2]  S. Khan,et al.  Learning to rank: An intelligent system for person reidentification , 2022, Int. J. Intell. Syst..

[3]  Ke Gong,et al.  Feature Refinement and Filter Network for Person Re-Identification , 2021, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Bo Du,et al.  Unsupervised Person Re-Identification With Stochastic Training Strategy , 2021, IEEE Transactions on Image Processing.

[5]  Huaxiang Zhang,et al.  A three-stage learning approach to cross-domain person re-identification , 2021, Appl. Soft Comput..

[6]  Barbara Rychalska,et al.  On the Unreasonable Effectiveness of Centroids in Image Retrieval , 2021, ICONIP.

[7]  Jinjun Wang,et al.  Multinetwork Collaborative Feature Learning for Semisupervised Person Reidentification , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Yongdong Zhang,et al.  Explainable Person Re-Identification with Attribute-guided Metric Distillation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[9]  Sung Wook Baik,et al.  Deep-ReID: deep features and autoencoder assisted image patching strategy for person re-identification in smart cities surveillance , 2021 .

[10]  Jun Zhou,et al.  Beyond Triplet Loss: Person Re-Identification With Fine-Grained Difference-Aware Pairwise Loss , 2020, IEEE Transactions on Multimedia.

[11]  Zhiyong Gao,et al.  Harmonious attention network for person re-identification via complementarity between groups and individuals , 2020, Neurocomputing.

[12]  Xiuping Liu,et al.  MHSA-Net: Multihead Self-Attention Network for Occluded Person Re-Identification , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[13]  Zhen Lei,et al.  An end-to-end exemplar association for unsupervised person Re-identification , 2020, Neural Networks.

[14]  Xiaoqiang Lu,et al.  Semisupervised Consistent Projection Metric Learning for Person Reidentification , 2020, IEEE Transactions on Cybernetics.

[15]  Xiaochun Cao,et al.  Task-Aware Attention Model for Clothing Attribute Prediction , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Bin Fan,et al.  Deep Attention Aware Feature Learning for Person Re-Identification , 2020, Pattern Recognit..

[17]  Ying Tai,et al.  Person Search by Separated Modeling and A Mask-Guided Two-Stream CNN Model , 2020, IEEE Transactions on Image Processing.

[18]  Chao Wang,et al.  Attention Deep Model With Multi-Scale Deep Supervision for Person Re-Identification , 2019, IEEE Transactions on Emerging Topics in Computational Intelligence.

[19]  Yinghuan Shi,et al.  Progressive Cross-Camera Soft-Label Learning for Semi-Supervised Person Re-Identification , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Kim-Hui Yap,et al.  AANet: Attribute Attention Network for Person Re-Identifications , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Nanning Zheng,et al.  Semi-supervised person re-identification using multi-view clustering , 2019, Pattern Recognit..

[22]  Shaogang Gong,et al.  Unsupervised Tracklet Person Re-Identification , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Xiaocong Fan,et al.  Unsupervised Joint Subspace and Dictionary Learning for Enhanced Cross-Domain Person Re-Identification , 2018, IEEE Journal of Selected Topics in Signal Processing.

[24]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[25]  Manuel J. Marín-Jiménez,et al.  Multimodal feature fusion for CNN-based gait recognition: an empirical comparison , 2018, Neural Computing and Applications.

[26]  Xiaogang Wang,et al.  Eliminating Background-bias for Robust Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[27]  Jing Xu,et al.  Attention-Aware Compositional Network for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[28]  Haiqing Li,et al.  Deep Spatial Feature Reconstruction for Partial Person Re-identification: Alignment-free Approach , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29]  Kaiqi Huang,et al.  Learning Deep Context-Aware Features over Body and Latent Parts for Person Re-identification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Shaogang Gong,et al.  Person Re-identification by Deep Learning Multi-scale Representations , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[31]  Xiaogang Wang,et al.  HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[32]  Hantao Yao,et al.  Deep Representation Learning With Part Loss for Person Re-Identification , 2017, IEEE Transactions on Image Processing.

[33]  Yi Yang,et al.  Pedestrian Alignment Network for Large-scale Person Re-Identification , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[34]  Rainer Stiefelhagen,et al.  Person Re-identification by Deep Learning Attribute-Complementary Information , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[35]  Liang Zheng,et al.  Improving Person Re-identification by Attribute and Identity Learning , 2017, Pattern Recognit..

[36]  Yifan Sun,et al.  SVDNet for Pedestrian Retrieval , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[37]  Huchuan Lu,et al.  Pose-Invariant Embedding for Deep Person Re-Identification , 2017, IEEE Transactions on Image Processing.

[38]  Dong Liu,et al.  Multi-Scale Triplet CNN for Person Re-Identification , 2016, ACM Multimedia.

[39]  Alberto Del Bimbo,et al.  Person Re-Identification by Iterative Re-Weighted Sparse Ranking , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Tao Xiang,et al.  Transferring a semantic representation for person re-identification and search , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Shengcai Liao,et al.  Person re-identification by Local Maximal Occurrence representation and metric learning , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Thomas B. Moeslund,et al.  A Local 3-D Motion Descriptor for Multi-View Human Action Recognition from 4-D Spatio-Temporal Interest Points , 2012, IEEE Journal of Selected Topics in Signal Processing.

[43]  Alessandro Perina,et al.  Person re-identification by symmetry-driven accumulation of local features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[44]  Y. Duan,et al.  Hybrid Contrastive Learning for Unsupervised Person Re-Identification , 2023, IEEE Transactions on Multimedia.

[45]  W. Shi,et al.  A CAM-Guided Parameter-Free Attention Network for Person Re-Identification , 2022, IEEE Signal Processing Letters.

[46]  Dapeng Tao,et al.  Attribute-Aligned Domain-Invariant Feature Learning for Unsupervised Domain Adaptation Person Re-Identification , 2021, IEEE Transactions on Information Forensics and Security.

[47]  Xiaokang Yang,et al.  Towards multi-scale deep features learning with correlation metric for person re-identification , 2021, Knowl. Based Syst..

[48]  Manuel J. Marín-Jiménez,et al.  UGaitNet: Multimodal Gait Recognition With Missing Input Modalities , 2021, IEEE Transactions on Information Forensics and Security.

[49]  Yuanfang Guo,et al.  Hierarchical Reasoning Network for Pedestrian Attribute Recognition , 2021, IEEE Transactions on Multimedia.

[50]  Mang Ye,et al.  Learning Sparse and Identity-Preserved Hidden Attributes for Person Re-Identification , 2020, IEEE Transactions on Image Processing.