Stochastic attentions and context learning for person re-identification

The discriminative parts of people’s appearance play a significant role in their re-identification across non overlapping camera views. However, just focusing on the discriminative or attention regions without catering the contextual information does not always help. It is more important to learn the attention with reference to their spatial locations in context of the whole image. Current person re-identification (re-id) approaches either use separate modules or classifiers to learn both of these; the attention and its context, resulting in highly expensive person re-id solutions. In this work, instead of handling attentions and the context separately, we employ a unified attention and context mapping (ACM) block within the convolutional layers of network, without any additional computational resources overhead. The ACM block captures the attention regions as well as the relevant contextual information in a stochastic manner and enriches the final person representations for robust person re-identification. We evaluate the proposed method on 04 public benchmarks of person re-identification i.e., Market1501, DukeMTMC-Reid, CUHK03 and MSMT17 and find that the ACM block consistently improves the performance of person re-identification over the baseline networks.

[1]  Longhui Wei,et al.  Person Transfer GAN to Bridge Domain Gap for Person Re-identification , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2]  Jonathan Tompson,et al.  Efficient object localization using Convolutional Networks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Xiaogang Wang,et al.  Residual Attention Network for Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  End-to-End Comparative Attention Networks for Person Re-Identification. , 2017 .

[5]  Yi Yang,et al.  Random Erasing Data Augmentation , 2017, AAAI.

[6]  M. Fraz,et al.  Smart Visual Surveillance: Proactive Person Re-identification instead of Impulsive Person Search , 2020, 2020 IEEE 23rd International Multitopic Conference (INMIC).

[7]  Nassir Navab,et al.  Concurrent Spatial and Channel Squeeze & Excitation in Fully Convolutional Networks , 2018, MICCAI.

[8]  Hassane Bouzahir,et al.  Convolutional neural networks approach for multimodal biometric identification system using the fusion of fingerprint, finger-vein and face images , 2020, PeerJ Comput. Sci..

[9]  Zuozhuo Dai,et al.  Batch DropBlock Network for Person Re-Identification and Beyond , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[10]  Saadia Batool,et al.  End to End Person Re-Identification for Automated Visual Surveillance , 2018, 2018 IEEE International Conference on Image Processing, Applications and Systems (IPAS).

[11]  Muhammad Moazam Fraz,et al.  Weighted hybrid features for person re-identification , 2017, 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA).

[12]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[13]  Nojun Kwak,et al.  Analysis on the Dropout Effect in Convolutional Neural Networks , 2016, ACCV.

[14]  Yu Wu,et al.  Auto-ReID: Searching for a Part-Aware ConvNet for Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[15]  Jingdong Wang,et al.  Deeply-Learned Part-Aligned Representations for Person Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[16]  Shaogang Gong,et al.  Harmonious Attention Network for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17]  Muhammad Moazam Fraz,et al.  Optimization of Person Re-Identification through Visual Descriptors , 2018, VISIGRAPP.

[18]  Yang Li,et al.  Deep Attention Network for RGB-Infrared Cross-Modality Person Re-Identification , 2020 .

[19]  Francesco Solera,et al.  Performance Measures and a Data Set for Multi-target, Multi-camera Tracking , 2016, ECCV Workshops.

[20]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Abhinav Gupta,et al.  Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  Hyunjung Shim,et al.  Attention-Based Dropout Layer for Weakly Supervised Object Localization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Tariq S. Durrani,et al.  Adversarial Erasing Attention for Person Re-Identification in Camera Networks Under Complex Environments , 2020, IEEE Access.

[24]  Liang Zheng,et al.  Improving Person Re-identification by Attribute and Identity Learning , 2017, Pattern Recognit..

[25]  Xiaogang Wang,et al.  Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  M. M. Fraz,et al.  Hierarchical Refined Local Associations for Robust Person Re-Identification , 2019, 2019 International Conference on Robotics and Automation in Industry (ICRAI).

[27]  Xiaogang Wang,et al.  DeepReID: Deep Filter Pairing Neural Network for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Nurul Amelina Nasharuddin,et al.  3D texture-based face recognition system using fine-tuned deep residual networks , 2019, PeerJ Comput. Sci..

[29]  Qi Tian,et al.  Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30]  Gang Wang,et al.  Dual Attention Matching Network for Context-Aware Feature Sequence Based Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31]  Liang Zheng,et al.  Learning Part-based Convolutional Features for Person Re-Identification , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Yun Fu,et al.  Image Super-Resolution Using Very Deep Residual Channel Attention Networks , 2018, ECCV.

[33]  Yi Yang,et al.  Adversarial Complementary Learning for Weakly Supervised Object Localization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34]  Soon Ki Jung,et al.  Two Stream Deep CNN-RNN Attentive Pooling Architecture for Video-Based Person Re-identification , 2018, CIARP.

[35]  Muhammad Moazam Fraz,et al.  Person Re-Identification Using Hybrid Representation Reinforced by Metric Learning , 2018, IEEE Access.

[36]  Dahun Kim,et al.  Two-Phase Learning for Weakly Supervised Object Localization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[37]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[38]  Yong Jae Lee,et al.  Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-Supervised Object and Action Localization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[39]  Shuicheng Yan,et al.  End-to-End Comparative Attention Networks for Person Re-Identification , 2016, IEEE Transactions on Image Processing.