Part-Guided Attention Learning for Vehicle Re-Identification

Vehicle re-identification (Re-ID) often requires one to recognize the fine-grained visual differences between vehicles. Besides the holistic appearance of vehicles which is easily affected by the viewpoint variation and distortion, vehicle parts also provide crucial cues to differentiate near-identical vehicles. Motivated by these observations, we introduce a Part-Guided Attention Network (PGAN) to pinpoint the prominent part regions and effectively combine the global and part information for discriminative feature learning. PGAN first detects the locations of different part components and salient regions regardless of the vehicle identity, which serve as the bottom-up attention to narrow down the possible searching regions. To estimate the importance of detected parts, we propose a Part Attention Module (PAM) to adaptively locate the most discriminative regions with high-attention weights and suppress the distraction of irrelevant parts with relatively low weights. The PAM is guided by the Re-ID loss and therefore provides top-down attention that enables attention to be calculated at the level of car parts and other salient regions. Finally, we aggregate the global appearance and part features to improve the feature performance further. The PGAN combines part-guided bottom-up and top-down attention, global and part visual features in an end-to-end framework. Extensive experiments demonstrate that the proposed method achieves new state-of-the-art vehicle Re-ID performance on four large-scale benchmark datasets.

[1]  Yunchao Wei,et al.  VehicleNet: Learning Robust Visual Representation for Vehicle Re-Identification , 2020, IEEE Transactions on Multimedia.

[2]  Shengyong Chen,et al.  Structural Analysis of Attributes for Vehicle Re-Identification and Retrieval , 2020, IEEE Transactions on Intelligent Transportation Systems.

[3]  Xiaoyu Wang,et al.  Group-Group Loss-Based Global-Regional Feature Learning for Vehicle Re-Identification , 2019, IEEE Transactions on Image Processing.

[4]  Chunhua Shen,et al.  Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[5]  Hao Chen,et al.  Partition and Reunion: A Two-Branch Neural Network for Vehicle Re-identification , 2019, CVPR Workshops.

[6]  Weiyao Lin,et al.  Amur Tiger Re-identification in the Wild , 2019, ArXiv.

[7]  Björn Ommer,et al.  Divide and Conquer the Embedding Space for Metric Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Bing He,et al.  Part-Regularized Near-Duplicate Vehicle Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Ling-Yu Duan,et al.  VERI-Wild: A Large Dataset and a New Method for Vehicle Re-Identification in the Wild , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Rama Chellappa,et al.  A Dual-Path Model With Adaptive Attention for Vehicle Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  Haiyun Guo,et al.  Two-Level Attention Network With Multi-Grain Ranking Loss for Vehicle Re-Identification , 2019, IEEE Transactions on Image Processing.

[12]  Wei Wei,et al.  Vehicle Re-Identification in Aerial Imagery: Dataset and Approach , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[13]  Jenq-Neng Hwang,et al.  CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Wei Jiang,et al.  Bag of Tricks and a Strong Baseline for Deep Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[15]  Ling-Yu Duan,et al.  Embedding Adversarial Learning for Vehicle Re-Identification , 2019, IEEE Transactions on Image Processing.

[16]  Wu Liu,et al.  PVSS: A Progressive Vehicle Search System for Video Surveillance Networks , 2019, Journal of Computer Science and Technology.

[17]  Farzin Aghdasi,et al.  Vehicle Re-identification: an Efficient Baseline Using Triplet Embedding , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[18]  Xiu-Shen Wei,et al.  Coarse-to-fine: A RNN-based hierarchical attention model for vehicle re-identification , 2018, ACCV.

[19]  Shengcai Liao,et al.  Vehicle Re-Identification Using Quadruple Directional Deep Learning Features , 2018, IEEE Transactions on Intelligent Transportation Systems.

[20]  Shaogang Gong,et al.  Vehicle Re-Identification in Context , 2018, GCPR.

[21]  Shiliang Zhang,et al.  SCAN: Spatial and Channel Attention Network for Vehicle Re-Identification , 2018, PCM.

[22]  Shiliang Zhang,et al.  RAM: A Region-Aware Deep Model for Vehicle Re-Identification , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[23]  Ling Shao,et al.  Viewpoint-Aware Attentive Multi-view Inference for Vehicle Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Xiong Chen,et al.  Learning Discriminative Features with Multiple Granularities for Person Re-Identification , 2018, ACM Multimedia.

[25]  Tao Mei,et al.  PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance , 2018, IEEE Transactions on Multimedia.

[26]  Hanqing Lu,et al.  Learning Coarse-to-Fine Structured Feature Embedding for Vehicle Re-Identification , 2018, AAAI.

[27]  Xiaogang Wang,et al.  Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[28]  Wei Zeng,et al.  Exploiting Multi-grain Ranking Constraints for Precisely Searching Visually-similar Vehicles , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[29]  Gang Sun,et al.  Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[30]  Di Wu,et al.  Multi-modal metric learning for vehicle re-identification in traffic surveillance environment , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[31]  Yi Yang,et al.  Random Erasing Data Augmentation , 2017, AAAI.

[32]  Xiaogang Wang,et al.  Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-Temporal Path Proposals , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[33]  Lei Zhang,et al.  Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34]  Lucas Beyer,et al.  In Defense of the Triplet Loss for Person Re-Identification , 2017, ArXiv.

[35]  Junchi Yan,et al.  Learning Correspondence Structures for Person Re-Identification , 2017, IEEE Transactions on Image Processing.

[36]  Chao Zhang,et al.  Hard-Aware Deeply Cascaded Embedding , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[37]  Tao Mei,et al.  A Deep Learning-Based Approach to Progressive Vehicle Re-identification for Urban Surveillance , 2016, ECCV.

[38]  Wu Liu,et al.  Large-scale vehicle re-identification in urban surveillance videos , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).

[39]  Tiejun Huang,et al.  Deep Relative Distance Learning: Tell the Difference between Similar Vehicles , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Xiaoou Tang,et al.  A large-scale car dataset for fine-grained categorization and verification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[43]  Aaron C. Courville,et al.  Generative adversarial networks , 2014, Commun. ACM.

[44]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[45]  Sharath Pankanti,et al.  Large-Scale Vehicle Detection, Indexing, and Search in Urban Surveillance Videos , 2012, IEEE Transactions on Multimedia.

[46]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Horst Bischof,et al.  Object Reacquisition and Tracking in Large-Scale Smart Camera Networks , 2007, 2007 First ACM/IEEE International Conference on Distributed Smart Cameras.

[48]  Yingli Tian,et al.  Multi-camera Vehicle Tracking and Re-identification on AI City Challenge 2019 , 2019, CVPR Workshops.

[49]  Rama Chellappa,et al.  Attention Driven Vehicle Re-identification and Unsupervised Anomaly Detection for Traffic Understanding , 2019, CVPR Workshops.

[50]  Shaogang Gong,et al.  Multi-Task Mutual Learning for Vehicle Re-Identification , 2019, CVPR Workshops.

[51]  Ling Shao,et al.  Cross-View GAN Based Vehicle Generation for Re-identification , 2017, BMVC.