PMT-Net: Progressive Multi-Task Network for one-shot Person Re-Identification

Abstract Most existing person re-identification (RE-ID) algorithms require abundant labeled data from paired non-overlapping camera views in the fully supervised scenario. However, the fully supervised RE-ID suffers from the limited availability of labeled training samples due to the sharply increased cost of manual efforts. To tackle this problem, a novel Progressive Multi-Task Network (PMT-Net) for person RE-ID is proposed. PMT-Net initializes a model using only one labeled sample for each identity, and it iteratively optimizes the model by sampling the most reliable pseudo labels dynamically from unlabeled samples. Firstly, pedestrian attributes recognition is incorporated as an auxiliary task to learn discriminative features. Then, based on the discriminative features, the identity label for unlabeled samples is estimated by the distance between the labeled samples and unlabeled samples in feature space. In addition, to enhance the accuracy of label estimation for the unlabeled samples, a semi-supervised clustering method, named Distance Ranked Weight Clustering (DRW-Clustering) is designed. The clustering method weights partial unlabeled samples by the indexed ordinal of distance sorting, so that it can find the real cluster center quickly and effectively. Extensive comparative evaluation experiments are conducted on Market1501 and DukeMTMC-reID datasets, and the experimental results indicate that the proposed method achieves performance competitive or better than that of the state-of-the-art for one-shot person RE-ID.

[1]  Xiaogang Wang,et al.  Spindle Net: Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Q. Tian,et al.  GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval , 2017, ACM Multimedia.

[3]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[4]  Qi Tian,et al.  Enhancing Person Re-identification in a Self-Trained Subspace , 2017, ACM Trans. Multim. Comput. Commun. Appl..

[5]  Zhihui Li,et al.  Deep Feature Learning via Structured Graph Laplacian Embedding for Person Re-Identification , 2017, Pattern Recognit..

[6]  Pong C. Yuen,et al.  Dynamic Label Graph Matching for Unsupervised Video Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[7]  Francesco Solera,et al.  Performance Measures and a Data Set for Multi-target, Multi-camera Tracking , 2016, ECCV Workshops.

[8]  Wei Li,et al.  Transferable Joint Attribute-Identity Deep Learning for Unsupervised Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Tao Mei,et al.  PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance , 2018, IEEE Transactions on Multimedia.

[10]  Jian-Huang Lai,et al.  Supplementary Material for “Unsupervised Person Re-identification by Soft Multilabel Learning” , 2019 .

[11]  Carlo Tomasi,et al.  Features for Multi-target Multi-camera Tracking and Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12]  Huchuan Lu,et al.  Stepwise Metric Promotion for Unsupervised Video Person Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[13]  Nanning Zheng,et al.  Deep self-paced learning for person re-identification , 2017, Pattern Recognit..

[14]  Shaogang Gong,et al.  Person Re-Identification by Unsupervised Video Matching , 2016, Pattern Recognit..

[15]  Quoc V. Le,et al.  Understanding and Simplifying One-Shot Architecture Search , 2018, ICML.

[16]  Xiaojing Chen,et al.  Sparse representation matching for person re-identification , 2016, Inf. Sci..

[17]  Qi Tian,et al.  Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[18]  Dapeng Tao,et al.  Top distance regularized projection and dictionary learning for person re-identification , 2019, Inf. Sci..

[19]  Jing Xu,et al.  Attention-Aware Compositional Network for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20]  Liang Zheng,et al.  Improving Person Re-identification by Attribute and Identity Learning , 2017, Pattern Recognit..

[21]  Tao Mei,et al.  Part-Aligned Bilinear Representations for Person Re-identification , 2018, ECCV.

[22]  Yu Wu,et al.  Exploit the Unknown Gradually: One-Shot Video-Based Person Re-identification by Stepwise Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  Yu Wu,et al.  Progressive Learning for Person Re-Identification With One Example , 2019, IEEE Transactions on Image Processing.

[24]  Gang Wang,et al.  Dual Attention Matching Network for Context-Aware Feature Sequence Based Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[25]  Liang Zheng,et al.  Unsupervised Person Re-identification: Clustering and Fine-tuning , 2017 .

[26]  Yang Yang,et al.  Unsupervised Learning of Multi-Level Descriptors for Person Re-Identification , 2017, AAAI.

[27]  Deyu Meng,et al.  Co-Saliency Detection via a Self-Paced Multiple-Instance Learning Framework , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Zhedong Zheng,et al.  Joint Discriminative and Generative Learning for Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Yi Yang,et al.  Pedestrian Alignment Network for Large-scale Person Re-Identification , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  Liang Zheng,et al.  Re-ranking Person Re-identification with k-Reciprocal Encoding , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Muhittin Gokmen,et al.  Human Semantic Parsing for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[32]  Wei-Shi Zheng,et al.  Distilled Person Re-Identification: Towards a More Scalable System , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Changxin Gao,et al.  Pose-guided spatiotemporal alignment for video-based person Re-identification , 2020, Inf. Sci..

[34]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Pong C. Yuen,et al.  Semi-supervised Region Metric Learning for Person Re-identification , 2018, International Journal of Computer Vision.