Temporal Weighting Appearance-Aligned Network for Nighttime Video Retrieval

Video-based person re-identification (ReID) aims at re-identifying video sequences of a specified person from videos captured by disjoint cameras. Existing datasets and works on this task all focus on daytime scenarios and cannot adapt well to the nighttime scenarios, which is also of significant importance for practical applications. In this letter, we contribute a new dataset for nighttime video-based ReID, termed NIVIR, which contains 800 identities with over 228,000 images. NIVIR contains video shots under various lighting conditions, different weathers, and complex scenarios, which is consistent with the real nighttime outdoor surveillance. Furthermore, we propose a temporal weighting appearance-aligned network (TWAN) for nighttime video-based ReID, which is composed of a correlation-based appearance-aligned module (CAM) and a temporal weighting module (TWM). Specifically, CAM is proposed to reconstruct the adjacent feature maps to guarantee the appearance alignment between the central frame and its adjacent frames. TWM is designed to evaluate the frame quality of a tracklet and generate temporal weights to enhance the video representation. Extensive experiments conducted on our new NIVIR dataset demonstrate that the proposed TWAN outperforms the state-of-the-art methods. We believe that our NIVIR dataset and the comprehensive attempts for solving the nighttime ReID problem will push forward the development of the ReID research community.

[1]  Weijian Ruan,et al.  Semantic-Guided Pixel Sampling for Cloth-Changing Person Re-Identification , 2021, IEEE Signal Processing Letters.

[2]  Mang Ye,et al.  TICNet: A Target-Insight Correlation Network for Object Tracking , 2021, IEEE Transactions on Cybernetics.

[3]  Yi Yu,et al.  Correlation Discrepancy Insight Network for Video Re-identification , 2020, ACM Trans. Multim. Comput. Commun. Appl..

[4]  Xilin Chen,et al.  Temporal Complementary Learning for Video Person Re-Identification , 2020, ECCV.

[5]  Xilin Chen,et al.  Appearance-Preserving 3D Convolution for Video-based Person Re-identification , 2020, ECCV.

[6]  Simone Calderara,et al.  Robust Re-Identification by Multiple Views Knowledge Distillation , 2020, ECCV.

[7]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[8]  Ruimin Hu,et al.  SIST: Online Scale-Adaptive Object tracking with Stepwise Insight , 2020, Neurocomputing.

[9]  Tao Mei,et al.  POINet: Pose-Guided Ovonic Insight Network for Multi-Person Pose Tracking , 2019, ACM Multimedia.

[10]  Shiguang Shan,et al.  Temporal Knowledge Propagation for Image-to-Video Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  Ruimin Hu,et al.  Multi-Correlation Filters With Triangle-Structure Constraints for Object Tracking , 2019, IEEE Transactions on Multimedia.

[12]  Wei Jiang,et al.  Bag of Tricks and a Strong Baseline for Deep Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[13]  Tiejun Huang,et al.  Multi-scale 3D Convolution Network for Video Based Person Re-Identification , 2018, AAAI.

[14]  Lin Wu,et al.  Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification , 2018, IEEE Transactions on Multimedia.

[15]  Zheng Wang,et al.  Video-Based Person Re-Identification via Self Paced Weighting , 2018, AAAI.

[16]  Longhui Wei,et al.  Person Transfer GAN to Bridge Domain Gap for Person Re-identification , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17]  Pong C. Yuen,et al.  Dynamic Label Graph Matching for Unsupervised Video Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[18]  Yu Cheng,et al.  Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19]  Ruimin Hu,et al.  Object tracking via online trajectory optimization with multi-feature fusion , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[20]  Michael Felsberg,et al.  ECO: Efficient Convolution Operators for Tracking , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Qi Tian,et al.  MARS: A Video Benchmark for Large-Scale Person Re-Identification , 2016, ECCV.

[22]  Ruimin Hu,et al.  Boosted local classifiers for visual tracking , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).

[23]  Xiao-Yuan Jing,et al.  Video-Based Person Re-Identification by Simultaneously Learning Intra-Video and Inter-Video Distance Metrics , 2016, IEEE Transactions on Image Processing.

[24]  Jesús Martínez del Rincón,et al.  Recurrent Convolutional Network for Video-Based Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Xiang Li,et al.  Top-Push Video-Based Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Shaogang Gong,et al.  Person Re-Identification by Discriminative Selection in Video Ranking , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Qi Tian,et al.  Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[29]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[30]  Shaogang Gong,et al.  Person Re-identification by Video Ranking , 2014, ECCV.

[31]  Xiaogang Wang,et al.  DeepReID: Deep Filter Pairing Neural Network for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Horst Bischof,et al.  Person Re-identification by Descriptive and Discriminative Classification , 2011, SCIA.

[34]  Jie Shao,et al.  Hierarchical Pseudo Labeling Based Embranchment Learning for One-Shot Person Re-Identification , 2022, IEEE Signal Processing Letters.

[35]  Haifeng Hu,et al.  Two-Branch Asymmetric Model With Alternately Clustering for Unsupervised Person Re-Identification , 2022, IEEE Signal Processing Letters.