ELECTRICITY: An Efficient Multi-camera Vehicle Tracking System for Intelligent City

City-scale multi-camera vehicle tracking is an important task in the intelligent city and traffic management. It is quite challenging with large scale variance, frequent occlusion and appearance variance caused by viewing perspective difference. In this paper, we propose ELECTRICITY, an efficient multi-camera vehicle tracking system with aggregation loss and fast multi-target cross-camera tracking strategy. The proposed system contains four main modules. Firstly, we extract tracklets under single camera view through object detection and multi-object tracking modules which shared the detection features. After that, we match the generated tracklets through a multicamera re-identification module. Finally, we eliminate isolated tracklets and synchronize tracking ids according to the re-identification results. The proposed system wins the first place in the City-Scale Multi-Camera Vehicle Tracking of AI City 2020 Challenge (Track 3)1 with a score of 0.4585.

[1]  Alexander Hauptmann,et al.  MMVG-INF-Etrol@TRECVID 2019: Activities in Extended Video , 2019, TRECVID.

[2]  Lucas Beyer,et al.  In Defense of the Triplet Loss for Person Re-Identification , 2017, ArXiv.

[3]  Jenq-Neng Hwang,et al.  CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Dietrich Paulus,et al.  Simple online and realtime tracking with a deep association metric , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[5]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Xiao Tan,et al.  Multi-camera vehicle tracking and re-identification based on visual and spatial-temporal features , 2019, CVPR Workshops.

[8]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[9]  Liang Zheng,et al.  Towards Real-Time Multi-Object Tracking , 2020, ECCV.

[10]  Lijun Yu,et al.  Adaptive Feature Aggregation for Video Object Detection , 2020, 2020 IEEE Winter Applications of Computer Vision Workshops (WACVW).

[11]  Liang Zheng,et al.  A Locality Aware City-Scale Multi-Camera Vehicle Tracking System , 2019, CVPR Workshops.

[12]  Alexander G. Hauptmann,et al.  Zero-VIRUS*: Zero-shot Vehicle Route Understanding System for Intelligent Transportation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[13]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Peng Chen,et al.  Argus: Efficient Activity Detection System for Extended Video Analysis , 2020, 2020 IEEE Winter Applications of Computer Vision Workshops (WACVW).

[15]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[16]  Wei Wu,et al.  Multi-Camera Vehicle Tracking with Powerful Visual Features and Spatial-Temporal Cue , 2019, CVPR Workshops.

[17]  Tao Mei,et al.  PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance , 2018, IEEE Transactions on Multimedia.

[18]  Pascal Fua,et al.  Eliminating Exposure Bias and Metric Mismatch in Multiple Object Tracking , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Andrea Palazzi,et al.  Unsupervised Vehicle Re-identification Using Triplet Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[20]  Adam Herout,et al.  Vehicle Re-Identifiation and Multi-Camera Tracking in Challenging City-Scale Environment , 2019, CVPR Workshops.

[21]  Lijun Yu,et al.  Traffic Danger Recognition With Surveillance Cameras Without Training Data , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[22]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[23]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Peng Chen,et al.  Training-free Monocular 3D Event Detection System for Traffic Surveillance , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[25]  Adam Herout,et al.  BoxCars: Improving Fine-Grained Recognition of Vehicles Using 3-D Bounding Boxes in Traffic Surveillance , 2017, IEEE Transactions on Intelligent Transportation Systems.

[26]  Jenq-Neng Hwang,et al.  The 2019 AI City Challenge , 2019, CVPR Workshops.

[27]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[29]  Fabio Tozeto Ramos,et al.  Simple online and realtime tracking , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[30]  Jenq-Neng Hwang,et al.  Multi-Camera Tracking of Vehicles based on Deep Features Re-ID and Trajectory-Based Camera Link Models , 2019, CVPR Workshops.

[31]  Tiejun Huang,et al.  Deep Relative Distance Learning: Tell the Difference between Similar Vehicles , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).