VisDrone-MOT2021: The Vision Meets Drone Multiple Object Tracking Challenge Results

Vision Meets Drone: Multiple Object Tracking (VisDrone-MOT2021) challenge – the forth annual activity organized by the VisDrone team – focuses on benchmarking UAV MOT algorithms in realistic challenging environments. It is held in conjunction with ICCV 2021. VisDrone-MOT2021 contains 96 video sequences in total, including 56 sequences (~24K frames) for training, 7 sequences (∼3K frames) for validation and 33 sequences (~13K frames) for testing. Bounding-box annotations for novel object categories are provided every frame and temporally consistent instance IDs are also given. Additionally, occlusion ratio and truncation ratio are provided as extra useful annotations. The results of eight state-of-the-art MOT algorithms are reported and discussed. We hope that our VisDrone-MOT2021 challenge will facilitate future research and applications in the field of UAV vision. The website of our challenge can be found at http://www.aiskyeye.com/.

[1]  Xiong Chen,et al.  Learning Discriminative Features with Multiple Granularities for Person Re-Identification , 2018, ACM Multimedia.

[2]  Nuno Vasconcelos,et al.  Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Ming-Hsuan Yang,et al.  UA-DETRAC: A new benchmark and protocol for multi-object detection and tracking , 2015, Comput. Vis. Image Underst..

[4]  Ramakant Nevatia,et al.  Online Learned Discriminative Part-Based Appearance Models for Multi-human Tracking , 2012, ECCV.

[5]  Huanlin Liu,et al.  Appearance Guidance Attention for Multi-Object Tracking , 2021, IEEE Access.

[6]  James M. Rehg,et al.  Multi-object Tracking with Neural Gating Using Bilinear LSTM , 2018, ECCV.

[7]  Ming-Hsuan Yang,et al.  DETRAC: A New Benchmark and Protocol for Multi-Object Tracking , 2015, ArXiv.

[8]  Deva Ramanan,et al.  TAO: A Large-Scale Benchmark for Tracking Any Object , 2020, ECCV.

[9]  Volker Eiselein,et al.  High-Speed tracking-by-detection without using image information , 2017, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[10]  Ramakant Nevatia,et al.  Global data association for multi-object tracking using network flows , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Lu Zhang,et al.  Preserving Structure in Model-Free Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Haibin Ling,et al.  Vision Meets Drones: Past, Present and Future , 2020, ArXiv.

[13]  Ling Shao,et al.  Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Ruigang Yang,et al.  A Unified Object Motion and Affinity Model for Online Multi-Object Tracking , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Haibin Ling,et al.  GMOT-40: A Benchmark for Generic Multiple Object Tracking , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Nicu Sebe,et al.  The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline , 2019, International Journal of Computer Vision.

[17]  Wenhan Luo,et al.  Multiple object tracking: A literature review , 2014, Artif. Intell..

[18]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[19]  Alan Yuille,et al.  DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution , 2020, ArXiv.

[20]  Wei Jiang,et al.  Bag of Tricks and a Strong Baseline for Deep Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[21]  Tobias Senst,et al.  Extending IOU Based Multi-Object Tracking by Visual Information , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[22]  Fan Yang,et al.  LaSOT: A High-Quality Benchmark for Large-Scale Single Object Tracking , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Dietrich Paulus,et al.  Simple online and realtime tracking with a deep association metric , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[24]  Hua Yang,et al.  Online Multi-Object Tracking with Dual Matching Attention Networks , 2018, ECCV.

[25]  Bin Dong,et al.  VisDrone-MOT2020: The Vision Meets Drone Multiple Object Tracking Challenge Results , 2020, ECCV Workshops.

[26]  Siwei Lyu,et al.  Learning Non-Uniform Hypergraph for Multi-Object Tracking , 2018, AAAI.

[27]  Vladlen Koltun,et al.  Tracking Objects as Points , 2020, ECCV.

[28]  Andrea Cavallaro,et al.  Omni-Scale Feature Learning for Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[29]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[30]  Hui Cheng,et al.  Detection-Free Multiobject Tracking by Reconfigurable Inference With Bundle Representations , 2016, IEEE Transactions on Cybernetics.

[31]  Erol Sahin,et al.  Swarm Robotics: From Sources of Inspiration to Domains of Application , 2004, Swarm Robotics.

[32]  Xinggang Wang,et al.  A Simple Baseline for Multi-Object Tracking , 2020, ArXiv.

[33]  Daniel Cremers,et al.  MOT20: A benchmark for multi object tracking in crowded scenes , 2020, ArXiv.

[34]  Guoqiang Yu,et al.  muSSP: Efficient Min-cost Flow Algorithm for Multi-object Tracking , 2019, NeurIPS.

[35]  Qiang Wang,et al.  Anti-UAV: A Large Multi-Modal Benchmark for UAV Tracking , 2021, ArXiv.

[36]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[37]  Qinghua Hu,et al.  Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).