Towards Real-Time Vehicle Detection on Edge Devices with Nvidia Jetson TX2

With the development of deep convolutional networks, significant advances in object detection task have been achieved. However, for applications in autonomous vehicles, it is necessary to have an efficient object detector that can process rapidly while maintaining high accuracy. This study presents our implementation and performance evaluation of two object detectors EfficientDet-Lite and Yolov3-tiny on Nvidia Jetson TX2 mobile embedded platform. Our experimental results on the KITTI dataset demonstrate that it is possible to achieve real-time and highly accurate object detection on edge devices with constrained resources.

[1]  Quoc V. Le,et al.  EfficientDet: Scalable and Efficient Object Detection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Hao Chen,et al.  FCOS: Fully Convolutional One-Stage Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[3]  Nuno Vasconcelos,et al.  Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4]  Shuxiang Guo,et al.  Quadrotor Vision-based Localization for Amphibious Robots in Amphibious Area , 2019, 2019 IEEE International Conference on Mechatronics and Automation (ICMA).

[5]  Rakesh Mehta,et al.  Object detection at 200 Frames Per Second , 2018, ECCV Workshops.

[6]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Liang-Bi Chen,et al.  A Deep Learning-Based Intelligent Medicine Recognition System for Chronic Patients , 2019, IEEE Access.

[8]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[10]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[11]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[12]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Changhua Yang,et al.  An Embedded Deep Learning Object Detection Model For Traffic In Asian Countries , 2020, 2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[14]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[15]  Hei Law,et al.  CornerNet: Detecting Objects as Paired Keypoints , 2018, ECCV.

[16]  Alexander Shustanov,et al.  A Method for Traffic Sign Recognition with CNN using GPU , 2017, SIGMAP.

[17]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Huajun Feng,et al.  Libra R-CNN: Towards Balanced Learning for Object Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Quoc V. Le,et al.  EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[22]  Qi Tian,et al.  CenterNet: Keypoint Triplets for Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[23]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.