YOLO v3-Tiny: Object Detection and Recognition using one stage improved model

Object detection has seen many changes in algorithms to improve performance both on speed and accuracy. By the continuous effort of so many researchers, deep learning algorithms are growing rapidly with an improved object detection performance. Various popular applications like pedestrian detection, medical imaging, robotics, self-driving cars, face detection, etc. reduces the efforts of humans in many areas. Due to the vast field and various state-of-the-art algorithms, it is a tedious task to cover all at once. This paper presents the fundamental overview of object detection methods by including two classes of object detectors. In two stage detector covered algorithms are RCNN, Fast RCNN, and Faster RCNN, whereas in one stage detector YOLO v1, v2, v3, and SSD are covered. Two stage detectors focus more on accuracy, whereas the primary concern of one stage detectors is speed. We will explain an improved YOLO version called YOLO v3-Tiny, and then its comparison with previous methods for detection and recognition of object is described graphically.

[1]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Hairong Qi,et al.  RRPN: Radar Region Proposal Network for Object Detection in Autonomous Vehicles , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[3]  Lei Liu,et al.  A Novel YOLOv3-tiny Network for Unmanned Airship Obstacle Detection , 2019, 2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS).

[4]  Sergio Guadarrama,et al.  Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Wenmin Wang,et al.  Better region proposals for pedestrian detection with R-CNN , 2016, 2016 Visual Communications and Image Processing (VCIP).

[6]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Ba Tuan Le,et al.  A Target Detection Model Based on Improved Tiny-Yolov3 Under the Environment of Mining Truck , 2019, IEEE Access.

[9]  Shawn McCann,et al.  Object Detection using Convolutional Neural Networks , 2013 .

[10]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Rachel Huang,et al.  YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[12]  Jean-Yves Ertaud,et al.  Real Time Object Detection, Tracking, and Distance and Motion Estimation based on Deep Learning: Application to Smart Mobility , 2019, 2019 Eighth International Conference on Emerging Security Technologies (EST).

[13]  Yan Song,et al.  Inception Single Shot MultiBox Detector for object detection , 2017, 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[14]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Rui-Sheng Jia,et al.  Mini-YOLOv3: Real-Time Object Detector for Embedded Applications , 2019, IEEE Access.

[16]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[17]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[18]  Shuyuan Yang,et al.  A Survey of Deep Learning-Based Object Detection , 2019, IEEE Access.

[19]  Xindong Wu,et al.  Object Detection With Deep Learning: A Review , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Lin Wang,et al.  Tinier-YOLO: A Real-Time Object Detection Method for Constrained Environments , 2020, IEEE Access.

[21]  Anis Koubaa,et al.  Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3 , 2018, 2019 1st International Conference on Unmanned Vehicle Systems-Oman (UVS).

[22]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[23]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[24]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[27]  Surya Nepal,et al.  Daedalus: Breaking Non-Maximum Suppression in Object Detection via Adversarial Examples , 2019, ArXiv.