Multi-Scale Vehicle Detection for Foreground-Background Class Imbalance with Improved YOLOv2

Vehicle detection is a challenging task in computer vision. In recent years, numerous vehicle detection methods have been proposed. Since the vehicles may have varying sizes in a scene, while the vehicles and the background in a scene may be with imbalanced sizes, the performance of vehicle detection is influenced. To obtain better performance on vehicle detection, a multi-scale vehicle detection method was proposed in this paper by improving YOLOv2. The main contributions of this paper include: (1) a new anchor box generation method Rk-means++ was proposed to enhance the adaptation of varying sizes of vehicles and achieve multi-scale detection; (2) Focal Loss was introduced into YOLOv2 for vehicle detection to reduce the negative influence on training resulting from imbalance between vehicles and background. The experimental results upon the Beijing Institute of Technology (BIT)-Vehicle public dataset demonstrated that the proposed method can obtain better performance on vehicle localization and recognition than that of other existing methods.

[1]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[2]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[3]  Moongu Jeon,et al.  Vehicle pose detection using region based convolutional neural network , 2016, 2016 International Conference on Control, Automation and Information Sciences (ICCAIS).

[4]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[5]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[6]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Yang Gao,et al.  Scale optimization for full-image-CNN vehicle detection , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[8]  Bernard Abayowa,et al.  Fast Vehicle Detection in Aerial Imagery , 2017, ArXiv.

[9]  Qian Zhang,et al.  An Improved YOLOv2 for Vehicle Detection , 2018, Sensors.

[10]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Yi Li,et al.  R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[13]  Yongjin Jeong,et al.  Front collision warning based on vehicle detection using CNN , 2016, 2016 International SoC Design Conference (ISOCC).

[14]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[15]  Lin Lei,et al.  Vehicle Detection in Aerial Images Based on Region Convolutional Neural Networks and Hard Negative Example Mining , 2017, Sensors.

[16]  A. Jazayeri,et al.  Vehicle Detection and Tracking in Car Video Based on Motion Model , 2011, IEEE Transactions on Intelligent Transportation Systems.

[17]  Lianfa Bai,et al.  Vehicle Detection Based on Superpixel and Improved HOG in Aerial Images , 2017, ICIG.

[18]  Khamron Sunat,et al.  Comparative Study of Computational Time that HOG-Based Features Used for Vehicle Detection , 2017, IC2IT.

[19]  Xuelong Li,et al.  Linear SVM classification using boosting HOG features for vehicle detection in low-altitude airborne videos , 2011, 2011 18th IEEE International Conference on Image Processing.

[20]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Р Ю Чуйков,et al.  Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector , 2017 .

[22]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Kuo-Chin Fan,et al.  Vehicle Detection Using Normalized Color and Edge Map , 2007, IEEE Transactions on Image Processing.

[24]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[25]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Yong Tang,et al.  Vehicle detection and recognition for intelligent traffic surveillance system , 2017, Multimedia Tools and Applications.

[27]  Ke Chen,et al.  Car type recognition with Deep Neural Networks , 2016, 2016 IEEE Intelligent Vehicles Symposium (IV).

[28]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[29]  Huanxin Zou,et al.  Toward Fast and Accurate Vehicle Detection in Aerial Images Using Coupled Region-Based Convolutional Neural Networks , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[30]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Yunde Jia,et al.  Vehicle Type Classification Using a Semisupervised Convolutional Neural Network , 2015, IEEE Transactions on Intelligent Transportation Systems.