FPSSD7: Real-time Object Detection using 7 Layers of Convolution based on SSD

In recent years, attention has been paid to developing object detection methods from images, based on deep learning. In particular, toward self-driving cars, it is essential to make the detection as accurately as possible, as quickly as possible, as economically as possible. Here, we focus on SSD: Single Shot MultiBox Detector (SSD300) and attempt to improve it with less GPU memory. We propose a new method called “FPSSD7” (Feature Pyramid SSD7), which has less convolutional layers, keeping semantic information. For this purpose, we develop a method to interpolate semantic information between lower layers with high resolution feature maps and upper layer with low resolution features maps. We conducted comparative experiments with the conventional methods using Udacity Annotated Driving Dataset. As the result, we demonstrate that our proposed method outperform the conventional methods.

[1]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[2]  Shifeng Zhang,et al.  Single-Shot Refinement Neural Network for Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[5]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[8]  Wei Liu,et al.  DSSD : Deconvolutional Single Shot Detector , 2017, ArXiv.

[9]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[12]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).