论文信息 - Real-Time Object Detection Using Efficient Convolutional Networks

Real-Time Object Detection Using Efficient Convolutional Networks

While recent object detection approaches have greatly improved the accuracy and robustness, the detection speed remains a Challenge for the community. In this paper, we propose an efficient fully convolutional network (EFCN) for real time object detection. EFCN employs the lightweight MobileNet [1] as the base network to significantly reduce the computation cost. Meanwhile, it detects objects in feature maps with multiple scales, and deploys a refining module on the top of each of these feature maps to alleviate the accuracy loss brought by the simple base network. We evaluate EFCN on the challenging KITTI [2] dataset and compare it with the state-of-the-art methods. The results show that EFCN keeps a good balance between speed and accuracy, it has \(25{\times }\) fewer parameters and is up to \(31{\times }\) faster than Faster-RCNN [3] while maintaining similar or better accuracy.

Xian Zhou | Xi Zhou | Youji Feng

[1] Forrest N. Iandola,et al. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[2] Yu-Wing Tai,et al. Accurate Single Stage Detector Using Recurrent Rolling Convolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Kavita Bala,et al. Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Wei Liu,et al. ParseNet: Looking Wider to See Better , 2015, ArXiv.

[6] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Jitendra Malik,et al. Hypercolumns for object segmentation and fine-grained localization , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[10] Fuchun Sun,et al. HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[13] Rogério Schmidt Feris,et al. A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection , 2016, ECCV.

[14] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15] Andreas Geiger,et al. Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.