Improved SSD-Based Multi-scale Pedestrian Detection Algorithm

This paper proposes an improved pedestrian detection algorithm based on SSD (Single Shot MultiBox Detector) model. The algorithm mainly solves the problem that the detection effect on small-scale pedestrians is not ideal when the SSD pedestrian detection algorithm performs the pedestrian detection task. The algorithm improves the original SSD model by introducing a shallower feature map. The pedestrian detection is carried out by using the characteristics of different output layers in the model, and the multi-layer detection results are combined to improve the detection effect of small-scale pedestrian. The Squeeze-and-Excitation Module is introduced in the additional feature layer of the SSD model. The improved model can automatically acquire the importance of each channel by self-learning, according to which useful features can be enhanced and features that are not useful for current tasks can be suppressed, thereby further improving the detection ability of the algorithm for small-scale pedestrians. Experiments show that the accuracy of the proposed algorithm in the INRIA dataset reaches 93.17%, and the missed detection rate is as low as 8.72%.

[1]  Joon Hee Han,et al.  Local Decorrelation For Improved Pedestrian Detection , 2014, NIPS.

[2]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2005, International Journal of Computer Vision.

[5]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[6]  Alexei A. Efros,et al.  An empirical study of context in object detection , 2009, CVPR.

[7]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Shang Gao,et al.  A multi-scale face detection algorithm based on improved SSD model , 2019, ACM TUR-C.

[10]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[11]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[12]  Sudipta Roy,et al.  Video shot boundary detection: A review , 2015, 2015 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT).

[13]  Bernt Schiele,et al.  Ten Years of Pedestrian Detection, What Have We Learned? , 2014, ECCV Workshops.

[14]  Gang Sun,et al.  Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  A. Torralba,et al.  The role of context in object recognition , 2007, Trends in Cognitive Sciences.