BackgroundNet: Small Dataset-Based Object Detection in Stationary Scenes

Deep learning algorithms have made remarkable progress on the object detection based on huge amount of images. However, it is really difficult to train a model with well generalization for the small-scale images with the limited computation resources. To address the problem, BackgroundNet is proposed to guide the learning process of deep learning-based object detection by the extracted information from the background images. The corresponding network learns not only the features of objects, but also the difference between the object and the non-object area, with the purpose of improve the classification performances for small-scale datasets. Based on YOLO, the background images are employed as the extra input data, and then the input layer of BackgroundNet is not traditionally three RGB but six channels. The experimental results done for coal mine dataset and six public datasets show that the proposed method has better performance when dealing with the object detection with small-scale images and their AP-values are averagely larger than YOLO about 27.8\(\%\) for six public datasets.

[1]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[3]  Huan Yang,et al.  Ensemble prediction-based dynamic robust multi-objective optimization methods , 2019, Swarm Evol. Comput..

[4]  Limin Wang,et al.  Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs , 2016, IEEE Transactions on Image Processing.

[5]  Jian Cheng,et al.  Robust Dynamic Multi-Objective Vehicle Routing Optimization Method , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[6]  Yuanhang Cheng,et al.  A Motion Image Detection Method Based on the Inter-Frame Difference Method , 2014 .

[7]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Lucia Maddalena,et al.  Towards Benchmarking Scene Background Initialization , 2015, ICIAP Workshops.

[9]  Dong Xu,et al.  Advanced Deep-Learning Techniques for Salient and Category-Specific Object Detection: A Survey , 2018, IEEE Signal Processing Magazine.

[10]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[13]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[14]  Trevor Darrell,et al.  Simultaneous Deep Transfer Across Domains and Tasks , 2015, ICCV.