Camouflage people detection via strong semantic dilation network

Camouflage greatly reduces the probability of the target being discovered, detecting the camouflage object from background is very hard for former object detection methods because they are designed to detect the object in an ideal environment. It brings difficulties to subsequent applications such as battlefield information acquisition, precision guided weapons and object tracking. To deal with the problem, in this paper, a new complete camouflage people detection dataset is firstly constructed, then we propose strong semantic dilation network (SSDN), which is specially designed to detect camouflage people in an end-to-end architecture. SSDN makes full use of semantic information in CNN and dilated convolutions are also added to enlarge the receptive field to find camouflage people. Experiments demonstrate that SSDN perform well in camouflage people detection dataset compared with other object detection methods.

[1]  Yu-Wing Tai,et al.  Salient Region Detection via High-Dimensional Color Transform , 2014, CVPR.

[2]  Sabine Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  P. Sengottuvelan,et al.  Performance of Decamouflaging Through Exploratory Image Analysis , 2008, 2008 First International Conference on Emerging Trends in Engineering and Technology.

[4]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Meng Sun,et al.  Detection of People With Camouflage Pattern Via Dense Deconvolution Network , 2019, IEEE Signal Processing Letters.

[6]  Bolei Zhou,et al.  Network Dissection: Quantifying Interpretability of Deep Visual Representations , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[8]  S. Kastner,et al.  Attention in the real world: toward understanding its neural basis , 2014, Trends in Cognitive Sciences.

[9]  Hui Li,et al.  A Unified Framework for Salient Structure Detection by Contour-Guided Visual Search , 2015, IEEE Transactions on Image Processing.

[10]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[11]  Zhuowen Tu,et al.  Deeply Supervised Salient Object Detection with Short Connections , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Jingdong Wang,et al.  Salient Object Detection: A Discriminative Regional Feature Integration Approach , 2013, International Journal of Computer Vision.

[13]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[14]  P. Nagabhushan,et al.  Camouflage Defect Identification: A Novel Approach , 2006, 9th International Conference on Information Technology (ICIT'06).

[15]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Yehezkel Yeshurun,et al.  Convexity-Based Visual Camouflage Breaking , 2001, Comput. Vis. Image Underst..

[17]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.