Military object detection in defense using multi-level capsule networks

Automatic target detection plays a significant role during war operations. The concept behind automatic target detection in war is military object recognition from the captured images. For object recognition in the given image, convolutional neural network (CNN) architectures are used efficiently. However, CNN suffers from a problem of location invariant and their performance depends mainly on the size of the training set. Generally, the available training data will be in less proportion for military objects due to its operational and security issues. Due to these two issues of CNN, the performance of CNN may degrade abruptly. To address military object recognition, a relatively new neural network architecture based on capsule network (CapsNet) is introduced in this paper. A variant of CapsNet called the multi-level CapsNet framework is projecting in this paper for efficient military object recognition under the case of a small training dataset. The validation of the introduced framework is done on a dataset of military objects collected from the Internet. The dataset contains particularly five different military objects and similar civil ones. Experiments demonstrate that the proposed framework can accomplish a high recognition precision, superior to many other algorithms such as conventional support vector machines and transfer learning-based CNNs.

[1]  Deepu Rajan,et al.  Video Processing From Electro-Optical Sensors for Object Detection and Tracking in a Maritime Environment: A Survey , 2016, IEEE Transactions on Intelligent Transportation Systems.

[2]  Yu Li,et al.  LIME: Low-Light Image Enhancement via Illumination Map Estimation. , 2017, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[3]  Alan L. Yuille,et al.  Semi-Supervised Sparse Representation Based Classification for Face Recognition With Insufficient Labeled Samples , 2016, IEEE Transactions on Image Processing.

[4]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[5]  Gora Chand Nandi,et al.  Robust and accurate feature selection for humanoid push recovery and classification: deep learning approach , 2017, Neural Computing and Applications.

[6]  B. Janakiramaiah,et al.  Deep learning-based detection and classification of adenocarcinoma cell nuclei , 2021 .

[7]  Nuri Yilmazer,et al.  Application of Object Detection and Tracking Techniques for Unmanned Aerial Vehicles , 2015, Complex Adaptive Systems.

[8]  P. N. Druzhkov,et al.  A survey of deep learning methods and software tools for image classification and object detection , 2016, Pattern Recognition and Image Analysis.

[9]  Shi-Min Hu,et al.  Global Contrast Based Salient Region Detection , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Huchuan Lu,et al.  Saliency Detection with Recurrent Fully Convolutional Networks , 2016, ECCV.

[11]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[13]  Nanning Zheng,et al.  Salient Object Detection: A Discriminative Regional Feature Integration Approach , 2013, International Journal of Computer Vision.

[14]  Jiebo Luo,et al.  Multi-modal deep feature learning for RGB-D object detection , 2017, Pattern Recognit..

[15]  Yao Li,et al.  Contextual Hypergraph Modeling for Salient Object Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[16]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[17]  Kasun De Zoysa,et al.  Enhanced bag-of-words model for phrase-level sentiment analysis , 2014, 2014 14th International Conference on Advances in ICT for Emerging Regions (ICTer).

[18]  Naif Alajlan,et al.  Assisting the Visually Impaired in Multi-object Scene Description Using OWA-Based Fusion of CNN Models , 2020 .

[19]  Hüseyin Demirel,et al.  Moving object detection using an adaptive background subtraction method based on block-based structure in dynamic scene , 2018, Optik.

[20]  Peng Zhang,et al.  Military Object Detection Based on Optimal Gabor Filtering and Deep Feature Pyramid Network , 2019 .

[21]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[22]  B. Schölkopf,et al.  Graph-Based Visual Saliency , 2007 .

[23]  Li Wang,et al.  Fast Threshold Image Segmentation Based on 2D Fuzzy Fisher and Random Local Optimized QPSO. , 2017, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[24]  Jiayi Ma,et al.  Infrared and visible image fusion methods and applications: A survey , 2018, Inf. Fusion.

[25]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[26]  L. V. Narasimha Prasad,et al.  Diabetic retinopathy detection and classification using capsule networks , 2021, Complex & Intelligent Systems.

[27]  B. Janakiramaiah,et al.  Automatic alert generation in a surveillance systems for smart city environment using deep learning algorithm , 2020, Evolutionary Intelligence.

[28]  Zhuowen Tu,et al.  Deeply Supervised Salient Object Detection with Short Connections , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Hae-Bum Yun,et al.  Automatic Pavement Object Detection Using Superpixel Segmentation Combined With Conditional Random Field , 2018, IEEE Transactions on Intelligent Transportation Systems.

[30]  Yu Li,et al.  LIME: Low-Light Image Enhancement via Illumination Map Estimation , 2017, IEEE Transactions on Image Processing.

[31]  Hans Christian Palm,et al.  Detection of military objects in LADAR images , 2008 .

[32]  Arya Raj,et al.  A Comparative Study on Target Detection in Military Field Using Various Digital Image Processing Techniques , 2016 .

[33]  Yansheng Li,et al.  Feature guided Gaussian mixture model with semi-supervised EM and local geometric constraint for retinal image registration , 2017, Inf. Sci..

[34]  Zhi Yang,et al.  Deep transfer learning for military object recognition under small training set condition , 2019, Neural Computing and Applications.

[35]  Henry Schneiderman,et al.  Feature-centric evaluation for efficient cascaded object detection , 2004, CVPR 2004.

[36]  Aníbal R. Figueiras-Vidal,et al.  Pattern classification with missing data: a review , 2010, Neural Computing and Applications.

[37]  Qi Tian,et al.  Statistical modeling of complex backgrounds for foreground object detection , 2004, IEEE Transactions on Image Processing.

[38]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Shi-Min Hu,et al.  Global contrast based salient region detection , 2011, CVPR 2011.

[40]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Yongchun Xie,et al.  Fast Threshold Image Segmentation Based on 2D Fuzzy Fisher and Random Local Optimized QPSO , 2017, IEEE Transactions on Image Processing.

[42]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[43]  Jian Sun,et al.  Convolutional neural networks at constrained time cost , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).