SAR target recognition and posture estimation using spatial pyramid pooling within CNN

Many convolution neural networks(CNN) architectures have been proposed to strengthen the performance on synthetic aperture radar automatic target recognition (SAR-ATR) and obtained state-of-art results on targets classification on MSTAR database, but few methods concern about the estimation of depression angle and azimuth angle of targets. To get better effect on learning representation of hierarchies of features on both 10-class target classification task and target posture estimation tasks, we propose a new CNN architecture with spatial pyramid pooling(SPP) which can build high hierarchy of features map by dividing the convolved feature maps from finer to coarser levels to aggregate local features of SAR images. Experimental results on MSTAR database show that the proposed architecture can get high recognition accuracy as 99.57% on 10-class target classification task as the most current state-of-art methods, and also get excellent performance on target posture estimation tasks which pays attention to depression angle variety and azimuth angle variety. What’s more, the results inspire us the application of deep learning on SAR target posture description.

[1]  Haipeng Wang,et al.  Target Classification Using the Deep Convolutional Networks for SAR Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Chris Kreucher,et al.  Modern approaches in deep learning for SAR ATR , 2016, SPIE Defense + Security.

[3]  Hongwei Liu,et al.  Convolutional Neural Network With Data Augmentation for SAR Target Recognition , 2016, IEEE Geoscience and Remote Sensing Letters.

[4]  Shaun Quegan,et al.  Matching map features to synthetic aperture radar (SAR) images using template matching , 1992, IEEE Trans. Geosci. Remote. Sens..

[5]  Li Shi,et al.  Application of Scale Invariant Feature Transformation to SAR Imagery Registration , 2008 .

[6]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[7]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[8]  H. Scott Clouse,et al.  Convolutional neural networks for synthetic aperture radar classification , 2016, SPIE Defense + Security.

[9]  Yue Li,et al.  Multiscale convolutional neural network for the detection of built-up areas in high-resolution SAR images , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[10]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14]  Shiyong Cui,et al.  Convolutional Neural Network for SAR image classification at patch level , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).