DIOD: Fast and Efficient Weakly Semi-Supervised Deep Complex ISAR Object Detection

Inverse synthetic aperture radar (ISAR) object detection is one of the most important and challenging problems in computer vision tasks. To provide a convenient and high-quality ISAR object detection method, a fast and efficient weakly semi-supervised method, called deep ISAR object detection (DIOD), is proposed, based on advanced region proposal networks (ARPNs) and weakly semi-supervised deep joint sparse learning: 1) to generate high-level region proposals and localize potential ISAR objects robustly and accurately in minimal time, ARPN is proposed based on a multiscale fully convolutional region proposal network and a region proposal classification and ranking strategy. ARPN shares common convolutional layers with the Inception-ResNet-based system and offers almost cost-free proposal computation with excellent performance; 2) to solve the difficult problem of the lack of sufficient annotated training data, especially in the ISAR field, a convenient and efficient weakly semi-supervised training method is proposed with the weakly annotated and unannotated ISAR images. Particularly, a pairwise-ranking loss handles the weakly annotated images, while a triplet-ranking loss is employed to harness the unannotated images; and 3) to further improve the accuracy and speed of the whole system, a novel sharable-individual mechanism and a relational-regularized joint sparse learning strategy are introduced to achieve more discriminative and comprehensive representations while learning the shared- and individual-features and their correlations. Extensive experiments are performed on two real-world ISAR datasets, showing that DIOD outperforms existing state-of-the-art methods and achieves higher accuracy with shorter execution time.

[1]  Venkatesh Saligrama,et al.  Sequential Optimization for Efficient High-Quality Object Proposal Generation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Changzhi Li,et al.  A Portable FMCW Interferometry Radar With Programmable Low-IF Architecture for Localization, ISAR Imaging, and Vital Sign Tracking , 2017, IEEE Transactions on Microwave Theory and Techniques.

[3]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[4]  Jitendra Malik,et al.  Region-Based Convolutional Networks for Accurate Object Detection and Segmentation , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Jason J. Corso,et al.  Detection and Localization of Robotic Tools in Robot-Assisted Surgery Videos Using Deep Neural Networks for Region Proposal and Detection , 2017, IEEE Transactions on Medical Imaging.

[6]  Manish Khare,et al.  Single change detection-based moving object segmentation by using Daubechies complex wavelet transform , 2014, IET Image Process..

[7]  Kyung-Tae Kim,et al.  Classification of ISAR Images Using Variable Cross-Range Resolutions , 2018, IEEE Transactions on Aerospace and Electronic Systems.

[8]  Chu-Song Chen,et al.  Supervised Learning of Semantics-Preserving Hash via Deep Convolutional Neural Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Masatoshi Kawahata,et al.  Hydrogen bonding to carbonyl oxygen of nitrogen-pyramidalized amide - detection of pyramidalization direction preference by vibrational circular dichroism spectroscopy. , 2016, Chemical communications.

[11]  Malte Henkel,et al.  Kinetics of the long-range spherical model , 2007 .

[12]  Yi Li,et al.  R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[13]  Fabiola Colone,et al.  VHF Cross-Range Profiling of Aerial Targets Via Passive ISAR: Signal Processing Schemes and Experimental Results , 2017, IEEE Transactions on Aerospace and Electronic Systems.

[14]  Michael Felsberg,et al.  Semantic Pyramids for Gender and Action Recognition , 2014, IEEE Transactions on Image Processing.

[15]  Li Li,et al.  Forecasting the High Penetration of Wind Power on Multiple Scales Using Multi-to-Multi Mapping , 2018, IEEE Transactions on Power Systems.

[16]  Weiqiang Wang,et al.  Stereo Object Proposals , 2017, IEEE Transactions on Image Processing.

[17]  Shuzhi Sam Ge,et al.  Analysis of Different Sparsity Methods in Constrained RBM for Sparse Representation in Cognitive Robotic Perception , 2015, Journal of Intelligent & Robotic Systems.

[18]  C. Lawrence Zitnick,et al.  Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[19]  Maja Pantic,et al.  Deep complementary bottleneck features for visual speech recognition , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  Senem Velipasalar,et al.  Adaptive Methodologies for Energy-Efficient Object Detection and Tracking With Battery-Powered Embedded Smart Cameras , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  R. Srikant,et al.  On projected stochastic gradient descent algorithm with weighted averaging for least squares regression , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[24]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[25]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[26]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[27]  Boris Murmann,et al.  Toward Always-On Mobile Object Detection: Energy Versus Performance Tradeoffs for Embedded HOG Feature Extraction , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Jonathan T. Barron,et al.  Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Marco Martorella,et al.  Automatic Target Recognition by Means of Polarimetric ISAR Images and Neural Networks , 2009, IEEE Trans. Geosci. Remote. Sens..

[31]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[32]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[33]  Junwei Han,et al.  Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[34]  Liang Lin,et al.  An Approach to Streaming Video Segmentation With Sub-Optimal Low-Rank Decomposition , 2016, IEEE Transactions on Image Processing.

[35]  Ioannis Patras,et al.  Robust Face Alignment Under Occlusion via Regional Predictive Power Estimation , 2015, IEEE Transactions on Image Processing.

[36]  Marco Martorella,et al.  Target Recognition by Means of Polarimetric ISAR Images , 2011, IEEE Transactions on Aerospace and Electronic Systems.

[37]  Chenguang Yang,et al.  Neural Networks Enhanced Adaptive Admittance Control of Optimized Robot–Environment Interaction , 2019, IEEE Transactions on Cybernetics.

[38]  Philip H. S. Torr,et al.  Object Proposal Generation Using Two-Stage Cascade SVMs , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Thomas Brox,et al.  Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Alberto Del Bimbo,et al.  Local Pyramidal Descriptors for Image Recognition , 2014, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Gang Wang,et al.  Recurrent Spatial Pyramid CNN for Optical Flow Estimation , 2018, IEEE Transactions on Multimedia.

[42]  Bahareh Kalantar,et al.  Multiple Moving Object Detection From UAV Videos Using Trajectories of Matched Regional Adjacency Graphs , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[43]  Chunhong Pan,et al.  Feature Extraction by Rotation-Invariant Matrix Representation for Object Detection in Aerial Image , 2017, IEEE Geoscience and Remote Sensing Letters.

[44]  Huimin Ma,et al.  Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection , 2016, IEEE Transactions on Image Processing.