LW-ODF: A Light-Weight Object Detection Framework for Optical Remote Sensing Imagery

In this paper, we propose to extract the multi-scaled and rotation-insensitive deep features to address the issues of object multi-solutions and rotations in geospatial object detection. To this end, we develop a novel object detection framework where a rotation-insensitive convolution neural network is applied for extracting multi-scaled and direction-insensitive feature representation and then the learned features can be fed into the ensemble classifier learning with fast feature pyramid. Such a non-end-to-end learning strategy intuitively reduces the computational cost without the additional performance loss, yielding an effective and efficient light-weight object detection framework. Experimental results conducted on the NWPU VHR-10 dataset demonstrate that the proposed framework outperforms several state-of-the-art baselines.

[1]  Alexei A. Efros,et al.  Ensemble of exemplar-SVMs for object detection and beyond , 2011, 2011 International Conference on Computer Vision.

[2]  Bin Yang,et al.  Convolutional Channel Features , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[3]  Naoto Yokoya,et al.  Learning a Robust Local Manifold Representation for Hyperspectral Dimensionality Reduction , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[4]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Naoto Yokoya,et al.  An Augmented Linear Mixing Model to Address Spectral Variability for Hyperspectral Unmixing , 2018, IEEE Transactions on Image Processing.

[6]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Naoto Yokoya,et al.  IMG2DSM: Height Simulation From Single Imagery Using Conditional Generative Adversarial Net , 2018, IEEE Geoscience and Remote Sensing Letters.

[8]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Meng Zhang,et al.  Estimation of PMx Concentrations from Landsat 8 OLI Images Based on a Multilayer Perceptron Neural Network , 2019, Remote. Sens..

[10]  Xin Wu,et al.  MsRi-CCF: Multi-Scale and Rotation-Insensitive Convolutional Channel Features for Geospatial Object Detection , 2018, Remote. Sens..

[11]  Qingshan Liu,et al.  Cascaded Recurrent Neural Networks for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Jian Su,et al.  Robust palmprint recognition based on the fast variation Vese-Osher model , 2016, Neurocomputing.

[13]  Li Hua,et al.  Detection of Damaged Rooftop Areas From High-Resolution Aerial Images Based on Visual Bag-of-Words Model , 2016, IEEE Geoscience and Remote Sensing Letters.

[14]  Naoto Yokoya,et al.  CoSpace: Common Subspace Learning From Hyperspectral-Multispectral Correspondences , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[15]  Jian Su,et al.  A novel hierarchical approach for multispectral palmprint recognition , 2015, Neurocomputing.

[16]  Junwei Han,et al.  Multi-class geospatial object detection and geographic image classification based on collection of part detectors , 2014 .

[17]  Xin Wu,et al.  Improved differential box counting with multi-scale and multi-direction: A new palmprint recognition method , 2014 .

[18]  Naoto Yokoya,et al.  Joint & Progressive Learning from High-Dimensional Data for Multi-Label Classification , 2018, ECCV.

[19]  Baojun Zhao,et al.  StfNet: A Two-Stream Convolutional Neural Network for Spatiotemporal Image Fusion , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Junwei Han,et al.  Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[21]  Naoto Yokoya,et al.  Learnable manifold alignment (LeMA): A semi-supervised cross-modality learning framework for land cover and land use classification , 2019, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[22]  Jocelyn Chanussot,et al.  ORSIm Detector: A Novel Object Detection Framework in Optical Remote Sensing Imagery Using Spatial-Frequency Channel Features , 2019, IEEE Transactions on Geoscience and Remote Sensing.