Vehicle Detection of Multi-source Remote Sensing Data Using Active Fine-tuning Network

Abstract Vehicle detection in remote sensing images has attracted increasing interest in recent years. However, its detection ability is limited due to lack of well-annotated samples, especially in densely crowded scenes. Furthermore, since a list of remotely sensed data sources is available, efficient exploitation of useful information from multi-source data for better vehicle detection is challenging. To solve the above issues, a multi-source active fine-tuning vehicle detection (Ms-AFt) framework is proposed, which integrates transfer learning, segmentation, and active classification into a unified framework for auto-labeling and detection. The proposed Ms-AFt employs a fine-tuning network to firstly generate a vehicle training set from an unlabeled dataset. To cope with the diversity of vehicle categories, a multi-source based segmentation branch is then designed to construct additional candidate object sets. The separation of high quality vehicles is realized by a designed attentive classifications network. Finally, all three branches are combined to achieve vehicle detection. Extensive experimental results conducted on two open ISPRS benchmark datasets, namely the Vaihingen village and Potsdam city datasets, demonstrate the superiority and effectiveness of the proposed Ms-AFt for vehicle detection. In addition, the generalization ability of Ms-AFt in dense remote sensing scenes is further verified on stereo aerial imagery of a large camping site.

[1]  Naoto Yokoya,et al.  Invariant Attribute Profiles: A Spatial-Frequency Joint Feature Extractor for Hyperspectral Image Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Pedram Ghamisi,et al.  LiDAR Data Classification Using Spatial Transformation and CNN , 2019, IEEE Geoscience and Remote Sensing Letters.

[3]  Junwei Han,et al.  Multi-class geospatial object detection and geographic image classification based on collection of part detectors , 2014 .

[4]  Jocelyn Chanussot,et al.  Fourier-Based Rotation-Invariant Feature Boosting: An Efficient Framework for Geospatial Object Detection , 2019, IEEE Geoscience and Remote Sensing Letters.

[5]  Gellért Máttyus,et al.  Fast Multiclass Vehicle Detection on Aerial Images , 2015, IEEE Geoscience and Remote Sensing Letters.

[6]  Yi Li,et al.  R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[7]  Xiaojun Yang What is Urban Remote Sensing , 2011 .

[8]  Jiebo Luo,et al.  DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Yusheng Xu,et al.  Multi-Scale Local Context Embedding for LiDAR Point Cloud Classification , 2020, IEEE Geoscience and Remote Sensing Letters.

[11]  Naoto Yokoya,et al.  CoSpace: Common Subspace Learning From Hyperspectral-Multispectral Correspondences , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Maciel Zortea,et al.  A supervised approach for simultaneous segmentation and classification of remote sensing images , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[13]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Yang Wang,et al.  MARTA GANs: Unsupervised Representation Learning for Remote Sensing Image Classification , 2016, IEEE Geoscience and Remote Sensing Letters.

[15]  Bertrand Le Saux,et al.  Segment-before-Detect: Vehicle Detection and Classification through Semantic Segmentation of Aerial Images , 2017, Remote. Sens..

[16]  Naoto Yokoya,et al.  Learning Convolutional Sparse Coding on Complex Domain for Interferometric Phase Restoration , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Junwei Han,et al.  Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Selim Aksoy,et al.  Multisource Region Attention Network for Fine-Grained Object Recognition in Remote Sensing Imagery , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Naoto Yokoya,et al.  Learnable manifold alignment (LeMA): A semi-supervised cross-modality learning framework for land cover and land use classification , 2019, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[20]  Naoto Yokoya,et al.  An Augmented Linear Mixing Model to Address Spectral Variability for Hyperspectral Unmixing , 2018, IEEE Transactions on Image Processing.

[21]  Jocelyn Chanussot,et al.  ORSIm Detector: A Novel Object Detection Framework in Optical Remote Sensing Imagery Using Spatial-Frequency Channel Features , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[22]  Jon Atli Benediktsson,et al.  Feature Extraction for Hyperspectral Imagery: The Evolution From Shallow to Deep: Overview and Toolbox , 2020, IEEE Geoscience and Remote Sensing Magazine.

[23]  Dengfeng Chai,et al.  A Probabilistic Framework for Building Extraction From Airborne Color Image and DSM , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[24]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[25]  Murari Mandal,et al.  AVDNet: A Small-Sized Vehicle Detection Network for Aerial Visual Data , 2019, IEEE Geoscience and Remote Sensing Letters.

[26]  F. Gao,et al.  Generating daily land surface temperature at Landsat resolution by fusing Landsat and MODIS data , 2014 .

[27]  Zhiyong Lin,et al.  Vehicle Object Detection in Remote Sensing Imagery Based on Multi-Perspective Convolutional Neural Network , 2018, ISPRS Int. J. Geo Inf..

[28]  Uwe Soergel,et al.  Detection of Vehicles in Multisensor Data via Multibranch Convolutional Neural Networks , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[29]  Jiaojiao Tian,et al.  2D vs. 3D Change Detection Using Aerial Imagery to Support Crisis Management of Large-Scale Events , 2018, Remote. Sens..

[30]  Junwei Han,et al.  A Survey on Object Detection in Optical Remote Sensing Images , 2016, ArXiv.

[31]  Dong Xu,et al.  Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection , 2019, IEEE Transactions on Image Processing.

[32]  Gang Wan,et al.  Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark , 2020, ISPRS Journal of Photogrammetry and Remote Sensing.

[33]  Peter Reinartz,et al.  Detection of Traffic Congestion in Optical Remote Sensing Imagery , 2008, IGARSS 2008 - 2008 IEEE International Geoscience and Remote Sensing Symposium.

[34]  Qiong Yan,et al.  Cascade Residual Learning: A Two-Stage Convolutional Neural Network for Stereo Matching , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[35]  Konrad Schindler,et al.  Street-side vehicle detection, classification and change detection using mobile laser scanning data , 2016 .

[36]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[37]  Boris Jutzi,et al.  INVESTIGATIONS ON THE POTENTIAL OF CONVOLUTIONAL NEURAL NETWORKS FOR VEHICLE CLASSIFICATION BASED ON RGB AND LIDAR DATA , 2017 .

[38]  Lei Guo,et al.  Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[39]  Liujuan Cao,et al.  Weakly supervised vehicle detection in satellite images via multi-instance discriminative learning , 2017, Pattern Recognit..

[40]  Uwe Stilla,et al.  Classification With an Edge: Improving Semantic Image Segmentation with Boundary Detection , 2016, ISPRS Journal of Photogrammetry and Remote Sensing.

[41]  Zhi Gao,et al.  Improved Faster R-CNN With Multiscale Feature Fusion and Homography Augmentation for Vehicle Detection in Remote Sensing Images , 2019, IEEE Geoscience and Remote Sensing Letters.

[42]  Xiwen Yao,et al.  Cross-Scale Feature Fusion for Object Detection in Optical Remote Sensing Images , 2021, IEEE Geoscience and Remote Sensing Letters.

[43]  Wolfgang Middelmann,et al.  Object-based detection of vehicles using combined optical and elevation data , 2018 .

[44]  Yan Song,et al.  Inception Single Shot MultiBox Detector for object detection , 2017, 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[45]  Karim Afdel,et al.  Prototype of an embedded system using Stratix III FPGA for vehicle detection and traffic management , 2014, 2014 International Conference on Multimedia Computing and Systems (ICMCS).

[46]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[47]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Zhong Chen,et al.  End-to-End Airplane Detection Using Transfer Learning in Remote Sensing Images , 2018, Remote. Sens..

[50]  Lei Liu,et al.  Semi-Supervised Object Detection in Remote Sensing Images Using Generative Adversarial Networks , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.