论文信息 - Object Detection in Aerial Images Using a Multiscale Keypoint Detection Network

Object Detection in Aerial Images Using a Multiscale Keypoint Detection Network

Automatic object detection in aerial imagery is being increasingly adopted in many applications, such as traffic monitoring, smart cities, and disaster assistance. In keypoint-based detectors, the prediction modules are usually generated from a fixed feature map scale. This configuration significantly limits the ability to detect multiscale objects in aerial scenes. The corner selection module in these detectors often ignores that a category in an aerial image is relatively unitary. In this article, a novel network, called the multiscale keypoint detection network (MKD-Net), is proposed to address these challenges. MKD-Net fuses multiscale layers to generate multiple feature maps for objects of different sizes. During the inference phase, both feature maps can be exploited for predicting corners. Moreover, a category attention module is designed to reduce the channel noise for a single-category scene. Experiments on benchmarks PASCAL VOC and DOTA show promising performance of MKD-Net compared with the baseline network. The code is available on https://github.com/jason-su/MKD-NET.

Dujuan Gu | Zongyue Wang | Guorong Cai | Jinhe Su | JiaJia Liao

[1] Yun Teng,et al. CornerNet-Lite: Efficient Keypoint based Object Detection , 2019, BMVC.

[2] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Guanghui He,et al. Scale Adaptive Proposal Network for Object Detection in Remote Sensing Images , 2019, IEEE Geoscience and Remote Sensing Letters.

[4] Hao Chen,et al. FCOS: Fully Convolutional One-Stage Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[5] Jürgen Beyerer,et al. Fast Deep Vehicle Detection in Aerial Images , 2017, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[6] Marios Savvides,et al. Feature Selective Anchor-Free Module for Single-Shot Object Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[8] Junwei Han,et al. Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[9] Jiebo Luo,et al. DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[10] Ross B. Girshick,et al. Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Qinghua Hu,et al. Vision Meets Drones: A Challenge , 2018, ArXiv.

[12] Hei Law,et al. CornerNet: Detecting Objects as Paired Keypoints , 2018, ECCV.

[13] Yuning Jiang,et al. FoveaBox: Beyond Anchor-based Object Detector , 2019, ArXiv.

[14] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Jia Deng,et al. Pixels to Graphs by Associative Embedding , 2017, NIPS.

[16] Qi Tian,et al. CenterNet: Keypoint Triplets for Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[17] Wei Li,et al. R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection , 2017, ArXiv.

[18] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[19] Kai Chen,et al. Region Proposal by Guided Anchoring , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[21] Junwei Han,et al. Multi-class geospatial object detection and geographic image classification based on collection of part detectors , 2014 .

[22] Huanxin Zou,et al. Toward Fast and Accurate Vehicle Detection in Aerial Images Using Coupled Region-Based Convolutional Neural Networks , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[23] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.

[24] Larry S. Davis,et al. Soft-NMS — Improving Object Detection with One Line of Code , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[25] Yue Zhang,et al. SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[26] Gang Wan,et al. Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark , 2020, ISPRS Journal of Photogrammetry and Remote Sensing.

[27] Gong Cheng,et al. RIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Zhiao Huang,et al. Associative Embedding: End-to-End Learning for Joint Detection and Grouping , 2016, NIPS.

[29] Ke Li,et al. Rotation-Insensitive and Context-Augmented Object Detection in Remote Sensing Images , 2018, IEEE Transactions on Geoscience and Remote Sensing.