ISDNet: AI-enabled Instance Segmentation of Aerial Scenes for Smart Cities

Aerial scenes captured by UAVs have immense potential in IoT applications related to urban surveillance, road and building segmentation, land cover classification, and so on, which are necessary for the evolution of smart cities. The advancements in deep learning have greatly enhanced visual understanding, but the domain of aerial vision remains largely unexplored. Aerial images pose many unique challenges for performing proper scene parsing such as high-resolution data, small-scaled objects, a large number of objects in the camera view, dense clustering of objects, background clutter, and so on, which greatly hinder the performance of the existing deep learning methods. In this work, we propose ISDNet (Instance Segmentation and Detection Network), a novel network to perform instance segmentation and object detection on visual data captured by UAVs. This work enables aerial image analytics for various needs in a smart city. In particular, we use dilated convolutions to generate improved spatial context, leading to better discrimination between foreground and background features. The proposed network efficiently reuses the segment-mask features by propagating them from early stages using residual connections. Furthermore, ISDNet makes use of effective anchors to accommodate varying object scales and sizes. The proposed method obtains state-of-the-art results in the aerial context.

[1]  Murari Mandal,et al.  AVDNet: A Small-Sized Vehicle Detection Network for Aerial Visual Data , 2019, IEEE Geoscience and Remote Sensing Letters.

[2]  Murari Mandal,et al.  ANTIC: antithetic isomeric cluster patterns for medical image retrieval and change detection , 2019, IET Comput. Vis..

[3]  Kai Chen,et al.  Gliding vertex on the horizontal bounding box for multi-oriented object detection , 2020, IEEE transactions on pattern analysis and machine intelligence.

[4]  Gui-Song Xia,et al.  A survey on vision-based UAV navigation , 2018, Geo spatial Inf. Sci..

[5]  Tarek Sayed,et al.  Automated Pedestrian Safety Analysis at a Signalized Intersection in New York City: Automated Data Extraction for Safety Diagnosis and Behavioral Study , 2015 .

[6]  Vikas Hassija,et al.  Scheduling drone charging for multi-drone network based on consensus time-stamp and game theory , 2020, Comput. Commun..

[7]  Marcin Woźniak,et al.  Multi-Level Features Extraction for Discontinuous Target Tracking in Remote Sensing Image Monitoring , 2019, Sensors.

[8]  Feng Xiao,et al.  Improved Traffic Sign Detection and Recognition Algorithm for Intelligent Vehicles , 2019, Sensors.

[9]  Neeraj Kumar,et al.  PARTH: A two-stage lightweight mutual authentication protocol for UAV surveillance networks , 2020, Comput. Commun..

[10]  David Lee,et al.  Vision-Based Potential Pedestrian Risk Analysis on Unsignalized Crosswalk Using Data Mining Techniques , 2020 .

[11]  Pascal Bouvry,et al.  Internet of Unmanned Aerial Vehicles—A Multilayer Low-Altitude Airspace Model for Distributed UAV Traffic Management , 2019, Sensors.

[12]  José-Enrique Simó-Ten,et al.  Distributed Architecture to Integrate Sensor Information: Object Recognition for Smart Cities , 2020, Sensors.

[13]  Xiao Xiang Zhu,et al.  Vehicle Instance Segmentation From Aerial Image and Video Using a Multitask Learning Residual Fully Convolutional Network , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Hei Law,et al.  CornerNet: Detecting Objects as Paired Keypoints , 2018, ECCV.

[15]  Marcin Woźniak,et al.  Road Detection Based on Shearlet for GF-3 Synthetic Aperture Radar Images , 2020, IEEE Access.

[16]  Mohsen Guizani,et al.  Applications of blockchain in unmanned aerial vehicles: A review , 2020, Veh. Commun..

[17]  Wei Wei,et al.  DA-PMHT for Multistatic Passive Radar Multitarget Tracking in Dense Clutter Environment , 2019, IEEE Access.