Looking for change? Roll the Dice and demand Attention

Change detection, i.e. identification per pixel of changes for some classes of interest from a set of bi-temporal co-registered images, is a fundamental task in the field of remote sensing. It remains challenging due to unrelated forms of change that appear at different times in input images. These are changes due to to different environmental conditions or simply changes of objects that are not of interest. Here, we propose a reliable deep learning framework for the task of semantic change detection in very high-resolution aerial images. Our framework consists of a new loss function, new attention modules, new feature extraction building blocks, and a new backbone architecture that is tailored for the task of semantic change detection. Specifically, we define a new form of set similarity, that is based on an iterative evaluation of a variant of the Dice coefficient. We use this similarity metric to define a new loss function as well as a new spatial and channel convolution Attention layer (the FracTAL). The new attention layer, designed specifically for vision tasks, is memory efficient, thus suitable for use in all levels of deep convolutional networks. Based on these, we introduce two new efficient self-contained feature extraction convolution units. We term these units CEECNet and FracTAL ResNet units. We validate the performance of these feature extraction building blocks on the CIFAR10 reference data and compare the results with standard ResNet modules. Further, we introduce a new encoder/decoder scheme, a network macro-topology, that is tailored for the task of change detection. We validate our approach by showing excellent performance and achieving state of the art score (F1 and Intersection over Union - hereafter IoU) on two building change detection datasets, namely, the LEVIRCD (F1: 0.918, IoU: 0.848) and the WHU (F1: 0.938, IoU: 0.882) datasets.

[1]  Alexander M. Rush,et al.  Structured Attention Networks , 2017, ICLR.

[2]  Julianna M Czum,et al.  Dive Into Deep Learning. , 2020, Journal of the American College of Radiology : JACR.

[3]  Yongjun Zhang,et al.  Building Instance Change Detection from Large-Scale Aerial Images using Convolutional Neural Networks and Simulated Samples , 2019, Remote. Sens..

[4]  Han Zhang,et al.  Self-Attention Generative Adversarial Networks , 2018, ICML.

[5]  Wenzhuo Song,et al.  A deep residual learning serial segmentation network for extracting buildings from remote sensing imagery , 2020, International Journal of Remote Sensing.

[6]  Kaiming He,et al.  Group Normalization , 2018, ECCV.

[7]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[8]  Li Chen,et al.  DASNet: Dual Attentive Fully Convolutional Siamese Networks for Change Detection in High-Resolution Satellite Images , 2020, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[9]  Nicholas J. Tate,et al.  A critical synthesis of remotely sensed optical image change detection techniques , 2015 .

[10]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[11]  Xiaomeng Zhang,et al.  Building Change Detection for Remote Sensing Images Using a Dual-Task Constrained Deep Siamese Convolutional Network Model , 2019, IEEE Geoscience and Remote Sensing Letters.

[12]  Fa Zhang,et al.  Detection of Small Changed Regions in Remote Sensing Imagery Using Convolutional Neural Network , 2020, IOP Conference Series: Earth and Environmental Science.

[13]  Zhong Lu,et al.  International Journal of Remote Sensing , 2012 .

[14]  D. Lu,et al.  Change detection techniques , 2004 .

[15]  Dongmei Chen,et al.  Change detection from remotely sensed images: From pixel-based to object-based approaches , 2013 .

[16]  Sepand Haghighi,et al.  PyCM: Multiclass confusion matrix library in Python , 2018, J. Open Source Softw..

[17]  Hao Chen,et al.  A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection , 2020, Remote. Sens..

[18]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[19]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[20]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[21]  Tat-Seng Chua,et al.  SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  J. Anitha,et al.  Change detection techniques for remote sensing applications: a survey , 2019, Earth Science Informatics.

[23]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Dima Damen,et al.  Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Abhinav Gupta,et al.  Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[26]  Akshaya Ramaswamy,et al.  ChangeNet: A Deep Learning Architecture for Visual Change Detection , 2018, ECCV Workshops.

[27]  Ling Shao,et al.  See More, Know More: Unsupervised Video Object Segmentation With Co-Attention Siamese Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Wenzhong Shi,et al.  Change Detection Based on Artificial Intelligence: State-of-the-Art and Challenges , 2020, Remote. Sens..

[29]  Paul D. Bates,et al.  A Change Detection Approach to Flood Mapping in Urban Areas Using TerraSAR-X , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[30]  Michael T. M. Emmerich,et al.  A tutorial on multiobjective optimization: fundamentals and evolutionary methods , 2018, Natural Computing.

[31]  Yu Liu,et al.  Learning to Measure Change: Fully Convolutional Siamese Metric Networks for Scene Change Detection , 2018, ArXiv.

[32]  Takayuki Okatani,et al.  Change Detection from a Street Image Pair using CNN Features and Superpixel Segmentation , 2015, BMVC.

[33]  Germán Ros,et al.  Street-view change detection with deconvolutional networks , 2016, Autonomous Robots.

[34]  Meng Lu,et al.  Fully Convolutional Networks for Multisource Building Extraction From an Open Aerial and Satellite Imagery Data Set , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[35]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[36]  Jia Deng,et al.  Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[37]  Diederik P. Kingma,et al.  An Introduction to Variational Autoencoders , 2019, Found. Trends Mach. Learn..

[38]  Travis E. Oliphant,et al.  Guide to NumPy , 2015 .

[39]  Olena Dubovyk,et al.  Mapping Cropland Abandonment in the Aral Sea Basin with MODIS Time Series , 2018, Remote. Sens..

[40]  Alexander Sergeev,et al.  Horovod: fast and easy distributed deep learning in TensorFlow , 2018, ArXiv.

[41]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[42]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Alexandre Boulch,et al.  Multitask learning for large-scale semantic change detection , 2018, Comput. Vis. Image Underst..

[44]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[45]  Xiangyun Hu,et al.  PGA-SiamNet: Pyramid Feature-Based Attention-Guided Siamese Network for Remote Sensing Orthoimagery Building Change Detection , 2020, Remote. Sens..

[46]  Douglas C. Morton,et al.  Rapid assessment of annual deforestation in the Brazilian Amazon using MODIS data , 2005 .

[47]  Quoc V. Le,et al.  Attention Augmented Convolutional Networks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[48]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[49]  Zheng Zhang,et al.  MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems , 2015, ArXiv.

[50]  Martin Jägersand,et al.  U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection , 2020, Pattern Recognit..

[51]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[52]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[53]  François Waldner,et al.  Deep learning on edge: extracting field boundaries from satellite images with a convolutional neural network , 2019, ArXiv.

[54]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[55]  Steven C. H. Hoi,et al.  Deep Learning for Image Super-Resolution: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Pol Coppin,et al.  Review ArticleDigital change detection methods in ecosystem monitoring: a review , 2004 .