Multi-sensor cloud and cloud shadow segmentation with a convolutional neural network

Abstract Cloud and cloud shadow segmentation is a crucial pre-processing step for any application that uses multi-spectral satellite images. In particular, disaster related applications (e.g., flood monitoring or rapid damage mapping), which are highly time- and data-critical, require methods that produce accurate cloud and cloud shadow masks in short time while being able to adapt to large variations in the target domain (induced by atmospheric conditions, different sensors, scene properties, etc.). In this study, we propose a data-driven approach to semantic segmentation of cloud and cloud shadow in single date images based on a modified U-Net convolutional neural network that aims to fulfil these requirements. We train the network on a global database of Landsat OLI images for the segmentation of five classes (“shadow”, “cloud”, “water”, “land” and “snow/ice”). We compare the results to state-of-the-art methods, proof the model's generalization ability across multiple satellite sensors (Landsat TM, Landsat ETM+, Landsat OLI and Sentinel-2) and show the influence of different training strategies and spectral band combinations on the performance of the segmentation. Our method consistently outperforms Fmask and a traditional Random Forest classifier on a globally distributed multi-sensor test dataset in terms of accuracy, Cohen's Kappa coefficient, Dice coefficient and inference speed. The results indicate that a reduced feature space composed solely of red, green, blue and near-infrared bands already produces good results for all tested sensors. If available, adding shortwave-infrared bands can increase the accuracy. Contrast and brightness augmentations of the training data further improve the segmentation performance. The best performing U-Net model achieves an accuracy of 0.89, Kappa of 0.82 and Dice coefficient of 0.85, while running the inference over 896 test image tiles with 44.8 s/megapixel (2.8 s/megapixel on GPU). The Random Forest classifier reaches an accuracy of 0.79, Kappa of 0.65 and Dice coefficient of 0.74 with 3.9 s/megapixel inference time (on CPU) on the same training and testing data. The rule-based Fmask method takes significantly longer (277.8 s/megapixel) and produces results with an accuracy of 0.75, Kappa of 0.60 and Dice coefficient of 0.72.

[1]  Sandro Martinis,et al.  A Multi-Scale Flood Monitoring System Based on Fully Automatic MODIS and TerraSAR-X Processing Chains , 2013, Remote. Sens..

[2]  Yujie Wang,et al.  An automatic cloud mask algorithm based on time series of MODIS measurements , 2008 .

[3]  Paolo Gamba,et al.  Multi-feature combined cloud and cloud shadow detection in GF-1 WFV imagery , 2016, ArXiv.

[4]  C. Woodcock,et al.  Improvement and expansion of the Fmask algorithm: cloud, cloud shadow, and snow detection for Landsats 4–7, 8, and Sentinel 2 images , 2015 .

[5]  Joanne C. White,et al.  Forest Monitoring Using Landsat Time Series Data: A Review , 2014 .

[6]  Luis Gómez-Chova,et al.  Convolutional neural networks for multispectral image cloud masking , 2017, 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[7]  Muhammad Bilal,et al.  Multilevel Cloud Detection for High-Resolution Remote Sensing Imagery Using Multiple Convolutional Neural Networks , 2018, ISPRS Int. J. Geo Inf..

[8]  Min Feng,et al.  A global, high-resolution (30-m) inland water body dataset for 2000: first results of a topographic–spectral classification algorithm , 2016, Int. J. Digit. Earth.

[9]  Xiao Xiang Zhu,et al.  Deep learning in remote sensing: a review , 2017, ArXiv.

[10]  Jianping Shi,et al.  Distinguishing Cloud and Snow in Satellite Images via Deep Convolutional Network , 2017, IEEE Geoscience and Remote Sensing Letters.

[11]  D. Roy,et al.  Web-enabled Landsat Data (WELD): Landsat ETM+ composited mosaics of the conterminous United States , 2010 .

[12]  Thomas Brox,et al.  Unsupervised feature learning by augmenting single images , 2013, ICLR.

[13]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[14]  Zhe Zhu,et al.  Object-based cloud and cloud shadow detection in Landsat imagery , 2012 .

[15]  M. Joseph Hughes,et al.  Automated Detection of Cloud and Cloud Shadow in Single-Date Landsat Imagery Using Neural Networks and Spatial Post-Processing , 2014, Remote. Sens..

[16]  Thomas J. Kopp,et al.  A Geometry-Based Approach to Identifying Cloud Shadows in the VIIRS Cloud Mask Algorithm for NPOESS , 2009 .

[17]  J. Pekel,et al.  High-resolution mapping of global surface water and its long-term changes , 2016, Nature.

[18]  Ann Henderson-Sellers,et al.  Cloud detection and analysis: A review of recent progress , 1988 .

[19]  Simon Plank,et al.  Sentinel-1-based flood mapping: a fully automated processing chain , 2016 .

[20]  Alan C. Bovik,et al.  Surface Water Mapping by Deep Learning , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[23]  Zhe Zhu,et al.  Cloud detection algorithm comparison and validation for operational Landsat data products , 2017 .

[24]  Deren Li,et al.  Cloud Detection for High-Resolution Satellite Imagery Using Machine Learning and Multi-Feature Fusion , 2016, Remote. Sens..

[25]  Xu Guodong,et al.  Small Satellite Cloud Detection Based On Deep Learning and Image Compression , 2018 .

[26]  Leslie N. Smith,et al.  Cyclical Learning Rates for Training Neural Networks , 2015, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[27]  Massimiliano Pittore,et al.  Large-area settlement pattern recognition from Landsat-8 data , 2016 .

[28]  Ferran Gascon,et al.  Sen2Cor for Sentinel-2 , 2017, Remote Sensing.

[29]  Zhiguo Jiang,et al.  A Cloud Detection Method for Landsat 8 Images Based on PCANet , 2018, Remote. Sens..

[30]  Yu Oishi,et al.  A new Landsat 8 cloud discrimination algorithm using thresholding tests , 2018, International Journal of Remote Sensing.

[31]  J. Kučera,et al.  Global trends in satellite-based emergency mapping , 2016, Science.

[32]  Yi Luo,et al.  Developing clear-sky, cloud and cloud shadow mask for producing clear-sky composites at 250-meter spatial resolution for the seven MODIS land bands over Canada and North America , 2008 .

[33]  G. Powell,et al.  Terrestrial Ecoregions of the World: A New Map of Life on Earth , 2001 .

[34]  Gérard Dedieu,et al.  A multi-temporal method for cloud detection, applied to FORMOSAT-2, VENµS, LANDSAT and SENTINEL-2 images , 2010 .

[35]  Luis Guanter,et al.  Ready-to-Use Methods for the Detection of Clouds, Cirrus, Snow, Shadow, Water and Clear Sky Pixels in Sentinel-2 MSI Images , 2016, Remote. Sens..

[36]  Chao-Hung Lin,et al.  CLOUD DETECTION OF OPTICAL SATELLITE IMAGES USING SUPPORT VECTOR MACHINE , 2016 .

[37]  G S Pankiewicz,et al.  Pattern recognition techniques for the identification of cloud and cloud systems , 2007 .