Cloud detection algorithm for multi-modal satellite imagery using convolutional neural-networks (CNN)

Abstract Cloud detection algorithms are crucial in many remote-sensing applications to allow an optimized processing of the acquired data, without the interference of the cloud fields above the surfaces of interest (e.g., land, coral reefs, etc.). While this is a well-established area of research, replete with a number of cloud detection methodologies, many issues persist for detecting clouds over areas of high albedo surfaces (snow and sand), detecting cloud shadows, and transferring a given algorithm between observational platforms. Particularly for the latter, algorithms are often platform-specific with corresponding rule-based tests and thresholds inherent to instruments and applied corrections. Here, we present a convolutional neural network (CNN) algorithm for the detection of cloud and cloud shadow fields in multi-channel satellite imagery from World-View-2 (WV-2) and Sentinel-2 (S-2), using their Red, Green, Blue, and Near-Infrared (RGB, NIR) channels. This algorithm is developed within the NASA NeMO-Net project, a multi-modal CNN for global coral reef classification which utilizes imagery from multiple remote sensing aircraft and satellites with heterogeneous spatial resolution and spectral coverage. Our cloud detection algorithm is novel in that it attempts to learn deep invariant features for cloud detection utilizing both the spectral and the spatial information inherent in satellite imagery. The first part of our work presents the CNN cloud and cloud shadow algorithm development (trained using WV-2 data) and its application to WV-2 (with a cloud detection accuracy of 89%) and to S-2 imagery (referred to as augmented CNN). The second part presents a new domain adaptation CNN-based approach (domain adversarial NN) that allows for better adaptation between the two satellite platforms during the prediction step, without the need to train for each platform separately. Our augmented CNN algorithm results in better cloud prediction rates as compared to the original S-2 cloud mask (81% versus 48%), but still, clear pixels prediction rate is lower than S-2 (81% versus 91%). Nevertheless, the application of the domain adaptation approach shows promise in better transferring the knowledge gained from one trained domain (WV-2) to another (S-2), increasing the prediction accuracy of both clear and cloudy pixels when compared to a network trained only by WV-2. As such, domain adaptation may offer a novel means of additional augmentation for our CNN-based cloud detection algorithm, increasing robustness towards predictions from multiple remote sensing platforms. The approach presented here may be further developed and optimized for global and multi-modal (multi-channel and multi-platform) satellite cloud detection capability by utilizing a more global dataset.

[1]  John R. Jensen,et al.  Introductory Digital Image Processing: A Remote Sensing Perspective , 1986 .

[2]  Richard R. Irish,et al.  Landsat 7 automatic cloud cover assessment , 2000, SPIE Defense + Commercial Sensing.

[3]  William J. Blackwell,et al.  A neural-network technique for the retrieval of atmospheric temperature and moisture profiles from high spectral resolution sounding data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[4]  Christoph U. Keller,et al.  Use of neural networks in ground-based aerosol retrievals from multi-angle spectropolarimetric observations , 2014 .

[5]  Muhammad Bilal,et al.  Multilevel Cloud Detection for High-Resolution Remote Sensing Imagery Using Multiple Convolutional Neural Networks , 2018, ISPRS Int. J. Geo Inf..

[6]  Olivier Hagolle,et al.  MACCS-ATCOR joint algorithm (MAJA) , 2016, Remote Sensing.

[7]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[9]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[10]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[11]  M. Joseph Hughes,et al.  Automated Detection of Cloud and Cloud Shadow in Single-Date Landsat Imagery Using Neural Networks and Spatial Post-Processing , 2014, Remote. Sens..

[12]  Paul F. Whelan,et al.  Using filter banks in Convolutional Neural Networks for texture classification , 2016, Pattern Recognit. Lett..

[13]  Pierre Alliez,et al.  Convolutional Neural Networks for Large-Scale Remote-Sensing Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[15]  S. Casadio,et al.  Neural networks for the dimensionality reduction of GOME measurement vector in the estimation of ozone profiles , 2005 .

[16]  Ved Chirayath,et al.  Next-Generation Optical Sensing Technologies for Exploring Ocean Worlds—NASA FluidCam, MiDAR, and NeMO-Net , 2019, Front. Mar. Sci..

[17]  S. Goward,et al.  Characterization of the Landsat-7 ETM Automated Cloud-Cover Assessment (ACCA) Algorithm , 2006 .

[18]  Darrel L. Williams,et al.  Landsat-7 Long-Term Acquisition Plan: Development and Validation , 2006 .

[19]  François Laviolette,et al.  Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[20]  Ved Chirayath,et al.  Drones that see through waves – preliminary results from airborne fluid lensing for centimetre-scale aquatic conservation , 2016 .

[21]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[22]  Ved Chirayath,et al.  NeMO-Net & Fluid Lensing: The Neural Multi-Modal Observation & Training Network for Global Coral Reef Assessment Using Fluid Lensing Augmentation of NASA EOS Data , 2018 .

[23]  Subhransu Maji,et al.  Deep filter banks for texture recognition and segmentation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Yann LeCun,et al.  A theoretical framework for back-propagation , 1988 .

[25]  Zhe Zhu,et al.  Object-based cloud and cloud shadow detection in Landsat imagery , 2012 .

[26]  Ronald Kemker,et al.  Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[27]  Zhe Zhu,et al.  Cloud detection algorithm comparison and validation for operational Landsat data products , 2017 .

[28]  D. Sagi,et al.  Gabor filters as texture discriminator , 1989, Biological Cybernetics.

[29]  Hiroyasu Sakamoto,et al.  A Multi-Label Convolutional Neural Network for Automatic Image Annotation , 2015, J. Inf. Process..

[30]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[31]  Yu Li,et al.  Multi-sensor cloud and cloud shadow segmentation with a convolutional neural network , 2019, Remote Sensing of Environment.

[32]  Zhiguo Jiang,et al.  A Cloud Detection Method for Landsat 8 Images Based on PCANet , 2018, Remote. Sens..