Improved mask R-CNN-based cloud masking method for remote sensing images

ABSTRACT Clouds lead to missing or distorted land-related information in impacted areas in optical remote sensing images. Cloud masking, which labels cloud-contaminated pixels, forms the basis for subsequent image utilization, such as excluding the distorted pixel or filling in the missing area. However, due to the diverse spectral, textural, and shape characteristics of different clouds and complicated combinations with the underlying land surfaces, cloud masking has become a challenge in remote sensing image processing. In recent years, the Mask region-based convolutional neural network (R-CNN) method, which performs instance segmentation from a complex background and generates a pixelwise mask for the object of interest, has been used widely in object segmentation tasks. When the Mask R-CNN method is used for cloud masking, the mask result has certain problems, such as failing to extract uncommon clouds and outputting inaccurate mask boundaries for large clouds. To address these problems, we introduce two strategies, group training and boundary optimization, to improve the Mask R-CNN. For group training, samples are divided into several groups. The samples in the first group are used for the initial training, and the samples in the next group are used for evaluation. Only samples with missing or falsely detected clouds are used for tuning the classifier; then, these processes are repeated until all groups have been used or the detection precision becomes stable. For boundary optimization, a block-by-block mask strategy is adopted to guarantee that clouds with diverse sizes have similar performances. Finally, two open data sets and one data set labelled by ourselves are selected to test the proposed method, and the results demonstrate that our method can produce cloud masks for different cloud types and diverse underlying land surfaces and can achieve high accuracies, thereby providing an effective alternative for cloud masking. Compared with the original Mask R-CNN method, our method improves the average recall, average precision, and intersection over union by 5.88%, 2.4%, and 0.071 in pixel level, respectively, demonstrating the effectiveness of our improvement.

[1]  S. Goward,et al.  Characterization of the Landsat-7 ETM Automated Cloud-Cover Assessment (ACCA) Algorithm , 2006 .

[2]  Pengfei Li,et al.  A cloud image detection method based on SVM vector machine , 2015, Neurocomputing.

[3]  D. Roy,et al.  The availability of cloud-free Landsat ETM+ data over the conterminous United States and globally , 2008 .

[4]  Kaiming He,et al.  Mask R-CNN , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[5]  Zhe Zhu,et al.  Object-based cloud and cloud shadow detection in Landsat imagery , 2012 .

[6]  Qing Zhang,et al.  Cloud Detection of RGB Color Aerial Photographs by Progressive Refinement Scheme , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Jiancheng Luo,et al.  Long-short-term-memory-based crop classification using high-resolution optical images and multi-temporal SAR data , 2019, GIScience & Remote Sensing.

[8]  Quan Wang,et al.  A cloud shadow detection method combined with cloud height iteration and spectral analysis for Landsat 8 OLI data , 2018 .

[9]  Rui Zhang,et al.  A stepwise cloud shadow detection approach combining geometry determination and SVM classification for MODIS data , 2013 .

[10]  Andrew K. C. Wong,et al.  Classification of Imbalanced Data: a Review , 2009, Int. J. Pattern Recognit. Artif. Intell..

[11]  Jun Yang,et al.  Cloud Type Classification of Total-Sky Images Using Duplex Norm-Bounded Sparse Coding , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[12]  Larry Di Girolamo,et al.  Selection of the automated thresholding algorithm for the Multi-angle Imaging SpectroRadiometer Radiometric Camera-by-Camera Cloud Mask over land , 2007 .

[13]  John L. Dwyer,et al.  Development of the Landsat Data Continuity Mission Cloud-Cover Assessment Algorithms , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Izak Van Zyl Marais,et al.  An optimal image transform for threshold-based cloud detection using heteroscedastic discriminant analysis , 2011 .

[15]  J. Cihlar,et al.  An image transform to characterize and compensate for spatial variations in thin cloud contamination of Landsat images , 2002 .

[16]  Zhiguo Jiang,et al.  A Cloud Detection Method for Landsat 8 Images Based on PCANet , 2018, Remote. Sens..

[17]  Yu Oishi,et al.  A new Landsat 8 cloud discrimination algorithm using thresholding tests , 2018, International Journal of Remote Sensing.

[18]  Panagiotis Sidiropoulos,et al.  CloudFCN: Accurate and Robust Cloud Detection for Satellite Imagery with Deep Learning , 2019, Remote. Sens..

[19]  Wei Wu,et al.  A Spectral-Temporal Patch-Based Missing Area Reconstruction for Time-Series Images , 2018, Remote. Sens..

[20]  Rune Hylsberg Jacobsen,et al.  A cloud detection algorithm for satellite imagery based on deep learning , 2019, Remote Sensing of Environment.

[21]  Zhe Zhu,et al.  Cloud detection algorithm comparison and validation for operational Landsat data products , 2017 .

[22]  Zhenwei Shi,et al.  Multilevel Cloud Detection in Remote Sensing Images Based on Deep Learning , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[23]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Zhiwei Li,et al.  Deep learning based cloud detection for remote sensing images by the fusion of multi-scale convolutional features , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[25]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Lei Wang,et al.  Object-Based Convolutional Neural Networks for Cloud and Snow Detection in High-Resolution Multispectral Imagers , 2018, Water.

[27]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[28]  Dengfeng Chai,et al.  Cloud and cloud shadow detection in Landsat imagery based on deep convolutional neural networks , 2019, Remote Sensing of Environment.

[29]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[30]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  C. Woodcock,et al.  Improvement and expansion of the Fmask algorithm: cloud, cloud shadow, and snow detection for Landsats 4–7, 8, and Sentinel 2 images , 2015 .

[32]  Feng Zhang,et al.  CloudNet: Ground‐Based Cloud Classification With Deep Convolutional Neural Network , 2018, Geophysical Research Letters.

[33]  Xiao Xiang Zhu,et al.  Vehicle Instance Segmentation From Aerial Image and Video Using a Multitask Learning Residual Fully Convolutional Network , 2018, IEEE Transactions on Geoscience and Remote Sensing.