Semi-supervised semantic segmentation in Earth Observation: the MiniFrance suite, dataset analysis and multi-task network study

The development of semi-supervised learning techniques is essential to enhance the generalization capacities of machine learning algorithms. Indeed, raw image data are abundant while labels are scarce, therefore it is crucial to leverage unlabeled inputs to build better models. The availability of large databases have been key for the development of learning algorithms with high level performance. Despite the major role of machine learning in Earth Observation to derive products such as land cover maps, datasets in the field are still limited, either because of modest surface coverage, lack of variety of scenes or restricted classes to identify. We introduce a novel large-scale dataset for semi-supervised semantic segmentation in Earth Observation, the MiniFrance suite. MiniFrance has several unprecedented properties: it is large-scale, containing over 2000 very high resolution aerial images, accounting for more than 200 billions samples (pixels); it is varied, covering 16 conurbations in France, with various climates, different landscapes, and urban as well as countryside scenes; and it is challenging, considering land use classes with high-level semantics. Nevertheless, the most distinctive quality of MiniFrance is being the only dataset in the field especially designed for semi-supervised learning: it contains labeled and unlabeled images in its training partition, which reproduces a life-like scenario. Along with this dataset, we present tools for data representativeness analysis in terms of appearance similarity and a thorough study of MiniFrance data, demonstrating that it is suitable for learning and generalizes well in a semi-supervised setting. Finally, we present semi-supervised deep architectures based on multi-task learning and the first experiments on MiniFrance.

[1]  Dong-Hyun Lee,et al.  Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .

[2]  Jan Dirk Wegner,et al.  Theme section “Urban object detection and 3D building reconstruction” , 2014 .

[3]  Alexandre Boulch,et al.  What Data are needed for Semantic Segmentation in Earth Observation? , 2019, 2019 Joint Urban Remote Sensing Event (JURSE).

[4]  Bertrand Le Saux,et al.  Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[5]  Ming-Hsuan Yang,et al.  Adversarial Learning for Semi-supervised Semantic Segmentation , 2018, BMVC.

[6]  Antonio J. Plaza,et al.  Image Segmentation Using Deep Learning: A Survey , 2021, IEEE transactions on pattern analysis and machine intelligence.

[7]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Jing Huang,et al.  DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[10]  Siyuan Lu,et al.  Map Generation from Large Scale Incomplete and Inaccurate Data Labels , 2020, KDD.

[11]  Alexandre Boulch,et al.  Multitask learning for large-scale semantic change detection , 2018, Comput. Vis. Image Underst..

[12]  Saikat Basu,et al.  Building High Resolution Maps for Humanitarian Aid and Development with Weakly- and Semi-Supervised Learning , 2019, CVPR Workshops.

[13]  Bernt Schiele,et al.  Simple Does It: Weakly Supervised Instance and Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Dimitris Samaras,et al.  Weakly Labeling the Antarctic: The Penguin Colony Case , 2019, CVPR Workshops.

[15]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[16]  Jong Chul Ye,et al.  Mumford–Shah Loss Functional for Image Segmentation With Deep Learning , 2019, IEEE Transactions on Image Processing.

[17]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[18]  Peijun Du,et al.  (Semi-) Supervised Probabilistic Principal Component Analysis for Hyperspectral Remote Sensing Image Classification , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[19]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[20]  Alexandre Boulch,et al.  Processing of Extremely High-Resolution LiDAR and RGB Data: Outcome of the 2015 IEEE GRSS Data Fusion Contest–Part A: 2-D Contest , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[21]  Brian Kulis,et al.  W-Net: A Deep Model for Fully Unsupervised Image Segmentation , 2017, ArXiv.

[22]  Bertrand Le Saux,et al.  Segment-before-Detect: Vehicle Detection and Classification through Semantic Segmentation of Aerial Images , 2017, Remote. Sens..

[23]  C. V. Jawahar,et al.  Universal Semi-Supervised Semantic Segmentation , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[24]  Allan Aasbjerg Nielsen,et al.  Journal of Photogrammetry and Remote Sensing , 2014 .

[25]  Frédéric Champagnat,et al.  Multitask Learning of Height and Semantics From Aerial Images , 2019, IEEE Geoscience and Remote Sensing Letters.

[26]  Jiebo Luo,et al.  DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[27]  Jamie Sherrah,et al.  Effective semantic pixel labelling with convolutional networks and Conditional Random Fields , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[28]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Geoffrey E. Hinton,et al.  Learning to Detect Roads in High-Resolution Aerial Images , 2010, ECCV.

[30]  Pierre Alliez,et al.  Can semantic labeling methods generalize to any city? the inria aerial image labeling benchmark , 2017, 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[31]  Li Yan,et al.  Semi-supervised center-based discriminative adversarial learning for cross-domain scene-level land-cover classification of aerial images , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[32]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[33]  Naoto Yokoya,et al.  Learnable manifold alignment (LeMA): A semi-supervised cross-modality learning framework for land cover and land use classification , 2019, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[34]  Yaroslav Bulatov,et al.  xView: Objects in Context in Overhead Imagery , 2018, ArXiv.

[35]  Rebecca K. Runting,et al.  Opportunities for big data in conservation and sustainability , 2020, Nature Communications.

[36]  Zhenghao Chen,et al.  Digging Into Pseudo Label: A Low-Budget Approach for Semi-Supervised Semantic Segmentation , 2020, IEEE Access.

[37]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[38]  Gabriele Moser,et al.  Improving Maps from CNNs Trained with Sparse, Scribbled Ground Truths Using Fully Connected CRFs , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[39]  Michele Volpi,et al.  Detecting animals in African Savanna with UAVs and the crowds , 2017, ArXiv.

[40]  Hicham Randrianarivo,et al.  Weakly Supervised Semantic Segmentation of Satellite Images , 2019, 2019 Joint Urban Remote Sensing Event (JURSE).

[41]  Howie Choset,et al.  xBD: A Dataset for Assessing Building Damage from Satellite Imagery , 2019, ArXiv.

[42]  Karsten Jacobsen,et al.  The german camera evaluation project - results from the geometry group , 2010 .

[43]  Xiao Xiang Zhu,et al.  Deep learning in remote sensing: a review , 2017, ArXiv.

[44]  Marin Ferecatu,et al.  Urban structure detection with deformable part-based models , 2013, 2013 IEEE International Geoscience and Remote Sensing Symposium - IGARSS.

[45]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[46]  Mark Russell,et al.  A match-day analysis of the movement profiles of substitutes from a professional soccer club before and after pitch-entry , 2019, PloS one.

[47]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[48]  Concetto Spampinato,et al.  Semi Supervised Semantic Segmentation Using Generative Adversarial Network , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[49]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Begüm Demir,et al.  Bigearthnet: A Large-Scale Benchmark Archive for Remote Sensing Image Understanding , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[51]  Bo Du,et al.  Unsupervised-Restricted Deconvolutional Neural Network for Very High Resolution Remote-Sensing Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[52]  Xiaoying Tang,et al.  W-net: Bridged U-net for 2D Medical Image Segmentation , 2018, ArXiv.

[53]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[54]  George Papandreou,et al.  Weakly-and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[55]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[56]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  Christophe Sannier,et al.  Monitoring Urban Areas with Sentinel-2A Data: Application to the Update of the Copernicus High Resolution Layer Imperviousness Degree , 2016, Remote. Sens..

[58]  Xiao Xiang Zhu,et al.  Weakly Supervised Semantic Segmentation of Satellite Images for Land Cover Mapping - Challenges and Opportunities , 2020, ArXiv.

[59]  Matthieu Cord,et al.  WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  Benjamin Bechtel,et al.  Mapping Europe into local climate zones , 2019, PloS one.

[61]  Gustavo Camps-Valls,et al.  Semisupervised Manifold Alignment of Multimodal Remote Sensing Images , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[62]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[63]  Joeri van Wolvelaer,et al.  The European Urban Atlas , 2014 .

[64]  R. Watson,et al.  Over 90 endangered fish and invertebrates are caught in industrial fisheries , 2020, Nature Communications.