Machine Learning-Based Processing Proof-of-Concept Pipeline for Semi-Automatic Sentinel-2 Imagery Download, Cloudiness Filtering, Classifications, and Updates of Open Land Use/Land Cover Datasets

Land use and land cover are continuously changing in today’s world. Both domains, therefore, have to rely on updates of external information sources from which the relevant land use/land cover (classification) is extracted. Satellite images are frequent candidates due to their temporal and spatial resolution. On the contrary, the extraction of relevant land use/land cover information is demanding in terms of knowledge base and time. The presented approach offers a proof-of-concept machine-learning pipeline that takes care of the entire complex process in the following manner. The relevant Sentinel-2 images are obtained through the pipeline. Later, cloud masking is performed, including the linear interpolation of merged-feature time frames. Subsequently, four-dimensional arrays are created with all potential training data to become a basis for estimators from the scikit-learn library; the LightGBM estimator is then used. Finally, the classified content is applied to the open land use and open land cover databases. The verification of the provided experiment was conducted against detailed cadastral data, to which Shannon’s entropy was applied since the number of cadaster information classes was naturally consistent. The experiment showed a good overall accuracy (OA) of 85.9%. It yielded a classified land use/land cover map of the study area consisting of 7188 km2 in the southern part of the South Moravian Region in the Czech Republic. The developed proof-of-concept machine-learning pipeline is replicable to any other area of interest so far as the requirements for input data are met.

[1]  Hannes Taubenböck,et al.  Spatial and semantic effects of LUCAS samples on fully automated land use/land cover classification in high-resolution Sentinel-2 data , 2020, Int. J. Appl. Earth Obs. Geoinformation.

[2]  Lewis Dijkstra,et al.  The EU-OECD definition of a functional urban area , 2019 .

[3]  Yady Tatiana Solano Correa,et al.  Analysis of multitemporal Sentinel-2 images in the framework of the ESA Scientific Exploitation of Operational Missions , 2017, 2017 9th International Workshop on the Analysis of Multitemporal Remote Sensing Images (MultiTemp).

[4]  Luis Guanter,et al.  Ready-to-Use Methods for the Detection of Clouds, Cirrus, Snow, Shadow, Water and Clear Sky Pixels in Sentinel-2 MSI Images , 2016, Remote. Sens..

[5]  Ferran Gascon,et al.  Sen2Cor for Sentinel-2 , 2017, Remote Sensing.

[6]  J. Chormański,et al.  SENTINEL-2 IMAGERY FOR MAPPING AND MONITORING IMPERVIOUSNESS IN URBAN AREAS , 2019, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[7]  Manjula Ranagalage,et al.  Sentinel-2 Data for Land Cover/Use Mapping: A Review , 2020, Remote. Sens..

[8]  Chien-Chih Lai,et al.  Clouds Classification from Sentinel-2 Imagery with Deep Residual Learning and Semantic Image Segmentation , 2019, Remote. Sens..

[9]  Pierre Soille,et al.  Assessment of the Added-Value of Sentinel-2 for Detecting Built-up Areas , 2016, Remote. Sens..

[10]  Madhavi Jain,et al.  Monitoring land use change and its drivers in Delhi, India using multi-temporal satellite data , 2016, Modeling Earth Systems and Environment.

[11]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[12]  Stephen V. Stehman,et al.  Sampling designs for accuracy assessment of land cover , 2009 .

[13]  Cezary Mazurek,et al.  An INSPIRE-Based Vocabulary for the Publication of Agricultural Linked Data , 2015, OWLED.

[14]  Ronald E. McRoberts,et al.  Land Use/Land Cover Mapping Using Multitemporal Sentinel-2 Imagery and Four Classification Methods - A Case Study from Dak Nong, Vietnam , 2020, Remote. Sens..

[15]  Nataliia Kussul,et al.  Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data , 2017, IEEE Geoscience and Remote Sensing Letters.

[16]  Dino Ienco,et al.  Combining Sentinel-1 and Sentinel-2 Satellite Image Time Series for land cover mapping via a multi-source deep learning architecture , 2019 .

[17]  Yuei-An Liou,et al.  Land-Use Land-Cover Classification by Machine Learning Classifiers for Satellite Observations - A Review , 2020, Remote. Sens..

[18]  C. Woodcock,et al.  Improvement and expansion of the Fmask algorithm: cloud, cloud shadow, and snow detection for Landsats 4–7, 8, and Sentinel 2 images , 2015 .

[19]  Lukás Herman,et al.  Prediction of Yield Productivity Zones from Landsat 8 and Sentinel-2A/B and Their Evaluation Using Farm Machinery Measurements , 2020, Remote. Sens..

[20]  Yanfei Zhong,et al.  A spectral–structural bag-of-features scene classifier for very high spatial resolution remote sensing imagery , 2016 .

[21]  Gérard Dedieu,et al.  A Multi-Temporal and Multi-Spectral Method to Estimate Aerosol Optical Thickness over Land, for the Atmospheric Correction of FormoSat-2, LandSat, VENμS and Sentinel-2 Images , 2015, Remote. Sens..

[22]  Olivier Hagolle,et al.  Validation of Copernicus Sentinel-2 Cloud Masks Obtained from MAJA, Sen2Cor, and FMask Processors Using Reference Cloud Masks Generated with a Supervised Active Learning Procedure , 2019, Remote. Sens..

[23]  Giles M. Foody,et al.  Good practices for estimating area and assessing accuracy of land change , 2014 .

[24]  Ewa Gromny,et al.  Automated Production of a Land Cover/Use Map of Europe Based on Sentinel-2 Imagery , 2020, Remote. Sens..

[25]  Peijun Du,et al.  A review of supervised object-based land-cover image classification , 2017 .

[26]  Abdulhakim M. Abdi,et al.  Land cover and land use classification performance of machine learning algorithms in a boreal landscape using Sentinel-2 data , 2019, GIScience & Remote Sensing.

[27]  Einar Eberhardt,et al.  Best Practice Network GS SOIL Promoting Access to European, Interoperable and INSPIRE Compliant Soil Information , 2011 .

[28]  Luisa Verdoliva,et al.  Land Use Classification in Remote Sensing Images by Convolutional Neural Networks , 2015, ArXiv.

[29]  Kathleen Neumann,et al.  Challenges in using land use and land cover data for global change studies , 2011 .

[30]  B. He,et al.  Fmask 4.0: Improved cloud and cloud shadow detection in Landsats 4–8 and Sentinel-2 imagery , 2019, Remote Sensing of Environment.

[31]  O. Mutanga,et al.  Examining the potential of Sentinel-2 MSI spectral resolution in quantifying above ground biomass across different fertilizer treatments , 2015 .

[32]  M. Rautiainen,et al.  Comparison of Sentinel-2 and Landsat 8 in the estimation of boreal forest canopy cover and leaf area index , 2017 .

[33]  Giorgos Mountrakis,et al.  A meta-analysis of remote sensing research on supervised pixel-based land-cover image classification processes: General guidelines for practitioners and future research , 2016 .

[34]  Naoto Yokoya,et al.  Performance Evaluation of Downscaling Sentinel-2 Imagery for Land Use and Land Cover Classification by Spectral-Spatial Features , 2017, Remote. Sens..

[35]  Zhe Zhu,et al.  Object-based cloud and cloud shadow detection in Landsat imagery , 2012 .

[36]  Sérgio Freire,et al.  Increasing the detail of European land use/cover data by combining heterogeneous data sets , 2018, Int. J. Digit. Earth.

[37]  Gabriel Navarro,et al.  Sentinel-2 Satellites Provide Near-Real Time Evaluation of Catastrophic Floods in the West Mediterranean , 2019, Water.

[38]  Michael Schultz,et al.  Open land cover from OpenStreetMap and remote sensing , 2017, Int. J. Appl. Earth Obs. Geoinformation.

[39]  Mahmut Cavur,et al.  LAND USE AND LAND COVER CLASSIFICATION OF SENTINEL 2-A: ST PETERSBURG CASE STUDY , 2019, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[40]  Giles M. Foody,et al.  Sample size determination for image classification accuracy assessment and comparison , 2009 .

[41]  Clement Atzberger,et al.  How much does multi-temporal Sentinel-2 data improve crop type classification? , 2018, Int. J. Appl. Earth Obs. Geoinformation.

[42]  Damir Medak,et al.  Impact of Various Atmospheric Corrections on Sentinel-2 Land Cover Classification Accuracy Using Machine Learning Classifiers , 2020, ISPRS Int. J. Geo Inf..

[43]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..