Assessing the robustness of Random Forests to map land cover with high resolution satellite image time series over large areas

Abstract New remote sensing sensors will acquire High spectral, spatial and temporal Resolution Satellite Image Time Series (HR-SITS). These new data are of great interest to map land cover thanks to the combination of the three high resolutions that will allow a depiction of scene dynamics. However, their efficient exploitation involves new challenges, especially for adapting traditional classification schemes to data complexity. More specifically, it requires: (1) to determine which classifier algorithms can handle the amount and the variability of data; (2) to evaluate the stability of classifier parameters; (3) to select the best feature set used as input data in order to find the good trade-off between classification accuracy and computational time; and (4) to establish the classifier accuracy over large areas. This work aims at studying these different issues, and more especially at demonstrating the ability of state-of-the-art classifiers, such as Random Forests (RF) or Support Vector Machines (SVM), to classify HR-SITS. For this purpose, several studies are carried out by using SPOT-4 and Landsat-8 HR-SITS in the south of France. Firstly, the choice of the classifier is discussed by comparing RF and SVM algorithms on HR-SITS. Both classifiers show their ability to tackle the classification problem with an Overall Accuracy (OA) of 83.3 % for RF and 77.1 % for SVM. But RF have some advantages such as a small training time, and an easy parameterization. Secondly, the stability of RF parameters is appraised. RF parameters appear to cause little influence on the classification accuracy, about 1% OA difference between the worst and the best parameter configuration. Thirdly, different input data – composed of spectral bands with or without spectral and/or temporal features – are proposed in order to enhance the characterization of land cover. The addition of features improves the classification accuracy, but the gain in OA is weak compared with the increase in the computational cost. Eventually, the classifier accuracy is assessed on a larger area where the landscape variabilities affect the classification performances.

[1]  Le Yu,et al.  Mapping global land cover in 2001 and 2010 with spatial-temporal consistency at 250 m resolution , 2015 .

[2]  Gérard Dedieu,et al.  A multi-temporal method for cloud detection, applied to FORMOSAT-2, VENµS, LANDSAT and SENTINEL-2 images , 2010 .

[3]  Aniruddha Ghosh,et al.  A framework for mapping tree species combining hyperspectral and LiDAR data: Role of selected classifiers and sensor across three spatial scales , 2014, Int. J. Appl. Earth Obs. Geoinformation.

[4]  Hugo Carrão,et al.  Contribution of multispectral and multitemporal information from MODIS images to land cover classification , 2008 .

[5]  Jon Atli Benediktsson,et al.  Morphological Attribute Profiles for the Analysis of Very High Resolution Images , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Matthias Drusch,et al.  Sentinel-2: ESA's Optical High-Resolution Mission for GMES Operational Services , 2012 .

[7]  Hanqiu Xu,et al.  A new index for delineating built‐up land features in satellite imagery , 2008 .

[8]  A. Strahler,et al.  Monitoring vegetation phenology using MODIS , 2003 .

[9]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[10]  P. Beck,et al.  Improved monitoring of vegetation dynamics at very high latitudes: A new method using MODIS NDVI , 2006 .

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Hanqiu Xu Modification of normalised difference water index (NDWI) to enhance open water features in remotely sensed imagery , 2006 .

[13]  Changsheng Li,et al.  Mapping paddy rice agriculture in southern China using multi-temporal MODIS images , 2005 .

[14]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[15]  Qi Chen,et al.  A comparison of classification techniques to support land cover and land use analysis in tropical coastal zones , 2011 .

[16]  D. Roberts,et al.  A comparison of methods for monitoring multitemporal vegetation change using Thematic Mapper imagery , 2002 .

[17]  Gérard Dedieu,et al.  Production of a Dynamic Cropland Mask by Processing Remote Sensing Image Series at High Temporal and Spatial Resolutions , 2016, Remote. Sens..

[18]  Limin Yang,et al.  Derivation of a tasselled cap transformation based on Landsat 7 at-satellite reflectance , 2002 .

[19]  M. Pal,et al.  Random forests for land cover classification , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[20]  Pratibha P. Shingare,et al.  Fusion classification of multispectral and panchromatic image using improved decision tree algorithm , 2014, 2014 International Conference on Signal Propagation and Computer Technology (ICSPCT 2014).

[21]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[22]  Liangpei Zhang,et al.  An SVM Ensemble Approach Combining Spectral, Structural, and Semantic Features for the Classification of High-Resolution Remotely Sensed Imagery , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[23]  Clément Mallet,et al.  Multi-temporal optical VHR image fusion for Land-Cover mapping , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[24]  V. Radeloff,et al.  Author's Personal Copy Mapping Abandoned Agriculture with Multi-temporal Modis Satellite Data , 2022 .

[25]  Steven E. Franklin,et al.  A comparison of pixel-based and object-based image analysis with selected machine learning algorithms for the classification of agricultural landscapes using SPOT-5 HRG imagery , 2012 .

[26]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[27]  J. A. Schell,et al.  Monitoring vegetation systems in the great plains with ERTS , 1973 .

[28]  Per Jönsson,et al.  TIMESAT - a program for analyzing time-series of satellite sensor data , 2004, Comput. Geosci..

[29]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[30]  M. Batistella,et al.  COMPARISON OF LAND-COVER CLASSIFICATION METHODS IN THE BRAZILIAN AMAZON BASIN , 2004 .

[31]  Christopher Conrad,et al.  Impact of feature selection on the accuracy and spatial uncertainty of per-field crop classification using Support Vector Machines , 2013 .

[32]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[33]  Anne-Laure Boulesteix,et al.  Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics , 2012, WIREs Data Mining Knowl. Discov..

[34]  Mohammad Khalilia,et al.  Predicting disease risks from highly imbalanced data using random forest , 2011, BMC Medical Informatics Decis. Mak..

[35]  Björn Waske,et al.  Classifying Multilevel Imagery From SAR and Optical Sensors by Decision Fusion , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[36]  C. Brodley,et al.  Decision tree classification of land cover from remotely sensed data , 1997 .

[37]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[38]  James J. Chen,et al.  Class-imbalanced classifiers for high-dimensional data , 2013, Briefings Bioinform..

[39]  Tim Appelhans,et al.  Comparison of four machine learning algorithms for their applicability in satellite-based optical rainfall retrievals , 2015 .

[40]  Roger Trias-Sanz,et al.  Texture Orientation and Period Estimator for Discriminating Between Forests, Orchards, Vineyards, and Tilled Fields , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[42]  Jon Atli Benediktsson,et al.  Morphological Profiles Based on Differently Shaped Structuring Elements for Classification of Images With Very High Spatial Resolution , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[43]  N. Silleos,et al.  Vegetation Indices: Advances Made in Biomass Estimation and Vegetation Monitoring in the Last 30 Years , 2006 .

[44]  Eric P. Crist,et al.  A Physically-Based Transformation of Thematic Mapper Data---The TM Tasseled Cap , 1984, IEEE Transactions on Geoscience and Remote Sensing.

[45]  S. K. McFeeters The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features , 1996 .

[46]  M. Mróz,et al.  Comparison of several vegetation indices calculated on the basis of a seasonal SPOT XS time series, and their suitability for land cover and agricultural crop identification , 2004 .

[47]  Per Jönsson,et al.  Seasonality extraction by function fitting to time-series of satellite sensor data , 2002, IEEE Trans. Geosci. Remote. Sens..

[48]  Gérard Dedieu,et al.  Assessment of an Operational System for Crop Type Map Production Using High Temporal and Spatial Resolution Satellite Optical Imagery , 2015, Remote. Sens..

[49]  Jay Gao,et al.  Use of normalized difference built-up index in automatically mapping urban areas from TM imagery , 2003 .

[50]  D. R. Cutler,et al.  Utah State University From the SelectedWorks of , 2017 .

[51]  Russell G. Congalton,et al.  A review of assessing the accuracy of classifications of remotely sensed data , 1991 .

[52]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[53]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[54]  Qingxi Tong,et al.  Derivation of a tasselled cap transformation based on Landsat 8 at-satellite reflectance , 2014 .

[55]  Nicole Vincent,et al.  Updating land cover databases using a single very high resolution satellite image , 2013 .

[56]  Michele Dalponte,et al.  Tree Species Classification in Boreal Forests With Hyperspectral Data , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[57]  X. H,et al.  A new index for delineating built-up land features in satellite imagery , 2008 .

[58]  R. DeFries,et al.  Classification trees: an alternative to traditional land cover classifiers , 1996 .

[59]  B. Wardlow,et al.  Large-area crop mapping using time-series MODIS 250 m NDVI data: An assessment for the U.S. Central Great Plains , 2008 .

[60]  Jordi Inglada PhenOTB, Phenological analysis for image time series , 2016 .

[61]  R. Kauth,et al.  The tasselled cap - A graphic description of the spectral-temporal development of agricultural crops as seen by Landsat , 1976 .

[62]  Huajun Tang,et al.  Land Cover Classification Based on Multi-temporal MODIS NDVI and LST in Northeastern China , 2006, 2006 IEEE International Symposium on Geoscience and Remote Sensing.

[63]  Antoine Masse,et al.  Tools for multitemporal analysis and classification of multisource satellite imagery , 2011, 2011 6th International Workshop on the Analysis of Multi-temporal Remote Sensing Images (Multi-Temp).

[64]  Hankui K. Zhang,et al.  Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data , 2013 .

[65]  Olivier Hagolle,et al.  SPOT-4 (Take 5): Simulation of Sentinel-2 Time Series on 45 Large Sites , 2015, Remote. Sens..

[66]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[67]  J. Pereira,et al.  Modeling spatial patterns of fire occurrence in Mediterranean Europe using Multiple Regression and Random Forest , 2012 .

[68]  Xiaoxia Wang,et al.  Land Cover Classification of Landsat Data with Phenological Features Extracted from Time Series MODIS NDVI Data , 2014, Remote. Sens..

[69]  P. Atkinson,et al.  Introduction Neural networks in remote sensing , 1997 .

[70]  Kenichi Tatsumi,et al.  Crop classification of upland fields using Random forest of time-series Landsat 7 ETM+ data , 2015, Comput. Electron. Agric..

[71]  Mario Chica-Olmo,et al.  An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[72]  Daniel Ierodiaconou,et al.  Evaluation of Four Supervised Learning Methods for Benthic Habitat Mapping Using Backscatter from Multi-Beam Sonar , 2012, Remote. Sens..

[73]  Jin Chen,et al.  Global land cover mapping at 30 m resolution: A POK-based operational approach , 2015 .

[74]  Jordi Inglada OTB Gapfilling, a temporal gapfilling for image time series library , 2016 .

[75]  Giorgos Mountrakis,et al.  A meta-analysis of remote sensing research on supervised pixel-based land-cover image classification processes: General guidelines for practitioners and future research , 2016 .