Direct, ECOC, ND and END Frameworks - Which One Is the Best? An Empirical Study of Sentinel-2A MSIL1C Image Classification for Arid-Land Vegetation Mapping in the Ili River Delta, Kazakhstan

To facilitate the advances in Sentinel-2A products for land cover from Moderate Resolution Imaging Spectroradiometer (MODIS) and Landsat imagery, Sentinel-2A MultiSpectral Instrument Level-1C (MSIL1C) images are investigated for large-scale vegetation mapping in an arid land environment that is located in the Ili River delta, Kazakhstan. For accurate classification purposes, multi-resolution segmentation (MRS) based extended object-guided morphological profiles (EOMPs) are proposed and then compared with conventional morphological profiles (MPs), MPs with partial reconstruction (MPPR), object-guided MPs (OMPs), OMPs with mean values (OMPsM), and object-oriented (OO)-based image classification techniques. Popular classifiers, such as C4.5, an extremely randomized decision tree (ERDT), random forest (RaF), rotation forest (RoF), classification via random forest regression (CVRFR), ExtraTrees, and radial basis function (RBF) kernel-based support vector machines (SVMs) are adopted to answer the question of whether nested dichotomies (ND) and ensembles of ND (END) are truly superior to direct and error-correcting output code (ECOC) multiclass classification frameworks. Finally, based on the results, the following conclusions are drawn: 1) the superior performance of OO-based techniques over MPs, MPPR, OMPs, and OMPsM is clear for Sentinel-2A MSIL1C image classification, while the best results are achieved by the proposed EOMPs; 2) the superior performance of ND, ND with class balancing (NDCB), ND with data balancing (NDDB), ND with random-pair selection (NDRPS), and ND with further centroid (NDFC) over direct and ECOC frameworks is not confirmed, especially in the cases of using weak classifiers for low-dimensional datasets; 3) from computationally efficient, high accuracy, redundant to data dimensionality and easy of implementations points of view, END, ENDCB, ENDDB, and ENDRPS are alternative choices to direct and ECOC frameworks; 4) surprisingly, because in the ensemble learning (EL) theorem, “weaker” classifiers (ERDT here) always have a better chance of reaching the trade-off between diversity and accuracy than “stronger” classifies (RaF, ExtraTrees, and SVM here), END with ERDT (END-ERDT) achieves the best performance with less than a 0.5% difference in the overall accuracy (OA) values, but is 100 to 10000 times faster than END with RaF and ExtraTrees, and ECOC with SVM while using different datasets with various dimensions; and, 5) Sentinel-2A Remote Sens. 2019, 11, 1953; doi:10.3390/rs11161953 www.mdpi.com/journal/remotesensing Remote Sens. 2019, 11, 1953 2 of 33 MSIL1C is better choice than the land cover products from MODIS and Landsat imagery for vegetation species mapping in an arid land environment, where the vegetation species are critically important, but sparsely distributed.

[1]  Mao Ning Tuanmu,et al.  A global 1‐km consensus land‐cover product for biodiversity and ecosystem modelling , 2014 .

[2]  Juan José Rodríguez Diez,et al.  Forests of nested dichotomies , 2010, Pattern Recognit. Lett..

[3]  A. D. McGuire,et al.  Circumpolar distribution and carbon storage of thermokarst landscapes , 2016, Nature Communications.

[4]  M. Hansen,et al.  A comparison of the IGBP DISCover and University of Maryland 1 km global land cover products , 2000 .

[5]  Xing Zhao,et al.  Spectral–Spatial Classification of Hyperspectral Data Based on Deep Belief Network , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[6]  Wei Zhang,et al.  Multiple Classifier System for Remote Sensing Image Classification: A Review , 2012, Sensors.

[7]  James S. Clark,et al.  Competition‐interaction landscapes for the joint response of forests to climate change , 2014, Global change biology.

[8]  Stefan Kramer,et al.  Ensembles of Balanced Nested Dichotomies for Multi-class Problems , 2005, PKDD.

[9]  N. Hölzel,et al.  Carbon exchange fluxes over peatlands in Western Siberia: Possible feedback between land-use change and climate change. , 2016, The Science of the total environment.

[10]  Peijun Du,et al.  Active extreme learning machines for quad-polarimetric SAR imagery classification , 2015, Int. J. Appl. Earth Obs. Geoinformation.

[11]  Paolo Frasconi,et al.  New results on error correcting output codes of kernel machines , 2004, IEEE Transactions on Neural Networks.

[12]  Geoff Holmes,et al.  Ensembles of Nested Dichotomies with Multiple Subset Evaluation , 2018, PAKDD.

[13]  Molly E. Brown,et al.  Evaluation of the consistency of long-term NDVI time series derived from AVHRR,SPOT-vegetation, SeaWiFS, MODIS, and Landsat ETM+ sensors , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[15]  Abderrahim Nemmaoui,et al.  Performance evaluation of object based greenhouse detection from Sentinel-2 MSI and Landsat 8 OLI data: A case study from Almería (Spain) , 2016, Int. J. Appl. Earth Obs. Geoinformation.

[16]  Tobias Landmann,et al.  Monitoring vegetation dynamics with medium resolution MODIS-EVI time series at sub-regional scale in southern Africa , 2015, Int. J. Appl. Earth Obs. Geoinformation.

[17]  Hurtado Abril,et al.  Generación de un índice espectro-temporal para la identificación de zonas afectadas por deforestación usando imágenes Landsat. , 2020 .

[18]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[19]  Jun Li,et al.  Improved hyperspectral image classification by active learning using pre-designed mixed pixels , 2016, Pattern Recognit..

[20]  M. Pal,et al.  Class Decomposition Approaches for Land Cover Classification: A Comparative Study , 2006, 2006 IEEE International Symposium on Geoscience and Remote Sensing.

[21]  Eyke Hüllermeier,et al.  Combining predictions in pairwise classification: An optimal adaptive voting strategy and its relation to weighted voting , 2010, Pattern Recognit..

[22]  M. Cho,et al.  Remote sensing of species diversity using Landsat 8 spectral variables , 2017 .

[23]  J. Ross Quinlan,et al.  Bagging, Boosting, and C4.5 , 1996, AAAI/IAAI, Vol. 1.

[24]  Antonio J. Plaza,et al.  A new approach to mixed pixel classification of hyperspectral imagery based on extended morphological profiles , 2004, Pattern Recognit..

[25]  Wenbin Wu,et al.  GlobeLand30 shows little cropland area loss but greater fragmentation in China , 2018, Int. J. Appl. Earth Obs. Geoinformation.

[26]  Jun Li,et al.  ${{\rm E}^{2}}{\rm LMs}$ : Ensemble Extreme Learning Machines for Hyperspectral Image Classification , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[27]  Mark E. Jakubauskas,et al.  A comparison of satellite data and landscape variables in predicting bird species occurrences in the Greater Yellowstone Ecosystem, USA , 2004, Landscape Ecology.

[28]  U. Gessner,et al.  Regional land cover mapping and change detection in Central Asia using MODIS time-series , 2012 .

[29]  Damien Sulla-Menashe,et al.  MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets , 2010 .

[30]  Ranga B. Myneni,et al.  Response of vegetation activity dynamic to climatic change and ecological restoration programs in Inner Mongolia from 2000 to 2012 , 2015 .

[31]  Nathalie Pettorelli,et al.  The application of remote sensing for marine protected area management , 2014 .

[32]  G. Hill,et al.  Vegetation mapping of a tropical freshwater swamp in the Northern Territory, Australia: A comparison of aerial photography, Landsat TM and SPOT satellite imagery , 2001 .

[33]  Rajendra Prasad,et al.  Comparison of support vector machine, artificial neural network, and spectral angle mapper algorithms for crop classification using LISS IV data , 2015 .

[34]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[35]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[36]  M. Cho,et al.  Classification of savanna tree species, in the Greater Kruger National Park region, by integrating hyperspectral and LiDAR data in a Random Forest data mining environment , 2012 .

[37]  Jason S. Sibold,et al.  Spatial and temporal trends of drought effects in a heterogeneous semi-arid forest ecosystem. , 2016 .

[38]  Peijun Du,et al.  A review of supervised object-based land-cover image classification , 2017 .

[39]  T. Warner,et al.  Multi-scale GEOBIA with very high spatial resolution digital aerial imagery: scale, texture and image objects , 2011 .

[40]  Hankui K. Zhang,et al.  Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data , 2013 .

[41]  Patricia Gober,et al.  Per-pixel vs. object-based classification of urban land cover extraction using high spatial resolution imagery , 2011, Remote Sensing of Environment.

[42]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[43]  Bernhard Pfahringer,et al.  Building Ensembles of Adaptive Nested Dichotomies with Random-Pair Selection , 2016, ECML/PKDD.

[44]  Ruiliang Pu,et al.  An efficient unsupervised index based approach for mapping urban vegetation from IKONOS imagery , 2016, Int. J. Appl. Earth Obs. Geoinformation.

[45]  Shunlin Liang,et al.  Time‐lag effects of global vegetation responses to climate change , 2015, Global change biology.

[46]  Maggi Kelly,et al.  A Vegetation Mapping Strategy for Conifer Forests by Combining Airborne LiDAR Data and Aerial Imagery , 2016 .

[47]  Juan J. Flores,et al.  The application of artificial neural networks to the analysis of remotely sensed data , 2008 .

[48]  Senén Barro,et al.  Comparison of a massive and diverse collection of ensembles and other classifiers for oil spill detection in SAR satellite images , 2017, Neural Computing and Applications.

[49]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[50]  Geping Luo,et al.  Changes in land use/land cover and ecosystem services in Central Asia during 1990–2009 , 2013 .

[51]  Hankui K. Zhang,et al.  Using the 500 m MODIS Land Cover Product to Derive a Consistent Continental Scale 30 m Landsat Land Cover Classification , 2017 .

[52]  Jon Atli Benediktsson,et al.  Advances in Spectral-Spatial Classification of Hyperspectral Images , 2013, Proceedings of the IEEE.

[53]  Claudio Persello,et al.  Evaluation of ForestPA for VHR RS image classification using spectral and superpixel-guided morphological profiles , 2019 .

[54]  J. Blackard,et al.  Journal of Applied , 2006 .

[55]  Clement Atzberger,et al.  Object Based Image Analysis and Data Mining applied to a remotely sensed Landsat time-series to map sugarcane over large areas , 2012 .

[56]  Gunnar Rätsch,et al.  Soft Margins for AdaBoost , 2001, Machine Learning.

[57]  Hui Fang,et al.  Changes in the area of inland lakes in arid regions of central Asia during the past 30 years , 2011, Environmental monitoring and assessment.

[58]  Ryutaro Tateishi,et al.  Production of global land cover data – GLCNMO , 2011, Int. J. Digit. Earth.

[59]  Christopher D. Elvidge,et al.  Monitoring Seasonal Dynamics of Arid Land Vegetation Using AVIRIS Data , 1998 .

[60]  Patrick Hostert,et al.  Intra-annual reflectance composites from Sentinel-2 and Landsat for national-scale crop and land cover mapping , 2019, Remote Sensing of Environment.

[61]  Mariana Belgiu,et al.  Sentinel-2 cropland mapping using pixel-based and object-based time-weighted dynamic time warping analysis , 2018 .

[62]  Jin Chen,et al.  Global land cover mapping at 30 m resolution: A POK-based operational approach , 2015 .

[63]  Johannes R. Sveinsson,et al.  Spectral and spatial classification of hyperspectral data using SVMs and morphological profiles , 2008, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[64]  Sébastien Lefèvre,et al.  A comparative study on multivariate mathematical morphology , 2007, Pattern Recognit..

[65]  Jonathan Cheung-Wai Chan,et al.  Evaluation of random forest and adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery , 2008 .

[66]  T. Petr,et al.  The impact on fish stocks of river regulation in Central Asia and Kazakhstan , 1998 .

[67]  Yichun Xie,et al.  Remote sensing imagery in vegetation mapping: a review , 2008 .

[68]  Yang Shao,et al.  Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points , 2012 .

[69]  Ujjwal Maulik,et al.  Remote Sensing Image Classification: A survey of support-vector-machine-based advanced techniques , 2017, IEEE Geoscience and Remote Sensing Magazine.

[70]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[71]  Olivier Arino,et al.  Global Land Cover Map for 2009 (GlobCover 2009) , 2012 .

[72]  S. Running,et al.  Global products of vegetation leaf area and fraction absorbed PAR from year one of MODIS data , 2002 .

[73]  A. Skidmore,et al.  Vegetation phenology from Sentinel-2 and field cameras for a Dutch barrier island , 2018, Remote Sensing of Environment.

[74]  Wilfried Philips,et al.  Taking Optimal Advantage of Fine Spatial Resolution: Promoting partial image reconstruction for the morphological analysis of very-high-resolution images , 2017, IEEE Geoscience and Remote Sensing Magazine.

[75]  Paolo Gamba,et al.  Quad-PolSAR data classification using modified random forest algorithms to map halophytic plants in arid areas , 2018, Int. J. Appl. Earth Obs. Geoinformation.

[76]  Galal Omer,et al.  Performance of Support Vector Machines and Artificial Neural Network for Mapping Endangered Tree Species Using WorldView-2 Data in Dukuduku Forest, South Africa , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[77]  U. Benz,et al.  Multi-resolution, object-oriented fuzzy analysis of remote sensing data for GIS-ready information , 2004 .

[78]  A. Belward,et al.  GLC2000: a new approach to global land cover mapping from Earth observation data , 2005 .

[79]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[80]  M. Friedl,et al.  Land Surface Phenology from MODIS: Characterization of the Collection 5 Global Land Cover Dynamics Product , 2010 .

[81]  Pramod K. Varshney,et al.  Decision tree regression for soft classification of remote sensing data , 2005 .

[82]  Bing Zhang,et al.  A Review of Remote Sensing Image Classification Techniques: the Role of Spatio-contextual Information , 2014 .

[83]  Dirk Tiede,et al.  ESP: a tool to estimate scale parameter for multiresolution image segmentation of remotely sensed data , 2010, Int. J. Geogr. Inf. Sci..

[84]  Jon Atli Benediktsson,et al.  Recent Advances in Techniques for Hyperspectral Image Processing , 2009 .

[85]  Juan José Rodríguez Diez,et al.  Rotation Forest: A New Classifier Ensemble Method , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[86]  H. Matsuyama,et al.  Decrease of river runoff in the Lake Balkhash basin in Central Asia , 2006 .

[87]  Andreas Kääb,et al.  Glacier Remote Sensing Using Sentinel-2. Part I: Radiometric and Geometric Performance, and Application to Ice Velocity , 2016, Remote. Sens..

[88]  Clemens Eisank,et al.  Automated object-based classification of topography from SRTM data , 2012, Geomorphology.

[89]  Jon Atli Benediktsson,et al.  Classification of Hyperspectral Images by Using Extended Morphological Attribute Profiles and Independent Component Analysis , 2011, IEEE Geoscience and Remote Sensing Letters.

[90]  Matthias Drusch,et al.  Sentinel-2: ESA's Optical High-Resolution Mission for GMES Operational Services , 2012 .

[91]  Thomas Blaschke,et al.  Geographic Object-Based Image Analysis – Towards a new paradigm , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[92]  Eyke Hüllermeier,et al.  Ensembles of evolved nested dichotomies for classification , 2018, GECCO.

[93]  Johannes R. Sveinsson,et al.  Classification of hyperspectral data from urban areas based on extended morphological profiles , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[94]  Ashbindu Singh,et al.  Status and distribution of mangrove forests of the world using earth observation satellite data , 2011 .

[95]  Jungho Im,et al.  Support vector machines in remote sensing: A review , 2011 .

[96]  P. Propastin Problems of Water Resources Management in the Drainage Basin of Lake Balkhash with Respect to Political Development , 2012 .

[97]  Giles M. Foody,et al.  A relative evaluation of multiclass image classification by support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[98]  William Stafford Noble,et al.  Support vector machine , 2013 .

[99]  J. Gamon,et al.  Remote sensing of biodiversity: Soil correction and data dimension reduction methods improve assessment of α-diversity (species richness) in prairie ecosystems , 2018 .

[100]  J. Kerr,et al.  From space to species: ecological applications for remote sensing , 2003 .

[101]  Gary R. Watmough,et al.  Evaluating the capabilities of Sentinel-2 for quantitative estimation of biophysical variables in vegetation , 2013 .

[102]  Peijun Du,et al.  Random Forest and Rotation Forest for fully polarized SAR image classification using polarimetric and spatial features , 2015 .

[103]  W. Gould REMOTE SENSING OF VEGETATION, PLANT SPECIES RICHNESS, AND REGIONAL BIODIVERSITY HOTSPOTS , 2000 .

[104]  Sicong Liu,et al.  Classification of VHR Multispectral Images Using ExtraTrees and Maximally Stable Extremal Region-Guided Morphological Profile , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[105]  Sergio Escalera,et al.  On the Decoding Process in Ternary Error-Correcting Output Codes , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[106]  Philip A. Townsend,et al.  Detecting prairie biodiversity with airborne remote sensing , 2019, Remote Sensing of Environment.

[107]  E. Jobbágy,et al.  Vegetation structure is as important as climate for explaining ecosystem function across Patagonian rangelands , 2014 .

[108]  Nicolas Barbier,et al.  Self‐organized vegetation patterning as a fingerprint of climate and human impact on semi‐arid ecosystems , 2006 .

[109]  Damien Sulla-Menashe,et al.  Detecting forest disturbance in the Pacific Northwest from MODIS time series using temporal segmentation , 2014 .

[110]  Giles M. Foody,et al.  Good practices for estimating area and assessing accuracy of land change , 2014 .

[111]  R. An,et al.  Vegetation dynamics and its driving forces from climate change and human activities in the Three-River Source Region, China from 1982 to 2012. , 2016, The Science of the total environment.

[112]  A. Bao,et al.  Vegetation dynamics and responses to climate change and human activities in Central Asia. , 2017, The Science of the total environment.

[113]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[114]  J. Townshend,et al.  Global land cover classi(cid:142) cation at 1 km spatial resolution using a classi(cid:142) cation tree approach , 2004 .

[115]  J. Townshend,et al.  Global Percent Tree Cover at a Spatial Resolution of 500 Meters: First Results of the MODIS Vegetation Continuous Fields Algorithm , 2003 .

[116]  Jun Li,et al.  Recent Advances on Spectral–Spatial Hyperspectral Image Classification: An Overview and New Guidelines , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[117]  Robert P. Anderson,et al.  Bioclimatic variables derived from remote sensing: assessment and application for species distribution modelling , 2014 .

[118]  Chi Zhang,et al.  The spatiotemporal patterns of vegetation coverage and biomass of the temperate deserts in Central Asia and their relationships with climate controls , 2016 .

[119]  G. Henebry,et al.  Using multiple remote sensing perspectives to identify and attribute land surface dynamics in Central Asia 2001–2013 , 2015 .

[120]  K. Meissner,et al.  How Essential Biodiversity Variables and remote sensing can help national biodiversity monitoring , 2017 .

[121]  Pavel Propastin,et al.  Simple model for monitoring Balkhash Lake water levels and Ili River discharges: Application of remote sensing , 2008 .

[122]  Cardona Alzate,et al.  Predicción y selección de variables con bosques aleatorios en presencia de variables correlacionadas , 2020 .

[123]  Jordi Vitrià,et al.  Discriminant ECOC: a heuristic method for application dependent design of error correcting output codes , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[124]  O. Csillik,et al.  Automated parameterisation for multi-scale image segmentation on multiple layers , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[125]  M. Fladeland,et al.  Remote sensing for biodiversity science and conservation , 2003 .

[126]  José Francisco Martínez Trinidad,et al.  Nested Dichotomies Based on Clustering , 2012, CIARP.

[127]  Lisa Norton,et al.  The relative value of field survey and remote sensing for biodiversity assessment , 2015 .

[128]  Luis Alonso,et al.  Optimizing LUT-Based RTM Inversion for Semiautomatic Mapping of Crop Biophysical Parameters from Sentinel-2 and -3 Data: Role of Cost Functions , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[129]  Stefan Kramer,et al.  Ensembles of nested dichotomies for multi-class problems , 2004, ICML.

[130]  Lênio Soares Galvão,et al.  Use of MSI/Sentinel-2 and airborne LiDAR data for mapping vegetation and studying the relationships with soil attributes in the Brazilian semi-arid region , 2018, Int. J. Appl. Earth Obs. Geoinformation.

[131]  Steven W. Running,et al.  Testing a MODIS Global Disturbance Index across North America , 2009 .

[132]  Amanda M. Schwantes,et al.  Global satellite monitoring of climate-induced vegetation disturbances. , 2015, Trends in plant science.

[133]  N. Pettorelli,et al.  Satellite remote sensing for applied ecologists: opportunities and challenges , 2014 .

[134]  Martin Brandt,et al.  Remote sensing of vegetation dynamics in drylands: Evaluating vegetation optical depth (VOD) using AVHRR NDVI and in situ green biomass data over West African Sahel , 2016 .