Evaluation of different machine learning methods for land cover mapping of a Mediterranean area using multi-seasonal Landsat images and Digital Terrain Models

Land cover monitoring using digital Earth data requires robust classification methods that allow the accurate mapping of complex land cover categories. This paper discusses the crucial issues related to the application of different up-to-date machine learning classifiers: classification trees (CT), artificial neural networks (ANN), support vector machines (SVM) and random forest (RF). The analysis of the statistical significance of the differences between the performance of these algorithms, as well as sensitivity to data set size reduction and noise were also analysed. Landsat-5 Thematic Mapper data captured in European spring and summer were used with auxiliary variables derived from a digital terrain model to classify 14 different land cover categories in south Spain. Overall, statistically similar accuracies of over 91% were obtained for ANN, SVM and RF. However, the findings of this study show differences in the accuracy of the classifiers, being RF the most accurate classifier with a very simple parameterization. SVM, followed by RF, was the most robust classifier to noise and data reduction. Significant differences in their performances were only reached for thresholds of noise and data reduction greater than 20% (noise, SVM) and 25% (noise, RF), and 80% (reduction, SVM) and 50% (reduction, RF), respectively.

[1]  Steven E. Franklin,et al.  A comparison of pixel-based and object-based image analysis with selected machine learning algorithms for the classification of agricultural landscapes using SPOT-5 HRG imagery , 2012 .

[2]  Qihao Weng,et al.  A survey of image classification methods and techniques for improving classification performance , 2007 .

[3]  Mario Chica-Olmo,et al.  An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[4]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[5]  Janet Franklin,et al.  Mapping land-cover modifications over large areas: A comparison of machine learning algorithms , 2008 .

[6]  Jungho Im,et al.  Support vector machines in remote sensing: A review , 2011 .

[7]  Jennifer A. Miller,et al.  Mapping landcover modifications over large areas : A comparison of machine learning algorithms , 2008 .

[8]  Gregory D. Williams,et al.  Assessment and monitoring , 2000 .

[9]  Manfred Ehlers,et al.  Photogrammetric Engineering and Remote Sensing , 2007 .

[10]  Jonathan Cheung-Wai Chan,et al.  Multiple Criteria for Evaluating Machine Learning Algorithms for Land Cover Classification from Satellite Data , 2000 .

[11]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[12]  Xiaofeng Li,et al.  Improved land cover mapping using random forests combined with landsat thematic mapper imagery and ancillary geographic data. , 2010 .

[13]  P. Atkinson,et al.  Introduction Neural networks in remote sensing , 1997 .

[14]  George P. Petropoulos,et al.  Support vector machines and object-based classification for obtaining land-use/cover cartography from Hyperion hyperspectral imagery , 2012, Comput. Geosci..

[15]  J. R. Sveinsson,et al.  Mapping of hyperspectral AVIRIS data using machine-learning algorithms , 2009 .

[16]  Data points. , 1992, Science.

[17]  Giles M. Foody,et al.  Sample size determination for image classification accuracy assessment and comparison , 2009 .

[18]  Samia Boukir,et al.  Relevance of airborne lidar and multispectral image data for urban scene classification using Random Forests , 2011 .

[19]  Graeme G. Wilkinson,et al.  Results and implications of a study of fifteen years of satellite image classification experiments , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[21]  Yang Shao,et al.  Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points , 2012 .

[22]  Giles M. Foody,et al.  Status of land cover classification accuracy assessment , 2002 .

[23]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[24]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[25]  A. Berk,et al.  Exploiting MODTRAN radiation transport for atmospheric correction: The FLAASH algorithm , 2002, Proceedings of the Fifth International Conference on Information Fusion. FUSION 2002. (IEEE Cat.No.02EX5997).

[26]  Sassan Saatchi,et al.  The use of decision tree and multiscale texture for classification of JERS-1 SAR data over tropical forest , 2000, IEEE Trans. Geosci. Remote. Sens..

[27]  Xiaojun Yang,et al.  Parameterizing Support Vector Machines for Land Cover Classification , 2011 .

[28]  Jonathan Cheung-Wai Chan,et al.  Evaluation of random forest and adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery , 2008 .

[29]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[30]  Paul E. Gessler,et al.  Integrating Landsat TM and SRTM-DEM derived variables with decision trees for habitat classification and change detection in complex neotropical environments , 2008 .

[31]  Greg Easson,et al.  Proposed workflow for improved Kauth–Thomas transform derivations , 2012 .

[32]  P. Mather,et al.  Classification Methods for Remotely Sensed Data , 2001 .

[33]  Marion Stellmes,et al.  Assessment and monitoring of land condition in the Iberian Peninsula, 1989–2000 , 2010 .

[34]  George P. Petropoulos,et al.  Evaluation of diverse classification approaches for land use/cover mapping in a Mediterranean region utilizing Hyperion data , 2014, Int. J. Digit. Earth.

[35]  C. Pipper,et al.  [''R"--project for statistical computing]. , 2008, Ugeskrift for laeger.

[36]  Sandra Lowe,et al.  Classification Methods For Remotely Sensed Data , 2016 .

[37]  Laura Hoch,et al.  Introductory Digital Image Processing , 2016 .

[38]  Ni-Bin Chang,et al.  Urban land use and land cover classification using the neural-fuzzy inference approach with Formosat-2 data , 2009 .

[39]  Juan J. Flores,et al.  The application of artificial neural networks to the analysis of remotely sensed data , 2008 .

[40]  George P. Petropoulos,et al.  Hyperion hyperspectral imagery analysis combined with machine learning classifiers for land use/cover mapping , 2012, Expert Syst. Appl..

[41]  Marina Fruehauf Classification Methods For Remotely Sensed Data , 2016 .

[42]  Walter Krämer,et al.  Review of Modern applied statistics with S, 4th ed. by W.N. Venables and B.D. Ripley. Springer-Verlag 2002 , 2003 .

[43]  R. Congalton,et al.  Evaluating seasonal variability as an aid to cover-type mapping from Landsat Thematic Mapper data in the Northeast , 1995 .

[44]  P. Atkinson,et al.  Random Forest classification of Mediterranean land cover using multi-seasonal imagery and multi-seasonal texture , 2012 .

[45]  Paul M. Mather,et al.  An assessment of the effectiveness of decision tree methods for land cover classification , 2003 .

[46]  L. S. Davis,et al.  An assessment of support vector machines for land cover classi(cid:142) cation , 2002 .

[47]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[48]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[49]  V. Rodriguez-Galiano,et al.  Land cover change analysis of a Mediterranean area in Spain using different sources of data: Multi-seasonal Landsat images, land surface temperature, digital terrain models and texture , 2012 .

[50]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[51]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[52]  Jon Atli Benediktsson,et al.  Ensemble methods for spectral-spatial classification of urban hyperspectral data , 2009, 2009 IEEE International Geoscience and Remote Sensing Symposium.

[53]  Thomas Blaschke,et al.  Land cover change assessment using decision trees, support vector machines and maximum likelihood classification algorithms , 2010, Int. J. Appl. Earth Obs. Geoinformation.

[54]  Rasmus Fensholt,et al.  Remote Sensing , 2008, Encyclopedia of GIS.

[55]  Giles M. Foody,et al.  An evaluation of some factors affecting the accuracy of classification by an artificial neural network , 1997 .

[56]  Mario Chica-Olmo,et al.  Incorporating the downscaled landsat TM thermal band in land-cover classification using random forest , 2012 .