Land Cover Classification Using Extremely Randomized Trees: A Kernel Perspective

The classification of the ever-increasing collections of remotely sensed images is a key but challenging task. In this letter, we introduce the use of extremely randomized trees known as Extra-Trees (ET) to create a similarity kernel [ET kernel (ETK)] that is subsequently used in a support vector machine (SVM) to create a novel classifier. The performance of this classifier is benchmarked against that of a standard ET, an SVM with both conventional radial basis function (RBF) kernel, and a recently introduced random forest-based kernel (RFK). A time series of Worldview-2 images over smallholder farms is used to illustrate our study. Four sets of features were obtained from these images by extending their original spectral bands with vegetation indices and textures derived from gray-level co-occurrence matrices. This allows testing the performance of the classifiers in low- and high-dimensional problems. Our results for the high-dimensional experiments show that the SVM with tree-based kernels provide better overall accuracies than with the RBF kernel. For problems with lower dimensionality, SVM-ETK slightly outperforms SVM-RFK and SVM-RBF. Moreover, SVM-ETK outperforms ET in most of the experiments. Besides an improved overall accuracy, the main advantage of ETK is its relatively low computational cost compared to the parameterization of the RBF and RFK. Thus, the proposed SVM-ETK classifier is an efficient alternative to common classifiers, especially in problems involving high-dimensional data sets.

[1]  Emma Izquierdo-Verdiguier,et al.  Evaluating the Performance of a Random Forest Kernel for Land Cover Classification , 2017, Remote. Sens..

[2]  Zoubin Ghahramani,et al.  The Random Forest Kernel and other kernels for big data from random partitions , 2014, ArXiv.

[3]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[4]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  Johannes R. Sveinsson,et al.  Random Forests for land cover classification , 2006, Pattern Recognit. Lett..

[7]  Rick Archibald,et al.  Feature Selection and Classification of Hyperspectral Images With Support Vector Machines , 2007, IEEE Geoscience and Remote Sensing Letters.

[8]  Sicong Liu,et al.  Classification of VHR Multispectral Images Using ExtraTrees and Maximally Stable Extremal Region-Guided Morphological Profile , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[11]  Bing Zhang,et al.  A Review of Remote Sensing Image Classification Techniques: the Role of Spatio-contextual Information , 2014 .

[12]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[13]  Jungho Im,et al.  Support vector machines in remote sensing: A review , 2011 .

[14]  Mahesh Pal,et al.  KERNEL METHODS IN REMOTE SENSING: A REVIEW , 2009 .

[15]  Paolo Napoletano,et al.  Remote Sensing Image Classification Exploiting Multiple Kernel Learning , 2015, IEEE Geoscience and Remote Sensing Letters.

[16]  Ingmar Nitze,et al.  Assessment of multi-temporal, multi-sensor radar and ancillary spatial data for grasslands monitoring in Ireland using machine learning approaches , 2014 .

[17]  Emma Izquierdo-Verdiguier,et al.  A Cloud-Based Multi-Temporal Ensemble Classifier to Map Smallholder Farming Systems , 2018, Remote. Sens..

[18]  Valentyn Tolpekin,et al.  A Workflow for Automated Satellite Image Processing: from Raw VHSR Data to Object-Based Spectral Information for Smallholder Agriculture , 2017, Remote. Sens..

[19]  Erwan Scornet,et al.  Random Forests and Kernel Methods , 2015, IEEE Transactions on Information Theory.