Land cover and land use mapping of the iSimangaliso Wetland Park, South Africa: comparison of oblique and orthogonal random forest algorithms

Abstract. In recent years, the popularity of tree-based ensemble methods for land cover classification has increased significantly. Using WorldView-2 image data, we evaluate the potential of the oblique random forest algorithm (oRF) to classify a highly heterogeneous protected area. In contrast to the random forest (RF) algorithm, the oRF algorithm builds multivariate trees by learning the optimal split using a supervised model. The oRF binary algorithm is adapted to a multiclass land cover and land use application using both the “one-against-one” and “one-against-all” combination approaches. Results show that the oRF algorithms are capable of achieving high classification accuracies (>80%). However, there was no statistical difference in classification accuracies obtained by the oRF algorithms and the more popular RF algorithm. For all the algorithms, user accuracies (UAs) and producer accuracies (PAs) >80% were recorded for most of the classes. Both the RF and oRF algorithms poorly classified the indigenous forest class as indicated by the low UAs and PAs. Finally, the results from this study advocate and support the utility of the oRF algorithm for land cover and land use mapping of protected areas using WorldView-2 image data.

[1]  Tim Appelhans,et al.  Improving the accuracy of rainfall rates from optical satellite sensors with machine learning — A random forests-based approach applied to MSG SEVIRI , 2014 .

[2]  W. Newmark Isolation of African protected areas , 2008 .

[3]  Cory R. Davis,et al.  Trajectories in land use change around U.S. National Parks and challenges and opportunities for management , 2011 .

[4]  Grégoire Dubois,et al.  Monitoring land cover changes in African protected areas in the 21st century , 2013, Ecol. Informatics.

[5]  Mark G. Anderson,et al.  Selecting and conserving lands for biodiversity: The role of remote sensing , 2009 .

[6]  Onisimo Mutanga,et al.  Land-use/cover classification in a heterogeneous coastal landscape using RapidEye imagery: evaluating the performance of random forest and support vector machines classifiers , 2014 .

[7]  L. Rebelo,et al.  A study of wetland hydrology and ecosystem service provision: GaMampa wetland, South Africa , 2011 .

[8]  Andrew K. Skidmore,et al.  Earth observation for biodiversity monitoring : a review of current approaches and future opportunities for tracking progress towards the Aichi Biodiversity Targets : e-book , 2014 .

[9]  S. Horvath,et al.  Unsupervised Learning With Random Forest Predictors , 2006 .

[10]  François Poulet,et al.  Classifying one billion data with a new distributed svm algorithm , 2006, 2006 International Conference onResearch, Innovation and Vision for the Future.

[11]  H. Nagendra,et al.  Remote sensing for conservation monitoring: Assessing protected areas, habitat extent, habitat condition, species diversity, and threats , 2013 .

[12]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[13]  Caspar A. Mücher,et al.  Satellite Earth observation data to identify anthropogenic pressures in selected protected areas , 2015, Int. J. Appl. Earth Obs. Geoinformation.

[14]  Joanna Adamczyk,et al.  Red-edge vegetation indices for detecting and assessing disturbances in Norway spruce dominated mountain forests , 2015, Int. J. Appl. Earth Obs. Geoinformation.

[15]  M. Pal,et al.  Random forests for land cover classification , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[16]  R. Lawrence,et al.  Mapping wetlands and riparian areas using Landsat ETM+ imagery and decision-tree-based models , 2006, Wetlands.

[17]  K. Chomitz,et al.  Effectiveness of Strict vs. Multiple Use Protected Areas in Reducing Tropical Forest Fires: A Global Analysis Using Matching Methods , 2011, PloS one.

[18]  J. Kerr,et al.  Just passing through: Global change and the conservation of biodiversity in protected areas , 2010 .

[19]  Lucas N Joppa,et al.  On the protection of “protected areas” , 2008, Proceedings of the National Academy of Sciences.

[20]  Yeqiao Wang,et al.  Remote sensing change detection tools for natural resource managers: Understanding concepts and tradeoffs in the design of landscape monitoring projects , 2009 .

[21]  J. Nel,et al.  National Biodiversity Assessment 2011: An assessment of South Africa’s biodiversity and ecosystems. , 2012 .

[22]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[23]  Giles M. Foody,et al.  A relative evaluation of multiclass image classification by support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[24]  Madan Gopal,et al.  Reduced one-against-all method for multiclass SVM classification , 2011, Expert Syst. Appl..

[25]  Gretchen C. Daily,et al.  Integrity and isolation of Costa Rica's national parks and biological reserves: examining the dynamics of land-cover change , 2003 .

[26]  Chandra P. Giri,et al.  Next generation of global land cover characterization, mapping, and monitoring , 2013, Int. J. Appl. Earth Obs. Geoinformation.

[27]  Cory R. Davis,et al.  Delineating the Ecosystems Containing Protected Areas for Monitoring and Management , 2011 .

[28]  Lorenzo Bruzzone,et al.  Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[29]  Miguel Delibes,et al.  Ecosystem functioning of protected and altered Mediterranean environments: A remote sensing classification in Doñana, Spain , 2010 .

[30]  Juan José Rodríguez Diez,et al.  An Experimental Study on Rotation Forest Ensembles , 2007, MCS.

[31]  R. DeFries,et al.  INCREASING ISOLATION OF PROTECTED AREAS IN TROPICAL FORESTS OVER THE PAST TWENTY YEARS , 2005 .

[32]  G. Mitchell,et al.  The Niger Delta wetlands: threats to ecosystem services, their importance to dependent communities and possible management measures , 2011 .

[33]  Taha B. M. J. Ouarda,et al.  Comparison of ice-affected streamflow estimates computed using artificial neural networks and multiple regression techniques , 2008 .

[34]  Yuyu Zhou,et al.  Remote sensing of land-cover change and landscape context of the National Parks: A case study of the Northeast Temperate Network , 2009 .

[35]  Jean-Marie Gregoire,et al.  Increased isolation of two Biosphere Reserves and surrounding protected areas (WAP ecological complex, West Africa) , 2007 .

[36]  Charalambos Kontoes,et al.  A transferability study of the kernel-based reclassification algorithm for habitat delineation , 2015, Int. J. Appl. Earth Obs. Geoinformation.

[37]  Ullrich Köthe,et al.  On Oblique Random Forests , 2011, ECML/PKDD.

[38]  Robert I. McDonald,et al.  The implications of current and future urbanization for global protected areas and biodiversity conservation , 2008 .

[39]  Shawn J. Leroux,et al.  Global protected areas and IUCN designations: Do the categories match the conditions? , 2010 .

[40]  Thanh-Nghi Do,et al.  Classifying Very-High-Dimensional Data with Random Forests of Oblique Decision Trees , 2009, EGC.

[41]  Jonathan Cheung-Wai Chan,et al.  Evaluation of random forest and adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery , 2008 .

[42]  Taskin Kavzoglu,et al.  An assessment of the effectiveness of a rotation forest ensemble for land-use and land-cover mapping , 2013 .

[43]  Scott N. Miller,et al.  High-resolution landcover classification using Random Forest , 2014 .

[44]  M. Spalding,et al.  Measuring the extent and effectiveness of protected areas as an indicator for meeting global biodiversity targets , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[45]  Juan José Rodríguez Diez,et al.  Rotation Forest: A New Classifier Ensemble Method , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Jane Southworth,et al.  Monitoring Parks Through Remote Sensing: Studies in Nepal and Honduras , 2004, Environmental management.

[47]  H. Nagendra,et al.  Assessing the impact of Celaque National Park on forest fragmentation in western Honduras , 2004 .

[48]  Riyad Ismail,et al.  A comparison of selected machine learning classifiers in mapping a South African heterogeneous coastal zone: Testing the utility of an object-based classification with WorldView-2 imagery , 2012, Remote Sensing.

[49]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[50]  Lorenzo Bruzzone,et al.  Kernel-based methods for hyperspectral image classification , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[51]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[52]  C. Braak,et al.  Complex contexts and dynamic drivers: understanding four decades of forest loss and recovery in an East African protected area , 2013 .

[53]  K. Brandon,et al.  The role of protected areas in conserving biodiversity and sustaining local livelihoods , 2005 .

[54]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[55]  J. Gaudart,et al.  Oblique decision trees for spatial pattern detection: optimal algorithm and application to malaria risk. , 2005 .

[56]  Clement Atzberger,et al.  Tree Species Classification with Random Forest Using Very High Spatial Resolution 8-Band WorldView-2 Satellite Data , 2012, Remote. Sens..

[57]  Barbara Cafarelli,et al.  Very high resolution Earth observation features for monitoring plant and animal community structure across multiple spatial scales in protected areas , 2015, Int. J. Appl. Earth Obs. Geoinformation.

[58]  Mario Chica-Olmo,et al.  An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[59]  Chun-Xia Zhang,et al.  An empirical study of using Rotation Forest to improve regressors , 2008, Appl. Math. Comput..

[60]  Onisimo Mutanga,et al.  Potential utility of the spectral red-edge region of SumbandilaSat imagery for assessing indigenous forest structure and health , 2012, Int. J. Appl. Earth Obs. Geoinformation.

[61]  Giles M. Foody,et al.  Status of land cover classification accuracy assessment , 2002 .

[62]  Paul M. Mather,et al.  Support vector machines for classification in remote sensing , 2005 .

[63]  Thomas R. Loveland,et al.  Assessing the landscape context and conversion risk of protected areas using satellite data products , 2009 .

[64]  Vasiliki Kosmidou,et al.  Using landscape structure to develop quantitative baselines for protected area monitoring , 2013 .

[65]  Thong Ngee Goh,et al.  Adaptive ridge regression system for software cost estimating on multi-collinear datasets , 2010, J. Syst. Softw..

[66]  D. R. Cutler,et al.  Utah State University From the SelectedWorks of , 2017 .

[67]  Russell G. Congalton,et al.  A review of assessing the accuracy of classifications of remotely sensed data , 1991 .

[68]  G. Foody Thematic map comparison: Evaluating the statistical significance of differences in classification accuracy , 2004 .

[69]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[70]  De-Shuang Huang,et al.  Cancer classification using Rotation Forest , 2008, Comput. Biol. Medicine.

[71]  Ruth DeFries,et al.  Ecological mechanisms linking protected areas to surrounding lands. , 2007, Ecological applications : a publication of the Ecological Society of America.

[72]  K. Redford,et al.  MEETING AICHI TARGET 11: WHAT DOES SUCCESS LOOK LIKE FOR PROTECTED AREA SYSTEMS? , 2012 .

[73]  K. McGarigal,et al.  Cape Cod National Seashore using Random Forests , 2012 .

[74]  Ryan M. Rifkin,et al.  In Defense of One-Vs-All Classification , 2004, J. Mach. Learn. Res..

[75]  L. S. Davis,et al.  An assessment of support vector machines for land cover classi(cid:142) cation , 2002 .