New machine learning tools for predictive vegetation mapping after climate change: Bagging and Random Forest perform better than Regression Tree Analysis.

More and better machine learning tools are becoming available for landscape ecologists to aid in understanding species-environment relationships and to map probable species occurrence now and potentially into the future. To thal end, we evaluated three statistical models: Regression Tree Analybib (RTA), Bagging Trees (BT) and Random Forest (RF) for their utility in predicting the distributions of four tree species under current and future climate. RTA's single tree was the easicst to interpret but is less accurate compared to BT and RF which use multiple regession trees with resampling and resampling-randomisation respectively. Future estimates of suitable habitat following climate change were also improved with BT and RF, with a slight edge to RF because it better smoothes the outputs in a logical gradient fashion. We recommend widespread use of these tools for CISbased vegetation mapping.

[1]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[2]  Cesare Furlanello,et al.  GIS and the Random Forest Predictor: Integration in R for Tick-Borne Disease Risk Assessment , 2003 .

[3]  Robert P. Sheridan,et al.  Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling , 2003, J. Chem. Inf. Comput. Sci..

[4]  A. Prasad,et al.  Potential Changes in Tree Species Richness and Forest Community Types following Climate Change , 2001, Ecosystems.

[5]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[6]  A. Prasad,et al.  PREDICTING ABUNDANCE OF 80 TREE SPECIES FOLLOWING CLIMATE CHANGE IN THE EASTERN UNITED STATES , 1998 .

[7]  Kurt Hornik,et al.  The support vector machine under test , 2003, Neurocomputing.

[8]  B. Lees,et al.  A new method for predicting vegetation distributions using decision tree analysis in a geographic information system , 1991 .

[9]  Connie Page,et al.  Computing Science and Statistics , 1992 .

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  A. Prasad,et al.  Atlas of current and potential future distributions of common trees of the eastern United States , 1999 .

[12]  Sanford Weisberg,et al.  Computing science and statistics : proceedings of the 30th Symposium on the Interface, Minneapolis, Minnesota, May 13-16, 1998 : dimension reduction, computational complexity and information , 1998 .

[13]  G. Boer,et al.  A transient climate change simulation with greenhouse gas and aerosol forcing: projected climate to the twenty-first century , 2000 .

[14]  J. Melillo Warm, Warm on the Range , 1999, Science.

[15]  Alex Hagen,et al.  Fuzzy set approach to assessing similarity of categorical maps , 2003, Int. J. Geogr. Inf. Sci..