How well does random forest analysis model deforestation and forest fragmentation in the Brazilian Atlantic forest?

We assessed the value of applying random forest analysis (RF) to relating metrics of deforestation (DF) and forest fragmentation (FF) to socioeconomic (SE) and biogeophysical (BGP) factors, in the Brazilian Atlantic Forest of Minas Gerais, Brazil. A vegetation-monitoring project provided land cover maps, from which we derived DF and FF metrics. An ecologic-economical zoning project provided more than 300 SE and BGP factors. We used RF to identify relationships between these sets of variables and compared its performance in this task to that of a more traditional multiple linear regression approach. We found that RF modelled relatively-well variance in all metrics used (the rate of deforestation, the amount of forest and the density and isolation of forest patches), presenting a better performance when compared to the classical approach. RF also identified geographical location and topographic factors as being most closely associated with patterns of DF and FF metrics. Both analyses found factors associated with economic productivity, social institutions, accessibility and exploration to have little relationship with forest metrics. RF was better at explaining variations in rates of deforestation, remaining forest and patch patterns, than the multiple linear regression approach. We conclude that RF provides a promising methodology for elucidating the relationships between land use and cover changes and potential drivers.

[1]  M. Samaniego,et al.  Research, part of a Special Feature on The influence of human demography and agriculture on natural systems in the Neotropics Historical, Demographic, and Economic Correlates of Land-Use Change in the Republic of Panama , 2008 .

[2]  N. Lazar,et al.  Methods and Criteria for Model Selection , 2004 .

[3]  M. Rajadhyaksha,et al.  Confocal imaging-guided laser ablation of basal cell carcinomas: an ex vivo study. , 2015, The Journal of investigative dermatology.

[4]  J. Sendra,et al.  Exploring the driving forces behind deforestation in the state of Mexico (Mexico) using geographically weighted regression , 2010 .

[5]  Stephen G. Perz Are Agricultural Production and Forest Conservation Compatible? Agricultural Diversity, Agricultural Incomes and Primary Forest Cover Among Small Farm Colonists in the Amazon , 2004 .

[6]  Daniela M. Witten,et al.  An Introduction to Statistical Learning: with Applications in R , 2013 .

[7]  Jiang-bo Gao,et al.  Detecting spatially non-stationary and scale-dependent relationships between urban landscape fragmentation and related factors using Geographically Weighted Regression , 2011 .

[8]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[9]  Ary T. Oliveira-Filho,et al.  Patterns of Floristic Differentiation among Atlantic Forests in Southeastern Brazil and the Influence of Climate1 , 2000 .

[10]  D. R. Cutler,et al.  Utah State University From the SelectedWorks of , 2017 .

[11]  川野 秀一 An Introduction to Statistical Learning (with Applications in R), Gareth James,Daniela Witten,Trevor Hastie and Robert Tibshirani著, Springer, 2013年8月, 430pp., 価格 59.99〓, ISBN 978-1-4614-7137-0 , 2014 .

[12]  D. Goodin,et al.  Land-cover change in the Paraguayan Chaco: 2000–2011 , 2015 .

[13]  Stephen J. Smith,et al.  Conditional variable importance in R package extendedForest , 2011 .

[14]  P. Fearnside The roles and movements of actors in the deforestation of Brazilian Amazonia , 2008 .

[15]  Brian D. Ripley,et al.  Modern Applied Statistics with S Fourth edition , 2002 .

[16]  T. Mitchell Aide,et al.  Research, part of a Special Feature on The influence of human demography and agriculture on natural systems in the Neotropics Agricultural Abandonment, Suburban Growth, and Forest Expansion in Puerto Rico between 1991 and 2000 , 2008 .

[17]  Jennifer J. Swenson,et al.  Forest fragmentation in the Pacific Northwest: quantification and correlations , 2004 .

[18]  Jean Paul Metzger,et al.  Do diagnóstico à conservação da biodiversidade: o estado da arte do programa BIOTA/FAPESP , 2006 .

[19]  E. Lambin,et al.  Proximate Causes and Underlying Driving Forces of Tropical Deforestation , 2002 .

[20]  Walter Krämer,et al.  Review of Modern applied statistics with S, 4th ed. by W.N. Venables and B.D. Ripley. Springer-Verlag 2002 , 2003 .

[21]  Eric F. Lambin,et al.  Land-Use and Land-Cover Change , 2006 .

[22]  S. Dong,et al.  Characterizing the "fragmentation-barrier" effect of road networks on landscape connectivity: a case study in Xishuangbanna, Southwest China. , 2010 .

[23]  A. Prasad,et al.  Newer Classification and Regression Tree Techniques: Bagging and Random Forests for Ecological Prediction , 2006, Ecosystems.

[24]  S. Hecht,et al.  Globalization, Forest Resurgence, and Environmental Politics in El Salvador , 2006 .

[25]  T. R. Leinbach,et al.  Transport and Third World development: Review, issues, and prescription , 1995 .

[26]  Noel J. Purcell,et al.  A Biometrics Invited Paper. Estimation for Small Domains , 1979 .

[27]  Y. Shimabukuro,et al.  Forest dynamics and land-use transitions in the Brazilian Atlantic Forest: the case of sugarcane expansion , 2015, Regional Environmental Change.

[28]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[29]  Johannes R. Sveinsson,et al.  Random Forests for land cover classification , 2006, Pattern Recognit. Lett..

[30]  A. Apan,et al.  Probing tropical deforestation: The use of GIS and statistical analysis of georeferenced data , 1998 .

[31]  E. Barbier,et al.  The forest transition: Towards a more comprehensive theoretical framework☆ , 2010 .

[32]  John R. Stevens,et al.  Tree-Based Methods , 2009 .

[33]  M. Marques,et al.  LANDSCAPE STRUCTURE IN THE NORTHERN COAST OF PARANÁ STATE, A HOTSPOT FOR THE BRAZILIAN ATLANTIC FOREST CONSERVATION¹ , 2012 .

[34]  J. Metzger,et al.  Evaluating the legacy of landscape history: extinction debt and species credit in bird and small mammal assemblages in the Brazilian Atlantic Forest , 2012 .

[35]  M. Caley,et al.  Global Patterns and Predictions of Seafloor Biomass Using Random Forests , 2010, PloS one.

[36]  Eric F. Lambin,et al.  Land-use and land-cover change : local processes and global impacts , 2010 .

[37]  Stephen J. Smith,et al.  Gradient forests: calculating importance gradients on physical predictors. , 2012, Ecology.

[38]  J. Metzger,et al.  Relief influence on the spatial distribution of the Atlantic Forest cover on the Ibiúna Plateau, SP. , 2007, Brazilian journal of biology = Revista brasleira de biologia.

[39]  Alexander Strickland Pfaff Talikoff What Drives Deforestation in the Brazilian Amazon? Evidence from Satellite and Socioeconomic Data , 1996 .

[40]  Ping Liu,et al.  A comparison of random forest regression and multiple linear regression for prediction in neuroscience , 2013, Journal of Neuroscience Methods.

[41]  S. Carpenter,et al.  Global Consequences of Land Use , 2005, Science.

[42]  M. Clark,et al.  The Relative Importance of Socioeconomic and Environmental Variables in Explaining Land Change in Bolivia, 2001–2010 , 2012 .

[43]  João Roberto dos Santos,et al.  Land-use and land-cover change processes in the Upper Uruguay Basin: linking environmental and socioeconomic variables , 2013, Landscape Ecology.

[44]  Joan Marull,et al.  Land use changes, landscape ecology and their socioeconomic driving forces in the Spanish Mediterranean coast (El Maresme County, 1850–2005) , 2012 .

[45]  Adele Cutler,et al.  Random forests for microarrays. , 2006, Methods in enzymology.

[46]  Jane Southworth,et al.  Tourism, forest conversion, and land transformations in the Angkor basin, Cambodia , 2009 .

[47]  Víctor Arroyo-Rodríguez,et al.  Land cover changes in the Lachuá region, Guatemala: patterns, proximate causes, and underlying driving forces over the last 50 years , 2014, Regional Environmental Change.

[48]  D. Kleinbaum,et al.  Applied Regression Analysis and Other Multivariate Methods , 1978 .

[49]  Y. Malhi,et al.  Tropical Forests in the Anthropocene , 2014 .

[50]  M. Batistella,et al.  Variação na cobertura vegetal nativa em São Paulo: um panorama do conhecimento atual , 2012 .

[51]  F. M. Borém,et al.  Atlantic Forest Fragmentation Analysis and Landscape Restoration Management Scenarios , 2012 .

[52]  O. Masera,et al.  Assessing implications of land-use and land-cover change dynamics for conservation of a highly diverse tropical rain forest , 2007 .

[53]  Stephen G. Perz,et al.  Unofficial Road Building in the Amazon: Socioeconomic and Biophysical Explanations , 2007 .

[54]  R. O'Neill,et al.  A factor analysis of landscape pattern and structure metrics , 1995, Landscape Ecology.

[55]  T. Aide,et al.  Ecology. Globalization, migration, and Latin American ecosystems. , 2004, Science.

[56]  Warren Dean With Broadax and Firebrand: The Destruction of the Brazilian Atlantic Forest , 1996 .

[57]  Richard F. Gunst,et al.  Applied Regression Analysis , 1999, Technometrics.

[58]  Ruth Beilin,et al.  Analysing how drivers of agricultural land abandonment affect biodiversity and cultural landscapes using case studies from Scandinavia, Iberia and Oceania , 2014 .

[59]  M. Lewis-Beck The Relative Importance of Socioeconomic and Political Variables For Public Policy , 1977 .

[60]  Jiquan Chen,et al.  Determining Socioeconomic Drivers of Urban Forest Fragmentation with Historical Remote Sensing Images , 2013 .

[61]  J. Barlow,et al.  Are we selecting appropriate metrics to assess human impacts on biodiversity , 2017 .

[62]  Mette Termansen,et al.  Underlying and proximate driving causes of land use change in district Swat, Pakistan , 2013 .

[63]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[64]  M. Clark,et al.  The influence of socioeconomic, environmental, and demographic factors on municipality-scale land-cover change in Mexico , 2012, Regional Environmental Change.

[65]  E. Salvador,et al.  Globalization , Forest Resurgence and Environmental Politics in , 2005 .

[66]  R. Bhadra,et al.  NIH Public Access , 2014 .

[67]  R. Hughes,et al.  Multi-scale assessment of human-induced changes to Amazonian instream habitats , 2016, Landscape Ecology.

[68]  Compton J. Tucker,et al.  Thirty Years of Land-cover Change in Bolivia , 2007, Ambio.

[69]  Jean Paul Metzger,et al.  Modeling landscape dynamics in an Atlantic Rainforest region: Implications for conservation , 2009 .

[70]  Christina Gloeckner,et al.  Modern Applied Statistics With S , 2003 .

[71]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[72]  A. Pfaff What drives deforestation in the Brazilian Amazon? Evidence from satellite and socioeconomic data , 1997 .

[73]  R. DeFries,et al.  Land‐use choices: balancing human needs and ecosystem function , 2004 .

[74]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[75]  Xiaochun Li,et al.  High-Dimensional Data Analysis in Cancer Research , 2009 .

[76]  Jean Paul Metzger,et al.  The Brazilian Atlantic Forest: How much is left, and how is the remaining forest distributed? Implications for conservation , 2009 .

[77]  William F. Laurance,et al.  The Future of the Brazilian Amazon , 2001, Science.

[78]  T. Hawbaker,et al.  Effects of roads, topography, and land use on forest cover dynamics in the Brazilian Atlantic Forest , 2010 .

[79]  Miguel B. Araújo,et al.  Systematic Conservation Planning Comes of Age , 2009 .

[80]  Jayajit Chakraborty,et al.  Using geographically weighted regression for environmental justice analysis: Cumulative cancer risks from air toxics in Florida , 2011 .

[81]  Venkat Reddy Konasani,et al.  Multiple Regression Analysis , 2015 .

[82]  Drasko Furundzic,et al.  Application example of neural networks for time series analysis: : Rainfall-runoff modeling , 1998, Signal Process..

[83]  Robert P Freckleton,et al.  Why do we still use stepwise modelling in ecology and behaviour? , 2006, The Journal of animal ecology.

[84]  R. Forman,et al.  ROADS AND THEIR MAJOR ECOLOGICAL EFFECTS , 1998 .