A Comparison between Spatial Econometric Models and Random Forest for Modeling Fire Occurrence

Fire occurrence, which is examined in terms of fire density (number of fire/km2) in this paper, has a close correlation with multiple spatiotemporal factors that include environmental, physical, and other socioeconomic predictors. Spatial autocorrelation exists widely and should be considered seriously for modeling the occurrence of fire in urban areas. Therefore, spatial econometric models (SE) were employed for modeling fire occurrence accordingly. Moreover, Random Forest (RF), which can manage the nonlinear correlation between predictors and shows steady predictive ability, was adopted. The performance of RF and SE models is discussed. Based on historical fire records of Hefei City as a case study in China, the results indicate that SE models have better predictive ability and among which the spatial autocorrelation model (SAC) is the best. Road density influences fire occurrence the most for SAC, while network distance to fire stations is the most important predictor for RF; they are selected in both models. Semivariograms are employed to explore their abilities to explain the spatial structure of fire occurrence, and the result shows that SAC works much better than RF. We give a further explanation for the generation of residuals between fire density and the common predictors in both models. Therefore, decision makers can make use of our conclusions to manage fire safety at the city scale.

[1]  Jiping Zhu,et al.  Modeling Fire Occurrence at the City Scale: A Comparison between Geographically Weighted Regression and Global Linear Regression , 2017, International journal of environmental research and public health.

[2]  Marcos Rodrigues,et al.  Analysis of recent spatial–temporal evolution of human driving factors of wildfires in Spain , 2016, Natural Hazards.

[3]  Lu Lu,et al.  Study on correlation between fire fighting time and fire loss in urban building based on statistical data , 2016 .

[4]  Zohre Sadat Pourtaghi,et al.  Investigation of general indicators influencing on forest fire and its susceptibility modeling using different data mining techniques , 2016 .

[5]  Heiko Balzter,et al.  Mapping regional patterns of large forest fires in Wildland-Urban Interface areas in Europe. , 2016, Journal of environmental management.

[6]  Gustau Camps-Valls,et al.  Uncertainty analysis of gross primary production upscaling using Random Forests, remote sensing and eddy covariance data , 2015 .

[7]  A. Stewart Fotheringham,et al.  Geographical and Temporal Weighted Regression (GTWR) , 2015 .

[8]  M. Loureiro,et al.  Modelling spatial patterns and temporal trends of wildfires in Galicia (NW Spain) , 2015 .

[9]  Ponnuthurai N. Suganthan,et al.  Modeling of steelmaking process with effective machine learning techniques , 2015, Expert Syst. Appl..

[10]  G. Pfister,et al.  Spatiotemporal prediction of fine particulate matter during the 2008 northern California wildfires using machine learning. , 2015, Environmental science & technology.

[11]  Seyed Amir Naghibi,et al.  GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran , 2015, Environmental Monitoring and Assessment.

[12]  Jeffrey P. Prestemon,et al.  Human-ignited wildfire patterns and responses to policy shifts , 2015 .

[13]  Pengcheng Qi,et al.  Improvement of fire danger modelling with geographically weighted logistic model , 2014 .

[14]  K. Satoh,et al.  Correlation Between Fire Attendance Time and Burned Area Based on Fire Statistical Data of Japan and China , 2014 .

[15]  Juan de la Riva,et al.  An insight into machine-learning algorithms to model human-caused wildfire occurrence , 2014, Environ. Model. Softw..

[16]  P. Mourão,et al.  The choices of the fire — Debating socioeconomic determinants of the fires observed at Portuguese municipalities ☆ , 2014 .

[17]  S. Fotheringham,et al.  Modeling the spatial variation of the explanatory factors of human-caused wildfires in Spain using geographically weighted logistic regression , 2014 .

[18]  Charles Jennings,et al.  Social and economic characteristics as determinants of residential fire risk in urban neighborhoods: A review of the literature , 2013 .

[19]  Kirsi Virrantaus,et al.  Discovering spatio-temporal relationships in the distribution of building fires , 2013 .

[20]  Nikos Koutsias,et al.  Modelling long-term fire occurrence factors in Spain by accounting for local variations with geographically weighted regression , 2013 .

[21]  Jorge Mateu,et al.  Spatial pattern modelling of wildfires in Catalonia, Spain 2004-2008 , 2013, Environ. Model. Softw..

[22]  Martin Jung,et al.  Estimating Basal Area of Spruce and Fir in Post-Fire Residual Stands in Central Siberia Using Quickbird, Feature Selection, and Random Forests , 2013, ICCS.

[23]  Alexander Brenning,et al.  Spatial cross-validation and bootstrap for the assessment of prediction rules in remote sensing: The R package sperrorest , 2012, 2012 IEEE International Geoscience and Remote Sensing Symposium.

[24]  J. Pereira,et al.  Modeling spatial patterns of fire occurrence in Mediterranean Europe using Multiple Regression and Random Forest , 2012 .

[25]  Xianjin Huang,et al.  Spatial Autocorrelation Analysis of Chinese Inter-Provincial Industrial Chemical Oxygen Demand Discharge , 2012, International journal of environmental research and public health.

[26]  Gary Higgs,et al.  Fire incidence in metropolitan areas: A comparative study of Brisbane (Australia) and Cardiff (United Kingdom) , 2011 .

[27]  Jean-Michel Poggi,et al.  Variable selection using random forests , 2010, Pattern Recognit. Lett..

[28]  Douglas G. Woolford,et al.  A model for predicting human-caused wildfire occurrence in the region of Madrid, Spain , 2010 .

[29]  Carlos J. Novillo,et al.  Spatial modelling of socioeconomic data to understand patterns of human-caused wildfire ignition risk in the SW of Madrid (central Spain) , 2010 .

[30]  Matthew G. Falk,et al.  Spatially stratified sampling using auxiliary information for geostatistical mapping , 2009, Environmental and Ecological Statistics.

[31]  Bjoern H. Menze,et al.  A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data , 2009, BMC Bioinformatics.

[32]  David T. Butry,et al.  Economic Performance of Residential Fire Sprinkler Systems , 2009 .

[33]  E. Chuvieco,et al.  Human-caused wildfire risk rating for prevention planning in Spain. , 2009, Journal of environmental management.

[34]  J. LeSage Introduction to spatial econometrics , 2009 .

[35]  A. Sebastián-López,et al.  Integration of socio-economic and environmental variables for modelling long-term fire danger in Southern Europe , 2008, European Journal of Forest Research.

[36]  R. G. Davies,et al.  Methods to account for spatial autocorrelation in the analysis of species distributional data : a review , 2007 .

[37]  W. D. Kissling,et al.  Spatial autocorrelation and the selection of simultaneous autoregressive models , 2007 .

[38]  Achim Zeileis,et al.  Bias in random forest variable importance measures: Illustrations, sources and a solution , 2007, BMC Bioinformatics.

[39]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[40]  J. Gómez,et al.  CONSEQUENCES OF SPATIAL AUTOCORRELATION FOR THE ANALYSIS OF METAPOPULATION DYNAMICS , 2005 .

[41]  Richard J. Telford,et al.  The secret assumption of transfer functions: problems with spatial autocorrelation in evaluating model performance , 2005 .

[42]  L. Breiman CONSISTENCY FOR A SIMPLE MODEL OF RANDOM FORESTS , 2004 .

[43]  M. Pal,et al.  Random forests for land cover classification , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[44]  J. LeSage Spatial Econometrics , 1998 .

[45]  Ams Ferreira de Almeida,et al.  The Relationship of Forest Fires to Agro-Forestry and Socio-Economic Parameters in Portugal , 1992 .