Using Spatial Reinforcement Learning to Build Forest Wildfire Dynamics Models From Satellite Images

Machine learning algorithms have increased tremendously in power in recent years but have yet to be fully utilized in many ecology and sustainable resource management domains such as wildlife reserve design, forest fire management and invasive species spread. One thing these domains have in common is that they contain dynamics that can be characterized as a Spatially Spreading Process (SSP) which requires many parameters to be set precisely to model the dynamics, spread rates and directional biases of the elements which are spreading. We present related work in Artificial Intelligence and Machine Learning for SSP sustainability domains including forest wildfire prediction. We then introduce a novel approach for learning in SSP domains using Reinforcement Learning (RL) where fire is the agent at any cell in the landscape and the set of actions the fire can take from a location at any point in time includes spreading North, South, East, West or not spreading. This approach inverts the usual RL setup since the dynamics of the corresponding Markov Decision Process (MDP) is a known function for immediate wildfire spread. Meanwhile, we learn an agent policy for a predictive model of the dynamics of a complex spatially-spreading process. Rewards are provided for correctly classifying which cells are on fire or not compared to satellite and other related data. We examine the behaviour of five RL algorithms on this problem: Value Iteration, Policy Iteration, Q-Learning, Monte Carlo Tree Search and Asynchronous Advantage Actor-Critic (A3C). We compare to a Gaussian process based supervised learning approach and discuss the relation of our approach to manually constructed, state-of-the-art methods from forest wildfire modelling. We also discuss the relation of our approach to manually constructed, state-of-the-art methods from forest wildfire modelling. We validate our approach with satellite image data of two massive wildfire events in Northern Alberta, Canada; the Fort McMurray fire of 2016 and the Richardson fire of 2011. The results show that we can learn predictive, agent-based policies as models of spatial dynamics using RL on readily available satellite images that other methods and have many additional advantages in terms of generalizability and interpretability.

[1]  Thomas G. Dietterich,et al.  Fast Simulation for Computational Sustainability Sequential Decision Making Problems , 2016 .

[2]  A. Brenning,et al.  Predictive mapping of reef fish species richness, diversity and biomass in Zanzibar using IKONOS imagery and machine-learning techniques. , 2010 .

[3]  K. Malarz,et al.  Are Forest Fires Predictable , 2002 .

[4]  Aarti Singh,et al.  PREDICTING BURNED AREAS OF FOREST FIRES : AN ARTIFICIAL INTELLIGENCE APPROACH , 2017 .

[5]  Imas Sukaesih Sitanggang,et al.  Classification model for hotspot occurrences using a decision tree method , 2011 .

[6]  Régis Sabbadin,et al.  Reinforcement learning for spatial processes , 2009 .

[7]  P. Cortez,et al.  A data mining approach to predict forest fires using meteorological data , 2007 .

[8]  Sang Michael Xie,et al.  Combining satellite imagery and machine learning to predict poverty , 2016, Science.

[9]  K. Angayarkkani,et al.  Efficient Forest Fire Detection System: A Spatial Data Mining and Image Processing Based Approach , 2009 .

[10]  Mark A. Finney,et al.  On the need for a theory of wildland fire spread , 2013 .

[11]  Stan Matwin,et al.  Machine Learning for the Detection of Oil Spills in Satellite Radar Images , 1998, Machine Learning.

[12]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  David L. Martell,et al.  A Review of Recent Forest and Wildland Fire Management Decision Support Systems Research , 2015, Current Forestry Reports.

[15]  M. Finney FARSITE : Fire Area Simulator : model development and evaluation , 1998 .

[16]  Ahmad A. A. Alkhatib A Review on Forest Fire Detection Techniques , 2014, Int. J. Distributed Sens. Networks.

[17]  Friedrich Recknagel,et al.  Applications of machine learning to ecological modelling , 2001 .

[18]  M. E. Alexander,et al.  Canadian Forest Fire Danger Rating System: An Overview , 1989 .

[19]  Cheng Liu,et al.  Detection, Emission Estimation and Risk Prediction of Forest Fires in China Using Satellite Sensors and Simulation Models in the Past Three Decades—An Overview , 2011, International journal of environmental research and public health.

[20]  Mikhail Kanevski,et al.  Machine Learning Algorithms for GeoSpatial Data. Applications and Software Tools , 2008 .

[21]  C. Furlanello,et al.  Predicting habitat suitability with machine learning models: The potential area of Pinus sylvestris L. in the Iberian Peninsula , 2006 .

[22]  Tao Han,et al.  Simulating wildfire spreading processes in a spatially heterogeneous landscapes using an improved cellular automaton model , 2004, IGARSS 2004. 2004 IEEE International Geoscience and Remote Sensing Symposium.

[23]  Thomas G. Dietterich,et al.  Allowing a wildfire to burn: estimating the effect on future fire suppression costs , 2013 .

[24]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[25]  Jing Li,et al.  High-resolution surface relative humidity computation using MODIS image in Peninsular Malaysia , 2006 .

[26]  M. Hemalatha,et al.  Integration of machine learning algorithm using spatial semi supervised classification in FWI data , 2012, IEEE-International Conference On Advances In Engineering, Science And Management (ICAESM -2012).

[27]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[28]  Csaba Szepesvári,et al.  Bandit Based Monte-Carlo Planning , 2006, ECML.

[29]  Simon M. Lucas,et al.  A Survey of Monte Carlo Tree Search Methods , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[30]  Lise Getoor,et al.  Entity resolution in geospatial data integration , 2006, GIS '06.

[31]  Jin Li,et al.  Application of machine learning methods to spatial interpolation of environmental variables , 2011, Environ. Model. Softw..

[32]  Matthew J. Cracknell,et al.  Geological mapping using remote sensing data: A comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information , 2014, Comput. Geosci..