A data mining approach to predict forest fires using meteorological data

Forest fires are a major environmental issue, creating economical and ecological damage while endangering human lives. Fast detection is a key element for controlling such phenomenon. To achieve this, one alternative is to use automatic tools based on local sensors, such as provided by meteorological stations. In effect, meteorological conditions (e.g. temperature, wind) are known to influence forest fires and several fire indexes, such as the forest Fire Weather Index (FWI), use such data. In this work, we explore a Data Mining (DM) approach to predict the burned area of forest fires. Five different DM techniques, e.g. Support Vector Machines (SVM) and Random Forests, and four distinct feature selection setups (using spatial, temporal, FWI components and weather attributes), were tested on recent real-world data collected from the northeast region of Portugal. The best configuration uses a SVM and four meteorological inputs (i.e. temperature, relative humidity, rain and wind) and it is capable of predicting the burned area of small fires, which are more frequent. Such knowledge is particularly useful for improving firefighting resource management (e.g. prioritizing targets for air tankers and ground crews).

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[3]  Giovanni Bovio,et al.  Comparative study of various methods of fire danger evaluation in southern Europe , 1994 .

[4]  Arthur Flexer,et al.  Statistical evaluation of neural networks experiments: Minimum requirements and current practice , 1994 .

[5]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[6]  Hans C. Jessen,et al.  Applied Logistic Regression Analysis , 1996 .

[7]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[8]  Josep Piñol,et al.  Climate Warming, Wildfire Hazard, and Wildfire Occurrence in Coastal Eastern Spain , 1998 .

[9]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[10]  Robert H. Kewley,et al.  Data strip mining for the virtual design of pharmaceuticals with neural networks , 2000, IEEE Trans. Neural Networks Learn. Syst..

[11]  Aníbal Ollero,et al.  An Intelligent System for False Alarm Reduction in Infrared Forest-Fire Detection , 2000, IEEE Intell. Syst..

[12]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[13]  K. Malarz,et al.  Are Forest Fires Predictable , 2002 .

[14]  Jinbo Bi,et al.  Regression Error Characteristic Curves , 2003, ICML.

[15]  Chang-Tien Lu,et al.  Detecting region outliers in meteorological data , 2003, GIS '03.

[16]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[17]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[18]  Mong-Li Lee,et al.  Image Mining: Trends and Developments , 2002, Journal of Intelligent Information Systems.

[19]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[20]  Yunqian Ma,et al.  Practical selection of SVM parameters and noise estimation for SVM regression , 2004, Neural Networks.

[21]  David J. Diner,et al.  Using MISR and MODIS data for detection and analysis of smoke plume injection heights over North American during Summer 2004 , 2005 .

[22]  S. Džeroski,et al.  LEARNING TO PREDICT FOREST FIRES WITH DIFFERENT DATA MINING TECHNIQUES , 2006 .

[23]  S. W. Taylor,et al.  Science, technology, and human factors in fire danger rating: the Canadian experience , 2006 .

[24]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[25]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[26]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.