Statistics Based Predictive Geo-spatial Data Mining: Forest Fire Hazardous Area Mapping Application

In this paper, we propose two statistics based predictive geo-spatial data mining methods and apply them to predict the forest fire hazardous area. The proposed prediction models used in geo-spatial data mining are likelihood ratio and conditional probability methods. In these approaches, the prediction models and estimation procedures depend on the basic quantitative relationships of geo-spatial data sets relevant to the forest fire with respect to the selected areas of previous forest fire ignition. In order to make the prediction map for the forest fire hazardous area prediction map using the two proposed prediction methods and evaluate the performance of prediction power, we applied a FHR (Forest Fire Hazard Rate) and a PRC (Prediction Rate Curve) respectively. When the prediction power of the two proposed prediction models is compared, the likelihood ratio method is more powerful than the conditional probability method. The proposed model for prediction of the forest fire hazardous area would be helpful to increase the efficiency of forest fire management such as prevention of forest fire occurrences and effective placement of forest fire monitoring equipment and manpower.

[1]  Lee B. Lusted,et al.  Introduction to medical decision making , 1968 .

[2]  Michael H. Kutner Applied Linear Statistical Models , 1974 .

[3]  Jorma Rissanen,et al.  SLIQ: A Fast Scalable Classifier for Data Mining , 1996, EDBT.

[4]  Richard J. Aspinall,et al.  An inductive modelling procedure based on Bayes' theorem for analysis of pattern in spatial data , 1992, Int. J. Geogr. Inf. Sci..

[5]  Ralf Hartmut Güting,et al.  An introduction to spatial database systems , 1994, VLDB J..

[6]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[7]  Jiawei Han,et al.  Generalization and decision tree induction: efficient classification in data mining , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[8]  D. J. Spiegelhalter,et al.  Statistical and Knowledge‐Based Approaches to Clinical Decision‐Support Systems, with an Application in Gastroenterology , 1984 .

[9]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[10]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[11]  Keun-Ho Ryu,et al.  Prediction of Forest Fire Hazardous Area Using Predictive Spatial Data Mining , 2002 .

[12]  Jiawei Han,et al.  Spatial Data Mining: Progress and Challenges , 1996, Workshop on Research Issues on Data Mining and Knowledge Discovery.

[13]  Robert Iansek,et al.  DIAGNOSIS, AND BAYES' THEOREM , 1985, The Lancet.

[14]  Rakesh Agrawal,et al.  SPRINT: A Scalable Parallel Classifier for Data Mining , 1996, VLDB.

[15]  R. K. T. Reddy,et al.  Developing a geographic expert system for regional mapping of volcanogenic massive sulfide (VMS) deposit potential , 1992 .

[16]  Henry Tirri,et al.  Predictive Data Mining with Finite Mixtures , 1996, KDD.

[17]  P. Aspinall,et al.  CLINICAL INFERENCES AND DECISIONS—I. DIAGNOSIS AND BAYES‘ THEOREM , 1983, Ophthalmic & physiological optics : the journal of the British College of Ophthalmic Opticians.

[18]  David M. Skapura,et al.  Building neural networks , 1995 .

[19]  Casimir A. Kulikowski,et al.  Computer Systems That Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning and Expert Systems , 1990 .

[20]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.