Two data mining technologies have been employed in this paper to build a prediction model for dwelling fire incidents. The first method uses a feedforward neural network to describe the relationship between the number of fires and factors that influence the fire occurrence. Eight factors are initially selected as the inputs of the neural network. Principle Component Analysis (PCA) is employed for pre-processing the input data set to reduce the number of the inputs. The model can give a prediction with an acceptable accuracy for the number of fires in dwelling areas. Genetic algorithms (GAs) are the second approach discussed in this article. The first three principle components of the raw data set chosen from outputs of the PCA are classified into the different groups according to their number of fires. An iterative GA is proposed and applied to extract the features for each data group. Once the features for all the groups have been identified the test data set can be easily clustered into one of the groups based on the group features. The number of fires for the group, which the test data belongs to, is the prediction of the fire occurrence for the test data. The two approaches have been compared in the conclusion.
[1]
David L. Martell,et al.
A logistic model for predicting daily people-caused forest fire occurrence in Ontario
,
1987
.
[2]
Q. Henry Wu,et al.
A Faster Genetic Clustering Algorithm
,
2000,
EvoWorkshops.
[3]
I. Jolliffe.
Principal Component Analysis
,
2002
.
[4]
J. Faraway,et al.
Time series forecasting with neural networks: a comparative study using the air line data
,
2008
.
[5]
James C. Bezdek,et al.
Clustering with a genetically optimized approach
,
1999,
IEEE Trans. Evol. Comput..