Flood Prediction and Mining Influential Spatial Features on Future Flood with Causal Discovery

The development of accurate flood prediction model could reduce number of fatalities. In this paper, water level time series, spatio-temporal precipitation and hydrological data are used for flood prediction. Since our data is high dimensional and not all features are correlated to flood, our proposed algorithm is designed to find influential spatial features, or features at locations which are highly correlated to flood. With the idea that true causes, or highly correlated features to flood, should give accurate information about flood, our proposed flood prediction algorithm is based on Bayesian based causal discovery. The purpose of this paper is twofold. Firstly, we propose a new causal discovery algorithm which is Bayesian-based approach with an optimization function for maximizing mutual information. Secondly, the proposed algorithm is applied to real-world precipitation and hydrological data to find influential spatial features on future flood in North Texas area. Flood prediction models can then be learned from selected features. Experiments on synthetic data confirm that our proposed algorithm is more accurate in finding true causal relationships than two competitors, Group Lasso and Markov2P. Experiments on flood predictions show that our approach has the best accuracy in almost all six lead time predictions. The accuracy and result visualizations also suggest that our proposed algorithm can find influential features to flood.

[1]  R. Horst,et al.  DC Programming: Overview , 1999 .

[2]  Ramli Adnan,et al.  Flood water level modeling and prediction using NARX neural network: Case study at Kelang river , 2014, 2014 IEEE 10th International Colloquium on Signal Processing and its Applications.

[3]  P. Spirtes,et al.  An Algorithm for Fast Recovery of Sparse Causal Graphs , 1991 .

[4]  Constantin F. Aliferis,et al.  Algorithms for Large Scale Markov Blanket Discovery , 2003, FLAIRS.

[5]  T. P. Dinh,et al.  Convex analysis approach to d.c. programming: Theory, Algorithm and Applications , 1997 .

[6]  Aapo Hyvärinen,et al.  Distinguishing causes from effects using nonlinear acyclic causal models , 2008, NIPS 2010.

[7]  Qin Huang,et al.  Long-term prediction for autumn flood season in Danjiangkou Reservoir basin based on OSR-BP neural network , 2010, 2010 Sixth International Conference on Natural Computation.

[8]  Sansanee Auephanwiriyakul,et al.  River basin flood prediction using support vector machines , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[9]  Tom Heskes,et al.  A Bayesian Approach to Constraint Based Causal Inference , 2012, UAI.

[10]  Yan Liu,et al.  Spatial-temporal causal modeling for climate change attribution , 2009, KDD.

[11]  Lai-Wan Chan,et al.  Using Bayesian Network Learning Algorithm to Discover Causal Relations in Multivariate Time Series , 2011, 2011 IEEE 11th International Conference on Data Mining.

[12]  Patrik O. Hoyer,et al.  Discovering Cyclic Causal Models by Independent Components Analysis , 2008, UAI.

[13]  Yan Liu,et al.  Learning Temporal Causal Graphs for Relational Time-Series Analysis , 2010, ICML.

[14]  Gwo-Fong Lin,et al.  Typhoon flood forecasting using integrated two-stage Support Vector Machine approach , 2013 .