Data mining process for modeling hydrological time series

The main purpose of this study was to develop an optimum flow prediction model, based on data mining process. The data mining process was applied to predict river flow of Seyhan Stream in the southern part of Turkey. Hydrological time series modeling was applied using monthly historical flow records to predict Seyhan Stream flows. Seyhan Stream flows were modeled by Markov models and it was seen that it adapted AR(2). Hence, F t –2 and F t –1 flows in ( t –2) and ( t –1) months were the taken inputs. For monthly streamflow predictions, data were taken from the General Directorate of Electrical Power Resources Survey and Development Administration. Used data covered 35 years between 1969 and 2003 for monthly streamflows. Furthermore, for the effect of monthly periodicity in hydrological time series cos (2 π i /12), sin (2 π i /12) ( I = 1, 2,…, 12) were included as inputs. Then, F t flows in ( t ) months were modeled by data mining process. It was concluded that with using data mining process for streamflow prediction, it was possible to estimate missing or unmeasured data.

[1]  Surjya K. Pal,et al.  Drill wear monitoring using back propagation neural network , 2006 .

[2]  S. C. Kaushik,et al.  Assessment of diffuse solar energy under general sky condition using artificial neural network , 2009 .

[3]  Junwen Wang,et al.  Predictive models for protein crystallization. , 2004, Methods.

[4]  Joannis Apostolakis,et al.  Crystal structure prediction by data mining , 2003 .

[5]  Steven Walczak,et al.  Knowledge discovery techniques for predicting country investment risk , 2002 .

[6]  W. L. Lane,et al.  Applied Modeling of Hydrologic Time Series , 1997 .

[7]  Xia Zhao,et al.  Cold rolling process data analysis based on SVM , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[8]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[9]  Olivier Teytaud,et al.  Learning for stochastic dynamic programming , 2006, ESANN.

[10]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[11]  Keith W. Hipel,et al.  TIME SERIES ANALYSIS IN PERSPECTIVE , 1985 .

[12]  Zhi-Hua Zhou,et al.  Three perspectives of data mining , 2003, Artif. Intell..

[13]  Zhaohui Tang,et al.  Data Mining with SQL Server 2005 , 2005 .

[14]  S. Delurgio Forecasting Principles and Applications , 1998 .

[15]  Thomas Reinartz,et al.  CRISP-DM 1.0: Step-by-step data mining guide , 2000 .

[16]  Lise Getoor,et al.  Predicting Protein-Protein Interactions Using Relational Features , 2007 .

[17]  Li-Yen Shue,et al.  Data mining to aid policy making in air pollution management , 2004, Expert Syst. Appl..

[18]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[19]  Xue Z. Wang,et al.  Software analyser design using data mining technology for toxicity prediction of aqueous effluents , 2000 .

[20]  Thy-Hou Lin,et al.  Using support vector regression to model the correlation between the clinical metastases time and gene expression profile for breast cancer , 2008, Artif. Intell. Medicine.