Pattern Discovery in Hydrological Time Series Data Mining during the Monsoon Period of the High Flood Years in Brahmaputra River Basin

paper, based on data mining techniques, the analysis is carried out in hydrological daily discharge time series of the Panchratna station in the river Brahmaputra under Brahmaputra and Barak Basin Organization in India. The data has been selected for the high flood years 1988, 1991,1998, 2004, and 2007. The whole year is divided into three periods known as Pre-monsoon, Monsoon and Post Monsoon. In this paper, only monsoon period data have been used. For standardization of data, statistical analysis such as mean monthly discharge, monthly Maximum Discharge, monthly amplitude and monthly standard deviation have been carried out. K-means clustering is segmented for the monsoon period process of daily discharge. Dynamic Time Warping (DTW) is used to look for similarities in the discharge process under the same climatic condition. Similarity matrix helped in the mining of discharge process in similar time period in the different years. The agglomerative hierarchical clustering is used to cluster and discover the discharge patterns in terms of the autoregressive model. A forecast model has been predicted on the discharge process. In the field of hydrological forecasting, with the help of data mining techniques, earlier various researches have been carried out. Some of them are : Similarity search and pattern discovery in hydrological time series data mining (14), Flood pattern detection using sliding window technique (17), Applications of Data Mining in Hydrology (11), Runoff forecasting using fuzzy support vector regression (23), Forecasting monthly runoff using wavelet neural network model (2), Neural network model for hydrological forecasting based on multivariate phase space reconstruction (22), Mid-short term daily runoff forecasting by ANNs and multiple process based hydrological models (7), Research and application of data mining for runoff forecasting (10), River flow time series using least squares support vector machines(18), a novel approach to the similarity analysis of multivariate time series and its application in hydrological data mining (25), Computational methods for temporal pattern discovery in biomedical genomic databases (16), an efficient k-Means clustering algorithm: analysis and implementation(9), The prediction algorithm based on fuzzy logic using time series data mining methods (3) , A forecast Model of Hydrologic Single, Element Medium and Long-Period Based on Rough Set Theory (19), Application of ANN in Forecast of surface runoff (4) are important contributions in the knowledge discovery from hydrological databases using time series data mining. In this paper, Time Series Data Mining has been used for hydrological study (14). Time series data mining combines the fields of time series and data mining techniques (3). Here, the object is to develop a data mining application using modern information technology to discover the hidden information or patterns behind the historical hydrological data of the river Brahmaputra under the hydrological process, and also to look for a new way to meet the requirements of hydrological time series analysis using TSDM algorithms.

[1]  Ku Ruhana Ku-Mahamud,et al.  Flood Pattern Detection Using Sliding Window Technique , 2009, 2009 Third Asia International Conference on Modelling & Simulation.

[2]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[3]  Weilin Liu,et al.  Neural network model for hydrological forecasting based on multivariate phase space reconstruction , 2011, 2011 Seventh International Conference on Natural Computation.

[4]  Toni Giorgino,et al.  Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package , 2009 .

[5]  Dong Sihui,et al.  A Forecast Model of Hydrologic Single Element Medium and Long-Period Based on Rough Set Theory , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[6]  Hai-Jun Xu,et al.  A Forecast Model of Hydrologic Single Element Medium and Long-Period Based on Rough Set Theory , 2004 .

[7]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[9]  K.A. Peker Subsequence time series (STS) clustering techniques for meaningful pattern discovery , 2005, International Conference on Integration of Knowledge Intensive Multi-Agent Systems, 2005..

[10]  Chenghu Zhou,et al.  Similarity search and pattern discovery in hydrological time series data mining , 2010 .

[11]  Guoquan Jiang,et al.  The Research of Data Mining Based on Neural Networks , .

[12]  Mehmet Karakose,et al.  The Prediction Algorithm Based on Fuzzy Logic Using Time Series Data Mining Method , 2009 .

[13]  Yao Liang,et al.  Applications of data mining in hydrology , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[14]  Martin J. O'Connor,et al.  Computational method for temporal pattern discovery in biomedical genomic databases , 2005, 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05).

[15]  Lu Jiahai,et al.  Forecasting monthly runoff using wavelet neural network model , 2011, 2011 International Conference on Mechatronic Science, Electric Engineering and Computer (MEC).

[16]  Puteh Saad,et al.  River flow time series using least squares support vector machines , 2011 .

[17]  Jörg Rech,et al.  Knowledge Discovery in Databases , 2001, Künstliche Intell..

[18]  Wanchang Zhang,et al.  Mid-short-term daily runoff forecasting by ANNs and multiple process-based hydrological models , 2009, 2009 IEEE Youth Conference on Information, Computing and Telecommunication.

[19]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[20]  Kwok-Wing Chau,et al.  Data-driven models for monthly streamflow time series prediction , 2010, Eng. Appl. Artif. Intell..

[21]  Vladan Babovic,et al.  Data mining in hydrology , 2005 .

[22]  Zhu Yuelong,et al.  A Novel Approach to the Similarity Analysis of Multivariate Time Series and Its Application in Hydrological Data Mining , 2008, 2008 International Conference on Computer Science and Software Engineering.

[23]  Xiaohui Yuan,et al.  Research and Application of Data Mining for Runoff Forecasting , 2008, 2008 International Conference on Intelligent Computation Technology and Automation (ICICTA).

[24]  L. H. Feng,et al.  Application of ANN in forecast of surface runoff , 2010, INC2010: 6th International Conference on Networked Computing.

[25]  Sansanee Auephanwiriyakul,et al.  Runoff forecasting using fuzzy support vector regression , 2009, 2008 International Symposium on Intelligent Signal Processing and Communications Systems.