A data warehousing and data mining approach for analysis and forecast of cloudburst events using OLAP-based data hypercube

The multidimensional data model can be effectively utilised for analysing huge and detailed meteorological datasets forecasted by numerical weather prediction (NWP) model. The model cannot predict any weather event directly. The output products of model are interpreted by man-machine mix to infer the idiosyncratic behaviour of weather events. The mathematical tools for analysis and forecasting are able to provide forecast of weather variables only at grid-points. In this paper, the technology of dimension modelling has been adapted for analysing NWP model output datasets corresponding to sub-grid scale events viz. cloudburst, using OLAP technique. The huge datasets of weather variables available directly and derived indirectly, are mined so as to locate the patterns of cloudburst formation. K-means clustering technique has been used to generate clusters of convergence and divergence, for four real-life cases of cloudburst. It has been observed that clustering technique can help in identification of patterns conducive to formation of cloudburst.

[1]  A. G. Ivakhnenko,et al.  Polynomial Theory of Complex Systems , 1971, IEEE Trans. Syst. Man Cybern..

[2]  F. Lemke,et al.  Self-Organising Data Mining , 2003 .

[3]  Ralph Kimball,et al.  The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling , 1996 .

[4]  Ajith Abraham,et al.  SELF-ORGANIZING DATA MINING FOR WEATHER FORECASTING , 2007 .

[5]  Jiawei Han,et al.  Efficient and Effective Clustering Methods for Spatial Data Mining , 1994, VLDB.

[6]  Mitchell W. Moncrieff,et al.  Simulation of a Himalayan cloudburst event , 2006 .

[7]  K. Pabreja,et al.  Data Mining - A Tool in Support of Interpretation of Low Pressure System Movement Over Indian Region , 2009 .

[8]  Anthony K. H. Tung,et al.  Spatial clustering methods in data mining : A survey , 2001 .

[9]  Kavita Pabreja Application of Multidimensional Databases of Rainfall and Low Pressure Systems on OLAP-Based Model , 2010, 2010 Second International Conference on Computer Modeling and Simulation.

[10]  Teodora Vătuiu,et al.  Overview of Oracle Olap and Using SQL for Manipulate Multidimensional Data , 2007 .

[11]  Someshwar Das Mountain weather forecasting using MM5 modelling system , 2005 .

[12]  Ian Witten,et al.  Data Mining , 2000 .

[13]  Jayanta Basak,et al.  Weather Data Mining Using Independent Component Analysis , 2004, J. Mach. Learn. Res..

[14]  Jiawei Han,et al.  OLAP Mining: Integration of OLAP with Data Mining , 1997, DS-7.

[15]  Mick J. Ridley,et al.  Data modelling for effective data warehouse architecture and design , 2009, Int. J. Inf. Decis. Sci..

[16]  Amreek Singh,et al.  Weather prediction using nearest-neighbour model , 2005 .

[17]  Ronnie Alves,et al.  Effective OLAP Mining of Evolving Data Marts , 2007, 11th International Database Engineering and Applications Symposium (IDEAS 2007).