Extracting storm-centric characteristics from raw rainfall data for storm analysis and mining

Most rainfall data is stored in formats that are not easy to analyze and mine. In these formats, the amount of data is enormous. In this paper, we propose techniques to summarize the raw rainfall data into a model that facilitates storm analysis and mining, and reduces the data size. The result is to convert raw rainfall data into meaningful storm-centric data, which is then stored in a relational database for easy analysis and mining. The size of the storm data is less than 1% of the size of the raw data. We can determine the spatio-temporal characteristics of a storm, such as how big a storm is, how many sites are covered, and what is its overall depth (precipitation) and duration. We present formal definitions for the storm-related concepts that are needed in our data conversion. Then we describe storm identification algorithms based on these concepts. Our storm identification algorithms analyze precipitation values of adjacent sites within the period of time that covers the whole storm and combines them together to identify the overall storm characteristics.

[1]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[2]  Raymond M. Slade,et al.  Extreme precipitation depths for Texas, excluding the Trans-Pecos region , 1998 .

[3]  P. E. O'Connell,et al.  The influence of storm characteristics and catchment conditions on extreme flood response : A case study based on the brue river basin, U.K. , 1995 .

[4]  Jeffery S. Horsburgh,et al.  A relational model for environmental and water resources data , 2008 .

[5]  Theodore G. Cleveland,et al.  Statistical characteristics of storm interevent time, depth, and duration for eastern New Mexico, Oklahoma, and Texas , 2006 .

[6]  Adam Freeman Pro .NET 4 Parallel Programming in C , 2010 .

[7]  W. Asquith Depth-duration frequency of precipitation for Texas , 1998 .

[8]  Theodore G. Cleveland,et al.  Synthesis of rainfall and runoff data used for Texas Department of Transportation Research Projects 0-4193 and 0-4194 , 2004 .

[9]  黄贻彬,et al.  Microsoft SQL Server中的星形连接查询优化 , 2011 .

[10]  William H. Asquith,et al.  Atlas of depth-duration frequency of precipitation annual maxima for Texas , 2004 .

[11]  Технология,et al.  National Climatic Data Center , 2011 .

[12]  Ramez Elmasri,et al.  Fundamentals of Database Systems, 5th Edition , 2006 .

[13]  Bill Franks,et al.  Taming The Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced Analytics , 2012 .

[14]  Aart Overeem,et al.  Rainfall depth-duration-frequency curves and their uncertainties , 2008 .

[15]  Reed Jacobson,et al.  Microsoft SQL Server(TM) 2005 Analysis Services Step by Step , 2006 .

[16]  Gordon S. Linoff,et al.  Data Analysis Using SQL and Excel , 2007 .