Clustering spatial-temporal precipitation data using wavelet transform and self-organizing map neural network

Abstract A data analysis method is proposed to cluster and explore spatio-temporal characteristics of the 22 years of precipitation data (1982–2003) for Taiwan. The wavelet transform self-organizing map (WTSOM) framework combines the wavelet transform (WT) and a self-organizing map (SOM) neural network. WT is used to extract dynamic and multiscale features of the non-stationary precipitation time-series, and SOM is applied to objectively identify spatially homogeneous clusters on the high-dimensional wavelet-transformed feature space. Haar and Morlet wavelets are applied in the data preprocessing stage to preserve the desired characteristics of the precipitation data. A two-level SOM neural network is applied to identify clusters in the wavelet space in the clustering stage. The performance of clustering is evaluated using silhouette coefficients. The results indicate that singularities or sharp transitions are more significant than changes in the periodicity or data structure in the spatial–temporal precipitation data. The WTSOM results show that six clusters are optimal for both Haar and Morlet wavelet functions, but their corresponding geographic locations are different. The geographic locations of clusters based on the Haar wavelet, which captures the occurrence of extreme hydrological events, appear in blocks while those classified by the Morlet wavelet, which indicates periodicity changes and describes fine structures, appear in strips that cross the island of Taiwan. Principal component analysis is applied to the precipitation data of each cluster. The first principal components explain 62–90% of the total variation of data. Characteristics of precipitation data for each cluster are explored using scalogram analysis. The results show that both extreme hydrological events and periodicity changes appear in the spatial and temporal precipitation data but with different characteristics for each cluster. Recognizing homogeneous hydrologic regions and identifying the associated precipitation characteristics improves the efficiency of water resources management in adapting to climate change, preventing the degradation of the water environment, and reducing the impact of climate-induced disasters. Measures for countering the stress of precipitation variation for water resources management are provided.

[1]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[2]  Zekai Şen,et al.  Global warming threat on water resources and environment: a review , 2009 .

[3]  George E. P. Box,et al.  Time Series Analysis: Forecasting and Control , 1977 .

[4]  Tommy W. S. Chow,et al.  Clustering of the self-organizing map using a clustering validity index based on inter-cluster and intra-cluster density , 2004, Pattern Recognit..

[5]  Gwo-Fong Lin,et al.  Identification of homogeneous regions for regional frequency analysis using the self-organizing map , 2006 .

[6]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[7]  N. Lauzon,et al.  Clustering of heterogeneous precipitation fields for the assessment and possible improvement of lumped neural network models for streamflow forecasts , 2006 .

[8]  Shaw K. Chen,et al.  The comparative ability of self-organizing neural networks to define cluster structure , 1995 .

[9]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[10]  Nicolás J. Medrano-Marqués,et al.  Feature Map Architectures for Pattern Recognition: Techniques for Automatic Region Selection , 1995, ICANNGA.

[11]  Marika M. Holland,et al.  Future abrupt reductions in the summer Arctic sea ice , 2006 .

[12]  Y. H. Song,et al.  Wavelet transform and neural networks for short-term electrical load forecasting , 2000 .

[13]  Chung-Ho Wang,et al.  Climate-induced hydrological impacts on the groundwater system of the Pingtung Plain, Taiwan , 2007 .

[14]  Louis J. Bajuk,et al.  Interpretation of Interdecadal Trends in Northern Hemisphere Surface Air Temperature , 1996 .

[15]  D. F. Morrison,et al.  Multivariate Statistical Methods , 1968 .

[16]  S. Mallat A wavelet tour of signal processing , 1998 .

[17]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[19]  C. Torrence,et al.  A Practical Guide to Wavelet Analysis. , 1998 .

[20]  Sheng-Tun Li,et al.  Knowledge discovery in financial investment for forecasting and trading strategy through wavelet-based SOM networks , 2008, Expert Syst. Appl..

[21]  Alain Arneodo,et al.  Revisiting multifractality of high‐resolution temporal rainfall using a wavelet‐based formalism , 2005 .

[22]  Sheng-Tun Li,et al.  A web-aware interoperable data mining system , 2002, Expert Syst. Appl..

[23]  Holger R. Maier,et al.  Optimal division of data for neural network models in water resources applications , 2002 .

[24]  Li-Yen Shue,et al.  Data mining to aid policy making in air pollution management , 2004, Expert Syst. Appl..

[25]  A. W. Minns,et al.  The classification of hydrologically homogeneous regions , 1999 .

[26]  Renu Joseph,et al.  Multiple-Scale Spatio–Temporal Variability of Precipitation over the Coterminous United States , 2000 .

[27]  H. Hsu,et al.  Observed and projected climate change in Taiwan , 2002 .

[28]  P. Claps,et al.  A comparison of homogeneity tests for regional frequency analysis , 2007 .

[29]  Aboelmagd Noureldin,et al.  Wavelet Transform for Structural Health Monitoring: A Compendium of Uses and Features , 2006 .

[30]  Shu-Hsien Liao,et al.  Knowledge management technologies and applications - literature review from 1995 to 2002 , 2003, Expert Syst. Appl..

[31]  Shie-Yui Liong,et al.  Advance flood forecasting for flood stricken Bangladesh with a fuzzy reasoning method (Copies of English Papers by the Center Staff Published in the Fiscal Year of 1999) , 2000 .

[32]  Vincent R. Gray Climate Change 2007: The Physical Science Basis Summary for Policymakers , 2007 .

[33]  F. Murtagh,et al.  The Kohonen self-organizing map method: An assessment , 1995 .

[34]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[35]  E. Foufoula‐Georgiou,et al.  Wavelet analysis for geophysical applications , 1997 .

[36]  A. Haar Zur Theorie der orthogonalen Funktionensysteme , 1910 .

[37]  Melody Y. Kiang,et al.  Extending the Kohonen self-organizing map networks for clustering analysis , 2002 .

[38]  S. Sievers,et al.  Dioxin, dioxin-like PCBS and organotin compounds in the river Elbe and the Hamburg harbour: identification of sources , 1998 .

[39]  David E. Booth,et al.  The use of fuzzy clustering algorithm and self-organizing neural networks for identifying potentially failing banks: an experimental study , 2000 .