A Fuzzy C-Means approach for regionalization using a bivariate homogeneity and discordancy approach

In stochastic analysis for droughts, such as frequency or trend analysis, the absence of lengthy records typically limits the reliability of statistical estimates. To address this issue, “regional” or “pooled” analysis approach is often used. The main contribution of this study is to create regions based on bivariate criteria rather than univariate ones; the two variables are severity and duration. The methodology is applied to hydrological records of 36 unregulated flow monitoring sites in the Canadian “prairie” provinces of Alberta, Saskatchewan and Manitoba. Our criteria for a hydrological “region” to be suitable are that it should be homogeneous, that it should not be discordant, and that it should not be too small. Tests for homogeneity and non-discordancy are traditionally based on univariate L-moment statistics; for example there have been several applications of univariate L-moments to bivariate drought analysis by simply ignoring one of the variables. Instead, we use multivariate L-moments, also known as L-comoments. The approach uses site characteristics and a fuzzy clustering approach, called Fuzzy C-Means (FCM), to form the initial regions (clusters) and adjusts initial clusters based on partial or fuzzy membership of each site to other clusters to form final clusters that meet the criteria of homogeneity, lack of discordancy, and sufficient size. We also estimate return periods using a bivariate copula method.

[1]  Mustafa M. Aral,et al.  Aquifer parameter and zone structure estimation using kernel-based fuzzy c-means clustering and genetic algorithm , 2007 .

[2]  N. Tase Area-deficit-intensity characteristics of droughts , 2007 .

[3]  C. De Michele,et al.  A Generalized Pareto intensity‐duration model of storm rainfall exploiting 2‐Copulas , 2003 .

[4]  Regional analysis of annual maximum and partial duration flood data by nonparametric and L-moment methods , 2000 .

[5]  D. Burn Evaluation of regional flood frequency analysis with a region of influence approach , 1990 .

[6]  D. Burn,et al.  The formation of groups for regional flood frequency analysis , 2000 .

[7]  J. Sickman,et al.  Effects of Urbanization on Organic Carbon Loads in the Sacramento River, California , 2007 .

[8]  T. Ouarda,et al.  Multivariate L‐moment homogeneity test , 2007 .

[9]  D. Burn,et al.  Detection of hydrologic trends and variability , 2002 .

[10]  Richard M. Vogel,et al.  Trends in floods and low flows in the United States: impact of spatial correlation , 2000 .

[11]  M. Sklar Fonctions de repartition a n dimensions et leurs marges , 1959 .

[12]  T. Kjeldsen,et al.  Regional partial duration series modelling of hydrological droughts in Zimbabwean rivers using a two-component exponential distribution , 1999 .

[13]  R. Serfling,et al.  A contribution to multivariate L-moments: L-comoment matrices , 2007 .

[14]  Donald H. Burn,et al.  Non-stationary pooled flood frequency analysis , 2003 .

[15]  V. Yevjevich,et al.  Analysis of drought characteristics by the theory of runs , 2007 .

[16]  J. R. Wallis,et al.  Some statistics useful in regional frequency analysis , 1993 .

[17]  Vijay P. Singh,et al.  IDF Curves Using the Frank Archimedean Copula , 2007 .

[18]  J. R. Wallis,et al.  Regional Frequency Analysis: An Approach Based on L-Moments , 1997 .