Indices for calibration data selection of the rainfall‐runoff model

[1] The identification of rainfall-runoff models requires selection of appropriate data for model calibration. Traditionally, hydrologists use rules of thumb to select a certain period of hydrological data to calibrate the models (i.e., 6 year data). There are no numerical indices to help hydrologists to quantitatively select the calibration data. There are two questions: how long should the calibration data be (e.g., 6 months), and from which period should the data be selected (e.g., which 6 month data should be selected)? In this study, some indices for the selection of calibration data with adequate lengths and appropriate durations are proposed by examining the spectral properties of data sequences before the calibration work. With the validation data determined beforehand, we assume that the more similarity the calibration data set bears to the validation set, the better should the performance of the rainfall-runoff model be after calibration. Three approaches are applied to reveal the similarity between the validation and calibration data sets: flow-duration curve, Fourier transform, and wavelet analysis. Data sets used for calibration are generated by designing three scenario groups with fixed lengths of 6, 12, and 24 months, respectively, from 8 year continuous observations in the Brue catchment of the United Kingdom. Scenarios in each group have different starting times and thus various durations with specific hydrological characteristics. With a predetermined 18 month validation set and the rainfall-runoff model chosen to be the probability distributed model, useful indices are produced for certain scenario groups by all three approaches. The information cost function, an entropy-like function based on the decomposition results of the discrete wavelet transform, is found to be the most effective index for the calibration data selection. The study demonstrates that the information content of the calibration data is more important than the data length; thus 6 month data may provide more useful information than longer data series. This is important for hydrological modelers since shorter and more useful data help hydrologists to build models more efficiently and effectively. The idea presented in this paper has also shown potential in enhancing the efficiency of calibration data utilization, especially for data-limited catchments.

[1]  Yuhui Shi,et al.  Particle swarm optimization: developments, applications and resources , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[2]  Kwok-wing Chau A split-step particle swarm optimization algorithm in river stage forecasting , 2007 .

[3]  Soroosh Sorooshian,et al.  Optimal use of the SCE-UA global optimization method for calibrating watershed models , 1994 .

[4]  A. Figliola,et al.  Analysis of physiological time series using wavelet transforms. , 1997, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[5]  Soroosh Sorooshian,et al.  Toward improved calibration of hydrologic models: Multiple and noncommensurable measures of information , 1998 .

[6]  Michael E. Mann,et al.  Interannual Temperature Events and Shifts in Global Temperature: A ''Multiwavelet'' Correlation Approach , 2000 .

[7]  Kwok-wing Chau,et al.  Particle Swarm Optimization Training Algorithm for ANNs in Stage Prediction of Shing Mun River , 2006 .

[8]  Y. P. Li,et al.  Identification of relationship between sunspots and natural runoff in the Yellow River based on discrete wavelet analysis , 2009, Expert Syst. Appl..

[9]  L. Druyan,et al.  Relationships between rainfall and West African wave disturbances in station observations , 2003 .

[10]  Fuquan Wang,et al.  A remark on the mallat pyramidal algorithm of wavelet analysis wavelet analysis , 1997 .

[11]  Bryson C. Bates,et al.  Calibration of a modified SFB model for twenty-five Australian catchments using simulated annealing , 1997 .

[12]  Aldo Drago,et al.  Use of the wavelet transform on hydro-meteorological data , 2002 .

[13]  Margriet Nakken,et al.  Wavelet analysis of rainfall-runoff variability isolating climatic from anthropogenic patterns , 1999, Environ. Model. Softw..

[14]  T. Partal,et al.  Trend analysis in Turkish precipitation data , 2006 .

[15]  Richard M. Vogel,et al.  Flow‐Duration Curves. I: New Interpretation and Confidence Intervals , 1994 .

[16]  O. Penalba,et al.  Interdecadal and interannual variations of annual and extreme precipitation over central‐northeastern Argentina , 2004 .

[17]  Soroosh Sorooshian,et al.  The relationship between data and the precision of parameter estimates of hydrologic models , 1985 .

[18]  Walter Boughton,et al.  Calibrations of a daily rainfall-runoff model with poor quality data , 2006, Environ. Model. Softw..

[19]  S. Sorooshian,et al.  Evaluation of Maximum Likelihood Parameter estimation techniques for conceptual rainfall‐runoff models: Influence of calibration data variability and length on model credibility , 1983 .

[20]  François Anctil,et al.  Impact of the length of observed records on the performance of ANN and of conceptual parsimonious rainfall-runoff forecasting models , 2004, Environ. Model. Softw..

[21]  C. Perrin,et al.  Impact of limited streamflow data on the efficiency and the parameters of rainfall—runoff models , 2007 .

[22]  M. J. D. Powell,et al.  THE CONVERGENCE OF VARIABLE METRIC METHODS FOR NONLINEARLY CONSTRAINED OPTIMIZATION CALCULATIONS , 1978 .

[23]  Henrik Madsen,et al.  An evaluation of the impact of model structure on hydrological modelling uncertainty for streamflow simulation , 2004 .

[24]  R. Moore The PDM rainfall-runoff model , 2007 .

[25]  Alejandra Figliola,et al.  Time-frequency analysis of electroencephalogram series. III. Wavelet packets and information cost function , 1998 .

[26]  Allan L. Gutjahr,et al.  Cross‐correlated random field generation with the direct Fourier Transform Method , 1993 .

[27]  R. Moore The probability-distributed principle and runoff production at point and basin scales , 1985 .

[28]  Armando Brath,et al.  Analysis of the effects of different scenarios of historical data availability on the calibration of a spatially-distributed hydrological model , 2004 .

[29]  Zong-Liang Yang,et al.  Impacts of data length on optimal parameter and uncertainty estimation of a land surface model , 2004 .

[30]  J. Nash,et al.  River flow forecasting through conceptual models part I — A discussion of principles☆ , 1970 .

[31]  S. Sorooshian,et al.  Effective and efficient global optimization for conceptual rainfall‐runoff models , 1992 .

[32]  N. C. Nigam Introduction to Random Vibrations , 1983 .

[33]  Quan J. Wang,et al.  Using genetic algorithms to optimise model parameters , 1997 .

[34]  Enoch M. Dlamini,et al.  Effects of model complexity and structure, data quality, and objective functions on hydrologic modeling , 1997 .

[35]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.

[36]  Q. J. Wang The Genetic Algorithm and Its Application to Calibrating Conceptual Rainfall-Runoff Models , 1991 .

[37]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[38]  Kieran M. O'Connor,et al.  Derivation of discretely coincident forms of continuous linear time-invariant models using the transfer function approach , 1982 .

[39]  Luis A. Bastidas,et al.  Multiobjective particle swarm optimization for parameter estimation in hydrology , 2006 .

[40]  D. Nagesh Kumar,et al.  Multi‐objective particle swarm optimization for generating optimal trade‐offs in reservoir operation , 2007 .

[41]  Shih-Ping Han A globally convergent method for nonlinear programming , 1975 .

[42]  James C. I. Dooge,et al.  Linear Theory of Hydrologic Systems , 1973 .

[43]  Bo Liu,et al.  Improved particle swarm optimization combined with chaos , 2005 .

[44]  Kieran M. O'Connor,et al.  Comparative assessment of six automatic optimization techniques for calibration of a conceptual rainfall—runoff model , 2007 .

[45]  M. J. D. Powell,et al.  A fast algorithm for nonlinearly constrained optimization calculations , 1978 .

[46]  S. Sorooshian,et al.  The Automatic Calibration of Conceptual Catchment Models Using Derivative‐Based Optimization Algorithms , 1985 .

[47]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[48]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[49]  Laurence C. Smith,et al.  Stream flow characterization and feature detection using a discrete wavelet transform , 1998 .

[50]  J. Kulkarni,et al.  Wavelet analysis of the association between the Southern Oscillation and the Indian summer monsoon , 2000 .

[51]  C. Torrence,et al.  A Practical Guide to Wavelet Analysis. , 1998 .

[52]  J. Harlin Development of a Process Oriented Calibration Scheme for the HBV Hydrological Model , 1991 .

[53]  C. Lucas,et al.  Automatic Calibration of Lumped Conceptual Rainfall-Runoff Model Using Particle Swarm Optimization , 2008 .

[54]  S. Sorooshian,et al.  Automatic calibration of conceptual rainfall-runoff models: sensitivity to calibration data , 1996 .

[55]  T. Gan,et al.  Automatic Calibration of Conceptual Rainfall-Runoff Models: Optimization Algorithms, Catchment Conditions, and Model Structure , 1996 .

[56]  Peter R. Waylen,et al.  A stochastic model of flow duration curves , 1993 .