Robust informational entropy-based descriptors of flow in catchment hydrology

Abstract This paper explores the use of entropy-based measures in catchment hydrology, and provides an importance-weighted numerical descriptor of the flow–duration curve. Although entropy theory is being applied in a wide spectrum of areas (including environmental and water resources), artefacts arising from the discrete, under-sampled and uncertain nature of hydrological data are rarely acknowledged, and have not been adequately explored. Here, we examine challenges to extracting hydrologically meaningful entropy measures from a flow signal; the effect of binning resolution on calculation of entropy is investigated, along with artefacts caused by (1) emphasis of information theoretic measures towards flow ranges having more data (statistically dominant information), and (2) effects of discharge measurement truncation errors. We introduce an importance-weighted entropy-based measure to counter the tendency of common binning approaches to over-emphasise information contained in the low flows which dominate the record. The measure uses a novel binning method, and overcomes artefacts due to data resolution and under-sampling. Our analysis reveals a fundamental problem with the extraction of information at high flows, due to the lack of statistically significant samples in this range. By separating the flow–duration curve into segments, our approach constrains the computed entropy to better respect distributional properties over the data range. When used as an objective function for model calibration, this approach constrains high flow predictions, as well as the commonly used Nash-Sutcliffe efficiency, but provides much better predictions of low flow behaviour. Editor Z.W. Kundzewicz Associate editor Not assigned

[1]  R. Woods,et al.  Catchment Classification and Hydrologic Similarity , 2006, Geography Compass.

[2]  R. K. Kachroo,et al.  Nonparametric kernel estimation of low flow quantiles , 1996 .

[3]  Harold T. Johnston,et al.  The use of flow duration curves as a data quality tool , 2003 .

[4]  Kevin H. Knuth,et al.  Lattice duality: The origin of probability and entropy , 2013, Neurocomputing.

[5]  Kevin R. Wheeler,et al.  Identifying Excessively Rounded or Truncated Data , 2006 .

[6]  J. McDonnell,et al.  Constraining dynamic TOPMODEL responses for imprecise water table information using fuzzy rule based performance measures , 2004 .

[7]  Vijay P. Singh,et al.  An entropy-based investigation into the variability of precipitation , 2009 .

[8]  B. Martin PARAMETER ESTIMATION , 2012, Statistical Methods for Biomedical Research.

[9]  F. Serinaldi Analytical confidence intervals for index flow flow duration curves , 2011 .

[10]  Tom G. Chapman,et al.  Entropy as a measure of hydrologic data uncertainty and model performance , 1986 .

[11]  Praveen Kumar,et al.  Ecohydrologic process networks: 1. Identification , 2009 .

[12]  Marc Parlange,et al.  HydroZIP: How Hydrological Knowledge can Be Used to Improve Compression of Hydrological Data , 2013, Entropy.

[13]  Martyn P. Clark,et al.  Hydrological field data from a modeller's perspective: Part 1. Diagnostic tests for model structure , 2011 .

[14]  Demetris Koutsoyiannis A random walk on water , 2009 .

[15]  V. Singh,et al.  The entropy theory as a tool for modelling and decision-making in environmental and water resources. , 2000 .

[16]  Kevin R. Wheeler,et al.  Revealing Relationships among Relevant Climate Variables with Information Theory , 2013, 1311.4632.

[17]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[18]  Demetris Koutsoyiannis,et al.  Uncertainty, entropy, scaling and hydrological stochastics. 2. Time dependence of hydrological processes and time scaling / Incertitude, entropie, effet d'échelle et propriétés stochastiques hydrologiques. 2. Dépendance temporelle des processus hydrologiques et échelle temporelle , 2005 .

[19]  N. Brunsell A multiscale information theory approach to assess spatial-temporal variability of daily precipitation , 2010 .

[20]  Sevinç Özkul Entropy-Based Assessment of Water Quality Monitoring Networks , 2001 .

[21]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[22]  Osama Z. Al-Hamdan,et al.  Soil Moisture Profile Development from Surface Observations by Principle of Maximum Entropy , 2010 .

[23]  R. Moore The probability-distributed principle and runoff production at point and basin scales , 1985 .

[24]  Rainer Spang,et al.  Inferring cellular networks – a review , 2007, BMC Bioinformatics.

[25]  Dirk P. Kroese,et al.  Kernel density estimation via diffusion , 2010, 1011.2602.

[26]  Max A. Viergever,et al.  Mutual information matching and interpolation artifacts , 1999, Medical Imaging.

[27]  Hoshin Vijai Gupta,et al.  Using an informational entropy-based metric as a diagnostic of flow duration to drive model parameter identification , 2012 .

[28]  Vijay P. Singh,et al.  Entropy theory for movement of moisture in soils , 2010 .

[29]  Jim Freer,et al.  Towards a limits of acceptability approach to the calibration of hydrological models : Extending observation error , 2009 .

[30]  A. Brath,et al.  Assessing the effectiveness of hydrological similarity measures for flood frequency analysis , 2001 .

[31]  Abubakr Gafar Abdalla,et al.  Probability Theory , 2017, Encyclopedia of GIS.

[32]  Marc B. Parlange,et al.  Data compression to define information content of hydrological time series , 2013 .

[33]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[34]  Murugesu Sivapalan,et al.  Climate, soil, and vegetation controls upon the variability of water balance in temperate and semiarid landscapes: Downward approach to water balance analysis , 2003 .

[35]  T. McMahon,et al.  Experimental Design and Initial Results from the Mahurangi River Variability Experiment: MARVEX , 2013 .

[36]  Donald H. Burn,et al.  An entropy approach to data collection network design , 1994 .

[37]  H. Lange,et al.  Classification of Runoff in Headwater Catchments: A Physical Problem? , 2008 .

[38]  V. Singh,et al.  THE USE OF ENTROPY IN HYDROLOGY AND WATER RESOURCES , 1997 .

[39]  Attilio Castellarin,et al.  Regional flow-duration curves: reliability for ungauged basins , 2004 .

[40]  Demetris Koutsoyiannis,et al.  Uncertainty, entropy, scaling and hydrological stochastics. 1. Marginal distributional properties of hydrological processes and state scaling / Incertitude, entropie, effet d'échelle et propriétés stochastiques hydrologiques. 1. Propriétés distributionnelles marginales des processus hydrologiques et , 2005 .

[41]  W. Bastiaan Kleijn,et al.  On the Estimation of Differential Entropy From Data Located on Embedded Manifolds , 2007, IEEE Transactions on Information Theory.

[42]  Luca G. Lanza,et al.  Correlation patterns and information flows in rainfall fields , 2006 .

[43]  A multivariate non‐parametric model for synthetic generation of daily streamflow , 2007 .

[44]  Vijay P. Singh,et al.  Parameter estimation for 3-parameter generalized pareto distribution by the principle of maximum entropy (POME) , 1995 .

[45]  Yuqiong Liu,et al.  Reconciling theory with observations: elements of a diagnostic approach to model evaluation , 2008 .

[46]  R. Moore The PDM rainfall-runoff model , 2007 .

[47]  H. McMillanb,et al.  The Use of Entropy as a Model Diagnostic in Rainfall-Runoff Modelling , 2010 .

[48]  Ioannis A. Niadas,et al.  Regional flow duration curve estimation in small ungauged catchments using instantaneous flow measurements and a censored data approach , 2005 .

[49]  Michel Verleysen,et al.  Information-theoretic feature selection for functional data classification , 2009, Neurocomputing.

[50]  V. Singh,et al.  Entropy theory for derivation of infiltration equations , 2010 .

[51]  Maya R. Gupta,et al.  Bayesian estimation of the entropy of the multivariate Gaussian , 2008, 2008 IEEE International Symposium on Information Theory.

[52]  Keith Beven,et al.  Calibration of hydrological models using flow-duration curves , 2010 .

[53]  A. H. Murphy,et al.  Skill Scores Based on the Mean Square Error and Their Relationships to the Correlation Coefficient , 1988 .

[54]  R. Woods The impact of spatial scale on spatial variability in hydrologic response: experiments and ideas. , 2004 .

[55]  S. Weijs,et al.  Why hydrological predictions should be evaluated using information theory , 2010 .

[56]  Vijay P. Singh,et al.  Application of Information Theory to Groundwater Quality Monitoring Networks , 2002 .

[57]  Pao-Shan Yu,et al.  Using synthetic flow duration curves for rainfall–runoff model calibration at ungauged sites , 2000 .

[58]  K. Beven Rainfall-Runoff Modelling: The Primer , 2012 .

[59]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[60]  Yakov A. Pachepsky,et al.  Information content and complexity of simulated soil water fluxes , 2006 .

[61]  J. N. Kapur Maximum-entropy models in science and engineering , 1992 .

[62]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[63]  J. Larson Can we define climate using information theory , 2010 .

[64]  Gordon Blower,et al.  Nonlinear kernel density estimation for binned data: convergence in entropy , 2002 .

[65]  N. McIntyre,et al.  Catchment scale hydrological modelling: a review of model types, calibration approaches and uncertainty analysis methods in the context of recent developments in technology and applications , 2011 .

[66]  Hoshin Vijai Gupta,et al.  On the ability to infer spatial catchment variability using streamflow hydrographs , 2011 .

[67]  Regional analysis of annual maximum and partial duration flood data by nonparametric and L-moment methods , 2000 .

[68]  J. Amorocho,et al.  Entropy in the assessment of uncertainty in hydrologic systems and models , 1973 .

[69]  M. Sivapalan,et al.  Improving model structure and reducing parameter uncertainty in conceptual water balance models through the use of auxiliary data , 2007 .

[70]  Florian Pappenberger,et al.  Impacts of uncertain river flow data on rainfall‐runoff model calibration and discharge predictions , 2010 .

[71]  H. S. Wheater,et al.  Calibration of the semi-distributed PDM rainfall-runoff model in the Upper Lee catchment, UK , 2010 .

[72]  Vijay P. Singh,et al.  A real-time flood forecasting model based on maximum-entropy spectral analysis: II. Application , 1993 .

[73]  J. Nash,et al.  River flow forecasting through conceptual models part I — A discussion of principles☆ , 1970 .

[74]  Demetris Koutsoyiannis,et al.  HESS Opinions "A random walk on water" , 2009 .