Benchmarking and Process Diagnostics of Land Models

We propose a conceptual and theoretical foundation for information-based model benchmarking and process diagnostics that provides diagnostic insight into model performance and model realism. We benchmark against a bounded estimate of the information contained in model inputs to obtain a bounded estimate of information lost due to model error, and we perform process-level diagnostics by taking differences between modeled versus observed transfer entropy networks. We use this methodology to reanalyze the recent Protocol for the Analysis of Land Surface Models (PALS) Land Surface Model Benchmarking Evaluation Project (PLUMBER) land model intercomparison project that includes the following models: CABLE, CH-TESSEL, COLA-SSiB, ISBA-SURFEX, JULES, Mosaic, Noah, and ORCHIDEE. We report that these models (i) use only roughly half of the information available from meteorological inputs about observed surface energy fluxes, (ii) do not use all information from meteorological inputs about long-term Budyko-type water balances, (iii) do not capture spatial heterogeneities in surface processes, and (iv) all suffer from similar patterns of process-level structural error. Because the PLUMBER intercomparison project did not report model parameter values, it is impossible to know whether process-level error patterns are due to model structural error or parameter error, although our proposed information-theoretic methodology could distinguish between these two issues if parameter values were reported. We conclude that there is room for significant improvement to the current generation of land models and their parameters. We also suggest two simple guidelines to make future community-wide model evaluation and intercomparison experiments more informative.

[1]  Hoshin Vijai Gupta,et al.  Debates—the future of hydrological sciences: A (common) path forward? Using models and data to learn: A systems theoretic perspective on the future of hydrological science , 2014 .

[2]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[3]  Alfred O. Hero,et al.  Estimating epistemic and aleatory uncertainties during hydrologic modeling: An information theoretic approach , 2013 .

[4]  Daniel L. Childers,et al.  Seasonally varied controls of climate and phenophase on terrestrial carbon dynamics: modeling eco-climate system state using Dynamical Process Networks , 2015, Landscape Ecology.

[5]  Hoshin Vijai Gupta,et al.  Systematic Bias in Land Surface Models , 2007 .

[6]  Keith Beven,et al.  Facets of uncertainty: epistemic uncertainty, non-stationarity, likelihood, hypothesis testing, and communication , 2016 .

[7]  Milan Palus,et al.  Cross-Scale Interactions and Information Transfer , 2014, Entropy.

[8]  A. Pitman,et al.  Evaluating the Performance of Land Surface Models , 2008 .

[9]  P. Dirmeyer,et al.  The Plumbing of Land Surface Models: Benchmarking Model Performance , 2015 .

[10]  Ii.,et al.  CLIMATE AND THE OCEAN CIRCULATION , 1969 .

[11]  Hubert H. G. Savenije,et al.  Analytical derivation of the Budyko curve based on rainfall characteristics and a simple evaporation model , 2009 .

[12]  S. Weijs,et al.  Why hydrological predictions should be evaluated using information theory , 2010 .

[13]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[14]  J. Kirchner Getting the right answers for the right reasons: Linking measurements, analyses, and models to advance the science of hydrology , 2006 .

[15]  Sujay V. Kumar,et al.  Benchmarking NLDAS-2 Soil Moisture and Evapotranspiration to Separate Uncertainty Contributions. , 2016, Journal of hydrometeorology.

[16]  Lu Zhang,et al.  Predicting the effect of vegetation changes on catchment average water balance , 1999 .

[17]  Dmitri Kavetski,et al.  The structure for unifying multiple modeling alternatives (SUMMA), Version 1.0: Technical Description , 2015 .

[18]  Kurt Hornik,et al.  Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[19]  Jim Freer,et al.  Towards a limits of acceptability approach to the calibration of hydrological models : Extending observation error , 2009 .

[20]  R. Stouffer,et al.  Stationarity Is Dead: Whither Water Management? , 2008, Science.

[21]  Kevin H. Knuth,et al.  Lattice duality: The origin of probability and entropy , 2013, Neurocomputing.

[22]  H. Gupta,et al.  Correcting the mathematical structure of a hydrological model via Bayesian data assimilation , 2011 .

[23]  Praveen Kumar,et al.  Information Driven Ecohydrologic Self-Organization , 2010, Entropy.

[24]  Liam Paninski,et al.  Estimation of Entropy and Mutual Information , 2003, Neural Computation.

[25]  Ming Ye,et al.  Towards a comprehensive assessment of model structural adequacy , 2012 .

[26]  Hoshin Vijai Gupta,et al.  Large-sample hydrology: a need to balance depth with breadth , 2013 .

[27]  Gabriel Abramowitz,et al.  Towards a benchmark for land surface models , 2005 .

[28]  Praveen Kumar,et al.  Ecohydrologic process networks: 1. Identification , 2009 .

[29]  Francesco Paesani,et al.  Getting the Right Answers for the Right Reasons: Toward Predictive Molecular Simulations of Water with Many-Body Potential Energy Functions. , 2016, Accounts of chemical research.

[30]  Hoshin Vijai Gupta,et al.  Ensembles vs. information theory: supporting science under uncertainty , 2018, Frontiers of Earth Science.

[31]  Hoshin Vijai Gupta,et al.  The quantity and quality of information in hydrologic models , 2015 .

[32]  S. Manabe CLIMATE AND THE OCEAN CIRCULATION1 , 1969 .

[33]  J. Kinney,et al.  Equitability, mutual information, and the maximal information coefficient , 2013, Proceedings of the National Academy of Sciences.

[34]  P. Milly Climate, soil water storage, and the average annual water balance , 1994 .

[35]  Peter S. Eagleson,et al.  Climate, soil, and vegetation: 1. Introduction to water balance dynamics , 1978 .

[36]  Yudong Tian,et al.  Performance Metrics, Error Modeling, and Uncertainty Quantification , 2016 .

[37]  Gab Abramowitz,et al.  Towards a public, standardized, diagnostic benchmarking system for land surface models , 2012 .

[38]  Philippe Ciais,et al.  A framework for benchmarking land models , 2012 .