What do we do with model simulation crashes? Recommendations for global sensitivity analysis of earth and environmental systems models

Abstract. Complex, software-intensive, technically advanced, and computationally demanding models, presumably with ever-growing realism and fidelity, have been widely used to simulate and predict the dynamics of the Earth and environmental systems. The parameter-induced simulation crash (failure) problem is typical across most of these models, despite considerable efforts that modellers have directed at model development and implementation over the last few decades. A simulation failure mainly occurs due to the violation of the numerical stability conditions, non-robust numerical implementations, or errors in programming. However, the existing sampling-based analysis techniques such as global sensitivity analysis (GSA) methods, which require running these models under many configurations of parameter values, are ill-equipped to effectively deal with model failures. To tackle this problem, we propose a novel approach that allows users to cope with failed designs (samples) during the GSA, without knowing where they took place and without re-running the entire experiment. This approach deems model crashes as missing data and uses strategies such as median substitution, single nearest neighbour, or response surface modelling to fill in for model crashes. We test the proposed approach on a 10-paramter HBV-SASK rainfall-runoff model and a 111-parameter MESH land surface-hydrology model. Our results show that response surface modelling is a superior strategy, out of the data filling strategies tested, and can scale well to the dimensionality of the model, sample size, and the ratio of number of failures to the sample size. Further, we conduct a "failure analysis" and discuss some possible causes of the MESH model failure.

[1]  Saman Razavi,et al.  Multicriteria sensitivity analysis as a diagnostic tool for understanding model behaviour and characterizing model uncertainty , 2017 .

[2]  D. Charles-Edwards,et al.  An Analysis of the Effects of Repeated Short-Term Soil Water Deficits on Stomatal Conductance to Carbon Dioxide and Leaf Photosynthesis by the Legume Macroptilium atropurpureum Cv. Siratro , 1981 .

[3]  D. Verseghy,et al.  CLASS-A Canadian Land Surface Scheme for GCMs , 1993 .

[4]  Thorsten Wagener,et al.  Identifiability of transient storage model parameters along a mountain stream , 2013 .

[5]  Ginés Rubio,et al.  Global and local modelling in RBF networks , 2011, Neurocomputing.

[6]  H. Gupta,et al.  A new framework for comprehensive, robust, and efficient global sensitivity analysis: 2. Application , 2016 .

[7]  Paola Annoni,et al.  Sixth International Conference on Sensitivity Analysis of Model Output How to avoid a perfunctory sensitivity analysis , 2010 .

[8]  Saman Razavi,et al.  Global sensitivity analysis for high-dimensional problems: How to objectively group factors and measure robustness and convergence while reducing computational cost , 2019, Environ. Model. Softw..

[9]  V. Singh,et al.  Mathematical models of small watershed hydrology and applications. , 2002 .

[10]  Saman Razavi,et al.  Progressive Latin Hypercube Sampling: An efficient approach for robust sampling-based analysis of environmental models , 2017, Environ. Model. Softw..

[11]  Saman Razavi,et al.  Revisiting the Basis of Sensitivity Analysis for Dynamical Earth System Models , 2018, Water Resources Research.

[12]  Bryan A. Tolson,et al.  Reducing the computational cost of automatic calibration through model preemption , 2010 .

[13]  Andrei P. Sokolov,et al.  Estimating Probability Distributions from Complex Models with Bifurcations: The Case of Ocean Circulation Collapse , 2007 .

[14]  Alain Pietroniro,et al.  Grouped Response Units for Distributed Hydrologic Modeling , 1993 .

[15]  Marc B. Neumann,et al.  Global sensitivity analysis for urban water quality modelling: terminology, convergence and comparison of different methods. , 2015 .

[16]  Saman Razavi,et al.  What do we mean by sensitivity analysis? The need for comprehensive characterization of “global” sensitivity in Earth and Environmental systems models , 2015 .

[17]  Ronald E. McRoberts,et al.  Diagnostic tools for nearest neighbors techniques when used with satellite imagery , 2009 .

[18]  Achille Messac,et al.  Metamodeling using extended radial basis functions: a comparative approach , 2006, Engineering with Computers.

[19]  Robert Marsh,et al.  Uncertainties due to transport-parameter sensitivity in an efficient 3-D ocean-climate model , 2005 .

[20]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[21]  Alfred Stein,et al.  Bayesian integration of flux tower data into a process-based simulator for quantifying uncertainty in simulated output , 2016 .

[22]  Saman Razavi,et al.  Enhanced identification of a hydrologic model using streamflow and satellite water storage data: A multicriteria sensitivity analysis and optimization approach , 2017 .

[23]  Dmitri Kavetski,et al.  Ancient numerical daemons of conceptual hydrological modeling: 1. Fidelity and efficiency of time stepping schemes , 2010 .

[24]  Florian Pappenberger,et al.  Multi-method global sensitivity analysis of flood inundation models. , 2008 .

[25]  Lorenzo Beretta,et al.  Nearest neighbor imputation algorithms: a critical evaluation , 2016, BMC Medical Informatics and Decision Making.

[26]  Philip Marsh,et al.  Diagnosis of the hydrology of a small Arctic basin at the tundra-taiga transition using a physically based hydrological model , 2017 .

[27]  Donald R. Jones,et al.  A Taxonomy of Global Optimization Methods Based on Response Surfaces , 2001, J. Glob. Optim..

[28]  Bryan A. Tolson,et al.  Review of surrogate modeling in water resources , 2012 .