An adaptive sparse‐grid high‐order stochastic collocation method for Bayesian inference in groundwater reactive transport modeling

[1] Bayesian analysis has become vital to uncertainty quantification in groundwater modeling, but its application has been hindered by the computational cost associated with numerous model executions required by exploring the posterior probability density function (PPDF) of model parameters. This is particularly the case when the PPDF is estimated using Markov Chain Monte Carlo (MCMC) sampling. In this study, a new approach is developed to improve the computational efficiency of Bayesian inference by constructing a surrogate of the PPDF, using an adaptive sparse-grid high-order stochastic collocation (aSG-hSC) method. Unlike previous works using first-order hierarchical basis, this paper utilizes a compactly supported higher-order hierarchical basis to construct the surrogate system, resulting in a significant reduction in the number of required model executions. In addition, using the hierarchical surplus as an error indicator allows locally adaptive refinement of sparse grids in the parameter space, which further improves computational efficiency. To efficiently build the surrogate system for the PPDF with multiple significant modes, optimization techniques are used to identify the modes, for which high-probability regions are defined and components of the aSG-hSC approximation are constructed. After the surrogate is determined, the PPDF can be evaluated by sampling the surrogate system directly without model execution, resulting in improved efficiency of the surrogate-based MCMC compared with conventional MCMC. The developed method is evaluated using two synthetic groundwater reactive transport models. The first example involves coupled linear reactions and demonstrates the accuracy of our high-order hierarchical basis approach in approximating high-dimensional posteriori distribution. The second example is highly nonlinear because of the reactions of uranium surface complexation, and demonstrates how the iterative aSG-hSC method is able to capture multimodal and non-Gaussian features of PPDF caused by model nonlinearity. Both experiments show that aSG-hSC is an effective and efficient tool for Bayesian inference.

[1]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[3]  Y. Rubin,et al.  A Bayesian approach for inverse modeling, data assimilation, and conditional simulation of spatial random fields , 2010 .

[4]  Liangsheng Shi,et al.  Qualification of Uncertainty for Simulating Solute Transport in the Heterogeneous Media with Sparse Grid Collocation Method , 2009 .

[5]  Karen Willcox,et al.  Surrogate Modeling for Uncertainty Assessment with Application to Aviation Environmental System Models , 2010 .

[6]  P. Kitanidis,et al.  Parameter estimation in nonlinear environmental problems , 2010 .

[7]  L. Shawn Matott,et al.  Evaluating uncertainty in integrated environmental models: A review of concepts and tools , 2009 .

[8]  Ashish Sharma,et al.  Hydrological model selection: A Bayesian alternative , 2005 .

[9]  Alexandre M. Tartakovsky,et al.  Numerical Studies of Three-dimensional Stochastic Darcy’s Equation and Stochastic Advection-Diffusion-Dispersion Equation , 2010, J. Sci. Comput..

[10]  G. Ferguson,et al.  Ground surface paleotemperature reconstruction using information measures and empirical Bayes , 2006 .

[11]  Erich Novak,et al.  High dimensional polynomial interpolation on sparse grids , 2000, Adv. Comput. Math..

[12]  Y. Rubin,et al.  Bayesian Method for hydrogeological site characterization using borehole and geophysical survey data: Theory and application to the Lawrence Livermore National Laboratory Superfund Site , 1999 .

[13]  Liangsheng Shi,et al.  Probabilistic collocation method for unconfined flow in heterogeneous media. , 2009 .

[14]  D. Higdon,et al.  Accelerating Markov Chain Monte Carlo Simulation by Differential Evolution with Self-Adaptive Randomized Subspace Sampling , 2009 .

[15]  Ming Ye,et al.  Analysis of regression confidence intervals and Bayesian credible intervals for uncertainty quantification , 2012 .

[16]  Nicholas Zabaras,et al.  An efficient Bayesian inference approach to inverse problems based on an adaptive sparse grid collocation method , 2009 .

[17]  Yoram Rubin,et al.  On minimum relative entropy concepts and prior compatibility issues in vadose zone inverse and forward modeling , 2005 .

[18]  Fabio Nobile,et al.  A Sparse Grid Stochastic Collocation Method for Partial Differential Equations with Random Input Data , 2008, SIAM J. Numer. Anal..

[19]  Keith Beven,et al.  Informal likelihood measures in model assessment: Theoretic development and investigation , 2008 .

[20]  Andreas Kemna,et al.  Estimating the spatiotemporal distribution of geochemical parameters associated with biostimulation using spectral induced polarization data and hierarchical Bayesian models , 2012 .

[21]  S. P. Neuman,et al.  Multimodel Bayesian analysis of data-worth applied to unsaturated fractured tuffs , 2012 .

[22]  David J. Nott,et al.  Generalized likelihood uncertainty estimation (GLUE) and approximate Bayesian computation: What's the connection? , 2012 .

[23]  Dongxiao Zhang,et al.  A sparse grid based Bayesian method for contaminant source identification , 2012 .

[24]  Michael Griebel,et al.  Adaptive sparse grid multilevel methods for elliptic PDEs based on finite differences , 1998, Computing.

[25]  Heikki Haario,et al.  DRAM: Efficient adaptive MCMC , 2006, Stat. Comput..

[26]  S. P. Neuman,et al.  On model selection criteria in multimodel analysis , 2007 .

[27]  S. P. Neuman,et al.  Maximum likelihood Bayesian averaging of uncertain model predictions , 2003 .

[28]  J. Vrugt,et al.  A formal likelihood function for parameter and predictive inference of hydrologic models with correlated, heteroscedastic, and non‐Gaussian errors , 2010 .

[29]  Jens Christian Refsgaard,et al.  Review of strategies for handling geological uncertainty in groundwater flow and transport modeling , 2012 .

[30]  Åke Björck,et al.  The calculation of linear least squares problems , 2004, Acta Numerica.

[31]  Daniel M. Tartakovsky,et al.  Assessment and management of risk in subsurface hydrology: A review and perspective , 2013 .

[32]  Daniel M. Tartakovsky,et al.  Uncertainty quantification via random domain decomposition and probabilistic collocation on sparse grids , 2010, J. Comput. Phys..

[33]  Bryan A. Tolson,et al.  Review of surrogate modeling in water resources , 2012 .

[34]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[35]  Chunmiao Zheng,et al.  MMA: A Computer Code for Multimodel Analysis , 2010 .

[36]  Dirk Pflüger,et al.  Spatially Adaptive Sparse Grids for High-Dimensional Problems , 2010 .

[37]  James N. Petersen,et al.  Development of analytical solutions for multispecies transport with serial and parallel reactions , 1999 .

[38]  Ming Ye,et al.  Maximum likelihood Bayesian averaging of spatial variability models in unsaturated fractured tuff , 2003 .

[39]  S. P. Neuman,et al.  Bayesian analysis of data-worth considering model and parameter uncertainties , 2012 .

[40]  Raul Tempone,et al.  An anisotropic sparse grid stochastic collocation method for elliptic partial differential equations with random input data , 2007 .

[41]  Ming Ye,et al.  Comparing Nonlinear Regression and Markov Chain Monte Carlo Methods for Assessment of Prediction Uncertainty in Vadose Zone Modeling , 2012 .

[42]  Ming Ye,et al.  Comment on “Inverse groundwater modeling for hydraulic conductivity estimation using Bayesian model averaging and variance window” by Frank T.‐C. Tsai and Xiaobao Li , 2010 .

[43]  Ming Ye,et al.  MMA: A Computer Code for Multimodel Analysis , 2010 .

[44]  Mary C. Hill,et al.  MMA, A Computer Code for Multi-Model Analysis , 2014 .

[45]  Fabio Nobile,et al.  An Anisotropic Sparse Grid Stochastic Collocation Method for Partial Differential Equations with Random Input Data , 2008, SIAM J. Numer. Anal..

[46]  Barbara I. Wohlmuth,et al.  Algorithm 847: Spinterp: piecewise multilinear hierarchical sparse grid interpolation in MATLAB , 2005, TOMS.

[47]  C. Tiedeman,et al.  Effective Groundwater Model Calibration , 2007 .

[48]  Guang Lin,et al.  An efficient, high-order probabilistic collocation method on sparse grids for three-dimensional flow and solute transport in randomly heterogeneous porous media , 2009 .

[49]  Habib N. Najm,et al.  Stochastic spectral methods for efficient Bayesian solution of inverse problems , 2005, J. Comput. Phys..

[50]  L. Shawn Matott,et al.  Calibration of complex subsurface reaction models using a surrogate-model approach , 2008 .

[51]  Matthias Kohler,et al.  Experimental Investigation and Modeling of Uranium (VI) Transport Under Variable Chemical Conditions , 1996 .

[52]  C. D. Perttunen,et al.  Lipschitzian optimization without the Lipschitz constant , 1993 .

[53]  Q. Kang,et al.  Optimization and uncertainty assessment of strongly nonlinear groundwater models with high parameter dimensionality , 2010 .

[54]  Keith Beven,et al.  The future of distributed models: model calibration and uncertainty prediction. , 1992 .

[55]  Mary C. Hill,et al.  UCODE_2005 and six other computer codes for universal sensitivity analysis, calibration, and uncertainty evaluation constructed using the JUPITER API , 2006 .

[56]  Y. Marzouk,et al.  A stochastic collocation approach to Bayesian inference in inverse problems , 2009 .

[57]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo conver-gence diagnostics: a comparative review , 1996 .

[58]  T. Ulrych,et al.  A full‐Bayesian approach to the groundwater inverse problem for steady state flow , 2000 .

[59]  Peter K. Kitanidis,et al.  Generalized priors in Bayesian inversion problems , 2012 .

[60]  S. E. Ahmed,et al.  Markov Chain Monte Carlo: Stochastic Simulation for Bayesian Inference , 2008, Technometrics.

[61]  B. Renard,et al.  A Bayesian hierarchical approach to regional frequency analysis , 2011 .

[62]  Allan D. Woodbury,et al.  Minimum relative entropy, Bayes and Kapur , 2011 .

[63]  Tyler Smith,et al.  Development of a formal likelihood function for improved Bayesian inference of ephemeral catchments , 2010 .

[64]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[65]  Guannan Zhang,et al.  An Adaptive Wavelet Stochastic Collocation Method for Irregular Solutions of Partial Differential Equations with Random Input Data , 2014 .

[66]  Cajo J. F. ter Braak,et al.  Treatment of input uncertainty in hydrologic modeling: Doing hydrology backward with Markov chain Monte Carlo simulation , 2008 .

[67]  Haibin Chang,et al.  A comparative study of numerical approaches to risk assessment of contaminant transport , 2010 .

[68]  George E. P. Box,et al.  Bayesian Inference in Statistical Analysis: Box/Bayesian , 1992 .

[69]  L. Matott,et al.  Calibration of subsurface batch and reactive-transport models involving complex biogeochemical processes , 2008 .

[70]  P. Kitanidis Parameter Uncertainty in Estimation of Spatial Functions: Bayesian Analysis , 1986 .

[71]  C. Appelo,et al.  PHT3D: A Reactive Multicomponent Transport Model for Saturated Porous Media , 2010, Ground water.

[72]  H. Bungartz,et al.  Sparse grids , 2004, Acta Numerica.

[73]  J. Beck,et al.  Bayesian Updating of Structural Models and Reliability using Markov Chain Monte Carlo Simulation , 2002 .

[74]  Dongxiao Zhang,et al.  Probabilistic collocation method for flow in porous media: Comparisons with other stochastic methods , 2007 .