ANOVA-based transformed probabilistic collocation method for Bayesian data-worth analysis

Abstract Bayesian theory provides a coherent framework in quantifying the data worth of measurements and estimating unknown parameters. Nevertheless, one common problem in Bayesian methods is the considerably high computational cost since a large number of model evaluations is required in the likelihood evaluation. To address this issue, a new surrogate modeling method, i.e., ANOVA (analysis of variance)-based transformed probabilistic collocation method (ATPCM), is developed in this work. To cope with the strong nonlinearity, the model responses are transformed to the arrival times, which are then approximated with a set of low-order ANOVA components. The validity of the proposed method is demonstrated by synthetic numerical cases involving water and heat transport in the vadose zone. It is shown that, the ATPCM is more efficient than the existing surrogate modeling methods (e.g., PCM, ANOVA-based PCM and TPCM). At a very low computational cost, the ATPCM-based Bayesian data-worth analysis provides a quantitative metric in comparing different monitoring plans, and helps to improve the parameter estimation. Although the flow and heat transport in vadose zone is considered in this work, the proposed method can be equally applied in any other hydrologic problems.

[1]  D. Xiu,et al.  Modeling uncertainty in flow simulations via generalized polynomial chaos , 2003 .

[2]  Robert Horton,et al.  Soil Heat and Water Flow With a Partial Surface Mulch , 1987 .

[3]  Cajo J. F. ter Braak,et al.  Treatment of input uncertainty in hydrologic modeling: Doing hydrology backward with Markov chain Monte Carlo simulation , 2008 .

[4]  Y. Mualem A New Model for Predicting the Hydraulic Conductivity , 1976 .

[5]  G. Rau,et al.  Heat as a tracer to quantify processes and properties in the vadose zone: A review , 2016 .

[6]  Van Genuchten,et al.  A closed-form equation for predicting the hydraulic conductivity of unsaturated soils , 1980 .

[7]  Cheng Chen,et al.  Efficient Bayesian experimental design for contaminant source identification , 2015, Water Resources Research.

[8]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[9]  Jasper A. Vrugt,et al.  High‐dimensional posterior exploration of hydrologic models using multiple‐try DREAM(ZS) and high‐performance computing , 2012 .

[10]  Marios Sophocleous,et al.  Analysis of water and heat flow in unsaturated‐saturated porous media , 1979 .

[11]  S. P. Neuman,et al.  Bayesian analysis of data-worth considering model and parameter uncertainties , 2012 .

[12]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[13]  J. R. Philip,et al.  Moisture movement in porous materials under temperature gradients , 1957 .

[14]  Dongxiao Zhang,et al.  Stochastic analysis of unsaturated flow with probabilistic collocation method , 2009 .

[15]  Yoram Rubin,et al.  Bayesian inversion of Mualem‐van Genuchten parameters in a multilayer soil profile: A data‐driven, assumption‐free likelihood function , 2015 .

[16]  Xiang Ma,et al.  An adaptive high-dimensional stochastic model representation technique for the solution of stochastic partial differential equations , 2010, J. Comput. Phys..

[17]  Cajo J. F. ter Braak,et al.  Differential Evolution Markov Chain with snooker updater and fewer chains , 2008, Stat. Comput..

[18]  Qiang Zheng,et al.  A probabilistic collocation based iterative Kalman filter for landfill data assimilation , 2017 .

[19]  Miroslav Šejna,et al.  Development and Applications of the HYDRUS and STANMOD Software Packages and Related Codes , 2008 .

[20]  Menner A. Tatang,et al.  An efficient method for parametric uncertainty analysis of numerical geophysical models , 1997 .

[21]  R. M. Cotta,et al.  Bayesian estimation of the hydraulic and solute transport properties of a small-scale unsaturated soil column , 2016 .

[22]  Dongxiao Zhang,et al.  Probabilistic collocation method for strongly nonlinear problems: 3. Transform by time , 2016 .

[23]  Xiu Yang,et al.  Adaptive ANOVA decomposition of stochastic incompressible and compressible flows , 2012, J. Comput. Phys..

[24]  Alexander Y. Sun,et al.  Model Calibration and Parameter Estimation: For Environmental and Water Resource Systems , 2015 .

[25]  Dongxiao Zhang,et al.  Probabilistic collocation method for strongly nonlinear problems: 1. Transform by location , 2013 .

[26]  J. Vrugt,et al.  Inverse Modeling of Subsurface Flow and Transport Properties: A Review with New Developments , 2008 .

[27]  H. Rabitz,et al.  High Dimensional Model Representations , 2001 .

[28]  J. Simunek,et al.  UNSATCHEM-2D code for simulating two-dimensional variably saturated water flow, heat transport, carbon dioxide production and transport, and multicomponent solute transport with major ion equilibrium and kinetic chemistry , 1993 .

[29]  S. Gorelick,et al.  When enough is enough: The worth of monitoring data in aquifer remediation design , 1994 .

[30]  Wolfgang Nowak,et al.  Bayesian assessment of the expected data impact on prediction confidence in optimal sampling design , 2012 .

[31]  D. Xiu Numerical Methods for Stochastic Computations: A Spectral Method Approach , 2010 .

[32]  Wolfgang Nowak,et al.  Uncertainty and data worth analysis for the hydraulic design of funnel‐and‐gate systems in heterogeneous aquifers , 2004 .

[33]  J. Šimůnek,et al.  Parameter estimation of soil hydraulic and thermal property functions for unsaturated porous media using the HYDRUS-2D code , 2014 .

[34]  Dongxiao Zhang,et al.  Probabilistic collocation method for strongly nonlinear problems: 2. Transform by displacement , 2014 .

[35]  J. Vrugt,et al.  On the value of soil moisture measurements in vadose zone hydrology: A review , 2008 .

[36]  Xun Huan,et al.  Simulation-based optimal Bayesian experimental design for nonlinear systems , 2011, J. Comput. Phys..

[37]  Marianthi G. Ierapetritou,et al.  A comparative assessment of efficient uncertainty analysis techniques for environmental fate and transport models: application to the FACT model , 2005 .

[38]  Lingzao Zeng,et al.  Data assimilation for unsaturated flow models with restart adaptive probabilistic collocation based Kalman filter , 2016 .

[39]  Herschel Rabitz,et al.  Global uncertainty assessments by high dimensional model representations (HDMR) , 2002 .

[40]  Dongxiao Zhang,et al.  An efficient, high-order perturbation approach for flow in random porous media via Karhunen-Loève and polynomial expansions , 2004 .

[41]  Wolfgang Nowak,et al.  Entropy-Based Experimental Design for Optimal Model Discrimination in the Geosciences , 2016, Entropy.

[42]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[43]  Y. Rubin,et al.  Bayesian geostatistical design: Task‐driven optimal site investigation when the geostatistical model is uncertain , 2010 .

[44]  Guang Lin,et al.  An adaptive ANOVA-based PCKF for high-dimensional nonlinear inverse modeling , 2014, J. Comput. Phys..

[45]  Jim Constantz,et al.  Temperature Dependence of Unsaturated Hydraulic Conductivity of Two Soils , 1982 .

[46]  Solomon Kullback,et al.  Information Theory and Statistics , 1970, The Mathematical Gazette.

[47]  Dongxiao Zhang,et al.  Probabilistic collocation method for flow in porous media: Comparisons with other stochastic methods , 2007 .