A concept for data-driven uncertainty quantification and its application to carbon dioxide storage in geological formations

Abstract Model-based uncertainty analysis can help to judge the potentials and hazard in many engineering applications better. This requires to specify the probability distributions of all model parameters, posing a huge demand on data availability or requiring highly subjective assumptions on distribution shapes to compensate for missing data. We present a minimally subjective approach for uncertainty quantification in data-sparse situations, based on a new and purely data-driven version of polynomial chaos expansion (PCE). It avoids the subjectivity that is otherwise introduced when choosing among a small limited number of theoretical distribution shapes to represent natural phenomena: we only demand the existence of a finite number of statistical moments, and do not require knowledge or even the existence of probability density functions for input parameters. In a small fictitious example with independent experts, otherwise, we demonstrate that this subjectivity can easily lead to substantial prediction bias, and that the subjective choice of distribution shapes has a similar relevance as uncertainties due to physical conceptualization, numerical codes and parameter uncertainty. With our approach we can directly and most flexibly use raw data sets available from global databases or soft information from experts in the form of arbitrary distributions or statistical moments. We illustrate and validate our proposed approach by a comparison with a Monte Carlo simulation using a common 3D benchmark problem for CO 2 injection, which is a low-parametric homogeneous system. We obtain a significant computational speed-up compared with Monte Carlo as well as high accuracy even for small orders of expansion, and show how our novel approach helps overcome subjectivity.

[1]  T. Ulrych,et al.  Minimum relative entropy: Forward probabilistic modeling , 1993 .

[2]  R. Askey,et al.  Some basic hypergeometric orthogonal polynomials that generalize Jacobi polynomials , 1985 .

[3]  Rainer Helmig,et al.  An integrative approach to robust design and probabilistic risk assessment for CO2 storage in geological formations , 2011 .

[4]  V. Sambucini A reference prior for the analysis of a response surface , 2007 .

[5]  J. Lancaster,et al.  Spatial heterogeneity of near‐bed hydraulics above a patch of river gravel , 2006 .

[6]  S. Isukapalli,et al.  Stochastic Response Surface Methods (SRSMs) for Uncertainty Propagation: Application to Environmental and Biological Systems , 1998, Risk analysis : an official publication of the Society for Risk Analysis.

[7]  P. Rentrop,et al.  Polynomial chaos for the approximation of uncertainties: Chances and limits , 2008, European Journal of Applied Mathematics.

[8]  F. Pappenberger,et al.  Ignorance is bliss: Or seven reasons not to use uncertainty analysis , 2006 .

[9]  Pol D. Spanos,et al.  A stochastic Galerkin expansion for nonlinear random vibration analysis , 1993 .

[10]  W. T. Martin,et al.  The Orthogonal Development of Non-Linear Functionals in Series of Fourier-Hermite Functionals , 1947 .

[11]  Anders Hansson,et al.  Expert opinions on carbon dioxide capture and storage—A framing of uncertainties and possibilities , 2009 .

[12]  Dmitri Kavetski,et al.  Model for CO2 leakage including multiple geological layers and multiple leaky wells. , 2009, Environmental science & technology.

[13]  Roger G. Ghanem,et al.  On the construction and analysis of stochastic models: Characterization and propagation of the errors associated with limited data , 2006, J. Comput. Phys..

[14]  Rainer Helmig,et al.  CO2 leakage through an abandoned well: problem-oriented benchmarks , 2007 .

[15]  Jan M. Nordbotten,et al.  Injection and Storage of CO2 in Deep Saline Aquifers: Analytical Solution for CO2 Plume Evolution During Injection , 2005 .

[16]  W. Siebert Circuits, Signals and Systems , 1985 .

[17]  Rainer Helmig,et al.  Investigations on CO2 storage capacity in saline aquifers: Part 1. Dimensional analysis of flow processes and reservoir characteristics , 2009 .

[18]  N. Wiener The Homogeneous Chaos , 1938 .

[19]  D. Xiu,et al.  Modeling uncertainty in flow simulations via generalized polynomial chaos , 2003 .

[20]  Jeroen A. S. Witteveen,et al.  Modeling Arbitrary Uncertainties Using Gram-Schmidt Polynomial Chaos , 2006 .

[21]  E. Jaynes On the rationale of maximum-entropy methods , 1982, Proceedings of the IEEE.

[22]  Dongbin Xiu,et al.  The Wiener-Askey Polynomial Chaos for Stochastic Differential Equations , 2002, SIAM J. Sci. Comput..

[23]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[24]  Jeroen A. S. Witteveen,et al.  Modeling physical uncertainties in dynamic stall induced fluid-structure interaction of turbine blades using arbitrary polynomial chaos , 2007 .

[25]  Stephen Roberts,et al.  Stochastic galerkin and collocation methods for quantifying uncertainty in differential equations: a review , 2009 .

[26]  J. Villadsen,et al.  Solution of differential equation models by polynomial approximation , 1978 .

[27]  Mark A. Herkommer,et al.  Data-volume reduction of data gathered along lines using the correlation coefficient to determine breakpoints , 1985 .

[28]  Curtis M. Oldenburg,et al.  The role of optimality in characterizing CO2 seepage from geological carbon sequestration sites , 2007 .

[29]  Heng Li,et al.  A Comparative Study of the Probabilistic-Collocation and Experimental-Design Methods for Petroleum-Reservoir Uncertainty Quantification , 2011 .

[30]  Hermann G. Matthies,et al.  Sparse Quadrature as an Alternative to Monte Carlo for Stochastic Finite Element Techniques , 2003 .

[31]  Dongxiao Zhang,et al.  Probabilistic collocation method for flow in porous media: Comparisons with other stochastic methods , 2007 .

[32]  Amos Gilat,et al.  Matlab, An Introduction With Applications , 2003 .

[33]  Hans Wackernagel,et al.  Multivariate Geostatistics: An Introduction with Applications , 1996 .

[34]  Roger G. Ghanem,et al.  Physical Systems with Random Uncertainties: Chaos Representations with Arbitrary Probability Measure , 2005, SIAM J. Sci. Comput..

[35]  Hermann G. Matthies,et al.  Galerkin methods for linear and nonlinear elliptic stochastic partial differential equations , 2005 .

[36]  F. Maltz,et al.  Variance reduction in Monte Carlo computations using multi-dimensional hermite polynomials , 1979 .

[37]  Laurent Trenty,et al.  A benchmark study on problems related to CO2 storage in geologic formations , 2009 .

[38]  G. Karniadakis,et al.  Multi-Element Generalized Polynomial Chaos for Arbitrary Probability Measures , 2006, SIAM J. Sci. Comput..

[39]  C. Tsang,et al.  Large-scale impact of CO2 storage in deep saline aquifers: A sensitivity study on pressure response in stratified systems , 2009 .