Predictive analytics with an advanced Bayesian modeling framework

One of the main limitations in predictive analytics is the acquisition cost of engineering data due to slow-running computer code or expensive experiments. Also, data is often multi-dimensional and highly non-linear in nature, causing problems for standard statistical predictive models. Once data is collected and models are built, many applications require accurate and scalable uncertainty quantification (UQ) solutions to enable robust design and reliable decision making. A modeling framework that addresses these common problems is presented here. The advanced Bayesian modeling framework called “GE Bayesian Hybrid Modeling” (GEBHM) combines simulation and experimental data sources using machine learning techniques and Bayesian statistics to perform UQ, provides detailed sensitivity reports for design engineers, visualizes the problem and its solution succinctly, and aids in developing nextstep plans for decision makers. GEBHM works well for cases with sparse data situations but also handles large-data problems. Fundamentally, GEBHM relies on Markov Chain Monte Carlo techniques for accurately learning the Bayesian model parameters and employs state-of-the-art techniques for self-validation. The GEBHM framework applied to an example engineering design problem requiring FE and/or CFD data is presented. It is demonstrated how the surrogate model is built, how uncertainty sources are quantified and propagated, how the impact of design decisions on performance is quantified, and how the engineering design is optimized. It is shown that the framework is capable of handling problems with limited data, accurately capture the uncertainty, and provide a high level of detail and accuracy for the designers and decision makers.

[1]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[2]  A. O'Hagan,et al.  Bayesian calibration of computer models , 2001 .

[3]  James Kennedy,et al.  Defining a Standard for Particle Swarm Optimization , 2007, 2007 IEEE Swarm Intelligence Symposium.

[4]  Liping Wang,et al.  A Comparative Study on Accuracy and Efficiency of Metamodels for Large Industrial Datasets , 2011 .

[5]  Bruno Sudret,et al.  Global sensitivity analysis using polynomial chaos expansions , 2008, Reliab. Eng. Syst. Saf..

[6]  D. Higdon,et al.  Computer Model Calibration Using High-Dimensional Output , 2008 .

[7]  Arun K. Subramaniyan,et al.  Probabilistic Optimization of Two-Phase Flow Using Bayesian Models , 2014 .

[8]  G. Venter,et al.  An algorithm for fast optimal Latin hypercube design of experiments , 2010 .

[9]  James O. Berger,et al.  Markov chain Monte Carlo-based approaches for inference in computationally intensive inverse problems , 2003 .

[10]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[11]  Liping Wang,et al.  Improving High-Dimensional Physics Models Through Bayesian Calibration With Uncertain Data , 2012 .

[12]  Brian Williams,et al.  A Bayesian calibration approach to the thermal problem , 2008 .

[13]  Dave Higdon,et al.  Combining Field Data and Computer Simulations for Calibration and Prediction , 2005, SIAM J. Sci. Comput..