Tuning machine learning dropout for subsurface uncertainty model accuracy

Abstract Machine learning hyperparameter tuning generally relies on minimizing the prediction error and maximizing the accuracy at withheld testing data locations. However, for many subsurface applications, prediction accuracy is not sufficient, and we must consider the goodness, accuracy, and precision, of the entire uncertainty model. Bayesian deep neural networks provide a robust framework to model uncertainty. But at a high computational cost and difficulty to scale in high-dimensional parameter spaces. Dropout offers a potential alternative for calculating prediction model realizations to represent the uncertainty model. We propose a method to compare the entire uncertainty model based on the prediction realizations and the addition of a single objective function, known as uncertainty model goodness, to the loss function to integrate model uncertainty for improved tuning of the dropout frequency in deep learning models. We test our method with a subsurface flow surrogate model based on an image-to-image regression problem. Implementing the proposed method with the uncertainty model goodness results in accurate and precise uncertainty models using the tuned dropout hyperparameter. We test the robustness and generalization of the proposed method with out-of-distribution testing data.

[1]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[2]  K. Aziz,et al.  Petroleum Reservoir Simulation , 1979 .

[3]  Tara N. Sainath,et al.  Improving deep neural networks for LVCSR using rectified linear units and dropout , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  László Pásztor,et al.  Comparison of various uncertainty modelling approaches based on geostatistics and machine learning algorithms , 2019, Geoderma.

[5]  Benjamin Van Roy,et al.  Deep Exploration via Bootstrapped DQN , 2016, NIPS.

[6]  M. Pyrcz,et al.  Stochastic Pix2pix: A New Machine Learning Method for Geophysical and Well Conditioning of Rule-Based Channel Reservoir Models , 2020, Natural Resources Research.

[7]  Zhanxing Zhu,et al.  ST-UNet: A Spatio-Temporal U-Network for Graph-structured Time Series Modeling , 2019, ArXiv.

[8]  Duo Xu,et al.  PoreFlow-Net: A 3D convolutional neural network to predict fluid flow through porous media , 2020, Advances in Water Resources.

[9]  Hang Lei,et al.  Hyperparameter Optimization for Machine Learning Models Based on Bayesian Optimization , 2019 .

[10]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[11]  Xiqun Chen,et al.  Short-Term Forecasting of Passenger Demand under On-Demand Ride Services: A Spatio-Temporal Deep Learning Approach , 2017, ArXiv.

[12]  Chun Wang,et al.  A metamodel-assisted evolutionary algorithm for expensive optimization , 2011, J. Comput. Appl. Math..

[13]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[14]  Thomas Nauss,et al.  Automatic classification of Google Earth images for a larger scale monitoring of bush encroachment in South Africa , 2016, Int. J. Appl. Earth Obs. Geoinformation.

[15]  C. C. Pain,et al.  Rapid spatio-temporal flood prediction and uncertainty quantification using a deep learning method , 2019, Journal of Hydrology.

[16]  B. Dindoruk,et al.  Analytical Solution of Nonisothermal Buckley-Leverett Flow Including Tracers , 2008 .

[17]  Indranil Pan,et al.  Performance comparison of several response surface surrogate models and ensemble methods for water injection optimization under uncertainty , 2016, Comput. Geosci..

[18]  Lars Kotthoff,et al.  Automated Machine Learning: Methods, Systems, Challenges , 2019, The Springer Series on Challenges in Machine Learning.

[19]  Albert C. Reynolds,et al.  Robust Life-Cycle Production Optimization With a Support-Vector-Regression Proxy , 2018, SPE Journal.

[20]  Michèle Sebag,et al.  Comparison-Based Optimizers Need Comparison-Based Surrogates , 2010, PPSN.

[21]  Geoffrey E. Hinton,et al.  Keeping the neural networks simple by minimizing the description length of the weights , 1993, COLT '93.

[22]  José Ranilla,et al.  Particle swarm optimization for hyper-parameter selection in deep neural networks , 2017, GECCO.

[23]  Nicholas Zabaras,et al.  Bayesian Deep Convolutional Encoder-Decoder Networks for Surrogate Modeling and Uncertainty Quantification , 2018, J. Comput. Phys..

[24]  Pierre Baldi,et al.  The dropout learning algorithm , 2014, Artif. Intell..

[25]  Hayaru Shouno,et al.  Analysis of Dropout Learning Regarded as Ensemble Learning , 2016, ICANN.

[26]  K. Aziz,et al.  Prediction Of Uncertainty In Reservoir Performance Forecast , 1992 .

[27]  L. Durlofsky,et al.  Deep-learning-based surrogate model for reservoir simulation with time-varying well controls , 2020, Journal of Petroleum Science and Engineering.

[28]  Oge Marques,et al.  Dropout vs. batch normalization: an empirical study of their impact to deep learning , 2020, Multimedia Tools and Applications.

[29]  Yaping Yang,et al.  A Comparison of Different Regression Algorithms for Downscaling Monthly Satellite-Based Precipitation over North China , 2016, Remote. Sens..

[30]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[31]  Amir Hossein Alavi,et al.  Machine learning in geosciences and remote sensing , 2016 .

[32]  Larry W. Lake,et al.  Fundamentals of Enhanced Oil Recovery , 2014 .

[33]  B. H. Caudle,et al.  Oil Production After Breakthrough as Influenced by Mobility Ratio , 1954 .

[34]  Lina Yao,et al.  Deep Neural Network Hyperparameter Optimization with Orthogonal Array Tuning , 2019, ICONIP.

[35]  P. Goovaerts,et al.  Geostatistical Modeling of the Spaces of Local, Spatial, and Response Uncertainty for Continuous Petrophysical Properties , 2006 .

[36]  Clayton V. Deutsch,et al.  Geostatistical Reservoir Modeling , 2002 .

[37]  David J. C. MacKay,et al.  A Practical Bayesian Framework for Backpropagation Networks , 1992, Neural Computation.

[38]  Tianrui Li,et al.  Spatio-temporal feature fusion for dynamic taxi route recommendation via deep reinforcement learning , 2020, Knowl. Based Syst..

[39]  Pierre Roudier,et al.  Mapping Daily Air Temperature for Antarctica Based on MODIS LST , 2016, Remote. Sens..

[40]  Honggeun Jo,et al.  Conditioning well data to rule-based lobe model by machine learning with a generative adversarial network , 2020, Energy Exploration & Exploitation.

[41]  Kevin Smith,et al.  Bayesian Uncertainty Estimation for Batch Normalized Deep Networks , 2018, ICML.

[42]  Charles Blundell,et al.  Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[43]  Ridha Gharbi,et al.  An introduction to artificial intelligence applications in petroleum exploration and production , 2005 .

[44]  Morteza Haghighat Sefat,et al.  Development of an adaptive surrogate model for production optimization , 2015 .

[45]  Alex Kendall,et al.  What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.

[46]  Huayi Wu,et al.  A novel cellular automata model integrated with deep learning for dynamic spatio-temporal land use change simulation , 2020, Comput. Geosci..

[47]  Richard Dikau,et al.  Regional-scale controls on the spatial activity of rockfalls (Turtmann Valley, Swiss Alps) — A multivariate modeling approach , 2017 .

[48]  Dmitry P. Vetrov,et al.  Uncertainty Estimation via Stochastic Batch Normalization , 2018, ICLR.

[49]  Yuewei Pan,et al.  Application of physics-informed neural networks for self-similar and transient solutions of spontaneous imbibition , 2021, Journal of Petroleum Science and Engineering.

[50]  Stefano Ermon,et al.  Accurate Uncertainties for Deep Learning Using Calibrated Regression , 2018, ICML.

[51]  Shuhua Wang,et al.  Applicability of deep neural networks on production forecasting in Bakken shale reservoirs , 2019, Journal of Petroleum Science and Engineering.