Chapter 8 – Data-Driven Modeling

Abstract Big data analytics and data-driven modeling have become quite the buzzwords in recent years in the context of analyzing the performance of oil and gas reservoirs. Their growing application has been predicated on the potential to usher in exciting new developments related to (1) acquiring and managing data in large volumes, of different varieties, and at high velocities (the 3V problem) and (2) using statistical techniques to “mine” the data and discover hidden patterns of association and relationships in large, complex, multivariate datasets. The terms data mining, statistical learning, knowledge discovery, and data analytics have all been used interchangeably in this context. Essentially, the goal of such an exercise is to extract important patterns and trends and understand “what the data says,” using supervised and/or unsupervised learning.

[1]  Eduardo A. B. da Silva,et al.  Big Data Analytics for Predictive Maintenance Modeling: Challenges and Opportunities , 2015 .

[2]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001, Statistical Science.

[3]  S. P. Neuman,et al.  Maximum likelihood Bayesian averaging of uncertain model predictions , 2003 .

[4]  Timothy R. Carr,et al.  Comparison of supervised and unsupervised approaches for mudstone lithofacies classification: Case studies from the Bakken and Mahantango-Marcellus Shale, USA , 2016 .

[5]  Luigi Saputelli Technology Focus: Petroleum Data Analytics , 2016 .

[6]  Zaid Rawi Machinery Predictive Analytics , 2010 .

[7]  Shahab D. Mohaghegh,et al.  Data Driven Analytics in Powder River Basin, WY , 2013 .

[8]  Akhil Datta-Gupta,et al.  The Role of Electrofacies, Lithofacies, and Hydraulic Flow Units in Permeability Predictions from Well Logs: A Comparative Analysis Using Classification Trees , 2005 .

[9]  Leen Weijers,et al.  Evaluating the Relationship Between Well Parameters and Production Using Multivariate Statistical Models: A Middle Bakken and Three Forks Case History , 2016 .

[10]  Srikanta Mishra,et al.  Global Sensitivity Analysis Techniques for Probabilistic Ground Water Modeling , 2009, Ground water.

[11]  Ming Zhong,et al.  Do Data Mining Methods Matter?: A Wolfcamp Shale Case Study , 2015 .

[12]  Srikanta Mishra,et al.  A New Approach to Reserves Estimation in Shale Gas Reservoirs Using Multiple Decline Curve Analysis Models , 2012 .

[13]  Keith Beven,et al.  The future of distributed models: model calibration and uncertainty prediction. , 1992 .

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  Genbao Shi,et al.  Geology-guided Quantification of Production-Forecast Uncertainty in Dynamic Model Inversion , 2011 .

[16]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[17]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[18]  Srikanta Mishra,et al.  Model Averaging Techniques for Quantifying Conceptual Model Uncertainty , 2010, Ground water.

[19]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[20]  Randy Lafollette,et al.  Data Analytics for Production Optimization in Unconventional Reservoirs , 2015 .