A new method of identification of complex lithologies and reservoirs: task-driven data mining

Abstract In the traditional data-driven data mining process, there are huge gaps between the efficient algorithms and intelligent tools as well as the invalidity of knowledge which is obtained by traditional data-driven data mining. Meanwhile, each data in the earth science field contains a solid physical meaning. If there is no corresponding domain knowledge involved in the mining process, the information explored by data-driven data mining will be lack of practicability and not able to effectively solve problems in the earth science area. Therefore, the task-driven data mining is proposed. Additionally, task-driven data mining concepts and principles are elaborated with the help of data mining concepts and techniques. It is divided into seven elements such as data warehousing, data preprocessing, feature subset selection, modeling, model evaluation, model updating and model release. Those constitute a cyclic and iterative process until a predictive model which is capable of effectively achieving the objectives. In order to accurately identify complex lithologies, this paper puts forward a self-organizing feature map neural network based on the task-driven data mining. With the attempt to solve the problem of complex reservoir identification, the decision tree and support vector machine are used to build the fluid predictive model. Meanwhile, the optimization algorithms inclusive of genetic, grid and quadratic are adopted to optimize the important parameters of C-SVC and υ-SVC, such as C, υ and γ, so as to improve the classification performance and generalization ability of the predictive model of support vector machine. The conclusions of fine interpretation are compared with the core analysis data and well testing data. As a result, the accuracy of the complex lithology and reservoir identification is more than 90%. Finally, the paper puts forward the understandings, development prospects and key challenges of task-driven data mining facing.

[1]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[2]  Huang Yan The Application of Box-counting Dimension of Well Logs on Lithology Identification of Volcanic in Formation of Xujiawei , 2009 .

[3]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[4]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[5]  Gioia Falcone,et al.  Knowledge Discovery in Databases and Multiphase Flow Metering: The Integration of Statistics, Data Mining, Neural Networks, Fuzzy Logic, and Ad Hoc Flow Measurements Towards Well Monitoring and Diagnosis , 2002 .

[6]  Zhou Bo A Study on Lithology Identification Methods for Volcanic Rocks , 2005 .

[7]  Xia Li-xian The application of self-organizing feature map neural network to logging lithological identification , 2005 .

[8]  Guo Rui-hua Application of PSO-LSSVM classification model in logging lithology recognition , 2010 .

[9]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[10]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[11]  Anuj Gupta,et al.  Permeability Modeling Using Neural Network Approach for Complex Mauddud-Burgan Carbonate Reservoir , 2007 .

[12]  Liaqat Ali,et al.  Modeling Permeability in Tight Gas Sands Using Intelligent and Innovative Data Mining Techniques , 2008 .

[13]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[14]  W. W. Weiss,et al.  Data Mining at a Regulatory Agency to Forecast Waterflood Recovery , 2001 .

[15]  David W. Aha,et al.  Feature Selection for Case-Based Classification of Cloud Types: An Empirical Comparison , 1994 .

[16]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[17]  W. Chambers San Antonio, Texas , 1940 .

[18]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[19]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[20]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[21]  David Heckerman,et al.  Bayesian Networks for Data Mining , 2004, Data Mining and Knowledge Discovery.

[22]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[23]  Michael Stundner,et al.  How Routine Reservoir Surveillance with Neural Networks and Simplified Reservoir Models can Convert Data into Information , 2002 .

[24]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[25]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[26]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[27]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.