An efficient data mining method for learning Bayesian networks using an evolutionary algorithm-based hybrid approach

Given the explosive growth of data collected from current business environment, data mining can potentially discover new knowledge to improve managerial decision making. This paper proposes a novel data mining approach that employs an evolutionary algorithm to discover knowledge represented in Bayesian networks. The approach is applied successfully to handle the business problem of finding response models from direct marketing data. Learning Bayesian networks from data is a difficult problem. There are two different approaches to the network learning problem. The first one uses dependency analysis, while the second one searches good network structures according to a metric. Unfortunately, both approaches have their own drawbacks. Thus, we propose a novel hybrid algorithm of the two approaches, which consists of two phases, namely, the conditional independence (CI) test and the search phases. In the CI test phase, dependency analysis is conducted to reduce the size of the search space. In the search phase, good Bayesian network models are generated by using an evolutionary algorithm. A new operator is introduced to further enhance the search effectiveness and efficiency. In a number of experiments and comparisons, the hybrid algorithm outperforms MDLEP, our previous algorithm which uses evolutionary programming (EP) for network learning, and other network learning algorithms. We then apply the approach to two data sets of direct marketing and compare the performance of the evolved Bayesian networks obtained by the new algorithm with those by MDLEP, the logistic regression models, the na/spl inodot//spl uml/ve Bayesian classifiers, and the tree-augmented na/spl inodot//spl uml/ve Bayesian network classifiers (TAN). In the comparison, the new algorithm outperforms the others.

[1]  Nissan Levin,et al.  Issues and problems in applying neural computing to target marketing , 1997 .

[2]  Finn Verner Jensen,et al.  Introduction to Bayesian Networks , 2008, Innovations in Bayesian Networks.

[3]  Karim K. Hirji,et al.  Discovering data mining: from concept to implementation , 1999, SKDD.

[4]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[5]  Eamonn J. Keogh,et al.  Learning augmented Bayesian classifiers: A comparison of distribution-based and classification-based approaches , 1999, AISTATS.

[6]  David A. Bell,et al.  Learning Bayesian networks from data: An information-theory based approach , 2002, Artif. Intell..

[7]  Juan Julián Merelo Guervós,et al.  Parallel Problem Solving from Nature - PPSN IX: 9th International Conference, Reykjavik, Iceland, September 9-13, 2006, Proceedings , 2006, PPSN.

[8]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.

[9]  Kathryn B. Laskey,et al.  Learning Bayesian networks from incomplete data using evolutionary algorithms , 1999 .

[10]  Chris Mellish,et al.  A semantically guided and domain-independent evolutionary model for knowledge discovery from texts , 2003, IEEE Trans. Evol. Comput..

[11]  Ron Kohavi,et al.  The Case against Accuracy Estimation for Comparing Induction Algorithms , 1998, ICML.

[12]  J. Suzuki Learning Bayesian Belief Networks Based on the Minimum Description Length Principle: Basic Properties , 1999 .

[13]  Olivia Parr Rud,et al.  Data Mining Cookbook: Modeling Data for Marketing, Risk, and Customer Relationship Management , 2000 .

[14]  Nissan Levin,et al.  Applying neural computing to target marketing , 1997 .

[15]  Charles X. Ling,et al.  Data Mining for Direct Marketing: Problems and Solutions , 1998, KDD.

[16]  Xin Yao,et al.  A novel evolutionary data mining algorithm with applications to churn prediction , 2003, IEEE Trans. Evol. Comput..

[17]  Judea Pearl,et al.  Bayesian Networks , 1998, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[18]  Gregory F. Cooper,et al.  A Simple Constraint-Based Algorithm for Efficiently Mining Observational Databases for Causal Relationships , 1997, Data Mining and Knowledge Discovery.

[19]  Moninder Singh,et al.  An Algorithm for the Construction of Bayesian Network Structures from Data , 1993, UAI.

[20]  Xin Yao,et al.  A new evolutionary system for evolving artificial neural networks , 1997, IEEE Trans. Neural Networks.

[21]  Weimin Xiao,et al.  Evolving accurate and compact classification rules with gene expression programming , 2003, IEEE Trans. Evol. Comput..

[22]  Thomas Bck,et al.  Evolutionary computation: Toward a new philosophy of machine intelligence , 1997, Complex..

[23]  Pedro Larrañaga,et al.  Structure Learning of Bayesian Networks by Genetic Algorithms: A Performance Analysis of Control Parameters , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Jin Tian,et al.  A Branch-and-Bound Algorithm for MDL Learning Bayesian Networks , 2000, UAI.

[25]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[26]  Stuart L. Crawford,et al.  Constructor: A System for the Induction of Probabilistic Models , 1990, AAAI.

[27]  Klaus-Uwe Höffgen,et al.  Learning and robust learning of product distributions , 1993, COLT '93.

[28]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[29]  David B. Fogel,et al.  Evolution-ary Computation 1: Basic Algorithms and Operators , 2000 .

[30]  Kathryn B. Laskey,et al.  Population Markov Chain Monte Carlo , 2004, Machine Learning.

[31]  Wai Lam,et al.  LEARNING BAYESIAN BELIEF NETWORKS: AN APPROACH BASED ON THE MDL PRINCIPLE , 1994, Comput. Intell..

[32]  Kwong-Sak Leung,et al.  A Hybrid Data Mining Approach To Discover Bayesian Networks Using Evolutionary Programming , 2002, GECCO.

[33]  Carlos Cotta,et al.  Analyzing Directed Acyclic Graph Recombination , 2001, Fuzzy Days.

[34]  Gregory F. Cooper,et al.  The ALARM Monitoring System: A Case Study with two Probabilistic Inference Techniques for Belief Networks , 1989, AIME.

[35]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[36]  Chris Spatz,et al.  Basic Statistics: Tales of Distributions , 1981 .

[37]  Zbigniew Michalewicz,et al.  Evolutionary Computation 2 : Advanced Algorithms and Operators , 2000 .

[38]  David Heckerman,et al.  Bayesian Networks for Data Mining , 2004, Data Mining and Knowledge Discovery.

[39]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[40]  R. Blattberg,et al.  Database marketing , 1997 .

[41]  Zbigniew Michalewicz,et al.  Evolutionary Computation 2 , 2000 .

[42]  Rajeev Motwani,et al.  Scalable Techniques for Mining Causal Structures , 1998, Data Mining and Knowledge Discovery.

[43]  Pat Langley,et al.  Induction of Selective Bayesian Classifiers , 1994, UAI.

[44]  Yang Xiang,et al.  Parallel Learning of Belief Networks in Large and Difficult Domains , 2004, Data Mining and Knowledge Discovery.

[45]  Carlos Cotta,et al.  Towards a More Efficient Evolutionary Induction of Bayesian Networks , 2002, PPSN.

[46]  Xiaohui Liu,et al.  Evolutionary learning of dynamic probabilistic models with large time lags , 2001, Int. J. Intell. Syst..

[47]  Richard E. Neapolitan,et al.  Probabilistic reasoning in expert systems - theory and algorithms , 2012 .

[48]  Wolfgang Banzhaf,et al.  A comparison of linear genetic programming and neural networks in medical data mining , 2001, IEEE Trans. Evol. Comput..

[49]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[50]  Lawrence J. Fogel,et al.  Artificial Intelligence through Simulated Evolution , 1966 .

[51]  Siddhartha Bhattacharyya,et al.  Knowledge-intensive genetic discovery in foreign exchange markets , 2002, IEEE Trans. Evol. Comput..

[52]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[53]  Pedro Larrañaga,et al.  Learning Bayesian network structures by searching for the best ordering with genetic algorithms , 1996, IEEE Trans. Syst. Man Cybern. Part A.

[54]  Mitchell P. Marcus,et al.  Learning bayesian networks for solving real-world problems , 1998 .

[55]  Eric Horvitz,et al.  Inferring Informational Goals from Free-Text Queries: A Bayesian Approach , 1998, UAI.

[56]  David B. Fogel,et al.  Evolutionary Computation: Towards a New Philosophy of Machine Intelligence , 1995 .

[57]  Zbigniew Michalewicz,et al.  Evolutionary Computation 1 , 2018 .

[58]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[59]  Francisco Herrera,et al.  Using evolutionary algorithms as instance selection for data reduction in KDD: an experimental study , 2003, IEEE Trans. Evol. Comput..

[60]  Dirk Thierens,et al.  Building a GA from Design Principles for Learning Bayesian Networks , 2003, GECCO.

[61]  Moninder Singh,et al.  Construction of Bayesian network structures from data: A brief survey and an efficient algorithm , 1995, Int. J. Approx. Reason..

[62]  Kwong-Sak Leung,et al.  Using Evolutionary Programming and Minimum Description Length Principle for Data Mining of Bayesian Networks , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[63]  Kwong-Sak Leung,et al.  A hybrid approach to learn Bayesian networks using evolutionary programming , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[64]  Siddhartha Bhattacharyya,et al.  Evolutionary algorithms in data mining: multi-objective performance modeling for direct marketing , 2000, KDD '00.

[65]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[66]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[67]  J.C.Y. Cheng,et al.  Discovering knowledge from medical databases using evolutionory algorithms , 2000, IEEE Engineering in Medicine and Biology Magazine.

[68]  D. Madigan,et al.  A characterization of Markov equivalence classes for acyclic digraphs , 1997 .

[69]  Marek J. Druzdzel,et al.  A Hybrid Anytime Algorithm for the Construction of Causal Models From Sparse Data , 1999, UAI.

[70]  Siddhartha Bhattacharyya,et al.  Direct Marketing Response Models Using Genetic Algorithms , 1998, KDD.

[71]  Kwong-Sak Leung,et al.  A hybrid approach to discover Bayesian networks from databases using evolutionary programming , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[72]  G. P. Beaumont,et al.  Statistical tests : an introduction with Minitab commentary , 1997 .