Two novel Ant Colony Optimization approaches for Bayesian network structure learning

Learning Bayesian networks from data is an NP-hard problem with important practical applications. Several researchers have designed algorithms to overcome the computational complexity of this task. Difficult challenges remain however in reducing computation time for structure learning in networks of medium to large size and in understanding problem-dependent aspects of performance. In this paper, we present two novel algorithms (ChainACO and K2ACO) that use Ant Colony Optimization (ACO). Both algorithms search through the space of orderings of data variables. The ChainACO approach uses chain structures to reduce computational complexity of evaluation but at the expense of ignoring the richer structure that is explored in the K2ACO approach. The novel algorithms presented here are ACO versions of previously published GA approaches. We are therefore able to compare ACO vs GA algorithms and Chain vs K2 evaluations. We present a series of experiments on three well-known benchmark problems. Our results show problem-specific trade-offs between solution quality and computational effort. However it seems that the ACO-based approaches might be favored for larger problems, achieving better fitnesses and success rate than their GA counterparts on the largest network studied in our experiments.

[1]  Francisco Javier Díez,et al.  Networks of probabilistic events in discrete time , 2002, Int. J. Approx. Reason..

[2]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[3]  Wray L. Buntine Theory Refinement on Bayesian Networks , 1991, UAI.

[4]  Stephen F. Smith,et al.  Ant colony control for autonomous decentralized shop floor routing , 2001, Proceedings 5th International Symposium on Autonomous Decentralized Systems.

[5]  Thomas A. Runkler,et al.  Using a Local Discovery Ant Algorithm for Bayesian Network Structure Learning , 2009, IEEE Transactions on Evolutionary Computation.

[6]  G. Di Caro,et al.  Ant colony optimization: a new meta-heuristic , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[7]  Thomas Stützle,et al.  Guest editorial: special section on ant colony optimization , 2002, IEEE Trans. Evol. Comput..

[8]  José A. Gámez,et al.  Learning Bayesian networks by Ant Colony Optimisation: searching in two different spaces , 2002 .

[9]  J. Gutiérrez,et al.  Applications of Bayesian Networks in Meteorology , 2004 .

[10]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[11]  Jose Miguel Puerta,et al.  Ant colony optimization for learning Bayesian networks , 2002, Int. J. Approx. Reason..

[12]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[13]  Christian Blum,et al.  Ant colony optimization: Introduction and recent trends , 2005 .

[14]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1998, Learning in Graphical Models.

[15]  Guoliang Xue,et al.  Applying two-level simulated annealing on Bayesian structure learning to infer genetic networks , 2004, Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004..

[16]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[17]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[18]  F. Harary New directions in the theory of graphs , 1973 .

[19]  Marco Dorigo,et al.  Ant colony optimization theory: A survey , 2005, Theor. Comput. Sci..

[20]  Guoliang Xue,et al.  Applying two-level simulated annealing on Bayesian structure learning to infer genetic networks , 2004 .

[21]  Gregory F. Cooper,et al.  The ALARM Monitoring System: A Case Study with two Probabilistic Inference Techniques for Belief Networks , 1989, AIME.

[22]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[23]  Pedro Larrañaga,et al.  Structure Learning of Bayesian Networks by Hybrid Genetic Algorithms , 1995, AISTATS.

[24]  Jose Miguel Puerta,et al.  Stochastic Local Algorithms for Learning Belief Networks: Searching in the Space of the Orderings , 2001, ECSQARU.

[25]  Thomas A. Runkler,et al.  Learning of Bayesian networks by a local discovery ant colony algorithm , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[26]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[27]  Luis M. de Campos,et al.  A new approach for learning belief networks using independence criteria , 2000, Int. J. Approx. Reason..

[28]  Qiang Shen,et al.  Learning Bayesian Network Equivalence Classes with Ant Colony Optimization , 2009, J. Artif. Intell. Res..

[29]  Marco Dorigo,et al.  The ant colony optimization meta-heuristic , 1999 .

[30]  Thomas A. Runkler,et al.  Online Optimization of a Color Sorting Assembly Buffer Using Ant Colony Optimization , 2007, OR.

[31]  José Mira Mira,et al.  NasoNet, modeling the spread of nasopharyngeal cancer with networks of probabilistic events in discrete time , 2002, Artif. Intell. Medicine.

[32]  C. J. Eyckelhof,et al.  Ant Systems for a Dynamic TSP , 2002, Ant Algorithms.

[33]  Pedro Larrañaga,et al.  Learning Bayesian network structures by searching for the best ordering with genetic algorithms , 1996, IEEE Trans. Syst. Man Cybern. Part A.

[34]  Qiang Shen,et al.  Using ant colony optimisation in learning Bayesian network equivalence classes , 2006 .

[35]  Cheng-Fa Tsai,et al.  A new approach for solving large traveling salesman problem using evolutionary ant rules , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[36]  John A. W. McCall,et al.  A chain-model genetic algorithm for Bayesian network structure learning , 2007, GECCO '07.