Learning Bayesian Belief Network Classifiers: Algorithms and System

This paper investigates the methods for learning predictive classifiers based on Bayesian belief networks (BN) - primarily unrestricted Bayesian networks and Bayesian multi-nets. We present our algorithms for learning these classifiers, and discuss how these methods address the overfitting problem and provide a natural method for feature subset selection. Using a set of standard classification problems, we empirically evaluate the performance of various BN-based classifiers. The results show that the proposed BN and Bayes multinet classifiers are competitive with (or superior to) the best known classifiers, based on both BN and other formalisms; and that the computational time for learning and using these classifiers is relatively small. These results argue that BN-based classifiers deserve more attention in the data mining community.

[1]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[2]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[3]  Michael J. Pazzani,et al.  Searching for Dependencies in Bayesian Classifiers , 1995, AISTATS.

[4]  Judea Pearl,et al.  Chapter 2 – BAYESIAN INFERENCE , 1988 .

[5]  Ron Kohavi,et al.  MLC++: a machine learning library in C++ , 1994, Proceedings Sixth International Conference on Tools with Artificial Intelligence. TAI 94.

[6]  Weiru Liu,et al.  Learning belief networks from data: an information theory based approach , 1997, CIKM '97.

[7]  Pat Langley,et al.  Induction of Selective Bayesian Classifiers , 1994, UAI.

[8]  Deborah Bruss,et al.  Book! Book! Book! , 2001 .

[9]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[10]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[11]  Igor Kononenko,et al.  Semi-Naive Bayesian Classifier , 1991, EWSL.

[12]  David Heckerman,et al.  Knowledge Representation and Inference in Similarity Networks and Bayesian Multinets , 1996, Artif. Intell..

[13]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[14]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[15]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[16]  Richard E. Neapolitan,et al.  Probabilistic reasoning in expert systems - theory and algorithms , 2012 .

[17]  Dale Schuurmans,et al.  Learning Bayesian Nets that Perform Well , 1997, UAI.

[18]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.