A Clustering Based Bayesian Network Classifier

Building high-accuracy and efficient Bayesian network classifiers is a hot theme of Bayesian network classifier in recent years. It is often the case that building unrestricted Bayesian network classifier with large number of attributes is time-consuming and always gets poor result, since the searching space of network structure is huge. This paper proposes a clustering based Bayesian network structure learning algorithm(CBNA), which uses mutual information to measure the distances between attributes so as to divide them into groups by hierarchical clustering. Then the network searching is running under these low dimensional spaces while the relation of attributes of different group is ignored instead of finding the network from one high dimensional space. Experimental results suggested that this algorithm is more accurate and efficient when compared to other Bayesian network classifiers and enables to obtain the optimal structure by unrestricted searching.

[1]  David Heckerman,et al.  Causal independence for probability assessment and inference using Bayesian networks , 1996, IEEE Trans. Syst. Man Cybern. Part A.

[2]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[3]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[4]  Robert P. Goldman,et al.  A Language for Construction of Belief Networks , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[6]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[7]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[8]  F. El-Matouat,et al.  Using optimal variables for Bayesian network classifiers , 2000, Proceedings of the Third International Conference on Information Fusion.

[9]  Gregory F. Cooper,et al.  A Bayesian Method for the Induction of Probabilistic Networks from Data , 1992 .

[10]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[11]  Zhou Yan Research for the classifiers based on Bayesian networks , 2003 .