Probalistic Networks and Fuzzy Clustering as Generalizations of Naive Bayes Classifiers

Although at first sight probabilistic networks and fuzzy clustering seem to be disparate areas of research, a closer look reveals that they can both be seen as generalizations of naive Bayes classifiers. If all attributes are numeric (except the class attribute, of course), naive Bayes classifiers often assume an axis-parallel multidimensional normal distribution for each class as the underlying model. Probabilistic networks remove the requirement that the distributions must be axis-parallel by taking the covariance of the attributes into account, where this is necessary. Fuzzy clustering is an unsupervised method that tries to find general or axis-parallel distributions to cluster the data. Although it does not take into account the class information, it can be used to improve the result of naive Bayes classifiers and probabilistic networks by removing the restriction that there can be only one distribution per class.

[1]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[2]  J. C. Peters,et al.  Fuzzy Cluster Analysis : A New Method to Predict Future Cardiac Events in Patients With Positive Stress Tests , 1998 .

[3]  Kazuo J. Ezawa,et al.  Knowledge Discovery in Telecommunication Services Data Using Bayesian Network Models , 1995, KDD.

[4]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[5]  Prakash P. Shenoy,et al.  Valuation-based systems: a framework for managing uncertainty in expert systems , 1992 .

[6]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Sankar K. Pal,et al.  Fuzzy models for pattern recognition : methods that search for structures in data , 1992 .

[8]  Pat Langley,et al.  Induction of Selective Bayesian Classifiers , 1994, UAI.

[9]  Frank Klawonn,et al.  Foundations of fuzzy systems , 1994 .

[10]  Irving John Good,et al.  The Estimation of Probabilities: An Essay on Modern Bayesian Methods , 1965 .

[11]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[12]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[13]  Rudolf Kruse,et al.  The context model: An integrating view of vagueness and uncertainty , 1993, Int. J. Approx. Reason..

[14]  Detlef Nauck,et al.  Foundations Of Neuro-Fuzzy Systems , 1997 .

[15]  David Heckerman,et al.  Probabilistic similarity networks , 1991, Networks.

[16]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[17]  Kristian G. Olesen,et al.  HUGIN - A Shell for Building Bayesian Belief Universes for Expert Systems , 1989, IJCAI.

[18]  Alessandro Saffiotti,et al.  Pulcinella: A General Tool for Propagating Uncertainty in Valuation Networks , 1991, UAI.

[19]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[20]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[21]  J. N. R. Jeffers,et al.  Graphical Models in Applied Multivariate Statistics. , 1990 .

[22]  Rudolf Kruse,et al.  Uncertainty and Vagueness in Knowledge Based Systems , 1991, Artificial Intelligence.