论文信息 - A Linear-Bayes Classifier

A Linear-Bayes Classifier

Naive Bayes is a well known and studied algorithm both in statistics and machine learning. Although its limitations with respect to expressive power, this procedure has a surprisingly good performance in a wide variety of domains, including many where there are clear dependencies between attributes. In this paper we address its main perceived limitation - its inability to deal with attribute dependencies. We present Linear Bayes that uses, for the continuous attributes, a multivariate normal distribution to compute the require probabilities. In this way, the interdependencies between the continuous attributes are considered. On the empirical evaluation, we compare Linear Bayes against a naive-Bayes that discretize continuous attributes, a naive-Bayes that assumes a univariate Gaussian for continuous attributes, and a standard Linear discriminant function. We show that Linear Bayes is a plausible algorithm, that competes quite well against other well established techniques.

João Gama

[1] Pat Langley,et al. Tractable Average-Case Analysis of Naive Bayesian Classifiers , 1999, ICML.

[2] Igor Kononenko,et al. Semi-Naive Bayesian Classifier , 1991, EWSL.

[3] Zijian Zheng,et al. Naive Bayesian Classifier Committees , 1998, ECML.

[4] George H. John. Enhancements to the data mining process , 1997 .

[5] Ron Kohavi,et al. Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[6] Ron Kohavi,et al. Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid , 1996, KDD.

[7] Pat Langley,et al. Induction of Recursive Bayesian Classifiers , 1993, ECML.

[8] Ron Kohavi,et al. Improving simple Bayes , 1997 .

[9] Nir Friedman,et al. Bayesian Network Classification with Continuous Attributes: Getting the Best of Both Discretization and Parametric Fitting , 1998, ICML.

[10] David J. Spiegelhalter,et al. Machine Learning, Neural and Statistical Classification , 2009 .

[11] João Gama,et al. Iterative Bayes , 2000, Intell. Data Anal..

[12] João Gama. Iterative Naive Bayes , 1999, Discovery Science.

[13] Geoffrey I. Webb,et al. Adjusted Probability Naive Bayesian Induction , 1998, Australian Joint Conference on Artificial Intelligence.

[14] M. Pazzani. Constructive Induction of Cartesian Product Attributes , 1998 .

[15] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[16] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.