论文信息 - The Learnability of Naive Bayes

The Learnability of Naive Bayes

Naive Bayes is an efficient and effective learning algorithm, but previous results show that its representation ability is severely limited since it can only represent certain linearly separable functions in the binary domain. We give necessary and sufficient conditions on linearly separable functions in the binary domain to be learnable by Naive Bayes under uniform representation. We then show that the learnability (and error rates) of Naive Bayes can be affected dramatically by sampling distributions. Our results help us to gain a much deeper understanding of this seemingly simple, yet powerful learning algorithm.

[1] Michael J. Pazzani,et al. Syskill & Webert: Identifying Interesting Web Sites , 1996, AAAI/IAAI, Vol. 1.

[2] Ron Kohavi,et al. Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[3] Pedro M. Domingos,et al. Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier , 1996, ICML.

[4] Pat Langley,et al. An Analysis of Bayesian Classifiers , 1992, AAAI.

[5] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[6] Brian R. Gaines,et al. Current Trends in Knowledge Acquisition , 1990 .

[7] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[8] Peter E. Hart,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.