论文信息 - Why Discretization Works for Naive Bayesian Classifiers

Why Discretization Works for Naive Bayesian Classifiers

This paper explains why well-known dis-cretization methods, such as entropy-based and ten-bin, work well for naive Bayesian classiiers with continuous variables, regardless of their complexities. These methods usually assume that discretized variables have Dirichlet priors. Since perfect aggrega-tion holds for Dirichlets, we can show that, generally, a wide variety of discretization methods can perform well with insigniicant diierence. We identify situations where dis-cretization may cause performance degradation and show that they are unlikely to happen for well-known methods. We empirically test our explanation with synthesized and real data sets and obtain connrming results. Our analysis leads to a lazy discretiza-tion method that can simplify the training for naive Bayes. This new method can perform as well as well-known methods in our experiment.

[1] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[2] Ron Kohavi,et al. Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[3] Peter E. Hart,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[4] P. Langley,et al. An Analysis of Bayesian Classifiers , 1992, AAAI.

[5] R. F.,et al. Mathematical Statistics , 1944, Nature.

[6] Usama M. Fayyad,et al. Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[7] Ivan Bratko,et al. On Estimating Probabilities in Tree Pruning , 1991, EWSL.

[8] Pat Langley,et al. Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[9] Ron Kohavi,et al. Error-Based and Entropy-Based Discretization of Continuous Features , 1996, KDD.