论文信息 - Improved Class Probability Estimates from Decision Tree Models

Improved Class Probability Estimates from Decision Tree Models

Decision tree models typically give good classification decisions but poor probability estimates. In many applications, it is important to have good probability estimates as well. This chapter introduces a new algorithm, Bagged Lazy Option Trees (B-LOTs), for constructing decision trees and compares it to an alternative, Bagged Probability Estimation Trees (B-PETs). The quality of the class probability estimates produced by the two methods is evaluated in two ways. First, we compare the ability of the two methods to make good classification decisions when the misclassification costs are asymmetric. Second, we compare the absolute accuracy of the estimates themselves. The experiments show that B-LOTs produce better decisions and more accurate probability estimates than B-PETs.

Thomas G. Dietterich | D. Margineantu

[1] Irving John Good,et al. The Estimation of Probabilities: An Essay on Modern Bayesian Methods , 1965 .

[2] C. G. Hilborn,et al. The Condensed Nearest Neighbor Rule , 1967 .

[3] Peter E. Hart,et al. The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[4] Bojan Cestnik,et al. Estimating Probabilities: A Crucial Task in Machine Learning , 1990, ECAI.

[5] Wray L. Buntine,et al. A theory of learning classification rules , 1990 .

[6] Belur V. Dasarathy,et al. Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[7] Elie Bienenstock,et al. Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[8] Thomas G. Dietterich,et al. A study of distance-based machine learning algorithms , 1994 .

[9] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.

[10] Ron Kohavi,et al. Lazy Decision Trees , 1996, AAAI/IAAI, Vol. 1.

[11] Ron Kohavi,et al. Option Decision Trees with Majority Votes , 1997, ICML.