论文信息 - A Wrapper Method for Cost-Sensitive Learning via Stratification

A Wrapper Method for Cost-Sensitive Learning via Stratification

Many machine learning applications require classifiers that minimize an asymmetric loss function rather than the raw misclassification rate. We introduce a wrapper method for data stratification to incorporate arbitrary cost matrices into learning algorithms. One way to implement stratification for C4.5 decision tree learners is to manipulate the weights assigned to the examples from different classes. For 2-class problems, this works for any cost matrix with zero values on the diagonal, but in general, fork > 2 classes, it is not sufficient. Nonetheless, we ask what is the set of class weights that best approximates an arbitrary k k cost matrix. We test and compare the new wrapper method against several heuristic methods. The results show that the best method is the wrapper method that directly optimizes the loss using a hold-out data set.

Thomas G. Dietterich | Dragos D. Margineantu | D. Margineantu

[1] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[2] J. Ross Quinlan,et al. Bagging, Boosting, and C4.5 , 1996, AAAI/IAAI, Vol. 1.

[3] Stan Matwin,et al. Machine Learning for the Detection of Oil Spills in Satellite Radar Images , 1998, Machine Learning.

[4] Pedro M. Domingos. MetaCost: a general method for making classifiers cost-sensitive , 1999, KDD '99.

[5] Thomas G. Dietterich,et al. Bootstrap Methods for the Cost-Sensitive Evaluation of Classifiers , 2000, ICML.

[6] Salvatore J. Stolfo,et al. AdaCost: Misclassification Cost-Sensitive Boosting , 1999, ICML.

[7] Michael J. Pazzani,et al. Reducing Misclassification Costs , 1994, ICML.

[8] Thomas G. Dietterich,et al. Applying the Waek Learning Framework to Understand and Improve C4.5 , 1996, ICML.

[9] Carla E. Brodley,et al. Pruning Decision Trees with Misclassification Costs , 1998, ECML.

[10] D. Signorini,et al. Neural networks , 1995, The Lancet.

[11] S. Hyakin,et al. Neural Networks: A Comprehensive Foundation , 1994 .

[12] Tom Fawcett,et al. Adaptive Fraud Detection , 1997, Data Mining and Knowledge Discovery.

[13] F. A. Seiler,et al. Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[14] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[15] Stan Matwin,et al. Learning When Negative Examples Abound , 1997, ECML.