Cost-sensitive Classification : Status and Beyond

The rows represent the actual patient status, and the columns represent the diagnosis made by the doctor. For instance, on any correct diagnosis, the society pays no (additional) cost. However, if an H1N1-infected patient is predicted as coldinfected or healthy, the whole society may suffer from a huge amount of cost. On the other hand, if a cold-infected patient is predicted as healthy, the society needs to pay some cost—but not as serious as the ones paid in the previous scenario. These different costs are important for a human doctor when making any diagnosis. For instance, the doctor would be very careful on the slightest H1N1 symptom to prevent the “1000000” level mis-prediction. If we were to build an automatic system—a “computer doctor”—to make the diagnosis, how can the system use the cost information appropriately? Many real-world applications that share similar needs can be found in medical decision making, target marketing, and object recognition. Those applications belong to cost-sensitive classification. In fact, costsensitive classification can be used to express any finite-choice and bounded-loss machine learning problems [2]. Thus, it has been attracting much research attention in the past decade [3], [4], [5], [6], [7], [8], [2], [9], [10], [11], [12], [1], [13].

[1]  John Langford,et al.  An iterative method for multi-class cost-sensitive learning , 2004, KDD.

[2]  John Langford,et al.  Sensitive Error Correcting Output Codes , 2005, COLT.

[3]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[4]  Hsuan-Tien Lin,et al.  One-sided Support Vector Regression for Multiclass Cost-sensitive Classification , 2010, ICML.

[5]  Zhi-Hua Zhou,et al.  Learning with cost intervals , 2010, KDD '10.

[6]  Hsuan-Tien Lin,et al.  A Simple Cost-sensitive Multiclass Classification Algorithm Using One-versus-one Comparisons , 2010 .

[7]  John Langford,et al.  Cost-sensitive learning by cost-proportionate example weighting , 2003, Third IEEE International Conference on Data Mining.

[8]  Pedro M. Domingos MetaCost: a general method for making classifiers cost-sensitive , 1999, KDD '99.

[9]  Thomas G. Dietterich,et al.  Methods for cost-sensitive learning , 2002 .

[10]  Hsuan-Tien Lin,et al.  From ordinal ranking to binary classification , 2008 .

[11]  Zhi-Hua Zhou,et al.  ON MULTI‐CLASS COST‐SENSITIVE LEARNING , 2006, Comput. Intell..

[12]  Charles Elkan,et al.  The Foundations of Cost-Sensitive Learning , 2001, IJCAI.

[13]  Hsuan-Tien Lin,et al.  An Ensemble Ranking Solution for the Yahoo ! Learning to Rank Challenge , 2010 .

[14]  A. Beygelzimer Multiclass Classification with Filter Trees , 2007 .

[15]  Thomas P. Hayes,et al.  Error limiting reductions between classification tasks , 2005, ICML.