论文信息 - Progressive random k-labelsets for cost-sensitive multi-label classification

Progressive random k-labelsets for cost-sensitive multi-label classification

In multi-label classification, an instance is associated with multiple relevant labels, and the goal is to predict these labels simultaneously. Many real-world applications of multi-label classification come with different performance evaluation criteria. It is thus important to design general multi-label classification methods that can flexibly take different criteria into account. Such methods tackle the problem of cost-sensitive multi-label classification (CSMLC). Most existing CSMLC methods either suffer from high computational complexity or focus on only certain specific criteria. In this work, we propose a novel CSMLC method, named progressive random k-labelsets (PRAkEL), to resolve the two issues above. The method is extended from a popular multi-label classification method, random k-labelsets, and hence inherits its efficiency. Furthermore, the proposed method can handle arbitrary example-based evaluation criteria by progressively transforming the CSMLC problem into a series of cost-sensitive multi-class classification problems. Experimental results demonstrate that PRAkEL is competitive with existing methods under the specific criteria they can optimize, and is superior under other criteria.

Hsuan-Tien Lin | Yuping Wu | Hsuan-Tien Lin | Yuping Wu

[1] Chun-Liang Li,et al. Condensed Filter Tree for Cost-Sensitive Multi-Label Classification , 2014, ICML.

[2] Eyke Hüllermeier,et al. An Exact Algorithm for F-Measure Maximization , 2011, NIPS.

[3] Jiebo Luo,et al. Learning multi-label scene classification , 2004, Pattern Recognit..

[4] Amanda Clare,et al. Knowledge Discovery in Multi-label Phenotype Data , 2001, PKDD.

[5] Yang Wang,et al. Cost-sensitive boosting for classification of imbalanced data , 2007, Pattern Recognit..

[6] Janez Demsar,et al. Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[7] Zhi-Hua Zhou,et al. ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[8] John Shawe-Taylor,et al. Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[9] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[10] Salvatore J. Stolfo,et al. AdaCost: Misclassification Cost-Sensitive Boosting , 1999, ICML.

[11] Tao Mei,et al. Correlative multi-label video annotation , 2007, ACM Multimedia.

[12] Grigorios Tsoumakas,et al. Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.

[13] Grigorios Tsoumakas,et al. Multi-target regression via input space expansion: treating targets as inputs , 2012, Machine Learning.

[14] Alex Alves Freitas,et al. A Genetic Algorithm for Optimizing the Label Ordering in Multi-label Classifier Chains , 2013, 2013 IEEE 25th International Conference on Tools with Artificial Intelligence.

[15] Jun Yu,et al. HC-Search for Multi-Label Prediction: An Empirical Study , 2014, AAAI.

[16] Grigorios Tsoumakas,et al. MULAN: A Java Library for Multi-Label Learning , 2011, J. Mach. Learn. Res..

[17] Shou-De Lin,et al. Generalized k-Labelsets Ensemble for Multi-Label and Cost-Sensitive Classification , 2014, IEEE Transactions on Knowledge and Data Engineering.

[18] Hsuan-Tien Lin,et al. Multilabel Classification Using Error-Correcting Codes of Hard or Soft Bits , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[19] Grigorios Tsoumakas,et al. Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[20] John Langford,et al. Error-Correcting Tournaments , 2009, ALT.

[21] Hsuan-Tien Lin,et al. Multilabel Classification with Principal Label Space Transformation , 2012, Neural Computation.

[22] Geoff Holmes,et al. Classifier chains for multi-label classification , 2009, Machine Learning.

[23] Eyke Hüllermeier,et al. An Analysis of Chaining in Multi-Label Classification , 2012, ECAI.

[24] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[25] Eyke Hüllermeier,et al. Bayes Optimal Multilabel Classification via Probabilistic Classifier Chains , 2010, ICML.

[26] John Langford,et al. Cost-sensitive learning by cost-proportionate example weighting , 2003, Third IEEE International Conference on Data Mining.

[27] Hsuan-Tien Lin,et al. One-sided Support Vector Regression for Multiclass Cost-sensitive Classification , 2010, ICML.

[28] Luca Martino,et al. Scalable multi-output label prediction: From classifier chains to classifier trellises , 2015, Pattern Recognit..

[29] Shou-De Lin,et al. Cost-Sensitive Multi-Label Learning for Audio Tag Annotation and Retrieval , 2011, IEEE Transactions on Multimedia.

[30] Zhi-Hua Zhou,et al. Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[31] Grigorios Tsoumakas,et al. Multi-Label Classification of Music into Emotions , 2008, ISMIR.

[32] John Langford,et al. An iterative method for multi-class cost-sensitive learning , 2004, KDD.

[33] Yoram Singer,et al. BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[34] Grigorios Tsoumakas,et al. Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[35] Yoav Freund,et al. A Short Introduction to Boosting , 1999 .

[36] John Langford,et al. Multi-Label Prediction via Compressed Sensing , 2009, NIPS.

[37] Luca Martino,et al. Efficient monte carlo methods for multi-dimensional learning with classifier chains , 2012, Pattern Recognit..

[38] Zhi-Hua Zhou,et al. ON MULTI‐CLASS COST‐SENSITIVE LEARNING , 2006, Comput. Intell..