论文信息 - Condensed Filter Tree for Cost-Sensitive Multi-Label Classification

Condensed Filter Tree for Cost-Sensitive Multi-Label Classification

Different real-world applications of multilabel classification often demand different evaluation criteria. We formalize this demand with a general setup, cost-sensitive multi-label classification (CSMLC), which takes the evaluation criteria into account during learning. Nevertheless, most existing algorithms can only focus on optimizing a few specific evaluation criteria, and cannot systematically deal with different ones. In this paper, we propose a novel algorithm, called condensed filter tree (CFT), for optimizing any criteria in CSMLC. CFT is derived from reducing CSMLC to the famous filter tree algorithm for cost-sensitive multi-class classification via constructing the label powerset. We successfully cope with the difficulty of having exponentially many extended-classes within the powerset for representation, training and prediction by carefully designing the tree structure and focusing on the key nodes. Experimental results across many real-world datasets validate that CFT is competitive with special purpose algorithms on special criteria and reaches better performance on general criteria.

Chun-Liang Li | Hsuan-Tien Lin | Hsuan-Tien Lin | Chun-Liang Li

[1] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[2] Pedro M. Domingos. MetaCost: a general method for making classifiers cost-sensitive , 1999, KDD '99.

[3] Tibério S. Caetano,et al. Submodular Multi-Label Learning , 2011, NIPS.

[4] Jiebo Luo,et al. Learning multi-label scene classification , 2004, Pattern Recognit..

[5] Grigorios Tsoumakas,et al. MULAN: A Java Library for Multi-Label Learning , 2011, J. Mach. Learn. Res..

[6] Zhi-Hua Zhou,et al. ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[7] Gert R. G. Lanckriet,et al. Semantic Annotation and Retrieval of Music and Sound Effects , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[8] Tibério S. Caetano,et al. Reverse Multi-Label Learning , 2010, NIPS.

[9] Grigorios Tsoumakas,et al. Introduction to the special issue on learning from multi-label data , 2012, Machine Learning.

[10] Thomas P. Hayes,et al. Error limiting reductions between classification tasks , 2005, ICML.

[11] John Langford,et al. Error-Correcting Tournaments , 2009, ALT.

[12] Lior Rokach,et al. Data Mining And Knowledge Discovery Handbook , 2005 .

[13] Jason Weston,et al. A kernel method for multi-labelled classification , 2001, NIPS.

[14] Eyke Hüllermeier,et al. Bayes Optimal Multilabel Classification via Probabilistic Classifier Chains , 2010, ICML.

[15] Eyke Hüllermeier,et al. Consistent Multilabel Ranking through Univariate Losses , 2012, ICML.

[16] Shou-De Lin,et al. Cost-Sensitive Multi-Label Learning for Audio Tag Annotation and Retrieval , 2011, IEEE Transactions on Multimedia.

[17] Grigorios Tsoumakas,et al. Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.

[18] Geoff Holmes,et al. Classifier chains for multi-label classification , 2009, Machine Learning.

[19] Eyke Hüllermeier,et al. An Exact Algorithm for F-Measure Maximization , 2011, NIPS.

[20] Eyke Hüllermeier,et al. An Analysis of Chaining in Multi-Label Classification , 2012, ECAI.

[21] Geoff Holmes,et al. MEKA: A Multi-label/Multi-target Extension to WEKA , 2016, J. Mach. Learn. Res..

[22] A.N. Srivastava,et al. Discovering recurring anomalies in text reports regarding complex space systems , 2005, 2005 IEEE Aerospace Conference.

[23] Charles Elkan,et al. Beam search algorithms for multilabel learning , 2013, Machine Learning.

[24] Grigorios Tsoumakas,et al. Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.