A Comparative Analysis of Supervised Multi-label Text Classification Methods

Multi-label classification methods are getting more popular now a days because of their increasing demand in various application domains such as text classification , image classification , functional genomics , music categorization , emotion recognition etc. Multi-label classification methods are falling under two broader categories of problem transformation methods and algorithm adaptation methods. From machine learning perspective both of these types are working under the roof of supervised classification methods wherein the labels are already provided in the training data set. An attempt is made through this paper to present the state of the art supervised text classification techniques and there comparison. The paper also discusses the important results reported so far in text classification domain and also tried to highlight the beneficial directions of the research till date. The experiments are conducted on standard bench mark datasets such as Enron, Bibtex and Slashdot. Moreover, the paper also contains a comprehensive bibliography of selected papers appeared in reputed journals and conference proceedings as an aid for the researchers working in the field of multi-label classification domain.

[1]  Geoff Holmes,et al.  Multi-label Classification Using Ensembles of Pruned Sets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[2]  Grigorios Tsoumakas,et al.  Effective and Efficient Multilabel Classification in Domains with Large Number of Labels , 2008 .

[3]  Min-Ling Zhang,et al.  Ml-rbf: RBF Neural Networks for Multi-Label Learning , 2009, Neural Processing Letters.

[4]  Grigorios Tsoumakas,et al.  Multi-Label Classification of Music into Emotions , 2008, ISMIR.

[5]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[6]  Naonori Ueda,et al.  Parametric Mixture Models for Multi-Labeled Text , 2002, NIPS.

[7]  Antonino Feitosa Neto,et al.  A Comparative Analysis of Classification Methods to Multi-label Tasks in Different Application Domains , 2011 .

[8]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[9]  Grigorios Tsoumakas,et al.  Multilabel Text Classification for Automated Tag Suggestion , 2008 .

[10]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[11]  Andrew McCallum,et al.  Collective multi-label classification , 2005, CIKM '05.

[12]  Saso Dzeroski,et al.  Decision Trees for Hierarchical Multilabel Classification: A Case Study in Functional Genomics , 2006, PKDD.

[13]  Jufu Feng,et al.  Parallel and Sequential Support Vector Machines for Multi-label Classification , 2005 .

[14]  Renato R. O. da Silva,et al.  Comparing Methods for Multilabel Classification of Proteins Using Machine Learning Techniques , 2009, BSB.

[15]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[16]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[17]  Shenghuo Zhu,et al.  Empirical Studies on Multi-label Classification , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[18]  Claudio Gentile,et al.  Incremental Algorithms for Hierarchical Classification , 2004, J. Mach. Learn. Res..

[19]  Sunita Sarawagi,et al.  Discriminative Methods for Multi-labeled Classification , 2004, PAKDD.

[20]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[21]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[22]  Yiming Yang,et al.  An Evaluation of Statistical Approaches to Text Categorization , 1999, Information Retrieval.