A Novel Graph Based Framework to build Multi Label Text Classifier

Text document is multifaceted object and associated with many properties such as multi labeledness. Under this a single text document can inherently belongs to more than one category simultaneously. Traditional single label and multi class text class ification paradigms cannot efficiently classify such multifaceted text corpus. Through our paper we are proposing a graph based frame work for Multi Label Text Classification paradigm. Representing text documents in the form of graph vertices rather than the vec tor representation like Bag of Words allows pre-computing and storing of necessary information. It also models the relationship between text documents and class labels. We are using semi supervised learning technique in our proposed approach for effectively utilizing labeled and unlabeled data for classification .Our proposed approach promises better classification accuracy and handling of complexity. Our propos ed framework is elaborated on the basis of standard dataset such as Enron, Slashdot, Bibtex and Reuters.

[1]  Antonino Feitosa Neto,et al.  A Comparative Analysis of Classification Methods to Multi-label Tasks in Different Application Domains , 2011 .

[2]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[3]  Yong Wang,et al.  Semi-supervised Multi-label Learning Algorithm Using Dependency Among Labels , 2011 .

[4]  Tao Mei,et al.  Graph-based semi-supervised learning with multi-label , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[5]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[6]  Gerhard Weikum,et al.  Graph-based text classification: learn from your neighbors , 2006, SIGIR.

[7]  Parag Kulkarni,et al.  Analysis of semi supervised learning methods towards multi label text classification , 2012 .

[8]  Grigorios Tsoumakas,et al.  Effective and Efficient Multilabel Classification in Domains with Large Number of Labels , 2008 .

[9]  Shih-Fu Chang,et al.  Graph construction and b-matching for semi-supervised learning , 2009, ICML '09.

[10]  Nello Cristianini,et al.  Scalable Corpus Annotation by Graph Construction and Label Propagation , 2012, ICPRAM.

[11]  Gang Chen,et al.  Semi-supervised Multi-label Learning by Solving a Sylvester Equation , 2008, SDM.

[12]  Yi Liu,et al.  Semi-supervised Multi-label Learning by Constrained Non-negative Matrix Factorization , 2006, AAAI.

[13]  Renato R. O. da Silva,et al.  Comparing Methods for Multilabel Classification of Proteins Using Machine Learning Techniques , 2009, BSB.

[14]  Parag Kulkarni,et al.  A Comparative Analysis of Supervised Multi-label Text Classification Methods , 2011 .

[15]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[16]  Partha Pratim Talukdar,et al.  Experiments in Graph-Based Semi-Supervised Learning Methods for Class-Instance Acquisition , 2010, ACL.

[17]  Sunita Sarawagi,et al.  Discriminative Methods for Multi-labeled Classification , 2004, PAKDD.