Collective Document Classification with Implicit Inter-document Semantic Relationships

This paper addresses the question of how document classifiers can exploit implicit information about document similarity to improve document classifier accuracy. We infer document similarity using simple n-gram overlap, and demonstrate that this improves overall document classification performance over two datasets. As part of this, we find that collective classification based on simple iterative classifiers outperforms the more complex and computationally-intensive dual classifier approach.

[1]  Thorsten Joachims,et al.  Transductive Learning via Spectral Graph Partitioning , 2003, ICML.

[2]  Stephen J. Wright,et al.  Dissimilarity in Graph-Based Semi-Supervised Classification , 2007, AISTATS.

[3]  S. T. Buckland,et al.  Computer-Intensive Methods for Testing Hypotheses. , 1990 .

[4]  S. Kotsiantis,et al.  Discretization Techniques: A recent survey , 2006 .

[5]  Alex Bateman,et al.  An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[6]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[7]  Kalyan Moy Gupta,et al.  Case-Based Collective Classification , 2007, FLAIRS.

[8]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[9]  Yiming Yang,et al.  A Study of Approaches to Hypertext Categorization , 2002, Journal of Intelligent Information Systems.

[10]  Veselin Stoyanov,et al.  Minimum-Risk Training of Approximate CRF-Based NLP Systems , 2012, NAACL.

[11]  Wei-Hao Lin,et al.  Which Side are You on? Identifying Perspectives at the Document and Sentence Levels , 2006, CoNLL.

[12]  Ben Taskar,et al.  Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[13]  Clinton Burford,et al.  Collective document classification using explicit and implicit inter-document relationships , 2013 .

[14]  Matt Thomas,et al.  Get out the vote: Determining support or opposition from Congressional floor-debate transcripts , 2006, EMNLP.

[15]  C. Lee Giles,et al.  CiteSeer: an automatic citation indexing system , 1998, DL '98.

[16]  M. Opper,et al.  Comparing the Mean Field Method and Belief Propagation for Approximate Inference in MRFs , 2001 .

[17]  Kalyan Moy Gupta,et al.  Cautious Collective Classification , 2009, J. Mach. Learn. Res..

[18]  Tiejun Zhao,et al.  Target-dependent Twitter Sentiment Classification , 2011, ACL.

[19]  Timothy Baldwin,et al.  Collective Classification of Congressional Floor-Debate Transcripts , 2011, ACL.

[20]  Long Jiang,et al.  User-level sentiment analysis incorporating social networks , 2011, KDD.

[21]  David Jurgens,et al.  That's What Friends Are For: Inferring Location in Online Social Media Platforms Based on Social Relationships , 2013, ICWSM.

[22]  Ramakrishnan Srikant,et al.  Mining newsgroups using networks arising from social behavior , 2003, WWW '03.

[23]  Dirk Hovy,et al.  More or less supervised supersense tagging of Twitter , 2014, *SEMEVAL.

[24]  Vikas Sindhwani,et al.  Document-Word Co-regularization for Semi-supervised Sentiment Analysis , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[25]  Massimiliano Ciaramita,et al.  Supersense Tagging of Unknown Nouns in WordNet , 2003, EMNLP.

[26]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[27]  Lise Getoor,et al.  Collective Classification in Network Data , 2008, AI Mag..

[28]  Avrim Blum,et al.  Learning from Labeled and Unlabeled Data using Graph Mincuts , 2001, ICML.

[29]  Takashi Inui,et al.  Extracting Semantic Orientations of Phrases from Dictionary , 2007, NAACL.

[30]  Sung-Hyon Myaeng,et al.  A practical hypertext catergorization method using links and incrementally available class information , 2000, SIGIR '00.

[31]  Beata Beigman Klebanov,et al.  Vocabulary Choice as an Indicator of Perspective , 2010, ACL.

[32]  Koby Crammer,et al.  Confidence-weighted linear classification , 2008, ICML '08.

[33]  William T. Freeman,et al.  Constructing free-energy approximations and generalized belief propagation algorithms , 2005, IEEE Transactions on Information Theory.

[34]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[35]  Mark Craven,et al.  Combining Statistical and Relational Methods for Learning in Hypertext Domains , 1998, ILP.

[36]  Lise Getoor,et al.  Opinion Graphs for Polarity and Discourse Classification , 2009, Graph-based Methods for Natural Language Processing.

[37]  L. Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.