论文信息 - Collective multi-label classification

Collective multi-label classification

Common approaches to multi-label classification learn independent classifiers for each category, and employ ranking or thresholding schemes for classification. Because they do not exploit dependencies between labels, such techniques are only well-suited to problems in which categories are independent. However, in many domains labels are highly interdependent. This paper explores multi-label conditional random field (CRF)classification models that directly parameterize label co-occurrences in multi-label classification. Experiments show that the models outperform their single-label counterparts on standard text corpora. Even when multi-labels are sparse, the models improve subset classification error by as much as 40%.

[1] Thorsten Joachims,et al. Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[2] Chin-Hui Lee,et al. A MFoM learning approach to robust multiclass multi-label text categorization , 2004, ICML.

[3] Yiming Yang,et al. An Evaluation of Statistical Approaches to Text Categorization , 1999, Information Retrieval.

[4] Jorge Nocedal,et al. Representations of quasi-Newton matrices and their use in limited memory methods , 1994, Math. Program..

[5] Koby Crammer,et al. A new family of online algorithms for category ranking , 2002, SIGIR '02.

[6] H.-A. Loeliger,et al. An introduction to factor graphs , 2004, IEEE Signal Process. Mag..

[7] Yoram Singer,et al. BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[8] Ben Taskar,et al. Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[9] Andrew McCallum,et al. Efficiently Inducing Features of Conditional Random Fields , 2002, UAI.

[10] Chris Buckley,et al. OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[11] Naonori Ueda,et al. Parametric Mixture Models for Multi-Labeled Text , 2002, NIPS.

[12] Andrew McCallum,et al. Using Maximum Entropy for Text Classification , 1999 .

[13] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[14] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[15] Stanley F. Chen,et al. A Gaussian Prior for Smoothing Maximum Entropy Models , 1999 .