论文信息 - Learning Condensed Feature Representations from Large Unsupervised Data Sets for Supervised Learning

Learning Condensed Feature Representations from Large Unsupervised Data Sets for Supervised Learning

This paper proposes a novel approach for effectively utilizing unsupervised data in addition to supervised data for supervised learning. We use unsupervised data to generate informative 'condensed feature representations' from the original feature set used in supervised NLP systems. The main contribution of our method is that it can offer dense and low-dimensional feature spaces for NLP tasks while maintaining the state-of-the-art performance provided by the recently developed high-performance semi-supervised learning technique. Our method matches the results of current state-of-the-art systems with very few features, i.e., F-score 90.72 with 344 features for CoNLL-2003 NER data, and UAS 93.55 with 12.5K features for dependency parsing data derived from PTB-III.

Masaaki Nagata | Jun Suzuki | Hideki Isozaki

[1] Eric P. Xing,et al. Turbo Parsers: Dependency Parsing by Approximate Variational Inference , 2010, EMNLP.

[2] Xavier Carreras,et al. An Empirical Study of Semi-supervised Structured Conditional Models for Dependency Parsing , 2009, EMNLP.

[3] Jun Suzuki,et al. Semi-Supervised Sequential Labeling and Segmentation Using Giga-Word Scale Unlabeled Data , 2008, ACL.

[4] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[5] Yoshua Bengio,et al. Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[6] Dekang Lin,et al. Phrase Clustering for Discriminative Learning , 2009, ACL.

[7] Kentaro Torisawa,et al. Improving Dependency Parsing with Subtrees from Auto-Parsed Data , 2009, EMNLP.

[8] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[9] Koby Crammer,et al. Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[10] Dan Roth,et al. Design Challenges and Misconceptions in Named Entity Recognition , 2009, CoNLL.

[11] Andrew McCallum,et al. High-Performance Semi-Supervised Learning using Discriminatively Constrained Generative Models , 2010, ICML.