Semi-supervised condensed nearest neighbor for part-of-speech tagging

This paper introduces a new training set condensation technique designed for mixtures of labeled and unlabeled data. It finds a condensed set of labeled and unlabeled data points, typically smaller than what is obtained using condensed nearest neighbor on the labeled data only, and improves classification accuracy. We evaluate the algorithm on semi-supervised part-of-speech tagging and present the best published result on the Wall Street Journal data set.

[1]  Xavier Carreras,et al.  An Empirical Study of Semi-supervised Structured Conditional Models for Dependency Parsing , 2009, EMNLP.

[2]  Fabrizio Angiulli,et al.  Fast condensed nearest neighbor rule , 2005, ICML.

[3]  Christian Biemann,et al.  Unsupervised Part-of-Speech Tagging Employing Efficient Graph Clustering , 2006, ACL.

[4]  Jan Hajic,et al.  Semi-Supervised Training for the Averaged Perceptron POS Tagger , 2009, EACL.

[5]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[6]  Anders Søgaard,et al.  Simple Semi-Supervised Training of Part-Of-Speech Taggers , 2010, ACL.

[7]  M. I. Jordan Leo Breiman , 2011, 1101.0929.

[8]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[9]  ci UniversityTR Voting over Multiple Condensed Nearest Neighbors , 1995 .

[10]  G. Gates The Reduced Nearest Neighbor Rule , 1998 .

[11]  Sandra Kübler,et al.  Semi-Supervised Learning for Word Sense Disambiguation: Quality vs. Quantity , 2009, RANLP.

[12]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[13]  Joakim Nivre,et al.  An Efficient Algorithm for Projective Dependency Parsing , 2003, IWPT.

[14]  C. G. Hilborn,et al.  The Condensed Nearest Neighbor Rule , 1967 .

[15]  Steven Abney,et al.  Semisupervised Learning for Computational Linguistics , 2007 .

[16]  Gordon T. Wilfong Nearest neighbor problems , 1991, SCG '91.

[17]  Lluís Màrquez i Villodre,et al.  SVMTool: A general POS Tagger Generator Based on Support Vector Machines , 2004, LREC.

[18]  Walter Daelemans,et al.  MBT: A Memory-Based Part of Speech Tagger-Generator , 1996, VLC@COLING.

[19]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[20]  Walter Daelemans,et al.  Forgetting Exceptions is Harmful in Language Learning , 1998, Machine Learning.