Word Representations: A Simple and General Method for Semi-Supervised Learning
暂无分享,去创建一个
[1] S. T. Dumais,et al. Using latent semantic analysis to improve access to textual information , 1988, CHI '88.
[2] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.
[3] Naftali Tishby,et al. Distributional Clustering of English Words , 1993, ACL.
[4] J. Elman. Learning and development in neural networks: the importance of starting small , 1993, Cognition.
[5] Timo Honkela,et al. Contextual Relations of Words in Grimm Tales, Analyzed by Self-Organizing Map , 1995 .
[6] Hermann Ney,et al. Algorithms for bigram and trigram word clustering , 1995, Speech Commun..
[7] Curt Burgess,et al. Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .
[8] Akira Ushioda,et al. Hierarchical Clustering of Words , 1996, COLING.
[9] T. Honkela. Self-Organizing Maps of Words for Natural Language Processing Applications , 1997 .
[10] Samuel Kaski,et al. Dimensionality reduction by random mapping: fast similarity computation for clustering , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).
[11] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..
[12] Sabine Buchholz,et al. Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.
[13] Magnus Sahlgren,et al. Vector-based semantic analysis: representing word meanings based on random labels , 2001 .
[14] Jean-Luc Gauvain,et al. Connectionist language modeling for large vocabulary continuous speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[15] Tong Zhang,et al. A Robust Risk Minimization based Named Entity Recognition System , 2003, CoNLL.
[16] Blockin Blockin. Quick Training of Probabilistic Neural Nets by Importance Sampling , 2003 .
[17] Fernando Pereira,et al. Shallow Parsing with Conditional Random Fields , 2003, NAACL.
[18] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[19] Jaakko J. Väyrynen,et al. Word Category Maps based on Emergent Features Created by ICA , 2004 .
[20] T. Kohonen,et al. Self-organizing semantic maps , 1989, Biological Cybernetics.
[21] Scott Miller,et al. Name Tagging with Word Clusters and Discriminative Training , 2004, NAACL.
[22] Percy Liang,et al. Semi-Supervised Learning for Natural Language , 2005 .
[23] Magnus Sahlgren,et al. An Introduction to Random Indexing , 2005 .
[24] Jaakko J. Väyrynen,et al. Comparison of Independent Component Analysis and Singular Value Decomposition in Word Context Analysis , 2005 .
[25] Tong Zhang,et al. A High-Performance Semi-Supervised Learning Method for Text Chunking , 2005, ACL.
[26] Wei Li,et al. Semi-Supervised Sequence Modeling with Syntactic Topic Models , 2005, AAAI.
[27] Yoshua Bengio,et al. Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.
[28] Magnus Sahlgren,et al. The Word-Space Model: using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces , 2006 .
[29] Christopher D. Manning,et al. An Effective Two-Stage Model for Exploiting Non-Local Dependencies in Named Entity Recognition , 2006, ACL.
[30] Timo Honkela,et al. Towards explicit semantic features using independent component analysis , 2007 .
[31] Geoffrey E. Hinton,et al. Three new graphical models for statistical language modelling , 2007, ICML '07.
[32] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.
[33] Xavier Carreras,et al. Simple Semi-supervised Dependency Parsing , 2008, ACL.
[34] Jun Suzuki,et al. Semi-Supervised Sequential Labeling and Segmentation Using Giga-Word Scale Unlabeled Data , 2008, ACL.
[35] Yoshua Bengio,et al. Neural net language models , 2008, Scholarpedia.
[36] Geoffrey E. Hinton,et al. A Scalable Hierarchical Distributed Language Model , 2008, NIPS.
[37] Joseph P. Turian. A preliminary evaluation of word representations for named-entity recognition , 2009 .
[38] Patrick F. Reidy. An Introduction to Latent Semantic Analysis , 2009 .
[39] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[40] Marie Candito,et al. Improving generative statistical parsing with semi-supervised word clustering , 2009, IWPT.
[41] Hai Zhao,et al. Multilingual Dependency Learning: A Huge Feature Engineering Method to Semantic Dependency Parsing , 2009, CoNLL Shared Task.
[42] Dan Roth,et al. Design Challenges and Misconceptions in Named Entity Recognition , 2009, CoNLL.
[43] Marie-Francine Moens,et al. Semi-supervised Semantic Role Labeling Using the Latent Words Language Model , 2009, EMNLP.
[44] Xavier Carreras,et al. An Empirical Study of Semi-supervised Structured Conditional Models for Dependency Parsing , 2009, EMNLP.
[45] Dekang Lin,et al. Phrase Clustering for Discriminative Learning , 2009, ACL.
[46] Alexander Yates,et al. Distributional Representations for Handling Sparsity in Supervised Sequence-Labeling , 2009, ACL.
[47] Reut Tsarfaty,et al. Enhancing Unlexicalized Parsing Performance Using a Wide Coverage Lexicon, Fuzzy Tag-Set Mapping, and EM-HMM-Based Lexical Probabilities , 2009, EACL.
[48] Patrick Pantel,et al. From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..
[49] Valentin I. Spitkovsky,et al. From Baby Steps to Leapfrog: How “Less is More” in Unsupervised Dependency Parsing , 2010, NAACL.
[50] Petr Sojka,et al. Software Framework for Topic Modelling with Large Corpora , 2010 .