论文信息 - Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings - 字舞流文

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings

The number of word embedding models is growing every year. Most of them are based on the co-occurrence information of words and their contexts. However, it is still an open question what is the best definition of context. We provide a systematical investigation of 4 different syntactic context types and context representations for learning word embeddings. Comprehensive experiments are conducted to evaluate their effectiveness on 6 extrinsic and intrinsic tasks. We hope that this paper, along with the published code, would be helpful for choosing the best context type and representation for a given task.

Xiaoyong Du | Tao Liu | Zhe Zhao | Buzhou Tang | Bofang Li | Anna Rogers | Aleksandr Drozd

[1] Wenpeng Yin,et al. Learning Word Meta-Embeddings , 2016, ACL.

[2] Mirella Lapata,et al. Dependency-Based Construction of Semantic Space Models , 2007, CL.

[3] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[4] Satoshi Matsuoka,et al. Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. , 2016, NAACL.

[5] Eneko Agirre,et al. A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches , 2009, NAACL.

[6] Mehmet Ali Yatbaz,et al. Learning Syntactic Categories Using Paradigmatic Representations of Word Context , 2012, EMNLP.

[7] Christopher Potts,et al. Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[8] Omer Levy,et al. Linguistic Regularities in Sparse and Explicit Word Representations , 2014, CoNLL.

[9] Anna Gladkova,et al. Intrinsic Evaluations of Word Embeddings: What Can We Do Better? , 2016, RepEval@ACL.

[10] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[11] Evgeniy Gabrilovich,et al. A word at a time: computing word relatedness using temporal semantic analysis , 2011, WWW.

[12] Bo Pang,et al. Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[13] Geoffrey E. Hinton,et al. Three new graphical models for statistical language modelling , 2007, ICML '07.

[14] Sampo Pyysalo,et al. Intrinsic Evaluation of Word Vectors Fails to Predict Extrinsic Performance , 2016, RepEval@ACL.

[15] Hal Daumé,et al. Deep Unordered Composition Rivals Syntactic Methods for Text Classification , 2015, ACL.

[16] Thorsten Joachims,et al. Evaluation methods for unsupervised word embeddings , 2015, EMNLP.

[17] Wang Ling,et al. Two/Too Simple Adaptations of Word2Vec for Syntax Problems , 2015, NAACL.

[18] Anna Korhonen,et al. Is “Universal Syntax” Universally Useful for Learning Distributed Word Representations? , 2016, ACL.

[19] Zellig S. Harris,et al. Distributional Structure , 1954 .

[20] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[21] Tal Linzen,et al. Issues in evaluating semantic spaces using word analogies , 2016, RepEval@ACL.

[22] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[23] Omer Levy,et al. Improving Distributional Similarity with Lessons Learned from Word Embeddings , 2015, TACL.

[24] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[25] Christopher D. Manning,et al. Better Word Representations with Recursive Neural Networks for Morphology , 2013, CoNLL.

[26] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[27] Jun Zhao,et al. How to Generate a Good Word Embedding , 2015, IEEE Intelligent Systems.

[28] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[29] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[30] Satoshi Matsuoka,et al. Word Embeddings, Analogies, and Machine Learning: Beyond king - man + woman = queen , 2016, COLING.

[31] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[32] James Richard Curran,et al. From distributional to semantic similarity , 2004 .

[33] Sanja Fidler,et al. Skip-Thought Vectors , 2015, NIPS.

[34] Christopher D. Manning,et al. Evaluating Word Embeddings Using a Representative Suite of Practical Tasks , 2016, RepEval@ACL.

[35] Gemma Boleda,et al. Distributional Semantics in Technicolor , 2012, ACL.

[36] Omer Levy,et al. Neural Word Embedding as Implicit Matrix Factorization , 2014, NIPS.

[37] Geoffrey Zweig,et al. Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[38] Siddharth Patwardhan,et al. The Role of Context Types and Dimensionality in Learning Word Embeddings , 2016, NAACL.

[39] Felix Hill,et al. SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation , 2014, CL.

[40] Iryna Gurevych,et al. Using Wiktionary for Computing Semantic Relatedness , 2008, AAAI.

[41] Yves Peirsman,et al. Modelling Word Similarity: an Evaluation of Automatic Synonymy Extraction Algorithms , 2008, LREC.

[42] Kentaro Inui,et al. Dependency Tree-based Sentiment Classification using CRFs with Hidden Variables , 2010, NAACL.

[43] Masaaki Nagata,et al. A Unified Learning Framework of Skip-Grams and Global Vectors , 2015, ACL.

[44] Ehud Rivlin,et al. Placing search in context: the concept revisited , 2002, TOIS.

[45] Bo Pang,et al. A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[46] Stephen Clark,et al. Vector Space Models of Lexical Meaning , 2015 .

[47] Christopher D. Manning,et al. Baselines and Bigrams: Simple, Good Sentiment and Topic Classification , 2012, ACL.

[48] Kevin Gimpel,et al. Tailoring Continuous Word Representations for Dependency Parsing , 2014, ACL.