论文信息 - A Survey On Neural Word Embeddings

A Survey On Neural Word Embeddings

Understanding human language has been a sub-challenge on the way of intelligent machines. The study of meaning in natural language processing (NLP) relies on the distributional hypothesis where language elements get meaning from the words that co-occur within contexts. The revolutionary idea of distributed representation for a concept is close to the working of a human mind in that the meaning of a word is spread across several neurons, and a loss of activation will only slightly affect the memory retrieval process. Neural word embeddings transformed the whole field of NLP by introducing substantial improvements in all NLP tasks. In this survey, we provide a comprehensive literature review on neural word embeddings. We give theoretical foundations and describe existing work by an interplay between word embeddings and language modeling. We provide broad coverage on neural word embeddings, including early word embeddings, embeddings targeting specific semantic relations, sense embeddings, morpheme embeddings, and finally, contextual representations. Finally, we describe benchmark datasets in word embeddings’ performance evaluation and downstream tasks along with the performance results of/due to word embeddings.

Erhan Sezerer | Selma Tekir | Erhan Sezerer | Selma Tekir

[1] Ryan Cotterell,et al. Morphological Word-Embeddings , 2019, NAACL.

[2] Omer Levy,et al. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[3] Samuel R. Bowman,et al. A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.

[4] Tao Chen,et al. Improving Distributed Representation of Word Sense via WordNet Gloss Composition and Context Clustering , 2015, ACL.

[5] Ngoc Thang Vu,et al. Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction , 2016, ACL.

[6] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.

[7] Aoying Zhou,et al. SphereRE: Distinguishing Lexical Relations with Hyperspherical Relation Embeddings , 2019, ACL.

[8] Sebastian Ruder,et al. Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[9] Ido Dagan,et al. context2vec: Learning Generic Context Embedding with Bidirectional LSTM , 2016, CoNLL.

[10] M. Tomasello,et al. Twelve-month-olds communicate helpfully and appropriately for knowledgeable and ignorant partners , 2008, Cognition.

[11] Richard Socher,et al. Learned in Translation: Contextualized Word Vectors , 2017, NIPS.

[12] Wang Ling,et al. Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation , 2015, EMNLP.

[13] Haixun Wang,et al. Learning Term Embeddings for Hypernymy Identification , 2015, IJCAI.

[14] Omer Levy,et al. Dependency-Based Word Embeddings , 2014, ACL.

[15] Yu Sun,et al. ERNIE: Enhanced Representation through Knowledge Integration , 2019, ArXiv.

[16] Wanxiang Che,et al. Learning Sense-specific Word Embeddings By Exploiting Bilingual Resources , 2014, COLING.

[17] Markus Forsberg,et al. SALDO: a touch of yin to WordNet’s yang , 2013, Lang. Resour. Evaluation.

[18] Phil Blunsom,et al. Compositional Morphology for Word Representations and Language Modelling , 2014, ICML.

[19] Kevin Gimpel,et al. From Paraphrase Database to Compositional Paraphrase Model and Back , 2015, Transactions of the Association for Computational Linguistics.

[20] Daniel Jurafsky,et al. Do Multi-Sense Embeddings Improve Natural Language Understanding? , 2015, EMNLP.

[21] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[22] Yang Xu,et al. Incorporating Latent Meanings of Morphological Compositions to Enhance Word Embeddings , 2018, ACL.

[23] Geoffrey E. Hinton,et al. Three new graphical models for statistical language modelling , 2007, ICML '07.

[24] Guokun Lai,et al. RACE: Large-scale ReAding Comprehension Dataset From Examinations , 2017, EMNLP.

[25] Percy Liang,et al. Know What You Don’t Know: Unanswerable Questions for SQuAD , 2018, ACL.

[26] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[27] Ehud Rivlin,et al. Placing search in context: the concept revisited , 2002, TOIS.

[28] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[29] Wolfgang Lezius,et al. TIGER: Linguistic Interpretation of a German Corpus , 2004 .

[30] Wenpeng Yin,et al. Learning Word Meta-Embeddings , 2016, ACL.

[31] Kevin Gimpel,et al. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[32] Alexander J. Smola,et al. Stacked Attention Networks for Image Question Answering , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[34] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[35] Radu Soricut,et al. Unsupervised Morphology Induction Using Word Embeddings , 2015, NAACL.

[36] Mark Steyvers,et al. Topics in semantic representation. , 2007, Psychological review.

[37] Hermann Ney,et al. Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[38] Partha Pratim Talukdar,et al. Zero-shot Word Sense Disambiguation using Sense Definition Embeddings , 2019, ACL.

[39] Jürgen Schmidhuber,et al. Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[40] G. Miller,et al. Contextual correlates of semantic similarity , 1991 .

[41] Andrew Y. Ng,et al. Improving Word Representations via Global Context and Multiple Word Prototypes , 2012, ACL.

[42] Ronan Collobert,et al. Word Embeddings through Hellinger PCA , 2013, EACL.

[43] Samuel R. Bowman,et al. Neural Network Acceptability Judgments , 2018, Transactions of the Association for Computational Linguistics.

[44] Patrick Pantel,et al. From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[45] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[46] Ido Dagan,et al. The Sixth PASCAL Recognizing Textual Entailment Challenge , 2009, TAC.

[47] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[48] Erik Velldal,et al. Diachronic word embeddings and semantic shifts: a survey , 2018, COLING.

[49] Ignacio Iacobacci,et al. SensEmbed: Learning Sense Embeddings for Word and Relational Similarity , 2015, ACL.