论文信息 - Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints - 字舞流文

Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints

We present Attract-Repel, an algorithm for improving the semantic quality of word vectors by injecting constraints extracted from lexical resources. Attract-Repel facilitates the use of constraints from mono- and cross-lingual resources, yielding semantically specialised cross-lingual vector spaces. Our evaluation shows that the method can make use of existing cross-lingual lexicons to construct high-quality vector spaces for a plethora of different languages, facilitating semantic transfer from high- to lower-resource ones. The effectiveness of our approach is demonstrated with state-of-the-art results on semantic similarity datasets in six languages. We next show that Attract-Repel-specialised vectors boost performance in the downstream task of dialogue state tracking (DST) across multiple languages. Finally, we show that cross-lingual vector spaces produced by our algorithm facilitate the training of multilingual DST models, which brings further performance improvements.

Diarmuid Ó Séaghdha | S. Young | A. Korhonen | Roi Reichart | N. Mrksic | Ivan Vulic | Milica Gasic | Ira Leviant

[1] Gökhan Tür,et al. Intent detection using semantically enriched word embeddings , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).

[2] Felix Hill,et al. HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment , 2016, CL.

[3] Felix Hill,et al. SimVerb-3500: A Large-Scale Evaluation Set of Verb Similarity , 2016, EMNLP.

[4] Eric Fosler-Lussier,et al. Adjusting Word Embeddings with Semantic Intensity Orders , 2016, Rep4NLP@ACL.

[5] András Kornai,et al. Measuring Semantic Similarity of Words Using Concept Networks , 2016, Rep4NLP@ACL.

[6] Kevin Gimpel,et al. Charagram: Embedding Words and Sentences via Character n-grams , 2016, EMNLP.

[7] Hiroshi Kanayama,et al. Learning Crosslingual Word Embeddings without Bilingual Corpora , 2016, EMNLP.

[8] Tsung-Hsien Wen,et al. Neural Belief Tracker: Data-Driven Dialogue State Tracking , 2016, ACL.

[9] Ngoc Thang Vu,et al. Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction , 2016, ACL.

[10] David Vandyke,et al. A Network-based End-to-End Trainable Task-oriented Dialogue System , 2016, EACL.

[11] Manaal Faruqui,et al. Cross-lingual Models of Word Embeddings: An Empirical Comparison , 2016, ACL.

[12] Diarmuid Ó Séaghdha,et al. Counter-fitting Word Vectors to Linguistic Constraints , 2016, NAACL.

[13] Noah A. Smith,et al. Many Languages, One Parser , 2016, TACL.

[14] Bhaskar Mitra,et al. A Dual Embedding Space Model for Document Ranking , 2016, ArXiv.

[15] Yoav Goldberg,et al. A Primer on Neural Network Models for Natural Language Processing , 2015, J. Artif. Intell. Res..

[16] Marie-Francine Moens,et al. Bilingual Distributed Word Representations from Document-Aligned Comparable Data , 2015, J. Artif. Intell. Res..

[17] Phil Blunsom,et al. Reasoning about Entailment with Neural Attention , 2015, ICLR.

[18] Shashi Narayan,et al. Encoding Prior Knowledge with Eigenword Embeddings , 2015, TACL.

[19] Anders Søgaard,et al. Any-language frame-semantic parsing , 2015, EMNLP.

[20] Guillaume Wenzek,et al. Trans-gram, Fast Cross-lingual Word-embeddings , 2015, EMNLP.

[21] Stephen Clark,et al. Specializing Word Embeddings for Similarity or Relatedness , 2015, EMNLP.

[22] Marie-Francine Moens,et al. Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings , 2015, SIGIR.

[23] Roi Reichart,et al. Separated by an Un-common Language: Towards Judgment Language Informed Vector Space Modeling , 2015 .

[24] Yu Hu,et al. Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints , 2015, ACL.

[25] David Yarowsky,et al. Cross-lingual Dependency Parsing Based on Distributed Representations , 2015, ACL.

[26] Georgiana Dinu,et al. Hubness and Pollution: Delving into Cross-Space Mapping for Zero-Shot Learning , 2015, ACL.

[27] Hinrich Schütze,et al. AutoExtend: Extending Word Embeddings to Embeddings for Synsets and Lexemes , 2015, ACL.

[28] Roy Schwartz,et al. Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction , 2015, CoNLL.

[29] Chris Callison-Burch,et al. PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification , 2015, ACL.

[30] David Vandyke,et al. Multi-domain Dialog State Tracking using Recurrent Neural Networks , 2015, ACL.

[31] Manaal Faruqui,et al. Non-distributional Word Vector Representations , 2015, ACL.

[32] Kevin Gimpel,et al. From Paraphrase Database to Compositional Paraphrase Model and Back , 2015, TACL.

[33] Christopher D. Manning,et al. Bilingual Word Representations with Monolingual Quality in Mind , 2015, VS@HLT-NAACL.

[34] Mark Stevenson,et al. A Hybrid Distributional and Knowledge-based Model of Lexical Semantics , 2015, *SEMEVAL.

[35] Chris Dyer,et al. Ontologically Grounded Multi-sense Representation Learning for Semantic Vector Space Models , 2015, NAACL.

[36] Andrey Kutuzov,et al. Texts in, meaning out: neural language models in semantic similarity task for Russian , 2015, ArXiv.

[37] Barbara Plank,et al. Inverted indexing for cross-lingual NLP , 2015, ACL.

[38] Benjamin Van Durme,et al. Multiview LSA: Representation Learning via Generalized CCA , 2015, NAACL.

[39] Akiko Aizawa,et al. Leveraging Monolingual Data for Crosslingual Compositional Word Representations , 2014, ICLR.

[40] Georgiana Dinu,et al. Improving zero-shot learning by mitigating the hubness problem , 2014, ICLR.

[41] Gang Wang,et al. RC-NET: A General Framework for Incorporating Knowledge into Word Representations , 2014, CIKM.

[42] Yoshua Bengio,et al. BilBOWA: Fast Bilingual Distributed Representations without Word Alignments , 2014, ICML.

[43] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[44] Wanxiang Che,et al. Revisiting Embedding Features for Simple Semi-supervised Learning , 2014, EMNLP.

[45] Jordan L. Boyd-Graber,et al. A Neural Network for Factoid Question Answering over Paragraphs , 2014, EMNLP.

[46] Tie-Yan Liu,et al. Knowledge-Powered Deep Learning for Word Embedding , 2014, ECML/PKDD.

[47] Felix Hill,et al. SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation , 2014, CL.

[48] Richard M. Schwartz,et al. Fast and Robust Neural Network Joint Models for Statistical Machine Translation , 2014, ACL.

[49] Kevin Gimpel,et al. Tailoring Continuous Word Representations for Dependency Parsing , 2014, ACL.

[50] Omer Levy,et al. Dependency-Based Word Embeddings , 2014, ACL.

[51] Mark Dredze,et al. Improving Lexical Embeddings with Semantic Knowledge , 2014, ACL.

[52] Matthew Henderson,et al. The Second Dialog State Tracking Challenge , 2014, SIGDIAL Conference.

[53] Philipp Cimiano,et al. Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0 , 2014, LREC.

[54] Chris Callison-Burch,et al. The Multilingual Paraphrase Database , 2014, LREC.

[55] Phil Blunsom,et al. Multilingual Models for Compositional Distributed Semantics , 2014, ACL.

[56] Manaal Faruqui,et al. Improving Vector Space Word Representations Using Multilingual Correlation , 2014, EACL.

[57] Matthew Henderson,et al. Robust dialog state tracking using delexicalised recurrent neural networks and unsupervised adaptation , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).

[58] Hugo Larochelle,et al. An Autoencoder Approach to Learning Bilingual Word Representations , 2014, NIPS.

[59] Phil Blunsom,et al. Multilingual Distributed Representations without Word Alignment , 2013, ICLR 2014.

[60] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[61] Christopher D. Manning,et al. Bilingual Word Embeddings for Phrase-Based Machine Translation , 2013, EMNLP.

[62] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[63] Quoc V. Le,et al. Exploiting Similarities among Languages for Machine Translation , 2013, ArXiv.

[64] Andrew Y. Ng,et al. Parsing with Compositional Vector Grammars , 2013, ACL.

[65] Chris Callison-Burch,et al. PPDB: The Paraphrase Database , 2013, NAACL.

[66] Milica Gasic,et al. POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.

[67] Simone Paolo Ponzetto,et al. BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[68] Ivan Titov,et al. Inducing Crosslingual Distributed Representations of Words , 2012, COLING.

[69] Geoffrey Zweig,et al. Polarity Inducing Latent Semantic Analysis , 2012, EMNLP.

[70] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[71] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[72] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[73] Joseph P. Turian,et al. Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[74] Xavier Glorot,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[75] Ari Rappoport,et al. Efficient Unsupervised Discovery of Word Categories Using Symmetric Patterns and High Frequency Words , 2006, ACL.

[76] John B. Lowe,et al. The Berkeley FrameNet Project , 1998, ACL.

[77] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[78] Anna Korhonen,et al. Is “Universal Syntax” Universally Useful for Learning Distributed Word Representations? , 2016, ACL.

[79] Anna Korhonen,et al. On the Role of Seed Lexicons in Learning Bilingual Word Embeddings , 2016, ACL.

[80] Dean P. Foster,et al. Eigenwords: spectral word embeddings , 2015, J. Mach. Learn. Res..

[81] Nikos D. Sidiropoulos,et al. Translation Invariant Word Embeddings , 2015, EMNLP.

[82] Makoto Miwa,et al. Word Embedding-based Antonym Detection using Thesauri and Distributional Information , 2015, NAACL.

[83] Matthew Henderson,et al. Word-Based Dialog State Tracking with Recurrent Neural Networks , 2014, SIGDIAL Conference.

[84] Danqi Chen,et al. A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[85] Elia Bruni,et al. Multimodal Distributional Semantics , 2014, J. Artif. Intell. Res..

[86] Steve J. Young,et al. Still talking to machines (cognitively speaking) , 2010, INTERSPEECH.

[87] Philipp Koehn,et al. Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[88] Ehud Rivlin,et al. Placing search in context: the concept revisited , 2002, TOIS.

[89] Nigel Gilbert,et al. Simulating speech systems , 1991 .