论文信息 - Generalizing and Improving Bilingual Word Embedding Mappings with a Multi-Step Framework of Linear Transformations - 字舞流文

Generalizing and Improving Bilingual Word Embedding Mappings with a Multi-Step Framework of Linear Transformations

Using a dictionary to map independently trained word embeddings to a shared space has shown to be an effective approach to learn bilingual word embeddings. In this work, we propose a multi-step framework of linear transformations that generalizes a substantial body of previous work. The core step of the framework is an orthogonal transformation, and existing methods can be explained in terms of the additional normalization, whitening, re-weighting, de-whitening and dimensionality reduction steps. This allows us to gain new insights into the behavior of existing methods, including the effectiveness of inverse regression, and design a novel variant that obtains the best published results in zero-shot bilingual lexicon extraction. The corresponding software is released as an open source project.

Eneko Agirre | Gorka Labaka | Mikel Artetxe | Eneko Agirre | Mikel Artetxe | Gorka Labaka

[1] Yoshua Bengio,et al. Zero-data Learning of New Tasks , 2008, AAAI.

[2] Geoffrey E. Hinton,et al. Zero-shot Learning with Semantic Output Codes , 2009, NIPS.

[3] Alexandros Nanopoulos,et al. On the existence of obstinate results in vector space models , 2010, SIGIR.

[4] Alexandros Nanopoulos,et al. Hubs in Space: Popular Nearest Neighbors in High-Dimensional Data , 2010, J. Mach. Learn. Res..

[5] Quoc V. Le,et al. Exploiting Similarities among Languages for Machine Translation , 2013, ArXiv.

[6] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[7] Dean P. Foster,et al. Large Scale Canonical Correlation Analysis with Iterative Least Squares , 2014, NIPS.

[8] Manaal Faruqui,et al. Improving Vector Space Word Representations Using Multilingual Correlation , 2014, EACL.

[9] Yuji Matsumoto,et al. Ridge Regression, Hubness, and Zero-Shot Learning , 2015, ECML/PKDD.

[10] Kevin Gimpel,et al. Deep Multilingual Correlation for Improved Word Embeddings , 2015, NAACL.

[11] Georgiana Dinu,et al. Hubness and Pollution: Delving into Cross-Space Mapping for Zero-Shot Learning , 2015, ACL.

[12] Dong Wang,et al. Normalized Word Embedding and Orthogonal Transform for Bilingual Word Translation , 2015, NAACL.

[13] Barbara Plank,et al. Inverted indexing for cross-lingual NLP , 2015, ACL.

[14] Kai Zhao,et al. Learning Translation Models from Monolingual Continuous Representations , 2015, NAACL.

[15] Georgiana Dinu,et al. Improving zero-shot learning by mitigating the hubness problem , 2014, ICLR.

[16] Christopher D. Manning,et al. Bilingual Word Representations with Monolingual Quality in Mind , 2015, VS@HLT-NAACL.

[17] Yoshua Bengio,et al. BilBOWA: Fast Bilingual Distributed Representations without Word Alignments , 2014, ICML.

[18] Regina Barzilay,et al. Ten Pairs to Tag – Multilingual POS Tagging via Coarse Mapping between Embeddings , 2016, NAACL.

[19] Achim Rettinger,et al. Bilingual Word Embeddings from Parallel and Non-parallel Corpora for Cross-Language Text Classification , 2016, NAACL.

[20] Guillaume Lample,et al. Massively Multilingual Word Embeddings , 2016, ArXiv.

[21] Marie-Francine Moens,et al. Bilingual Distributed Word Representations from Document-Aligned Comparable Data , 2015, J. Artif. Intell. Res..

[22] Eneko Agirre,et al. Learning principled bilingual mappings of word embeddings while preserving monolingual invariance , 2016, EMNLP.

[23] Anna Korhonen,et al. On the Role of Seed Lexicons in Learning Bilingual Word Embeddings , 2016, ACL.

[24] Samuel L. Smith,et al. Offline bilingual word vectors, orthogonal transformations and the inverted softmax , 2017, ICLR.

[25] Eneko Agirre,et al. Learning bilingual word embeddings with (almost) no bilingual data , 2017, ACL.