论文信息 - Translating Embeddings for Modeling Multi-relational Data

Translating Embeddings for Modeling Multi-relational Data

We consider the problem of embedding entities and relationships of multi-relational data in low-dimensional vector spaces. Our objective is to propose a canonical model which is easy to train, contains a reduced number of parameters and can scale up to very large databases. Hence, we propose TransE, a method which models relationships by interpreting them as translations operating on the low-dimensional embeddings of the entities. Despite its simplicity, this assumption proves to be powerful since extensive experiments show that TransE significantly outperforms state-of-the-art methods in link prediction on two knowledge bases. Besides, it can be successfully trained on a large scale data set with 1M entities, 25k relationships and more than 17M training samples.

[1] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[2] R. Harshman,et al. PARAFAC: parallel factor analysis , 1994 .

[3] Thomas L. Griffiths,et al. Learning Systems of Concepts with an Infinite Relational Model , 2006, AAAI.

[4] Praveen Paritosh,et al. Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[5] Geoffrey J. Gordon,et al. Relational learning via collective matrix factorization , 2008, KDD.

[6] Thomas L. Griffiths,et al. Nonparametric Latent Feature Models for Link Prediction , 2009, NIPS.

[7] Joshua B. Tenenbaum,et al. Modelling Relational Data using Bayesian Clustered Tensor Factorization , 2009, NIPS.

[8] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[9] Jason Weston,et al. Learning Structured Embeddings of Knowledge Bases , 2011, AAAI.

[10] Hans-Peter Kriegel,et al. A Three-Way Model for Collective Learning on Multi-Relational Data , 2011, ICML.

[11] Nicolas Le Roux,et al. A latent factor model for highly multi-relational data , 2012, NIPS.

[12] Hans-Peter Kriegel,et al. Factorizing YAGO: scalable machine learning for linked data , 2012, WWW.

[13] Jun Zhu,et al. Max-Margin Nonparametric Latent Feature Models for Link Prediction , 2012, ICML.

[14] Danqi Chen,et al. Learning New Facts From Knowledge Bases With Neural Tensor Networks and Semantic Word Vectors , 2013, ICLR.

[15] Jason Weston,et al. Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction , 2013, EMNLP.

[16] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[17] Jason Weston,et al. A semantic matching energy function for learning with multi-relational data , 2013, Machine Learning.