Learning Embeddings to lexicalise RDF Properties

A difficult task when generating text from knowledge bases (KB) consists in finding appropriate lexicalisations for KB symbols. We present an approach for lexicalis-ing knowledge base relations and apply it to DBPedia data. Our model learns low-dimensional embeddings of words and RDF resources and uses these representations to score RDF properties against candidate lexicalisations. Training our model using (i) pairs of RDF triples and automatically generated verbalisations of these triples and (ii) pairs of paraphrases extracted from various resources, yields competitive results on DBPedia data.

[1]  Philipp Cimiano,et al.  Linking Lexical Resources and Ontologies on the Semantic Web with Lemon , 2011, ESWC.

[2]  Ellen Riloff,et al.  Automatically Generating Extraction Patterns from Untagged Text , 1996, AAAI/IAAI, Vol. 2.

[3]  Jason Weston,et al.  Open Question Answering with Weakly Supervised Embedding Models , 2014, ECML/PKDD.

[4]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[5]  Chris Callison-Burch,et al.  Paraphrasing with Bilingual Parallel Corpora , 2005, ACL.

[6]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[7]  Robert Stevens,et al.  OWL Pizzas: Practical Experience of Teaching OWL-DL: Common Errors & Common Patterns , 2004, EKAW.

[8]  Daniel S. Weld,et al.  Open Information Extraction Using Wikipedia , 2010, ACL.

[9]  Philipp Cimiano,et al.  ATOLL - A framework for the automatic induction of ontology lexica , 2014, Data Knowl. Eng..

[10]  Philipp Cimiano,et al.  M-ATOLL: A Framework for the Lexicalization of Ontologies in Multiple Languages , 2014, International Semantic Web Conference.

[11]  Oren Etzioni,et al.  Open Information Extraction: The Second Generation , 2011, IJCAI.

[12]  Philipp Cimiano,et al.  A Corpus-Based Approach for the Induction of Ontology Lexica , 2013, NLDB.

[13]  Estevam R. Hruschka,et al.  Discovering Relations between Noun Categories , 2011, EMNLP.

[14]  Stephen Soderland,et al.  Learning Information Extraction Rules for Semi-Structured and Free Text , 1999, Machine Learning.

[15]  Gerhard Weikum,et al.  Discovering and Exploring Relations on the Web , 2012, Proc. VLDB Endow..

[16]  Oren Etzioni,et al.  Paraphrase-Driven Learning for Open Question Answering , 2013, ACL.