Learning a Cross-Lingual Semantic Representation of Relations Expressed in Text

Learning cross-lingual semantic representations of relations from textual data is useful for tasks like cross-lingual information retrieval and question answering. So far, research has been mainly focused on cross-lingual entity linking, which is confined to linking between phrases in a text document and their corresponding entities in a knowledge base but cannot link to relations. In this paper, we present an approach for inducing clusters of semantically related relations expressed in text, where relation clusters i can be extracted from text of different languages, ii are embedded in a semantic representation of the context, and iii can be linked across languages to properties in a knowledge base. This is achieved by combining multi-lingual semantic role labeling SRL with cross-lingual entity linking followed by spectral clustering of the annotated SRL graphs. With our initial implementation we learned a cross-lingual lexicon of relation expressions from English and Spanish Wikipedia articles. To demonstrate its usefulness we apply it to cross-lingual question answering over linked data.

[1]  Mark Steedman,et al.  Unsupervised Induction of Cross-Lingual Semantic Relations , 2013, EMNLP.

[2]  Ariadna Quattoni,et al.  Learning Task-specific Bilexical Embeddings , 2014, COLING.

[3]  Enrico Motta,et al.  SemSearch: A Search Engine for the Semantic Web , 2006, EKAW.

[4]  Amit P. Sheth,et al.  TcruziKB: Enabling Complex Queries for Genomic Data Exploration , 2008, 2008 IEEE International Conference on Semantic Computing.

[5]  Raffaella Bernardi,et al.  Acquiring Relational Patterns from Wikipedia : A Case Study , 2012 .

[6]  Hoifung Poon,et al.  Grounded Unsupervised Semantic Parsing , 2013, ACL.

[7]  Philipp Cimiano,et al.  M-ATOLL: A Framework for the Lexicalization of Ontologies in Multiple Languages , 2014, International Semantic Web Conference.

[8]  Patrick Pantel,et al.  DIRT @SBT@discovery of inference rules from text , 2001, KDD '01.

[9]  Xavier Carreras,et al.  Joint Arc-factored Parsing of Syntactic and Semantic Dependencies , 2013, Transactions of the Association for Computational Linguistics.

[10]  Achim Rettinger,et al.  X-LiSA: Cross-lingual Semantic Annotation , 2014, Proc. VLDB Endow..

[11]  Philipp Cimiano,et al.  Towards portable natural language interfaces to knowledge bases - The case of the ORAKEL system , 2008, Data Knowl. Eng..

[12]  Axel-Cyrille Ngonga Ngomo,et al.  Extracting Multilingual Natural-Language Patterns for RDF Predicates , 2012, EKAW.

[13]  Horacio Rodríguez,et al.  WRPA: A System for Relational Paraphrase Acquisition from Wikipedia , 2010, Proces. del Leng. Natural.

[14]  James Fan,et al.  Large Scale Relation Detection , 2010, HLT-NAACL 2010.

[15]  Dekang Lin,et al.  DIRT – Discovery of Inference Rules from Text , 2001 .

[16]  Jitendra Malik,et al.  Contour and Texture Analysis for Image Segmentation , 2001, International Journal of Computer Vision.

[17]  Pierre Nugues,et al.  Ontology matching: from PropBank to DBpedia , 2012 .

[18]  Hamish Cunningham,et al.  FREyA: An Interactive Way of Querying Linked Data Using Natural Language , 2011, ESWC Workshops.

[19]  Fabien L. Gandon,et al.  QAKiS: an Open Domain QA System based on Relational Patterns , 2012, SEMWEB.

[20]  Gerhard Weikum,et al.  PATTY: A Taxonomy of Relational Patterns with Semantic Types , 2012, EMNLP.