Entity Reference Resolution via Spreading Activation on RDF-Graphs

The use of natural language identifiers as reference for ontology elements—in addition to the URIs required by the Semantic Web standards—is of utmost importance because of their predominance in the human everyday life, i.e.speech or print media. Depending on the context, different names can be chosen for one and the same element, and the same element can be referenced by different names. Here homonymy and synonymy are the main cause of ambiguity in perceiving which concrete unique ontology element ought to be referenced by a specific natural language identifier describing an entity. We propose a novel method to resolve entity references under the aspect of ambiguity which explores only formal background knowledge represented in RDF graph structures. The key idea of our domain independent approach is to build an entity network with the most likely referenced ontology elements by constructing steiner graphs based on spreading activation. In addition to exploiting complex graph structures, we devise a new ranking technique that characterises the likelihood of entities in this network, i.e. interpretation contexts. Experiments in a highly polysemic domain show the ability of the algorithm to retrieve the correct ontology elements in almost all cases.

[1]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2]  Michalis Vazirgiannis,et al.  Word Sense Disambiguation with Spreading Activation Networks Generated from Thesauri , 2007, IJCAI.

[3]  Jorge García-Vidal,et al.  Wireless Systems and Mobility in Next Generation Internet, Third International Workshop of the EURO-NGI Network of Excellence, Sitges, Spain, June 6-9, 2006, Revised Selected Papers , 2007, EuroNGI Workshop.

[4]  Raphael Volz,et al.  Ontology based entity disambiguation with natural language patterns , 2009, 2009 Fourth International Conference on Digital Information Management.

[5]  A Min Tjoa,et al.  Word Sense Disambiguation as the Primary Step of Ontology Integration , 2008, DEXA.

[6]  Abraham Bernstein,et al.  The Semantic Web - ISWC 2009, 8th International Semantic Web Conference, ISWC 2009, Chantilly, VA, USA, October 25-29, 2009. Proceedings , 2009, SEMWEB.

[7]  Véronique Malaisé,et al.  Disambiguating automatic semantic annotation based on a thesaurus structure , 2007 .

[8]  Mehmet A. Orgun,et al.  AI 2007: Advances in Artificial Intelligence, 20th Australian Joint Conference on Artificial Intelligence, Gold Coast, Australia, December 2-6, 2007, Proceedings , 2007, Australian Conference on Artificial Intelligence.

[9]  M. Ross Quillian,et al.  A revised design for an understanding machine , 1962, Mech. Transl. Comput. Linguistics.

[10]  Wenfei Fan,et al.  Keys with Upward Wildcards for XML , 2001, DEXA.

[11]  Tru H. Cao,et al.  Named entity disambiguation on an ontology enriched by Wikipedia , 2008, 2008 IEEE International Conference on Research, Innovation and Vision for the Future in Computing and Communication Technologies.

[12]  Tru H. Cao,et al.  A Knowledge-Based Approach to Named Entity Disambiguation in News Articles , 2007, Australian Conference on Artificial Intelligence.

[13]  Ismailcem Budak Arpinar,et al.  Ontology-Driven Automatic Entity Disambiguation in Unstructured Text , 2006, SEMWEB.

[14]  Rada Mihalcea,et al.  Unsupervised Graph-basedWord Sense Disambiguation Using Measures of Word Semantic Similarity , 2007 .

[15]  Ansgar Bernardi,et al.  IdentityRank: Named Entity Disambiguation in the Context of the NEWS Project , 2007, ESWC.

[16]  Lise Getoor,et al.  Collective entity resolution in relational data , 2007, TKDD.

[17]  Philip S. Yu,et al.  BLINKS: ranked keyword searches on graphs , 2007, SIGMOD '07.

[18]  Rada Mihalcea,et al.  Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[19]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[20]  Md Maruf Hasan,et al.  A Spreading Activation Framework for Ontology-Enhanced Adaptive Information Access within Organisations , 2003, AMKM.

[21]  Lise Getoor,et al.  Entity Resolution in Graphs , 2005 .

[22]  Bruno Pouliquen,et al.  Multilingual and cross-lingual news topic tracking , 2004, COLING.

[23]  Allan Collins,et al.  A spreading-activation theory of semantic processing , 1975 .

[24]  Rada Mihalcea,et al.  Unsupervised graph-based word sense disambiguation , 2009 .

[25]  Andreas Abecker,et al.  Agent-Mediated Knowledge Management , 2004, Lecture Notes in Computer Science.

[26]  Raphael Volz,et al.  Towards Ontology-based Disambiguation of Geographical Identifiers , 2007, I3.

[27]  Amit P. Sheth,et al.  Context and Domain Knowledge Enhanced Entity Spotting in Informal Text , 2009, SEMWEB.

[28]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[29]  Nancy Ide,et al.  Word Sense Disambiguation with Very Large Neural Networks Extracted from Machine Readable Dictionaries , 1990, COLING.

[30]  Haofen Wang,et al.  Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF) Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[31]  S. Sudarshan,et al.  Bidirectional Expansion For Keyword Search on Graph Databases , 2005, VLDB.

[32]  Norberto Fernández García,et al.  Semantic Annotation of Web Resources Using IdentityRank and Wikipedia , 2007, AWIC.

[33]  S. Sudarshan,et al.  Keyword searching and browsing in databases using BANKS , 2002, Proceedings 18th International Conference on Data Engineering.

[34]  Dean Allemang,et al.  The Semantic Web - ISWC 2006, 5th International Semantic Web Conference, ISWC 2006, Athens, GA, USA, November 5-9, 2006, Proceedings , 2006, SEMWEB.

[35]  John R. Anderson A Spreading Activation Theory of Memory , 1988 .

[36]  Andrew McCallum,et al.  An Entity Based Model for Coreference Resolution , 2009, SDM.