Combining a co-occurrence-based and a semantic measure for entity linking

One key feature of the Semantic Web lies in the ability to link related Web resources. However, while relations within particular datasets are often well-defined, links between disparate datasets and corpora of Web resources are rare. The increasingly widespread use of cross-domain reference datasets, such as Freebase and DBpedia for annotating and enriching datasets as well as documents, opens up opportunities to exploit their inherent semantic relationships to align disparate Web resources. In this paper, we present a combined approach to uncover relationships between disparate entities which exploits (a) graph analysis of reference datasets together with (b) entity co-occurrence on the Web with the help of search engines. In (a), we introduce a novel approach adopted and applied from social network theory to measure the connectivity between given entities in reference datasets. The connectivity measures are used to identify connected Web resources. Finally, we present a thorough evaluation of our approach using a publicly available dataset and introduce a comparison with established measures in the field.

[1]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[2]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[3]  Amit P. Sheth,et al.  Relationship Web: Blazing Semantic Trails between Web Resources , 2007, IEEE Internet Computing.

[4]  Amit P. Sheth,et al.  SemRank: ranking complex relationship search results on the semantic web , 2005, WWW '05.

[5]  Andrzej Sieminski Fast algorithm for assessing semantic similarity of texts , 2012, Int. J. Intell. Inf. Database Syst..

[6]  Amit P. Sheth,et al.  Ρ-Queries: enabling querying for semantic associations on the semantic web , 2003, WWW '03.

[7]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[8]  Pabitra Mitra,et al.  Feature weighting in content based recommendation system using social network analysis , 2008, WWW.

[9]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[10]  Yannis Stavrakas,et al.  Exploiting the Social and Semantic Web for Guided Web Archiving , 2012, TPDL.

[11]  James A. Hendler,et al.  A Method to Rank Nodes in an RDF Graph , 2008, International Semantic Web Conference.

[12]  Wolfgang Nejdl,et al.  Can Entities be Friends? , 2012, WoLE@ISWC.

[13]  Deborah L. McGuinness,et al.  When owl: sameAs Isn't the Same: An Analysis of Identity in Linked Data , 2010, SEMWEB.

[14]  Seong-Bae Park,et al.  Ranking Entities Similar to an Entity for a Given Relationship , 2010, PRICAI.

[15]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[16]  Alexandre Passant,et al.  Measuring Semantic Distance on Linking Data and Using it for Resources Recommendations , 2010, AAAI Spring Symposium: Linked Data Meets Artificial Intelligence.

[17]  Jens Lehmann,et al.  Discovering Unknown Connections - the DBpedia Relationship Finder , 2007, CSSW.

[18]  Cong Yu,et al.  REX: Explaining Relationships between Entity Pairs , 2011, Proc. VLDB Endow..

[19]  Frank van Harmelen,et al.  Using Google distance to weight approximate ontology matches , 2007, WWW '07.

[20]  Enrico Motta,et al.  Relation Discovery from the Semantic Web , 2008, International Semantic Web Conference.

[21]  François Scharffe,et al.  Data Linking for the Semantic Web , 2011, Int. J. Semantic Web Inf. Syst..

[22]  Alexandre Passant,et al.  dbrec - Music Recommendations Using DBpedia , 2010, SEMWEB.

[23]  Milan Stankovic,et al.  Linked Data-Based Concept Recommendation: Comparison of Different Methods in Open Innovation Scenario , 2012, ESWC.

[24]  Won-Kyung Sung,et al.  Efficient Finding Relationship between Individuals in a Mass Ontology Database , 2011, FGIT-UNESST.

[25]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[26]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[27]  Enrico Motta,et al.  Exploring the Semantic Web as Background Knowledge for Ontology Matching , 2008, J. Data Semant..

[28]  Aristides Gionis,et al.  Estimating entity importance via counting set covers , 2012, KDD.

[29]  Daniela Giordano,et al.  Linked education: interlinking educational resources and the Web of data , 2012, SAC '12.