Combining a Co-occurrence-Based and a Semantic Measure for Entity Linking

One key feature of the Semantic Web lies in the ability to link related Web resources. However, while relations within particular datasets are often well-defined, links between disparate datasets and corpora of Web resources are rare. The increasingly widespread use of cross-domain reference datasets, such as Freebase and DBpedia for annotating and enriching datasets as well as documents, opens up opportunities to exploit their inherent semantic relationships to align disparate Web resources. In this paper, we present a combined approach to uncover relationships between disparate entities which exploits (a) graph analysis of reference datasets together with (b) entity co-occurrence on the Web with the help of search engines. In (a), we introduce a novel approach adopted and applied from social network theory to measure the connectivity between given entities in reference datasets. The connectivity measures are used to identify connected Web resources. Finally, we present a thorough evaluation of our approach using a publicly available dataset and introduce a comparison with established measures in the field.

[1]  Andrzej Sieminski Fast algorithm for assessing semantic similarity of texts , 2012, Int. J. Intell. Inf. Database Syst..

[2]  Deborah L. McGuinness,et al.  When owl: sameAs Isn't the Same: An Analysis of Identity in Linked Data , 2010, SEMWEB.

[3]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[4]  Seong-Bae Park,et al.  Ranking Entities Similar to an Entity for a Given Relationship , 2010, PRICAI.

[5]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[6]  Frank van Harmelen,et al.  Using Google distance to weight approximate ontology matches , 2007, WWW '07.

[7]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[8]  Yannis Stavrakas,et al.  Exploiting the Social and Semantic Web for Guided Web Archiving , 2012, TPDL.

[9]  Alexandre Passant,et al.  dbrec - Music Recommendations Using DBpedia , 2010, SEMWEB.

[10]  Enrico Motta,et al.  Relation Discovery from the Semantic Web , 2008, International Semantic Web Conference.

[11]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[12]  François Scharffe,et al.  Data Linking for the Semantic Web , 2011, Int. J. Semantic Web Inf. Syst..

[13]  Amit P. Sheth,et al.  Relationship Web: Blazing Semantic Trails between Web Resources , 2007, IEEE Internet Computing.

[14]  Aristides Gionis,et al.  Estimating entity importance via counting set covers , 2012, KDD.

[15]  Milan Stankovic,et al.  Linked Data-Based Concept Recommendation: Comparison of Different Methods in Open Innovation Scenario , 2012, ESWC.

[16]  Daniela Giordano,et al.  Linked education: interlinking educational resources and the Web of data , 2012, SAC '12.

[17]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[18]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[19]  Amit P. Sheth,et al.  Ρ-Queries: enabling querying for semantic associations on the semantic web , 2003, WWW '03.

[20]  Won-Kyung Sung,et al.  Efficient Finding Relationship between Individuals in a Mass Ontology Database , 2011, FGIT-UNESST.

[21]  Cong Yu,et al.  REX: Explaining Relationships between Entity Pairs , 2011, Proc. VLDB Endow..

[22]  Jens Lehmann,et al.  Discovering Unknown Connections - the DBpedia Relationship Finder , 2007, CSSW.

[23]  Amit P. Sheth,et al.  SemRank: ranking complex relationship search results on the semantic web , 2005, WWW '05.

[24]  Enrico Motta,et al.  Exploring the Semantic Web as Background Knowledge for Ontology Matching , 2008, J. Data Semant..

[25]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[26]  Pabitra Mitra,et al.  Feature weighting in content based recommendation system using social network analysis , 2008, WWW.

[27]  Alexandre Passant,et al.  Measuring Semantic Distance on Linking Data and Using it for Resources Recommendations , 2010, AAAI Spring Symposium: Linked Data Meets Artificial Intelligence.

[28]  James A. Hendler,et al.  A Method to Rank Nodes in an RDF Graph , 2008, International Semantic Web Conference.

[29]  Wolfgang Nejdl,et al.  Can Entities be Friends? , 2012, WoLE@ISWC.

[30]  Jakub Simko,et al.  Data linking for the Semantic Web , 2015 .