On the relation between keys and link keys for data interlinking

Both keys and their generalisation, link keys, may be used to perform data interlinking, i.e. finding identical resources in different RDF datasets. However, the precise relationship between keys and link keys has not been fully determined yet. A common formal framework encompassing both keys and link keys is necessary to ensure the correctness of data interlinking tools based on them, and to determine their scope and possible overlapping. In this paper, we provide a semantics for keys and link keys within description logics. We determine under which conditions they are legitimate to generate links. We provide conditions under which link keys are logically equivalent to keys. In particular, we show that data interlinking with keys and ontology alignments can be reduced to data interlinking with link keys, but not the other way around.

[1]  Robert Isele,et al.  Efficient Multidimensional Blocking for Link Discovery without losing Recall , 2011, WebDB.

[2]  Jérôme Euzenat,et al.  Tableau extensions for reasoning with link keys , 2016, OM@ISWC.

[3]  Markus Nentwig,et al.  A survey of current Link Discovery frameworks , 2016, Semantic Web.

[4]  Jérôme David,et al.  Data interlinking through robust linkkey extraction , 2014, ECAI.

[5]  Sebastian Rudolph,et al.  Foundations of Description Logics , 2011, Reasoning Web.

[6]  Peter Christen,et al.  Data Matching , 2012, Data-Centric Systems and Applications.

[7]  Sören Auer,et al.  LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data , 2011, IJCAI.

[8]  Jérôme David,et al.  The Alignment API 4.0 , 2011, Semantic Web.

[9]  Ian Horrocks,et al.  Keys, Nominals, and Concrete Domains , 2003, IJCAI.

[10]  HoganAidan,et al.  Scalable and distributed methods for entity matching, consolidation and disambiguation over linked data corpora , 2012 .

[11]  Nathalie Pernelle,et al.  Defining Key Semantics for the RDF Datasets: Experiments and Evaluations , 2014, ICCS.

[12]  Axel-Cyrille Ngonga Ngomo,et al.  EAGLE: Efficient Active Learning of Link Specifications Using Genetic Programming , 2012, ESWC.

[13]  Martin Gaedke,et al.  Discovering and Maintaining Links on the Web of Data , 2009, SEMWEB.

[14]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[15]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[16]  Nathalie Pernelle,et al.  SAKey: Scalable Almost Key Discovery in RDF Data , 2014, SEMWEB.

[17]  Jürgen Umbrich,et al.  Scalable and distributed methods for entity matching, consolidation and disambiguation over linked data corpora , 2012, J. Web Semant..

[18]  Luciano Serafini,et al.  Distributed Description Logics: Assimilating Information from Peer Sources , 2003, J. Data Semant..

[19]  Axel-Cyrille Ngonga Ngomo,et al.  ROCKER: A Refinement Operator for Key Discovery , 2015, WWW.

[20]  Diego Calvanese,et al.  Keys for Free in Description Logics , 2000, Description Logics.

[21]  Jérôme David,et al.  Linkex: A Tool for Link Key Discovery Based on Pattern Structures , 2019, ICFCA.

[22]  François Scharffe,et al.  Data Linking for the Semantic Web , 2011, Int. J. Semantic Web Inf. Syst..

[23]  Jens Lehmann,et al.  Wombat - A Generalization Approach for Automatic Link Discovery , 2017, ESWC.

[24]  Manuel Atencia,et al.  Inferring Same-As Facts from Linked Data: An Iterative Import-by-Query Approach , 2015, AAAI.

[25]  Alexander Borgida,et al.  Adding Uniqueness Constraints to Description Logics (Preliminary Report) , 1997, DOOD.

[26]  Jérôme Euzenat,et al.  Three Semantics for Distributed Systems and Their Relations with Alignment Composition , 2006, SEMWEB.

[27]  Jérôme David,et al.  Uncertainty-Sensitive Reasoning for Inferring sameAs Facts in Linked Data , 2016, ECAI.

[28]  Carsten Lutz,et al.  Description Logics with Concrete Domains and Functional Dependencies , 2004, ECAI.

[29]  W. W. Armstrong,et al.  Dependency Structures of Data Base Relationships , 1974, IFIP Congress.

[30]  Konstantin Todorov,et al.  Automatic Key Selection for Data Linking , 2016, EKAW.

[31]  Konstantin Todorov,et al.  KeyRanker: Automatic RDF Key Ranking for Data Linking , 2017, K-CAP.

[32]  Jérôme David,et al.  Keys and Pseudo-Keys Detection for Web Datasets Cleansing and Interlinking , 2012, EKAW.

[33]  Diego Calvanese,et al.  Identification Constraints and Functional Dependencies in Description Logics , 2001, IJCAI.

[34]  Jérôme David,et al.  A Guided Walk into Link Key Candidate Extraction with Relational Concept Analysis , 2020, JT@ISWC.

[35]  David Toman,et al.  On Keys and Functional Dependencies as First-Class Citizens in Description Logics , 2007, Journal of Automated Reasoning.