The number of datasets published in the Web of Data as part of the Linked Data Cloud is constantly increasing. The Linked Data paradigm is based on the unconstrained publication of information by different publishers, and the interlinking of Web resources across knowledge bases. In most cases, the cross-dataset links are not explicit in the dataset and must be automatically determined using Instance Matching (IM) tools (also known as record linkage [1], duplicate detection [2] and, entity resolution [3]) amongst others. The large variety of techniques requires their comparative evaluation to determine which one is best suited for a given context. Performing such an assessment generally requires well-defined and widely accepted benchmarks to determine the weak and strong points of the proposed techniques and/or tools. A number of real and synthetic benchmarks that address different data linking challenges have been proposed for evaluating the performance of such systems. Those include, but are not limited to, IIMB 2012 [4], Sandbox 2012 [4], RDFT 2013 [5], ID-REC 2014 [6], ONTOBI 2010 [7], Author Task 2015 [8] and Lance 2015 [9] to mention few. A more complete survey can be found in [10].
[1]
Lise Getoor,et al.
Entity Resolution in Graphs
,
2005
.
[2]
Fabien Duchateau,et al.
PABench: Designing a Taxonomy and Implementing a Benchmark for Spatial Entity Matching
,
2015
.
[3]
Ahmed K. Elmagarmid,et al.
Duplicate Record Detection: A Survey
,
2007,
IEEE Transactions on Knowledge and Data Engineering.
[4]
Chen Li,et al.
Supporting Efficient Record Linkage for Large Data Sets Using Mapping Techniques
,
2006,
World Wide Web.
[5]
Heiner Stuckenschmidt,et al.
Results of the Ontology Alignment Evaluation Initiative 2007
,
2006,
OM.
[6]
Stefan Conrad,et al.
A Benchmark for Testing Instance-based Ontology Matching Methods
,
2010,
EKAW.
[7]
Irini Fundulaki,et al.
Instance matching benchmarks in the era of Linked Data
,
2016,
J. Web Semant..
[8]
Heiner Stuckenschmidt,et al.
Results of the Ontology Alignment Evaluation Initiative
,
2007
.
[9]
Axel-Cyrille Ngonga Ngomo,et al.
LANCE: Piercing to the Heart of Instance Matching Tools
,
2015,
SEMWEB.