论文信息 - Identifying Relevant Sources for Data Linking using a Semantic Web Index

Identifying Relevant Sources for Data Linking using a Semantic Web Index

With more data repositories constantly being published on the Web, choosing appropriate data sources to interlink with newly published datasets becomes a non-trivial problem. While catalogs of data repositories and meta-level descriptors such as VoiD provide valuable information to take these decisions, more detailed information about the instances included into repositories is often required to assess the relevance of datasets and the part of the dataset to link to. However, retrieving and processing such information for a potentially large number of datasets is practically unfeasible. In this paper, we examine how using an existing semantic web index can help identifying candidate datasets for linking. We further apply ontology schema matching techniques to rank these candidate datasets and extract the sub-dataset to use for linking, in the form of classes with instances more likely to match the ones of the local dataset.

Andriy Nikolov | Mathieu d'Aquin | M. d’Aquin | A. Nikolov

[1] Enrico Motta,et al. Integration of Semantically Annotated Data by the KnoFuss Architecture , 2008, EKAW.

[2] Stefan Decker,et al. Sig.ma: live views on the web of data , 2010, WWW '10.

[3] Eduardo Mena,et al. Ontology Matching with CIDER: Evaluation Report for the OAEI 2008 , 2008, OM.

[4] Stefan Decker,et al. Sig.ma: Live views on the Web of Data , 2010, J. Web Semant..

[5] Vipul Kashyap,et al. OBSERVER: An Approach for Query Processing in Global Information Systems Based on Interoperation Across Pre-Existing Ontologies , 2000, Distributed and Parallel Databases.

[6] Martin Gaedke,et al. Discovering and Maintaining Links on the Web of Data , 2009, SEMWEB.

[7] Enrico Motta,et al. Capturing Emerging Relations between Schema Ontologies on the Web of Data , 2010, COLD.

[8] Deborah L. McGuinness,et al. When owl: sameAs Isn't the Same: An Analysis of Identity in Linked Data , 2010, SEMWEB.

[9] Eyal Oren,et al. Sindice.com: Weaving the Open Linked Data , 2007, ISWC/ASWC.

[10] Vipul Kashyap,et al. Observer: an approach for query processing in global information systems based on interoperation across pre-existing ontologies , 1996, Proceedings First IFCIS International Conference on Cooperative Information Systems.