Knowledge Reconciliation of n-ary Relations

In the expanding Semantic Web, an increasing number of sources of data and knowledge are accessible by human and software agents. Sources may differ in granularity or completeness, and thus be complementary. Consequently, unlocking the full potential of the available knowledge requires combining them. To this aim, we define the task of knowledge reconciliation, which consists in identifying, within and across sources, equivalent, more specific, or similar units. This task can be challenging since knowledge units are heterogeneously represented in sources (e.g., in terms of vocabularies). In this paper, we propose a rule-based methodology for the reconciliation of n-ary relations. To alleviate the heterogeneity in representation, we rely on domain knowledge expressed by ontologies. We tested our method on the biomedical domain of pharmacogenomics by reconciling 50,435 n-ary relations from four different real-world sources, which highlighted noteworthy agreements and discrepancies within and across sources.

[1]  Teri E. Klein,et al.  Incorporation of Pharmacogenomics into Routine Clinical Practice: the Clinical Pharmacogenetics Implementation Consortium (CPIC) Guideline Development Process , 2014, Current drug metabolism.

[2]  Diego Reforgiato Recupero,et al.  Reconciling Event-Based Knowledge Through RDF2VEC , 2017, HybridSemStats@ISWC.

[3]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[4]  Amedeo Napoli,et al.  PGxO and PGxLOD: a reconciliation of pharmacogenomic knowledge of various provenances, enabling further comparison , 2018 .

[5]  Nicoleta Preda,et al.  Mining rules to align knowledge bases , 2013, AKBC '13.

[6]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[7]  Evgeniy Gabrilovich,et al.  A Review of Relational Machine Learning for Knowledge Graphs , 2015, Proceedings of the IEEE.

[8]  Heiko Paulheim,et al.  RDF2Vec: RDF Graph Embeddings for Data Mining , 2016, SEMWEB.

[9]  Serge Abiteboul,et al.  PARIS: Probabilistic Alignment of Relations, Instances, and Schema , 2011, Proc. VLDB Endow..

[10]  Alon Y. Halevy,et al.  Web data management , 2011, SIGMOD '11.

[11]  Jérôme David,et al.  Data interlinking through robust linkkey extraction , 2014, ECAI.

[12]  Diego Reforgiato Recupero,et al.  Merging open knowledge extracted from text with MERGILO , 2016, Knowl. Based Syst..

[13]  Adrien Coulet,et al.  Mining Electronic Health Records to Validate Knowledge in Pharmacogenomics , 2016, ERCIM News.

[14]  Jérôme Euzenat,et al.  Ontology Matching, Second Edition , 2013 .

[15]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.