Towards Causal Knowledge Graphs - Position Paper

In this position paper, we highlight that being able to analyse the cause-effect relationships for determining the causal status among a set of events is an essential requirement in many contexts and argue that cannot be overlooked when building systems targeting real-world use cases. This is especially true for medical contexts where the understanding of the cause(s) of a symptom, or observation, is of vital importance. However, most approaches purely based on Machine Learning (ML) do not explicitly represent and reason with causal relations, and may therefore mistake correlation for causation. In the paper, we therefore argue for an approach to extract causal relations from text, and represent them in the form of Knowledge Graphs (KG), to empower downstream ML applications, or AI systems in general, with the ability to distinguish correlation from causation and reason with causality in an explicit manner. So far, the bottlenecks in KG creation have been scalability and accuracy of automated methods, hence, we argue that two novel features are required from methods for addressing these challenges, i.e. (i) the use of Knowledge Patterns to guide the KG generation process towards a certain resulting knowledge structure, and (ii) the use of a semantic referee to automatically curate the extracted knowledge. We claim that this will be an important step forward for supporting interpretable AI systems, and integrating ML and knowledge representation approaches, such as KGs, which should also generalise well to other types of relations, apart from causality.

[1]  Johannes Gehrke,et al.  Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission , 2015, KDD.

[2]  Syin Chan,et al.  Extracting Causal Knowledge from a Medical Database Using Graphical Patterns , 2000, ACL.

[3]  Lipika Dey,et al.  Automatic Extraction of Causal Relations from Text using Linguistically Informed Deep Neural Networks , 2018, SIGDIAL Conference.

[4]  Xinlei Chen,et al.  Never-Ending Learning , 2012, ECAI.

[5]  Eva Blomqvist,et al.  Engineering Ontologies with Patterns - The eXtreme Design Methodology , 2016, Ontology Engineering with Ontology Design Patterns.

[6]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[7]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[8]  Mélanie Frappier,et al.  The Book of Why: The New Science of Cause and Effect , 2018, Science.

[9]  Júlio Cesar dos Reis,et al.  Generating Knowledge Graphs from Scientific Literature of Degenerative Diseases , 2019, SEPDA@ISWC.

[10]  Diego Reforgiato Recupero,et al.  Semantic Web Machine Reading with FRED , 2017, Semantic Web.

[11]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[12]  Yejin Choi,et al.  COMET: Commonsense Transformers for Automatic Knowledge Graph Construction , 2019, ACL.

[13]  Claudio Giuliano,et al.  N-ary relation extraction for simultaneous T-Box and A-Box knowledge base augmentation , 2018, Semantic Web.

[14]  Steffen Staab,et al.  Knowledge graphs , 2020, Commun. ACM.

[15]  Chengsheng Mao,et al.  KG-BERT: BERT for Knowledge Graph Completion , 2019, ArXiv.

[16]  知秀 柴田 5分で分かる!? 有名論文ナナメ読み:Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding , 2020 .

[17]  J. Pearl Causal diagrams for empirical research , 1995 .

[18]  Mark Steedman,et al.  Learning Typed Entailment Graphs with Global Soft Constraints , 2018, Transactions of the Association for Computational Linguistics.

[19]  S. Goodman,et al.  Causal inference in public health. , 2013, Annual review of public health.

[20]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[21]  Suchi Saria,et al.  Reliable Decision Support using Counterfactual Models , 2017, NIPS.

[22]  E. W. Schneider,et al.  Course Modularization Applied: The Interface System and Its Implications For Sequence Control and Data Analysis. , 1973 .

[23]  Peter Spirtes,et al.  Introduction to Causal Inference , 2010, J. Mach. Learn. Res..

[24]  Fei Wang,et al.  Deep learning for healthcare: review, opportunities and challenges , 2018, Briefings Bioinform..

[25]  Achim Rettinger,et al.  Linked data quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO , 2017, Semantic Web.

[26]  Aldo Gangemi,et al.  Towards a pattern science for the Semantic Web , 2010, Semantic Web.

[27]  Jeffrey Ling,et al.  Matching the Blanks: Distributional Similarity for Relation Learning , 2019, ACL.

[28]  David Sontag,et al.  Learning a Health Knowledge Graph from Electronic Medical Records , 2017, Scientific Reports.

[29]  Amy Loutfi,et al.  Semantic Referee: A Neural-Symbolic Framework for Enhancing Geospatial Semantic Segmentation , 2019, Semantic Web.

[30]  V. Carretta Zamborlini,et al.  Knowledge Representation for Clinical Guidelines: with applications to Multimorbidity Analysis and Literature Search , 2017 .

[31]  Kurt Sandkuhl,et al.  Patterns in Ontology Engineering: Classification of Ontology Patterns , 2005, ICEIS.

[32]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.