Tell Me Why Is It So? Explaining Knowledge Graph Relationships by Finding Descriptive Support Passages

We address the problem of finding descriptive explanations of facts stored in a knowledge graph. This is important in high-risk domains such as healthcare, intelligence, etc. where users need additional information for decision making and is especially crucial for applications that rely on automatically constructed knowledge bases where machine learned systems extract facts from an input corpus and working of the extractors is opaque to the end-user. We follow an approach inspired from information retrieval and propose a simple and efficient, yet effective solution that takes into account passage level as well as document level properties to produce a ranked list of passages describing a given input relation. We test our approach using Wikidata as the knowledge base and Wikipedia as the source corpus and report results of user studies conducted to study the effectiveness of our proposed model.

[1]  Haofen Wang,et al.  Building and Exploring an Enterprise Knowledge Graph for Investment Analysis , 2016, SEMWEB.

[2]  Ping Zhang,et al.  Predicting Drug-Drug Interactions Through Large-Scale Similarity-Based Link Prediction , 2016, ESWC.

[3]  Jörg Tiedemann Comparing Document Segmentation Strategies for Passage Retrieval in Question Answering , 2007 .

[4]  M. de Rijke,et al.  Learning to Explain Entity Relationships in Knowledge Graphs , 2015, ACL.

[5]  Ziyang Liu,et al.  Query biased snippet generation in XML search , 2008, SIGMOD Conference.

[6]  Dimitra Gkatzia,et al.  Natural Language Generation enhances human decision-making with uncertain information , 2016, ACL.

[7]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[8]  Luo Si,et al.  Knowledge Transfer and Opinion Detection in the TREC2006 Blog Track , 2006 .

[9]  W. Bruce Croft,et al.  Combining the language model and inference network approaches to retrieval , 2004, Inf. Process. Manag..

[10]  Luke S. Zettlemoyer,et al.  Joint Coreference Resolution and Named-Entity Linking with Multi-Pass Sieves , 2013, EMNLP.

[11]  Cong Yu,et al.  REX: Explaining Relationships between Entity Pairs , 2011, Proc. VLDB Endow..

[12]  Michael D. Iannacone,et al.  Developing an Ontology for Cyber Security Knowledge Graphs , 2015, CISR.

[13]  W. Scott Spangler,et al.  Generating and Browsing Multiple Taxonomies Over a Document Collection , 2003, J. Manag. Inf. Syst..

[14]  Alfio Gliozzo,et al.  An Entity-Focused Approach to Generating Company Descriptions , 2016, ACL.

[15]  Grace Hui Yang,et al.  Knowledge Transfer and Opinion Detection in the TREC 2006 Blog Track , 2006, TREC.

[16]  Christopher Ré,et al.  DeepDive: Web-scale Knowledge-base Construction using Statistical Learning and Inference , 2012, VLDS.

[17]  Giuseppe Pirrò,et al.  Explaining and Suggesting Relatedness in Knowledge Graphs , 2015, SEMWEB.

[18]  Peter J. Haas,et al.  Predicting Future Scientific Discoveries Based on a Networked Analysis of the Past Literature , 2015, KDD.

[19]  Carol Tenopir,et al.  Finding and using journal-article components: Impacts of disaggregation on teaching and research practice , 2008, J. Assoc. Inf. Sci. Technol..

[20]  Roi Blanco,et al.  Finding support sentences for entities , 2010, SIGIR.

[21]  Charles L. A. Clarke,et al.  The influence of caption features on clickthrough patterns in web search , 2007, SIGIR.

[22]  Suzan Verberne,et al.  Passage Retrieval for Question Answering using Sliding Windows , 2008, COLING 2008.

[23]  James P. Callan,et al.  Passage-level evidence in document retrieval , 1994, SIGIR '94.

[24]  Hugh E. Williams,et al.  Fast generation of result snippets in web search , 2007, SIGIR.

[25]  Luo Si,et al.  Discriminative probabilistic models for passage based retrieval , 2008, SIGIR '08.

[26]  Sumit Bhatia,et al.  Separating Wheat from the Chaff - A Relationship Ranking Algorithm , 2016, ESWC.

[27]  Mark Sanderson,et al.  Advantages of query biased summaries in information retrieval , 1998, SIGIR '98.

[28]  George R. Thoma,et al.  Annotation and retrieval of clinically relevant images , 2009, Int. J. Medical Informatics.

[29]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[30]  Jörg Tiedemann,et al.  Simple is Best: Experiments with Different Document Segmentation Strategies for Passage Retrieval , 2008, COLING 2008.

[31]  M. de Rijke,et al.  Generating Descriptions of Entity Relationships , 2017, ECIR.

[32]  Daniel S. Weld,et al.  Open Information Extraction Using Wikipedia , 2010, ACL.

[33]  Haofen Wang,et al.  Snippet Generation for Semantic Web Search Engines , 2008, ASWC.

[34]  Wei Zhang,et al.  TimeMachine: Timeline Generation for Knowledge-Base Entities , 2015, KDD.

[35]  Nitish Aggarwal,et al.  Connecting the Dots: Explaining Relationships Between Unconnected Entities in a Knowledge Graph , 2016, ESWC.

[36]  José Manuél Gómez-Pérez,et al.  HAVAS 18 Labs: A Knowledge Graph for Innovation in the Media Industry , 2014, International Semantic Web Conference.

[37]  Prasenjit Mitra,et al.  Summarizing figures, tables, and algorithms in scientific publications to augment search results , 2012, TOIS.

[38]  Steffen Lohmann,et al.  Interactive Relationship Discovery via the Semantic Web , 2010, ESWC.

[39]  Amit P. Sheth,et al.  Semantic Association Identification and Knowledge Discovery for National Security Applications , 2005, J. Database Manag..

[40]  Danqi Chen,et al.  Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.