Demographic Aware Probabilistic Medical Knowledge Graph Embeddings of Electronic Medical Records

Medical knowledge graphs (KGs) constructed from Electronic Medical Records (EMR) contain abundant information about patients and medical entities. The utilization of KG embedding models on these data has proven to be efficient for different medical tasks. However, existing models do not properly incorporate patient demographics and most of them ignore the probabilistic features of the medical KG. In this paper, we propose DARLING (Demographic Aware pRobabiListic medIcal kNowledge embeddinG), a demographic-aware medical KG embedding framework that explicitly incorporates demographics in the medical entities space by associating patient demographics with a corresponding hyperplane. Our framework leverages the probabilistic features within the medical entities for learning their representations through demographic guidance. We evaluate DARLING through link prediction for treatments and medicines, on a medical KG constructed from EMR data, and illustrate its superior performance compared to existing KG embedding models.

[1]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[2]  L. Standish,et al.  Alternative medicine use in HIV-positive men and women: Demographics, utilization patterns and health status , 2001, AIDS care.

[3]  Sen Wang,et al.  SMR: Medical Knowledge Graph Embedding for Safe Medicine Recommendation , 2020, Big Data Res..

[4]  Michel Dumontier,et al.  Bio2RDF Release 3: A larger, more connected network of Linked Data for the Life Sciences , 2014, SEMWEB.

[5]  Zhiyuan Liu,et al.  Learning Entity and Relation Embeddings for Knowledge Graph Completion , 2015, AAAI.

[6]  P. Metnitz,et al.  Development of demographics and outcome of very old critically ill patients admitted to intensive care units , 2012, Intensive Care Medicine.

[7]  David S. Wishart,et al.  DrugBank 5.0: a major update to the DrugBank database for 2018 , 2017, Nucleic Acids Res..

[8]  David Sontag,et al.  Learning a Health Knowledge Graph from Electronic Medical Records , 2017, Scientific Reports.

[9]  Riccardo Miotto,et al.  Heterogeneous Graph Embeddings of Electronic Health Records Improve Critical Care Disease Predictions , 2020, AIME.

[10]  Weiqing Wang,et al.  MedGraph: Structural and Temporal Representation Learning of Electronic Medical Records , 2019, ECAI.

[11]  Yan Jia,et al.  Knowledge Graph-Based Clinical Decision Support System Reasoning: A Survey , 2019, 2019 IEEE Fourth International Conference on Data Science in Cyberspace (DSC).

[12]  Oguz Dikenelli,et al.  Evaluation of knowledge graph embedding approaches for drug-drug interaction prediction in realistic settings , 2019, BMC Bioinformatics.

[13]  Jimeng Sun,et al.  Multi-layer Representation Learning for Medical Concepts , 2016, KDD.

[14]  P. C. Sherimon,et al.  OntoDiabetic: An Ontology-Based Clinical Decision Support System for Diabetic Patients , 2016 .

[15]  P. Mortensen,et al.  Suicide risk in relation to socioeconomic, demographic, psychiatric, and familial factors: a national register-based study of all suicides in Denmark, 1981-1997. , 2003, The American journal of psychiatry.

[16]  Gang Feng,et al.  Disease Ontology: a backbone for disease semantic integration , 2011, Nucleic Acids Res..

[17]  Jun Zhao,et al.  Knowledge Graph Embedding via Dynamic Mapping Matrix , 2015, ACL.

[18]  Buzhou Tang,et al.  A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development , 2020, JMIR medical informatics.

[19]  Gerhard Weikum,et al.  KnowLife: a versatile approach for constructing a large knowledge graph for biomedical sciences , 2015, BMC Bioinformatics.

[20]  Zhen Wang,et al.  Knowledge Graph Embedding by Translating on Hyperplanes , 2014, AAAI.

[21]  Dehua Chen,et al.  Clinical Knowledge Graph Embeddings with Hierarchical Structure for Thyroid Treatment Recommendation , 2019, 2019 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech).

[22]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[23]  J. Ribeiro,et al.  Demographics as predictors of suicidal thoughts and behaviors: A meta-analysis , 2017, PloS one.