Learning to Diagnose: Assimilating Clinical Narratives using Deep Reinforcement Learning

Clinical diagnosis is a critical and non-trivial aspect of patient care which often requires significant medical research and investigation based on an underlying clinical scenario. This paper proposes a novel approach by formulating clinical diagnosis as a reinforcement learning problem. During training, the reinforcement learning agent mimics the clinician’s cognitive process and learns the optimal policy to obtain the most appropriate diagnoses for a clinical narrative. This is achieved through an iterative search for candidate diagnoses from external knowledge sources via a sentence-by-sentence analysis of the inherent clinical context. A deep Q-network architecture is trained to optimize a reward function that measures the accuracy of the candidate diagnoses. Experiments on the TREC CDS datasets demonstrate the effectiveness of our system over various non-reinforcement learning-based systems.

[1]  Charles Elkan,et al.  Learning to Diagnose with LSTM Recurrent Neural Networks , 2015, ICLR.

[2]  Sanda M. Harabagiu,et al.  Medical Question Answering for Clinical Decision Support , 2016, CIKM.

[3]  Jason Weston,et al.  Reading Wikipedia to Answer Open-Domain Questions , 2017, ACL.

[4]  Anselmo Cardoso de Paiva,et al.  Application on Reinforcement Learning for Diagnosis Based on Medical Image , 2008 .

[5]  Daniel S. Weld,et al.  Open Information Extraction Using Wikipedia , 2010, ACL.

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7]  Nancy F. Lenfestey,et al.  Cognitive interventions to reduce diagnostic error: a narrative review , 2012, BMJ quality & safety.

[8]  G H Guyatt,et al.  Users' guides to the medical literature: XVII. How to use guidelines and recommendations about screening. Evidence-Based Medicine Working Group. , 1999, JAMA.

[9]  Rahul Gupta,et al.  Knowledge base completion via search-based question answering , 2014, WWW.

[10]  Ellen M. Voorhees,et al.  Overview of the TREC 2014 Clinical Decision Support Track , 2014, TREC.

[11]  Xuchen Yao,et al.  Information Extraction over Structured Data: Question Answering with Freebase , 2014, ACL.

[12]  Alexander Kotov,et al.  Optimization Method for Weighting Explicit and Latent Concepts in Clinical Decision Support Queries , 2016, ICTIR.

[13]  Mathieu Nendaz,et al.  Diagnostic errors and flaws in clinical reasoning: mechanisms and prevention in practice. , 2012, Swiss medical weekly.

[14]  M. Kosorok,et al.  Reinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung Cancer , 2011, Biometrics.

[15]  Lee Brooks,et al.  Non‐analytical models of clinical reasoning: the role of experience , 2007, Medical education.

[16]  R. Bellman A Markovian Decision Process , 1957 .

[17]  Oladimeji Farri,et al.  Diagnostic Inferencing via Improving Clinical Concept Extraction with Deep Reinforcement Learning: A Preliminary Study , 2017, MLHC.

[18]  H. A. David,et al.  The Paired t Test Under Artificial Pairing , 1997 .

[19]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[20]  Ellen M. Voorhees,et al.  State-of-the-art in biomedical literature retrieval for clinical cases: a survey of the TREC 2014 CDS track , 2016, Information Retrieval Journal.

[21]  Walter F. Stewart,et al.  Doctor AI: Predicting Clinical Events via Recurrent Neural Networks , 2015, MLHC.

[22]  Joelle Pineau,et al.  Informing sequential clinical decision-making through reinforcement learning: an empirical study , 2010, Machine Learning.

[23]  André Kushniruk,et al.  Analysis of Complex Decision-Making Processes in Health Care: Cognitive Approaches to Health Informatics , 2001, J. Biomed. Informatics.

[24]  G H Guyatt,et al.  Users' guides to the medical literature: XV. How to use an article about disease probability for differential diagnosis. Evidence-Based Medicine Working Group. , 1999, JAMA.

[25]  Jason Weston,et al.  Key-Value Memory Networks for Directly Reading Documents , 2016, EMNLP.

[26]  Bernard Charlin,et al.  An analysis of clinical reasoning through a recent and comprehensive approach: the dual-process theory , 2011, Medical education online.

[27]  Jimeng Sun,et al.  RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism , 2016, NIPS.

[28]  Xiaojun Wan,et al.  Graph-Based Multi-Modality Learning for Clinical Decision Support , 2016, CIKM.

[29]  G. Guyatt,et al.  Users' guides to the medical literature. , 1993, JAMA.

[30]  Daniel Bakkelund An LCS-based string metric , 2009 .

[31]  Gary C. Borchardt,et al.  External Knowledge Sources for Question Answering , 2005, TREC.

[32]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[33]  Gang Feng,et al.  Disease Ontology: a backbone for disease semantic integration , 2011, Nucleic Acids Res..

[34]  Regina Barzilay,et al.  Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning , 2016, EMNLP.

[35]  Yuan Ling,et al.  Improving Clinical Diagnosis Inference through Integration of Structured and Unstructured Knowledge , 2017 .

[36]  Esko Ukkonen,et al.  Approximate String Matching with q-grams and Maximal Matches , 1992, Theor. Comput. Sci..

[37]  Ming Zhou,et al.  Question Answering over Freebase with Multi-Column Convolutional Neural Networks , 2015, ACL.

[38]  Erica Bell,et al.  Evidence in clinical reasoning: a computational linguistics analysis of 789,712 medical case summaries 1983–2012 , 2015, BMC Medical Informatics and Decision Making.

[39]  Tiejun Zhao,et al.  Knowledge-Based Question Answering as Machine Translation , 2014, ACL.

[40]  Qin Zhang,et al.  Clinical diagnosis expert system based on dynamic uncertain causality graph , 2014, 2014 IEEE 7th Joint International Information Technology and Artificial Intelligence Conference.

[41]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[42]  Grzegorz Kondrak,et al.  N-Gram Similarity and Distance , 2005, SPIRE.

[43]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[44]  Siddharth Patwardhan,et al.  WatsonPaths: Scenario-Based Question Answering and Inference over Unstructured Information , 2017, AI Mag..

[45]  S. Mamede,et al.  Cognitive diagnostic error in internal medicine. , 2013, European journal of internal medicine.

[46]  Gang Pan,et al.  Semantic Health Knowledge Graph: Semantic Integration of Heterogeneous Medical Knowledge and Services , 2017, BioMed research international.

[47]  Radhika Poolla,et al.  A reinforcement learning approach to obtain treatment strategies in sequential medical decision problems , 2003 .

[48]  William E. Winkler 20. Matching and Record Linkage , 2011 .

[49]  Danqi Chen,et al.  Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.

[50]  Oladimeji Farri,et al.  Condensed Memory Networks for Clinical Diagnostic Inferencing , 2016, AAAI.

[51]  Erik T. Mueller,et al.  Watson: Beyond Jeopardy! , 2013, Artif. Intell..

[52]  William E. Winkler,et al.  Matching and record linkage , 2011 .