Multi-disease prediction using LSTM recurrent neural networks

Abstract Prediction of future clinical events (e.g., disease diagnoses) is an important machine learning task in healthcare informatics research. In this work, we propose a deep learning approach to perform multi-disease prediction for intelligent clinical decision support. The proposed approach utilizes a long short-term memory network and extends it with two mechanisms (i.e., time-aware and attention-based) to conduct multi-label classification based on patients’ clinical visit records. The former mechanism (time-aware) is used to handle the temporal irregularity across clinical visits whereas the latter mechanism (attention-based) assists in determining the importance of each visit for the prediction task. Using a large clinical record data set (over 5 million records) collected from a hospital in Southeast China, we show that our proposed approach outperforms a variety of traditional and deep learning methods in predicting future disease diagnoses. We further study the impacts of different time interval choices for the time-aware mechanism and compare the performances of existing attention-based mechanisms with the one proposed in our study. Our work has implications for supporting physician diagnoses via the use of intelligent systems and more broadly for improving the quality of healthcare service.

[1]  Denise R. Aberle,et al.  An Interpretable Deep Hierarchical Semantic Convolutional Neural Network for Lung Nodule Malignancy Classification , 2018, Expert Syst. Appl..

[2]  Bernard C. Jiang,et al.  Using data mining techniques for multi-diseases prediction modeling of hypertension and hyperlipidemia by common risk factors , 2011, Expert Syst. Appl..

[3]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[4]  David Vandyke,et al.  Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems , 2015, EMNLP.

[5]  Li Li,et al.  Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records , 2016, Scientific Reports.

[6]  Young-Jin Kim,et al.  An architecture for emergency event prediction using LSTM recurrent neural networks , 2018, Expert Syst. Appl..

[7]  Jiun-Hung Chen,et al.  A multi-label classification based approach for sentiment classification , 2015, Expert Syst. Appl..

[8]  A. Hawkes,et al.  A cluster process representation of a self-exciting process , 1974, Journal of Applied Probability.

[9]  Quansheng Ge,et al.  East Asian warm season temperature variations over the past two millennia , 2018, Scientific reports.

[10]  Andrea Montanari,et al.  A Low-Cost Method for Multiple Disease Prediction , 2015, AMIA.

[11]  Truyen Tran,et al.  Predicting healthcare trajectories from medical records: A deep learning approach , 2017, J. Biomed. Informatics.

[12]  Yan Liu,et al.  Recurrent Neural Networks for Multivariate Time Series with Missing Values , 2016, Scientific Reports.

[13]  Antonio Sarasa Cabezuelo,et al.  Application of Artificial Intelligence Techniques to Predict Survival in Kidney Transplantation: A Review , 2020, Journal of clinical medicine.

[14]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Deep learning for biological image classification , 2017, Expert Syst. Appl..

[15]  Kaori Ito,et al.  Disease progression meta-analysis model in Alzheimer's disease , 2010, Alzheimer's & Dementia.

[16]  Alessandro Tredicucci,et al.  Corrigendum: Universal lineshapes at the crossover between weak and strong critical coupling in Fano-resonant coupled oscillators , 2016, Scientific Reports.

[17]  F. Nelson,et al.  Use of prediction markets to forecast infectious disease activity. , 2007, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[18]  D. Bates,et al.  Big data in health care: using analytics to identify and manage high-risk and high-cost patients. , 2014, Health affairs.

[19]  Robert R. Bies,et al.  Disease Progression Modeling: Key Concepts and Recent Developments , 2016, Current Pharmacology Reports.

[20]  David W. Bates,et al.  Implementation of a scalable, web-based, automated clinical decision support risk-prediction tool for chronic kidney disease using C-CDA and application programming interfaces , 2017, J. Am. Medical Informatics Assoc..

[21]  Ha Young Kim,et al.  Forecasting the volatility of stock price index: A hybrid model integrating LSTM with multiple GARCH-type models , 2018, Expert Syst. Appl..

[22]  D. Mould Models for Disease Progression: New Approaches and Uses , 2012, Clinical pharmacology and therapeutics.

[23]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[24]  H. Katus,et al.  High-sensitivity cardiac troponin T for early prediction of evolving non-ST-segment elevation myocardial infarction in patients with suspected acute coronary syndrome and negative troponin results on admission. , 2010, Clinical chemistry.

[25]  R. Thomson,et al.  Informal Health Provider and Practical Approach to Lung Health interventions to improve the detection of chronic airways disease and tuberculosis at primary care level in Malawi: study protocol for a randomised controlled trial , 2015, Trials.

[26]  R. Emsley,et al.  Beyond total treatment effects in RCTs: why we need to measure outcomes at baseline when investigating mediation , 2015, Trials.

[27]  N. Tangri,et al.  A predictive model for progression of chronic kidney disease to kidney failure. , 2011, JAMA.

[28]  Hye Jin Kam,et al.  Learning representations for the early detection of sepsis with deep neural networks , 2017, Comput. Biol. Medicine.

[29]  Lichuan Yang,et al.  A simple risk score for prediction of sepsis associated-acute kidney injury in critically ill patients , 2019, Journal of Nephrology.

[30]  Nello Cristianini,et al.  Efficient classification of multi-labeled text streams by clashing , 2014, Expert Syst. Appl..

[31]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[32]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[33]  Lucila Ohno-Machado,et al.  NIH's Big Data to Knowledge initiative and the advancement of biomedical informatics , 2014, J. Am. Medical Informatics Assoc..