3D-CNN-SPP: A Patient Risk Prediction System From Electronic Health Records via 3D CNN and Spatial Pyramid Pooling

The problem of extracting useful clinical representations from longitudinal electronic health record (EHR) data, also known as the computational phenotyping problem, is an important yet challenging task in the health-care academia and industry. Recent progress in the design and applications of deep learning methods has shown promising results towards solving this problem. In this paper, we propose 3D-CNN-SPP (3D Convolutional Neural Networks and Spatial Pyramid Pooling), a novel patient risk prediction system, to investigate the application of deep neural networks in modeling longitudinal EHR data. Particularly, we propose a 3D CNN structure, which is featured by SPP. Compared with 2D CNN methods, our proposed method can capture the complex relationships in EHRs more effectively and efficiently. Furthermore, previous works handle the issue of variable length in patient records by padding zeros to all vectors so that they have a fixed length. In our work, the proposed spatial pyramid pooling divides the records into several length sections for respective pooling processing, hence handling the variable length problem easily and naturally. We take heart failure and diabetes as examples to test the performance of the system, and the experiment results demonstrate great effectiveness in patient risk prediction, compared with several strong baselines.

[1]  Lynda C Burton,et al.  Using electronic health records to help coordinate care. , 2004, The Milbank quarterly.

[2]  Jieh-Haur Chen,et al.  Developing an SVM based risk hedging prediction model for construction material suppliers , 2010 .

[3]  George Hripcsak,et al.  Next-generation phenotyping of electronic health records , 2012, J. Am. Medical Informatics Assoc..

[4]  Parisa Rashidi,et al.  Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis , 2017, IEEE Journal of Biomedical and Health Informatics.

[5]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Joel Scanlan,et al.  Risk prediction using natural language processing of electronic mental health records in an inpatient forensic psychiatry setting , 2018, J. Biomed. Informatics.

[7]  D. Blumenthal,et al.  The "meaningful use" regulation for electronic health records. , 2010, The New England journal of medicine.

[8]  Murthy V. Devarakonda,et al.  An NLP-based cognitive system for disease status identification in electronic health records , 2017, 2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[9]  I. Kohane Using electronic health records to drive discovery in disease genomics , 2011, Nature Reviews Genetics.

[10]  R. Cebul,et al.  Electronic health records and quality of diabetes care. , 2011, The New England journal of medicine.

[11]  Adel Said Elmaghraby,et al.  Scalable Healthcare Assessment for Diabetic Patients Using Deep Learning on Multiple GPUs , 2019, IEEE Transactions on Industrial Informatics.

[12]  Fenglong Ma,et al.  Risk Prediction on Electronic Health Records with Prior Medical Knowledge , 2018, KDD.

[13]  Fei Wang,et al.  A methodology for interactive mining and visual analysis of clinical event patterns using electronic health record data , 2014, J. Biomed. Informatics.

[14]  Fenglong Ma,et al.  Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks , 2017, KDD.

[15]  Zhi-Ping Liu,et al.  Prediction of protein-RNA binding sites by a random forest method with combined features , 2010, Bioinform..

[16]  Jimeng Sun,et al.  Multi-layer Representation Learning for Medical Concepts , 2016, KDD.

[17]  Pan Zhou,et al.  Spatial Pyramid Pooling Mechanism in 3D Convolutional Network for Sentence-Level Classification , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[18]  Yan Liu,et al.  Deep Computational Phenotyping , 2015, KDD.

[19]  J. Valderas,et al.  Comparison of the information provided by electronic health records data and a population health survey to estimate prevalence of selected health conditions and multimorbidity , 2013, BMC Public Health.

[20]  Ping Zhang,et al.  Risk Prediction with Electronic Health Records: A Deep Learning Approach , 2016, SDM.

[21]  Fenglong Ma,et al.  KAME: Knowledge-based Attention Model for Diagnosis Prediction in Healthcare , 2018, CIKM.

[22]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Max A. Viergever,et al.  ConvNet-Based Localization of Anatomical Structures in 3-D Medical Images , 2017, IEEE Transactions on Medical Imaging.

[24]  Wenpeng Yin,et al.  Multichannel Variable-Size Convolution for Sentence Classification , 2015, CoNLL.

[25]  Yu Cheng,et al.  Exploiting Convolutional Neural Network for Risk Prediction with Medical Feature Embedding , 2017, ArXiv.

[26]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[27]  Sara Rosenbaum,et al.  Electronic health records' limited successes suggest more targeted uses. , 2010, Health affairs.

[28]  Xiaopeng Wei,et al.  Predicting the Risk of Heart Failure With EHR Sequential Data Modeling , 2018, IEEE Access.

[29]  Taghi M. Khoshgoftaar,et al.  Melanoma Risk Prediction with Structured Electronic Health Records , 2018, BCB.

[30]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[31]  Yong Xiang,et al.  Cost Efficient Resource Management in Fog Computing Supported Medical Cyber-Physical System , 2017, IEEE Transactions on Emerging Topics in Computing.

[32]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Fei Wang,et al.  Readmission prediction via deep contextual embedding of clinical concepts , 2018, PloS one.

[34]  D. Bates Getting in Step: Electronic Health Records and their Role in Care Coordination , 2010, Journal of General Internal Medicine.

[35]  Peng Liu,et al.  DeepDisc: Optic Disc Segmentation Based on Atrous Convolution and Spatial Pyramid Pooling , 2018, COMPAY/OMIA@MICCAI.

[36]  Y. Tabak,et al.  An Automated Model to Identify Heart Failure Patients at Risk for 30-Day Readmission or Death Using Electronic Medical Record Data , 2010, Medical care.

[37]  D Kalra,et al.  Electronic health records: new opportunities for clinical research , 2013, Journal of internal medicine.

[38]  Di Zhao,et al.  Combining PubMed knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction , 2011, J. Biomed. Informatics.

[39]  Huilong Duan,et al.  A Regularized Deep Learning Approach for Clinical Risk Prediction of Acute Coronary Syndrome Using Electronic Health Records , 2018, IEEE Transactions on Biomedical Engineering.

[40]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[41]  Jyotishman Pathak,et al.  Developing EHR-driven heart failure risk prediction models using CPXR(Log) with the probabilistic loss function , 2016, J. Biomed. Informatics.

[42]  Jeffrey Dean,et al.  Scalable and accurate deep learning with electronic health records , 2018, npj Digital Medicine.

[43]  R. Stafford,et al.  Electronic health records and clinical decision support systems: impact on national ambulatory care quality. , 2011, Archives of internal medicine.

[44]  Atsushi Suzuki,et al.  Risk Prediction of Diabetic Nephropathy via Interpretable Feature Extraction from EHR Using Convolutional Autoencoder , 2018, MIE.

[45]  Li Li,et al.  Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records , 2016, Scientific Reports.

[46]  Michael Klompas,et al.  Uses of electronic health records for public health surveillance to advance public health. , 2015, Annual review of public health.

[47]  Hua Xu,et al.  Portability of an algorithm to identify rheumatoid arthritis in electronic health records , 2012, J. Am. Medical Informatics Assoc..

[48]  Jeffrey Soar,et al.  Multimedia data mining using deep learning , 2015, 2015 Fifth International Conference on Digital Information Processing and Communications (ICDIPC).

[49]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[50]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[51]  Fei Wang,et al.  Towards heterogeneous temporal clinical event pattern discovery: a convolutional approach , 2012, KDD.

[52]  Charles Elkan,et al.  Learning to Diagnose with LSTM Recurrent Neural Networks , 2015, ICLR.

[53]  Euijoon Ahn,et al.  Sparsity-based Convolutional Kernel Network for Unsupervised Medical Image Analysis , 2018, Medical image analysis.

[54]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Jimeng Sun,et al.  Limestone: High-throughput candidate phenotype generation via tensor factorization , 2014, J. Biomed. Informatics.

[56]  Stephen B. Johnson,et al.  A review of approaches to identifying patient phenotype cohorts using electronic health records , 2013, J. Am. Medical Informatics Assoc..