Automatic infection detection based on electronic medical records

Making the most relevant patient care decision, as early as possible, is a constant challenge, especially for physicians in the emergency department. The increasing volumes of electronic medical records (EMR) open new horizons for automatic diagnosis. In this paper, we propose to use machine learning approaches for automatic infection detection using EMR. Three categories of features are extracted, including text-based features, vital signs and clinic tests. Experimental results on a newly constructed EMR dataset from emergency department showed that our model can achieve a decent performance for infection detection with F1 score of 0.88 using text features only. Further analysis reveals that our model can identify indicative symptom expressions about infection.

[1]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[2]  M. Pencina,et al.  Electronic Health Records and Pharmacokinetic Modeling to Assess the Relationship between Ampicillin Exposure and Seizure Risk in Neonates. , 2016, The Journal of pediatrics.

[3]  Lixia Yao,et al.  Electronic health records: Implications for drug discovery. , 2011, Drug discovery today.

[4]  Riccardo Miotto,et al.  Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials , 2015, J. Am. Medical Informatics Assoc..

[5]  Xuanjing Huang,et al.  FudanNLP: A Toolkit for Chinese Natural Language Processing , 2013, ACL.

[6]  R. Epstein,et al.  TEPAPA: a novel in silico feature learning pipeline for mining prognostic and associative factors from text-based electronic medical records , 2017, Scientific Reports.

[7]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[8]  I. Kohane Using electronic health records to drive discovery in disease genomics , 2011, Nature Reviews Genetics.

[9]  Li Li,et al.  Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records , 2016, Scientific Reports.

[10]  George Hripcsak,et al.  Defining and measuring completeness of electronic health records for secondary use , 2013, J. Biomed. Informatics.

[11]  Alan E. Jones,et al.  Surviving Sepsis Campaign: International Guidelines for Management of Sepsis and Septic Shock: 2016 , 2017, Intensive Care Medicine.

[12]  Fangfang Xia,et al.  Antimicrobial Resistance Prediction in PATRIC and RAST , 2016, Scientific Reports.

[13]  Loes M. M. Braun,et al.  Natural Language Processing in Radiology: A Systematic Review. , 2016, Radiology.

[14]  L. Flashman,et al.  Predicting the Risk of Suicide by Analyzing the Text of Clinical Notes , 2014, PloS one.

[15]  Chunhua Weng,et al.  Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research , 2013, J. Am. Medical Informatics Assoc..