On-time clinical phenotype prediction based on narrative reports

In this paper we describe a natural language processing system which is able to predict whether or not a patient exhibits a specific phenotype using the information extracted from the narrative reports associated with the patient. Furthermore, the phenotypic annotations from our report dataset were performed at the report level which allows us to perform the prediction of the clinical phenotype at any point in time during the patient hospitalization period. Our experiments indicate that an important factor in achieving better results for this problem is to determine how much information to extract from the patient reports in the time interval between the patient admission time and the current prediction time.

[1]  Lucila Ohno-Machado,et al.  Natural language processing: an introduction , 2011, J. Am. Medical Informatics Assoc..

[2]  Peter J. Haug,et al.  A Comparison of Classification Algorithms to Automatically Identify Chest X-Ray Reports That Support Pneumonia , 2001, J. Biomed. Informatics.

[3]  Christopher G. Chute,et al.  Genome-and Phenome-Wide Analysis of Cardiac Conduction Identifies Markers of Arrhythmia Risk Running title : Ritchie et al . ; QRS GWAS and PheWAS in electronic records , 2013 .

[4]  Cosmin Adrian Bejan,et al.  Assertion modeling and its role in clinical phenotype identification , 2013, J. Biomed. Informatics.

[5]  Carol Friedman,et al.  Extracting Information on Pneumonia in Infants Using Natural Language Processing of Radiology Reports , 2003, BioNLP@ACL.

[6]  Cosmin Adrian Bejan,et al.  Assessing Pneumonia Identification from Time-Ordered Narrative Reports , 2012, AMIA.

[7]  Clement J. McDonald,et al.  What can natural language processing do for clinical decision support? , 2009, J. Biomed. Informatics.

[8]  Cosmin Adrian Bejan,et al.  Pneumonia identification using statistical feature selection , 2012, J. Am. Medical Informatics Assoc..

[9]  Peter J. Haug,et al.  Research Paper: Automatic Detection of Acute Bacterial Pneumonia from Chest X-ray Reports , 2000, J. Am. Medical Informatics Assoc..

[10]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[11]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[12]  Eneida A. Mendonça,et al.  Use of computerized surveillance to detect nosocomial pneumonia in neonatal intensive care unit patients. , 2004, American journal of infection control.

[13]  Peter Szolovits,et al.  Associations of autoantibodies, autoimmune risk alleles, and clinical diagnoses from the electronic medical records in rheumatoid arthritis cases and non-rheumatoid arthritis controls. , 2013, Arthritis and rheumatism.

[14]  Melissa A. Basford,et al.  Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record. , 2010, American journal of human genetics.

[15]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[16]  Jianfeng Gao,et al.  MSR SPLAT, a language analysis toolkit , 2012, HLT-NAACL.

[17]  Peter J. Haug,et al.  An integrated decision support system for diagnosing and managing patients with community-acquired pneumonia , 1999, AMIA.