An Introduction to Natural Language Processing: How You Can Get More From Those Electronic Notes You Are Generating.

Electronically stored clinical documents may contain both structured data and unstructured data. The use of structured clinical data varies by facility, but clinicians are familiar with coded data such as International Classification of Diseases, Ninth Revision, Systematized Nomenclature of Medicine-Clinical Terms codes, and commonly other data including patient chief complaints or laboratory results. Most electronic health records have much more clinical information stored as unstructured data, for example, clinical narrative such as history of present illness, procedure notes, and clinical decision making are stored as unstructured data. Despite the importance of this information, electronic capture or retrieval of unstructured clinical data has been challenging. The field of natural language processing (NLP) is undergoing rapid development, and existing tools can be successfully used for quality improvement, research, healthcare coding, and even billing compliance. In this brief review, we provide examples of successful uses of NLP using emergency medicine physician visit notes for various projects and the challenges of retrieving specific data and finally present practical methods that can run on a standard personal computer as well as high-end state-of-the-art funded processes run by leading NLP informatics researchers.

[1]  Carol Friedman,et al.  Towards a comprehensive medical language processing system: methods and issues , 1997, AMIA.

[2]  G. Hripcsak,et al.  Extracting Findings from Narrative Reports: Software Transferability and Sources of Physician Disagreement , 1998, Methods of Information in Medicine.

[3]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing , 2000 .

[4]  Olivier Bodenreider,et al.  The NLM Indexing Initiative , 2000, AMIA.

[5]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[6]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[7]  Carol Friedman,et al.  A Comparison of Semantic Categories of the ISO Reference Terminology Models for Nursing and the MedLEE Natural Language Processing System , 2004, MedInfo.

[8]  Ken P Kleinman,et al.  Identifying pediatric age groups for influenza vaccination using a real-time regional surveillance system. , 2005, American journal of epidemiology.

[9]  David C Kaelber,et al.  Underdiagnosis of hypertension in children and adolescents. , 2007, JAMA.

[10]  John F. Hurdle,et al.  Automated identification of adverse events related to central venous catheters , 2007, J. Biomed. Informatics.

[11]  Christopher G. Chute,et al.  Technical Brief: Mayo Clinic NLP System for Patient Smoking Status Identification , 2008, J. Am. Medical Informatics Assoc..

[12]  Douglas MacFadden,et al.  Application of Information Technology The Shared Health Research Information Network ( SHRINE ) : A Prototype Federated Query Tool for Clinical Data Repositories , 2014 .

[13]  M. Weinstein,et al.  Risk of Bacterial or Herpes Simplex Virus Meningitis/Encephalitis in Children With Complex Febrile Seizures , 2009, Pediatric emergency care.

[14]  A. Kimia,et al.  Glass Thermometer Injuries: It Is Not Just About the Mercury , 2009, Pediatric emergency care.

[15]  Andrew J. Capraro,et al.  Utility of Lumbar Puncture for First Simple Febrile Seizure Among Children 6 to 18 Months of Age , 2009, Pediatrics.

[16]  D. Vanderveen,et al.  Acute Periorbital Infections: Who Needs Emergent Imaging? , 2010, Pediatrics.

[17]  Andrew J. Capraro,et al.  Yield of Lumbar Puncture Among Children Who Present With Their First Complex Febrile Seizure , 2010, Pediatrics.

[18]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[19]  Michael Feldman,et al.  caTIES: a grid based system for coding and retrieval of surgical pathology reports and tissue specimens in support of translational research , 2010, J. Am. Medical Informatics Assoc..

[20]  A. Kimia,et al.  Incidence of Morbidity From Penetrating Palate Trauma , 2010, Pediatrics.

[21]  S. Meystre,et al.  Automatic de-identification of textual documents in the electronic health record: a review of recent research , 2010, BMC medical research methodology.

[22]  Jung-Hsien Chiang,et al.  Automated evaluation of electronic discharge notes to assess quality of care for cardiovascular diseases using Medical Language Extraction and Encoding System (MedLEE) , 2010, J. Am. Medical Informatics Assoc..

[23]  Griffin M. Weber,et al.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2) , 2010, J. Am. Medical Informatics Assoc..

[24]  I. Kohane,et al.  Electronic medical records for discovery research in rheumatoid arthritis , 2010, Arthritis care & research.

[25]  Christopher G Chute,et al.  Discovering peripheral arterial disease cases from radiology notes using natural language processing. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[26]  Lynette Hirschman,et al.  The MITRE Identification Scrubber Toolkit: Design, training, and assessment , 2010, Int. J. Medical Informatics.

[27]  D. Roden,et al.  The Emerging Role of Electronic Medical Records in Pharmacogenomics , 2011, Clinical pharmacology and therapeutics.

[28]  Rob C. van Ommering,et al.  Automatically Correlating Clinical Findings and Body Locations in Radiology Reports Using MedLEE , 2012, Journal of Digital Imaging.

[29]  M. Neuman,et al.  Relationship between cerebrospinal fluid glucose and serum glucose. , 2012, The New England journal of medicine.

[30]  Andrew J. Capraro,et al.  Yield of Emergent Neuroimaging Among Children Presenting With a First Complex Febrile Seizure , 2012, Pediatric emergency care.

[31]  B. Ebel,et al.  Increase in pediatric magnet-related foreign bodies requiring emergency care. , 2013, Annals of emergency medicine.

[32]  Brian G Smith,et al.  Distinguishing Lyme From Septic Knee Monoarthritis in Lyme Disease–Endemic Areas , 2013, Pediatrics.

[33]  Hyeong-Ah Choi,et al.  Automated outcome classification of emergency department computed tomography imaging reports. , 2013, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.