Supporting information retrieval from electronic health records: A report of University of Michigan's nine-year experience in developing and using the Electronic Medical Record Search Engine (EMERSE)

OBJECTIVE This paper describes the University of Michigan's nine-year experience in developing and using a full-text search engine designed to facilitate information retrieval (IR) from narrative documents stored in electronic health records (EHRs). The system, called the Electronic Medical Record Search Engine (EMERSE), functions similar to Google but is equipped with special functionalities for handling challenges unique to retrieving information from medical text. MATERIALS AND METHODS Key features that distinguish EMERSE from general-purpose search engines are discussed, with an emphasis on functions crucial to (1) improving medical IR performance and (2) assuring search quality and results consistency regardless of users' medical background, stage of training, or level of technical expertise. RESULTS Since its initial deployment, EMERSE has been enthusiastically embraced by clinicians, administrators, and clinical and translational researchers. To date, the system has been used in supporting more than 750 research projects yielding 80 peer-reviewed publications. In several evaluation studies, EMERSE demonstrated very high levels of sensitivity and specificity in addition to greatly improved chart review efficiency. DISCUSSION Increased availability of electronic data in healthcare does not automatically warrant increased availability of information. The success of EMERSE at our institution illustrates that free-text EHR search engines can be a valuable tool to help practitioners and researchers retrieve information from EHRs more effectively and efficiently, enabling critical tasks such as patient case synthesis and research data abstraction. CONCLUSION EMERSE, available free of charge for academic use, represents a state-of-the-art medical IR tool with proven effectiveness and user acceptance.

[1]  S. Saini,et al.  Risk Models for Post–Endoscopic Retrograde Cholangiopancreatography Pancreatitis (PEP): Smoking and Chronic Liver Disease Are Predictors of Protection Against PEP , 2013, Pancreas.

[2]  Heather Walters,et al.  Validation of key behaviourally based mental health diagnoses in administrative data: suicide attempt, alcohol abuse, illicit drug abuse and tobacco use , 2012, BMC Health Services Research.

[3]  A. Jha,et al.  The promise of electronic records: around the corner or down the road? , 2011, JAMA.

[4]  Charles P. Friedman,et al.  Conceptualising and creating a global learning health system , 2013, Int. J. Medical Informatics.

[5]  Hong Yu,et al.  Biomedical negation scope detection with conditional random fields , 2010, J. Am. Medical Informatics Assoc..

[6]  Arun Krishnaraj,et al.  Automated before-procedure electronic health record screening to assess appropriateness for GI endoscopy and sedation. , 2012, Gastrointestinal endoscopy.

[7]  Anders Grimsmo,et al.  Instant availability of patient records, but diminished availability of patient information: A multi-method study of GP's use of electronic patient records , 2008, BMC Medical Informatics Decis. Mak..

[8]  David A. Hanauer,et al.  Enhanced identification of eligibility for depression research using an electronic medical record search engine , 2009, Int. J. Medical Informatics.

[9]  Joi L. Moore,et al.  “I Don't Have Time to Dig Back Through This”: The Role of Semantic Search in Supporting Physician Information Seeking in an Electronic Health Record , 2014 .

[10]  Robert A. Jenders,et al.  A systematic literature review of automated clinical coding and classification systems , 2010, J. Am. Medical Informatics Assoc..

[11]  David A Hanauer,et al.  Retrospective Database Research in Pediatric Cardiology and Congenital Heart Surgery , 2012, World journal for pediatric & congenital heart surgery.

[12]  Michael Gao,et al.  A comparison of particulate and onyx embolization in preoperative devascularization of juvenile nasopharyngeal angiofibromas , 2013, Neuroradiology.

[13]  Dario A. Giuse,et al.  StarTracker: An Integrated, Web-based Clinical Search Engine , 2003, AMIA.

[14]  Heather Walters,et al.  Predictors of suicide in patient charts among patients with depression in the Veterans Health Administration health system: importance of prescription drug and alcohol abuse. , 2012, The Journal of clinical psychiatry.

[15]  Naren Ramakrishnan,et al.  Describing the Relationship between Cat Bites and Human Depression Using Data from an Electronic Health Record , 2013, PloS one.

[16]  Kanakadurga Singer,et al.  Clinical course of sepsis in children with acute leukemia admitted to the pediatric intensive care unit* , 2011, Pediatric critical care medicine : a journal of the Society of Critical Care Medicine and the World Federation of Pediatric Intensive and Critical Care Societies.

[17]  Lei Yang,et al.  Voice-dictated versus typed-in clinician notes: linguistic properties and the potential implications on natural language processing. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[18]  C Friedman,et al.  Electronic chart review as an aid to postdischarge surgical site surveillance: increased case finding. , 2001, American journal of infection control.

[19]  S. Saini,et al.  Do Clinical Characteristics Predict the Presence of Small Bowel Angioectasias on Capsule Endoscopy? , 2011, Digestive Diseases and Sciences.

[20]  David Hanauer,et al.  Implementation of the Quality Oncology Practice Initiative at a university comprehensive cancer center. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[21]  K M Jensen,et al.  Health care in adults with Down syndrome: a longitudinal cohort study. , 2013, Journal of intellectual disability research : JIDR.

[22]  Griffin M. Weber,et al.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2) , 2010, J. Am. Medical Informatics Assoc..

[23]  Lei Yang,et al.  Query log analysis of an electronic health record search engine. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[24]  Wendy W. Chapman,et al.  Anaphoric reference in clinical reports: Characteristics of an annotated corpus , 2012, J. Biomed. Informatics.

[25]  Thomas M Braun,et al.  Elafin Is a Biomarker of Graft-Versus-Host Disease of the Skin , 2008, Science Translational Medicine.

[26]  R. Chervin,et al.  Sleep-disordered breathing in multiple sclerosis , 2012, Neurology.

[27]  Hua Xu,et al.  A hybrid system for temporal information extraction from clinical text , 2013, J. Am. Medical Informatics Assoc..

[28]  D. Bates,et al.  How many medication orders are entered through free-text in EHRs?--a study on hypoglycemic agents. , 2012, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[29]  Hua Xu,et al.  Data from clinical notes: a perspective on the tension between structure and flexible documentation , 2011, J. Am. Medical Informatics Assoc..

[30]  Matthew M Davis,et al.  Fidelity of Administrative Data When Researching Down Syndrome , 2014, Medical care.

[31]  D. Blumenthal,et al.  Achieving a Nationwide Learning Health System , 2010, Science Translational Medicine.

[32]  Kai Zheng,et al.  Hedging their Mets: The Use of Uncertainty Terms in Clinical Documents and its Potential Implications when Sharing the Documents with Patients , 2012, AMIA.

[33]  Eric Fosler-Lussier,et al.  How essential are unstructured clinical narratives and information fusion to clinical trial recruitment? , 2014, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[34]  David A Hanauer,et al.  Informatics and the American College of Surgeons National Surgical Quality Improvement Program: automated processes could replace manual record review. , 2009, Journal of the American College of Surgeons.

[35]  Thomas M Braun,et al.  A biomarker panel for acute graft-versus-host disease. , 2009, Blood.

[36]  P. Biron,et al.  An information retrieval system for computerized patient records in the context of a daily hospital practice: the example of the Léon Bérard Cancer Center (France) , 2014, Applied Clinical Informatics.

[37]  B. Thompson,et al.  The effect of age on arteriovenous malformations in children and young adults undergoing magnetic resonance imaging , 2011, Child's Nervous System.

[38]  William R. Hersh,et al.  Barriers to Retrieving Patient Information from Electronic Health Record Data: Failure Analysis from the TREC Medical Records Track , 2012, AMIA.

[39]  N. Ghaziuddin,et al.  Retrospective chart review of catatonia in child and adolescent psychiatric patients , 2012, Acta psychiatrica Scandinavica.

[40]  W ChapmanWendy,et al.  Anaphoric reference in clinical reports , 2012 .

[41]  Carol Friedman,et al.  Limited parsing of notational text visit notes: ad-hoc vs. NLP approaches , 2000, AMIA.

[42]  Marcia Valenstein,et al.  Suicide risk assessment received prior to suicide death by Veterans Health Administration patients with a history of depression. , 2013, The Journal of clinical psychiatry.

[43]  Vasudevan Jagannathan,et al.  Assessment of commercial NLP engines for medication information extraction from dictated clinical notes , 2009, Int. J. Medical Informatics.

[44]  Daniel M. Stein,et al.  An analysis of clinical queries in an electronic health record search utility , 2010, Int. J. Medical Informatics.

[45]  Kai Zheng,et al.  Handling anticipated exceptions in clinical care: investigating clinician use of 'exit strategies' in an electronic health records system , 2011, J. Am. Medical Informatics Assoc..

[46]  Lucila Ohno-Machado,et al.  Natural language processing: an introduction , 2011, J. Am. Medical Informatics Assoc..

[47]  Scott T. Weiss,et al.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system , 2006, BMC Medical Informatics Decis. Mak..

[48]  K. Cooney,et al.  Hypertension, obesity and prostate cancer biochemical recurrence after radical prostatectomy , 2012, Prostate Cancer and Prostatic Diseases.

[49]  Thomas M Braun,et al.  Regenerating islet-derived 3-alpha is a biomarker of gastrointestinal graft-versus-host disease. , 2011, Blood.

[50]  Kai Zheng,et al.  Collaborative search in electronic health records , 2011, J. Am. Medical Informatics Assoc..

[51]  Rob Koeling,et al.  Optimising the use of electronic health records to estimate the incidence of rheumatoid arthritis in primary care: what information is hidden in free text? , 2013, BMC Medical Research Methodology.

[52]  Thomas M Braun,et al.  Plasma biomarkers of lower gastrointestinal and liver acute GVHD. , 2012, Blood.

[53]  M. Gardner,et al.  Information retrieval for patient care , 1997, BMJ.

[54]  K M Jensen,et al.  Primary care for adults with Down syndrome: adherence to preventive healthcare recommendations. , 2013, Journal of intellectual disability research : JIDR.

[55]  Trivellore E Raghunathan,et al.  Alcohol use and cigarette smoking as risk factors for post-endoscopic retrograde cholangiopancreatography pancreatitis. , 2009, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[56]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[57]  Michael Zalis,et al.  Advanced search of the electronic medical record: augmenting safety and efficiency in radiology. , 2010, Journal of the American College of Radiology : JACR.

[58]  Susan C. Weber,et al.  STRIDE - An Integrated Standards-Based Translational Research Informatics Platform , 2009, AMIA.

[59]  Jules J Berman,et al.  Implementation and evaluation of a negation tagger in a pipeline-based system for information extract from pathology reports. , 2004, Studies in health technology and informatics.

[60]  Carol E Chenoweth,et al.  Accuracy of Hospital Administrative Data in Reporting Central Line–Associated Bloodstream Infections in Newborns , 2013, Pediatrics.

[61]  Carol Friedman,et al.  Natural language processing: State of the art and prospects for significant progress, a workshop sponsored by the National Library of Medicine , 2013, J. Biomed. Informatics.