Development and empirical user-centered evaluation of semantically-based query recommendation for an electronic health record search engine

OBJECTIVE The utility of biomedical information retrieval environments can be severely limited when users lack expertise in constructing effective search queries. To address this issue, we developed a computer-based query recommendation algorithm that suggests semantically interchangeable terms based on an initial user-entered query. In this study, we assessed the value of this approach, which has broad applicability in biomedical information retrieval, by demonstrating its application as part of a search engine that facilitates retrieval of information from electronic health records (EHRs). MATERIALS AND METHODS The query recommendation algorithm utilizes MetaMap to identify medical concepts from search queries and indexed EHR documents. Synonym variants from UMLS are used to expand the concepts along with a synonym set curated from historical EHR search logs. The empirical study involved 33 clinicians and staff who evaluated the system through a set of simulated EHR search tasks. User acceptance was assessed using the widely used technology acceptance model. RESULTS The search engine's performance was rated consistently higher with the query recommendation feature turned on vs. off. The relevance of computer-recommended search terms was also rated high, and in most cases the participants had not thought of these terms on their own. The questions on perceived usefulness and perceived ease of use received overwhelmingly positive responses. A vast majority of the participants wanted the query recommendation feature to be available to assist in their day-to-day EHR search tasks. DISCUSSION AND CONCLUSION Challenges persist for users to construct effective search queries when retrieving information from biomedical documents including those from EHRs. This study demonstrates that semantically-based query recommendation is a viable solution to addressing this challenge.

[1]  Julia Adler-Milstein,et al.  More than half of US hospitals have at least a basic EHR, but stage 2 criteria remain challenging for most. , 2014, Health affairs.

[2]  Qing Zeng-Treitler,et al.  Research Paper: Assisting Consumer Health Information Retrieval with Query Recommendations , 2006, J. Am. Medical Informatics Assoc..

[3]  Fabrizio Silvestri,et al.  Generating suggestions for queries in the long tail with an inverted index , 2012, Inf. Process. Manag..

[4]  Franklin Dexter,et al.  Difficulties and Challenges Associated with Literature Searches in Operating Room Management, Complete with Recommendations , 2013, Anesthesia and analgesia.

[5]  Richard J. Holden,et al.  The Technology Acceptance Model: Its past and its future in health care , 2010, J. Biomed. Informatics.

[6]  David A. Hanauer,et al.  Enhanced identification of eligibility for depression research using an electronic medical record search engine , 2009, Int. J. Medical Informatics.

[7]  Ellen M. Voorhees,et al.  Overview of the TREC 2012 Medical Records Track , 2012, TREC.

[8]  Kun Lu,et al.  Explicitly integrating MeSH thesaurus help into health information retrieval systems: An empirical user study , 2014, Inf. Process. Manag..

[9]  Kai Zheng,et al.  Collaborative search in electronic health records , 2011, J. Am. Medical Informatics Assoc..

[10]  Yan Zhang,et al.  Health information searching behavior in MedlinePlus and the impact of tasks , 2012, IHI '12.

[11]  Stéfan Jacques Darmoni,et al.  Performance evaluation of unified medical language system®'s synonyms expansion to query PubMed , 2012, BMC Medical Informatics and Decision Making.

[12]  Zhenyu Liu,et al.  Knowledge-based query expansion to support scenario-specific retrieval of medical free text , 2005, SAC '05.

[13]  Claudio Carpineto,et al.  A Survey of Automatic Query Expansion in Information Retrieval , 2012, CSUR.

[14]  Thomas C. Rindflesch,et al.  Query expansion using the UMLS Metathesaurus , 1997, AMIA.

[15]  P. Biron,et al.  An information retrieval system for computerized patient records in the context of a daily hospital practice: the example of the Léon Bérard Cancer Center (France). , 2014, Applied clinical informatics.

[16]  K. Davies,et al.  The information-seeking behaviour of doctors: a review of the evidence. , 2007, Health information and libraries journal.

[17]  S. Saini,et al.  Risk Models for Post–Endoscopic Retrograde Cholangiopancreatography Pancreatitis (PEP): Smoking and Chronic Liver Disease Are Predictors of Protection Against PEP , 2013, Pancreas.

[18]  K. Bretonnel Cohen,et al.  MetaMap is a Superior Baseline to a Standard Document Retrieval Engine for the Task of Finding Patient Cohorts in Clinical Free Text , 2011, TREC.

[19]  Kai Zheng,et al.  Supporting information retrieval from electronic health records: A report of University of Michigan's nine-year experience in developing and using the Electronic Medical Record Search Engine (EMERSE) , 2015, J. Biomed. Informatics.

[20]  Ben Carterette,et al.  Improving health records search using multiple query expansion collections , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[21]  William R. Hersh,et al.  Barriers to Retrieving Patient Information from Electronic Health Record Data: Failure Analysis from the TREC Medical Records Track , 2012, AMIA.

[22]  K. Charmaz,et al.  Constructing Grounded Theory , 2014 .

[23]  William R. Hersh,et al.  Assessing thesaurus-based query expansion using the UMLS Metathesaurus , 2000, AMIA.

[24]  Carol Friedman,et al.  Limited parsing of notational text visit notes: ad-hoc vs. NLP approaches , 2000, AMIA.

[25]  D Fitzgerald,et al.  How good are clinical MEDLINE searches? A comparative study of clinical end-user and librarian searches. , 1990, Computers and biomedical research, an international journal.

[26]  Elmer V. Bernstam,et al.  Research paper: A Day in the Life of PubMed: Analysis of a Typical Day's Query Log , 2007, J. Am. Medical Informatics Assoc..

[27]  Sanna Salanterä,et al.  Overview of the ShARe/CLEF eHealth Evaluation Lab 2013 , 2013, CLEF.

[28]  Arantxa Otegi,et al.  Improving search over Electronic Health Records using UMLS-based query expansion through random walks , 2014, J. Biomed. Informatics.

[29]  Hsinchun Chen,et al.  An end user evaluation of query formulation and results review tools in three medical meta-search engines , 2007, Int. J. Medical Informatics.

[30]  Sooyoung Yoo,et al.  On the query reformulation technique for effective MEDLINE document retrieval , 2010, J. Biomed. Informatics.

[31]  Marc Overhage,et al.  An evaluation of the UMLS in representing corpus derived clinical concepts. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[32]  Joi L. Moore,et al.  “I Don't Have Time to Dig Back Through This”: The Role of Semantic Search in Supporting Physician Information Seeking in an Electronic Health Record , 2014 .

[33]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[34]  Gordon B. Davis,et al.  User Acceptance of Information Technology: Toward a Unified View , 2003, MIS Q..

[35]  Dario A. Giuse,et al.  StarTracker: An Integrated, Web-based Clinical Search Engine , 2003, AMIA.

[36]  Lei Yang,et al.  Query log analysis of an electronic health record search engine. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[37]  Barry Smyth,et al.  Improving Web Search through Collaborative Query Recommendation , 2004, ECAI.

[38]  Thomas M Braun,et al.  Elafin Is a Biomarker of Graft-Versus-Host Disease of the Skin , 2008, Science Translational Medicine.

[39]  R. Chervin,et al.  Sleep-disordered breathing in multiple sclerosis , 2012, Neurology.

[40]  Anthony N. Nguyen,et al.  Identify Disorders in Health Records using Conditional Random Fields and Metamap AEHRC at ShARe/CLEF 2013 eHealth Evaluation Lab Task 1 , 2013, CLEF.

[41]  Michael Zalis,et al.  Advanced search of the electronic medical record: augmenting safety and efficiency in radiology. , 2010, Journal of the American College of Radiology : JACR.

[42]  Susan C. Weber,et al.  STRIDE - An Integrated Standards-Based Translational Research Informatics Platform , 2009, AMIA.

[43]  Tao Tao,et al.  A formal study of information retrieval heuristics , 2004, SIGIR '04.

[44]  Daniel M. Stein,et al.  An analysis of clinical queries in an electronic health record search utility , 2010, Int. J. Medical Informatics.

[45]  Thomas C. Rindflesch,et al.  Synonym, Topic Model and Predicate-Based Query Expansion for Retrieving Clinical Documents , 2012, AMIA.

[46]  A. Strauss,et al.  The discovery of grounded theory: strategies for qualitative research aldine de gruyter , 1968 .

[47]  Eta S. Berner,et al.  Viewpoint Paper: Informatics Challenges for the Impending Patient Information Explosion , 2005, J. Am. Medical Informatics Assoc..

[48]  Matthew M Davis,et al.  The Factors Associated With High-Quality Communication for Critically Ill Children , 2013, Pediatrics.

[49]  Fred D. Davis Perceived Usefulness, Perceived Ease of Use, and User Acceptance of Information Technology , 1989, MIS Q..

[50]  Matthew M Davis,et al.  Fidelity of Administrative Data When Researching Down Syndrome , 2014, Medical care.