Query log analysis of an electronic health record search engine.

We analyzed a longitudinal collection of query logs of a full-text search engine designed to facilitate information retrieval in electronic health records (EHR). The collection, 202,905 queries and 35,928 user sessions recorded over a course of 4 years, represents the information-seeking behavior of 533 medical professionals, including frontline practitioners, coding personnel, patient safety officers, and biomedical researchers for patient data stored in EHR systems. In this paper, we present descriptive statistics of the queries, a categorization of information needs manifested through the queries, as well as temporal patterns of the users' information-seeking behavior. The results suggest that information needs in medical domain are substantially more sophisticated than those that general-purpose web search engines need to accommodate. Therefore, we envision there exists a significant challenge, along with significant opportunities, to provide intelligent query recommendations to facilitate information retrieval in EHR.

[1]  Silviu Cucerzan,et al.  Acronym-Expansion Recognition and Ranking on the Web , 2007, 2007 IEEE International Conference on Information Reuse and Integration.

[2]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[3]  Yinglian Xie,et al.  Locality in search engine queries and its implications for caching , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[4]  Daniel M. Stein,et al.  An analysis of clinical queries in an electronic health record search utility , 2010, Int. J. Medical Informatics.

[5]  Yan Zhang,et al.  Searching : an Analysis of Questions in a Social Q & A Community , 2010 .

[6]  Zhiyong Lu,et al.  Evaluation of query expansion using MeSH in PubMed , 2009, Information Retrieval.

[7]  Elmer V. Bernstam,et al.  A day in the life of PubMed: analysis of a typical day's query log. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[8]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[9]  Rosie Jones,et al.  Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[10]  Thomas Braun,et al.  Change in plasma tumor necrosis factor receptor 1 levels in the first week after myeloablative allogeneic transplantation correlates with severity and incidence of GVHD and survival. , 2008, Blood.

[11]  Qing Zeng-Treitler,et al.  Research Paper: Assisting Consumer Health Information Retrieval with Query Recommendations , 2006, J. Am. Medical Informatics Assoc..

[12]  Kent A. Spackman,et al.  SNOMED clinical terms: overview of the development process and project status , 2001, AMIA.

[13]  Amanda Spink,et al.  Real life information retrieval: a study of user queries on the Web , 1998, SIGF.

[14]  Yin Yang,et al.  A study of medical and health queries to web search engines. , 2004, Health information and libraries journal.

[15]  Gang Luo Lessons learned from building the iMED intelligent medical search engine , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[16]  Christian Köhler,et al.  How do consumers search for and appraise health information on the world wide web? Qualitative study using focus groups, usability tests, and in-depth interviews , 2002, BMJ : British Medical Journal.

[17]  Gang Luo,et al.  Design and Evaluation of the iMed Intelligent Medical Search Engine , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[18]  Aysu Betin Can,et al.  MedicoPort: A medical search engine for all , 2007, Comput. Methods Programs Biomed..

[19]  Eric Brill,et al.  Spelling Correction as an Iterative Process that Exploits the Collective Knowledge of Web Users , 2004, EMNLP.

[20]  M. Englesbe,et al.  Early Urologic Complications After Pediatric Renal Transplant: A Single-Center Experience , 2008, Transplantation.

[21]  Kai Zheng,et al.  Collaborative search in electronic health records , 2011, J. Am. Medical Informatics Assoc..

[22]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[23]  Christian Köhler,et al.  What is the prevalence of health-related searches on the World Wide Web? Qualitative and quantitative analysis of search engine queries on the Internet , 2003, AMIA.

[24]  Robert Krovetz,et al.  Viewing morphology as an inference process , 1993, Artif. Intell..

[25]  David A. Hanauer,et al.  EMERSE: The Electronic Medical Record Search Engine , 2006, AMIA.

[26]  Chunqiang Tang,et al.  Challenging issues in iterative intelligent medical search , 2008, 2008 19th International Conference on Pattern Recognition.

[27]  Hao Yang,et al.  MedSearch: a specialized search engine for medical information retrieval , 2008, CIKM '08.

[28]  Qing Zeng-Treitler,et al.  Research Paper: A Frequency-based Technique to Improve the Spelling Suggestion Rank in Medical Queries , 2004, J. Am. Medical Informatics Assoc..

[29]  Trivellore E Raghunathan,et al.  Alcohol use and cigarette smoking as risk factors for post-endoscopic retrograde cholangiopancreatography pancreatitis. , 2009, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[30]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[31]  Shuang-Hong Yang,et al.  Dialect topic modeling for improved consumer medical search. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[32]  Rey-Long Liu,et al.  Medical query generation by term-category correlation , 2011, Inf. Process. Manag..

[33]  Kenneth Ward Church,et al.  Query suggestion using hitting time , 2008, CIKM '08.

[34]  Ravi Kumar,et al.  A Characterization of Online Search Behavior , 2009, IEEE Data Eng. Bull..