Leveraging medical context to recommend semantically similar terms for chart reviews

Background Information retrieval (IR) help clinicians answer questions posed to large collections of electronic medical records (EMRs), such as how best to identify a patient’s cancer stage. One of the more promising approaches to IR for EMRs is to expand a keyword query with similar terms (e.g., augmenting cancer with mets ). However, there is a large range of clinical chart review tasks, such that fixed sets of similar terms is insufficient. Current language models, such as Bidirectional Encoder Representations from Transformers (BERT) embeddings, do not capture the full non-textual context of a task. In this study, we present new methods that provide similar terms dynamically by adjusting with the context of the chart review task. Methods We introduce a vector space for medical-context in which each word is represented by a vector that captures the word’s usage in different medical contexts (e.g., how frequently cancer is used when ordering a prescription versus describing family history) beyond the context learned from the surrounding text. These vectors are transformed into a vector space for customizing the set of similar terms selected for different chart review tasks. We evaluate the vector space model with multiple chart review tasks, in which supervised machine learning models learn to predict the preferred terms of clinically knowledgeable reviewers. To quantify the usefulness of the predicted similar terms to a baseline of standard word2vec embeddings, we measure (1) the prediction performance of the medical-context vector space model using the area under the receiver operating characteristic curve (AUROC) and (2) the labeling effort required to train the models. Results The vector space outperformed the baseline word2vec embeddings in all three chart review tasks with an average AUROC of 0.80 versus 0.66, respectively. Additionally, the medical-context vector space significantly reduced the number of labels required to learn and predict the preferred similar terms of reviewers. Specifically, the labeling effort was reduced to 10% of the entire dataset in all three tasks. Conclusions The set of preferred similar terms that are relevant to a chart review task can be learned by leveraging the medical context of the task.

[1]  B. Anthony,et al.  Mental Health Screening Quality Improvement Learning Collaborative in Pediatric Primary Care , 2017, Pediatrics.

[2]  Fei Li,et al.  Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)–Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study , 2019, JMIR medical informatics.

[3]  Uta S Guo,et al.  Racing Against the Clock: Internal Medicine Residents' Time Spent On Electronic Health Records. , 2016, Journal of graduate medical education.

[4]  Daniel Fabbri,et al.  Predicting Discharge Dates From the NICU Using Progress Note Data , 2015, Pediatrics.

[5]  Joi L. Moore,et al.  “I Don't Have Time to Dig Back Through This”: The Role of Semantic Search in Supporting Physician Information Seeking in an Electronic Health Record , 2014 .

[6]  Publisher's Note , 2018, Anaesthesia.

[7]  Travis R. Goodwin,et al.  UTD at TREC 2014: Query Expansion for Clinical Decision Support , 2014, TREC.

[8]  Yevgeniy Vorobeychik,et al.  A Crowdsourcing Framework for Medical Data Sets , 2018, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[9]  Mandar Mitra,et al.  Exploring Query Categorisation for Query Expansion: A Study , 2015, ArXiv.

[10]  Paul A. Harris,et al.  Secondary use of clinical data: The Vanderbilt approach , 2014, J. Biomed. Informatics.

[11]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[12]  Martin Hofmann-Apitius,et al.  ADO: A disease ontology representing the domain knowledge specific to Alzheimer's disease , 2014, Alzheimer's & Dementia.

[13]  Jaewoo Kang,et al.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining , 2019, Bioinform..

[14]  Arantxa Otegi,et al.  Improving search over Electronic Health Records using UMLS-based query expansion through random walks , 2014, J. Biomed. Informatics.

[15]  George Hripcsak,et al.  Use of electronic clinical documentation: time spent and team interactions , 2011, J. Am. Medical Informatics Assoc..

[16]  Daniel M. Stein,et al.  An analysis of clinical queries in an electronic health record search utility , 2010, Int. J. Medical Informatics.

[17]  Daniel Fabbri,et al.  Extracting similar terms from multiple EMR-based semantic embeddings to support chart reviews , 2018, J. Biomed. Informatics.

[18]  Hongfang Liu,et al.  A Comparison of Word Embeddings for the Biomedical Natural Language Processing , 2018, J. Biomed. Informatics.

[19]  Chris North,et al.  Bridging the gap between user intention and model parameters for human-in-the-loop data analytics , 2016, HILDA '16.

[20]  Randolph A. Miller,et al.  Research Paper: Evaluation of a Method to Identify and Categorize Section Headers in Clinical Documents , 2009, J. Am. Medical Informatics Assoc..

[21]  Luke V Rasmussen,et al.  The Electronic Health Record for Translational Research , 2014, Journal of Cardiovascular Translational Research.

[22]  Dario A. Giuse,et al.  StarTracker: An Integrated, Web-based Clinical Search Engine , 2003, AMIA.

[23]  Yue Wang,et al.  Interactive medical word sense disambiguation through informed learning , 2018, J. Am. Medical Informatics Assoc..

[24]  Hongwu Qin,et al.  A survey of query expansion, query suggestion and query refinement techniques , 2015, 2015 4th International Conference on Software Engineering and Computer Systems (ICSECS).

[25]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[26]  Michael Zalis,et al.  Advanced search of the electronic medical record: augmenting safety and efficiency in radiology. , 2010, Journal of the American College of Radiology : JACR.

[27]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[28]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[29]  Jeffrey S. Simonoff,et al.  Tree Induction Vs Logistic Regression: A Learning Curve Analysis , 2001, J. Mach. Learn. Res..

[30]  Kai Zheng,et al.  Supporting information retrieval from electronic health records: A report of University of Michigan's nine-year experience in developing and using the Electronic Medical Record Search Engine (EMERSE) , 2015, J. Biomed. Informatics.

[31]  P. Biron,et al.  An information retrieval system for computerized patient records in the context of a daily hospital practice: the example of the Léon Bérard Cancer Center (France) , 2014, Applied Clinical Informatics.