Exploring methods for identifying related patient safety events using structured and unstructured data

Most healthcare systems have implemented patient safety event reporting systems to identify safety hazards. Searching the safety event data to find related patient safety reports and identify trends is challenging given the complexity and quantity of these reports. Structured data elements selected by the event reporter may be inaccurate and the free-text narrative descriptions are difficult to analyze. In this paper we present and explore methods for utilizing both the unstructured free-text and structured data elements in safety event reports to identify and rank similar events. We evaluate the results of three different free-text search methods, including a unique topic modeling adaptation, and structured element weights, using a patient fall use case. The various search techniques and weight combinations tended to prioritize different aspects of the event reports leading to different search and ranking results. These search and prioritization methods have the potential to greatly improve patient safety officers, and other healthcare workers, understanding of which safety event reports are related.

[1]  Farah Magrabi,et al.  Automated identification of extreme-risk events in clinical incident reports , 2012, J. Am. Medical Informatics Assoc..

[2]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[3]  George Hripcsak,et al.  Automated detection of adverse events using natural language processing of discharge summaries. , 2005, Journal of the American Medical Informatics Association : JAMIA.

[4]  Ramesh Nallapati,et al.  Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora , 2009, EMNLP.

[5]  Peter J Pronovost,et al.  Toward learning from patient safety reporting systems. , 2006, Journal of critical care.

[6]  Marianthi Markatou,et al.  Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection , 2011, J. Am. Medical Informatics Assoc..

[7]  Farah Magrabi,et al.  Using FDA reports to inform a classification for health information technology safety problems , 2012, J. Am. Medical Informatics Assoc..

[8]  Qiaozhu Mei,et al.  Understanding the Limiting Factors of Topic Modeling via Posterior Contraction Analysis , 2014, ICML.

[9]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[10]  Albert W Wu,et al.  ICU incident reporting systems. , 2002, Journal of critical care.

[11]  Janet M. Corrigan,et al.  Patient Safety Reporting Systems and Applications , 2004 .

[12]  Aziz A. Boxwala,et al.  Organization and representation of patient safety data: current status and issues around generalizability and scalability. , 2004, Journal of the American Medical Informatics Association : JAMIA.

[13]  E-Step Structural Topic Models for Open Ended Survey Responses , 2022 .

[14]  John R. Clarke,et al.  How a System for Reporting Medical Errors Can and Cannot Improve Patient Safety , 2006, The American surgeon.

[15]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[16]  Matthew Smith,et al.  On the ecological validity of a password study , 2013, SOUPS.

[17]  Juan Enrique Ramos,et al.  Using TF-IDF to Determine Word Relevance in Document Queries , 2003 .

[18]  Czeslaw Danilowicz,et al.  Re-ranking method based on inter-document distances , 2005, Inf. Process. Manag..

[19]  Edoardo M. Airoldi,et al.  Summarizing topical content with word frequency and exclusivity , 2012, ICML 2012.

[20]  Margaret E. Roberts,et al.  The structural topic model and applied social science , 2013, ICONIP 2013.

[21]  Andrew McCallum,et al.  Optimizing Semantic Coherence in Topic Models , 2011, EMNLP.