Initializing and Growing a Database of Health Information Technology (HIT) Events by Using TF-IDF and Biterm Topic Modeling

Health information technology (HIT) events were listed in the top 10 technology-related hazards since one in six patient safety events (PSE) is related to HIT. Although it becomes a common sense that event reporting is an effective way to accumulate typical cases for learning, the lack of HIT event databases remains a challenge. Aiming to retrieve HIT events from millions of event reports related to medical devices in FDA Manufacturer and User Facility Device Experience (MAUDE) database, we proposed a novel identification strategy composed of a structured data-based filter and an unstructured data-based classifier using both TF-IDF and biterm topic. A dataset with 97% HIT events was retrieved from the raw database of 2015 FDA MAUDE, which contains approximately 0.4~0.9% HIT events. This strategy holds promise of initializing and growing an HIT database to meet the challenges of collecting, analyzing, sharing, and learning from HIT events at an aggregated level.

[1]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.

[2]  Frank Wang,et al.  Identifying and Synchronizing Health Information Technology (HIT) Events from FDA Medical Device Reports , 2017, MedInfo.

[3]  Yang Gong,et al.  Design of a User-Centered Voluntary Reporting System for Patient Safety Events , 2017, MedInfo.

[4]  Todd R. Johnson,et al.  Improving the utility of MeSH® terms using the TopicalMeSH representation , 2016, J. Biomed. Informatics.

[5]  Yang Gong,et al.  A Novel Schema to Enhance Data Quality of Patient Safety Event Reports , 2016, AMIA.

[6]  Top 10 health technology hazards for 2015 are named. , 2015, OR manager.

[7]  Michel Wensing,et al.  Classification of medication incidents associated with information technology. , 2014, Journal of the American Medical Informatics Association : JAMIA.

[8]  K. Ogasawara,et al.  [Development of an attitude-measurement questionnaire using the semantic differential technique: defining the attitudes of radiological technology students toward X-ray examination]. , 2014, Nihon Hoshasen Gijutsu Gakkai zasshi.

[9]  J. James A New, Evidence-based Estimate of Patient Harms Associated with Hospital Care , 2013, Journal of patient safety.

[10]  V. Hasselblad,et al.  Effect of Clinical Decision-Support Systems , 2012, Annals of Internal Medicine.

[11]  Farah Magrabi,et al.  Using FDA reports to inform a classification for health information technology safety problems , 2012, J. Am. Medical Informatics Assoc..

[12]  Randy R Richter,et al.  Using MeSH (Medical Subject Headings) to Enhance PubMed Search Strategies for Evidence-Based Practice in Physical Therapy , 2011, Physical Therapy.

[13]  Timothy N. Rubin,et al.  Statistical topic models for multi-label document classification , 2011, Machine Learning.

[14]  Farah Magrabi,et al.  An analysis of computer-related patient safety incidents to inform the development of a classification , 2010, J. Am. Medical Informatics Assoc..

[15]  Brian D. Davison,et al.  Empirical study of topic modeling in Twitter , 2010, SOMA '10.

[16]  D. Bates,et al.  Priority Setting Working Group of the WHO World Alliance for Patient Safety , 2022 .

[17]  Lawrence Carin,et al.  Probabilistic Topic Models , 2010, IEEE Signal Processing Magazine.

[18]  C. Goldzweig,et al.  Costs and benefits of health information technology: new trends from the literature. , 2009, Health affairs.

[19]  N. Powe,et al.  Clinical information technologies and inpatient outcomes: a multiple hospital study. , 2009, Archives of internal medicine.

[20]  Zhiyong Lu,et al.  Evaluation of query expansion using MeSH in PubMed , 2009, Information Retrieval.

[21]  Hao Yang,et al.  MedSearch: a specialized search engine for medical information retrieval , 2008, CIKM '08.

[22]  ELSKE AMMENWERTH,et al.  Review Paper: The Effect of Electronic Prescribing on Medication Errors and Adverse Drug Events: A Systematic Review , 2008, J. Am. Medical Informatics Assoc..

[23]  Paul Pavlidis,et al.  Gene Ontology term overlap as a measure of gene functional similarity , 2008, BMC Bioinformatics.

[24]  P. Shekelle,et al.  Systematic Review: Impact of Health Information Technology on Quality, Efficiency, and Costs of Medical Care , 2006, Annals of Internal Medicine.

[25]  E Coiera,et al.  Section 1: Health and Clinical Mangement: The Safety and Quality of Decision Support Systems , 2006, Yearbook of Medical Informatics.

[26]  R Shaw,et al.  Adverse events and near miss reporting in the NHS , 2005, Quality and Safety in Health Care.

[27]  H. Mcdonald,et al.  Effects of computerized clinical decision support systems on practitioner performance and patient outcomes: a systematic review. , 2005, JAMA.

[28]  Peter J. Pronovost,et al.  Application of Information Technology j Creating the Web-based Intensive Care Unit Safety Reporting System , 2005 .

[29]  Marc Berg,et al.  Viewpoint Paper: Some Unintended Consequences of Information Technology in Health Care: The Nature of Patient Care Information System-related Errors , 2003, J. Am. Medical Informatics Assoc..

[30]  D. Bates,et al.  Effects of computerized physician order entry and clinical decision support systems on medication safety: a systematic review. , 2003, Archives of internal medicine.

[31]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[32]  Robert H. Baud,et al.  Evaluating and reducing the effect of data corruption when applying bag of words approaches to medical records , 2002, Int. J. Medical Informatics.

[33]  K. Nelson,et al.  Developing a comprehensive electronic adverse event reporting system in an academic health center. , 2002, The Joint Commission journal on quality improvement.

[34]  L. Kohn,et al.  To Err Is Human : Building a Safer Health System , 2007 .

[35]  P. Barach,et al.  Reporting and preventing medical mishaps: lessons from non-medical near miss reporting systems , 2000, BMJ : British Medical Journal.

[36]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[37]  C E Billings Some hopes and concerns regarding medical event-reporting systems: lessons from the NASA Aviation Safety Reporting System. , 1998, Archives of pathology & laboratory medicine.

[38]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[39]  S. A. Edwards Computer-based management gaming: a method of executive development. , 1965, Hospitals.