Laplacian SVM Based Feature Selection Improves Medical Event Reports Classification

Timely reporting and analysis of adverse events and medical errors is critical to driving forward programs in patient-safety, however, due to the large numbers of event reports accumulating daily in health institutions, manually finding and labeling certain types of errors or events is becoming increasingly challenging. We propose to automatically classify/label event reports via semi-supervised learning which utilizes labeled as well as unlabeled event reports to complete the classification task. We focused on classifying two types of event reports: patient mismatches and weight errors. We downloaded 9405 reports from the Connecticut Children's Medical Center reporting system. We generated two samples of labeled and unlabeled reports containing 3155 and 255 for the patient mismatch and the weight error use cases respectively. We developed feature based Laplacian Support Vector machine (FS-LapSVM), a hybrid framework that combines feature selection with Laplacian Support Vector machine classifier (LapSVM). Superior performance of FS-LapSVM in finding patient weight error reports compared to LapSVM. Also, FS-LapSVM classifier outperformed standard LapSVM in classifying patient mismatch reports across all metrics.

[1]  Shawn Wolfe,et al.  Wordplay: An Examination of Semantic Approaches to Classify Safety Reports , 2007 .

[2]  Bernhard Schölkopf,et al.  Cluster Kernels for Semi-Supervised Learning , 2002, NIPS.

[3]  Avrim Blum,et al.  Learning from Labeled and Unlabeled Data using Graph Mincuts , 2001, ICML.

[4]  Chih-Jen Lin,et al.  Combining SVMs with Various Feature Selection Strategies , 2006, Feature Extraction.

[5]  Paul J Sharek,et al.  Adverse Events in the Neonatal Intensive Care Unit: Development, Testing, and Findings of an NICU-Focused Trigger Tool to Identify Harm in North American NICUs , 2006, Pediatrics.

[6]  Eric S. Kirkendall,et al.  Phenotyping for patient safety: algorithm development for electronic health record based automated adverse event and medical error detection in neonatal intensive care , 2014, Journal of the American Medical Informatics Association : JAMIA.

[7]  Claudia R Campbell,et al.  A comparison of hospital adverse events identified by three widely used detection methods. , 2009, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[8]  Eric S. Kirkendall,et al.  Measuring Adverse Events and Levels of Harm in Pediatric Inpatients With the Global Trigger Tool , 2012, Pediatrics.

[9]  George Hripcsak,et al.  Automated detection of adverse events using natural language processing of discharge summaries. , 2005, Journal of the American Medical Informatics Association : JAMIA.

[10]  P. Conway,et al.  Identifying causes of adverse events detected by an automated trigger tool through in-depth analysis , 2010, Quality and Safety in Health Care.

[11]  Dawn M. McIntosh,et al.  Automatic Discovery of Anomalies Reported in Aerospace Systems Health and Safety Documents , 2007 .

[12]  N. Laird,et al.  Incidence of Adverse Drug Events and Potential Adverse Drug Events: Implications for Prevention , 1995 .

[13]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[14]  M. Cantor,et al.  Using trigger phrases to detect adverse drug reactions in ambulatory care notes , 2007, Quality and Safety in Health Care.

[15]  D. Bates,et al.  Medication errors and adverse drug events in pediatric inpatients. , 2001, JAMA.

[16]  Thorsten Joachims,et al.  Transductive Learning via Spectral Graph Partitioning , 2003, ICML.

[17]  Farah Magrabi,et al.  Automated categorisation of clinical incident reports using statistical text classification , 2010, Quality and Safety in Health Care.

[18]  Vikas Sindhwani,et al.  On Manifold Regularization , 2005, AISTATS.

[19]  Medication Errors : Significance of Accurate Patient Weights , 2010 .

[20]  Tommi S. Jaakkola,et al.  Partially labeled classification with Markov random walks , 2001, NIPS.

[21]  Cynthia Brandt,et al.  Semi-supervised clinical text classification with Laplacian SVMs: An application to cancer case management , 2013, J. Biomed. Informatics.

[22]  Mikhail Belkin,et al.  Laplacian Support Vector Machines Trained in the Primal , 2009, J. Mach. Learn. Res..