Using multiclass classification to automate the identification of patient safety incident reports by type and severity

BackgroundApproximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident reports in a timely manner is limited by the sheer volumes of data collected. In this study, we aim to evaluate the feasibility of using multiclass classification to automate the identification of patient safety incidents in hospitals.MethodsText based classifiers were applied to identify 10 incident types and 4 severity levels. Using the one-versus-one (OvsO) and one-versus-all (OvsA) ensemble strategies, we evaluated regularized logistic regression, linear support vector machine (SVM) and SVM with a radial-basis function (RBF) kernel. Classifiers were trained and tested with “balanced” datasets (n_Type = 2860, n_SeverityLevel = 1160) from a state-wide incident reporting system. Testing was also undertaken with imbalanced “stratified” datasets (n_Type = 6000, n_SeverityLevel =5950) from the state-wide system and an independent hospital reporting system. Classifier performance was evaluated using a confusion matrix, as well as F-score, precision and recall.ResultsThe most effective combination was a OvsO ensemble of binary SVM RBF classifiers with binary count feature extraction. For incident type, classifiers performed well on balanced and stratified datasets (F-score: 78.3, 73.9%), but were worse on independent datasets (68.5%). Reports about falls, medications, pressure injury, aggression and blood products were identified with high recall and precision. “Documentation” was the hardest type to identify. For severity level, F-score for severity assessment code (SAC) 1 (extreme risk) was 87.3 and 64% for SAC4 (low risk) on balanced data. With stratified data, high recall was achieved for SAC1 (82.8–84%) but precision was poor (6.8–11.2%). High risk incidents (SAC2) were confused with medium risk incidents (SAC3).ConclusionsBinary classifier ensembles appear to be a feasible method for identifying incidents by type and severity level. Automated identification should enable safety problems to be detected and addressed in a more timely manner. Multi-label classifiers may be necessary for reports that relate to more than one incident type.

[1]  R. Thomson,et al.  Towards an International Classification for Patient Safety: key concepts and terms , 2009, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[2]  R Ratwani,et al.  An Evaluation of Patient Safety Event Report Categories Using Unsupervised Topic Modeling. , 2015, Methods of information in medicine.

[3]  W. Runciman,et al.  An integrated framework for safety, quality and risk management: an information and incident management system based on a universal patient safety classification , 2006, Quality and Safety in Health Care.

[4]  Raj M. Ratwani,et al.  Exploring methods for identifying related patient safety events using structured and unstructured data , 2015, J. Biomed. Informatics.

[5]  Hung-Yi Lin,et al.  Efficient classifiers for multi-class classification problems , 2012, Decis. Support Syst..

[6]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[7]  Vincent Liu,et al.  Automated identification of pneumonia in chest radiograph reports in critically ill patients , 2013, BMC Medical Informatics and Decision Making.

[8]  Alex A. T. Bui,et al.  Automatic Identification & Classification of Surgical Margin Status from Pathology Reports Following Prostate Cancer Surgery , 2007, AMIA.

[9]  P. Pronovost,et al.  Patient safety incident reporting: a qualitative study of thoughts and perceptions of experts 15 years after ‘To Err is Human’ , 2015, BMJ Quality & Safety.

[10]  Farah Magrabi,et al.  Automated identification of extreme-risk events in clinical incident reports , 2012, J. Am. Medical Informatics Assoc..

[11]  R. Conroy,et al.  Adverse events in healthcare: learning from mistakes. , 2015, QJM : monthly journal of the Association of Physicians.

[12]  Farah Magrabi,et al.  Clinical safety of England's national programme for IT: A retrospective analysis of all reported safety events 2005 to 2011 , 2015, Int. J. Medical Informatics.

[13]  Peter J. Pronovost,et al.  Improving the Value of Patient Safety Reporting Systems , 2008 .

[14]  Farah Magrabi,et al.  Using statistical text classification to identify health information technology incidents , 2013, J. Am. Medical Informatics Assoc..

[15]  Nathalie Japkowicz,et al.  The Class Imbalance Problem: Significance and Strategies , 2000 .

[16]  R. Mahajan,et al.  Critical incident reporting and learning. , 2010, British journal of anaesthesia.

[17]  Abeed Sarker,et al.  Portable automatic text classification for adverse drug reaction detection via multi-corpus training , 2015, J. Biomed. Informatics.

[18]  Albert Y. Zomaya,et al.  A Survey of Mobile Device Virtualization , 2016, ACM Comput. Surv..

[19]  John Xavier Rolley Bn Rn Mcn Mrcna Safety and Ethics in Healthcare: A Guide to Getting it Right , 2007 .

[20]  J. Rolley Safety and Ethics in Healthcare: A Guide to Getting it Right , 2007 .

[21]  J. Braithwaite,et al.  Implementation of a patient safety incident management system as viewed by doctors, nurses and allied health professionals , 2009, Health.

[22]  W. J. Russell,et al.  The Australian Incident Monitoring Study: An Analysis of 2000 Incident Reports , 1993, Anaesthesia and intensive care.

[23]  J Gosbee,et al.  Developing and deploying a patient safety program in a large health care delivery system: you can't fix what you don't know about. , 2001, The Joint Commission journal on quality improvement.

[24]  Yang Gong Data Consistency in a Voluntary Medical Incident Reporting System , 2009, Journal of Medical Systems.

[25]  Allan Fong,et al.  'Connecting the dots': leveraging visual analytics to make sense of patient safety event reports , 2015, J. Am. Medical Informatics Assoc..

[26]  Johanna I. Westbrook,et al.  Improving the identification and management of chronic kidney disease in primary care: lessons from a staged improvement collaborative , 2014, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[27]  Pernille Warrer,et al.  Using text-mining techniques in electronic patient records to identify ADRs from medicine use. , 2012, British Journal of Clinical Pharmacology.

[28]  Anant Madabhushi,et al.  An active learning based classification strategy for the minority class problem: application to histopathology annotation , 2011, BMC Bioinformatics.

[29]  Luís Torgo,et al.  A Survey of Predictive Modeling on Imbalanced Domains , 2016, ACM Comput. Surv..

[30]  Farah Magrabi,et al.  Automated categorisation of clinical incident reports using statistical text classification , 2010, Quality and Safety in Health Care.

[31]  D. Black On the Rationale of Group Decision-making , 1948, Journal of Political Economy.

[32]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[33]  Martti Juhola,et al.  Stemming and lemmatization in the clustering of finnish text documents , 2004, CIKM '04.

[34]  D. Ashcroft,et al.  Medication errors: how reliable are the severity ratings reported to the national reporting and learning system? , 2009, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[35]  Erin Sparnon,et al.  Screening Electronic Health Record–Related Patient Safety Reports Using Machine Learning , 2017, Journal of patient safety.

[36]  Sebastián Ventura,et al.  A Tutorial on Multilabel Learning , 2015, ACM Comput. Surv..

[37]  Hua Xu,et al.  Applying active learning to assertion classification of concepts in clinical text , 2012, J. Biomed. Informatics.

[38]  Andrew Zisserman,et al.  Efficient Visual Search of Videos Cast as Text Retrieval , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Özlem Uzuner,et al.  Machine learning and rule-based approaches to assertion classification. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[40]  Stephen E. Robertson,et al.  Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[41]  Farah Magrabi,et al.  Using FDA reports to inform a classification for health information technology safety problems , 2012, J. Am. Medical Informatics Assoc..

[42]  Charles P. Friedman,et al.  Viewpoint Paper: A "Fundamental Theorem" of Biomedical Informatics , 2009, J. Am. Medical Informatics Assoc..

[43]  Jun Wang,et al.  Enhancing multi-label classification by modeling dependencies among labels , 2014, Pattern Recognit..

[44]  Francisco Herrera,et al.  An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes , 2011, Pattern Recognit..

[45]  Inconsistency in Classification and Reporting of In‐Hospital Falls , 2009, Journal of the American Geriatrics Society.

[46]  Farah Magrabi,et al.  Identifying patient safety problems associated with information technology in general practice: an analysis of incident reports , 2015, BMJ Quality & Safety.