Development of a predictive model for Clostridium difficile infection incidence in hospitals using Gaussian mixture model and Dempster–Shafer theory

Clostridium difficile infection is one of the major patient safety concerns in hospitals worldwide. Clostridium difficile infection can have high economic burden to patients, hospitals, and government. Limited work has been done in the area of predictive modeling. In this article, A new predictive model based on Gaussian mixture model and Dempster–Shafter theory is proposed to predict Clostridium difficile infection incidence in hospitals. First, the Gaussian mixture model and expectation–maximization algorithms are used to generate explicit probability criteria of risk factors based on the given data. Second, Dempster–Shafter theory is used to predict the Clostridium difficile infection incidence based on the generated probability criteria that have different beliefs attributing to their different credits. The main procedure includes (1) generate the probability criteria model using Gaussian mixture model and expectation–maximization algorithm; (2) determine the credit of the probability criteria; (3) generate the basic probability assignment; (4) discount the evidences; (5) aggregate the evidences using Dempster combining rule; (6) predict Clostridium difficile infection incidence using pignistic probability transformation. Results show that the model has a higher accuracy than an existing model. The proposed model can generate the criteria ratings of risk factors automatically, which would potentially prevent the imprecision caused by the subjective judgement of experts. The proposed model can assist risk managers and hospital administrators in the prediction and control of Clostridium difficile infection incidence with optimizing their resources.

[1]  Zeshui Xu,et al.  Probabilistic dual hesitant fuzzy set and its application in risk evaluation , 2017, Knowl. Based Syst..

[2]  Maria E. Orlowska,et al.  Projected outlier detection in high-dimensional mixed-attributes data set , 2009, Expert Syst. Appl..

[3]  Chee Khiang Pang,et al.  Gaussian Mixture Model Using Semisupervised Learning for Probabilistic Fault Diagnosis Under New Data Categories , 2017, IEEE Transactions on Instrumentation and Measurement.

[4]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[5]  Simon Lacroix,et al.  Classification of Outdoor 3D Lidar Data Based on Unsupervised Gaussian Mixture Models , 2017, IEEE Transactions on Automation Science and Engineering.

[6]  Rehan Sadiq,et al.  Clostridium difficile infection incidence prediction in hospitals (CDIIPH): a predictive model based on decision tree and fuzzy techniques , 2017, Stochastic Environmental Research and Risk Assessment.

[7]  Pavel V. Sevastjanov,et al.  An interpretation of intuitionistic fuzzy sets in terms of evidence theory: Decision making aspect , 2010, Knowl. Based Syst..

[8]  Joshua A. Doherty,et al.  Development and Validation of a Clostridium difficile Infection Risk Prediction Model Author ( s ) : , 2018 .

[9]  Jian-Bo Yang,et al.  The evidential reasoning approach for MADA under both probabilistic and fuzzy uncertainties , 2006, Eur. J. Oper. Res..

[10]  Xinge You,et al.  A Batch Rival Penalized Expectation-Maximization Algorithm for Gaussian Mixture Clustering with Automatic Model Selection , 2012, Comput. Math. Methods Medicine.

[11]  Karl Weiss,et al.  Prevalence of Colonization and Infection with Methicillin-Resistant Staphylococcus aureus and Vancomycin-Resistant Enterococcus and of Clostridium difficile Infection in Canadian Hospitals , 2013, Infection Control & Hospital Epidemiology.

[12]  P. Smets Data fusion in the transferable belief model , 2000, Proceedings of the Third International Conference on Information Fusion.

[13]  M. Febrero,et al.  Outlier detection in functional data by depth measures, with application to identify abnormal NOx levels , 2008 .

[14]  Sankaran Mahadevan,et al.  Evidential cognitive maps , 2012, Knowl. Based Syst..

[15]  Rehan Sadiq,et al.  Fate of drilling waste discharges and ecological risk assessment in the Egyptian Red Sea: an aquivalence-based fuzzy analysis , 2012, Stochastic Environmental Research and Risk Assessment.

[16]  Fernanda C. Lessa,et al.  Current Status of Clostridium difficile Infection Epidemiology , 2012, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[17]  Yong Deng,et al.  A method to determine basic probability assignment in the open world and its application in data fusion and classification , 2017, Applied Intelligence.

[18]  Mahdi Mahfouf,et al.  Hybrid-modelling of compact tension energy in high strength pipeline steel using a Gaussian Mixture Model based error compensation , 2016, Appl. Soft Comput..

[19]  Mark Miller,et al.  Health care-associated Clostridium difficile infection in adults admitted to acute care hospitals in Canada: a Canadian Nosocomial Infection Surveillance Program Study. , 2009, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[20]  Raghav Gupta,et al.  Density Based Outlier Detection Technique , 2016 .

[21]  Margaret A. Olsen,et al.  Burden of Clostridium difficile on the Healthcare System , 2012, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[22]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[23]  Brian Veitch,et al.  RISK-BASED DECISION-MAKING FOR DRILLING WASTE DISCHARGES USING A FUZZY SYNTHETIC EVALUATION TECHNIQUE , 2004 .

[24]  Clara Pizzuti,et al.  Distance-based detection and prediction of outliers , 2006, IEEE Transactions on Knowledge and Data Engineering.

[25]  Zhen Li,et al.  An Evidential Failure Mode and Effects Analysis Using Linguistic Terms , 2016, Qual. Reliab. Eng. Int..

[26]  Rashmi Chauhan,et al.  A Deviation Based Outlier Intrusion Detection System , 2010, CNSA.

[27]  Rahim Moineddin,et al.  The Magnitude and Duration of Clostridium difficile Infection Risk Associated with Antibiotic Therapy: A Hospital Cohort Study , 2014, PloS one.

[28]  Frédéric Barbut,et al.  Epidemiology and control of Clostridium difficile infections in healthcare settings: an update , 2011, Current opinion in infectious diseases.

[29]  L. Gentry,et al.  A clinical risk index for Clostridium difficile infection in hospitalised patients receiving broad-spectrum antibiotics. , 2008, The Journal of hospital infection.

[30]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[31]  Adrian E. Raftery,et al.  Model-based clustering and data transformations for gene expression data , 2001, Bioinform..

[32]  Helen Burstin,et al.  Strategies to Prevent Clostridium difficile Infections in Acute Care Hospitals , 2008, Infection Control & Hospital Epidemiology.

[33]  Ho-Sub Yoon,et al.  Automated Speaker Recognition for Home Service Robots Using Genetic Algorithm and Dempster–Shafer Fusion Technique , 2009, IEEE Transactions on Instrumentation and Measurement.

[34]  Timoteo Carletti,et al.  The Stochastic Evolution of a Protocell: The Gillespie Algorithm in a Dynamically Varying Volume , 2011, Comput. Math. Methods Medicine.

[35]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[36]  J. Tanner,et al.  Waterlow score to predict patients at risk of developing Clostridium difficile-associated disease. , 2009, The Journal of hospital infection.

[37]  Arthur P. Dempster,et al.  Upper and Lower Probabilities Induced by a Multivalued Mapping , 1967, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[38]  J. Starr,et al.  Mathematical modeling of Clostridium difficile infection. , 2001, Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases.

[39]  Yong Hu,et al.  An evidential game theory framework in multi-criteria decision making process , 2014, Appl. Math. Comput..

[40]  Piero P. Bonissone,et al.  A classification and regression technique to handle heterogeneous and imperfect information , 2010, Soft Comput..