Machine Learning Models for Early Dengue Severity Prediction

Infection by dengue-virus is prevalent and a public health issue in tropical countries worldwide. Also, in developing nations, child populations remain at risk of adverse events following an infection by dengue virus, as the necessary care is not always accessible, or health professionals are without means to cheaply and reliably predict how likely is for a patient to experience severe Dengue. Here, we propose a classification model based on Machine Learning techniques, which predicts whether or not a pediatric patient will be admitted into the pediatric Intensive Care Unit, as a proxy for Dengue severity. Different Machine Learning techniques were trained and validated using Stratified 5-Fold Cross-Validation, and the best model was evaluated on a disjoint test set. Cross-Validation results showed an SVM with Gaussian Kernel outperformed the other models considered, with an 0.81 Receiver Operating Characteristic Area Under the Curve (ROC AUC) score. Subsequent results over the test set showed a 0.75 ROC AUC score. Validation and test results are promising and support further research and development.

[1]  M. Blettner,et al.  Vascular leakage in severe dengue virus infections: a potential role for the nonstructural viral protein NS1 and complement. , 2006, The Journal of infectious diseases.

[2]  Tin Wee Tan,et al.  Classification of Dengue Fever Patients Based on Gene Expression Data Using Support Vector Machines , 2010, PloS one.

[3]  S. Halstead,et al.  Secondary infection as a risk factor for dengue hemorrhagic fever/dengue shock syndrome: an historical perspective and role of antibody-dependent enhancement of infection , 2013, Archives of Virology.

[4]  J. Farrar,et al.  Epidemiological Factors Associated with Dengue Shock Syndrome and Mortality in Hospitalized Dengue Patients in Ho Chi Minh City, Vietnam , 2011, The American journal of tropical medicine and hygiene.

[5]  Luis Roman Carrasco,et al.  Predictive Tools for Severe Dengue Conforming to World Health Organization 2009 Criteria , 2014, PLoS neglected tropical diseases.

[6]  Tom Solomon,et al.  Clinical diagnosis and assessment of severity of confirmed dengue infections in Vietnamese children: is the world health organization classification system helpful? , 2004, The American journal of tropical medicine and hygiene.

[7]  P. Brasil,et al.  Is Pregnancy Associated with Severe Dengue? A Review of Data from the Rio de Janeiro Surveillance Information System , 2013, PLoS neglected tropical diseases.

[8]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[9]  K. Kesorn,et al.  Morbidity Rate Prediction of Dengue Hemorrhagic Fever (DHF) Using the Support Vector Machine and the Aedes aegypti Infection Rate in Similar Climates and Geographical Areas , 2015, PloS one.

[10]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[11]  Mark J. Schreiber,et al.  Decision Tree Algorithms Predict the Diagnosis and Outcome of Dengue Fever in the Early Phase of Illness , 2008, PLoS neglected tropical diseases.

[12]  Nguyen Tien Huy,et al.  Development of clinical decision rules to predict recurrent shock in dengue , 2013, Critical Care.

[13]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[14]  John S. Brownstein,et al.  The global distribution and burden of dengue , 2013, Nature.

[15]  Stephenie C. Lemon,et al.  Prediction of Dengue Disease Severity among Pediatric Thai Patients Using Early Clinical Laboratory Indicators , 2010, PLoS neglected tropical diseases.

[16]  A. Nisalak,et al.  Differing influences of virus burden and immune activation on disease severity in secondary dengue-3 virus infections. , 2002, The Journal of infectious diseases.

[17]  K. Shekhar,et al.  Epidemiology of Dengue/Dengue Hemorrhagic Fever in Malaysia — A Retrospective Epidemiological Study 1973-1987. Part I: Dengue hemorrhagic fever (DHF) , 1992, Asia-Pacific journal of public health.

[18]  K. Shekhar,et al.  Epidemiology of Dengue/Dengue Hemorrhagic Fever in Malaysia-A Retrospective Epidemiological Study. 1973-1987. Part II: Dengue Fever (DF). , 1992, Asia-Pacific journal of public health.

[19]  Marcel Wolbers,et al.  Clinical Features of Dengue in a Large Vietnamese Cohort: Intrinsically Lower Platelet Counts and Greater Risk for Bleeding in Adults than Children , 2012, PLoS neglected tropical diseases.

[20]  S. Kalayanarooj,et al.  Is dengue severity related to nutritional status? , 2005, The Southeast Asian journal of tropical medicine and public health.

[21]  A. Nisalak,et al.  High circulating levels of the dengue virus nonstructural protein NS1 early in dengue illness correlate with the development of dengue hemorrhagic fever. , 2002, The Journal of infectious diseases.

[22]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[23]  A. Nisalak,et al.  Dengue viremia titer, antibody response pattern, and virus serotype correlate with disease severity. , 2000, The Journal of infectious diseases.

[24]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[25]  Tom Solomon,et al.  Evaluation of the World Health Organization standard tourniquet test and a modified tourniquet test in the diagnosis of dengue infection in Viet Nam , 2002, Tropical medicine & international health : TM & IH.

[26]  E. C. Duarte,et al.  Determinants of mortality from severe dengue in Brazil: a population-based case-control study. , 2013, The American journal of tropical medicine and hygiene.