Classification of Dengue Fever Patients Based on Gene Expression Data Using Support Vector Machines

Background Symptomatic infection by dengue virus (DENV) can range from dengue fever (DF) to dengue haemorrhagic fever (DHF), however, the determinants of DF or DHF progression are not completely understood. It is hypothesised that host innate immune response factors are involved in modulating the disease outcome and the expression levels of genes involved in this response could be used as early prognostic markers for disease severity. Methodology/Principal Findings mRNA expression levels of genes involved in DENV innate immune responses were measured using quantitative real time PCR (qPCR). Here, we present a novel application of the support vector machines (SVM) algorithm to analyze the expression pattern of 12 genes in peripheral blood mononuclear cells (PBMCs) of 28 dengue patients (13 DHF and 15 DF) during acute viral infection. The SVM model was trained using gene expression data of these genes and achieved the highest accuracy of ∼85% with leave-one-out cross-validation. Through selective removal of gene expression data from the SVM model, we have identified seven genes (MYD88, TLR7, TLR3, MDA5, IRF3, IFN-α and CLEC5A) that may be central in differentiating DF patients from DHF, with MYD88 and TLR7 observed to be the most important. Though the individual removal of expression data of five other genes had no impact on the overall accuracy, a significant combined role was observed when the SVM model of the two main genes (MYD88 and TLR7) was re-trained to include the five genes, increasing the overall accuracy to ∼96%. Conclusions/Significance Here, we present a novel use of the SVM algorithm to classify DF and DHF patients, as well as to elucidate the significance of the various genes involved. It was observed that seven genes are critical in classifying DF and DHF patients: TLR3, MDA5, IRF3, IFN-α, CLEC5A, and the two most important MYD88 and TLR7. While these preliminary results are promising, further experimental investigation is necessary to validate their specific roles in dengue disease.

[1]  Mark J. Schreiber,et al.  Decision Tree Algorithms Predict the Diagnosis and Outcome of Dengue Fever in the Early Phase of Illness , 2008, PLoS neglected tropical diseases.

[2]  L. Platanias Mechanisms of type-I- and type-II-interferon-mediated signalling , 2005, Nature Reviews Immunology.

[3]  N. Bhardwaj,et al.  Plasmacytoid Dendritic Cells: Linking Innate and Adaptive Immunity , 2005, Journal of Virology.

[4]  Thomas D. Schmittgen,et al.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. , 2001, Methods.

[5]  Xin Yao,et al.  Gene selection algorithms for microarray data based on least squares support vector machine , 2006, BMC Bioinformatics.

[6]  M. Yoneyama,et al.  RNA recognition and signal transduction by RIG‐I‐like receptors , 2009, Immunological reviews.

[7]  E. Holmes,et al.  The causes and consequences of genetic variation in dengue virus. , 2000, Trends in microbiology.

[8]  Chuhsing Kate Hsiao,et al.  A new regularized least squares support vector regression for gene selection , 2009, BMC Bioinformatics.

[9]  M. Diamond,et al.  The host immunologic response to West Nile encephalitis virus. , 2009, Frontiers in bioscience.

[10]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[11]  Hideo Negishi,et al.  IRF-7 is the master regulator of type-I interferon-dependent immune responses , 2005, Nature.

[12]  Shizuo Akira,et al.  Toll‐like Receptor and RIG‐1‐like Receptor Signaling , 2008, Annals of the New York Academy of Sciences.

[13]  Yanqing Zhang,et al.  Development of Two-Stage SVM-RFE Gene Selection Strategy for Microarray Expression Data Analysis , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[14]  Ulisses Braga-Neto,et al.  Gene Expression Profiling during Early Acute Febrile Stage of Dengue Infection Can Predict the Disease Outcome , 2009, PloS one.

[15]  G. Foster,et al.  Dengue Virus Inhibits Alpha Interferon Signaling by Reducing STAT2 Expression , 2005, Journal of Virology.

[16]  Chi-Huey Wong,et al.  CLEC5A is critical for dengue-virus-induced lethal disease , 2008, Nature.

[17]  Pieter H. Reitsma,et al.  Differential Gene Expression Changes in Children with Severe Dengue Virus Infections , 2008, PLoS neglected tropical diseases.

[18]  Tin Wee Tan,et al.  SVM-based prediction of caspase substrate cleavage sites , 2006, BMC Bioinformatics.

[19]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[20]  Philippe Després,et al.  Human genetic determinants of dengue virus susceptibility. , 2009, Microbes and infection.

[21]  G. Crane Dengue haemorrhagic fever: diagnosis, treatment, prevention and control , 1999 .

[22]  Ulisses Braga-Neto,et al.  Reliable Classifier to Differentiate Primary and Secondary Acute Dengue Infection Based on IgG ELISA , 2009, PloS one.

[23]  Jeerayut Chaijaruwanich,et al.  Differences in global gene expression in peripheral blood mononuclear cells indicate a significant role of the innate responses in progression of dengue fever but not dengue hemorrhagic fever. , 2008, The Journal of infectious diseases.

[24]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[25]  Kevin R Porter,et al.  Functional characterization of ex vivo blood myeloid and plasmacytoid dendritic cells after infection with dengue virus. , 2009, Virology.

[26]  J. Muñoz-Jordán Subversion of interferon by dengue virus. , 2010, Current topics in microbiology and immunology.

[27]  E. Nascimento,et al.  Characterization of a dengue patient cohort in Recife, Brazil. , 2007, The American journal of tropical medicine and hygiene.