A Computational Study Identifies HIV Progression-Related Genes Using mRMR and Shortest Path Tracing

Since statistical relationships between HIV load and CD4+ T cell loss have been demonstrated to be weak, searching for host factors contributing to the pathogenesis of HIV infection becomes a key point for both understanding the disease pathology and developing treatments. We applied Maximum Relevance Minimum Redundancy (mRMR) algorithm to a set of microarray data generated from the CD4+ T cells of viremic non-progressors (VNPs) and rapid progressors (RPs) to identify host factors associated with the different responses to HIV infection. Using mRMR algorithm, 147 gene had been identified. Furthermore, we constructed a weighted molecular interaction network with the existing protein-protein interaction data from STRING database and identified 1331 genes on the shortest-paths among the genes identified with mRMR. Functional analysis shows that the functions relating to apoptosis play important roles during the pathogenesis of HIV infection. These results bring new insights of understanding HIV progression.

[1]  Manish Kumar,et al.  Heat Shock Protein 40 Is Necessary for Human Immunodeficiency Virus-1 Nef-mediated Enhancement of Viral Gene Expression and Replication* , 2005, Journal of Biological Chemistry.

[2]  Curtis D. Klaassen,et al.  Xenobiotic, Bile Acid, and Cholesterol Transporters: Function and Regulation , 2010, Pharmacological Reviews.

[3]  G. Blatch,et al.  Heat shock protein 40 (Hsp40) plays a key role in the virus life cycle. , 2011, Virus research.

[4]  K. Chou Some remarks on protein attribute prediction and pseudo amino acid composition , 2010, Journal of Theoretical Biology.

[5]  S. Khoo,et al.  Intracellular accumulation of efavirenz and nevirapine is independent of P-glycoprotein activity in cultured CD4 T cells and primary human lymphocytes. , 2009, The Journal of antimicrobial chemotherapy.

[6]  X. Wang,et al.  Predicting hepatitis B virus–positive metastatic hepatocellular carcinomas using gene expression profiling and supervised machine learning , 2003, Nature Medicine.

[7]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[8]  Adriano Lazzarin,et al.  Treatment of primary HIV-1 infection with cyclosporin A coupled with highly active antiretroviral therapy. , 2002, The Journal of clinical investigation.

[9]  G. Sirugo,et al.  Mortality in HIV infection is independently predicted by host iron status and SLC11A1 and HP genotypes, with new evidence of a gene-nutrient interaction. , 2009, The American journal of clinical nutrition.

[10]  Todd,et al.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning , 2002, Nature Medicine.

[11]  P. Krammer,et al.  Sensitization of T cells to CD95-mediated apoptosis by HIV-1 Tat and gp120 , 1995, Nature.

[12]  G. Pantaleo,et al.  Effects of mycophenolic acid on human immunodeficiency virus infection in vitro and in vivo , 2000, Nature Medicine.

[13]  N. Jones,et al.  Depletion of Regulatory T Cells in HIV Infection Is Associated with Immune Activation1 , 2005, The Journal of Immunology.

[14]  David Heckerman,et al.  Influence of HLA-C Expression Level on HIV Control , 2013, Science.

[15]  K. Taylor,et al.  Genes linked to energy metabolism and immunoregulatory mechanisms are associated with subcutaneous adipose tissue distribution in HIV-infected men , 2011, Pharmacogenetics and genomics.

[16]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[17]  P. D. de Bakker,et al.  Comparative transcriptomics of extreme phenotypes of human HIV-1 infection and SIV infection in sooty mangabey and rhesus macaque. , 2011, The Journal of clinical investigation.

[18]  Peter Hunt,et al.  Immune activation set point during early HIV infection predicts subsequent CD4+ T-cell changes independent of viral load. , 2004, Blood.

[19]  Martin Meier-Schellersheim,et al.  CD4 T Cell Depletion Is Linked Directly to Immune Activation in the Pathogenesis of HIV-1 and HIV-2 but Only Indirectly to the Viral Load1 , 2002, The Journal of Immunology.

[20]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[21]  F. Miedema,et al.  Persistent immune activation in HIV-1 infection is associated with progression to AIDS , 2003, AIDS.

[22]  M. Wainberg,et al.  Gene expression profiling of the host response to HIV-1 B, C, or A/E infection in monocyte-derived dendritic cells. , 2006, Virology.

[23]  P. Charneau,et al.  The Maturation of Dendritic Cells Results in Postintegration Inhibition of HIV-1 Replication1 , 2001, The Journal of Immunology.

[24]  Jack T Stapleton,et al.  The Major Genetic Determinants of HIV-1 Control Affect HLA Class I Peptide Presentation , 2010, Science.

[25]  Benjamin Haibe-Kains,et al.  mRMRe: an R package for parallelized mRMR ensemble feature selection , 2013, Bioinform..

[26]  V. Metelev,et al.  Induction of apoptosis in uninfected lymphocytes by HIV-1 Tat protein. , 1995, Science.

[27]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[28]  Kristen K. Dang,et al.  Genome-Wide mRNA Expression Correlates of Viral Control in CD4+ T-Cells from HIV-1-Infected Individuals , 2010, PLoS pathogens.

[29]  Kuo-Chen Chou,et al.  Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-Nearest Neighbor classifiers. , 2006, Journal of proteome research.

[30]  Jacques Fellay,et al.  A Whole-Genome Association Study of Major Determinants for Host Control of HIV-1 , 2007, Science.

[31]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[32]  K. Chou,et al.  Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms , 2008, Nature Protocols.

[33]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[34]  M. E. Perry,et al.  mdm2 Is Critical for Inhibition of p53 during Lymphopoiesis and the Response to Ionizing Irradiation , 2003, Molecular and Cellular Biology.

[35]  J.C. Rajapakse,et al.  SVM-RFE With MRMR Filter for Gene Selection , 2010, IEEE Transactions on NanoBioscience.

[36]  A. Fauci,et al.  Analysis of apoptosis in lymph nodes of HIV-infected persons. Intensity of apoptosis correlates with the general state of activation of the lymphoid tissue and not with stage of disease or viral burden. , 1995, Journal of immunology.

[37]  Chris H. Q. Ding,et al.  Minimum redundancy feature selection from microarray gene expression data , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[38]  Bonaventura Clotet,et al.  Drug uptake transporters in antiretroviral therapy. , 2011, Pharmacology & therapeutics.

[39]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[40]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[41]  D. Follmann,et al.  HIV infection-associated immune activation occurs by two distinct pathways that differentially affect CD4 and CD8 T cells , 2008, Proceedings of the National Academy of Sciences.

[42]  Stephen N. Jones,et al.  Regulation of p53 stability by Mdm2 , 1997, Nature.

[43]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[44]  K. Chou,et al.  Prediction of protein structural classes. , 1995, Critical reviews in biochemistry and molecular biology.