Identification of HIV-1 Vif Protein Attributes Associated With CD4 T Cell Numbers and Viral Loads Using Artificial Intelligence Algorithms

The Human Immunodeficiency Virus (HIV) Viral Infectivity Factor (Vif) is a 192-amino acid accessory protein essential to viral replication which counteracts host APOBEC3 proteins. APOBEC3 proteins interfere with the replication of HIV, hepatitis C virus, hepatitis B virus and retrotransposons. Vif is a recent candidate target for therapeutic and preventative interventions in HIV/AIDS yet little is known about its clinical relevance. We describe the results of applying different machine learning algorithms (Apriori, Multifactor Dimensionality Reductor, C4.5, Artificial Neural Networks and ID3) to the search of associations between HIV-1 Vif protein attributes and clinical endpoints. Final iterations showed that the presence of mutations in BC Boxes, APOBEC motifs and Cullin5 binding motifs were together associated with higher initial CD4 T cells while mutations of specific APOBEC motifs coupled with the conservation of other APOBEC motifs were associated with lower historic CD4 T cells. Conservation of specific APOBEC motifs and BC boxes were linked to lower initial viral loads while different combinations of mutations in the Nuclear Localisation Inhibition Signal and BC Boxes were associated with higher historic viral loads. Further scrutiny of these combinations through traditional statistical methods revealed striking differences in both CD4 T cells and viral loads in patients stratified into those having the previous combinations. While artificial intelligence algorithms do not phase out traditional statistical methods, our Artificial Intelligence (AI)-based approach highlights their use at reducing the dimensionality of large and complex datasets and at proposing novel, unimaginable, associations of biological patterns with functional relevance or clinical roles.

[1]  Juan Carlos Cuevas-Tello,et al.  Support vector machine algorithms in the search of KIR gene associations with disease , 2013, Comput. Biol. Medicine.

[2]  Edward L. Evans,et al.  HIV-1 Vif's Capacity To Manipulate the Cell Cycle Is Species Specific , 2018, Journal of Virology.

[3]  J. Ross Quinlan,et al.  Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[4]  Y. Benjamini,et al.  Controlling the false discovery rate in behavior genetics research , 2001, Behavioural Brain Research.

[5]  William Stafford Noble,et al.  How does multiple testing correction work? , 2009, Nature Biotechnology.

[6]  U. Liebert,et al.  HIV-1C proviral DNA for detection of drug resistance mutations , 2018, PloS one.

[7]  C. Rouzioux,et al.  High frequency of X4/DM-tropic viruses in PBMC samples from patients with primary HIV-1 subtype-B infection in 1996-2007: the French ANRS CO06 PRIMO Cohort Study. , 2009, The Journal of antimicrobial chemotherapy.

[8]  Krzysztof J. Cios,et al.  A machine learning method for generation of a neural network architecture: a continuous ID3 algorithm , 1992, IEEE Trans. Neural Networks.

[9]  A. Godinho-Santos,et al.  Host Factors and HIV-1 Replication: Clinical Evidence and Potential Therapeutic Approaches , 2013, Front. Immunol..

[10]  V. Simon,et al.  Evolutionarily conserved pressure for the existence of distinct G2/M cell cycle arrest and A3H inactivation functions in HIV-1 Vif , 2015, Cell cycle.

[11]  L. Vandekerckhove,et al.  Concordance between HIV‐1 genotypic coreceptor tropism predictions based on plasma RNA and proviral DNA , 2011, HIV medicine.

[12]  Leonardo de Oliveira Martins,et al.  Codon pairs of the HIV-1 vif gene correlate with CD4+ T cell count , 2013, BMC Infectious Diseases.

[13]  D. Hernández-Ramírez,et al.  Association of KIR3DL1/S1 and HLA-Bw4 with CD4 T cell counts in HIV-infected Mexican mestizos , 2015, Immunogenetics.

[14]  W. Greene,et al.  APOBEC3G and HIV-1: Strike and counterstrike , 2006, Current infectious disease reports.

[15]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[16]  J Forsström,et al.  Using the ID3 Algorithm to find Discrepant Diagnoses from laboratory Databases of Thyroid Patients , 1991, Medical decision making : an international journal of the Society for Medical Decision Making.

[17]  V. Calvez,et al.  Factors associated with proviral DNA HIV-1 tropism in antiretroviral therapy-treated patients with fully suppressed plasma HIV viral load: implications for the clinical use of CCR5 antagonists. , 2010, The Journal of antimicrobial chemotherapy.

[18]  V. Simon,et al.  The Structural Interface between HIV-1 Vif and Human APOBEC3H , 2016, Journal of Virology.

[19]  M. Kawahara,et al.  HIV-1 viral infectivity factor interacts with TP53 to induce G2 cell cycle arrest and positively regulate viral replication , 2010, Proceedings of the National Academy of Sciences.

[20]  Dong-Ling Tong,et al.  Hybrid genetic algorithm-neural network: Feature extraction for unpreprocessed microarray data , 2011, Artif. Intell. Medicine.

[21]  V. Simon,et al.  The Activity Spectrum of Vif from Multiple HIV-1 Subtypes against APOBEC3G, APOBEC3F, and APOBEC3H , 2011, Journal of Virology.

[22]  Christophe Lemetre,et al.  Artificial Neural Network Inference (ANNI): A Study on Gene-Gene Interaction for Biomarkers in Childhood Sarcomas , 2014, PloS one.

[23]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[24]  R. Harris,et al.  Human Immunodeficiency Virus Type 1 Vif Induces Cell Cycle Delay via Recruitment of the Same E3 Ubiquitin Ligase Complex That Targets APOBEC3 Proteins for Degradation , 2008, Journal of Virology.

[25]  A. Wensing,et al.  European guidelines on the clinical management of HIV-1 tropism testing. , 2011, The Lancet. Infectious diseases.

[26]  Jason H. Moore,et al.  Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions , 2003, Bioinform..

[27]  M. Malim,et al.  Isolation of a human gene that inhibits HIV-1 infection and is suppressed by the viral Vif protein , 2002, Nature.

[28]  A. Moris,et al.  AID and APOBECs span the gap between innate and adaptive immunity , 2014, Front. Microbiol..

[29]  A. Rafael,et al.  Molecular characterization of Mexican HIV-1 Vif sequences , 2016 .

[30]  Hua Xu,et al.  Genetic studies of complex human diseases: Characterizing SNP-disease associations using Bayesian networks , 2012, BMC Systems Biology.

[31]  Xianghui Yu,et al.  Selective assembly of HIV-1 Vif-Cul5-ElonginB-ElonginC E3 ubiquitin ligase complex through a novel SOCS box and upstream cysteines. , 2004, Genes & development.

[32]  Ivan Bratko,et al.  Analyzing Attribute Dependencies , 2003, PKDD.

[33]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[34]  M. Hercigonja-Szekeres,et al.  Decision Support Systems in Health Care - Velocity of Apriori Algorithm. , 2017, Studies in health technology and informatics.

[35]  Rui Jiang,et al.  A random forest approach to the detection of epistatic interactions in case-control studies , 2009, BMC Bioinformatics.

[36]  Bin Zhao,et al.  KIR3DS1/L1 and HLA-Bw4-80I are associated with HIV disease progression among HIV typical progressors and long-term nonprogressors , 2013, BMC Infectious Diseases.

[37]  Jon Doyle,et al.  Bayesian neural networks for detecting epistasis in genetic association studies , 2014, BMC Bioinformatics.

[38]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[39]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[40]  L. Chelico,et al.  Suppression of APOBEC3-mediated restriction of HIV-1 by Vif , 2014, Front. Microbiol..

[41]  R. Diaz,et al.  Genetic Diversity of HIV-1 Gene vif Among Treatment-Naive Brazilians. , 2017, AIDS research and human retroviruses.

[42]  D. Noyola,et al.  Molecular Characterization of Mexican HIV-1 Vif Sequences. , 2016, AIDS research and human retroviruses.