A study of the coevolutionary patterns operating within the env gene of the HIV-1 group M subtypes.

The env gene of human immunodeficiency virus (HIV) is a functionally important gene responsible for the production of protein products (gp120 and gp41) involved in host cell recognition, binding, and entry. This occurs through a complex and, as yet, not fully understood process of protein-protein interaction and within and between protein functional communication. Exposure on the surface of active HIV virions means the gp120-gp41 complexes are subjected to intense immune system pressure and have, therefore, evolved mechanisms to avoid neutralization. Using protein-coding sequences representing all the HIV type-1 (HIV-1) group M subtypes, we have identified amino acids within the env gene whose evolution is inextricably linked over the entire HIV-1 group M epidemic. We identified 848 pairs of coevolving residues (involving 263 out of 764 amino acid sites), which represent 0.29% of all possible pairs. Of the coevolving pairs, 68% were significantly correlated by hydrophobicity, molecular weight, or both hydrophobicity and molecular weight. Subsequent grouping of coevolving pairs resulted in the identification of 290 groups of amino acid residues, with the size of these groups ranging from 2 to 10 amino acid residues. Many of these dependencies are correlated by function including CD4 binding, coreceptor binding, glycosylation, and protein-protein interaction. This analysis provides important information regarding the functional dependencies observed within all the HIV-1 group M subtypes and may assist in the identification of functional protein domains and therapeutic targets within the HIV-1 env gene.

[1]  Austin L. Hughes,et al.  Extraepitopic Compensatory Substitutions Partially Restore Fitness to Simian Immunodeficiency Virus Variants That Escape from an Immunodominant Cytotoxic-T-Lymphocyte Response , 2004, Journal of Virology.

[2]  J. Sodroski,et al.  Analysis of the interaction of the human immunodeficiency virus type 1 gp120 envelope glycoprotein with the gp41 transmembrane glycoprotein , 1997, Journal of virology.

[3]  W A Hendrickson,et al.  Structures of HIV-1 gp120 envelope glycoproteins from laboratory-adapted and primary isolates. , 2000, Structure.

[4]  P. Vernazza,et al.  Characterization of V3 Sequence Heterogeneity in Subtype C Human Immunodeficiency Virus Type 1 Isolates from Malawi: Underrepresentation of X4 Variants , 1999, Journal of Virology.

[5]  Ying Sun,et al.  A conserved HIV gp120 glycoprotein structure involved in chemokine receptor binding. , 1998, Science.

[6]  M. Fares Computational and Statistical Methods to Explore the Various Dimensions of Protein Evolution , 2006 .

[7]  C. Sander,et al.  Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations? , 1994, Protein engineering.

[8]  T. F. Rinke de Wit,et al.  HIV-1 subtype C syncytium- and non-syncytium-inducing phenotypes and coreceptor usage among Ethiopian patients with AIDS. , 1999, AIDS.

[9]  R A Olshen,et al.  Covariability of V3 loop amino acids. , 1996, AIDS research and human retroviruses.

[10]  Ben Berkhout,et al.  Second site escape of a T20-dependent HIV-1 variant by a single amino acid change in the CD4 binding region of the envelope glycoprotein , 2006, Retrovirology.

[11]  David L. Robertson,et al.  Comparative Study of Adaptive Molecular Evolution in Different Human Immunodeficiency Virus Groups and Subtypes , 2004, Journal of Virology.

[12]  Lynn Morris,et al.  The CCR5 and CXCR4 Coreceptors Are Both Used by Human Immunodeficiency Virus Type 1 Primary Isolates from Subtype C , 2003, Journal of Virology.

[13]  G. Gloor,et al.  Mutual information in protein multiple sequence alignments reveals two classes of coevolving positions. , 2005, Biochemistry.

[14]  E. Holmes,et al.  Convergent and divergent sequence evolution in the surface envelope glycoprotein of human immunodeficiency virus type 1 within a single infected patient. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[15]  L Pritchard,et al.  Evaluation of a novel method for the identification of coevolving protein residues. , 2001, Protein engineering.

[16]  Paul E. Kennedy,et al.  HIV-1 Entry Cofactor: Functional cDNA Cloning of a Seven-Transmembrane, G Protein-Coupled Receptor , 1996, Science.

[17]  B. Cullen,et al.  Identification of the envelope V3 loop as the primary determinant of cell tropism in HIV-1. , 1991, Science.

[18]  Anne-Mieke Vandamme,et al.  Mapping Sites of Positive Selection and Amino Acid Diversification in the HIV Genome , 2004, Genetics.

[19]  Oliver Hartley,et al.  V3: HIV's switch-hitter. , 2005, AIDS research and human retroviruses.

[20]  David Posada,et al.  MODELTEST: testing the model of DNA substitution , 1998, Bioinform..

[21]  K. Hatrick,et al.  Compensating changes in protein multiple sequence alignments. , 1994, Protein engineering.

[22]  J. Sodroski,et al.  Structure of an HIV gp120 envelope glycoprotein in complex with the CD4 receptor and a neutralizing human antibody , 1998, Nature.

[23]  C. Sander,et al.  Correlated mutations and residue contacts in proteins , 1994, Proteins.

[24]  M. Steel,et al.  A covariotide model explains apparent phylogenetic structure of oxygenic photosynthetic lineages. , 1998, Molecular biology and evolution.

[25]  Bette Korber,et al.  Structure of a V3-Containing HIV-1 gp120 Core , 2005, Science.

[26]  Peter D. Kwong,et al.  The antigenic structure of the HIV gp120 envelope glycoprotein , 1998, Nature.

[27]  Simon A. A. Travers,et al.  Evidence for Heterogeneous Selective Pressures in the Evolution of the env Gene in Different Human Immunodeficiency Virus Type 1 Subtypes , 2005, Journal of Virology.

[28]  Stephen C. Peiper,et al.  Identification of a major co-receptor for primary isolates of HIV-1 , 1996, Nature.

[29]  R. F. Smith,et al.  Mutational trends in V3 loop protein sequences observed in different genetic lineages of human immunodeficiency virus type 1 , 1994, Journal of virology.

[30]  Miguel Ángel Martínez,et al.  Fitness variations and their impact on the evolution of antiretroviral drug resistance. , 2003, Current drug targets. Infectious disorders.

[31]  A. Trkola,et al.  Human monoclonal antibody 2G12 defines a distinctive neutralization epitope on the gp120 glycoprotein of human immunodeficiency virus type 1 , 1996, Journal of virology.

[32]  Ursula Rothlisberger,et al.  Drug resistance in HIV‐1 protease: Flexibility‐assisted mechanism of compensatory mutations , 2002, Protein science : a publication of the Protein Society.

[33]  Valentina Svicher,et al.  Novel drug resistance mutations in HIV: recognition and clinical relevance. , 2006, AIDS reviews.

[34]  M. Fares,et al.  Functional coevolutionary networks of the Hsp70-Hop-Hsp90 system revealed through computational analyses. , 2007, Molecular biology and evolution.

[35]  Don C. Wiley,et al.  Structure of an unliganded simian immunodeficiency virus gp120 core , 2005, Nature.

[36]  Wen-Hsiung Li Unbiased estimation of the rates of synonymous and nonsynonymous substitution , 2006, Journal of Molecular Evolution.

[37]  Martin A. Nowak,et al.  Antibody neutralization and escape by HIV-1 , 2003, Nature.

[38]  R. Compans,et al.  Oligomerization of the hydrophobic heptad repeat of gp41 , 1995, Journal of virology.

[39]  J. Bonifacino,et al.  Interaction of endocytic signals from the HIV-1 envelope glycoprotein complex with members of the adaptor medium chain family. , 1997, Virology.

[40]  Michael P. Cummings,et al.  PAUP* [Phylogenetic Analysis Using Parsimony (and Other Methods)] , 2004 .

[41]  E. Freed,et al.  The Role of Human Immunodeficiency Virus Type 1 Envelope Glycoproteins in Virus Infection (*) , 1995, The Journal of Biological Chemistry.

[42]  D R Burton,et al.  gp120: Biologic aspects of structural features. , 2001, Annual review of immunology.

[43]  Simon A. A. Travers,et al.  A Novel Method for Detecting Intramolecular Coevolution: Adding a Further Dimension to Selective Constraints Analyses , 2006, Genetics.

[44]  Virginia Litwin,et al.  HIV-1 entry into CD4+ cells is mediated by the chemokine receptor CC-CKR-5 , 1996, Nature.

[45]  Sergei L. Kosakovsky Pond,et al.  Evolutionary Interactions between N-Linked Glycosylation Sites in the HIV-1 Envelope , 2006, PLoS Comput. Biol..

[46]  W. Taylor,et al.  Effectiveness of correlation analysis in identifying protein residues undergoing correlated evolution. , 1997, Protein engineering.

[47]  G. Chelvanayagam,et al.  An analysis of simultaneous variation in protein structures. , 1997, Protein engineering.

[48]  A. Jean-Marie,et al.  A model-based approach for detecting coevolving positions in a molecule. , 2005, Molecular biology and evolution.

[49]  T. Gojobori,et al.  Reevaluation of Amino Acid Variability of the Human Immunodeficiency Virus Type 1 gp120 Envelope Glycoprotein and Prediction of New Discontinuous Epitopes , 2000, Journal of Virology.

[50]  E. Tillier,et al.  Neighbor Joining and Maximum Likelihood with RNA Sequences: Addressing the Interdependence of Sites , 1995 .

[51]  J. Bonifacino,et al.  A Membrane-proximal Tyrosine-based Signal Mediates Internalization of the HIV-1 Envelope Glycoprotein via Interaction with the AP-2 Clathrin Adaptor* , 1998, The Journal of Biological Chemistry.

[52]  D. Burton,et al.  GP120: target for neutralizing HIV-1 antibodies. , 2006, Annual review of immunology.

[53]  W. Atchley,et al.  Correlations among amino acid sites in bHLH protein domains: an information theoretic analysis. , 2000, Molecular biology and evolution.

[54]  R. Doms,et al.  Mutations in the C3 region of human and simian immunodeficiency virus envelope have differential effects on viral infectivity, replication, and CD4-dependency. , 2003, Virology.

[55]  A. Hughes,et al.  Natural selection on the gag, pol, and env genes of human immunodeficiency virus 1 (HIV-1). , 1995, Molecular biology and evolution.

[56]  W R Taylor,et al.  Coevolving protein residues: maximum likelihood identification and relationship to structure. , 1999, Journal of molecular biology.

[57]  J. G. Burleigh,et al.  Covarion structure in plastid genome evolution: a new statistical test. , 2005, Molecular biology and evolution.

[58]  J. Sodroski,et al.  Oligomeric Modeling and Electrostatic Analysis of the gp120 Envelope Glycoprotein of Human Immunodeficiency Virus , 2000, Journal of Virology.

[59]  Thomas W. H. Lui,et al.  Using multiple interdependency to separate functional from phylogenetic correlations in protein alignments , 2003, Bioinform..

[60]  J. Booth,et al.  Resampling-Based Multiple Testing. , 1994 .

[61]  Peter B Gilbert,et al.  Covariability of selected amino acid positions for HIV type 1 subtypes C and B. , 2005, AIDS research and human retroviruses.

[62]  R. Swanstrom,et al.  Improved success of phenotype prediction of the human immunodeficiency virus type 1 from envelope variable loop 3 sequence using neural networks. , 2001, Virology.

[63]  R. Siliciano,et al.  Endocytosis of endogenously synthesized HIV-1 envelope protein. Mechanism and role in processing for association with class II MHC. , 1995, Journal of immunology.

[64]  Ziheng Yang,et al.  Maximum Likelihood Analysis of Adaptive Evolution in HIV-1 Gp120 env Gene , 2000, Pacific Symposium on Biocomputing.

[65]  S Gnanakaran,et al.  Clade-Specific Differences between Human Immunodeficiency Virus Type 1 Clades B and C: Diversity and Correlations in C3-V4 Regions of gp120 , 2007, Journal of Virology.

[66]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[67]  A. Lapedes,et al.  Covariation of mutations in the V3 loop of human immunodeficiency virus type 1 envelope protein: an information theoretic analysis. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[68]  A. Gronenborn,et al.  Three‐dimensional solution structure of the 44 kDa ectodomain of SIV gp41 , 1998, The EMBO journal.

[69]  Joseph P. Bielawski,et al.  Widespread Adaptive Evolution in the Human Immunodeficiency Virus Type 1 Genome , 2003, Journal of Molecular Evolution.

[70]  Mario A. Fares,et al.  CAPS: coevolution analysis using protein sequences , 2006, Bioinform..

[71]  Feng Gao,et al.  Diversity Considerations in HIV-1 Vaccine Selection , 2002, Science.

[72]  P. Sharp,et al.  Genetic diversity of HIV-1: the moving target. , 2000, AIDS.

[73]  M. Steel,et al.  Modeling the covarion hypothesis of nucleotide substitution. , 1998, Mathematical biosciences.

[74]  N. Dimmock,et al.  The C-terminal tail of the gp41 transmembrane envelope glycoprotein of HIV-1 clades A, B, C, and D may exist in two conformations: an analysis of sequence, structure, and function , 2005, Virology.

[75]  John R Mascola,et al.  Changes in the V3 region of gp120 contribute to unusually broad coreceptor usage of an HIV-1 isolate from a CCR5 Delta32 heterozygote. , 2007, Virology.

[76]  N. Galtier Sampling properties of the bootstrap support in molecular phylogeny: influence of nonindependence among sites. , 2004, Systematic biology.

[77]  N. Letvin,et al.  Compensatory Substitutions Restore Normal Core Assembly in Simian Immunodeficiency Virus Isolates with Gag Epitope Cytotoxic T-Lymphocyte Escape Mutations , 2006, Journal of Virology.

[78]  Andrew Rambaut,et al.  A phylogenetic method for detecting positive epistasis in gene sequences and its application to RNA virus evolution. , 2006, Molecular biology and evolution.