Residue-residue contact prediction in the HIV-1 envelope glycoprotein complex

HIV-1 Env glycoprotein complex is the key protein that mediates binding and entry of HIV-1 into human host cells. The complex entry process involves three main steps. First, the attachment, the interaction of gp120 and CD4. Second, the coreceptor binding, where gp120 binds the chemokine receptor CCR5 or CXCR4, and finally, the fusion of viral and host cell membranes. Despite the fact that several coordinate structures of HIV-1 Env in unliganded state exist (as well as in complex with CD4, CD4 mimics, or various antibodies, and of gp41 in intermediate and post-fusion state), a comprehensive understanding of structural arrangements and communication within gp120 and gp41 domains during the entry is far from complete. In this study, we applied a direct amino acid interaction detecting method to analyse the function and structure of HIV-1 Env protein sequences representing all group M subtypes. We identified more than 400 coevolving residue pairs within Env, of which the majority are real contacts and proximal in the available coordinate structures, or have functional implications such as receptor binding, variable loop, gp120-gp41, and interdomain interactions. This work provides a new dimension of information in HIV research, important in assisting protein coordinate structure prediction and in designing new and effective entry inhibitors.

[1]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[2]  Simon A. A. Travers,et al.  A study of the coevolutionary patterns operating within the env gene of the HIV-1 group M subtypes. , 2007, Molecular biology and evolution.

[3]  R A Olshen,et al.  Covariability of V3 loop amino acids. , 1996, AIDS research and human retroviruses.

[4]  C. Sander,et al.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families , 2011, Proceedings of the National Academy of Sciences.

[5]  C. Sander,et al.  Correlated mutations and residue contacts in proteins , 1994, Proteins.

[6]  Burkhard Rost,et al.  FreeContact: fast and free software for protein contact prediction from residue co-evolution , 2014, BMC Bioinformatics.

[7]  Massimiliano Pontil,et al.  PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments , 2012, Bioinform..

[8]  R. Aldrich,et al.  Influence of conservation on calculations of amino acid covariance in multiple sequence alignments , 2004, Proteins.

[9]  Alfonso Valencia,et al.  Emerging methods in protein co-evolution , 2013 .

[10]  Tongqing Zhou,et al.  Structure and immune recognition of trimeric prefusion HIV-1 Env , 2014, Nature.

[11]  A. Lapedes,et al.  Covariation of mutations in the V3 loop of human immunodeficiency virus type 1 envelope protein: an information theoretic analysis. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[12]  C. Sander,et al.  A method to predict functional residues in proteins , 1995, Nature Structural Biology.

[13]  Bette Korber,et al.  Structure of a V3-Containing HIV-1 gp120 Core , 2005, Science.

[14]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[15]  Gregory B. Gloor,et al.  Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction , 2008, Bioinform..

[16]  Peter B Gilbert,et al.  Covariability of selected amino acid positions for HIV type 1 subtypes C and B. , 2005, AIDS research and human retroviruses.

[17]  S Gnanakaran,et al.  The implications of patterns in HIV diversity for neutralizing antibody induction and susceptibility , 2009, Current opinion in HIV and AIDS.