Strong Selection Significantly Increases Epistatic Interactions in the Long-Term Evolution of a Protein

The human immuno-deficiency virus sub-type 1 (HIV-1) is evolving to keep up with a changing fitness landscape, due to the various drugs introduced to stop the virus’s replication. As the virus adapts, the information the virus encodes about its environment must change, and this change is reflected in the amino-acid composition of proteins, as well as changes in viral RNAs, binding sites, and splice sites. Information can also be encoded in the interaction between residues in a single protein as well as across proteins, leading to a change in the epistatic patterns that can affect how the virus can change in the future. Measuring epistasis usually requires fitness measurements that are difficult to obtain in high-throughput. Here we show that epistasis can be inferred from the pair-wise information between residues, and study how epistasis and information have changed over the long-term. Using HIV-1 protease sequence data from public databases covering the years 1998-2006 (from both treated and untreated subjects), we show that drug treatment has increased the protease’s per-site entropies on average. At the same time, the sum of mutual entropies across all pairs of residues within the protease shows a significant increase over the years, indicating an increase in epistasis in response to treatment, a trend not seen within sequences from untreated subjects. Our findings suggest that information theory can be an important tool to study long-term trends in the evolution of macromolecules.

[1]  R. Siliciano,et al.  Recent trends in HIV-1 drug resistance. , 2013, Current opinion in virology.

[2]  Dan S. Tawfik,et al.  Robustness–epistasis link shapes the fitness landscape of a randomly drifting protein , 2006, Nature.

[3]  A. Velázquez‐Campoy,et al.  Structural and thermodynamic basis of resistance to HIV-1 protease inhibition: implications for inhibitor design. , 2003, Current drug targets. Infectious disorders.

[4]  E. Ortlund,et al.  Crystal Structure of an Ancient Protein: Evolution by Conformational Epistasis , 2007, Science.

[5]  J. Louis,et al.  HIV-1 protease with 20 mutations exhibits extreme resistance to clinical inhibitors through coordinated structural rearrangements. , 2012, Biochemistry.

[6]  Christoph Adami,et al.  Annals of the New York Academy of Sciences the Use of Information Theory in Evolutionary Biology , 2022 .

[7]  G. Basharin On a Statistical Estimate for the Entropy of a Sequence of Independent Random Variables , 1959 .

[8]  A. Chao,et al.  Nonparametric estimation of Shannon’s index of diversity when there are unseen species in sample , 2004, Environmental and Ecological Statistics.

[9]  Christopher J. Lee,et al.  Distinguishing Functional Amino Acid Covariation from Background Linkage Disequilibrium in HIV Protease and Reverse Transcriptase , 2007, PloS one.

[10]  F. Mammano,et al.  Resistance-Associated Loss of Viral Fitness in Human Immunodeficiency Virus Type 1: Phenotypic Analysis of Protease andgag Coevolution in Protease Inhibitor-Treated Patients , 1998, Journal of Virology.

[11]  Michael M. Desai,et al.  Global epistasis makes adaptation predictable despite sequence-level stochasticity , 2014, Science.

[12]  B. Korber,et al.  Associations between amino acids in the evolution of HIV type 1 protease sequences under indinavir therapy. , 1999, AIDS research and human retroviruses.

[13]  Korbinian Strimmer,et al.  Entropy Inference and the James-Stein Estimator, with Application to Nonlinear Gene Association Networks , 2008, J. Mach. Learn. Res..

[14]  C. Adami,et al.  Physical complexity of symbolic sequences , 1996, adap-org/9605002.

[15]  C. Petropoulos,et al.  Evidence for Positive Epistasis in HIV-1 , 2004, Science.

[16]  Ursula Rothlisberger,et al.  Drug resistance in HIV‐1 protease: Flexibility‐assisted mechanism of compensatory mutations , 2002, Protein science : a publication of the Protein Society.

[17]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[18]  J. Dushoff,et al.  Prevalence of Epistasis in the Evolution of Influenza A Surface Proteins , 2011, PLoS genetics.

[19]  E. Freire,et al.  A major role for a set of non-active site mutations in the development of HIV-1 protease drug resistance. , 2003, Biochemistry.

[20]  B. Torbett,et al.  Accessory mutations maintain stability in drug-resistant HIV-1 protease. , 2011, Journal of molecular biology.

[21]  Raphail E. Krichevsky,et al.  The performance of universal encoding , 1981, IEEE Trans. Inf. Theory.

[22]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[23]  Mill Johannes G.A. Van,et al.  Transmission Of Information , 1961 .

[24]  Sebastian Bonhoeffer,et al.  A systems analysis of mutational effects in HIV-1 protease and reverse transcriptase , 2011, Nature Genetics.

[25]  Christoph Adami,et al.  Information theory in molecular biology , 2004, q-bio/0405004.

[26]  Wei Shao,et al.  Frequent polymorphism at drug resistance sites in HIV-1 protease and reverse transcriptase , 2008, AIDS.

[27]  D. Richman,et al.  2022 update of the drug resistance mutations in HIV-1. , 2022, Topics in antiviral medicine.

[28]  Jonathan M. Schapiro,et al.  Genotypic Changes in Human Immunodeficiency Virus Type 1 Protease Associated with Reduced Susceptibility and Virologic Response to the Protease Inhibitor Tipranavir , 2006, Journal of Virology.

[29]  C. Schiffer,et al.  How does a symmetric dimer recognize an asymmetric substrate? A substrate complex of HIV-1 protease. , 2000, Journal of molecular biology.

[30]  Frances H. Arnold,et al.  In the Light of Evolution III: Two Centuries of Darwin Sackler Colloquium: In the light of directed evolution: Pathways of adaptive protein evolution , 2009 .

[31]  D. Anastassiou Computational analysis of the synergy among multiple interacting genes , 2007, Molecular systems biology.

[32]  Sebastian Bonhoeffer,et al.  Exploring the Complexity of the HIV-1 Fitness Landscape , 2012, PLoS genetics.

[33]  Ga Miller,et al.  Note on the bias of information estimates , 1955 .

[34]  C. Adami,et al.  Impact of epistasis and pleiotropy on evolutionary adaptation , 2009, Proceedings of the Royal Society B: Biological Sciences.

[35]  T. Oas,et al.  The structural distribution of cooperative interactions in proteins: analysis of the native state ensemble. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[36]  W. Bialek Biophysics: Searching for Principles , 2012 .

[37]  P. Gorry,et al.  Transmission of HIV-1 drug-resistant variants: prevalence and effect on treatment outcome. , 2010, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[38]  Thanat Chookajorn,et al.  Stepwise acquisition of pyrimethamine resistance in the malaria parasite , 2009, Proceedings of the National Academy of Sciences.

[39]  P. Phillips Epistasis — the essential role of gene interactions in the structure and evolution of genetic systems , 2008, Nature Reviews Genetics.

[40]  D. J. Kiviet,et al.  Empirical fitness landscapes reveal accessible evolutionary paths , 2007, Nature.

[41]  Celia A Schiffer,et al.  Covariation of amino acid positions in HIV-1 protease. , 2003, Virology.

[42]  Robert B. Heckendorn,et al.  Should evolutionary geneticists worry about higher-order epistasis? , 2013, Current opinion in genetics & development.

[43]  L. Bourgon,et al.  Selection and characterization of HIV-1 showing reduced susceptibility to the non-peptidic protease inhibitor tipranavir. , 2005, Antiviral research.

[44]  S. Yerly,et al.  Transmission of antiretroviral-drug-resistant HIV-1 variants , 1999, The Lancet.

[45]  A. Lapedes,et al.  Covariation of mutations in the V3 loop of human immunodeficiency virus type 1 envelope protein: an information theoretic analysis. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[46]  R. Lenski,et al.  Microbial genetics: Evolution experiments with microorganisms: the dynamics and genetic bases of adaptation , 2003, Nature Reviews Genetics.

[47]  J. Louis,et al.  Enhanced stability of monomer fold correlates with extreme drug resistance of HIV-1 protease. , 2013, Biochemistry.

[48]  Sebastian Bonhoeffer,et al.  Epistasis between deleterious mutations and the evolution of recombination. , 2007, Trends in ecology & evolution.

[49]  R. Hartley Transmission of information , 1928 .

[50]  Chi‐Huey Wong,et al.  HIV-1 protease: mechanism and drug discovery. , 2003, Organic & biomolecular chemistry.

[51]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[52]  Thomas D. Wu,et al.  Mutation Patterns and Structural Correlates in Human Immunodeficiency Virus Type 1 Protease following Different Protease Inhibitor Treatments , 2003, Journal of Virology.

[53]  Jesse D. Bloom,et al.  Epistatically Interacting Substitutions Are Enriched during Adaptive Protein Evolution , 2014, PLoS genetics.

[54]  Christoph Endres,et al.  Introduction to Artificial Life , 2000, Künstliche Intell..

[55]  J. da Silva Amino Acid Covariation in a Functionally Important Human Immunodeficiency Virus Type 1 Protein Region Is Associated With Population Subdivision , 2009, Genetics.

[56]  Jeffrey E. Barrick,et al.  Genome evolution and adaptation in a long-term experiment with Escherichia coli , 2009, Nature.

[57]  Tim F. Cooper,et al.  The Environment Affects Epistatic Interactions to Alter the Topology of an Empirical Fitness Landscape , 2013, PLoS genetics.

[58]  Christoph Adami,et al.  Thermodynamic prediction of protein neutrality. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[59]  Ronald M. Levy,et al.  Pairwise and higher-order correlations among drug-resistance mutations in HIV-1 subtype B protease , 2009, BMC Bioinformatics.

[60]  Peter Grassberger,et al.  Entropy estimation of symbol sequences. , 1996, Chaos.

[61]  V. Trouplin,et al.  Retracing the Evolutionary Pathways of Human Immunodeficiency Virus Type 1 Resistance to Protease Inhibitors: Virus Fitness in the Absence and in the Presence of Drug , 2000, Journal of Virology.

[62]  William Bialek,et al.  Entropy and Inference, Revisited , 2001, NIPS.

[63]  Joachim Krug,et al.  Evolutionary Accessibility of Mutational Pathways , 2011, PLoS Comput. Biol..

[64]  R. Shafer,et al.  Update of the drug resistance mutations in HIV-1: March 2013. , 2013, Topics in antiviral medicine.

[65]  W. P. Russ,et al.  Evolutionary information for specifying a protein fold , 2005, Nature.

[66]  Marianne Manchester,et al.  Complete mutagenesis of the HIV-1 protease , 1989, Nature.

[67]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[68]  Charles Ofria,et al.  Evolutionary dynamics, epistatic interactions, and biological information. , 2010, Journal of Theoretical Biology.

[69]  A. Sousa,et al.  Positive Epistasis Drives the Acquisition of Multidrug Resistance , 2009, PLoS genetics.