MPL resolves genetic linkage in fitness inference from complex evolutionary histories

Genetic linkage causes the fate of new mutations in a population to be contingent on the genetic background on which they appear. This makes it challenging to identify how individual mutations affect fitness. To overcome this challenge, we developed marginal path likelihood (MPL), a method to infer selection from evolutionary histories that resolves genetic linkage. Validation on real and simulated data sets shows that MPL is fast and accurate, outperforming existing inference approaches. We found that resolving linkage is crucial for accurately quantifying selection in complex evolving populations, which we demonstrate through a quantitative analysis of intrahost HIV-1 evolution using multiple patient data sets. Linkage effects generated by variants that sweep rapidly through the population are particularly strong, extending far across the genome. Taken together, our results argue for the importance of resolving linkage in studies of natural selection. A new method models the influence of genetic background on the fitness effects of mutations.

[1]  John P. Barton,et al.  The Fitness Landscape of HIV-1 Gag: Advanced Modeling Approaches and Validation of Model Predictions by In Vitro Testing , 2014, PLoS Comput. Biol..

[2]  Sebastian Bonhoeffer,et al.  Stochastic or deterministic: what is the effective population size of HIV-1? , 2006, Trends in microbiology.

[3]  Richard E. Lenski,et al.  Tempo and mode of genome evolution in a 50,000-generation experiment , 2016, Nature.

[4]  F. Zanini,et al.  In vivo mutation rates and the landscape of fitness costs of HIV-1 , 2017, Virus evolution.

[5]  A. Hobolth,et al.  Inference Under a Wright-Fisher Model Using an Accurate Beta Approximation , 2015, Genetics.

[6]  Carlo C. Maley,et al.  Clonal evolution in cancer , 2012, Nature.

[7]  Benjamin H. Good,et al.  The Dynamics of Molecular Evolution Over 60,000 Generations , 2017, Nature.

[8]  J. Schraiber A path integral formulation of the Wright-Fisher process with genic selection. , 2013, Theoretical population biology.

[9]  Christopher J. R. Illingworth,et al.  Distinguishing Driver and Passenger Mutations in an Evolutionary History Categorized by Interference , 2011, Genetics.

[10]  Vineet Bafna,et al.  Clear: Composition of Likelihoods for Evolve and Resequence Experiments , 2016, Genetics.

[11]  D. Hartl,et al.  An Equivalence Principle for the Incorporation of Favorable Mutations in Asexual Populations , 2006, Science.

[12]  M. Lässig,et al.  Fitness flux and ubiquity of adaptive evolution , 2010, Proceedings of the National Academy of Sciences.

[13]  A. Futschik,et al.  Quantifying Selection with Pool-Seq Time Series Data , 2017, Molecular biology and evolution.

[14]  Andrew L. Ferguson,et al.  Translating HIV sequences into quantitative fitness landscapes predicts viral vulnerabilities for rational immunogen design. , 2013, Immunity.

[15]  Michael M. Desai,et al.  Pervasive Genetic Hitchhiking and Clonal Interference in 40 Evolving Yeast Populations , 2013, Nature.

[16]  M. Lässig,et al.  A predictive fitness model for influenza , 2014, Nature.

[17]  C. Swanton,et al.  Resolving genetic heterogeneity in cancer , 2019, Nature Reviews Genetics.

[18]  Orestis Malaspinas,et al.  Estimating Allele Age and Selection Coefficient from Time-Serial Data , 2012, Genetics.

[19]  W. P. Russ,et al.  Evolutionary information for specifying a protein fold , 2005, Nature.

[20]  J. Plotkin,et al.  Identifying Signatures of Selection in Genetic Time Series , 2013, Genetics.

[21]  J. Luban,et al.  Cyclophilin A promotes HIV-1 reverse transcription but its effect on transduction correlates best with its effect on nuclear entry of viral cDNA , 2014, Retrovirology.

[22]  Motoo Kimura,et al.  Diffusion models in population genetics , 1964, Journal of Applied Probability.

[23]  K. Metzner,et al.  Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data , 2012, Front. Microbio..

[24]  Vitaly V. Ganusov,et al.  Broad CTL Response in Early HIV Infection Drives Multiple Concurrent CTL Escapes , 2015, PLoS Comput. Biol..

[25]  Alan S. Perelson,et al.  Inferring HIV Escape Rates from Multi-Locus Genotype Data , 2013, Front. Immunol..

[26]  C. Seoighe,et al.  Population Genetics Inference for Longitudinally-Sampled Mutants Under Strong Selection , 2014, Genetics.

[27]  P. Deloukas,et al.  Signatures of mutation and selection in the cancer genome , 2010, Nature.

[28]  Chaim A. Schramm,et al.  Co-evolution of a broadly neutralizing HIV-1 antibody and founder virus , 2013, Nature.

[29]  Thomas Leitner,et al.  Recombination Rate and Selection Strength in HIV Intra-patient Evolution , 2009, PLoS Comput. Biol..

[30]  W. P. Russ,et al.  Natural-like function in artificial WW domains , 2005, Nature.

[31]  C. Sander,et al.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families , 2011, Proceedings of the National Academy of Sciences.

[32]  Jan Albert,et al.  Population genomics of intrapatient HIV-1 evolution , 2015, eLife.

[33]  Rob J de Boer,et al.  Reliable reconstruction of HIV-1 whole genome haplotypes reveals clonal interference and genetic hitchhiking among immune escape variants , 2013, Retrovirology.

[34]  Rebecca Batorsky,et al.  Estimate of effective recombination rate and average selection coefficient for HIV in chronic infection , 2011, Proceedings of the National Academy of Sciences.

[35]  Matthew R. McKay,et al.  Fitness landscape of the human immunodeficiency virus envelope protein that is targeted by antibodies , 2018, Proceedings of the National Academy of Sciences.

[36]  David C. Nickle,et al.  Selection on the Human Immunodeficiency Virus Type 1 Proteome following Primary Infection , 2006, Journal of Virology.

[37]  Brian T. Foley,et al.  Retrieval and on-the-fly alignment of sequence fragments from the HIV database , 2001, Bioinform..

[38]  Christian Brander,et al.  Selective Escape from CD8+ T-Cell Responses Represents a Major Driving Force of Human Immunodeficiency Virus Type 1 (HIV-1) Sequence Diversity and Reveals Constraints on HIV-1 Evolution , 2005, Journal of Virology.

[39]  Christopher J. R. Illingworth,et al.  Identifying Selection in the Within-Host Evolution of Influenza Using Viral Sequence Data , 2014, PLoS Comput. Biol..

[40]  Daniel Wegmann,et al.  An Approximate Markov Model for the Wright–Fisher Diffusion and Its Application to Time Series Data , 2015, Genetics.

[41]  Raymond H. Y. Louie,et al.  Identifying immunologically-vulnerable regions of the HCV E2 glycoprotein and broadly neutralizing antibodies that target them , 2019, Nature Communications.

[42]  Maria Simonsen,et al.  Statistical Inference in the Wright–Fisher Model Using Allele Frequency Data , 2016, Systematic biology.

[43]  A. McKenna,et al.  Evolution and Impact of Subclonal Mutations in Chronic Lymphocytic Leukemia , 2012, Cell.

[44]  Jonathan P. Bollback,et al.  Estimation of 2Nes From Temporal Allele Frequency Data , 2008, Genetics.

[45]  A. Chakraborty,et al.  Deconvolving mutational patterns of poliovirus outbreaks reveals its intrinsic fitness landscape , 2020, Nature Communications.

[46]  Christopher J. R. Illingworth,et al.  Quantifying Selection Acting on a Complex Trait Using Allele Frequency Time Series Data , 2011, Molecular biology and evolution.

[47]  Gil McVean,et al.  Estimating Selection Coefficients in Spatially Structured Populations from Time Series Data of Allele Frequencies , 2013, Genetics.

[48]  Chaim A. Schramm,et al.  Developmental pathway for potent V1V2-directed HIV-neutralizing antibodies , 2014, Nature.

[49]  Richard A Neher,et al.  Mathematical modeling of escape of HIV from cytotoxic T lymphocyte responses , 2012, Journal of statistical mechanics.

[50]  Persephone Borrow,et al.  The immune response during acute HIV-1 infection: clues for vaccine development , 2009, Nature Reviews Immunology.

[51]  B. Walker,et al.  Relative rate and location of intra-host HIV evolution to evade cellular immunity are predictable , 2016, Nature Communications.

[52]  M. Beaumont,et al.  Effects of the Ordering of Natural Selection and Population Regulation Mechanisms on Wright-Fisher Models , 2016, G3: Genes, Genomes, Genetics.

[53]  Christian Schlötterer,et al.  Multi-locus Analysis of Genomic Time Series Data from Experimental Evolution , 2014, bioRxiv.

[54]  Feng Gao,et al.  Vertical T cell immunodominance and epitope entropy determine HIV-1 escape. , 2012, The Journal of clinical investigation.

[55]  Matthieu Foll,et al.  WFABC: a Wright-Fisher ABC-based approach for inferring effective population sizes and selection coefficients from time-sampled data , 2014, bioRxiv.

[56]  M. Lässig,et al.  Clonal Interference in the Evolution of Influenza , 2012, Genetics.

[57]  N. McGranahan,et al.  The causes and consequences of genetic heterogeneity in cancer evolution , 2013, Nature.

[58]  John Maynard Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[59]  Mehran Kardar,et al.  Manipulating the selection forces during affinity maturation to generate cross-reactive HIV antibodies , 2015, Cell.

[60]  B. Korber,et al.  Human retroviruses and AIDS 1997 , 1997 .

[61]  Simona Cocco,et al.  Inverse statistical physics of protein sequences: a key issues review , 2017, Reports on progress in physics. Physical Society.

[62]  Gavin Sherlock,et al.  Quantitative evolutionary dynamics using high-resolution lineage tracking , 2015, Nature.

[63]  A. Levine,et al.  A neoantigen fitness model predicts tumour response to checkpoint blockade immunotherapy , 2017, Nature.

[64]  M. Weigt,et al.  Coevolutionary Landscape Inference and the Context-Dependence of Mutations in Beta-Lactamase TEM-1 , 2015, bioRxiv.

[65]  T. Hwa,et al.  Identification of direct residue contacts in protein–protein interaction by message passing , 2009, Proceedings of the National Academy of Sciences.

[66]  L. Morris,et al.  Multiple Pathways of Escape from HIV Broadly Cross-Neutralizing V2-Dependent Antibodies , 2013, Journal of Virology.

[67]  Thomas A. Hopf,et al.  Mutation effects predicted from sequence co-variation , 2017, Nature Biotechnology.

[68]  Alan S. Perelson,et al.  Fitness Costs and Diversity of the Cytotoxic T Lymphocyte (CTL) Response Determine the Rate of CTL Escape during Acute and Chronic Phases of HIV Infection , 2011, Journal of Virology.

[69]  A. Børresen-Dale,et al.  The Life History of 21 Breast Cancers , 2012, Cell.

[70]  H. Muller THE RELATION OF RECOMBINATION TO MUTATIONAL ADVANCE. , 1964, Mutation research.