Quantifying Parallel Evolution

Parallel evolution is consistently observed across the tree of life. However, the degree of parallelism between replicate populations in evolution experiments is rarely quantified at the gene level. Here we examine parallel evolution as the degree of covariance between replicate populations, providing a justification for the use of dimensionality reduction. We examine the extent that signals of gene-level covariance can be inferred in microbial evolve-and-resequence evolution experiments, finding that deviations from parallelism are difficult to quantify at a given point in time. However, this low statistical signal means that covariance between replicate populations is unlikely to interfere with the ability to detect divergent evolutionary trajectories for populations in different environments. Finally, we find evidence suggesting that temporal patterns of parallelism are comparatively easier to detect and that these patterns may reflect the evolutionary dynamics of microbial populations.

[1]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[2]  W. Patefield,et al.  An Efficient Method of Generating Random R × C Tables with Given Row and Column Totals , 1981 .

[3]  N. Roy,et al.  The Classification and Evolution of Bacterial Cross-Feeding , 2019, Front. Ecol. Evol..

[4]  A. F. Bennett,et al.  The Molecular Diversity of Adaptive Convergence , 2012, Science.

[5]  Dominique Schneider,et al.  Tests of parallel molecular evolution in a long-term experiment with Escherichia coli. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[6]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[7]  George M. Church,et al.  Adaptive evolution of genomically recoded Escherichia coli , 2018, Proceedings of the National Academy of Sciences.

[8]  P. Wilson,et al.  Parallel evolution of site‐specific changes in divergent caribou lineages , 2018, Ecology and evolution.

[9]  V. Cooper Experimental Evolution as a High-Throughput Screen for Genetic Adaptations , 2018, mSphere.

[10]  Ulrike von Luxburg,et al.  Clustering Stability: An Overview , 2010, Found. Trends Mach. Learn..

[11]  Marti J. Anderson,et al.  Some solutions to the multivariate Behrens–Fisher problem for dissimilarity‐based analyses , 2017 .

[12]  Jeremy Schmutz,et al.  Widespread Parallel Evolution in Sticklebacks by Repeated Fixation of Ectodysplasin Alleles , 2005, Science.

[13]  Chaitanya S. Gokhale,et al.  Repeatability of evolution on epistatic landscapes , 2015, Scientific Reports.

[14]  I. Johnstone On the distribution of the largest eigenvalue in principal components analysis , 2001 .

[15]  Tom Minka,et al.  Automatic Choice of Dimensionality for PCA , 2000, NIPS.

[16]  G. I. Lang,et al.  Detecting genetic interactions using parallel evolution in experimental populations , 2019, Philosophical Transactions of the Royal Society B.

[17]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[18]  Benjamin H. Good,et al.  The Dynamics of Molecular Evolution Over 60,000 Generations , 2017, Nature.

[19]  Benjamin H. Good,et al.  The Impact of Macroscopic Epistasis on Long-Term Evolutionary Dynamics , 2014, Genetics.

[20]  T. Tao Topics in Random Matrix Theory , 2012 .

[21]  R. Kassen,et al.  What drives parallel evolution? , 2017, BioEssays : news and reviews in molecular, cellular and developmental biology.

[22]  B. Pickersgill Parallel vs. Convergent Evolution in Domestication and Diversification of Crops in the Americas , 2018, Front. Ecol. Evol..

[23]  Robert B. Allen,et al.  Trajectory analysis in community ecology , 2019, Ecological Monographs.

[24]  M. Lynch,et al.  Escherichia coli cultures maintain stable subpopulation structure during long-term evolution , 2018, Proceedings of the National Academy of Sciences.

[25]  E. Karakoç,et al.  Population size changes and selection drive patterns of parallel evolution in a host–virus system , 2018, Nature Communications.

[26]  Cole J. Thompson,et al.  Contrasting effects of environment and genetics generate a continuum of parallel evolution , 2017, Nature Ecology &Evolution.

[27]  David T. Fraebel,et al.  Environment determines evolutionary trajectory in a constrained phenotypic space , 2016, bioRxiv.

[28]  V. Cooper,et al.  Parallel genetic adaptation across environments differing in mode of growth or resource availability , 2018, Evolution Letters.

[29]  L. Devroye Discrete Univariate Distributions , 1986 .

[30]  G. McVean A Genealogical Interpretation of Principal Components Analysis , 2009, PLoS genetics.

[31]  Jonathon Shlens,et al.  A Tutorial on Principal Component Analysis , 2014, ArXiv.

[32]  A. Soshnikov A Note on Universality of the Distribution of the Largest Eigenvalues in Certain Sample Covariance Matrices , 2001, math/0104113.

[33]  S. Qin,et al.  Determining the number of principal components for best reconstruction , 2000 .

[34]  T. Chou,et al.  Diversity in biology: definitions, quantification and models , 2019, Physical biology.

[35]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[36]  Beom Jun Kim,et al.  Growing scale-free networks with tunable clustering. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  D. Bolnick,et al.  A Multivariate View of Parallel Evolution , 2020, bioRxiv.

[38]  B. Gaut,et al.  The phenotypic signature of adaptation to thermal stress in Escherichia coli , 2015, BMC Evolutionary Biology.

[39]  R. Lenski,et al.  Long-Term Experimental Evolution in Escherichia coli. I. Adaptation and Divergence During 2,000 Generations , 1991, The American Naturalist.

[40]  R. Milo,et al.  Conversion of Escherichia coli to Generate All Biomass Carbon from CO2 , 2019, Cell.

[41]  David L. Donoho,et al.  The Optimal Hard Threshold for Singular Values is 4/sqrt(3) , 2013, 1305.5870.

[42]  Yvonne Feierabend,et al.  Population Genetics A Concise Guide , 2016 .

[43]  B. L. Welch THE SIGNIFICANCE OF THE DIFFERENCE BETWEEN TWO MEANS WHEN THE POPULATION VARIANCES ARE UNEQUAL , 1938 .

[44]  K. Metzner,et al.  Parallel Evolution of HIV-1 in a Long-Term Experiment , 2019, Molecular biology and evolution.

[45]  R. Fisher THE FIDUCIAL ARGUMENT IN STATISTICAL INFERENCE , 1935 .

[46]  R Bro,et al.  Cross-validation of component models: A critical look at current methods , 2008, Analytical and bioanalytical chemistry.

[47]  Identifying drivers of parallel evolution: A regression model approach , 2017 .

[48]  Richard E. Lenski,et al.  Tempo and mode of genome evolution in a 50,000-generation experiment , 2016, Nature.

[49]  Diana J. Rennison,et al.  (Non)Parallel Evolution , 2018, Annual Review of Ecology, Evolution, and Systematics.