Using Classical Population Genetics Tools with Heterochroneous Data: Time Matters!

Background New polymorphism datasets from heterochroneous data have arisen thanks to recent advances in experimental and microbial molecular evolution, and the sequencing of ancient DNA (aDNA). However, classical tools for population genetics analyses do not take into account heterochrony between subsets, despite potential bias on neutrality and population structure tests. Here, we characterize the extent of such possible biases using serial coalescent simulations. Methodology/Principal Findings We first use a coalescent framework to generate datasets assuming no or different levels of heterochrony and contrast most classical population genetic statistics. We show that even weak levels of heterochrony (∼10% of the average depth of a standard population tree) affect the distribution of polymorphism substantially, leading to overestimate the level of polymorphism θ, to star like trees, with an excess of rare mutations and a deficit of linkage disequilibrium, which are the hallmark of e.g. population expansion (possibly after a drastic bottleneck). Substantial departures of the tests are detected in the opposite direction for more heterochroneous and equilibrated datasets, with balanced trees mimicking in particular population contraction, balancing selection, and population differentiation. We therefore introduce simple corrections to classical estimators of polymorphism and of the genetic distance between populations, in order to remove heterochrony-driven bias. Finally, we show that these effects do occur on real aDNA datasets, taking advantage of the currently available sequence data for Cave Bears (Ursus spelaeus), for which large mtDNA haplotypes have been reported over a substantial time period (22–130 thousand years ago (KYA)). Conclusions/Significance Considering serial sampling changed the conclusion of several tests, indicating that neglecting heterochrony could provide significant support for false past history of populations and inappropriate conservation decisions. We therefore argue for systematically considering heterochroneous models when analyzing heterochroneous samples covering a large time scale.

[1]  F. Depaulis,et al.  Neutrality tests based on the distribution of haplotypes under an infinite-site model. , 1998, Molecular biology and evolution.

[2]  Cristina E. Valdiosera,et al.  Surprising migration and population size dynamics in ancient Iberian brown bears (Ursus arctos) , 2008, Proceedings of the National Academy of Sciences.

[3]  Hans-Jürgen Bandelt,et al.  Mitochondrial genomes of extinct aurochs survive in domestic cattle , 2008, Current Biology.

[4]  G. McVean,et al.  A genealogical interpretation of linkage disequilibrium. , 2002, Genetics.

[5]  Uma Ramakrishnan,et al.  Genetic Response to Climatic Change: Insights from Ancient DNA and Phylochronology , 2004, PLoS biology.

[6]  R. Hudson,et al.  Statistical properties of the number of recombination events in the history of a sample of DNA sequences. , 1985, Genetics.

[7]  W. Li,et al.  Statistical tests of neutrality of mutations. , 1993, Genetics.

[8]  S. Ho,et al.  The Effect of Inappropriate Calibration: Three Case Studies in Molecular Ecology , 2008, PloS one.

[9]  Cristina E. Valdiosera,et al.  Typing single polymorphic nucleotides in mitochondrial DNA as a way to access Middle Pleistocene DNA , 2006, Biology Letters.

[10]  G Achaz,et al.  A robust measure of HIV-1 population turnover within chronically infected individuals. , 2004, Molecular biology and evolution.

[11]  S. Pääbo,et al.  Lack of phylogeography in European mammals before the last glaciation. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[12]  C. Gissi,et al.  Nucleotide Substitution Rate of Mammalian Mitochondrial Genomes , 1999, Journal of Molecular Evolution.

[13]  S. Ho,et al.  Elevated substitution rates estimated from ancient DNA sequences , 2007, Biology Letters.

[14]  J. Mountain,et al.  Detecting past population bottlenecks using temporal genetic data , 2005, Molecular ecology.

[15]  J K Kelly,et al.  A test of neutrality based on interlocus associations. , 1997, Genetics.

[16]  R. Wayne,et al.  Ancient DNA Evidence for Old World Origin of New World Dogs , 2002, Science.

[17]  R. Hudson,et al.  A statistical test for detecting geographic subdivision. , 1992, Molecular biology and evolution.

[18]  M. Whitlock,et al.  Indirect measures of gene flow and migration: FST not equal to 1/(4Nm + 1). , 1999, Heredity.

[19]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[20]  James Haile,et al.  Ancient Biomolecules from Deep Ice Cores Reveal a Forested Southern Greenland , 2007, Science.

[21]  D. Turnbull,et al.  Analysis of European mtDNAs for recombination. , 2001, American journal of human genetics.

[22]  F. Tajima Evolutionary relationship of DNA sequences in finite populations. , 1983, Genetics.

[23]  P. Taberlet,et al.  Ancient DNA analysis reveals divergence of the cave bear, Ursus spelaeus, and brown bear, Ursus arctos, lineages , 2001, Current Biology.

[24]  Richard E. Lenski,et al.  Rates of DNA Sequence Evolution in Experimental Populations of Escherichia coli During 20,000 Generations , 2003, Journal of Molecular Evolution.

[25]  Beth Shapiro,et al.  Rise and Fall of the Beringian Steppe Bison , 2004, Science.

[26]  A. Lambert,et al.  Experimental Estimation of Mutation Rates in a Wheat Population With a Gene Genealogy Approach , 2008, Genetics.

[27]  Michael C Whitlock,et al.  The incomplete natural history of mitochondria , 2004, Molecular ecology.

[28]  S. Pääbo,et al.  No Evidence of Neandertal mtDNA Contribution to Early Modern Humans , 2004, PLoS biology.

[29]  R. Wayne,et al.  A genetic record of population isolation in pocket gophers during Holocene climatic change. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[30]  J. Hein,et al.  Tree measures and the number of segregating sites in time-structured population samples , 2005, BMC Genetics.

[31]  W. Schröder,et al.  Sequencing mtDNA of the cave bearUrsus spelaeus from the Bavarian Alps is feasible by nested and touchdown PCR , 2001, Acta Theriologica.

[32]  Justin C. Fay,et al.  Hitchhiking under positive Darwinian selection. , 2000, Genetics.

[33]  M. Nordborg,et al.  Recombination or mutational hot spots in human mtDNA? , 2002, Molecular biology and evolution.

[34]  Cristina E. Valdiosera,et al.  Staying out in the cold: glacial refugia and mitochondrial DNA phylogeography in ancient European brown bears , 2007, Molecular ecology.

[35]  P. Awadalla,et al.  Linkage disequilibrium and recombination in hominid mitochondrial DNA. , 1999, Science.

[36]  J. Mountain,et al.  Serial coalescent simulations suggest a weak genealogical relationship between Etruscans and modern Tuscans. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[37]  T. Ohta,et al.  Linkage disequilibrium with the island model. , 1982, Genetics.

[38]  P. Taberlet,et al.  Tracking the origins of the cave bear (Ursus spelaeus) by mitochondrial DNA sequencing. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[39]  E. G. Shpaer,et al.  Coalescent estimates of HIV-1 generation time in vivo. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[40]  M. Slatkin,et al.  Using maximum likelihood to estimate population size from temporal changes in allele frequencies. , 1999, Genetics.

[41]  M. Slatkin,et al.  Estimation of levels of gene flow from DNA sequence data. , 1992, Genetics.

[42]  L. Orlando,et al.  Ancient DNA and the population genetics of cave bears (Ursus spelaeus) through space and time. , 2002, Molecular biology and evolution.

[43]  Erik Axelsson,et al.  The effect of ancient DNA damage on inferences of demographic histories. , 2008, Molecular biology and evolution.

[44]  C. Hänni,et al.  Ancient DNA evidence for the loss of a highly divergent brown bear clade during historical times , 2008, Molecular ecology.

[45]  C. Wiuf,et al.  Crosslinks Rather Than Strand Breaks Determine Access to Ancient DNA Sequences From Frozen Sediments , 2006, Genetics.

[46]  J. Taubenberger,et al.  Characterization of the 1918 "Spanish" influenza virus neuraminidase gene. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[47]  A. Eyre-Walker,et al.  The correlation between linkage disequilibrium and distance: implications for recombination in hominid mitochondria. , 2001, Molecular biology and evolution.

[48]  Molly Przeworski,et al.  Evidence for population growth in humans is confounded by fine-scale population structure. , 2002, Trends in genetics : TIG.

[49]  C. Millar,et al.  Rates of Evolution in Ancient DNA from Adélie Penguins , 2002, Science.

[50]  F. Depaulis,et al.  Detecting Selective Sweeps with Haplotype Tests , 2005 .

[51]  B. Shapiro,et al.  Dynamics of Pleistocene Population Extinctions in Beringian Brown Bears , 2002, Science.

[52]  U. Ramakrishnan,et al.  Studying the effect of environmental change on biotic evolution: past genetic contributions, current work and future directions , 2004, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[53]  M. Stoneking,et al.  Questioning evidence for recombination in human mitochondrial DNA. , 2000, Science.

[54]  L. Waits,et al.  Ancient DNA analyses reveal high mitochondrial DNA sequence diversity and parallel morphological evolution of late pleistocene cave bears. , 2002, Molecular biology and evolution.

[55]  M. Whitlock,et al.  Indirect measures of gene flow and migration: FST≠1/(4Nm+1) , 1999, Heredity.

[56]  Graziano Pesole,et al.  Mitochondrial DNA in metazoa: degree of freedom in a frozen event. , 2002, Gene.

[57]  Alexei J Drummond,et al.  Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data. , 2002, Genetics.

[58]  Michel Veuille,et al.  Power of Neutrality Tests to Detect Bottlenecks and Hitchhiking , 2003, Journal of Molecular Evolution.

[59]  Yun-Xin Fu,et al.  Summary statistics of neutral mutations in longitudinal DNA samples. , 2008, Theoretical population biology.

[60]  Hey,et al.  Human mitochondrial DNA recombination: can it be true? , 2000, Trends in ecology & evolution.

[61]  B. Kurtén The Cave Bear Story: Life and Death of a Vanished Animal , 1976 .

[62]  M. Stoneking Hypervariable sites in the mtDNA control region are mutational hotspots. , 2000, American journal of human genetics.

[63]  R. Wayne,et al.  Widespread origins of domestic horse lineages. , 2001, Science.

[64]  Svante Pääbo,et al.  Evidence for Reproductive Isolation between Cave Bear Populations , 2004, Current Biology.

[65]  A. Eyre-Walker,et al.  A reanalysis of the indirect evidence for recombination in human mitochondrial DNA , 2004, Heredity.

[66]  E. Hadly,et al.  Bayesian Estimation of the Timing and Severity of a Population Bottleneck from Ancient DNA , 2006, PLoS genetics.

[67]  E. Willerslev,et al.  Barking up the wrong tree: Modern northern European dogs fail to explain their origin , 2008, BMC Evolutionary Biology.

[68]  F. Depaulis,et al.  Effect of misoriented sites on neutrality tests with outgroup. , 2003, Genetics.

[69]  C. Wiuf Recombination in Human Mitochondrial DNA , 2000, Science.

[70]  S. Wright,et al.  Evolution in Mendelian Populations. , 1931, Genetics.

[71]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[72]  A. Rodrigo,et al.  Measurably evolving populations , 2003 .