The Confounding Effect of Population Structure on Bayesian Skyline Plot Inferences of Demographic History

Many coalescent-based methods aiming to infer the demographic history of populations assume a single, isolated and panmictic population (i.e. a Wright-Fisher model). While this assumption may be reasonable under many conditions, several recent studies have shown that the results can be misleading when it is violated. Among the most widely applied demographic inference methods are Bayesian skyline plots (BSPs), which are used across a range of biological fields. Violations of the panmixia assumption are to be expected in many biological systems, but the consequences for skyline plot inferences have so far not been addressed and quantified. We simulated DNA sequence data under a variety of scenarios involving structured populations with variable levels of gene flow and analysed them using BSPs as implemented in the software package BEAST. Results revealed that BSPs can show false signals of population decline under biologically plausible combinations of population structure and sampling strategy, suggesting that the interpretation of several previous studies may need to be re-evaluated. We found that a balanced sampling strategy whereby samples are distributed on several populations provides the best scheme for inferring demographic change over a typical time scale. Analyses of data from a structured African buffalo population demonstrate how BSP results can be strengthened by simulations. We recommend that sample selection should be carefully considered in relation to population structure previous to BSP analyses, and that alternative scenarios should be evaluated when interpreting signals of population size change.

[1]  Mary K. Kuhner,et al.  LAMARC 2.0: maximum likelihood and Bayesian estimation of population parameters , 2006, Bioinform..

[2]  N. J. Fagundes,et al.  A Reevaluation of the Native American MtDNA Genome Diversity and Its Bearing on the Models of Early Colonization of Beringia , 2008, PloS one.

[3]  W. Stephan,et al.  The Impact of Sampling Schemes on the Site Frequency Spectrum in Nonequilibrium Subdivided Populations , 2009, Genetics.

[4]  M. Beaumont,et al.  Recent developments in genetic data analysis: what can they tell us about human demographic history? , 2004, Heredity.

[5]  Stephen M. Krone,et al.  Separation of time scales and convergence to the coalescent in structured populations ∗ , 2001 .

[6]  James Haile,et al.  Species-specific responses of Late Quaternary megafauna to climate and humans , 2011, Nature.

[7]  H. Siegismund,et al.  Cape buffalo mitogenomics reveals a Holocene shift in the African human–megafauna dynamics , 2012, Molecular ecology.

[8]  S. Ho,et al.  Skyline‐plot methods for estimating demographic history from nucleotide sequences , 2011, Molecular ecology resources.

[9]  M. Whitlock,et al.  Indirect measures of gene flow and migration: FST≠1/(4Nm+1) , 1999, Heredity.

[10]  J. Wakeley,et al.  Nonequilibrium migration in human history. , 1999, Genetics.

[11]  Christopher R. Gignoux,et al.  Rapid, global demographic expansions after the origins of agriculture , 2011, Proceedings of the National Academy of Sciences.

[12]  O. Pybus,et al.  Bayesian coalescent inference of past population dynamics from molecular sequences. , 2005, Molecular biology and evolution.

[13]  O. Pybus,et al.  An integrated framework for the inference of viral population history from reconstructed genealogies. , 2000, Genetics.

[14]  B. Goossens,et al.  The Confounding Effects of Population Structure, Genetic Diversity and the Sampling Scheme on the Detection and Quantification of Population Size Changes , 2010, Genetics.

[15]  Beth Shapiro,et al.  Rise and Fall of the Beringian Steppe Bison , 2004, Science.

[16]  M. Nordborg,et al.  Coalescent Theory , 2019, Handbook of Statistical Genomics.

[17]  E. Lorenzen,et al.  Mid‐Holocene decline in African buffalos inferred from Bayesian coalescent‐based analyses of microsatellites and mitochondrial DNA , 2008, Molecular ecology.

[18]  L. Excoffier,et al.  Intra-deme molecular diversity in spatially expanding populations. , 2003, Molecular biology and evolution.

[19]  M. Beaumont Detecting population expansion and decline using microsatellites. , 1999, Genetics.

[20]  Stephen M. Krone,et al.  On the Meaning and Existence of an Effective Population Size , 2005, Genetics.

[21]  H. Prins,et al.  Pan-African Genetic Structure in the African Buffalo (Syncerus caffer): Investigating Intraspecific Divergence , 2013, PloS one.

[22]  E. Hadly,et al.  Bayesian Estimation of the Timing and Severity of a Population Bottleneck from Ancient DNA , 2006, PLoS genetics.

[23]  Mark A Beaumont,et al.  Statistical inferences in phylogeography , 2009, Molecular ecology.

[24]  John R Pannell COALESCENCE IN A METAPOPULATION WITH RECURRENT LOCAL EXTINCTION AND RECOLONIZATION , 2003, Evolution; international journal of organic evolution.

[25]  Noah A. Rosenberg,et al.  Genealogical trees, coalescent theory and the analysis of genetic polymorphisms , 2002, Nature Reviews Genetics.

[26]  M. Slatkin,et al.  Estimation of levels of gene flow from DNA sequence data. , 1992, Genetics.

[27]  Alexei J Drummond,et al.  mtDNA variation predicts population size in humans and reveals a major Southern Asian chapter in human prehistory. , 2008, Molecular biology and evolution.

[28]  A. Drummond,et al.  Bayesian inference of population size history from multiple loci , 2008, BMC Evolutionary Biology.

[29]  Erik Axelsson,et al.  Ancient DNA analyses exclude humans as the driving force behind late Pleistocene musk ox (Ovibos moschatus) population dynamics , 2010, Proceedings of the National Academy of Sciences.

[30]  Jody Hey,et al.  Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics , 2007, Proceedings of the National Academy of Sciences.

[31]  Laurent Excoffier,et al.  Distinguishing between population bottleneck and population subdivision by a Bayesian model choice procedure , 2010, Molecular ecology.