Improved multi-type birth-death phylodynamic inference in BEAST 2

The multi-type birth-death model with sampling is a phylodynamic model which enables quantification of past population dynamics in structured populations, based on phylogenetic trees. The BEAST 2 package bdmm implements an algorithm for numerically computing the probability density of a phylogenetic tree given the population dynamic parameters under this model. In the initial release of bdmm, analyses were limited to trees consisting of up to approximately 250 genetic samples for numerical reasons. We implemented important algorithmic changes to bdmm which dramatically increase the number of genetic samples that can be analyzed, and improve the numerical robustness and efficiency of the calculations. Being able to use bigger datasets leads to improved precision of parameter estimates. Furthermore, we report on several model extensions to bdmm, inspired by properties common to empirical datasets. We apply this improved algorithm to two partly overlapping datasets of Influenza A virus HA sequences sampled around the world, one with 500 samples, the other with only 175, for comparison. We report and compare the global migration patterns and seasonal dynamics inferred from each dataset. Availability The latest release with our updates, bdmm 0.3.5, is freely available as an open access package of BEAST 2. The source code can be accessed at https://github.com/denisekuehnert/bdmm.

[1]  E. Volz,et al.  Sampling through time and phylodynamic inference with coalescent and birth–death models , 2014, Journal of The Royal Society Interface.

[2]  Veronika Boskova,et al.  Inference of Epidemiological Dynamics Based on Simulated Phylogenies Using Birth-Death and Coalescent Models , 2014, PLoS Comput. Biol..

[3]  S. Hodges Floral Nectar Spurs and Diversification , 1997, International Journal of Plant Sciences.

[4]  Peter E Midford,et al.  Estimating a binary character's effect on speciation and extinction. , 2007, Systematic biology.

[5]  Lesley T Lancaster,et al.  Phylogenetic inference of reciprocal effects between geographic range evolution and diversification. , 2011, Systematic biology.

[6]  Boris Igić,et al.  Species Selection Maintains Self-Incompatibility , 2010, Science.

[7]  Trevor Bedford,et al.  Virus genomes reveal factors that spread and sustained the Ebola epidemic , 2017, Nature.

[8]  Itay Mayrose,et al.  Recently Formed Polyploid Plants Diversify at Lower Rates Supporting Online Material , 2022 .

[9]  Tanja Stadler,et al.  Phylodynamics with Migration: A Computational Framework to Quantify Population Structure from Genomic Data , 2016, Molecular biology and evolution.

[10]  Alexei J. Drummond,et al.  Phylogenetic and epidemic modeling of rapidly evolving infectious diseases , 2011, Infection, Genetics and Evolution.

[11]  Jody Hey,et al.  The study of structured populations — new hope for a difficult and divided science , 2003, Nature Reviews Genetics.

[12]  R. FitzJohn Diversitree: comparative phylogenetic analyses of diversification in R , 2012 .

[13]  S. Bonhoeffer,et al.  Birth–death skyline plot reveals temporal changes of epidemic spread in HIV and hepatitis C virus (HCV) , 2012, Proceedings of the National Academy of Sciences.

[14]  O. Pybus,et al.  Unifying the Epidemiological and Evolutionary Dynamics of Pathogens , 2004, Science.

[15]  U. Obolski,et al.  Genomic and epidemiological monitoring of yellow fever virus transmission potential , 2018, Science.

[16]  W. Maddison CONFOUNDING ASYMMETRIES IN EVOLUTIONARY DIVERSIFICATION AND CHARACTER CHANGE , 2006, Evolution; international journal of organic evolution.

[17]  Tanja Stadler,et al.  Uncovering epidemiological dynamics in heterogeneous host populations using phylogenetic methods , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[18]  D. Rabosky,et al.  Model inadequacy and mistaken inferences of trait-dependent speciation. , 2014, Systematic biology.

[19]  J. Felsenstein,et al.  Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates. , 1992, Genetical research.

[20]  David Welch,et al.  Efficient Bayesian inference under the structured coalescent , 2014, Bioinform..

[21]  Dong Xie,et al.  BEAST 2: A Software Platform for Bayesian Evolutionary Analysis , 2014, PLoS Comput. Biol..