Rapid maximum likelihood ancestral state reconstruction of continuous characters: A rerooting‐free algorithm

Abstract Ancestral state reconstruction is a method used to study the evolutionary trajectories of quantitative characters on phylogenies. Although efficient methods for univariate ancestral state reconstruction under a Brownian motion model have been described for at least 25 years, to date no generalization has been described to allow more complex evolutionary models, such as multivariate trait evolution, non‐Brownian models, missing data, and within‐species variation. Furthermore, even for simple univariate Brownian motion models, most phylogenetic comparative R packages compute ancestral states via inefficient tree rerooting and full tree traversals at each tree node, making ancestral state reconstruction extremely time‐consuming for large phylogenies. Here, a computationally efficient method for fast maximum likelihood ancestral state reconstruction of continuous characters is described. The algorithm has linear complexity relative to the number of species and outperforms the fastest existing R implementations by several orders of magnitude. The described algorithm is capable of performing ancestral state reconstruction on a 1,000,000‐species phylogeny in fewer than 2 s using a standard laptop, whereas the next fastest R implementation would take several days to complete. The method is generalizable to more complex evolutionary models, such as phylogenetic regression, within‐species variation, non‐Brownian evolutionary models, and multivariate trait evolution. Because this method enables fast repeated computations on phylogenies of virtually any size, implementation of the described algorithm can drastically alleviate the computational burden of many otherwise prohibitively time‐consuming tasks requiring reconstruction of ancestral states, such as phylogenetic imputation of missing data, bootstrapping procedures, Expectation‐Maximization algorithms, and Bayesian estimation. The described ancestral state reconstruction algorithm is implemented in the Rphylopars functions anc.recon and phylopars.

[1]  J. Felsenstein Phylogenies and the Comparative Method , 1985, The American Naturalist.

[2]  David L. Swofford,et al.  Reconstructing ancestral character states under Wagner parsimony , 1987 .

[3]  W. Maddison Squared-Change Parsimony Reconstructions of Ancestral States for Continuous-Valued Characters on a Phylogenetic Tree , 1991 .

[4]  T. F. Hansen,et al.  Phylogenies and the Comparative Method: A General Approach to Incorporating Phylogenetic Information into the Analysis of Interspecific Data , 1997, The American Naturalist.

[5]  D. Schluter,et al.  LIKELIHOOD OF ANCESTOR STATES IN ADAPTIVE RADIATION , 1997, Evolution; international journal of organic evolution.

[6]  Anthony R. Ives,et al.  An Introduction to Phylogenetically Based Statistical Methods, with a New Method for Confidence Intervals on Ancestral Values , 1999 .

[7]  Anthony R. Ives,et al.  Using the Past to Predict the Present: Confidence Intervals for Regression Equations in Phylogenetic Comparative Methods , 2000, The American Naturalist.

[8]  F J Rohlf,et al.  COMPARATIVE METHODS FOR THE ANALYSIS OF CONTINUOUS VARIABLES: GEOMETRIC INTERPRETATIONS , 2001, Evolution; international journal of organic evolution.

[9]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..

[10]  T. Garland,et al.  Within-species variation and measurement error in phylogenetic comparative methods. , 2007, Systematic biology.

[11]  J. Felsenstein Comparative Methods with Sampling Error and Within‐Species Variation: Contrasts Revisited and Revised , 2008, The American Naturalist.

[12]  Jaap Heringa,et al.  PhyloPars: estimation of missing parameter values using phylogeny , 2009, Nucleic Acids Res..

[13]  J. Hadfield,et al.  General quantitative genetic methods for comparative biology: phylogenies, taxonomies and multi‐trait models for continuous and categorical characters , 2010, Journal of evolutionary biology.

[14]  Carl Boettiger,et al.  IS YOUR PHYLOGENY INFORMATIVE? MEASURING THE POWER OF COMPARATIVE METHODS , 2011, Evolution; international journal of organic evolution.

[15]  Carl Boettiger,et al.  Is your phylogeny informative , 2012 .

[16]  Liam J. Revell,et al.  phytools: an R package for phylogenetic comparative biology (and other things) , 2012 .

[17]  L. Revell,et al.  A NEW BAYESIAN METHOD FOR FITTING EVOLUTIONARY MODELS TO COMPARATIVE DATA WITH INTRASPECIFIC VARIATION , 2012, Evolution; international journal of organic evolution.

[18]  R. FitzJohn Diversitree: comparative phylogenetic analyses of diversification in R , 2012 .

[19]  Carl Boettiger,et al.  IS YOUR PHYLOGENY INFORMATIVE? MEASURING THE POWER OF COMPARATIVE METHODS , 2012, Evolution; international journal of organic evolution.

[20]  Robert P. Freckleton,et al.  Fast likelihood calculations for comparative analyses , 2012 .

[21]  C. Ané,et al.  A linear-time algorithm for Gaussian and non-Gaussian trait evolution models. , 2014, Systematic biology.

[22]  M. Elliot,et al.  Identical inferences about correlated evolution arise from ancestral state reconstruction and independent contrasts. , 2013, Journal of theoretical biology.

[23]  Eric W Goolsby,et al.  Likelihood-Based Parameter Estimation for High-Dimensional Phylogenetic Comparative Models: Overcoming the Limitations of "Distance-Based" Methods. , 2016, Systematic biology.

[24]  J. Bruggeman,et al.  Rphylopars: fast multivariate phylogenetic comparative methods for missing data and within‐species variation , 2017 .