Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability.

Owing to the exponential growth of genome databases, phylogenetic trees are now widely used to test a variety of evolutionary hypotheses. Nevertheless, computation time burden limits the application of methods such as maximum likelihood nonparametric bootstrap to assess reliability of evolutionary trees. As an alternative, the much faster Bayesian inference of phylogeny, which expresses branch support as posterior probabilities, has been introduced. However, marked discrepancies exist between nonparametric bootstrap proportions and Bayesian posterior probabilities, leading to difficulties in the interpretation of sometimes strongly conflicting results. As an attempt to reconcile these two indices of node reliability, we apply the nonparametric bootstrap resampling procedure to the Bayesian approach. The correlation between posterior probabilities, bootstrap maximum likelihood percentages, and bootstrapped posterior probabilities was studied for eight highly diverse empirical data sets and were also investigated using experimental simulation. Our results show that the relation between posterior probabilities and bootstrapped maximum likelihood percentages is highly variable but that very strong correlations always exist when Bayesian node support is estimated on bootstrapped character matrices. Moreover, simulations corroborate empirical observations in suggesting that, being more conservative, the bootstrap approach might be less prone to strongly supporting a false phylogenetic hypothesis. Thus, apparent conflicts in topology recovered by the Bayesian approach were reduced after bootstrapping. Both posterior probabilities and bootstrap supports are of great interest to phylogeny as potential upper and lower bounds of node reliability, but they are surely not interchangeable and cannot be directly compared.

[1]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[2]  J. Bull,et al.  An Empirical Test of Bootstrapping as a Method for Assessing Confidence in Phylogenetic Analysis , 1993 .

[3]  Joseph Felsenstein,et al.  Is there something wrong with the bootstrap on phylogenies? A reply to Hillis and Bull , 1993 .

[4]  B. Efron,et al.  Bootstrap confidence levels for phylogenetic trees. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[5]  K. Strimmer,et al.  Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree Topologies , 1996 .

[6]  W. Maddison Gene Trees in Species Trees , 1997 .

[7]  Andrew Rambaut,et al.  Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees , 1997, Comput. Appl. Biosci..

[8]  B. Rannala,et al.  Bayesian phylogenetic inference using DNA sequences: a Markov Chain Monte Carlo Method. , 1997, Molecular biology and evolution.

[9]  David Posada,et al.  MODELTEST: testing the model of DNA substitution , 1998, Bioinform..

[10]  B. Larget,et al.  Markov Chain Monte Carlo Algorithms for the Bayesian Analysis of Phylogenetic Trees , 2000 .

[11]  M. Goodman,et al.  Molecular phylogeny of ateline new world monkeys (Platyrrhini, atelinae) based on gamma-globin gene sequences: evidence that brachyteles is the sister group of lagothrix. , 1999, Molecular phylogenetics and evolution.

[12]  M. Chase,et al.  Molecular phylogenetics of Diseae (Orchidaceae): a contribution from nuclear ribosomal ITS sequences. , 1999, American journal of botany.

[13]  Bin Ma,et al.  From Gene Trees to Species Trees , 2000, SIAM J. Comput..

[14]  A. Rodrigo,et al.  Likelihood-based tests of topologies in phylogenetics. , 2000, Systematic biology.

[15]  Charles F. Delwiche,et al.  The Closest Living Relatives of Land Plants , 2001, Science.

[16]  P. Waddell,et al.  A phylogenetic foundation for comparative mammalian genomics. , 2001, Genome informatics. International Conference on Genome Informatics.

[17]  Jonathan P. Bollback,et al.  Bayesian Inference of Phylogeny and Its Impact on Evolutionary Biology , 2001, Science.

[18]  John P. Huelsenbeck,et al.  MRBAYES: Bayesian inference of phylogenetic trees , 2001, Bioinform..

[19]  D. Horner,et al.  Unique phylogenetic relationships of glucokinase and glucosephosphate isomerase of the amitochondriate eukaryotes Giardia intestinalis, Spironucleus barkhanus and Trichomonas vaginalis. , 2001, Gene.

[20]  W. Doolittle,et al.  Bacterial origin for the isoprenoid biosynthesis enzyme HMG-CoA reductase of the archaeal orders Thermoplasmatales and Archaeoglobales. , 2001, Molecular biology and evolution.

[21]  W. Murphy,et al.  Resolution of the Early Placental Mammal Radiation Using Bayesian Phylogenetics , 2001, Science.

[22]  H. Morrison,et al.  Three retrotransposon families in the genome of Giardia lamblia: Two telomeric, one dead , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Mark Pagel,et al.  Major fungal lineages are derived from lichen symbiotic ancestors , 2022 .

[24]  J. Huelsenbeck Testing a covariotide model of DNA substitution. , 2002, Molecular biology and evolution.

[25]  Derrick J. Zwickl,et al.  Phylogenetic relationships of the dwarf boas and a comparison of Bayesian and bootstrap measures of phylogenetic support. , 2002, Molecular phylogenetics and evolution.

[26]  D. Swofford PAUP*: Phylogenetic analysis using parsimony (*and other methods), Version 4.0b10 , 2002 .

[27]  ohn,et al.  Potential Applications and Pitfalls of Bayesian Inference of Phylogeny , 2002 .

[28]  C. Cunningham,et al.  The effects of nucleotide substitution model assumptions on estimates of nonparametric bootstrap support. , 2002, Molecular biology and evolution.

[29]  M. Stanhope,et al.  Molecular phylogeny of living xenarthrans and the impact of character and taxon sampling on the placental tree rooting. , 2002, Molecular biology and evolution.

[30]  D. Winkler,et al.  Phylogeny of the tree swallow genus, Tachycineta (Aves: Hirundinidae), by Bayesian analysis of mitochondrial DNA sequences. , 2002, Molecular phylogenetics and evolution.

[31]  T. Buckley,et al.  Model misspecification and probabilistic tests of topology: evidence from empirical data sets. , 2002, Systematic biology.

[32]  Peter Arensburger,et al.  Combined data, Bayesian phylogenetics, and the origin of the New Zealand cicada genera. , 2002, Systematic biology.

[33]  Dam,et al.  Molecular Systematics of the Eastern Fence Lizard ( Sceloporus undulatus ): A Comparison of Parsimony, Likelihood, and Bayesian Approaches , 2002 .

[34]  M. Stanhope,et al.  Molecular phylogenetic evidence refuting the hypothesis of Batoidea (rays and skates) as derived sharks. , 2003, Molecular Phylogenetics and Evolution.

[35]  Michael P. Cummings,et al.  PAUP* [Phylogenetic Analysis Using Parsimony (and Other Methods)] , 2004 .

[36]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[37]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.