Retraction: Measures of Clade Confidence Do Not Correlate with Accuracy of Phylogenetic Trees

In PLoS Computational Biology, volume 3, issue 3, doi:10.1371/journal.pcbi.0030051: As a result of a bug in the Perl script used to compare estimated trees with true trees, the clade confidence measures were sometimes associated with the incorrect clades. The error was detected by the sharp eye of Professor Sarah P. Otto of the University of British Columbia. She noticed a discrepancy between the example tree in Figure 1B and the results reported for the gene nuoK in Table 1, and requested that she be sent all ten nuoK Bayesian trees. She painstakingly did a manual comparison of those trees with the true trees, concluded that for that dataset there was a strong correlation between clade confidence and the probability of a clade being true, and suggested the possibility of a bug in the Perl script. Dr. Otto put in considerable effort, and we want to acknowledge the generosity of that effort. The major conclusion of our paper, as given in its title, is therefore invalid, and the paper must be retracted. It is important to stress that the responsibility for the necessity of retracting our paper is entirely mine (Barry Hall), and that my coauthor Stephen J. Salipante bears none of the responsibility. I wrote the Perl script and failed to check its accuracy sufficiently. We have now corrected the script and reanalyzed the trees in Tables 1–6. The results show that there are strong correlations between clade confidence and the probability that a clade is valid for Bayesian posterior probabilities and for Maximum Likelihood bootstrap percentages and weaker correlations for Maximum Likelihood aLRT values. We have prepared a new paper describing this reanalysis and the results achieved and have submitted it for publication.

[1]  Derrick J. Zwickl,et al.  Phylogenetic relationships of the dwarf boas and a comparison of Bayesian and bootstrap measures of phylogenetic support. , 2002, Molecular phylogenetics and evolution.

[2]  Masatoshi Nei,et al.  Overcredibility of molecular phylogenies obtained by Bayesian phylogenetics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[3]  M A Newton,et al.  Bayesian Phylogenetic Inference via Markov Chain Monte Carlo Methods , 1999, Biometrics.

[4]  A. Zharkikh,et al.  Statistical properties of bootstrap estimation of phylogenetic variability from nucleotide sequences. I. Four taxa with a molecular clock. , 1992, Molecular biology and evolution.

[5]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[6]  A. Oskooi Molecular Evolution and Phylogenetics , 2008 .

[7]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.

[8]  J. Huelsenbeck,et al.  MRBAYES : Bayesian inference of phylogeny , 2001 .

[9]  S. Jeffery Evolution of Protein Molecules , 1979 .

[10]  W. Doolittle,et al.  Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability. , 2003, Molecular biology and evolution.

[11]  R. Doolittle Similar amino acid sequences: chance or common ancestry? , 1981, Science.

[12]  John P. Huelsenbeck,et al.  MrBayes 3: Bayesian phylogenetic inference under mixed models , 2003, Bioinform..

[13]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[14]  J. Huelsenbeck,et al.  Potential applications and pitfalls of Bayesian inference of phylogeny. , 2002, Systematic biology.

[15]  J. Bull,et al.  An Empirical Test of Bootstrapping as a Method for Assessing Confidence in Phylogenetic Analysis , 1993 .

[16]  Barry G. Hall,et al.  Phylogenetic Trees Made Easy: A How-To Manual for Molecular Biologists , 2001 .

[17]  F. Lutzoni,et al.  Bayes or bootstrap? A simulation study comparing the performance of Bayesian Markov chain Monte Carlo sampling and bootstrapping in assessing phylogenetic confidence. , 2003, Molecular biology and evolution.

[18]  Olivier Poch,et al.  A comprehensive comparison of multiple sequence alignment programs , 1999, Nucleic Acids Res..

[19]  B. Rannala,et al.  Probability distribution of molecular evolutionary trees: A new method of phylogenetic inference , 1996, Journal of Molecular Evolution.

[20]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[21]  O. Gascuel,et al.  Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative. , 2006, Systematic biology.

[22]  B. Hall Comparison of the accuracies of several phylogenetic methods using protein and DNA sequences. , 2005, Molecular biology and evolution.

[23]  B. Hall,et al.  Simple and accurate estimation of ancestral protein sequences. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[24]  W. H. Piel,et al.  An assessment of accuracy, error, and conflict with support values from genome-scale phylogenetic data. , 2004, Molecular biology and evolution.

[25]  Sudhir Kumar,et al.  MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment , 2004, Briefings Bioinform..