A frequency-dependent significance test for parsimony.

We describe techniques for assessing evolutionary trees constructed by the parsimony criteria, when sequences exhibit irregular base compositions. In particular, we extend a recently described frequency-dependent significance test to handle any number of taxa and describe a modification of the Kishino-Hasegawa sites test. These modifications are useful for detecting historical signals beyond those patterns which arise purely from irregular base compositions between the compared sequences. We apply the test to extend our earlier studies on chloroplast origins using 16S rDNA sequences, where a failure to compensate for irregular base compositions between the compared sequences provides statistically significant support for unjustified phylogenetic inferences. We also describe how the techniques can be modified to determine how "tree-like" data are, given independent variation in the base frequencies.

[1]  W. Fitch Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology , 1971 .

[2]  J. Hartigan MINIMUM MUTATION FITS TO A GIVEN TREE , 1973 .

[3]  Elizabeth A. Thompson,et al.  Human Evolutionary Trees , 1975 .

[4]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .

[5]  D. Penny,et al.  Branch and bound algorithms to determine minimal evolutionary trees , 1982 .

[6]  R. Graham,et al.  Unlikelihood that minimal phylogenies for a realistic biological study can be constructed in reasonable computational time , 1982 .

[7]  J A Lake,et al.  A rate-independent technique for analysis of nucleic acid sequences: evolutionary parsimony. , 1987, Molecular biology and evolution.

[8]  James W. Archie,et al.  A randomization test for phylogenetic information in systematic data , 1989 .

[9]  S. Pääbo,et al.  Chance marsupial relationships , 1990, Nature.

[10]  J. Oliver,et al.  The general stochastic model of nucleotide substitution. , 1990, Journal of theoretical biology.

[11]  Nicholas C. Wormald,et al.  On the Distribution of Lengths of Evolutionary Trees , 1990, SIAM J. Discret. Math..

[12]  G. Olsen,et al.  The ribosomal RNA database project. , 1991, Nucleic acids research.

[13]  J. Palmer,et al.  Gene phylogenies and the endosymbiotic origin of plastids. , 1992, Bio Systems.

[14]  Michael D. Hendy,et al.  Significance of the length of the shortest tree , 1992 .

[15]  D. Penny,et al.  Controversy on chloroplast origins , 1992, FEBS Letters.

[16]  Michael D. Hendy,et al.  Parsimony Can Be Consistent , 1993 .

[17]  G. Olsen,et al.  Ribosomal RNA: a key to phylogeny , 1993, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[18]  C. Stewart The powers and pitfalls of parsimony , 1993, Nature.

[19]  J. Huelsenbeck,et al.  SUCCESS OF PHYLOGENETIC METHODS IN THE FOUR-TAXON CASE , 1993 .

[20]  M. A. Steel,et al.  Confidence in evolutionary trees from biological sequence data , 1993, Nature.

[21]  H. Klenk,et al.  DNA-dependent RNA polymerases as phylogenetic marker molecules , 1993 .

[22]  Mike A. Steel Distributions on Bicoloured Binary Trees Arising from the Principle of Parsimony , 1993, Discret. Appl. Math..

[23]  Masami Hasegawa,et al.  Ribosomal RNA trees misleading? , 1993, Nature.

[24]  M. Steel,et al.  Recovering evolutionary trees under a more realistic model of sequence evolution. , 1994, Molecular biology and evolution.

[25]  J. Lake,et al.  Reconstructing evolutionary trees from DNA and protein sequences: paralinear distances. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[26]  M. Steel Recovering a tree from the leaf colourations it generates under a Markov model , 1994 .

[27]  Mike Steel,et al.  Five surprising properties of parsimoniously colored trees , 1995 .

[28]  M. Waterman,et al.  A central limit theorem for the parsimony length of trees , 1996, Advances in Applied Probability.