NULL MODELS FOR THE NUMBER OF EVOLUTIONARY STEPS IN A CHARACTER ON A PHYLOGENETIC TREE

Random trees and random characters can be used in null models for testing phylogenetic hypothesis. We consider three interpretations of random trees: first, that trees are selected from the set of all possible trees with equal probability; second, that trees are formed by random speciation or coalescence (equivalent); and third, that trees are formed by a series of random partitions of the taxa. We consider two interpretations of random characters: first, that the number of taxa with each state is held constant, but the states are randomly reshuffled among the taxa; and second, that the probability each taxon is assigned a particular state is constant from one taxon to the next. Under null models representing various combinations of randomizations of trees and characters, exact recursion equations are given to calculate the probability distribution of the number of character state changes required by a phylogenetic tree. Possible applications of these probability distributions are discussed. They can be used, for example, to test for a panmictic population structure within a species or to test phylogenetic inertia in a character's evolution. Whether and how a null model incorporates tree randomness makes little difference to the probability distribution in many but not all circumstances. The null model's sense of character randomness appears more critical. The difficult issue of choosing a null model is discussed.

[1]  Nicholas C. Wormald,et al.  On the Distribution of Lengths of Evolutionary Trees , 1990, SIAM J. Discret. Math..

[2]  W. Fitch Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology , 1971 .

[3]  Rohlf Fj Numbering binary trees with labeled terminal vertices , 1983 .

[4]  M. Steel Distributions on bicoloured evolutionary trees , 1990, Bulletin of the Australian Mathematical Society.

[5]  W. Maddison RECONSTRUCTING CHARACTER EVOLUTION ON POLYTOMOUS CLADOGRAMS , 1989, Cladistics : the international journal of the Willi Hennig Society.

[6]  F J Rohlf,et al.  Numbering binary trees with labeled terminal vertices. , 1983, Bulletin of mathematical biology.

[7]  Adolfo J. Quiroz,et al.  Fast random generation of binary, t-ary and other types of trees , 1989 .

[8]  James W. Archie,et al.  A randomization test for phylogenetic information in systematic data , 1989 .

[9]  W. J. Quesne,et al.  A Method of Selection of Characters in Numerical Taxonomy , 1969 .

[10]  N. Oden,et al.  An algorithm to equiprobably generate all directed trees with kappa labeled terminal nodes and unlabeled interior nodes. , 1984, Bulletin of mathematical biology.

[11]  Michael D. Hendy,et al.  Significance of the length of the shortest tree , 1992 .

[12]  Daniel Simberloff,et al.  CALCULATING PROBABILITIES THAT CLADOGRAMS MATCH: A METHOD OF BIOGEOGRAPHICAL INFERENCE , 1987 .

[13]  G. Furnas The generation of random, binary unordered trees , 1984 .

[14]  J. Farris Methods for Computing Wagner Trees , 1970 .

[15]  M. Stoneking,et al.  Mitochondrial DNA and human evolution , 1987, Nature.

[16]  Joseph Felsenstein,et al.  The number of evolutionary trees , 1978 .

[17]  N. Takahata Gene genealogy in three related populations: consistency probability between gene and population trees. , 1989, Genetics.

[18]  Neal L. Oden,et al.  An algorithm to equiprobably generate all directed trees withk labeled terminal nodes and unlabeled interior nodes , 1984 .

[19]  E. Harding The probabilities of rooted tree-shapes generated by random bifurcation , 1971, Advances in Applied Probability.

[20]  F. Tajima Evolutionary relationship of DNA sequences in finite populations. , 1983, Genetics.

[21]  Joseph B. Slowinski,et al.  Testing the Stochasticity of Patterns of Organismal Diversity: An Improved Null Model , 1989, The American Naturalist.

[22]  C. Meacham A probability measure for character compatibility , 1981 .

[23]  H. M. Savage The shape of evolution: systematic tree topology , 1983 .

[24]  Mike A. Steel Distributions on Bicoloured Binary Trees Arising from the Principle of Parsimony , 1993, Discret. Appl. Math..

[25]  Earl D. McCoy,et al.  There Have Been No Statistical Tests of Cladistic Biogeographical Hypotheses , 1981 .

[26]  M Slatkin,et al.  A cladistic measure of gene flow inferred from the phylogenies of alleles. , 1989, Genetics.

[27]  M. Frohlich Common-Is-Primitive: A Partial Validation by Tree Counting , 1987 .

[28]  F. Tajima DNA polymorphism in a subdivided population: the expected number of segregating sites in the two-subpopulation model. , 1989, Genetics.

[29]  F. Rohlf,et al.  Sampling Distribution of Consensus Indices when All Bifurcating Trees are Equally Likely , 1983 .

[30]  R. Thorne,et al.  Phenetic and Phylogenetic Classification , 1964, Nature.

[31]  D. Rosen Vicariant Patterns and Historical Explanation in Biogeography , 1978 .