CENTRAL MOMENTS AND PROBABILITY DISTRIBUTION OF COLLESS'S COEFFICIENT OF TREE IMBALANCE

The great increase in the number of phylogenetic studies of a wide variety of organisms in recent decades has focused considerable attention on the balance of phylogenetic trees—the degree to which sister clades within a tree tend to be of equal size—for at least two reasons: (1) the degree of balance of a tree may affect the accuracy of estimates of it; (2) the degree of balance, or imbalance, of a tree may reveal something about the macroevolutionary processes that produced it. In particular, variation among lineages in rates of speciation or extinction is expected to produce trees that are less balanced than those that result from phylogenetic evolution in which each extant species of a group has the same probability of speciation or extinction. Several coefficients for measuring the balance or imbalance of phylogenetic trees have been proposed. I focused on Colless's coefficient of imbalance (7) for its mathematical tractability and ease of interpretation. Earlier work on this statistic produced exact methods only for calculating the expected value. In those studies, the variance and confidence limits, which are necessary for testing the departure of observed values of I from the expected, were estimated by Monte Carlo simulation. I developed recursion equations that allow exact calculation of the mean, variance, skewness, and complete probability distribution of I for two different probability‐generating models for bifurcating tree shapes. The Equal‐Rates Markov (ERM) model assumes that trees grow by the random speciation and extinction of extant species, with all species that are extant at a given time having the same probability of speciation or extinction. The Equal Probability (EP) model assumes that all possible labeled trees for a given number of terminal taxa have the same probability of occurring. Examples illustrate how these theoretically derived probabilities and parameters may be used to test whether the evolution of a monophyletic group or set of monophyletic groups has proceeded according to a Markov model with equal rates of speciation and extinction among species, that is, whether there has been significant variation among lineages in expected rates of speciation or extinction.

[1]  L. Sucheston Modern Probability Theory and its Applications. , 1961 .

[2]  R. Page RANDOM DENDROGRAMS AND NULL HYPOTHESES IN CLADISTIC BIOGEOGRAPHY , 1991 .

[3]  M. Slatkin,et al.  SEARCHING FOR EVOLUTIONARY PATTERNS IN THE SHAPE OF A PHYLOGENETIC TREE , 1993, Evolution; international journal of organic evolution.

[4]  J. H. M. Wedderburn The Functional Equation g(x 2 ) = 2 αx + [g(x)] 2 , 1922 .

[5]  D. Futuyma,et al.  PHYLOGENY AND THE EVOLUTION OF HOST PLANT ASSOCIATIONS IN THE LEAF BEETLE GENUS OPHRAELLA (COLEOPTERA, CHRYSOMELIDAE) , 1990, Evolution; international journal of organic evolution.

[6]  Joseph B. Slowinski,et al.  PROBABILITIES OF n-TREES UNDER TWO MODELS: A DEMONSTRATION THAT ASYMMETRICAL INTERIOR NODES ARE NOT IMPROBABLE , 1990 .

[7]  S. Heard,et al.  PATTERNS IN TREE BALANCE AMONG CLADISTIC, PHENETIC, AND RANDOMLY GENERATED PHYLOGENETIC TREES , 1992, Evolution; international journal of organic evolution.

[8]  L. Cavalli-Sforza,et al.  PHYLOGENETIC ANALYSIS: MODELS AND ESTIMATION PROCEDURES , 1967, Evolution; international journal of organic evolution.

[9]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[10]  H. M. Savage The shape of evolution: systematic tree topology , 1983 .

[11]  Earl D. McCoy,et al.  There Have Been No Statistical Tests of Cladistic Biogeographical Hypotheses , 1981 .

[12]  James S. Rogers,et al.  Response of Colless's tree imbalance to number of terminal taxa , 1993 .

[13]  D. H. Colless,et al.  Phylogenetics: The Theory and Practice of Phylogenetic Systematics. , 1982 .

[14]  C. Guyer,et al.  COMPARISONS OF OBSERVED PHYLOGENETIC TOPOLOGIES WITH NULL EXPECTATIONS AMONG THREE MONOPHYLETIC LINEAGES , 1991, Evolution; international journal of organic evolution.

[15]  E. Harding The probabilities of rooted tree-shapes generated by random bifurcation , 1971, Advances in Applied Probability.

[16]  Brian D. Farrell,et al.  PHYLOGENESIS OF INSECT/PLANT INTERACTIONS: HAVE PHYLLOBROTICA LEAF BEETLES (CHRYSOMELIDAE) AND THE LAMIALES DIVERSIFIED IN PARALLEL? , 1990, Evolution; international journal of organic evolution.

[17]  Gareth Nelson,et al.  Vicariance Biogeography: A Critique , 1982 .