ACCURACY OF ESTIMATED PHYLOGENIES: EFFECTS OF TREE TOPOLOGY AND EVOLUTIONARY MODEL

A simulation study was carried out to investigate the relative importance of tree topology (both balance and stemminess), evolutionary rates (constant, varying among characters, and varying among lineages), and evolutionary models in determining the accuracy with which phylogenetic trees can be estimated. The three evolutionary context models were phyletic (characters can change at each simulated time step), speciational (changes are possible only at the time of speciation into two daughter lineages), and punctuational (changes occur at the time of speciation but only in one of the daughter lineages). UPGMA clustering and maximum parsimony (“Wagner trees”) methods for estimating phylogenies were compared. All trees were based on eight recent OTUs. The three evolutionary context models were found to have the largest influence on accuracy of estimates by both methods. The next most important effect was that of the stemminess × context interaction. Maximum parsimony and UPGMA performed worst under the punctuational models. Under the phyletic model, trees with high stemminess values could be estimated more accurately and balanced trees were slightly easier to estimate than unbalanced ones. Overall, maximum parsimony yielded more accurate trees than UPGMA—but that was expected for these simulations since many more characters than OTUs were used. Our results suggest that the great majority of estimated phylogenetic trees are likely to be quite inaccurate; they underscore the inappropriateness of characterizing current phylogenetic methods as being for reconstruction rather than for estimation.

[1]  Donald H. Colless,et al.  Congruence Between Morphometric and Allozyme Data for Menidia Species: A Reappraisal , 1980 .

[2]  E. Wiley Phylogenetics: The Theory and Practice of Phylogenetic Systematics , 1981 .

[3]  M. Lynch PHYLOGENETIC HYPOTHESES UNDER THE ASSUMPTION OF NEUTRAL QUANTITATIVE‐GENETIC VARIATION , 1989, Evolution; international journal of organic evolution.

[4]  R. Sokal,et al.  OTU stability and factors determining taxonomic stability: examples from the caminalcules and the leptopodomorpha , 1984 .

[5]  G. Estabrook,et al.  Application of compatibility analysis in numerical cladistics at the infraspecific level , 1978 .

[6]  J. Farris Estimating Phylogenetic Trees from Distance Matrices , 1972, The American Naturalist.

[7]  R. Sokal,et al.  FACTORS DETERMINING THE ACCURACY OF CLADOGRAM ESTIMATION: EVALUATION USING COMPUTER SIMULATION , 1985, Evolution; international journal of organic evolution.

[8]  F. Rohlf,et al.  EVALUATION OF THE RESTRICTED MAXIMUM‐LIKELIHOOD METHOD FOR ESTIMATING PHYLOGENETIC TREES USING SIMULATED ALLELE‐FREQUENCY DATA , 1988, Evolution; international journal of organic evolution.

[9]  A. Tversky,et al.  Additive similarity trees , 1977 .

[10]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[11]  Bernard R. Baum Relationships between Transformation Series and Some Numerical Cladistic Methods at the Infraspecific Level, when Genealogies are Known , 1983 .

[12]  D. H. Colless,et al.  Phylogenetics: The Theory and Practice of Phylogenetic Systematics. , 1982 .

[13]  F. Rohlf Consensus indices for comparing classifications , 1982 .

[14]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[15]  Fred R. McMorris,et al.  Consensusn-trees , 1981 .

[16]  L. Cavalli-Sforza,et al.  PHYLOGENETIC ANALYSIS: MODELS AND ESTIMATION PROCEDURES , 1967, Evolution; international journal of organic evolution.

[17]  C. Krimbas,et al.  Accuracy of phylogenetic trees estimated from DNA sequence data. , 1987, Molecular biology and evolution.