Estimating Phylogenetic Trees from Distance Matrices

1. The distance Wagner procedure, presented here, is a modification of the original Wagner algorithm of Kluge and Farris (1969). Unlike previous techniques for calculating most parsimonious trees, it does not require a character-state matrix for the OTUs, but depends only upon an OTU × OTU matrix of phenetic differences. For this reason, the distance Wagner procedure is applicable to data matrices of immunological distance, such as that of Sarich (1969a), in which between-OTU comparisons are evaluated but for which no attributes of the OTUs themselves are directly observable. The distance Wagner procedure has the advantage over other available techniques for processing such data that it is free of the assumption of homogeneity of evolutionary rates over phyletic lines. 2. The estimated evolutionary trees produced by the distance Wagner procedure are undirected. Their roots may be estimated either by assuming the evolutionary rates of the two most divergent phyletic lines on the tree to be equal, or by connecting to the estimating tree an OTU external to the holophyletic group for which the root is to be estimated. The former technique has the advantage that it can be applied even in poorly understood groups. The latter technique is probably somewhat more reliable in cases in which an OTU external to the desired holophyletic group can be safety identified. 3. An example of the use of the distance Wagner procedure is provided in an analysis of the immunological-distance data of Sarich (1969a). For a subset of the OTUs of Sarich's study, the tree calculated by the distance Wagner procedure is found to give a better fit to the data than the tree published by Sarich (1969b). 4. A second tree, calculated for the largest set of OTUs for which Sarich's data are complete, is used to investigate the degree of homogeneity of the evolutionary rates in different carnivore phyletic lines. It is found that phyletic lines may differ by at least 20%-30% in actual rate of evolution over a time span of about 55 million years. Over shorter time spans, phyletic lines may show at least a 4.5-fold difference in rate of divergence. 5. Because of the likelihood of heterogeneities in amounts of divergence between different phyletic lines, it is inadvisable to use phenetic-similarity clustering techniques to estimate evolutionary trees. An unweighted pair-group analysis of Sarich's data yields a taxonomic placement of dogs which is almost certainly incorrect. 6. The amounts of evolutionary divergence between pairs of OTUs estimated by the distance Wagner procedure have the property that they are always greater than or equal to the observed between-OTU differences. They can thus be interpreted as the number of evolutionary changes occurring in the lines of descent connecting the OTUs. Estimated evolutionary trees on which the length of the branches are assigned using the procedure of Sarich (1969a), Fitch and Margoliash (1967), or Goodman et al. (1971) do not necessarily indicate amounts of divergence with the above property For this reason, "amounts of divergence" calculated from such trees cannot logically be given a physical interpretation and should not be used as estimates of actual amounts of divergence.

[1]  H. Ross Principles of Numerical Taxonomy , 1964 .

[2]  Seymour Lipschutz,et al.  Schaum's outline of theory and problems of general topology , 1965 .

[3]  R. Sokal,et al.  A METHOD FOR DEDUCING BRANCHING SEQUENCES IN PHYLOGENY , 1965 .

[4]  J. Gower Some distance properties of latent root and vector methods used in multivariate analysis , 1966 .

[5]  Rates of albumin evolution in primates. , 1967, Proceedings of the National Academy of Sciences of the United States of America.

[6]  W. Fitch,et al.  Construction of phylogenetic trees. , 1967, Science.

[7]  James S. Farris,et al.  The Meaning of Relationship and Taxonomic Procedure , 1967 .

[8]  L. Cavalli-Sforza,et al.  PHYLOGENETIC ANALYSIS: MODELS AND ESTIMATION PROCEDURES , 1967, Evolution; international journal of organic evolution.

[9]  P. Walker The specificity of molecular hybridization in relation to studies on higher organisms. , 1969, Progress in nucleic acid research and molecular biology.

[10]  J. Farris,et al.  Quantitative Phyletics and the Evolution of Anurans , 1969 .

[11]  V. Sarich Pinniped origins and the rate of evolution of carnivore albumins. , 1969, Systematic zoology.

[12]  Arnold G. Kluge,et al.  A Numerical Approach to Phylogenetic Systematics , 1970 .

[13]  B. Mccarthy,et al.  DNA Comparisons among Barley, Oats, Rye, and Wheat. , 1970, Genetics.

[14]  D. H. Colless The Phenogram as an Estimate of Phylogeny , 1970 .

[15]  J. Farris Methods for Computing Wagner Trees , 1970 .

[16]  G. Moore,et al.  Molecular Evolution in the Descent of Man , 1971, Nature.

[17]  P. Ashlock Monophyly and Associated Terms , 1971 .

[18]  G. Moore,et al.  Immunodiffusion Systematics of the Primates I. the Catarrhini , 1971 .

[19]  Peter H. A. Sneath,et al.  Numerical Taxonomy: The Principles and Practice of Numerical Classification , 1973 .