Relative efficiencies of the maximum parsimony and distance-matrix methods in obtaining the correct phylogenetic tree.

The relative efficiencies of the maximum parsimony (MP) and distance-matrix methods in obtaining the correct tree (topology) were studied by using computer simulation. The distance-matrix methods examined are the neighbor-joining, distance-Wagner, Tateno et al. modified Farris, Faith, and Li methods. In the computer simulation, six or eight DNA sequences were assumed to evolve following a given model tree, and the evolutionary changes of the sequences were followed. Both constant and varying rates of nucleotide substitution were considered. From the sequences thus obtained, phylogenetic trees were constructed using the six tree-making methods and compared with the model (true) tree. This process was repeated 300 times for each different set of parameters. The results obtained indicate that when the number of nucleotide substitutions per site is small and a relatively small number of nucleotides are used, the probability of obtaining the correct topology (P1) is generally lower in the MP method than in the distance-matrix methods. The P1 value for the MP method increases with increasing number of nucleotides but is still generally lower than the value for the NJ or DW method. Essentially the same conclusion was obtained whether or not the rate of nucleotide substitution was constant or whether or not a transition bias in nucleotide substitution existed. The relatively poor performance of the MP method for these cases is due to the fact that information from singular sites is not used in this method. The MP method also showed a relatively low P1 value when the model of varying rate of nucleotide substitution was used and the number of substitutions per site was large. However, the MP method often produced cases in which the correct tree was one of several equally parsimonious trees. When these cases were included in the class of "success," the MP method performed better than the other methods, provided that the number of nucleotide substitutions per site was small.

[1]  M. O. Dayhoff,et al.  Atlas of protein sequence and structure , 1965 .

[2]  W. Fitch,et al.  Construction of phylogenetic trees. , 1967, Science.

[3]  P. Meisel Margaret O. Dayhoff: Atlas of Protein Sequence and Structure 1969 (Volume 4) XXIV u. 361 S., 21 Ausklapptafeln, 68 Abb. und zahlreiche Tabellen. National Biomedical Research Foundation, Silver Spring/Maryland 1969. Preis $ 12,50 , 1971 .

[4]  W. Fitch Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology , 1971 .

[5]  J. Farris Estimating Phylogenetic Trees from Distance Matrices , 1972, The American Naturalist.

[6]  D. Boulter,et al.  Use of amino acid sequence data in phylogeny and evaluation of methods using computer simulation. , 1975, Journal of molecular biology.

[7]  J. Farris On the Phenetic Approach to Vertebrate Classification , 1977 .

[8]  W. F. Blair,et al.  Major Patterns in Vertebrate Evolution. , 1978 .

[9]  Walter M. Fitch,et al.  On the Problem of Discovering the Most Parsimonious Tree , 1977, The American Naturalist.

[10]  Peter Charles Goody,et al.  Major Patterns in Vertebrate Evolution. , 1978 .

[11]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .

[12]  L. Klotz,et al.  A practical method for calculating evolutionary trees from sequence data. , 1981, Journal of theoretical biology.

[13]  W. Li,et al.  Simple method for constructing phylogenetic trees from distance matrices. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[14]  D. Penny Towards a basis for classification: the incompleteness of distance measures, incompatibility analysis and phenetic classification. , 1982, Journal of theoretical biology.

[15]  D. Faith Distance Methods and the Approximation of Most-Parsimonious Trees , 1985 .

[16]  M. Nei Molecular Evolutionary Genetics , 1987 .

[17]  C. Krimbas,et al.  Accuracy of phylogenetic trees estimated from DNA sequence data. , 1987, Molecular biology and evolution.

[18]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.