Biases in Maximum Likelihood and Parsimony: A Simulation Approach to a 10-Taxon Case

Biases present in maximum likelihood and parsimony are investigated through a simulation study in a 10-taxon case in which several long branches coexist with short branches in the modeled topology. The performance of these methods is explored while increasing the length of the long branches with different amounts of data. Also, simulations with different taxonomic sampling schemes are examined through this study. The presence of a strong bias in parsimony is corroborated: the well-known long-branch attraction. Likelihood performance is found to be sensitive to the mere presence extreme of branch length disparity, retrieving topologies compatible with long-branch attraction and long-branch repulsion, irrespective of the correctness of the model used.

[1]  N. Platnick,et al.  Advances in cladistics , 1983 .

[2]  J. Wiens,et al.  War of the Iguanas: conflicting molecular and morphological phylogenies and long-branch attraction in iguanid lizards. , 2000, Systematic biology.

[3]  M. Miyamoto,et al.  Testing phylogenetic approaches with empirical data, as illustrated with the parsimony method. , 1992, Molecular biology and evolution.

[4]  James Lyons-Weiler,et al.  Escaping from the Felsenstein zone by detecting long branches in phylogenetic data. , 1997, Molecular phylogenetics and evolution.

[5]  B. Hall,et al.  Long-branch attraction and the rDNA model of early eukaryotic evolution. , 1999, Molecular biology and evolution.

[6]  J. Huelsenbeck Systematic bias in phylogenetic analysis: is the Strepsiptera problem solved? , 1998, Systematic biology.

[7]  P. Lewis,et al.  Success of maximum likelihood phylogeny inference in the four-taxon case. , 1995, Molecular biology and evolution.

[8]  H Philippe,et al.  Phylogeny of eukaryotes based on ribosomal RNA: long-branch attraction and models of sequence evolution. , 2000, Molecular biology and evolution.

[9]  W. Bruno,et al.  Topological bias and inconsistency of maximum likelihood using wrong models. , 1999, Molecular biology and evolution.

[10]  James Lyons-Weiler,et al.  Branch Length Heterogeneity Leads to Nonindependent Branch Length Estimates and Can Decrease the Efficiency of Methods of Phylogenetic Inference , 1999, Journal of Molecular Evolution.

[11]  Joseph T. Chang,et al.  Inconsistency of evolutionary tree topology reconstruction methods when substitution rates vary across characters. , 1996, Mathematical biosciences.

[12]  J. Huelsenbeck Is the Felsenstein zone a fly trap? , 1997, Systematic biology.

[13]  M. Sanderson,et al.  Error, bias, and long-branch attraction in data for two chloroplast photosystem genes in seed plants. , 2000, Molecular biology and evolution.

[14]  Junhyong Kim,et al.  Large-scale phylogenies and measuring the performance of phylogenetic estimators. , 1998, Systematic biology.

[15]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[16]  J. Felsenstein,et al.  A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. , 1994, Molecular biology and evolution.

[17]  Masatoshi Nei,et al.  Assessing molecular phylogenies , 1995, Science.

[18]  D. Penny,et al.  Spectral analysis of phylogenetic data , 1993 .

[19]  J. Huelsenbeck,et al.  SUCCESS OF PHYLOGENETIC METHODS IN THE FOUR-TAXON CASE , 1993 .

[20]  N Takezaki,et al.  Efficiencies of different genes and different tree-building methods in recovering a known vertebrate phylogeny. , 1996, Molecular biology and evolution.

[21]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[22]  M. Nei,et al.  Relative efficiencies of the maximum-likelihood, neighbor-joining, and maximum-parsimony methods when substitution rate varies with site. , 1994, Molecular biology and evolution.

[23]  M. Nei,et al.  Efficiencies of fast algorithms of phylogenetic inference under the criteria of maximum parsimony, minimum evolution, and maximum likelihood when a large number of sequences are used. , 2000, Molecular biology and evolution.

[24]  M. Siddall,et al.  Long‐Branch Abstractions , 1999 .

[25]  J. Huelsenbeck Performance of Phylogenetic Methods in Simulation , 1995 .

[26]  M. Siddall,et al.  Success of Parsimony in the Four‐Taxon Case: Long‐Branch Repulsion by Likelihood in the Farris Zone , 1998 .

[27]  Z. Yang,et al.  How often do wrong models produce better phylogenies? , 1997, Molecular biology and evolution.

[28]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .

[29]  Junhyong Kim Improving the Accuracy of Phylogenetic Estimation by Combining Different Methods , 1993 .

[30]  H. Munro,et al.  Mammalian protein metabolism , 1964 .

[31]  J. Farris Likelihood and Inconsistency , 1999, Cladistics : the international journal of the Willi Hennig Society.