Multiple maxima of likelihood in phylogenetic trees: an analytic approach

Maximum likelihood (ML) is a widely used criterion for selecting optimal evolutionary trees. However, little is known on the nature of the likelihood surface for trees, especially as to the frequency of multiple optima. We initiate an analytic study for identifying sequences that generate multiple optima. We report a new approach to calculating ML directly, which we have used to find large families of sequences that have multiple optima, including sequences with a continuum of optimal points. Such datasets are best supported by different (two or more) phylogenies that vary significantly in their timings of evolutionary events Some standard biological processes can lead to data with multiple optima and consequently the field needs further investigation. Our results imply that hill climbing techniques, as currently implemented in various software packages, cannot guarantee to find the global ML point, even if it is unique.

[1]  J. S. Rogers,et al.  Multiple local maxima for likelihoods of phylogenetic trees: a simulation study. , 1999, Molecular biology and evolution.

[2]  Dan Pelleg,et al.  Constructing Phylogenies from Quartets: Elucidation of Eutherian Superordinal Relationships , 1998, J. Comput. Biol..

[3]  D. Penny,et al.  Spectral analysis of phylogenetic data , 1993 .

[4]  Mike Steel,et al.  The Maximum Likelihood Point for a Phylogenetic Tree is Not Unique , 1994 .

[5]  M. Kimura,et al.  The neutral theory of molecular evolution. , 1983, Scientific American.

[6]  J. Neyman MOLECULAR STUDIES OF EVOLUTION: A SOURCE OF NOVEL STATISTICAL PROBLEMS* , 1971 .

[7]  Neil J. A. Sloane,et al.  The theory of error-correcting codes (north-holland , 1977 .

[8]  Asami,et al.  Towards Resolving the Interordinal Relationships of Placental Mammals , 2001 .

[9]  Tandy J. Warnow,et al.  A few logs suffice to build (almost) all trees (I) , 1999, Random Struct. Algorithms.

[10]  M.A. CHARLESTON,et al.  Toward a Characterization of Landscapes of Combinatorial Optimization Problems, with Special Attention to the Phylogeny Problem , 1995, J. Comput. Biol..

[11]  Michael D. Hendy,et al.  A combinatorial description of the closest tree algorithm for finding evolutionary trees , 1991, Discret. Math..

[12]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[13]  D. Penny,et al.  Spectral Analysis, Systematic Bias, and the Evolution of Chloroplasts , 1999 .

[14]  Z. Yang,et al.  Complexity of the simplest phylogenetic estimation problem , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[15]  Daniel Barry,et al.  [Statistical Analysis of Hominoid Molecular Evolution]: Rejoinder , 1987 .

[16]  P. Erdös,et al.  A few logs suffice to build (almost) all trees (l): part I , 1997 .

[17]  J. Felsenstein,et al.  Invariants of phylogenies in a simple case with discrete states , 1987 .

[18]  Mike Steel,et al.  Links between maximum likelihood and maximum parsimony under a simple model of site substitution , 1997 .

[19]  Nick Goldman,et al.  MAXIMUM LIKELIHOOD INFERENCE OF PHYLOGENETIC TREES, WITH SPECIAL REFERENCE TO A POISSON PROCESS MODEL OF DNA SUBSTITUTION AND TO PARSIMONY ANALYSES , 1990 .

[20]  G. Lothian,et al.  Spectral Analysis , 1971, Nature.

[21]  Joseph T. Chang,et al.  Full reconstruction of Markov models on evolutionary trees: identifiability and consistency. , 1996, Mathematical biosciences.

[22]  D Penny,et al.  Parsimony, likelihood, and the role of models in molecular phylogenetics. , 2000, Molecular biology and evolution.

[23]  N. Saitou,et al.  Maximum likelihood methods. , 1990, Methods in enzymology.

[24]  S J Willson Measuring inconsistency in phylogenetic trees. , 1998, Journal of theoretical biology.

[25]  D Penny,et al.  A discrete Fourier analysis for evolutionary trees. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[26]  A. Dress,et al.  Reconstructing the shape of a tree from observed dissimilarity data , 1986 .

[27]  K. Strimmer,et al.  Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree Topologies , 1996 .

[28]  M. Tristem Molecular Evolution — A Phylogenetic Approach. , 2000, Heredity.

[29]  M. Kimura The Neutral Theory of Molecular Evolution: From Lamarck to population genetics , 1983 .

[30]  Ziheng Yang Statistical Properties of the Maximum Likelihood Method of Phylogenetic Estimation and Comparison With Distance Matrix Methods , 1994 .

[31]  J. S. Rogers,et al.  On the consistency of maximum likelihood estimation of phylogenetic trees from nucleotide sequences. , 1997, Systematic biology.