MAXIMUM LIKELIHOOD INFERENCE OF PHYLOGENETIC TREES, WITH SPECIAL REFERENCE TO A POISSON PROCESS MODEL OF DNA SUBSTITUTION AND TO PARSIMONY ANALYSES

Maximum likelihood inference is discussed, and some of its advantages and dis- advantages are noted. The application of maximum likelihood inference to phylogenetics is examined, and a simple Poisson process model of DNA substitution is used as one example. Further examples follow from the clarification of implicit models underlying traditional "par- simony" and "compatibility" analyses. From the elucidation of these models and analyses, it is seen that Poisson process analysis gives a statistically consistent estimate of phylogeny, and that parsimony methods do indeed have a maximum likelihood foundation but give potentially incorrect estimates of phylogeny. The maximum likelihood formulation provides a common framework within which these analyses are discussed and compared. (Phylogenetic inference; maximum likelihood inference; Poisson process model; parsimony analysis; DNA substitution.)

[1]  John D. Kalbfleisch,et al.  Application of Likelihood Methods to Models Involving Large Numbers of Parameters , 1970 .

[2]  R. Fisher Statistical Methods for Research Workers , 1971 .

[3]  A. Wald The Fitting of Straight Lines if Both Variables are Subject to Error , 1940 .

[4]  M. Forster,et al.  STATISTICAL COVARIANCE AS A MEASURE OF PHYLOGENETIC RELATIONSHIP , 1986, Cladistics : the international journal of the Willi Hennig Society.

[5]  Joseph Felsenstein,et al.  Parsimony and likelihood: an exchange , 1986 .

[6]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .

[7]  L. Cavalli-Sforza,et al.  ANALYSIS OF HUMAN EVOLUTION UNDER RANDOM GENETIC DRIFT. , 1964, Cold Spring Harbor symposia on quantitative biology.

[8]  M. Kendall,et al.  The advanced theory of statistics , 1945 .

[9]  David S. Johnson,et al.  The computational complexity of inferring rooted phylogenies by parsimony , 1986 .

[10]  P. H. A. Sneath,et al.  Detecting Evolutionary Incompatibilities From Protein Sequences , 1975 .

[11]  L. Cavalli-Sforza,et al.  PHYLOGENETIC ANALYSIS: MODELS AND ESTIMATION PROCEDURES , 1967, Evolution; international journal of organic evolution.

[12]  J. Farris A Probability Model for Inferring Evolutionary Trees , 1973 .

[13]  F. McMorris,et al.  A Mathematical Foundation for the Analysis of Cladistic Character Compatibility , 1976 .

[14]  E. Wilson A Consistency Test for Phylogenies Based on Contemporaneous Species , 1965 .

[15]  R. Graham,et al.  Unlikelihood that minimal phylogenies for a realistic biological study can be constructed in reasonable computational time , 1982 .

[16]  R. Sokal,et al.  A METHOD FOR DEDUCING BRANCHING SEQUENCES IN PHYLOGENY , 1965 .

[17]  M. Bishop,et al.  Evolutionary trees from nucleic acid and protein sequences , 1985, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[18]  Michael D. Hendy,et al.  A Framework for the Quantitative Study of Evolutionary Trees , 1989 .

[19]  W. H. Day Computationally difficult parsimony problems in phylogenetic systematics , 1983 .

[20]  R. Fisher 014: On the "Probable Error" of a Coefficient of Correlation Deduced from a Small Sample. , 1921 .

[21]  M. Kimura,et al.  The rate of molecular evolution considered from the standpoint of population genetics. , 1969, Proceedings of the National Academy of Sciences of the United States of America.

[22]  M. Bishop,et al.  Maximum likelihood alignment of DNA sequences. , 1986, Journal of molecular biology.

[23]  M. Snyder,et al.  Reconstructing the Past: Some Cognitive Consequences of Person Perception , 1978 .

[24]  Joseph Felsenstein,et al.  Maximum Likelihood and Minimum-Steps Methods for Estimating Evolutionary Trees from Data on Discrete Characters , 1973 .

[25]  E. B. Andersen,et al.  Asymptotic Properties of Conditional Maximum‐Likelihood Estimators , 1970 .

[26]  Elizabeth A. Thompson,et al.  Human Evolutionary Trees , 1975 .

[27]  W. J. Quesne The Uniquely Evolved Character Concept and its Cladistic Application , 1974 .

[28]  A. Edwards,et al.  Estimation of the Branch Points of a Branching Diffusion Process , 1970 .

[29]  Robert C. Elston,et al.  Genealogical and Genetic Structure. , 1982 .

[30]  H. D. Miller,et al.  The Theory Of Stochastic Processes , 1977, The Mathematical Gazette.

[31]  Walter M. Fitch,et al.  On the Problem of Discovering the Most Parsimonious Tree , 1977, The American Naturalist.

[32]  J. Farris Phylogenetic Analysis Under Dollo's Law , 1977 .

[33]  G. Estabrook,et al.  An idealized concept of the true cladistic character , 1975 .

[34]  Joseph Felsenstein,et al.  Alternative Methods of Phylogenetic Inference and their Interrelationship , 1979 .

[35]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[36]  P. J. Green,et al.  Probability and Statistical Inference , 1978 .

[37]  G. F. Estabrook,et al.  An algebraic analysis of cladistic characters , 1976, Discret. Math..

[38]  J. Kiefer,et al.  CONSISTENCY OF THE MAXIMUM LIKELIHOOD ESTIMATOR IN THE PRESENCE OF INFINITELY MANY INCIDENTAL PARAMETERS , 1956 .

[39]  W. J. Quesne,et al.  A Method of Selection of Characters in Numerical Taxonomy , 1969 .

[40]  G. Estabrook,et al.  A SIMPLE TEST FOR THE POSSIBLE SIMULTANEOUS EVOLUTIONARY DIVERGENCE OF TWO AMINO ACID POSITIONS , 1975 .