A new method of inference of ancestral nucleotide and amino acid sequences.

A statistical method was developed for reconstructing the nucleotide or amino acid sequences of extinct ancestors, given the phylogeny and sequences of the extant species. A model of nucleotide or amino acid substitution was employed to analyze data of the present-day sequences, and maximum likelihood estimates of parameters such as branch lengths were used to compare the posterior probabilities of assignments of character states (nucleotides or amino acids) to interior nodes of the tree; the assignment having the highest probability was the best reconstruction at the site. The lysozyme c sequences of six mammals were analyzed by using the likelihood and parsimony methods. The new likelihood-based method was found to be superior to the parsimony method. The probability that the amino acids for all interior nodes at a site reconstructed by the new method are correct was calculated to be 0.91, 0.86, and 0.73 for all, variable, and parsimony-informative sites, respectively, whereas the corresponding probabilities for the parsimony method were 0.84, 0.76, and 0.51, respectively. The probability that an amino acid in an ancestral sequence is correctly reconstructed by the likelihood analysis ranged from 91.3 to 98.7% for the four ancestral sequences.

[1]  Linus Pauling,et al.  Chemical Paleogenetics. Molecular "Restoration Studies" of Extinct Forms of Life. , 1963 .

[2]  W. Fitch Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology , 1971 .

[3]  M. O. Dayhoff,et al.  22 A Model of Evolutionary Change in Proteins , 1978 .

[4]  Allan C. Wilson,et al.  Adaptive evolution in the stomach lysozymes of foregut fermenters , 1987, Nature.

[5]  J. Hartigan,et al.  Statistical Analysis of Hominoid Molecular Evolution , 1987 .

[6]  Nick Goldman,et al.  MAXIMUM LIKELIHOOD INFERENCE OF PHYLOGENETIC TREES, WITH SPECIAL REFERENCE TO A POISSON PROCESS MODEL OF DNA SUBSTITUTION AND TO PARSIMONY ANALYSES , 1990 .

[7]  Brian W. Matthews,et al.  Ancestral lysozymes reconstructed, neutrality tested, and thermostability linked to hydrocarbon packing , 1990, Nature.

[8]  Wayne P. Maddison,et al.  Macclade: Analysis of Phylogeny and Character Evolution/Version 3 , 1992 .

[9]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[10]  Z. Yang,et al.  Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. , 1993, Molecular biology and evolution.

[11]  G. Naylor,et al.  COMPOSITIONAL BIAS, CHARACTER-STATE BIAS, AND CHARACTER-STATE RECONSTRUCTION USING PARSIMONY , 1994 .

[12]  A. Sparks,et al.  Molecular resurrection of an extinct ancestral promoter for mouse L1. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Z. Yang,et al.  On the use of nucleic acid sequences to infer early branchings in the tree of life. , 1995, Molecular biology and evolution.

[14]  C. Stewart Active ancestral molecules , 1995, Nature.

[15]  Steven A. Benner,et al.  Reconstructing the evolutionary history of the artiodactyl ribonuclease superfamily , 1995, Nature.

[16]  Z. Yang,et al.  A space-time process model for the evolution of DNA sequences. , 1995, Genetics.