A branch-and-bound algorithm for the inference of ancestral amino-acid sequences when the replacement rate varies among sites: Application to the evolution of five gene families

MOTIVATION We developed an algorithm to reconstruct ancestral sequences, taking into account the rate variation among sites of the protein sequences. Our algorithm maximizes the joint probability of the ancestral sequences, assuming that the rate is gamma distributed among sites. Our algorithm probably finds the global maximum. The use of 'joint' reconstruction is motivated by studies that use the sequences at all the internal nodes in a phylogenetic tree, such as, for instance, the inference of patterns of amino-acid replacement, or tracing the biochemical changes that occurred during the evolution of a given protein family. RESULTS We give an algorithm that guarantees finding the global maximum. The efficient search method makes our method applicable to datasets with large number sequences. We analyze ancestral sequences of five gene families, exploring the effect of the amount of among-site-rate-variation, and the degree of sequence divergence on the resulting ancestral states. AVAILABILITY AND SUPPLEMENTARY INFORMATION http://evolu3.ism.ac.jp/~tal/ CONTACT tal@ism.ac.jp

[1]  Z. Yang,et al.  Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. , 1993, Molecular biology and evolution.

[2]  Z. Yang,et al.  Among-site rate variation and its impact on phylogenetic analyses. , 1996, Trends in ecology & evolution.

[3]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[4]  R. Khesin,et al.  Molecular Genetics , 1968, Springer Berlin Heidelberg.

[5]  J. Thompson,et al.  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. , 1997, Nucleic acids research.

[6]  J. Adachi,et al.  MOLPHY, programs for molecular phylogenetics , 1992 .

[7]  Steven A. Benner,et al.  Reconstructing the evolutionary history of the artiodactyl ribonuclease superfamily , 1995, Nature.

[8]  S. Yokoyama,et al.  The molecular genetics and evolution of red and green color vision in vertebrates. , 2001, Genetics.

[9]  P. Lio’,et al.  Molecular phylogenetics: state-of-the-art methods for looking into the past. , 2001, Trends in genetics : TIG.

[10]  J. Adachi,et al.  MOLPHY version 2.3 : programs for molecular phylogenetics based on maximum likelihood , 1996 .

[11]  Dolph Schluter,et al.  Uncertainty in ancient phylogenies , 1995, Nature.

[12]  S. Yokoyama,et al.  The molecular genetics of red and green color vision in mammals. , 1999, Genetics.

[13]  G. Kitagawa,et al.  Akaike Information Criterion Statistics , 1988 .

[14]  S. O’Brien,et al.  Molecular phylogenetics and the origins of placental mammals , 2001, Nature.

[15]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[16]  G. Kitagawa,et al.  Akaike Information Criterion Statistics , 1988 .

[17]  J. W. Thornton Evolution of vertebrate steroid receptors from an ancestral estrogen receptor by ligand exploitation and serial genome expansions , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Ziheng Yang Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: Approximate methods , 1994, Journal of Molecular Evolution.

[19]  M. Nei,et al.  Positive Darwinian selection after gene duplication in primate ribonuclease genes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Masatoshi Nei,et al.  Estimation of the number of amino acid substitutions per site when the substitution rate varies among sites , 1994, Journal of Molecular Evolution.

[21]  M. Nei,et al.  A new method of inference of ancestral nucleotide and amino acid sequences. , 1995, Genetics.

[22]  Simon Easteal,et al.  Evolutionary Rate Acceleration of Cytochrome c Oxidase Subunit I in Simian Primates , 2000, Journal of Molecular Evolution.

[23]  R. Shamir,et al.  A fast algorithm for joint reconstruction of ancestral amino acid sequences. , 2000, Molecular biology and evolution.

[24]  Z. Yang,et al.  Mixed model analysis of DNA sequence evolution. , 1995, Biometrics.

[25]  M. Nei,et al.  Color vision of ancestral organisms of higher primates. , 1997, Molecular biology and evolution.

[26]  Thomas Uzzell,et al.  Fitting Discrete Probability Distributions to Evolutionary Events , 1971, Science.

[27]  Andrey Rzhetsky,et al.  Unbiased estimates of the number of nucleotide substitutions when substitution rate varies among different sites , 1994, Journal of Molecular Evolution.