The Adaptive Evolution Database (TAED)

BackgroundThe Master Catalog is a collection of evolutionary families, including multiple sequence alignments, phylogenetic trees and reconstructed ancestral sequences, for all protein-sequence modules encoded by genes in GenBank. It can therefore support large-scale genomic surveys, of which we present here The Adaptive Evolution Database (TAED). In TAED, potential examples of positive adaptation are identified by high values for the normalized ratio of nonsynonymous to synonymous nucleotide substitution rates (KA/KS values) on branches of an evolutionary tree between nodes representing reconstructed ancestral sequences.ResultsEvolutionary trees and reconstructed ancestral sequences were extracted from the Master Catalog for every subtree containing proteins from the Chordata only or the Embryophyta only. Branches with high KA/KS values were identified. These represent candidate episodes in the history of the protein family when the protein may have undergone positive selection, where the mutant form conferred more fitness than the ancestral form. Such episodes are frequently associated with change in function. An unexpectedly large number of families (between 10% and 20% of those families examined) were found to have at least one branch with high KA/KS values above arbitrarily chosen cut-offs (1 and 0.6). Most of these survived a robustness test and were collected into TAED.ConclusionsTAED is a raw resource for bioinformaticists interested in data mining and for experimental evolutionists seeking candidate examples of adaptive evolution for further experimental study. It can be expanded to include other evolutionary information (for example changes in gene regulation or splicing) placed in a phylogenetic perspective.

[1]  S. Benner,et al.  Functional inferences from reconstructed evolutionary biology involving rectified databases--an evolutionarily grounded approach to functional genomics. , 2000, Research in microbiology.

[2]  T Gojobori,et al.  Large-scale search for genes on which positive selection may operate. , 1996, Molecular biology and evolution.

[3]  M. Boguski,et al.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[4]  M. Nei,et al.  Evolution by the birth-and-death process in multigene families of the vertebrate immune system. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Edwards Molecular Evolution, Protein Polymorphism and the Neutral Theory , 1984 .

[6]  K. Crandall,et al.  Parallel evolution of drug resistance in HIV: failure of nonsynonymous/synonymous substitution rate ratio to detect selection. , 1999, Molecular biology and evolution.

[7]  M. Tristem Molecular Evolution — A Phylogenetic Approach. , 2000, Heredity.

[8]  M. Gouy,et al.  HOVERGEN: a database of homologous vertebrate genes. , 1994, Nucleic acids research.

[9]  N. Rank,et al.  Functional and physiological consequences of genetic variation at phosphoglucose isomerase: heat shock protein expression is related to enzyme genotype in a montane beetle. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[10]  M. Nei,et al.  Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection , 1988, Nature.

[11]  G. Lu,et al.  The crystal structure of a high oxygen affinity species of haemoglobin (bar-headed goose haemoglobin in the oxy form). , 1996, Journal of molecular biology.

[12]  N. Bianchi,et al.  Evolution of the Zfx and Zfy genes: rates and interdependence between the genes. , 1993, Molecular biology and evolution.

[13]  S. Benner,et al.  Pseudogenes in ribonuclease evolution: a source of new biomacromolecular function? , 1996, FEBS letters.

[14]  S. Benner,et al.  Post-genomic science: converting primary structure into physiological function. , 1998, Advances in enzyme regulation.

[15]  W. Fitch Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology , 1971 .

[16]  M. Hattori,et al.  Accelerated evolution in the protein-coding regions is universal in crotalinae snake venom gland phospholipase A2 isozyme genes. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[17]  W. Messier,et al.  Episodic adaptive evolution of primate lysozymes , 1997, Nature.

[18]  W. Bode,et al.  Tissue-type plasminogen activator: variants and crystal/solution structures demarcate structural determinants of function. , 1997, Current opinion in structural biology.

[19]  S. Benner,et al.  Evolutionary history of the uterine serpins. , 2000, The Journal of experimental zoology.

[20]  C. Luo,et al.  A new method for estimating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes. , 1985, Molecular biology and evolution.

[21]  Se-Jin Lee,et al.  Myostatin and the control of skeletal muscle mass. , 1999, Current opinion in genetics & development.

[22]  S. Benner,et al.  Interpreting the behavior of enzymes: purpose or pedigree? , 1988, CRC critical reviews in biochemistry.

[23]  M. Wigger Receptor-assisted combinatorial synthesis (RACS) - a new approach for combinatorial chemistry , 1998 .