Codon-substitution models for heterogeneous selection pressure at amino acid sites.

Comparison of relative fixation rates of synonymous (silent) and nonsynonymous (amino acid-altering) mutations provides a means for understanding the mechanisms of molecular sequence evolution. The nonsynonymous/synonymous rate ratio (omega = d(N)d(S)) is an important indicator of selective pressure at the protein level, with omega = 1 meaning neutral mutations, omega < 1 purifying selection, and omega > 1 diversifying positive selection. Amino acid sites in a protein are expected to be under different selective pressures and have different underlying omega ratios. We develop models that account for heterogeneous omega ratios among amino acid sites and apply them to phylogenetic analyses of protein-coding DNA sequences. These models are useful for testing for adaptive molecular evolution and identifying amino acid sites under diversifying selection. Ten data sets of genes from nuclear, mitochondrial, and viral genomes are analyzed to estimate the distributions of omega among sites. In all data sets analyzed, the selective pressure indicated by the omega ratio is found to be highly heterogeneous among sites. Previously unsuspected Darwinian selection is detected in several genes in which the average omega ratio across sites is <1, but in which some sites are clearly under diversifying selection with omega > 1. Genes undergoing positive selection include the beta-globin gene from vertebrates, mitochondrial protein-coding genes from hominoids, the hemagglutinin (HA) gene from human influenza virus A, and HIV-1 env, vif, and pol genes. Tests for the presence of positively selected sites and their subsequent identification appear quite robust to the specific distributional form assumed for omega and can be achieved using any of several models we implement. However, we encountered difficulties in estimating the precise distribution of omega among sites from real data sets.

[1]  T. Jukes,et al.  The neutral theory of molecular evolution. , 2000, Genetics.

[2]  E. Holmes,et al.  Genealogical evidence for positive selection in the nef gene of HIV-1. , 1999, Genetics.

[3]  H. Akashi,et al.  Within- and between-species DNA sequence variation and the 'footprint' of natural selection. , 1999, Gene.

[4]  Wen-Hsiung Li,et al.  Coalescing into the 21st century: An overview and prospects of coalescent theory. , 1999, Theoretical population biology.

[5]  K. Crandall,et al.  Parallel evolution of drug resistance in HIV: failure of nonsynonymous/synonymous substitution rate ratio to detect selection. , 1999, Molecular biology and evolution.

[6]  P. Lio’,et al.  Models of molecular evolution and phylogeny. , 1998, Genome research.

[7]  S. Pääbo,et al.  Conflict Among Individual Mitochondrial Proteins in Resolving the Phylogeny of Eutherian Orders , 1998, Journal of Molecular Evolution.

[8]  C. Wiuf,et al.  A codon-based model designed to describe lentiviral evolution. , 1998, Molecular biology and evolution.

[9]  M. Malim,et al.  HIV-1 regulatory/accessory genes: keys to unraveling viral and host cell biology. , 1998, Science.

[10]  K. Simonsen,et al.  Statistical tests of neutrality in the age of weak selection. , 1998, Trends in ecology & evolution.

[11]  R. Nielsen,et al.  Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. , 1998, Genetics.

[12]  G. Kuno,et al.  Phylogeny of the Genus Flavivirus , 1998, Journal of Virology.

[13]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[14]  W. Fitch,et al.  Long term trends in the evolution of H(3) HA1 human influenza type A. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[15]  S Kumar,et al.  Tempo and mode of nucleotide substitutions in gag and env gene fragments in human immunodeficiency virus type 1 populations with a known transmission history , 1997 .

[16]  Rasmus Nielsen,et al.  The ratio of replacement to silent divergence and tests of neutrality , 1997 .

[17]  T. Gojobori,et al.  Evolutionary mechanisms and population dynamics of the third variable envelope region of HIV within single hosts. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[18]  W. Messier,et al.  Episodic adaptive evolution of primate lysozymes , 1997, Nature.

[19]  P. Sharp,et al.  In search of molecular darwinism , 1997, Nature.

[20]  T Gojobori,et al.  Large-scale search for genes on which positive selection may operate. , 1996, Molecular biology and evolution.

[21]  S. Kumar,et al.  Patterns of nucleotide substitution in mitochondrial protein coding genes of vertebrates. , 1996, Genetics.

[22]  D. Mindell,et al.  Positive selection and rates of evolution in immunodeficiency viruses from humans and chimpanzees. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[23]  E. Holmes,et al.  Population dynamics of flaviviruses revealed by molecular phylogenies. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[24]  A. Hughes,et al.  Natural selection on the gag, pol, and env genes of human immunodeficiency virus 1 (HIV-1). , 1995, Molecular biology and evolution.

[25]  H. Akashi,et al.  Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. , 1995, Genetics.

[26]  E. Holmes,et al.  The molecular epidemiology of human immunodeficiency virus type 1 in Edinburgh. , 1995, The Journal of infectious diseases.

[27]  S. Muse,et al.  A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. , 1994, Molecular biology and evolution.

[28]  N. Goldman,et al.  A codon-based model of nucleotide substitution for protein-coding DNA sequences. , 1994, Molecular biology and evolution.

[29]  T. Ohta THE NEARLY NEUTRAL THEORY OF MOLECULAR EVOLUTION , 1992 .

[30]  M. Nei,et al.  Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection , 1988, Nature.

[31]  K. Liang,et al.  Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions , 1987 .

[32]  R. Hudson,et al.  A test of neutral molecular evolution based on nucleotide data. , 1987, Genetics.

[33]  M. Nei,et al.  Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. , 1986, Molecular biology and evolution.

[34]  M. Kimura,et al.  The neutral theory of molecular evolution. , 1983, Scientific American.

[35]  G. P. Bhattacharjee,et al.  The Incomplete Gamma Integral , 1970 .