Pervasive Cryptic Epistasis in Molecular Evolution

The functional effects of most amino acid replacements accumulated during molecular evolution are unknown, because most are not observed naturally and the possible combinations are too numerous. We created 168 single mutations in wild-type Escherichia coli isopropymalate dehydrogenase (IMDH) that match the differences found in wild-type Pseudomonas aeruginosa IMDH. 104 mutant enzymes performed similarly to E. coli wild-type IMDH, one was functionally enhanced, and 63 were functionally compromised. The transition from E. coli IMDH, or an ancestral form, to the functional wild-type P. aeruginosa IMDH requires extensive epistasis to ameliorate the combined effects of the deleterious mutations. This result stands in marked contrast with a basic assumption of molecular phylogenetics, that sites in sequences evolve independently of each other. Residues that affect function are scattered haphazardly throughout the IMDH structure. We screened for compensatory mutations at three sites, all of which lie near the active site and all of which are among the least active mutants. No compensatory mutations were found at two sites indicating that a single site may engage in compound epistatic interactions. One complete and three partial compensatory mutations of the third site are remote and lie in a different domain. This demonstrates that epistatic interactions can occur between distant (>20Å) sites. Phylogenetic analysis shows that incompatible mutations were fixed in different lineages.

[1]  Fyodor A. Kondrashov,et al.  Sequence space and the ongoing expansion of the protein universe , 2010, Nature.

[2]  Gaurav Tyagi,et al.  Functionally compensating coevolving positions are neither homoplasic nor conserved in clades. , 2010, Molecular biology and evolution.

[3]  Bryan D. Kolaczkowski,et al.  Robustness of Ancestral Sequence Reconstruction to Phylogenetic Uncertainty , 2010, Molecular biology and evolution.

[4]  D. Presgraves,et al.  The molecular evolutionary basis of species formation , 2010, Nature Reviews Genetics.

[5]  M. Matz,et al.  Retracing evolution of red fluorescence in GFP-like proteins from Faviina corals. , 2010, Molecular biology and evolution.

[6]  Bryan Kolaczkowski,et al.  Long-Branch Attraction Bias and Inconsistency in Bayesian Phylogenetics , 2009, PloS one.

[7]  Edward Susko,et al.  PROCOV: maximum likelihood estimation of protein phylogeny under covarion models and site-specific covarion pattern analysis , 2009, BMC Evolutionary Biology.

[8]  E. Ortlund,et al.  An epistatic ratchet constrains the direction of glucocorticoid receptor evolution , 2009, Nature.

[9]  Hervé Philippe,et al.  Computational methods for evaluating phylogenetic models of coding sequence evolution with dependence between codons. , 2009, Molecular biology and evolution.

[10]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[11]  S. Whelan The genetic code can cause systematic bias in simple phylogenetic models , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[12]  Huan Zhang,et al.  Elucidation of phenotypic adaptations: Molecular analyses of dim-light vision proteins in vertebrates , 2008, Proceedings of the National Academy of Sciences.

[13]  M. Steel,et al.  Difficulties in testing for covarion-like properties of sequences under the confounding influence of changing proportions of variable sites. , 2008, Molecular biology and evolution.

[14]  Dan S. Tawfik,et al.  Intense neutral drifts yield robust and evolvable consensus proteins. , 2008, Journal of molecular biology.

[15]  Trevor Bedford,et al.  Overdispersion of the Molecular Clock Varies Between Yeast, Drosophila and Mammals , 2008, Genetics.

[16]  Bryan D. Kolaczkowski,et al.  A mixed branch length model of heterotachy improves phylogenetic accuracy. , 2008, Molecular biology and evolution.

[17]  François Stricher,et al.  How Protein Stability and New Functions Trade Off , 2008, PLoS Comput. Biol..

[18]  A. Oskooi Molecular Evolution and Phylogenetics , 2008 .

[19]  A. Dean,et al.  Mechanistic approaches to the study of evolution: the functional synthesis , 2007, Nature Reviews Genetics.

[20]  R. Korona,et al.  Epistatic buffering of fitness loss in yeast double deletion strains , 2007, Nature Genetics.

[21]  Edward Susko,et al.  Testing for covarion-like evolution in protein sequences. , 2007, Molecular biology and evolution.

[22]  S. Carroll,et al.  Bushes in the Tree of Life , 2006, PLoS biology.

[23]  Stephen P. Miller,et al.  Direct Demonstration of an Adaptive Constraint , 2006, Science.

[24]  Kai Wang,et al.  Incorporating background frequency improves entropy-based residue conservation measures , 2006, BMC Bioinform..

[25]  A. Eyre-Walker,et al.  The rate of adaptive evolution in enteric bacteria. , 2006, Molecular biology and evolution.

[26]  Paul D. Williams,et al.  Assessing the Accuracy of Ancestral Protein Reconstruction Methods , 2006, PLoS Comput. Biol..

[27]  Simon A. A. Travers,et al.  A Novel Method for Detecting Intramolecular Coevolution: Adding a Further Dimension to Selective Constraints Analyses , 2006, Genetics.

[28]  Nigel F. Delaney,et al.  Darwinian Evolution Can Follow Only Very Few Mutational Paths to Fitter Proteins , 2006, Science.

[29]  Pasch,et al.  References and Notes Supporting Online Material Evolution of Hormone-receptor Complexity by Molecular Exploitation , 2022 .

[30]  H. Mori,et al.  Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection , 2006, Molecular systems biology.

[31]  S. Carroll,et al.  Animal Evolution and the Molecular Signature of Radiations Compressed in Time , 2005, Science.

[32]  Stephen P. Miller,et al.  The Biochemical Architecture of an Ancient Adaptive Landscape , 2005, Science.

[33]  W. P. Russ,et al.  Natural-like function in artificial WW domains , 2005, Nature.

[34]  W. P. Russ,et al.  Evolutionary information for specifying a protein fold , 2005, Nature.

[35]  M. DePristo,et al.  Missense meanderings in sequence space: a biophysical view of protein evolution , 2005, Nature Reviews Genetics.

[36]  A. Jean-Marie,et al.  A model-based approach for detecting coevolving positions in a molecule. , 2005, Molecular biology and evolution.

[37]  G. Gloor,et al.  Mutual information in protein multiple sequence alignments reveals two classes of coevolving positions. , 2005, Biochemistry.

[38]  J. G. Burleigh,et al.  Covarion structure in plastid genome evolution: a new statistical test. , 2005, Molecular biology and evolution.

[39]  Charles H. Langley,et al.  Are evolutionary rates really variable? , 1979, Journal of Molecular Evolution.

[40]  T. Ohta,et al.  On the constancy of the evolutionary rate of cistrons , 2005, Journal of Molecular Evolution.

[41]  Charles H. Langley,et al.  An examination of the constancy of the rate of molecular evolution , 2005, Journal of Molecular Evolution.

[42]  D. Hartl,et al.  Compensated Deleterious Mutations in Insect Genomes , 2004, Science.

[43]  Bryan Kolaczkowski,et al.  Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous , 2004, Nature.

[44]  J. Shiloach,et al.  Functional Correlation in Amino Acid Residue Mutations of Yeast Iso-2-Cytochrome c that Is Consistent with the Prediction of the Concomitantly Variable Codon Theory in Cytochrome c Evolution , 2000, Biochemical Genetics.

[45]  W. Fitch,et al.  An improved method for determining codon variability in a gene and its application to the rate of fixation of mutations in evolution , 1970, Biochemical Genetics.

[46]  Ohta Tomoko Synonymous and nonsynonymous substitutions in mammalian genes and the nearly neutral theory , 2004, Journal of Molecular Evolution.

[47]  John P. Huelsenbeck,et al.  MrBayes 3: Bayesian phylogenetic inference under mixed models , 2003, Bioinform..

[48]  Claes Gustafsson,et al.  Systematic variation of amino acid substitutions for stringent assessment of pairwise covariation. , 2003, Journal of molecular biology.

[49]  Gürol M. Süel,et al.  Evolutionarily conserved networks of residues mediate allosteric communication in proteins , 2003, Nature Structural Biology.

[50]  Carlos D. Bustamante,et al.  Bayesian Analysis Suggests that Most Amino Acid Replacements in Drosophila Are Driven by Positive Selection , 2003, Journal of Molecular Evolution.

[51]  Claudia Neuhauser,et al.  The Pattern of Amino Acid Replacements in α/β-Barrels , 2002 .

[52]  S. Sunyaev,et al.  Dobzhansky–Muller incompatibilities in protein evolution , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[53]  W. S. Valdar,et al.  Scoring residue conservation , 2002, Proteins.

[54]  Tal Pupko,et al.  A branch-and-bound algorithm for the inference of ancestral amino-acid sequences when the replacement rate varies among sites: Application to the evolution of five gene families , 2002, Bioinform..

[55]  J. Huelsenbeck Testing a covariotide model of DNA substitution. , 2002, Molecular biology and evolution.

[56]  Adam Eyre-Walker,et al.  Adaptive protein evolution in Drosophila , 2002, Nature.

[57]  C. Neuhauser,et al.  The Pattern of Amino Acid Replacements in a / b-Barrels , 2002 .

[58]  L Pritchard,et al.  Evaluation of a novel method for the identification of coevolving protein residues. , 2001, Protein engineering.

[59]  N. Galtier,et al.  Maximum-likelihood phylogenetic analysis under a covarion-like model. , 2001, Molecular biology and evolution.

[60]  D. Cutler,et al.  Understanding the overdispersed molecular clock. , 2000, Genetics.

[61]  A. Dean,et al.  Enzyme evolution explained (sort of). , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[62]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[63]  A. Fersht,et al.  Mutually compensatory mutations during evolution of the tetramerization domain of tumor suppressor p53 lead to impaired hetero-oligomerization. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[64]  W R Taylor,et al.  Coevolving protein residues: maximum likelihood identification and relationship to structure. , 1999, Journal of molecular biology.

[65]  A. R. Fresht Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding , 1999 .

[66]  B. Charlesworth,et al.  Why sex and recombination? , 1998, Science.

[67]  M. Steel,et al.  A covariotide model explains apparent phylogenetic structure of oxygenic photosynthetic lineages. , 1998, Molecular biology and evolution.

[68]  K Namba,et al.  Structure of 3-isopropylmalate dehydrogenase in complex with 3-isopropylmalate at 2.0 A resolution: the role of Glu88 in the unique substrate-recognition mechanism. , 1998, Structure.

[69]  M. Steel,et al.  Modeling the covarion hypothesis of nucleotide substitution. , 1998, Mathematical biosciences.

[70]  D. Tsuchiya,et al.  Crystal structure of 3-isopropylmalate dehydrogenase from the moderate facultative thermophile, Bacillus coagulans: two strategies for thermostabilization of protein structures. , 1997, Journal of biochemistry.

[71]  H. Tachida,et al.  Bottleneck effect on evolutionary rate in the nearly neutral mutation model. , 1997, Genetics.

[72]  G. Petsko,et al.  Crystal structures of Escherichia coli and Salmonella typhimurium 3-isopropylmalate dehydrogenase and comparison with their thermophilic counterpart from Thermus thermophilus. , 1997, Journal of molecular biology.

[73]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[74]  H. A. Orr,et al.  The population genetics of speciation: the evolution of hybrid incompatibilities. , 1995, Genetics.

[75]  E. Tillier,et al.  Neighbor Joining and Maximum Likelihood with RNA Sequences: Addressing the Interdependence of Sites , 1995 .

[76]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[77]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[78]  T. P. Flores,et al.  Multiple protein structure alignment , 1994, Protein science : a publication of the Protein Society.

[79]  A. von Haeseler,et al.  A stochastic model for the evolution of autocorrelated DNA sequences. , 1994, Molecular phylogenetics and evolution.

[80]  W. Stemmer Rapid evolution of a protein in vitro by DNA shuffling , 1994, Nature.

[81]  C. Sander,et al.  Correlated mutations and residue contacts in proteins , 1994, Proteins.

[82]  C. Sander,et al.  Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations? , 1994, Protein engineering.

[83]  Y. Iwasa Overdispersed molecular evolution in constant environments. , 1993, Journal of theoretical biology.

[84]  A. Lapedes,et al.  Covariation of mutations in the V3 loop of human immunodeficiency virus type 1 envelope protein: an information theoretic analysis. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[85]  B. Matthews,et al.  Structural and thermodynamic analysis of compensating mutations within the core of chicken egg white lysozyme. , 1993, The Journal of biological chemistry.

[86]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[87]  T. Ohta THE NEARLY NEUTRAL THEORY OF MOLECULAR EVOLUTION , 1992 .

[88]  Jeffrey H. Miller,et al.  A short course in bacterial genetics , 1992 .

[89]  Y. Katsube,et al.  Three-dimensional structure of a highly thermostable enzyme, 3-isopropylmalate dehydrogenase of Thermus thermophilus at 2.2 A resolution. , 1991, Journal of molecular biology.

[90]  N. Takahata Statistical models of the overdispersed molecular clock. , 1991, Theoretical population biology.

[91]  H. Tachida A study on a nearly neutral mutation model in finite populations. , 1991, Genetics.

[92]  J. Gillespie The causes of molecular evolution , 1991 .

[93]  D. Hanahan,et al.  Plasmid transformation of Escherichia coli and other bacteria. , 1991, Methods in enzymology.

[94]  T. Ohta,et al.  Theoretical study of near neutrality. I. Heterozygosity and rate of mutant substitution. , 1990, Genetics.

[95]  Brian W. Matthews,et al.  Ancestral lysozymes reconstructed, neutrality tested, and thermostability linked to hydrocarbon packing , 1990, Nature.

[96]  A. Kondrashov Deleterious mutations and the evolution of sexual reproduction , 1988, Nature.

[97]  N. Takahata,et al.  On the overdispersed molecular clock. , 1987, Genetics.

[98]  D. Hartl,et al.  Limits of adaptation: the evolution of selective neutrality. , 1985, Genetics.

[99]  W. Ewens The neutral theory of molecular evolution , 1985 .

[100]  J H Gillespie,et al.  The molecular clock may be an episodic clock. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[101]  J. Gillespie MOLECULAR EVOLUTION OVER THE MUTATIONAL LANDSCAPE , 1984, Evolution; international journal of organic evolution.

[102]  R. Sokal,et al.  Biometry: The Principles and Practice of Statistics in Biological Research (2nd ed.). , 1982 .

[103]  H. Kacser,et al.  The molecular basis of dominance. , 1981, Genetics.

[104]  M. M. Bradford A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding. , 1976, Analytical biochemistry.

[105]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[106]  C. M. Lyman,et al.  The availability of amino acids in some foods. , 1948, Federation proceedings.