Functionally compensating coevolving positions are neither homoplasic nor conserved in clades.

We demonstrated that a pair of positions in phosphoglycerate kinase that score highly by three nonparametric covariation measures are important for function even though the positions can be occupied by aliphatic, aromatic, or charged residues. Examination of these pairs suggested that the majority of the covariation scores could be explained by within-clade conservation. However, an analysis of diversity showed that the conservation within clades of covarying pairs was indistinguishable from pairs of positions that do not covary, thus ruling out both clade conservation and extensive homoplasy as means to identify covarying positions. Mutagenesis showed that the residues in the covarying pair were epistatic, with the type of epistasis being dependent on the initial pair. The results show that nonconserved covarying positions that affect protein function can be identified with high precision.

[1]  J. Brandts,et al.  Substitution of a proline for alanine 183 in the hinge region of phosphoglycerate kinase: Effects on catalysis, activation by sulfate, and thermal stability , 1990, Journal of protein chemistry.

[2]  Gregory B. Gloor,et al.  Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction , 2008, Bioinform..

[3]  G. Gloor,et al.  Mutual information in protein multiple sequence alignments reveals two classes of coevolving positions. , 2005, Biochemistry.

[4]  T. McPhillips,et al.  Structure of the R65Q mutant of yeast 3-phosphoglycerate kinase complexed with Mg-AMP-PNP and 3-phospho-D-glycerate. , 1996, Biochemistry.

[5]  Yanli Wang,et al.  MMDB: annotating protein sequences with Entrez's 3D-structure database , 2006, Nucleic Acids Res..

[6]  Beáta Flachner,et al.  Communication between the nucleotide site and the main molecular hinge of 3-phosphoglycerate kinase. , 2008, Biochemistry.

[7]  L Pritchard,et al.  Evaluation of a novel method for the identification of coevolving protein residues. , 2001, Protein engineering.

[8]  A. Dean,et al.  An empirical test of the concomitantly variable codon hypothesis , 2007, Proceedings of the National Academy of Sciences.

[9]  Alejandro A. Schäffer,et al.  A structure-based method for protein sequence alignment , 2005, Bioinform..

[10]  L. C. Martin,et al.  Using information theory to search for co-evolving residues in proteins , 2005, Bioinform..

[11]  Thomas W. H. Lui,et al.  Using multiple interdependency to separate functional from phylogenetic correlations in protein alignments , 2003, Bioinform..

[12]  S. Arold,et al.  Molecular basis for the lack of enantioselectivity of human 3-phosphoglycerate kinase , 2008, Nucleic acids research.

[13]  C. Yanofsky,et al.  Second-site revertants of Escherichia coli trp repressor mutants. , 1988, Genetics.

[14]  Nigel F. Delaney,et al.  Darwinian Evolution Can Follow Only Very Few Mutational Paths to Fitter Proteins , 2006, Science.

[15]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[16]  W. Fitch,et al.  An improved method for determining codon variability in a gene and its application to the rate of fixation of mutations in evolution , 1970, Biochemical Genetics.

[17]  Geoffrey J. Barton,et al.  Jalview Version 2—a multiple sequence alignment editor and analysis workbench , 2009, Bioinform..

[18]  W. Atchley,et al.  Correlations among amino acid sites in bHLH protein domains: an information theoretic analysis. , 2000, Molecular biology and evolution.

[19]  J. Yon,et al.  Unfolding-refolding of the domains in yeast phosphoglycerate kinase: comparison with the isolated engineered domains. , 1990, Biochemistry.

[20]  J F Brandts,et al.  A simple model for proteins with interacting domains. Applications to scanning calorimetry data. , 1989, Biochemistry.

[21]  R. Brooker,et al.  Suppressor analysis of mutations in the loop 2-3 motif of lactose permease: evidence that glycine-64 is an important residue for conformational changes , 1997, Journal of bacteriology.

[22]  R. Brooker,et al.  Evidence That Transmembrane Segment 2 of the Lactose Permease Is Part of a Conformationally Sensitive Interface between the Two Halves of the Protein (*) , 1996, The Journal of Biological Chemistry.

[23]  Rob Knight,et al.  Detecting coevolution without phylogenetic trees? Tree-ignorant metrics of coevolution perform as well as tree-aware metrics , 2008, BMC Evolutionary Biology.

[24]  S. Carroll,et al.  Frequent and widespread parallel evolution of protein sequences. , 2008, Molecular biology and evolution.

[25]  C. Yanofsky,et al.  Protein Structure Relationships Revealed by Mutational Analysis , 1964, Science.

[26]  K. P. Murphy,et al.  The molecular basis of cooperativity in protein folding. Thermodynamic dissection of interdomain interactions in phosphoglycerate kinase. , 1992, Biochemistry.

[27]  Alfonso Valencia,et al.  Protein co-evolution, co-adaptation and interactions , 2008, The EMBO journal.

[28]  Mario A. Fares,et al.  Why Should We Care About Molecular Coevolution? , 2008, Evolutionary bioinformatics online.

[29]  S. Eom,et al.  Crystal structure of Thermus caldophilus phosphoglycerate kinase in the open conformation. , 2006, Biochemical and biophysical research communications.

[30]  Dan S. Tawfik,et al.  Chaperonin overexpression promotes genetic variation and enzyme evolution , 2009, Nature.

[31]  J. Sturtevant,et al.  Thermodynamic study of yeast phosphoglycerate kinase. , 1987, Biochemistry.

[32]  Art Poon,et al.  The Rate of Compensatory Mutation in the DNA Bacteriophage φX174 , 2005, Genetics.

[33]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[34]  M. Vas,et al.  Correlation between conformational stability of the ternary enzyme–substrate complex and domain closure of 3‐phosphoglycerate kinase , 2005, The FEBS journal.

[35]  X. Mao,et al.  MET3 Promoter: A Tightly Regulated Promoter and Its Application in Construction of Conditional Lethal Strain , 2002, Current Microbiology.

[36]  Cristina Marino Buslje,et al.  Correction for phylogeny, small number of observations and data redundancy improves the identification of coevolving amino acid pairs using mutual information , 2009, Bioinform..

[37]  Narmada Thanki,et al.  CDD: specific functional annotation with the Conserved Domain Database , 2008, Nucleic Acids Res..

[38]  J. Marmur,et al.  Isolation and characterization of Saccharomyces cerevisiae glycolytic pathway mutants , 1977, Journal of bacteriology.

[39]  G J Davies,et al.  Structure of the ADP complex of the 3-phosphoglycerate kinase from Bacillus stearothermophilus at 1.65 A. , 1994, Acta crystallographica. Section D, Biological crystallography.

[40]  W. Atchley,et al.  Networks of coevolving sites in structural and functional domains of serpin proteins. , 2005, Molecular biology and evolution.

[41]  E. Ortlund,et al.  An epistatic ratchet constrains the direction of glucocorticoid receptor evolution , 2009, Nature.

[42]  R. Huber,et al.  Closed structure of phosphoglycerate kinase from Thermotoga maritima reveals the catalytic mechanism and determinants of thermal stability. , 1997, Structure.

[43]  An Analysis of Suppressor Mutations Suggests That the Two Halves of the Lactose Permease Function in a Symmetrical Manner* , 1997, The Journal of Biological Chemistry.

[44]  Shunsuke Kato,et al.  The screening of the second‐site suppressor mutations of the common p53 mutants , 2007, International journal of cancer.

[45]  E. Tillier,et al.  Regional covariation and its application for predicting protein contact patches , 2010, Proteins.

[46]  D. J. Kiviet,et al.  Empirical fitness landscapes reveal accessible evolutionary paths , 2007, Nature.

[47]  G A Petsko,et al.  Aromatic-aromatic interaction: a mechanism of protein structure stabilization. , 1985, Science.

[48]  Simon A. A. Travers,et al.  A Novel Method for Detecting Intramolecular Coevolution: Adding a Further Dimension to Selective Constraints Analyses , 2006, Genetics.

[49]  M. DePristo,et al.  Missense meanderings in sequence space: a biophysical view of protein evolution , 2005, Nature Reviews Genetics.

[50]  Daniel Y. Little,et al.  Identification of Coevolving Residues and Coevolution Potentials Emphasizing Structure, Bond Formation and Catalytic Coordination in Protein Evolution , 2009, PloS one.

[51]  M. Whitlock,et al.  Compensatory mutations are repeatable and clustered within proteins , 2009, Proceedings of the Royal Society B: Biological Sciences.

[52]  Mark Gerstein,et al.  An integrated system for studying residue coevolution in proteins , 2008, Bioinform..

[53]  R. Aldrich,et al.  Influence of conservation on calculations of amino acid covariance in multiple sequence alignments , 2004, Proteins.

[54]  B. Bernstein,et al.  A bisubstrate analog induces unexpected conformational changes in phosphoglycerate kinase from Trypanosoma brucei. , 1998, Journal of molecular biology.