Point mutations in protein globular domains: contributions from function, stability and misfolding.

Several contrasting hypotheses have been formulated about the influence of functional and conformational properties, like stability and avoidance of misfolding, on the evolution of protein globular domains. Selection at functional sites has been suggested to be detrimental to stability or coupled to it. Avoidance of misfolding may be achieved by discarding misfolding-prone sequences or by maintaining a stable native state and thus destabilizing partially or fully unfolded states from which misfolding can take place. We have performed a hierarchical analysis of a large database of point mutations to dissect the relative contributions of function, stability and misfolding in the evolution of natural sequences. We show that at catalytic sites, selection for function overrules selection for stability but find no evidence for an anticorrelation between function and stability. Selection for stability plays a secondary role at binding sites, but is not fully coupled to selection for function. Remarkably, we did not find a selective pressure against misfolding-prone sequences in globular proteins at the level of individual positions. We suggest that such a selection would compromise native-state stability due to a correlation between the stabilities of native and misfolded states. Stabilization of the native state is the most frequent way in which natural proteins avoid misfolding.

[1]  Juan Fernández-Recio,et al.  Efficient restraints for protein-protein docking by comparison of observed amino acid substitution patterns with those predicted from local environment. , 2006, Journal of molecular biology.

[2]  Andreas Hoenger,et al.  De novo designed peptide-based amyloid fibrils , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Frederic Rousseau,et al.  How evolutionary pressure against protein aggregation shaped chaperone specificity. , 2006, Journal of molecular biology.

[4]  B K Shoichet,et al.  A relationship between protein stability and protein function. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Richard A Friesner,et al.  Computational prediction of native protein ligand-binding and enzyme active site sequences. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6]  S. Steinbacher,et al.  Sequence statistics reliably predict stabilizing mutations in a protein domain. , 1994, Journal of molecular biology.

[7]  D. Baker,et al.  Improvement in protein functional site prediction by distinguishing structural and functional constraints on protein family evolution using computational design , 2005, Nucleic acids research.

[8]  R. Sauer,et al.  The structural stability of a protein is an important determinant of its proteolytic susceptibility in Escherichia coli. , 1989, The Journal of biological chemistry.

[9]  A. Fersht,et al.  Contribution of residues in the reactive site loop of chymotrypsin inhibitor 2 to protein stability and activity. , 1994, Biochemistry.

[10]  Eugene I. Shakhnovich,et al.  Natural selection of more designable folds: A mechanism for thermophilic adaptation , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[11]  B. Rost,et al.  Conservation and prediction of solvent accessibility in protein families , 1994, Proteins.

[12]  Amedeo Caflisch,et al.  Organism complexity anti-correlates with proteomic beta-aggregation propensity. , 2005, Protein science : a publication of the Protein Society.

[13]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[14]  F. Cohen,et al.  An evolutionary trace method defines binding surfaces common to protein families. , 1996, Journal of molecular biology.

[15]  T. Blundell,et al.  Distinguishing structural and functional restraints in evolution in order to identify interaction sites. , 2004, Journal of molecular biology.

[16]  Kevin W Plaxco,et al.  On the precision of experimentally determined protein folding rates and ϕ‐values , 2006, Protein science : a publication of the Protein Society.

[17]  Stefan M. Larson,et al.  Analysis of covariation in an SH3 domain sequence alignment: applications in tertiary contact prediction and the design of compensating hydrophobic core substitutions. , 2000, Journal of molecular biology.

[18]  Miriam Barlow,et al.  Predicting evolutionary potential: in vitro evolution accurately reproduces natural evolution of the tem beta-lactamase. , 2002, Genetics.

[19]  K. Nishikawa,et al.  Prediction of catalytic residues in enzymes based on known tertiary structure, stability profile, and sequence conservation. , 2003, Journal of molecular biology.

[20]  G Schreiber,et al.  Stability and function: two constraints in the evolution of barstar and other proteins. , 1994, Structure.

[21]  J. Richardson,et al.  Natural β-sheet proteins use negative design to avoid edge-to-edge aggregation , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[22]  M. Oobatake,et al.  Investigating the role of conserved residue Asp134 in Escherichia coli ribonuclease HI by site-directed random mutagenesis. , 1994, European journal of biochemistry.

[23]  T. P. Flores,et al.  Comparison of conformational characteristics in structurally similar protein pairs , 1993, Protein science : a publication of the Protein Society.

[24]  D. Chasman,et al.  Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation. , 2001, Journal of molecular biology.

[25]  Alexander V. Diemand,et al.  iMolTalk: an interactive, internet-based protein structure analysis server , 2004, Nucleic Acids Res..

[26]  Anthony D. Keefe,et al.  Functional proteins from a random-sequence library , 2001, Nature.

[27]  Volker Sieber,et al.  Surface‐exposed phenylalanines in the RNP1/RNP2 motif stabilize the cold‐shock protein CspB from Bacillus subtilis , 1998, Proteins.

[28]  M. Yu,et al.  Probing the native strain iin alpha1-antitrypsin. , 1996, Nature structural biology.

[29]  J M Thornton,et al.  LIGPLOT: a program to generate schematic diagrams of protein-ligand interactions. , 1995, Protein engineering.

[30]  R. Sauer,et al.  Tolerance of Arc repressor to multiple-alanine substitutions. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[31]  R. Raines,et al.  Contribution of a tyrosine side chain to ribonuclease A catalysis and stability–Contribution of Tyr 97 to RNase A catalysis and stability , 1996, Protein science : a publication of the Protein Society.

[32]  P E Wright,et al.  Changes in the apomyoglobin folding pathway caused by mutation of the distal histidine residue. , 2000, Biochemistry.

[33]  Piero Fariselli,et al.  I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure , 2005, Nucleic Acids Res..

[34]  R. Raines,et al.  His...Asp catalytic dyad of ribonuclease A: conformational stability of the wild-type, D121N, D121A, and H119A enzymes. , 1998, Biochemistry.

[35]  S. Ghaemmaghami,et al.  A quantitative, high-throughput screen for protein stability. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[36]  M. Lehmann,et al.  From DNA sequence to improved functionality: using protein sequence comparisons to rapidly design a thermostable consensus phytase. , 2000, Protein engineering.

[37]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[38]  Amedeo Caflisch,et al.  Organism complexity anti‐correlates with proteomic β‐aggregation propensity , 2005, Protein science : a publication of the Protein Society.

[39]  L. Gierasch,et al.  Mutating the charged residues in the binding pocket of cellular retinoic acid‐binding protein simultaneously reduces its binding affinity to retinoic acid and increases its thermostability , 1992, Proteins.

[40]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[41]  L. Gregoret,et al.  Coupling protein stability and protein function in Escherichia coli CspA. , 1998, Folding & design.

[42]  L. Serrano,et al.  Protein aggregation and amyloidosis: confusion of the kinds? , 2006, Current opinion in structural biology.

[43]  P. Srere,et al.  Conformational stability of pig citrate synthase and some active-site mutants. , 1991, Biochemistry.

[44]  Akinori Sarai,et al.  ProTherm and ProNIT: thermodynamic databases for proteins and protein–nucleic acid interactions , 2005, Nucleic Acids Res..

[45]  C. Wilke,et al.  Why highly expressed proteins evolve slowly. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Andrés Moya,et al.  Genomic determinants of protein folding thermodynamics in prokaryotic organisms. , 2004, Journal of molecular biology.

[47]  S. Henikoff,et al.  Position-based sequence weights. , 1994, Journal of molecular biology.

[48]  J. Clarke,et al.  Sequence conservation in Ig-like domains: the role of highly conserved proline residues in the fibronectin type III superfamily. , 2002, Journal of molecular biology.

[49]  K. Kristiansen,et al.  Conserved residues and their role in the structure, function, and stability of acyl-coenzyme A binding protein. , 1999, Biochemistry.

[50]  T. Kiefhaber,et al.  Origin of unusual phi-values in protein folding: evidence against specific nucleation sites. , 2003, Journal of molecular biology.

[51]  L Serrano,et al.  Effect of active site residues in barnase on activity and stability. , 1992, Journal of molecular biology.

[52]  Janet M. Thornton,et al.  The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data , 2004, Nucleic Acids Res..

[53]  S A Benner,et al.  Amino acid substitution during functionally constrained divergent evolution of protein sequences. , 1994, Protein engineering.

[54]  Dmitry M Korzhnev,et al.  Dramatic acceleration of protein folding by stabilization of a nonnative backbone conformation. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[55]  Gail J. Bartlett,et al.  Analysis of catalytic residues in enzyme active sites. , 2002, Journal of molecular biology.

[56]  C. Dobson,et al.  Glycine residues appear to be evolutionarily conserved for their ability to inhibit aggregation. , 2005, Structure.

[57]  K. Brew,et al.  Stability, activity and flexibility in α-lactalbumin , 1999 .

[58]  P. Lansbury,et al.  Amyloid fibrillogenesis: themes and variations. , 2000, Current opinion in structural biology.

[59]  Boris Steipe,et al.  Consensus-based engineering of protein stability: from intrabodies to thermostable enzymes. , 2004, Methods in enzymology.

[60]  A. Fersht,et al.  Design of highly stable functional GroEL minichaperones , 1999, Protein science : a publication of the Protein Society.

[61]  Stefan M. Larson,et al.  The relationship between conservation, thermodynamic stability, and function in the SH3 domain hydrophobic core. , 2003, Journal of molecular biology.

[62]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[63]  B. Matthews,et al.  A test of the "jigsaw puzzle" model for protein folding by multiple methionine substitutions within the core of T4 lysozyme. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Shaun M Lippow,et al.  Improved mutants from directed evolution are biased to orthologous substitutions. , 2006, Protein engineering, design & selection : PEDS.

[65]  L. Serrano,et al.  Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteins , 2004, Nature Biotechnology.

[66]  M. Hecht,et al.  Nature disfavors sequences of alternating polar and non-polar amino acids: implications for amyloidogenesis. , 2000, Journal of molecular biology.

[67]  J. M. Sanchez-Ruiz,et al.  A stability pattern of protein hydrophobic mutations that reflects evolutionary structural optimization. , 2005, Biophysical journal.

[68]  L. Serrano,et al.  A comparative study of the relationship between protein structure and beta-aggregation in globular and intrinsically disordered proteins. , 2004, Journal of molecular biology.

[69]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[70]  J. King,et al.  Frequencies of amino acid strings in globular protein sequences indicate suppression of blocks of consecutive hydrophobic residues , 2001, Protein science : a publication of the Protein Society.

[71]  J. Thornton,et al.  NUCPLOT: a program to generate schematic diagrams of protein-nucleic acid interactions. , 1997, Nucleic acids research.

[72]  S. Hubbard,et al.  Conservation of orientation and sequence in protein domain--domain interactions. , 2005, Journal of molecular biology.

[73]  R. Jaenicke,et al.  Proteins under extreme physical conditions , 1990, FEBS letters.

[74]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.