Molecular Determinants of Mutant Phenotypes, Inferred from Saturation Mutagenesis Data

Understanding how mutations affect protein activity and organismal fitness is a major challenge. We used saturation mutagenesis combined with deep sequencing to determine mutational sensitivity scores for 1,664 single-site mutants of the 101 residue Escherichia coli cytotoxin, CcdB at seven different expression levels. Active-site residues could be distinguished from buried ones, based on their differential tolerance to aliphatic and charged amino acid substitutions. At nonactive-site positions, the average mutational tolerance correlated better with depth from the protein surface than with accessibility. Remarkably, similar results were observed for two other small proteins, PDZ domain (PSD95pdz3) and IgG-binding domain of protein G (GB1). Mutational sensitivity data obtained with CcdB were used to derive a procedure for predicting functional effects of mutations. Results compared favorably with those of two widely used computational predictors. In vitro characterization of 80 single, nonactive-site mutants of CcdB showed that activity in vivo correlates moderately with thermal stability and solubility. The inability to refold reversibly, as well as a decreased folding rate in vitro, is associated with decreased activity in vivo. Upon probing the effect of modulating expression of various proteases and chaperones on mutant phenotypes, most deleterious mutants showed an increased in vivo activity and solubility only upon over-expression of either Trigger factor or SecB ATP-independent chaperones. Collectively, these data suggest that folding kinetics rather than protein stability is the primary determinant of activity in vivo. This study enhances our understanding of how mutations affect phenotype, as well as the ability to predict fitness effects of point mutations.

[1]  A. Surolia,et al.  Thermodynamic characterization of monomeric and dimeric forms of CcdB (controller of cell division or death B protein). , 2004, The Biochemical journal.

[2]  Daniel W. A. Buchan,et al.  A large-scale evaluation of computational protein function prediction , 2013, Nature Methods.

[3]  R. Sun,et al.  Functional Constraint Profiling of a Viral Protein Reveals Discordance of Evolutionary Conservation and Functionality , 2015, PLoS genetics.

[4]  T. Palzkill,et al.  How Structural and Physicochemical Determinants Shape Sequence Constraints in a Functional Enzyme , 2015, PloS one.

[5]  D. A. Dougherty,et al.  Cation-π interactions in structural biology , 1999 .

[6]  R. Varadarajan,et al.  Residue specific contributions to stability and activity inferred from saturation mutagenesis and deep sequencing. , 2014, Current opinion in structural biology.

[7]  Amy E Keating,et al.  Peptide ligands for pro-survival protein Bfl-1 from computationally guided library screening. , 2013, ACS chemical biology.

[8]  Raghavan Varadarajan,et al.  A rapid, efficient, and economical inverse polymerase chain reaction-based method for generating a site saturation mutant library. , 2014, Analytical biochemistry.

[9]  S. Gottesman,et al.  Protein quality control: triage by chaperones and proteases. , 1997, Genes & development.

[10]  J. Moult,et al.  Loss of protein structure stability as a major causative factor in monogenic disease. , 2005, Journal of molecular biology.

[11]  F. Richards,et al.  Mobile unnatural amino acid side chains in the core of staphylococcal nuclease , 1996, Protein science : a publication of the Protein Society.

[12]  H. Yanagi,et al.  Overexpression of Trigger Factor Prevents Aggregation of Recombinant Proteins in Escherichia coli , 2000, Applied and Environmental Microbiology.

[13]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[14]  Jesse D Bloom,et al.  The inherent mutational tolerance and antigenic evolvability of influenza hemagglutinin , 2014, bioRxiv.

[15]  T. Andrews,et al.  Comparison of predicted and actual consequences of missense mutations , 2015, Proceedings of the National Academy of Sciences.

[16]  Brian K Shoichet,et al.  Evolution of an antibiotic resistance enzyme constrained by stability and activity trade-offs. , 2002, Journal of molecular biology.

[17]  D. Baker,et al.  Small molecule probes to quantify the functional fraction of a specific protein in a cell with minimal folding equilibrium shifts , 2014, Proceedings of the National Academy of Sciences.

[18]  F. J. Poelwijk,et al.  The spatial architecture of protein function and adaptation , 2012, Nature.

[19]  R. Liddington,et al.  Crystal structure of CcdB, a topoisomerase poison from E. coli. , 1999, Journal of molecular biology.

[20]  Raghavan Varadarajan,et al.  A Procedure for Detection and Quantitation of Cavity Volumes in Proteins , 2002, The Journal of Biological Chemistry.

[21]  R. Varadarajan,et al.  Mutagenesis-based definitions and probes of residue burial in proteins. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Raghavan Varadarajan,et al.  Residue proximity information and protein model discrimination using saturation-suppressor mutagenesis , 2015, eLife.

[23]  John P. Overington,et al.  Environment‐specific amino acid substitution tables: Tertiary templates and prediction of protein folds , 1992, Protein science : a publication of the Protein Society.

[24]  R. Miller,et al.  One-step preparation of competent Escherichia coli: transformation and storage of bacterial cells in the same solution. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[25]  G. Rose,et al.  Hydrophobicity of amino acid residues in globular proteins. , 1985, Science.

[26]  Kelly M. Thayer,et al.  Analyses of the effects of all ubiquitin point mutants on yeast growth rate. , 2013, Journal of molecular biology.

[27]  Jeffrey E. Lee,et al.  Measuring Protein‐Protein and Protein‐Nucleic Acid Interactions by Biolayer Interferometry , 2015, Current protocols in protein science.

[28]  Amy E Keating,et al.  Predictive Bcl-2 family binding models rooted in experiment or structure. , 2012, Journal of molecular biology.

[29]  D. Bolon,et al.  Experimental illumination of a fitness landscape , 2011, Proceedings of the National Academy of Sciences.

[30]  R. Gibbs,et al.  Deep sequencing of systematic combinatorial libraries reveals β-lactamase sequence constraints at high resolution. , 2012, Journal of molecular biology.

[31]  S. Fields,et al.  A fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function , 2012, Proceedings of the National Academy of Sciences.

[32]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[33]  D. A. Dougherty,et al.  Cation-pi interactions in structural biology. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Douglas E. V. Pires,et al.  DUET: a server for predicting effects of mutations on protein stability using an integrated computational approach , 2014, Nucleic Acids Res..

[35]  Yang Zhang,et al.  The I-TASSER Suite: protein structure and function prediction , 2014, Nature Methods.

[36]  J F Brandts,et al.  Study of strong to ultratight protein interactions using differential scanning calorimetry. , 1990, Biochemistry.

[37]  L. Wyns,et al.  Rejuvenation of CcdB-poisoned gyrase by an intrinsically disordered protein domain. , 2009, Molecular cell.

[38]  J M Sturtevant,et al.  Thermodynamics of the binding of L-arabinose and of D-galactose to the L-arabinose-binding protein of Escherichia coli. , 1983, The Journal of biological chemistry.

[39]  S. Kanaya,et al.  Protein core adaptability: crystal structures of the cavity-filling variants of Escherichia coli RNase HI. , 2010, Protein and peptide letters.

[40]  M. Sternberg,et al.  SuSPect: Enhanced Prediction of Single Amino Acid Variant (SAV) Phenotype Using Network Features , 2014, Journal of molecular biology.

[41]  B. Rost,et al.  Better prediction of functional effects for sequence variants , 2015, BMC Genomics.

[42]  M. DePristo,et al.  Missense meanderings in sequence space: a biophysical view of protein evolution , 2005, Nature Reviews Genetics.

[43]  Raghavan Varadarajan,et al.  A Procedure for Detection and Quantitation of Cavity Volumes in Proteins , 2002, The Journal of Biological Chemistry.

[44]  B. Bukau,et al.  Cooperation of Hsp70 and Hsp100 chaperone machines in protein disaggregation , 2015, Front. Mol. Biosci..

[45]  David L. Young,et al.  High-throughput Analysis of in vivo Protein Stability* , 2013, Molecular & Cellular Proteomics.

[46]  Joseph B Hiatt,et al.  Activity-enhancing mutations in an E3 ubiquitin ligase identified by high-throughput mutagenesis , 2013, Proceedings of the National Academy of Sciences.

[47]  Dmitry Chudakov,et al.  Local fitness landscape of the green fluorescent protein , 2016, Nature.

[48]  Akinori Sarai,et al.  ProTherm and ProNIT: thermodynamic databases for proteins and protein–nucleic acid interactions , 2005, Nucleic Acids Res..

[49]  Andreas Plückthun,et al.  Critical features for biosynthesis, stability, and functionality of a G protein-coupled receptor uncovered by all-versus-all mutations , 2012, Proceedings of the National Academy of Sciences.

[50]  R. Varadarajan,et al.  Contribution of cation-pi interactions to protein stability. , 2006, Biochemistry.

[51]  J. Moult,et al.  Identification and analysis of deleterious human SNPs. , 2006, Journal of molecular biology.

[52]  M. Fares,et al.  The Effect of Chaperonin Buffering on Protein Evolution , 2010, Genome biology and evolution.

[53]  F M Richards,et al.  Areas, volumes, packing and protein structure. , 1977, Annual review of biophysics and bioengineering.

[54]  Dan S. Tawfik,et al.  Chaperonin overexpression promotes genetic variation and enzyme evolution , 2009, Nature.

[55]  Jane Clarke,et al.  Using Model Proteins to Quantify the Effects of Pathogenic Mutations in Ig-like Proteins*♦ , 2006, Journal of Biological Chemistry.

[56]  M. Michael Gromiha,et al.  CUPSAT: prediction of protein stability upon point mutations , 2006, Nucleic Acids Res..

[57]  F. Niesen,et al.  The use of differential scanning fluorimetry to detect ligand interactions that promote protein stability , 2007, Nature Protocols.

[58]  S. Gottesman,et al.  ATP-dependent Degradation of CcdA by Lon Protease , 1996, The Journal of Biological Chemistry.

[59]  G. Fidelio,et al.  Extremely high thermal stability of streptavidin and avidin upon biotin binding. , 1999, Biomolecular engineering.

[60]  A. Sali,et al.  Statistical potential for assessment and prediction of protein structures , 2006, Protein science : a publication of the Protein Society.

[61]  E. Shakhnovich,et al.  Soluble oligomerization provides a beneficial fitness effect on destabilizing mutations , 2011, Proceedings of the National Academy of Sciences.

[62]  Dan S. Tawfik,et al.  Robustness–epistasis link shapes the fitness landscape of a randomly drifting protein , 2006, Nature.

[63]  Raghavan Varadarajan,et al.  Structural correlates of the temperature sensitive phenotype derived from saturation mutagenesis studies of CcdB. , 2008, Biochemistry.

[64]  Wei-Lun Hsu,et al.  Proportion of solvent-exposed amino acids in a protein and rate of protein evolution. , 2007, Molecular biology and evolution.

[65]  C. Obinger,et al.  Construction of a Stability Landscape of the CH3 Domain of Human IgG1 by Combining Directed Evolution with High Throughput Sequencing , 2012, Journal of molecular biology.

[66]  D. Hartl,et al.  Misfolded proteins impose a dosage-dependent fitness cost and trigger a cytosolic unfolded protein response in yeast , 2010, Proceedings of the National Academy of Sciences.

[67]  C. Georgopoulos,et al.  SecB is a bona fide generalized chaperone in Escherichia coli. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[68]  Julie C. Mitchell,et al.  Community‐wide evaluation of methods for predicting the effect of mutations on protein–protein interactions , 2013, Proteins.

[69]  J U Bowie,et al.  Identifying determinants of folding and activity for a protein of unknown structure. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[70]  S. Jackson,et al.  Context-dependent nature of destabilizing mutations on the stability of FKBP12. , 1998, Biochemistry.

[71]  Mallur S. Madhusudhan,et al.  Stereochemical Criteria for Prediction of the Effects of Proline Mutations on Protein Stability , 2007, PLoS Comput. Biol..

[72]  Robert T. Sauer,et al.  Protein stability effects of a complete set of alanine substitutions in Arc repressor , 1994, Nature Structural Biology.

[73]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[74]  George I Makhatadze,et al.  Thermodynamic consequences of burial of polar and non-polar amino acid residues in the protein interior. , 2002, Journal of molecular biology.

[75]  T. Mikkelsen,et al.  Comprehensive mutational scanning of a kinase in vivo reveals substrate-dependent fitness landscapes , 2014, Nucleic acids research.

[76]  Timothy A. Whitehead,et al.  High-Resolution Sequence-Function Mapping of Full-Length Proteins , 2015, PloS one.

[77]  T. Ogura,et al.  Effects of the ccd function of the F plasmid on bacterial growth , 1985, Journal of bacteriology.

[78]  Finbarr Hayes,et al.  Toxins-Antitoxins: Plasmid Maintenance, Programmed Cell Death, and Cell Cycle Arrest , 2003, Science.

[79]  Raghavan Varadarajan,et al.  Protein model discrimination using mutational sensitivity derived from deep sequencing. , 2012, Structure.

[80]  F. Richards,et al.  The crystal structure of a mutant protein with altered but improved hydrophobic core packing. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[81]  Philip A. Romero,et al.  Dissecting enzyme function with microfluidic-based deep mutational scanning , 2015, Proceedings of the National Academy of Sciences.

[82]  Timothy A. Whitehead,et al.  Optimization of affinity, specificity and function of designed influenza inhibitors using deep sequencing , 2012, Nature Biotechnology.

[83]  Benjamin P. Roscoe,et al.  Fitness analyses of all possible point mutations for regions of genes in yeast , 2012, Nature Protocols.

[84]  Jingwen Zhou,et al.  Protein quality control acts on folding intermediates to shape the effects of mutations on organismal fitness. , 2013, Molecular cell.

[85]  D. Baker,et al.  High Resolution Mapping of Protein Sequence–Function Relationships , 2010, Nature Methods.

[86]  B. Matthews,et al.  The introduction of strain and its effects on the structure and stability of T4 lysozyme. , 1999, Journal of molecular biology.

[87]  Christoph Adami,et al.  Thermodynamic prediction of protein neutrality. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[88]  David L. Young,et al.  Massively Parallel Functional Analysis of BRCA1 RING Domain Variants , 2015, Genetics.

[89]  C. A. Lewis,et al.  Temperature dependence of amino acid hydrophobicities , 2015, Proceedings of the National Academy of Sciences.

[90]  F M Richards,et al.  The hydrophobic core of Escherichia coli thioredoxin shows a high tolerance to nonconservative single amino acid substitutions. , 1992, Biochemistry.

[91]  Nicholas C. Wu,et al.  A Comprehensive Biophysical Description of Pairwise Epistasis throughout an Entire Protein Domain , 2014, Current Biology.

[92]  Konstantin B. Zeldovich,et al.  Latent Effects of Hsp90 Mutants Revealed at Reduced Expression Levels , 2013, PLoS genetics.

[93]  N. Ban,et al.  A cradle for new proteins: trigger factor at the ribosome. , 2005, Current opinion in structural biology.

[94]  C Chothia,et al.  Surface, subunit interfaces and interior of oligomeric proteins. , 1988, Journal of molecular biology.