High‐throughput identification of protein mutant stability computed from a double mutant fitness landscape

The effect of a mutation on protein stability is traditionally measured by genetic construction, expression, purification, and physical analysis using low‐throughput methods. This process is tedious and limits the number of mutants able to be examined in a single study. In contrast, functional fitness effects can be measured in a high‐throughput manner by various deep mutational scanning tools. Using protein GB 1, we have recently demonstrated the feasibility of estimating the mutational stability effect ( ΔΔG) of single‐substitution based on the functional fitness profile of all double‐substitutions. The principle is to identify genetic backgrounds that have an exhausted stability margin. The functional effect of an additional substitution on these genetic backgrounds can then be used to compute the mutational ΔΔG based on the biophysical relationship between functional fitness and thermodynamic stability. However, to identify such genetic backgrounds, the approach described in our previous study required a benchmark dataset, which is a set of known mutational ΔΔG. In this study, a benchmark‐independent approach is developed. The genetic backgrounds of interest are identified using k‐means clustering with the integration of structural information. We further demonstrated that a reasonable approximation of ΔΔG can also be obtained without taking structural information into account. In summary, this study describes a novel method for computing ΔΔG from double‐substitution functional fitness profiles alone, without relying on any known mutational ΔΔG as a benchmark.

[1]  G L Gilliland,et al.  Two crystal structures of the B1 immunoglobulin-binding domain of streptococcal protein G and comparison with NMR. , 1994, Biochemistry.

[2]  Debora S. Marks,et al.  Inferring Pairwise Interactions from Biological Data Using Maximum-Entropy Probability Models , 2015, PLoS Comput. Biol..

[3]  S. Fields,et al.  Deep mutational scanning: a new style of protein science , 2014, Nature Methods.

[4]  Dan S. Tawfik,et al.  Robustness–epistasis link shapes the fitness landscape of a randomly drifting protein , 2006, Nature.

[5]  F. Arnold,et al.  Protein stability promotes evolvability. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Thomas J Magliery,et al.  Protein stability by number: high-throughput and statistical approaches to one of protein science's most difficult problems. , 2011, Current opinion in chemical biology.

[7]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[8]  D. Baker,et al.  Role of conformational sampling in computing mutation‐induced changes in protein structure and stability , 2011, Proteins.

[9]  Nicholas C. Wu,et al.  A Comprehensive Biophysical Description of Pairwise Epistasis throughout an Entire Protein Domain , 2014, Current Biology.

[10]  F. Arnold,et al.  Directed evolution of a thermostable esterase. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[11]  S. Radford,et al.  Optimizing protein stability in vivo. , 2009, Molecular cell.

[12]  Marc A Suchard,et al.  Stability-mediated epistasis constrains the evolution of an influenza protein , 2013, eLife.

[13]  Joost Schymkowitz,et al.  The stability effects of protein mutations appear to be universally distributed. , 2007, Journal of molecular biology.

[14]  G. Schreiber,et al.  Assessing computational methods for predicting protein stability upon mutation: good on average but not in the details. , 2009, Protein engineering, design & selection : PEDS.

[15]  C. Wilke,et al.  Predicting the tolerance of proteins to random amino acid substitution. , 2005, Biophysical journal.

[16]  K. R. Woods,et al.  Prediction of protein antigenic determinants from amino acid sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Christoph Adami,et al.  Thermodynamic prediction of protein neutrality. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[18]  David Baker,et al.  Macromolecular modeling with rosetta. , 2008, Annual review of biochemistry.