Cooperative Changes in Solvent Exposure Identify Cryptic Pockets, Switches, and Allosteric Coupling.

Proteins are dynamic molecules that undergo conformational changes to a broad spectrum of different excited states. Unfortunately, the small populations of these states make it difficult to determine their structures or functional implications. Computer simulations are an increasingly powerful means to identify and characterize functionally relevant excited states. However, this advance has uncovered a further challenge: it can be extremely difficult to identify the most salient features of large simulation data sets. We reasoned that many functionally relevant conformational changes are likely to involve large, cooperative changes to the surfaces that are available to interact with potential binding partners. To examine this hypothesis, we introduce a method that returns a prioritized list of potentially functional conformational changes by segmenting protein structures into clusters of residues that undergo cooperative changes in their solvent exposure, along with the hierarchy of interactions between these groups. We term these groups exposons to distinguish them from other types of clusters that arise in this analysis and others. We demonstrate, using three different model systems, that this method identifies experimentally validated and functionally relevant conformational changes, including conformational switches, allosteric coupling, and cryptic pockets. Our results suggest that key functional sites are hubs in the network of exposons. As a further test of the predictive power of this approach, we apply it to discover cryptic allosteric sites in two different β-lactamase enzymes that are widespread sources of antibiotic resistance. Experimental tests confirm our predictions for both systems. Importantly, we provide the first evidence, to our knowledge, for a cryptic allosteric site in CTX-M-9 β-lactamase. Experimentally testing this prediction did not require any mutations and revealed that this site exerts the most potent allosteric control over activity of any pockets found in β-lactamases to date. Discovery of a similar pocket that was previously overlooked in the well-studied TEM-1 β-lactamase demonstrates the utility of exposons.

[1]  A. Berger,et al.  Deuterium exchange of poly-DL-alanine in aqueous solution. , 1957, Archives of biochemistry and biophysics.

[2]  R. Ruigrok,et al.  Nucleoproteins and nucleocapsids of negative-strand RNA viruses. , 2011, Current opinion in microbiology.

[3]  M. Vieth,et al.  Predicting the Conformational Variability of Abl Tyrosine Kinase using Molecular Dynamics Simulations and Markov State Models. , 2018, Journal of chemical theory and computation.

[4]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[5]  A. Shrake,et al.  Environment and exposure to solvent of protein atoms. Lysozyme and insulin. , 1973, Journal of molecular biology.

[6]  W. Team,et al.  West African Ebola Epidemic after One Year — Slowing but Not Yet under Control , 2015 .

[7]  Gregory R Bowman,et al.  Advanced Methods for Accessing Protein Shape-Shifting Present New Therapeutic Opportunities. , 2019, Trends in biochemical sciences.

[8]  A. Laio,et al.  Metadynamics: a method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science , 2008 .

[9]  H. Won,et al.  Structures of inactive CRP species reveal the atomic details of the allosteric transition that discriminates cyclic nucleotide second messengers. , 2014, Acta crystallographica. Section D, Biological crystallography.

[10]  R. Ebright,et al.  Dynamically driven protein allostery , 2006, Nature Structural &Molecular Biology.

[11]  X. Barril,et al.  Understanding and predicting druggability. A high-throughput method for detection of drug binding sites. , 2010, Journal of medicinal chemistry.

[12]  G. Bowman,et al.  Equilibrium fluctuations of a single folded protein reveal a multitude of potential cryptic allosteric sites , 2012, Proceedings of the National Academy of Sciences.

[13]  Gerhard Hummer,et al.  Peptide folding kinetics from replica exchange molecular dynamics. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Kathryn M Hart,et al.  Discovery of multiple hidden allosteric sites by combining Markov state models and experiments , 2015, Proceedings of the National Academy of Sciences.

[15]  Ryan G. Coleman,et al.  Protein Pockets: Inventory, Shape, and Comparison , 2010, J. Chem. Inf. Model..

[16]  Justin R. Porter,et al.  Enspara: Modeling molecular ensembles with scalable data structures and parallel computing , 2018, bioRxiv.

[17]  Eric Vanden-Eijnden,et al.  Transition Path Theory for Markov Jump Processes , 2009, Multiscale Model. Simul..

[18]  W. Chiu,et al.  An Intrinsically Disordered Peptide from Ebola Virus VP35 Controls Viral RNA Synthesis by Modulating Nucleoprotein-RNA Interactions. , 2015, Cell reports.

[19]  Wah Chiu,et al.  Electron Cryo-microscopy Structure of Ebola Virus Nucleoprotein Reveals a Mechanism for Nucleocapsid-like Assembly , 2018, Cell.

[20]  Kevin L. Shaw,et al.  Linear extrapolation method of analyzing solvent denaturation curves , 2000, Proteins.

[21]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[22]  Diwakar Shukla,et al.  To milliseconds and beyond: challenges in the simulation of protein folding. , 2013, Current opinion in structural biology.

[23]  Frank Noé,et al.  Variational Approach to Molecular Kinetics. , 2014, Journal of chemical theory and computation.

[24]  Frank Noé,et al.  An Introduction to Markov State Models and Their Application to Long Timescale Molecular Simulation , 2014, Advances in Experimental Medicine and Biology.

[25]  Vijay S. Pande,et al.  Screen Savers of the World Unite! , 2000, Science.

[26]  G. Bowman,et al.  Quantifying Allosteric Communication via Both Concerted Structural Changes and Conformational Disorder with CARDS. , 2017, Journal of chemical theory and computation.

[27]  Eric Vanden-Eijnden,et al.  Transition-path theory and path-finding algorithms for the study of rare events. , 2010, Annual review of physical chemistry.

[28]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[29]  William Swope,et al.  Describing Protein Folding Kinetics by Molecular Dynamics Simulations. 1. Theory , 2004 .

[30]  D. Baker,et al.  A large scale test of computational protein design: folding and stability of nine completely redesigned globular proteins. , 2003, Journal of molecular biology.

[31]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[32]  Wei Zhang,et al.  A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculations , 2003, J. Comput. Chem..

[33]  J. Wells,et al.  Searching for new allosteric sites in enzymes. , 2004, Current opinion in structural biology.

[34]  Kyle A. Beauchamp,et al.  Molecular simulation of ab initio protein folding for a millisecond folder NTL9(1-39). , 2010, Journal of the American Chemical Society.

[35]  Mutagenesis of the cyclic AMP receptor protein of Escherichia coli: targeting positions 72 and 82 of the cyclic nucleotide binding pocket. , 1993, Nucleic acids research.

[36]  Sotaro Fuchigami,et al.  Slow dynamics in protein fluctuations revealed by time-structure based independent component analysis: the case of domain motions. , 2011, The Journal of chemical physics.

[37]  James R Horn,et al.  Allosteric inhibition through core disruption. , 2004, Journal of molecular biology.

[38]  F. Noé,et al.  Commute Maps: Separating Slowly Mixing Molecular Configurations for Kinetic Modeling. , 2016, Journal of chemical theory and computation.

[39]  Erik van Nimwegen,et al.  Disentangling Direct from Indirect Co-Evolution of Residues in Protein Alignments , 2010, PLoS Comput. Biol..

[40]  M. Schroeder,et al.  LIGSITEcsc: predicting ligand binding sites using the Connolly surface and degree of conservation , 2006, BMC Structural Biology.

[41]  David L Mobley,et al.  Quantifying Correlations Between Allosteric Sites in Thermodynamic Ensembles. , 2009, Journal of chemical theory and computation.

[42]  Jeremy C. Smith,et al.  Dynamical fingerprints for probing individual relaxation processes in biomolecular dynamics with simulations and kinetic experiments , 2011, Proceedings of the National Academy of Sciences.

[43]  Sergio Decherchi,et al.  A general and Robust Ray-Casting-Based Algorithm for Triangulating Surfaces at the Nanoscale , 2013, PloS one.

[44]  Toni Giorgino,et al.  Identification of slow molecular order parameters for Markov model construction. , 2013, The Journal of chemical physics.

[45]  Maxwell I. Zimmerman,et al.  Designing small molecules to target cryptic pockets yields both positive and negative allosteric modulators , 2017, PloS one.

[46]  M Hendlich,et al.  LIGSITE: automatic and efficient detection of potential small molecule-binding sites in proteins. , 1997, Journal of molecular graphics & modelling.

[47]  E. van Nimwegen,et al.  Accurate Prediction of Protein–protein Interactions from Sequence Alignments Using a Bayesian Method , 2022 .

[48]  T. Steitz,et al.  Crystal structure of a CAP-DNA complex: the DNA is bent by 90 degrees , 1991, Science.

[49]  Gregory R Bowman,et al.  Choice of Adaptive Sampling Strategy Impacts State Discovery, Transition Probabilities, and the Apparent Mechanism of Conformational Changes. , 2018, Journal of chemical theory and computation.

[50]  D. Crothers,et al.  Modulation of the stability of a gene-regulatory protein dimer by DNA and cAMP. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[51]  H. Matsuzawa,et al.  Site-directed mutants, at position 166, of RTEM-1 beta-lactamase that form a stable acyl-enzyme intermediate with penicillin. , 1991, The Journal of biological chemistry.

[52]  Amanda E. Wakefield,et al.  Cryptic binding sites on proteins: definition, detection, and druggability , 2018, Current Opinion in Chemical Biology.

[53]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[54]  Brian K Shoichet,et al.  Evolution of an antibiotic resistance enzyme constrained by stability and activity trade-offs. , 2002, Journal of molecular biology.

[55]  J. Lee,et al.  Intersubunit communications in Escherichia coli cyclic AMP receptor protein: studies of the ligand binding domain. , 1992, Biochemistry.

[56]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[57]  J G Harman,et al.  Allosteric regulation of the cAMP receptor protein. , 2001, Biochimica et biophysica acta.

[58]  Frank Noé,et al.  Markov state models of biomolecular conformational dynamics. , 2014, Current opinion in structural biology.

[59]  D. van der Spoel,et al.  GROMACS: A message-passing parallel molecular dynamics implementation , 1995 .

[60]  Luis Pedro Coelho Jug: Software for Parallel Reproducible Computation in Python , 2017 .

[61]  Frank Noé,et al.  Markov models of molecular kinetics: generation and validation. , 2011, The Journal of chemical physics.

[62]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[63]  Kathryn M Hart,et al.  supplementary figures , 2018 .

[64]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[65]  Thomas J Lane,et al.  MDTraj: a modern, open library for the analysis of molecular dynamics trajectories , 2014, bioRxiv.