Bayesian inference of protein conformational ensembles from limited structural data

Many proteins consist of folded domains connected by regions with higher flexibility. The details of the resulting conformational ensemble play a central role in controlling interactions between domains and with binding partners. Small-Angle Scattering (SAS) is well-suited to study the conformational states adopted by proteins in solution. However, analysis is complicated by the limited information content in SAS data and care must be taken to avoid constructing overly complex ensemble models and fitting to noise in the experimental data. To address these challenges, we developed a method based on Bayesian statistics that infers conformational ensembles from a structural library generated by all-atom Monte Carlo simulations. The first stage of the method involves a fast model selection based on variational Bayesian inference that maximizes the model evidence of the selected ensemble. This is followed by a complete Bayesian inference of population weights in the selected ensemble. Experiments with simulated ensembles demonstrate that model evidence is capable of identifying the correct ensemble and that correct number of ensemble members can be recovered up to high level of noise. Using experimental data, we demonstrate how the method can be extended to include data from Nuclear Magnetic Resonance (NMR) and structural energies of conformers extracted from the all-atom energy functions. We show that the data from SAXS, NMR chemical shifts and energies calculated from conformers can work synergistically to improve the definition of the conformational ensemble.

[1]  M. Gautel,et al.  Dissecting the N-terminal Myosin Binding Site of Human Cardiac Myosin-binding Protein C , 2007, Journal of Biological Chemistry.

[2]  P. V. Konarev,et al.  ATSAS 2.8: a comprehensive data analysis suite for small-angle scattering from macromolecular solutions , 2017, Journal of applied crystallography.

[3]  H. Vogel,et al.  A molecular dynamics study of Ca(2+)-calmodulin: evidence of interdomain coupling and structural collapse on the nanosecond timescale. , 2004, Biophysical journal.

[4]  S. Sadayappan,et al.  Cardiac myosin binding protein-C as a central target of cardiac sarcomere signaling: a special mini review series , 2013, Pflügers Archiv - European Journal of Physiology.

[5]  B. Vestergaard Analysis of biostructural changes, dynamics, and interactions - Small-angle X-ray scattering to the rescue. , 2016, Archives of biochemistry and biophysics.

[6]  F A Quiocho,et al.  Calmodulin structure refined at 1.7 A resolution. , 1992, Journal of molecular biology.

[7]  J. Trewhella,et al.  Ligand-induced conformational changes and conformational dynamics in the solution structure of the lactose repressor protein. , 2008, Journal of molecular biology.

[8]  John Skilling,et al.  Maximum Entropy and Bayesian Methods , 1989 .

[9]  A. Means,et al.  Calmodulin: a prototypical calcium sensor. , 2000, Trends in cell biology.

[10]  D. Baker,et al.  Alternate states of proteins revealed by detailed energy landscape mapping. , 2011, Journal of molecular biology.

[11]  Dmitri I. Svergun,et al.  Advanced ensemble modelling of flexible macromolecules using X-ray solution scattering , 2015, IUCrJ.

[12]  J. Trewhella,et al.  Calmodulin disrupts the structure of the HIV-1 MA protein. , 2010, Journal of molecular biology.

[13]  Nobuhiro Nakamura,et al.  Ubiquitin System , 2018, International journal of molecular sciences.

[14]  D. Uttenweiler,et al.  Myosin binding protein C, a phosphorylation-dependent force regulator in muscle that controls the attachment of myosin heads by its interaction with myosin S2. , 2000, Circulation research.

[15]  Temple F. Smith Occam's razor , 1980, Nature.

[16]  Susan S. Taylor,et al.  Dysfunctional conformational dynamics of protein kinase A induced by a lethal mutant of phospholamban hinder phosphorylation , 2015, Proceedings of the National Academy of Sciences.

[17]  Sara Linse,et al.  The role of electrostatic interactions in calmodulin-peptide complex formation. , 2004, Biophysical journal.

[18]  Christopher E. Berndsen,et al.  New insights into ubiquitin E3 ligase mechanism , 2014, Nature Structural &Molecular Biology.

[19]  A. Bonvin,et al.  On the usefulness of ion-mobility mass spectrometry and SAXS data in scoring docking decoys. , 2013, Acta crystallographica. Section D, Biological crystallography.

[20]  C. Tung,et al.  A Highly Conserved Yet Flexible Linker Is Part of a Polymorphic Protein-Binding Domain in Myosin-Binding Protein C. , 2016, Structure.

[21]  J. Trewhella,et al.  Calmodulin binds a highly extended HIV-1 MA protein that refolds upon its release. , 2012, Biophysical journal.

[22]  J. Lefèvre,et al.  The assembly of immunoglobulin-like modules in titin: implications for muscle elasticity. , 1998, Journal of molecular biology.

[23]  E. Homsher,et al.  Regulation of contraction in striated muscle. , 2000, Physiological reviews.

[24]  H. P. Lu,et al.  Molecular mechanism of multispecific recognition of Calmodulin through conformational changes , 2017, Proceedings of the National Academy of Sciences.

[25]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[26]  H. Kawasaki,et al.  Conformational landscape mapping the difference between N-lobes and C-lobes of calmodulin. , 2017, Journal of inorganic biochemistry.

[27]  Bente Vestergaard,et al.  Application of Bayesian analysis to indirect Fourier transformation in small-angle scattering , 2006 .

[28]  J. Tainer,et al.  Structural dynamics in DNA damage signaling and repair. , 2010, Current opinion in structural biology.

[29]  L. D. Antonov,et al.  Bayesian inference of protein ensembles from SAXS data. , 2016, Physical chemistry chemical physics : PCCP.

[30]  Collin M. Stultz,et al.  Efficient Construction of Disordered Protein Ensembles in a Bayesian Framework with Optimal Selection of Conformations , 2011, Pacific Symposium on Biocomputing.

[31]  C. Chothia,et al.  Structure, function and evolution of multidomain proteins. , 2004, Current opinion in structural biology.

[32]  L. Kay,et al.  A novel approach for sequential assignment of 1H, 13C, and 15N spectra of proteins: heteronuclear triple-resonance three-dimensional NMR spectroscopy. Application to calmodulin. , 1990, Biochemistry.

[33]  Yutaka Ueno,et al.  Molecular dynamics simulations revealed Ca2+‐dependent conformational change of Calmodulin , 2002, FEBS letters.

[34]  Hua Lee,et al.  Maximum Entropy and Bayesian Methods. , 1996 .

[35]  Andrej Sali,et al.  FoXS, FoXSDock and MultiFoXS: Single-state and multi-state structural modeling of proteins and their complexes based on SAXS profiles , 2016, Nucleic Acids Res..

[36]  J. Stull,et al.  Activation of Myosin Light Chain Kinase Requires Translocation of Bound Calmodulin* , 2001, The Journal of Biological Chemistry.

[37]  Poul Nissen,et al.  Structural diversity of calmodulin binding to its target sites , 2013, The FEBS journal.

[38]  P. Rosevear,et al.  Structural Insight into Unique Cardiac Myosin-binding Protein-C Motif , 2012, The Journal of Biological Chemistry.

[39]  T. Chouard,et al.  Structural biology: Breaking the protein rules , 2011, Nature.

[40]  Andrej Sali,et al.  Recovering a representative conformational ensemble from underdetermined macromolecular structural data. , 2013, Journal of the American Chemical Society.

[41]  M. Komajda,et al.  Organization and sequence of human cardiac myosin binding protein C gene (MYBPC3) and identification of mutations predicted to produce truncated proteins in familial hypertrophic cardiomyopathy. , 1997, Circulation research.

[42]  Dmitri I. Svergun,et al.  2017 publication guidelines for structural modelling of small-angle scattering data from biomolecules in solution: an update , 2017, Acta crystallographica. Section D, Structural biology.

[43]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[44]  Joseph Hilbe,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2009 .

[45]  J. Trewhella,et al.  Comparison of the crystal and solution structures of calmodulin and troponin C. , 1988, Biochemistry.

[46]  John A Tainer,et al.  Bridging the solution divide: comprehensive structural analyses of dynamic RNA, DNA, and protein assemblies by small-angle X-ray scattering. , 2010, Current opinion in structural biology.

[47]  Dmitri I Svergun,et al.  Correlation Map, a goodness-of-fit test for one-dimensional X-ray scattering spectra , 2015, Nature Methods.

[48]  Maxim V. Petoukhov,et al.  Conformational space of flexible biological macromolecules from average data. , 2010, Journal of the American Chemical Society.

[49]  L. Kay,et al.  A novel approach for sequential assignment of proton, carbon-13, and nitrogen-15 spectra of larger proteins: heteronuclear triple-resonance three-dimensional NMR spectroscopy. Application to calmodulin , 1990 .

[50]  H. Watkins,et al.  Cardiac Myosin Binding Protein C: Its Role in Physiology and Disease , 2004, Circulation research.

[51]  D. Kern,et al.  Dynamic personalities of proteins , 2007, Nature.

[52]  Martina Krüger,et al.  Titin, a Central Mediator for Hypertrophic Signaling, Exercise-Induced Mechanosignaling and Skeletal Muscle Remodeling , 2016, Front. Physiol..

[53]  Carl E. Rasmussen,et al.  Occam's Razor , 2000, NIPS.

[54]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[55]  D. Svergun,et al.  A practical guide to small angle X‐ray scattering (SAXS) of flexible and intrinsically disordered proteins , 2015, FEBS letters.

[56]  M Ikura,et al.  Backbone dynamics of calmodulin studied by 15N relaxation using inverse detected two-dimensional NMR spectroscopy: the central helix is flexible. , 1992, Biochemistry.

[57]  Dmitri I. Svergun,et al.  A posteriori determination of the useful data range for small-angle scattering experiments on dilute monodisperse systems , 2015, IUCrJ.

[58]  Ali Rana Atilgan,et al.  Designing Molecular Dynamics Simulations to Shift Populations of the Conformational States of Calmodulin , 2013, PLoS Comput. Biol..

[59]  L. Kay,et al.  Heteronuclear 3D NMR and isotopic labeling of calmodulin. Towards the complete assignment of the 1H NMR spectrum. , 1990, Biochemical pharmacology.

[60]  G. Clore,et al.  Contrast-matched small-angle X-ray scattering from a heavy-atom-labeled protein in structure determination: application to a lead-substituted calmodulin-peptide complex. , 2012, Journal of the American Chemical Society.

[61]  Jeff Wereszczynski,et al.  Determining Atomistic SAXS Models of Tri-Ubiquitin Chains from Bayesian Analysis of Accelerated Molecular Dynamics Simulations. , 2017, Journal of chemical theory and computation.

[62]  Andrew Gelman,et al.  The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo , 2011, J. Mach. Learn. Res..

[63]  T. Pollard,et al.  Annual review of biophysics and biomolecular structure , 1992 .

[64]  Michal Hammel,et al.  Validation of macromolecular flexibility in solution by small-angle X-ray scattering (SAXS) , 2012, European Biophysics Journal.

[65]  M. Gautel,et al.  cAPK‐phosphorylation controls the interaction of the regulatory domain of cardiac myosin binding protein C with myosin‐S2 in an on‐off fashion , 1999, FEBS letters.

[66]  M. Karplus,et al.  A hierarchy of timescales in protein dynamics is linked to enzyme catalysis , 2007, Nature.

[67]  Massimiliano Bonomi,et al.  Principles of protein structural ensemble determination. , 2017, Current opinion in structural biology.

[68]  D. Svergun,et al.  CRYSOL : a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates , 1995 .

[69]  M Ikura,et al.  Molecular and structural basis of target recognition by calmodulin. , 1995, Annual review of biophysics and biomolecular structure.

[70]  M. Blackledge,et al.  Structural characterization of flexible proteins using small-angle X-ray scattering. , 2007, Journal of the American Chemical Society.

[71]  K Schulten,et al.  Structure and dynamics of calmodulin in solution. , 1998, Biophysical journal.

[72]  Dynamics and entropy of a calmodulin-peptide complex studied by NMR and molecular dynamics. , 2003, Biochemistry.

[73]  D. Svergun,et al.  Small-angle scattering: a view on the properties, structures and structural changes of biological macromolecules in solution , 2003, Quarterly Reviews of Biophysics.

[74]  Jill Trewhella,et al.  Small-angle scattering and 3D structure interpretation. , 2016, Current opinion in structural biology.

[75]  Aki Vehtari,et al.  Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC , 2015, Statistics and Computing.

[76]  D. Wishart,et al.  Rapid and accurate calculation of protein 1H, 13C and 15N chemical shifts , 2003, Journal of Biomolecular NMR.