BEES: Bayesian Ensemble Estimation from SAS, a SASSIE-web Module

Many biomolecular complexes exist in a flexible ensemble of states in solution which are necessary to perform their biological function. Small angle scattering (SAS) measurements are a popular method for characterizing these flexible molecules due to their relative ease of use and ability to simultaneously probe the full ensemble of states. However, SAS data is typically low-dimensional and difficult to interpret without the assistance of additional structural models. In theory, experimental SAS curves can be reconstituted from a linear combination of theoretical models, although this procedure carries significant risk of overfitting the inherently low-dimensional SAS data. Previously, we developed a Bayesian-based method for fitting ensembles of model structures to experimental SAS data that rigorously avoids overfitting. However, we have found that these methods can be difficult to incorporate into typical SAS modeling workflows, especially for users that are not experts in computational modeling. To this end, we present the “Bayesian Ensemble Estimator from SAS” (BEES) program. Two forks of BEES are available, the primary one existing as module for the SASSIE webserver, and a developmental version that is a standalone python program. BEES allows users to exhaustively sample ensemble models constructed from a library of theoretical states and to interactively analyze and compare each model’s performance. The fitting routine also allows for a secondary data sets to be supplied, thereby simultaneously fitting models to both SAS data as well as orthogonal information. The flexible ensemble of K63-linked ubiquitin trimers is presented as an example of BEES’ capabilities.

[1]  D. Kern,et al.  Dynamic personalities of proteins , 2007, Nature.

[2]  Greg L. Hura,et al.  Structure and flexibility within proteins as identified through small angle X-ray scattering. , 2009, General physiology and biophysics.

[3]  Jesper Ferkinghoff-Borg,et al.  Calculation of accurate small angle X-ray scattering curves from coarse-grained protein models , 2010, BMC Bioinformatics.

[4]  Jill Trewhella,et al.  Bayesian inference of protein conformational ensembles from limited structural data , 2018, PLoS Comput. Biol..

[5]  Keegan E. Hines,et al.  A primer on Bayesian inference for biophysical systems. , 2015, Biophysical journal.

[6]  Guangfeng Zhou,et al.  Bayesian inference of conformational state populations from computational models and sparse experimental observables , 2014, J. Comput. Chem..

[7]  David R. Anderson,et al.  Model Selection and Multimodel Inference , 2003 .

[8]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[9]  Dmitri I. Svergun,et al.  A posteriori determination of the useful data range for small-angle scattering experiments on dilute monodisperse systems , 2015, IUCrJ.

[10]  Lauren Boldon,et al.  Review of the fundamental theories behind small angle X-ray scattering, molecular dynamics simulations, and relevant integrated application , 2015, Nano reviews.

[11]  Bartosz Różycki,et al.  Large, dynamic, multi-protein complexes: a challenge for structural biology , 2014, Journal of physics. Condensed matter : an Institute of Physics journal.

[12]  D. Svergun,et al.  CRYSOL : a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates , 1995 .

[13]  Jill Trewhella,et al.  Small-angle scattering and 3D structure interpretation. , 2016, Current opinion in structural biology.

[14]  Lee Makowski,et al.  Modeling the hydration layer around proteins: applications to small- and wide-angle x-ray scattering. , 2011, Biophysical journal.

[15]  Nancy Wilkins-Diehr,et al.  XSEDE: Accelerating Scientific Discovery , 2014, Computing in Science & Engineering.

[16]  George Bosilca,et al.  Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation , 2004, PVM/MPI.

[17]  Wei Huang,et al.  Fast-SAXS-pro: a unified approach to computing SAXS profiles of DNA, RNA, protein, and their complexes. , 2013, The Journal of chemical physics.

[18]  J. Hub,et al.  Validating solution ensembles from molecular dynamics simulation by wide-angle X-ray scattering data. , 2014, Biophysical journal.

[19]  A. Menzel,et al.  Deconvoluting Protein (Un)folding Structural Ensembles Using X-Ray Scattering, Nuclear Magnetic Resonance Spectroscopy and Molecular Dynamics Simulation , 2015, PloS one.

[20]  Andrej Sali,et al.  FoXS, FoXSDock and MultiFoXS: Single-state and multi-state structural modeling of proteins and their complexes based on SAXS profiles , 2016, Nucleic Acids Res..

[21]  J. Hub,et al.  Interpreting solution X-ray scattering data using molecular simulations. , 2018, Current opinion in structural biology.

[22]  Dmitri I. Svergun,et al.  Advanced ensemble modelling of flexible macromolecules using X-ray solution scattering , 2015, IUCrJ.

[23]  Ron Elber,et al.  Revealing the distinct folding phases of an RNA three-helix junction , 2018, Nucleic acids research.

[24]  Ieee Xplore Computing in science & engineering , 1999 .

[25]  Jeff Wereszczynski,et al.  Determining Atomistic SAXS Models of Tri-Ubiquitin Chains from Bayesian Analysis of Accelerated Molecular Dynamics Simulations. , 2017, Journal of chemical theory and computation.

[26]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[27]  Max C. Watson,et al.  Rapid and accurate calculation of small‐angle scattering profiles using the golden ratio , 2013 .

[28]  P. V. Konarev,et al.  ATSAS 2.8: a comprehensive data analysis suite for small-angle scattering from macromolecular solutions , 2017, Journal of applied crystallography.

[29]  Nicholas J. Terrill,et al.  Atomistic modelling of scattering data in the Collaborative Computational Project for Small Angle Scattering (CCP-SAS) , 2016, Journal of applied crystallography.

[30]  Renwick C J Dobson,et al.  Tyrosine Latching of a Regulatory Gate Affords Allosteric Control of Aromatic Amino Acid Biosynthesis* , 2011, The Journal of Biological Chemistry.

[31]  Patrice Vachette,et al.  Direct observation in solution of a preexisting structural equilibrium for a mutant of the allosteric aspartate transcarbamoylase , 2007, Proceedings of the National Academy of Sciences.

[32]  Lee Makowski,et al.  Multidomain assembled states of Hck tyrosine kinase in solution , 2010, Proceedings of the National Academy of Sciences.

[33]  Joseph E Curtis,et al.  Conformation of the HIV-1 Gag protein in solution. , 2007, Journal of molecular biology.

[34]  Marlon E. Pierce,et al.  The GenApp framework integrated with Airavata for managed compute resource submissions , 2015, Concurr. Comput. Pract. Exp..

[35]  J. Hub,et al.  Interpretation of solution x-ray scattering by explicit-solvent molecular dynamics. , 2015, Biophysical journal.

[36]  Kresten Lindorff-Larsen,et al.  On the Calculation of SAXS Profiles of Folded and Intrinsically Disordered Proteins from Computer Simulations. , 2018, Journal of molecular biology.

[37]  Collin M. Stultz,et al.  Modeling Intrinsically Disordered Proteins with Bayesian Statistics , 2010, Journal of the American Chemical Society.

[38]  Osman Bilsel,et al.  Sub-millisecond time-resolved SAXS using a continuous-flow mixer and X-ray microbeam , 2013, Journal of synchrotron radiation.

[39]  Andrej Sali,et al.  FoXS: a web server for rapid computation and fitting of SAXS profiles , 2010, Nucleic Acids Res..

[40]  Joseph E. Curtis,et al.  Monte Carlo simulation algorithm for B‐DNA , 2016, J. Comput. Chem..

[41]  Vincent A Voelz,et al.  Model Selection Using BICePs: A Bayesian Approach for Force Field Validation and Parameterization. , 2018, The journal of physical chemistry. B.