A fragment based method for modeling of protein segments into cryo-EM density maps

BackgroundSingle-particle analysis of electron cryo-microscopy (cryo-EM) is a key technology for elucidation of macromolecular structures. Recent technical advances in hardware and software developments significantly enhanced the resolution of cryo-EM density maps and broadened the applicability and the circle of users. To facilitate modeling of macromolecules into cryo-EM density maps, fast and easy to use methods for modeling are now demanded.ResultsHere we investigated and benchmarked the suitability of a classical and well established fragment-based approach for modeling of segments into cryo-EM density maps (termed FragFit). FragFit uses a hierarchical strategy to select fragments from a pre-calculated set of billions of fragments derived from structures deposited in the Protein Data Bank, based on sequence similarly, fit of stem atoms and fit to a cryo-EM density map. The user only has to specify the sequence of the segment and the number of the N- and C-terminal stem-residues in the protein. Using a representative data set of protein structures, we show that protein segments can be accurately modeled into cryo-EM density maps of different resolution by FragFit. Prediction quality depends on segment length, the type of secondary structure of the segment and local quality of the map.ConclusionFast and automated calculation of FragFit renders it applicable for implementation of interactive web-applications e.g. to model missing segments, flexible protein parts or hinge-regions into cryo-EM density maps.

[1]  F. Tama,et al.  Flexible multi-scale fitting of atomic structures into low-resolution electron density maps with elastic network normal mode analysis. , 2004, Journal of molecular biology.

[2]  Richard A Friesner,et al.  Progress in super long loop prediction , 2011, Proteins.

[3]  Yoonjoo Choi,et al.  FREAD revisited: Accurate loop structure prediction using a database search algorithm , 2010, Proteins.

[4]  Dong Si,et al.  A machine learning approach for the identification of protein secondary structure elements from electron cryo-microscopy density maps. , 2012, Biopolymers.

[5]  Florence Tama,et al.  Excited states of ribosome translocation revealed through integrative molecular modeling , 2011, Proceedings of the National Academy of Sciences.

[6]  Leonardo G. Trabuco,et al.  Molecular dynamics flexible fitting: a practical guide to combine cryo-electron microscopy and X-ray crystallography. , 2009, Methods.

[7]  Ram Samudrala,et al.  A novel method for predicting and using distance constraints of high accuracy for refining protein structure prediction , 2009, Proteins.

[8]  András Fiser,et al.  Saturating representation of loop conformational fragments in structure databanks , 2006, BMC Structural Biology.

[9]  T. Mielke,et al.  Structures of ribosome-bound initiation factor 2 reveal the mechanism of subunit association , 2016, Science Advances.

[10]  Alan Brown,et al.  Tools for macromolecular model building and refinement into electron cryo-microscopy reconstructions , 2015, Acta crystallographica. Section D, Biological crystallography.

[11]  Keren Lasker,et al.  Finding the right fit: chiseling structures out of cryo-electron microscopy maps. , 2014, Current opinion in structural biology.

[12]  David Baker,et al.  De novo protein structure determination from near-atomic resolution cryo-EM maps , 2015, Nature Methods.

[13]  Matthew L. Baker,et al.  Gorgon and pathwalking: macromolecular modeling tools for subnanometer resolution density maps. , 2012, Biopolymers.

[14]  Alexander S. Rose,et al.  SL2: an interactive webtool for modeling of missing segments in proteins , 2016, Nucleic Acids Res..

[15]  C. Sanders,et al.  Involvement of distinct arrestin-1 elements in binding to different functional forms of rhodopsin , 2012, Proceedings of the National Academy of Sciences.

[16]  Andrej Sali,et al.  MultiFit: a web server for fitting multiple protein structures into their electron microscopy density map , 2011, Nucleic Acids Res..

[17]  S. Pongor,et al.  A normalized root‐mean‐spuare distance for comparing protein three‐dimensional structures , 2001, Protein science : a publication of the Protein Society.

[18]  Frank DiMaio,et al.  Automated structure refinement of macromolecular assemblies from cryo-EM maps using Rosetta , 2016, bioRxiv.

[19]  Sebastian Kelm,et al.  Fragment‐based modeling of membrane protein loops: Successes, failures, and prospects for the future , 2014, Proteins.

[20]  Yang Zhang,et al.  How significant is a protein structure similarity with TM-score = 0.5? , 2010, Bioinform..

[21]  Jesper Ferkinghoff-Borg,et al.  A generative, probabilistic model of local protein structure , 2008, Proceedings of the National Academy of Sciences.

[22]  John D. Westbrook,et al.  EMDataBank.org: unified data resource for CryoEM , 2010, Nucleic Acids Res..

[23]  Alan Brown,et al.  Structure of the Yeast Mitochondrial Large Ribosomal Subunit , 2014, Science.

[24]  Ivo Atanasov,et al.  Atomic model of CPV reveals the mechanism used by this single-shelled virus to economically carry out functions conserved in multishelled reoviruses. , 2011, Structure.

[25]  Ronald M Levy,et al.  Have we seen all structures corresponding to short protein fragments in the Protein Data Bank? An update. , 2003, Protein engineering.

[26]  H. Grubmüller,et al.  Energy barriers and driving forces in tRNA translocation through the ribosome , 2013, Nature Structural &Molecular Biology.

[27]  M. Baker,et al.  Identification of secondary structure elements in intermediate-resolution density maps. , 2007, Structure.

[28]  David S. Goodsell,et al.  The RCSB Protein Data Bank: new resources for research and education , 2012, Nucleic Acids Res..

[29]  Jens Meiler,et al.  BCL::EM-Fit: rigid body fitting of atomic structures into density maps using geometric hashing and real space refinement. , 2011, Journal of structural biology.

[30]  E. Coutsias,et al.  Sub-angstrom accuracy in protein loop reconstruction by robotics-inspired conformational sampling , 2009, Nature Methods.

[31]  Robert Preissner,et al.  SuperLooper—a prediction server for the modeling of loops in globular and membrane proteins , 2009, Nucleic Acids Res..

[32]  Helmut Grubmüller,et al.  Position of transmembrane helix 6 determines receptor G protein coupling specificity. , 2014, Journal of the American Chemical Society.

[33]  Ke Tang,et al.  Fast Protein Loop Sampling and Structure Prediction Using Distance-Guided Sequential Chain-Growth Monte Carlo Method , 2014, PLoS Comput. Biol..

[34]  M. S. Chapman,et al.  Parsimony in Protein Conformational Change. , 2015, Structure.

[35]  Alexander S. Rose,et al.  Precision vs flexibility in GPCR signaling. , 2013, Journal of the American Chemical Society.

[36]  Ruedi Aebersold,et al.  Architecture of the large subunit of the mammalian mitochondrial ribosome , 2013, Nature.

[37]  Dong Si,et al.  Tracing beta strands using StrandTwister from cryo-EM density maps at medium resolutions. , 2014, Structure.

[38]  A. Sali,et al.  Modeling of loops in protein structures , 2000, Protein science : a publication of the Protein Society.

[39]  Yifan Cheng,et al.  Mechanism of gate opening in the 20S proteasome by the proteasomal ATPases. , 2008, Molecular cell.

[40]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[41]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[42]  A. Goede,et al.  Loops In Proteins (LIP)--a comprehensive loop database for homology modelling. , 2003, Protein engineering.

[43]  S. Rasmussen,et al.  Structure of a nanobody-stabilized active state of the β2 adrenoceptor , 2010, Nature.

[44]  A Leith,et al.  SPIDER and WEB: processing and visualization of images in 3D electron microscopy and related fields. , 1996, Journal of structural biology.

[45]  Terrence Frey,et al.  Faculty Opinions recommendation of TRPV1 structures in distinct conformations reveal activation mechanisms. , 2014 .

[46]  T. Kawabata Multiple Subunit Fitting into a Low-Resolution Density Map of a Macromolecular Complex Using a Gaussian Mixture Model , 2008, Biophysical journal.

[47]  Zhe Wang,et al.  Real-space refinement with DireX: from global fitting to side-chain improvements. , 2012, Biopolymers.

[48]  Ali Masoudi-Nejad,et al.  omputational structure analysis of biomacromolecule complexes by nterface geometry , 2013 .