An Effective Computational Method Incorporating Multiple Secondary Structure Predictions in Topology Determination for Cryo-EM Images

A key idea in de novo modeling of a medium-resolution density image obtained from cryo-electron microscopy is to compute the optimal mapping between the secondary structure traces observed in the density image and those predicted on the protein sequence. When secondary structures are not determined precisely, either from the image or from the amino acid sequence of the protein, the computational problem becomes more complex. We present an efficient method that addresses the secondary structure placement problem in presence of multiple secondary structure predictions and computes the optimal mapping. We tested the method using 12 simulated images from α-proteins and two Cryo-EM images of α-β proteins. We observed that the rank of the true topologies is consistently improved by using multiple secondary structure predictions instead of a single prediction. The results show that the algorithm is robust and works well even when errors/misses in the predicted secondary structures are present in the image or the sequence. The results also show that the algorithm is efficient and is able to handle proteins with as many as 33 helices.

[1]  Matthew L. Baker,et al.  Backbone structure of the infectious Epsilon15 virus capsid revealed by electron cryomicroscopy , 2008 .

[2]  Desh Ranjan,et al.  A Novel Computational Method for Deriving Protein Secondary Structure Topologies Using Cryo-EM Density Maps and Multiple Secondary Structure Predictions , 2015, ISBRA.

[3]  Aoife McLysaght,et al.  Porter: a new, accurate server for protein secondary structure prediction , 2005, Bioinform..

[4]  Dong Si,et al.  Beta-sheet Detection and Representation from Medium Resolution Cryo-EM Density Maps , 2013, BCB.

[5]  M. Baker,et al.  Electron cryomicroscopy of biological machines at subnanometer resolution. , 2005, Structure.

[6]  Matthew L. Baker,et al.  Shape modeling and matching in identifying 3D protein structures , 2008, Comput. Aided Des..

[7]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[8]  Wah Chiu,et al.  Near-atomic-resolution cryo-EM for molecular virology. , 2011, Current opinion in virology.

[9]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[10]  Desh Ranjan,et al.  A Dynamic Programming Algorithm for Finding the Optimal Placement of a Secondary Structure Topology in Cryo-EM Data , 2015, J. Comput. Biol..

[11]  Dong Si,et al.  A machine learning approach for the identification of protein secondary structure elements from electron cryo-microscopy density maps. , 2012, Biopolymers.

[12]  P. Stewart,et al.  EM-fold: De novo folding of alpha-helical proteins guided by intermediate-resolution electron microscopy density maps. , 2009, Structure.

[13]  Desh Ranjan,et al.  Ranking Valid Topologies of the Secondary Structure Elements Using a Constraint Graph , 2011, J. Bioinform. Comput. Biol..

[14]  Mirabela Rusu,et al.  Evolutionary bidirectional expansion for the tracing of alpha helices in cryo-electron microscopy reconstructions. , 2012, Journal of structural biology.

[15]  R. Aebersold,et al.  Molecular architecture of the 26S proteasome holocomplex determined by an integrative approach , 2012, Proceedings of the National Academy of Sciences.

[16]  M. Baker,et al.  Identification of secondary structure elements in intermediate-resolution density maps. , 2007, Structure.

[17]  Jing He,et al.  IDENTIFICATION OF α-HELICES FROM LOW RESOLUTION PROTEIN DENSITY MAPS , 2006 .

[18]  Matthew L. Baker,et al.  Backbone structure of the infectious ε15 virus capsid revealed by electron cryomicroscopy , 2008, Nature.

[19]  Bernard F. Buxton,et al.  Secondary structure prediction with support vector machines , 2003, Bioinform..

[20]  M. Baker,et al.  Bridging the information gap: computational tools for intermediate resolution structure interpretation. , 2001, Journal of molecular biology.

[21]  B. Rost,et al.  Alignments grow, secondary structure prediction improves , 2002, Proteins.

[22]  John D. Westbrook,et al.  EMDataBank.org: unified data resource for CryoEM , 2010, Nucleic Acids Res..

[23]  W Chiu,et al.  EMAN: semiautomated software for high-resolution single-particle reconstructions. , 1999, Journal of structural biology.

[24]  Desh Ranjan,et al.  Improved Efficiency in Cryo-EM Secondary Structure Topology Determination from Inaccurate Data , 2012, J. Bioinform. Comput. Biol..

[25]  Jaap Heringa,et al.  The influence of gapped positions in multiple sequence alignments on secondary structure prediction methods , 2004, Comput. Biol. Chem..

[26]  M. Baker,et al.  Modeling protein structure at near atomic resolutions with Gorgon. , 2011, Journal of structural biology.

[27]  Legand Burge,et al.  Intensity-Based Skeletonization of CryoEM Gray-Scale Images Using a True Segmentation-Free Algorithm , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[28]  Enrico Pontelli,et al.  Identification of alpha-helices from low resolution protein density maps. , 2006, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[29]  Dong Si,et al.  Tracing beta strands using StrandTwister from cryo-EM density maps at medium resolutions. , 2014, Structure.

[30]  Andrey N. Chernikov,et al.  Estimating loop length from CryoEM images at medium resolutions , 2013, BMC Structural Biology.

[31]  Qinfen Zhang,et al.  CryoEM structure of the mature dengue virus at 3.5-Å resolution , 2012, Nature Structural &Molecular Biology.

[32]  Cinque S. Soto,et al.  Evaluating conformational free energies: The colony energy and its application to the problem of loop prediction , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Klaus Schulten,et al.  Cryo-electron microscopy modeling by the molecular dynamics flexible fitting method. , 2012, Biopolymers.

[34]  Michael Levitt,et al.  Combining efficient conformational sampling with a deformable elastic network model facilitates structure refinement at low resolution. , 2007, Structure.

[35]  Daniel N. Wilson,et al.  Structures of the human and Drosophila 80S ribosome , 2013, Nature.

[36]  Aleksey A. Porollo,et al.  Combining prediction of secondary structure and solvent accessibility in proteins , 2005, Proteins.

[37]  P. Argos,et al.  Seventy‐five percent accuracy in protein secondary structure prediction , 1997, Proteins.

[38]  Jianpeng Ma,et al.  A structural-informatics approach for mining beta-sheets: locating sheets in intermediate-resolution density maps. , 2003, Journal of molecular biology.

[39]  Geoffrey J. Barton,et al.  JPred : a consensus secondary structure prediction server , 1999 .

[40]  S Birmanns,et al.  Using situs for flexible and rigid-body fitting of multiresolution single-molecule data. , 2001, Journal of structural biology.

[41]  M. Levitt,et al.  Mechanism of Folding Chamber Closure in a Group II Chaperonin , 2010, Nature.

[42]  Dong Si,et al.  Orientations of beta-strand traces and near maximum twist , 2014, BCB.

[43]  Desh Ranjan,et al.  Solving the Secondary Structure Matching Problem in Cryo-EM De Novo Modeling Using a Constrained $K$-Shortest Path Graph Algorithm , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.