Protein conformational search with geometric projections

Protein structure prediction remains a central challenge in computational structural biology. Even at the coarse-grained level of detail, the protein conformational space is vast, and available energy functions contain many false local minima. In order to effectively characterize this space, a conformational search must sample a geometrically-diverse set of low-energy conformations. Our recently published FeLTr framework achieves this goal by employing a low-dimensional geometric projection layer to bias conformational sampling towards unexplored regions of the search space. In this work we present a new geometric projection layer based on the effective connectivity measure, which encapsulates interatomic distances within a conformation. Extensive analysis indicates that effective connectivity allows equipping the high-dimensional conformational search with an effective projection layer. On several target proteins, this layer improves significantly over our previous work, resulting in sampling of conformations with significantly lower lRMSDs to the known native structure.

[1]  P. Kollman,et al.  Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. , 1998, Science.

[2]  Shuangye Yin,et al.  Eris: an automated estimator of protein stability , 2007, Nature Methods.

[3]  Harold A. Scheraga,et al.  Protein structure prediction using a combination of sequence homology and global energy minimization I. Global energy minimization of surface loops , 1990 .

[4]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[5]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[6]  Oliver Brock,et al.  Guiding conformation space search with an all‐atom energy potential , 2008, Proteins.

[7]  Kevin Molloy,et al.  Variable-length Fragment Assembly within a Probabilistic Protein Structure Prediction Framework Variable-length Fragment Assembly within a Probabilistic Protein Structure Prediction Framework , 2011 .

[8]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[9]  A. Mathiowetz,et al.  De novo prediction of polypeptide conformations using dihedral probability grid Monte Carlo methodology , 1995, Protein Science.

[10]  Richard Bonneau,et al.  De novo prediction of three-dimensional structures for major protein families. , 2002, Journal of molecular biology.

[11]  Michele Vendruscolo,et al.  Reconstruction of protein structures from a vectorial representation. , 2004, Physical review letters.

[12]  Harold A. Scheraga,et al.  Pattern recognition in the prediction of protein structure. II. Chain conformation from a probability‐directed search procedure , 1989 .

[13]  Amarda Shehu,et al.  Guiding the Search for Native-like Protein Conformations with an Ab-initio Tree-based Exploration , 2010, Int. J. Robotics Res..

[14]  Amarda Shehu,et al.  An Ab-initio tree-based exploration to enhance sampling of low-energy protein conformations , 2009, Robotics: Science and Systems.

[15]  K. Dill,et al.  The Protein Folding Problem , 1993 .

[16]  Zaida Luthey-Schulten,et al.  Evaluating protein structure-prediction schemes using energy landscape theory , 2001, IBM J. Res. Dev..

[17]  R. Abagyan,et al.  Biased probability Monte Carlo conformational searches and electrostatic calculations for peptides and proteins. , 1994, Journal of molecular biology.

[18]  Amarda Shehu,et al.  In Search of the protein Native State with a Probabilistic Sampling Approach , 2011, J. Bioinform. Comput. Biol..

[19]  Jaime Prilusky,et al.  Assessment of CASP8 structure predictions for template free targets , 2009, Proteins.

[20]  Yang Zhang Progress and challenges in protein structure prediction. , 2008, Current opinion in structural biology.

[21]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[22]  Tanja Kortemme,et al.  Computational design of protein-protein interactions. , 2004, Current opinion in chemical biology.

[23]  Wilfred F. van Gunsteren,et al.  Biomolecular Modeling: Goals, Problems, Perspectives , 2006 .

[24]  Harold A. Scheraga,et al.  Pattern recognition in the prediction of protein structure. III. An importance‐sampling minimization procedure , 1989 .

[25]  Ruth Nussinov,et al.  fragment folding and assembly Reducing the computational complexity of protein folding via , 2002 .

[26]  Cecilia Clementi,et al.  Unfolding the fold of cyclic cysteine‐rich peptides , 2008, Protein science : a publication of the Protein Society.

[27]  Yuko Okamoto,et al.  Generalized-ensemble algorithms: enhanced sampling techniques for Monte Carlo and molecular dynamics simulations. , 2003, Journal of molecular graphics & modelling.

[28]  James E. Fitzgerald,et al.  Mimicking the folding pathway to improve homology-free protein structure prediction , 2009, Proceedings of the National Academy of Sciences.

[29]  Amarda Shehu,et al.  Enhancing Sampling of the Conformational Space Near the Protein Native State , 2010, BIONETICS.

[30]  Amarda Shehu,et al.  Conformational Search for the Protein Native State , 2010 .

[31]  L. Kavraki,et al.  Multiscale characterization of protein conformational ensembles , 2009, Proteins.

[32]  Sitao Wu,et al.  Ab Initio Protein Structure Prediction , 2009 .

[33]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[34]  Michele Vendruscolo,et al.  A stochastic method for the reconstruction of protein structures from one-dimensional structural profiles. , 2008, Gene.

[35]  Michele Vendruscolo,et al.  Stochastic reconstruction of protein structures from effective connectivity profiles , 2008, PMC biophysics.

[36]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[37]  P. Wolynes,et al.  Water in protein structure prediction. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Samuel L. DeLuca,et al.  Practically Useful: What the Rosetta Protein Modeling Suite Can Do for You , 2010, Biochemistry.

[39]  Pedro J Ballester,et al.  Prospective virtual screening with Ultrafast Shape Recognition: the identification of novel inhibitors of arylamine N-acetyltransferases , 2010, Journal of The Royal Society Interface.

[40]  Cecilia Clementi,et al.  Coarse-grained models of protein folding: toy models or predictive tools? , 2008, Current opinion in structural biology.

[41]  Michele Vendruscolo,et al.  Efficient identification of near‐native conformations in ab initio protein structure prediction using structural profiles , 2010, Proteins.

[42]  W. Graham Richards,et al.  Ultrafast shape recognition to search compound databases for similar molecular shapes , 2007, J. Comput. Chem..