Evolutionary-inspired probabilistic search for enhancing sampling of local minima in the protein energy surface

BackgroundDespite computational challenges, elucidating conformations that a protein system assumes under physiologic conditions for the purpose of biological activity is a central problem in computational structural biology. While these conformations are associated with low energies in the energy surface that underlies the protein conformational space, few existing conformational search algorithms focus on explicitly sampling low-energy local minima in the protein energy surface.MethodsThis work proposes a novel probabilistic search framework, PLOW, that explicitly samples low-energy local minima in the protein energy surface. The framework combines algorithmic ingredients from evolutionary computation and computational structural biology to effectively explore the subspace of local minima. A greedy local search maps a conformation sampled in conformational space to a nearby local minimum. A perturbation move jumps out of a local minimum to obtain a new starting conformation for the greedy local search. The process repeats in an iterative fashion, resulting in a trajectory-based exploration of the subspace of local minima.Results and conclusionsThe analysis of PLOW's performance shows that, by navigating only the subspace of local minima, PLOW is able to sample conformations near a protein's native structure, either more effectively or as well as state-of-the-art methods that focus on reproducing the native structure for a protein system. Analysis of the actual subspace of local minima shows that PLOW samples this subspace more effectively that a naive sampling approach. Additional theoretical analysis reveals that the perturbation function employed by PLOW is key to its ability to sample a diverse set of low-energy conformations. This analysis also suggests directions for further research and novel applications for the proposed framework.

[1]  K. Dill,et al.  The protein folding problem. , 1993, Annual review of biophysics.

[2]  D. Baker,et al.  Coupled prediction of protein secondary and tertiary structure , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Shuangye Yin,et al.  Eris: an automated estimator of protein stability , 2007, Nature Methods.

[4]  Eugene Santos,et al.  Local minima-based exploration for off-lattice protein folding , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[5]  J. Onuchic,et al.  Theory of protein folding: the energy landscape perspective. , 1997, Annual review of physical chemistry.

[6]  S. Goedecker,et al.  A minima hopping study of all-atom protein folding and structure prediction. , 2009, The journal of physical chemistry. B.

[7]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[8]  Dusan P Djurdjevic,et al.  Ab initio protein fold prediction using evolutionary algorithms: Influence of design and control parameters on performance , 2006, J. Comput. Chem..

[9]  Gaetano T. Montelione,et al.  3.11 News & Views 031 CDS , 2005 .

[10]  J. Doye,et al.  Global Optimization by Basin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms , 1997, cond-mat/9803344.

[11]  D Baker,et al.  Global properties of the mapping between local amino acid sequence and local structure in proteins. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Andrea Tettamanzi,et al.  A Memetic Algorithm for Protein Structure Prediction in a 3D-Lattice HP Model , 2004, EvoWorkshops.

[13]  P. Wolynes,et al.  Restriction versus guidance in protein structure prediction , 2009, Proceedings of the National Academy of Sciences.

[14]  L. Kavraki,et al.  On the characterization of protein native state ensembles. , 2007, Biophysical journal.

[15]  A. Schug,et al.  Basin hopping simulations for all-atom protein folding. , 2006, The Journal of chemical physics.

[16]  M. Karplus,et al.  Molecular dynamics and protein function. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[17]  L. Kay,et al.  NMR studies of protein structure and dynamics. , 2005, Journal of magnetic resonance.

[18]  Peter G Wolynes,et al.  Protein structure prediction: do hydrogen bonding and water-mediated interactions suffice? , 2010, Methods.

[19]  Cecilia Clementi,et al.  Unfolding the fold of cyclic cysteine‐rich peptides , 2008, Protein science : a publication of the Protein Society.

[20]  Peter G Wolynes,et al.  Localizing frustration in native proteins and protein assemblies , 2007, Proceedings of the National Academy of Sciences.

[21]  James E. Fitzgerald,et al.  Mimicking the folding pathway to improve homology-free protein structure prediction , 2009, Proceedings of the National Academy of Sciences.

[22]  Tanja Kortemme,et al.  Computational design of protein-protein interactions. , 2004, Current opinion in chemical biology.

[23]  J. Onuchic,et al.  Theory of Protein Folding This Review Comes from a Themed Issue on Folding and Binding Edited Basic Concepts Perfect Funnel Landscapes and Common Features of Folding Mechanisms , 2022 .

[24]  Peter G Wolynes,et al.  Consequences of localized frustration for the folding mechanism of the IM7 protein , 2007, Proceedings of the National Academy of Sciences.

[25]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[26]  Peter G Wolynes,et al.  Protein structure prediction using basin-hopping. , 2008, The Journal of chemical physics.

[27]  Cecilia Clementi,et al.  Coarse-grained models of protein folding: toy models or predictive tools? , 2008, Current opinion in structural biology.

[28]  Christian Blum,et al.  Proceedings of the 10th European conference on Evolutionary Computation in Combinatorial Optimization , 2007 .

[29]  K. Misura,et al.  PROTEINS: Structure, Function, and Bioinformatics 59:15–29 (2005) Progress and Challenges in High-Resolution Refinement of Protein Structure Models , 2022 .

[30]  Jin-Kao Hao,et al.  A Critical Element-Guided Perturbation Strategy for Iterated Local Search , 2009, EvoCOP.

[31]  P G Wolynes,et al.  Protein folding mechanisms and the multidimensional folding funnel , 1998, Proteins.

[32]  Amarda Shehu,et al.  Enhancing Sampling of the Conformational Space Near the Protein Native State , 2010, BIONETICS.

[33]  P. Wolynes,et al.  Water in protein structure prediction. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[34]  L. Kavraki,et al.  Modeling protein conformational ensembles: From missing loops to equilibrium fluctuations , 2006, Proteins.

[35]  Richard Bonneau,et al.  De novo prediction of three-dimensional structures for major protein families. , 2002, Journal of molecular biology.

[36]  Lydia E. Kavraki,et al.  Sampling Conformation Space to Model Equilibrium Fluctuations in Proteins , 2007, Algorithmica.

[37]  Oliver Brock,et al.  Guiding conformation space search with an all‐atom energy potential , 2008, Proteins.

[38]  Madhu Chetty,et al.  Novel Memetic Algorithm for Protein Structure Prediction , 2009, Australasian Conference on Artificial Intelligence.

[39]  Peter E Wright,et al.  Structure, dynamics, and catalytic function of dihydrofolate reductase. , 2004, Annual review of biophysics and biomolecular structure.

[40]  Amarda Shehu,et al.  Guiding the Search for Native-like Protein Conformations with an Ab-initio Tree-based Exploration , 2010, Int. J. Robotics Res..

[41]  L. Kavraki,et al.  Multiscale characterization of protein conformational ensembles , 2009, Proteins.

[42]  Amarda Shehu,et al.  An Ab-initio tree-based exploration to enhance sampling of low-energy protein conformations , 2009, Robotics: Science and Systems.

[43]  Gustavo Camps-Valls,et al.  Bioinformatics and Computational Biology , 2015, Encyclopedia of Data Warehousing and Mining.

[44]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[45]  Amarda Shehu,et al.  In Search of the protein Native State with a Probabilistic Sampling Approach , 2011, J. Bioinform. Comput. Biol..

[46]  J. Onuchic,et al.  Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations , 2006, Proceedings of the National Academy of Sciences.

[47]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[48]  M. Sternberg,et al.  The relationship between the flexibility of proteins and their conformational states on forming protein-protein complexes with an application to protein-protein docking. , 2005, Journal of molecular biology.