Conformational Sampling in Template-Free Protein Loop Structure Modeling: An Overview

Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a “mini protein folding problem” under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the approaches of sampling loop conformation space, which is a critical step to obtain high resolution models in template-free methods. We review the potential energy functions for loop modeling, loop buildup mechanisms to satisfy geometric constraints, and loop conformation sampling algorithms. The recent loop modeling results are also summarized.

[1]  C Sander,et al.  On the use of sequence homologies to predict protein structure: identical pentapeptides can have completely different conformations. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Chaok Seok,et al.  Protein loop modeling by using fragment assembly and analytical loop closure , 2010, Proteins.

[3]  Mark A. Olson,et al.  Comparison between self‐guided Langevin dynamics and molecular dynamics simulations for structure refinement of protein loop conformations , 2011, J. Comput. Chem..

[4]  Lisa Yan,et al.  LOOPER: a molecular mechanics-based algorithm for protein loop prediction. , 2008, Protein engineering, design & selection : PEDS.

[5]  Yaohang Li MOMCMC: An efficient Monte Carlo method for multi-objective sampling over real parameter space , 2012, Comput. Math. Appl..

[6]  Richard A Friesner,et al.  Prediction of Protein Loop Conformations using the AGBNP Implicit Solvent Model and Torsion Angle Sampling. , 2008, Journal of chemical theory and computation.

[7]  G R Marshall,et al.  Ab initio modeling of small, medium, and large loops in proteins. , 2001, Biopolymers.

[8]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[9]  Yaohang Li,et al.  Backbone statistical potential from local sequence-structure interactions in protein loops. , 2010, The journal of physical chemistry. B.

[10]  W F van Gunsteren,et al.  A structure refinement method based on molecular dynamics in four spatial dimensions. , 1993, Journal of molecular biology.

[11]  Adrian A Canutescu,et al.  Cyclic coordinate descent: A robotics algorithm for protein loop closure , 2003, Protein science : a publication of the Protein Society.

[12]  N. Go,et al.  Ring Closure and Local Conformational Deformations of Chain Molecules , 1970 .

[13]  E. Querol,et al.  Identification of function-associated loop motifs and application to protein function prediction , 2006, Bioinform..

[14]  Richard Bonneau,et al.  Ab initio protein structure prediction of CASP III targets using ROSETTA , 1999, Proteins.

[15]  William L. Jorgensen,et al.  OPLS all‐atom force field for carbohydrates , 1997 .

[16]  P. R. Sibbald,et al.  The P-loop--a common motif in ATP- and GTP-binding proteins. , 1990, Trends in biochemical sciences.

[17]  P. Kollman,et al.  Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. , 2000, Accounts of chemical research.

[18]  A. Sali,et al.  Statistical potential for assessment and prediction of protein structures , 2006, Protein science : a publication of the Protein Society.

[19]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[20]  Gloria Fuentes,et al.  Prediction of protein loop geometries in solution , 2007, Proteins.

[21]  Yaohang Li,et al.  Improving predicted protein loop structure ranking using a Pareto-optimality consensus method , 2010, BMC Structural Biology.

[22]  Andrzej Kolinski,et al.  Modeling of loops in proteins: a multi-method approach , 2010, BMC Structural Biology.

[23]  Michael W. Deem,et al.  Efficient Monte Carlo methods for cyclic peptides , 1999 .

[24]  Wouter Boomsma,et al.  Full cyclic coordinate descent: solving the protein loop closure problem in Cα space , 2005, BMC Bioinformatics.

[25]  J. Garnier,et al.  Modeling of protein loops by simulated annealing , 1993, Protein science : a publication of the Protein Society.

[26]  Jerome Nilmeier,et al.  Assessing protein loop flexibility by hierarchical Monte Carlo sampling. , 2011, Journal of chemical theory and computation.

[27]  A. Lesk,et al.  Common features of the conformations of antigen‐binding loops in immunoglobulins and application to modeling loop conformations , 1992, Proteins.

[28]  S. Tosatto,et al.  Application of MM/PBSA colony free energy to loop decoy discrimination: Toward correlation between energy and root mean square deviation , 2005, Protein science : a publication of the Protein Society.

[29]  Stefano Piana,et al.  Refinement of protein structure homology models via long, all‐atom molecular dynamics simulations , 2012, Proteins.

[30]  Viktor Hornak,et al.  Generation of accurate protein loop conformations through low‐barrier molecular dynamics , 2003, Proteins.

[31]  Nicholas C Fitzkee,et al.  The Protein Coil Library: A structural database of nonhelix, nonstrand fragments derived from the PDB , 2005, Proteins.

[32]  Simona Golic Grdadolnik,et al.  Determination of conformational preferences of dipeptides using vibrational spectroscopy. , 2008, The journal of physical chemistry. B.

[33]  Oleg Y Dmitriev,et al.  The rigid connecting loop stabilizes hairpin folding of the two helices of the ATP synthase subunit c , 2007, Protein science : a publication of the Protein Society.

[34]  Osmar Norberto de Souza,et al.  An expert protein loop refinement protocol by molecular dynamics simulations with restraints , 2013, Expert Syst. Appl..

[35]  Leonidas J. Guibas,et al.  Inverse Kinematics in Biology: The Protein Loop Closure Problem , 2005, Int. J. Robotics Res..

[36]  J. Kwasigroch,et al.  A global taxonomy of loops in globular proteins. , 1996, Journal of molecular biology.

[37]  Barry Honig,et al.  Loop modeling: Sampling, filtering, and scoring , 2007, Proteins.

[38]  Chi Zhang,et al.  Protein Loop Modeling with Optimized Backbone Potential Functions. , 2012, Journal of chemical theory and computation.

[39]  Michael J. Cahill,et al.  On the kinematics of protein folding , 2002, J. Comput. Chem..

[40]  Gianni Cesareni,et al.  Molecular Recognition in Helix-Loop-Helix and Helix-Loop-Helix-Leucine Zipper Domains , 2003, The Journal of Biological Chemistry.

[41]  Pu Liu,et al.  A Self-Organizing Algorithm for Modeling Protein Loops , 2009, PLoS Comput. Biol..

[42]  Richard A Friesner,et al.  Progress in super long loop prediction , 2011, Proteins.

[43]  Chaok Seok,et al.  A kinematic view of loop closure , 2004, J. Comput. Chem..

[44]  Anne-Claude Camproux,et al.  Mining protein loops using a structural alphabet and statistical exceptionality , 2010, BMC Bioinformatics.

[45]  Scott R. Presnell,et al.  Origins of structural diversity within sequentially identical hexapeptides , 1993, Protein science : a publication of the Protein Society.

[46]  Harold A. Scheraga,et al.  Exact analytical loop closure in proteins using polynomial equations , 1999, J. Comput. Chem..

[47]  R. Bruccoleri,et al.  Ab initio loop modeling and its application to homology modeling. , 2000, Methods in molecular biology.

[48]  Yaoqi Zhou,et al.  Specific interactions for ab initio folding of protein terminal regions with secondary structures , 2008, Proteins.

[49]  A. Levey,et al.  RGS2 Binds Directly and Selectively to the M1 Muscarinic Acetylcholine Receptor Third Intracellular Loop to Modulate Gq/11α Signaling* , 2004, Journal of Biological Chemistry.

[50]  Ron Unger The Genetic Algorithm Approach to Protein Structure Prediction , 2004 .

[51]  Fred E. Cohen,et al.  Conformational Sampling of Loop Structures Using Genetic Algorithms , 1994 .

[52]  D. Baker,et al.  Modeling structurally variable regions in homologous proteins with rosetta , 2004, Proteins.

[53]  David Baker,et al.  Voltage sensor conformations in the open and closed states in ROSETTA structural models of K(+) channels. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Kai Zhu,et al.  Toward better refinement of comparative models: Predicting loops in inexact environments , 2008, Proteins.

[55]  C. Deane,et al.  A novel exhaustive search algorithm for predicting the conformation of polypeptide segments in proteins , 2000, Proteins.

[56]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[57]  R A Friesner,et al.  Prediction of loop geometries using a generalized born model of solvation effects , 1999, Proteins.

[58]  L. Aravind,et al.  Identification of the prokaryotic ligand-gated ion channels and their implications for the mechanisms and origins of animal Cys-loop ion channels , 2004, Genome Biology.

[59]  D. I. Stuart,et al.  α-Lactalbumin possesses a novel calcium binding loop , 1986, Nature.

[60]  Brian Kuhlman,et al.  High-resolution design of a protein loop , 2007, Proceedings of the National Academy of Sciences.

[61]  Yaohang Li,et al.  Sampling Multiple Scoring Functions Can Improve Protein Loop Structure Prediction Accuracy , 2011, J. Chem. Inf. Model..

[62]  F. Cohen,et al.  Taxonomy and conformational analysis of loops in proteins. , 1992, Journal of molecular biology.

[63]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[64]  C M Deane,et al.  Improved protein loop prediction from sequence alone. , 2001, Protein engineering.

[65]  Charles L. Brooks,et al.  Prediction of protein loop conformations using multiscale modeling methods with physical energy scoring functions , 2008, J. Comput. Chem..

[66]  J. Greer,et al.  Model for haptoglobin heavy chain based upon structural homology. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[67]  E. Coutsias,et al.  Sub-angstrom accuracy in protein loop reconstruction by robotics-inspired conformational sampling , 2009, Nature Methods.

[68]  C. Deane,et al.  CODA: A combined algorithm for predicting the structurally variable regions of protein models , 2001, Protein science : a publication of the Protein Society.

[69]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[70]  Thomas Madej,et al.  Structural similarity of loops in protein families: toward the understanding of protein evolution , 2005, BMC Evolutionary Biology.

[71]  Joseph A. Adams,et al.  Structural Basis for the Regulation of Protein Kinase A by Activation Loop Phosphorylation* , 2012, The Journal of Biological Chemistry.

[72]  M. Mezei,et al.  Prediction of protein loop structures using a local move Monte Carlo approach and a grid-based force field. , 2008, Protein engineering, design & selection : PEDS.

[73]  Markus A Lill,et al.  Predicting flexible loop regions that interact with ligands: The challenge of accurate scoring , 2012, Proteins.

[74]  B. Honig,et al.  A hierarchical approach to all‐atom protein loop prediction , 2004, Proteins.

[75]  C. Levinthal,et al.  Predicting antibody hypervariable loop conformation. I. Ensembles of random conformations for ringlike structures , 1987, Biopolymers.

[76]  R. Friesner,et al.  Long loop prediction using the protein local optimization program , 2006, Proteins.

[77]  M. DePristo,et al.  Ab initio construction of polypeptide fragments: Accuracy of loop decoy discrimination by an all‐atom statistical potential and the AMBER force field with the Generalized Born solvation model , 2003, Proteins.

[78]  Cinque S. Soto,et al.  Evaluating conformational free energies: The colony energy and its application to the problem of loop prediction , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[79]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[80]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[81]  M. Lisa Phipps,et al.  Antibody binding loop insertions as diversity elements , 2006, Nucleic acids research.

[82]  Yuh Nung Jan,et al.  The S4–S5 loop contributes to the ion-selective pore of potassium channels , 1993, Neuron.

[83]  Mihaly Mezei,et al.  Prediction of protein loop structures using a local move Monte Carlo approach and a grid-based force field , 2009 .

[84]  A. Sali,et al.  Modeling of loops in protein structures , 2000, Protein science : a publication of the Protein Society.

[85]  Ronald M. Levy,et al.  AGBNP: An analytic implicit solvent model suitable for molecular dynamics simulations and high‐resolution modeling , 2004, J. Comput. Chem..

[86]  Luhua Lai,et al.  A fast and efficient program for modeling protein loops , 1997 .

[87]  A C Martin,et al.  Modeling antibody hypervariable loops: a combined algorithm. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[88]  M. DePristo,et al.  Ab initio construction of polypeptide fragments: Efficient generation of accurate, representative ensembles , 2003, Proteins.