Analysis of loop boundaries using different local structure assignment methods

Loops connect regular secondary structures. In many instances, they are known to play important biological roles. Analysis and prediction of loop conformations depend directly on the definition of repetitive structures. Nonetheless, the secondary structure assignment methods (SSAMs) often lead to divergent assignments. In this study, we analyzed, both structure and sequence point of views, how the divergence between different SSAMs affect boundary definitions of loops connecting regular secondary structures. The analysis of SSAMs underlines that no clear consensus between the different SSAMs can be easily found. Because these latter greatly influence the loop boundary definitions, important variations are indeed observed, that is, capping positions are shifted between different SSAMs. On the other hand, our results show that the sequence information in these capping regions are more stable than expected, and, classical and equivalent sequence patterns were found for most of the SSAMs. This is, to our knowledge, the most exhaustive survey in this field as (i) various databank have been used leading to similar results without implication of protein redundancy and (ii) the first time various SSAMs have been used. This work hence gives new insights into the difficult question of assignment of repetitive structures and addresses the issue of loop boundaries definition. Although SSAMs give very different local structure assignments capping sequence patterns remain efficiently stable.

[1]  Catherine Etchebest,et al.  A new prediction strategy for long local protein structures using an original description , 2009, Proteins.

[2]  A. Bornot,et al.  Analysis of protein contacts into Protein Units. , 2009, Biochimie.

[3]  Cristina Benros,et al.  Analyzing the sequence-structure relationship of a library of local structural prototypes. , 2009, Journal of theoretical biology.

[4]  Changiz Eslahchi,et al.  PROSIGN: A method for protein secondary structure assignment based on three-dimensional coordinates of consecutive Calpha atoms , 2008, Comput. Biol. Chem..

[5]  Oxana V. Galzitskaya,et al.  Prediction of Loop Regions in protein Sequence , 2008, J. Bioinform. Comput. Biol..

[6]  Charles L. Brooks,et al.  Prediction of protein loop conformations using multiscale modeling methods with physical energy scoring functions , 2008, J. Comput. Chem..

[7]  A Keith Dunker,et al.  Assessing secondary structure assignment of protein structures by using pairwise sequence‐alignment benchmarks , 2008, Proteins.

[8]  Chung F Wong,et al.  Flexible protein–flexible ligand docking with disrupted velocity simulated annealing , 2008, Proteins.

[9]  Matthew P Jacobson,et al.  Conformational selection in silico: Loop latching motions and ligand binding in enzymes , 2008, Proteins.

[10]  Barry Honig,et al.  Loop modeling: Sampling, filtering, and scoring , 2007, Proteins.

[11]  A. Efimov Structural trees for proteins containing phi-motifs. , 2008, Biochemistry. Biokhimiia.

[12]  A. Efimov Structural trees for proteins containing φ-motifs , 2008, Biochemistry (Moscow).

[13]  Markus Wagener,et al.  A flexible approach to induced fit docking. , 2007, Journal of medicinal chemistry.

[14]  Haiyan Jiang,et al.  Insertions and the emergence of novel protein structure: a structure-based phylogenetic study of insertions , 2007, BMC Bioinformatics.

[15]  Brian Kuhlman,et al.  High-resolution design of a protein loop , 2007, Proceedings of the National Academy of Sciences.

[16]  N. Gautham,et al.  Exploring conformational space using a mean field technique with MOLS sampling , 2007, Journal of Biosciences.

[17]  Gloria Fuentes,et al.  Prediction of protein loop geometries in solution , 2007, Proteins.

[18]  Lauren L. Perskie,et al.  Physical‐chemical determinants of turn conformations in globular proteins , 2007, Protein science : a publication of the Protein Society.

[19]  M. Tyagi,et al.  Local Protein Structures , 2007 .

[20]  A. G. Brevern,et al.  A reduced amino acid alphabet for understanding and designing protein adaptation to mutation , 2007, European Biophysics Journal.

[21]  N. Gautham,et al.  Exploring the conformational space of protein loops using a mean field technique with MOLS sampling , 2007, Proteins.

[22]  A. G. Brevern,et al.  “Pinning strategy”: a novel approach for predicting the backbone structure in terms of protein blocks from sequence , 2007, Journal of Biosciences.

[23]  Benjamin A. Shoemaker,et al.  Long-term trends in evolution of indels in protein sequences , 2007, BMC Evolutionary Biology.

[24]  Protein Anatomy,et al.  The Anatomy and Taxonomy of Protein Structure , 2007 .

[25]  Thomas Madej,et al.  Protein homologous cores and loops: important clues to evolutionary relationships between structurally similar proteins , 2007, BMC Structural Biology.

[26]  Andreas Vogel,et al.  Iterative saturation mutagenesis on the basis of B factors as a strategy for increasing protein thermostability. , 2006, Angewandte Chemie.

[27]  R. Friesner,et al.  Long loop prediction using the protein local optimization program , 2006, Proteins.

[28]  E. Querol,et al.  Identification of function-associated loop motifs and application to protein function prediction , 2006, Bioinform..

[29]  András Fiser,et al.  Saturating representation of loop conformational fragments in structure databanks , 2006, BMC Structural Biology.

[30]  Jun Zhai,et al.  ArchPRED: a template based loop structure prediction server , 2006, Nucleic Acids Res..

[31]  Narayanaswamy Srinivasan,et al.  Protein Block Expert (PBE): a web-based protein structure analysis server using a structural alphabet , 2006, Nucleic Acids Res..

[32]  Aurélie Bornot,et al.  Protein beta-turn assignments , 2006, Bioinformation.

[33]  Baldomero Oliva,et al.  A supersecondary structure library and search algorithm for modeling loops in protein structures , 2006, Nucleic acids research.

[34]  G. Rose Lifting the lid on helix-capping , 2006, Nature chemical biology.

[35]  Duhee Bang,et al.  Dissecting the energetics of protein α-helix C-cap termination through chemical protein synthesis , 2006, Nature chemical biology.

[36]  Cristina Benros,et al.  Assessing a novel approach for predicting local 3D protein structures from sequence , 2005, Proteins.

[37]  C A Floudas,et al.  Protein loop structure prediction with flexible stem geometries , 2005, Proteins.

[38]  Fabien Cailliez,et al.  Secondary structure assignment that accurately reflects physical and evolutionary characteristics , 2005, BMC Bioinformatics.

[39]  F. Major,et al.  A new catalog of protein β‐sheets , 2005 .

[40]  Thomas Madej,et al.  Evolutionary plasticity of protein families: Coupling between sequence and structure variation , 2005, Proteins.

[41]  E. Kruus,et al.  Gibbs sampling and helix-cap motifs , 2005, Nucleic acids research.

[42]  J. Gibrat,et al.  Protein secondary structure assignment revisited: a detailed analysis of different assignment methods , 2005, BMC Structural Biology.

[43]  Iosif I Vaisman,et al.  New method for protein secondary structure assignment based on a simple topological descriptor , 2005, Proteins.

[44]  Nick V. Grishin,et al.  PALSSE: A program to delineate linear secondary structural elements from protein structures , 2005, BMC Bioinformatics.

[45]  A. G. Brevern,et al.  A structural model of a seven-transmembrane helix receptor: the Duffy antigen/receptor for chemokine (DARC). , 2005, Biochimica et biophysica acta.

[46]  Wouter Boomsma,et al.  Full cyclic coordinate descent: solving the protein loop closure problem in Cα space , 2005, BMC Bioinformatics.

[47]  Guoli Wang,et al.  PISCES: recent improvements to a PDB sequence culling server , 2005, Nucleic Acids Res..

[48]  A. Alix,et al.  High accuracy prediction of β‐turns and their types using propensities and multiple alignments , 2005 .

[49]  Jean-François Sadoc,et al.  Voro3D: 3D Voronoi tessellations applied to protein structures , 2005, Bioinform..

[50]  Nicholas C Fitzkee,et al.  The Protein Coil Library: A structural database of nonhelix, nonstrand fragments derived from the PDB , 2005, Proteins.

[51]  David A. Fenstermacher,et al.  Introduction to bioinformatics , 2005, J. Assoc. Inf. Sci. Technol..

[52]  G. Rose,et al.  Are proteins made from a limited parts list? , 2005, Trends in biochemical sciences.

[53]  François Major,et al.  A new catalog of protein beta-sheets. , 2005, Proteins.

[54]  Alexandre G. de Brevern,et al.  New assessment of a structural alphabet , 2005, Silico Biol..

[55]  Thomas Madej,et al.  Structural similarity of loops in protein families: toward the understanding of protein evolution , 2005, BMC Evolutionary Biology.

[56]  Thomas Madej,et al.  Analysis of protein homology by assessing the (dis)similarity in protein loop regions , 2004, Proteins.

[57]  Baldomero Oliva,et al.  Classification of common functional loops of kinase super‐families , 2004, Proteins.

[58]  Jean-François Sadoc,et al.  Protein secondary structure assignment through Voronoï tessellation , 2004, Proteins.

[59]  D. Baker,et al.  Modeling structurally variable regions in homologous proteins with rosetta , 2004, Proteins.

[60]  Alexandre G. de Brevern,et al.  Use of a structural alphabet for analysis of short loops connecting repetitive structures , 2004, BMC Bioinformatics.

[61]  Yutaka Kuroda,et al.  Characterization and prediction of linker sequences of multi-domain proteins by a neural network , 2004, Journal of Structural and Functional Genomics.

[62]  Serge A. Hazout,et al.  Local backbone structure prediction of proteins , 2004, Silico Biol..

[63]  David R. Gilbert,et al.  TOPS: an enhanced database of protein structural topology , 2004, Nucleic Acids Res..

[64]  A. Goede,et al.  Loops In Proteins (LIP)--a comprehensive loop database for homology modelling. , 2003, Protein engineering.

[65]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[66]  N. Srinivasan,et al.  Bmc Structural Biology Structural Basis of Regulation and Substrate Specificity of Protein Kinase Ck2 Deduced from the Modeling of Protein-protein Interactions Casein Kinase 2molecular Modelling Phosphorylationprotein-protein Interactionsprotein Kinasesregulation of Activitysubstrate Specificity , 2022 .

[67]  A. G. Brevern,et al.  'Hybrid Protein Model' for optimally defining 3D protein structure fragments , 2003, Bioinform..

[68]  H. Valadié,et al.  Extension of a local backbone description using a structural alphabet: A new approach to the sequence‐structure relationship , 2002, Protein science : a publication of the Protein Society.

[69]  Yael Mandel-Gutfreund,et al.  On the significance of alternating patterns of polar and non-polar residues in beta-strands. , 2002, Journal of molecular biology.

[70]  D. Schomburg,et al.  Positioning of anchor groups in protein loop prediction: The importance of solvent accessibility and secondary structure elements , 2002, Proteins.

[71]  S. Al-Karadaghi,et al.  Occurrence, conformational features and amino acid propensities for the pi-helix. , 2002, Protein engineering.

[72]  C. A. Andersen,et al.  Continuum secondary structure captures protein flexibility. , 2002, Structure.

[73]  Yael Mandel-Gutfreund,et al.  Contributions of residue pairing to β-sheet formation:conservation and covariation of amino acid residue pairs on antiparallel β-strands 1 1 Edited by J. Thornton , 2001 .

[74]  S. G. Pandalai,et al.  Recent Research Developments in Protein Engineering , 2001 .

[75]  Pierre Tufféry,et al.  Protein structural alphabets: beyond the secondary structure description , 2001 .

[76]  Y. Mandel-Gutfreund,et al.  Contributions of residue pairing to beta-sheet formation: conservation and covariation of amino acid residue pairs on antiparallel beta-strands. , 2001, Journal of molecular biology.

[77]  C. Etchebest,et al.  Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks , 2000, Proteins.

[78]  M. Bansal,et al.  HELANAL: A Program to Characterize Helix Geometry in Proteins , 2000, Journal of biomolecular structure & dynamics.

[79]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[80]  R. Srinivasan,et al.  A physical basis for protein secondary structure. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[81]  U. Lessel,et al.  Importance of anchor group positioning in protein loop prediction , 1999, Proteins.

[82]  J. Wójcik,et al.  New efficient statistical sequence-dependent structure prediction of short to medium-sized protein loops based on an exhaustive loop classification. , 1999, Journal of molecular biology.

[83]  S M King,et al.  Assigning secondary structure from protein coordinate data , 1999, Proteins.

[84]  G J Barton,et al.  Evaluation and improvement of multiple sequence methods for protein secondary structure prediction , 1999, Proteins.

[85]  J. Christman,et al.  Mechanism of inhibition of DNA (cytosine C5)-methyltransferases by oligodeoxyribonucleotides containing 5,6-dihydro-5-azacytosine. , 1999, Journal of molecular biology.

[86]  Manju Bansal,et al.  Geometrical and Sequence Characteristics of α-Helices in Globular Proteins , 1998 .

[87]  S. Wodak,et al.  Typical interaction patterns in alphabeta and betaalpha turn motifs. , 1998, Protein engineering.

[88]  Marianne Rooman,et al.  Structural classification of αββ and ββα supersecondary structure units in proteins , 1998 .

[89]  R. Aurora,et al.  Helix capping , 1998, Protein science : a publication of the Protein Society.

[90]  N. Boutonnet,et al.  Structural classification of alphabetabeta and betabetaalpha supersecondary structure units in proteins. , 1998, Proteins.

[91]  S. Kumar,et al.  Geometrical and sequence characteristics of alpha-helices in globular proteins. , 1998, Biophysical journal.

[92]  Joël Pothier,et al.  P-SEA: a new efficient assignment of secondary structure from C alpha trace of proteins , 1997, Comput. Appl. Biosci..

[93]  T. Blundell,et al.  Predicting the conformational class of short and medium size loops connecting regular secondary structures: application to comparative modelling. , 1997, Journal of molecular biology.

[94]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[95]  T. Blundell,et al.  Structural aspects of the functional modules in human protein kinase‐Cα deduced from comparative analyses , 1996, Proteins.

[96]  S. Kumar,et al.  Structural and sequence characteristics of long alpha helices in globular proteins. , 1996, Biophysical journal.

[97]  J. Thornton,et al.  PROMOTIF—A program to identify and analyze structural motifs in proteins , 1996, Protein science : a publication of the Protein Society.

[98]  S. Wodak,et al.  Automatic classification and analysis of alpha alpha-turn motifs in proteins. , 1996, Journal of molecular biology.

[99]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[100]  J. Thornton,et al.  A revised set of potentials for β‐turn formation in proteins , 1994 .

[101]  J. Thornton,et al.  A revised set of potentials for beta-turn formation in proteins. , 1994, Protein science : a publication of the Protein Society.

[102]  N. Colloc'h,et al.  Comparison of three algorithms for the assignment of secondary structure in proteins: the advantages of a consensus assignment. , 1993, Protein engineering.

[103]  E G Hutchinson,et al.  The Greek key motif: extraction, classification and analysis. , 1993, Protein engineering.

[104]  B. Henrissat,et al.  Detection of secondary structure elements in proteins by hydrophobic cluster analysis. , 1992, Protein engineering.

[105]  F. Cohen,et al.  Taxonomy and conformational analysis of loops in proteins. , 1992, Journal of molecular biology.

[106]  A. Efimov,et al.  Structure of coiled β‐β‐hairpins and β‐β‐corners , 1991 .

[107]  A. V. Efimov,et al.  Structure of α-α-hairpins with short connections , 1991 .

[108]  A. Efimov,et al.  Structure of alpha-alpha-hairpins with short connections. , 1991, Protein engineering.

[109]  A. Efimov Structure of coiled beta-beta-hairpins and beta-beta-corners. , 1991, FEBS letters.

[110]  E G Hutchinson,et al.  HERA—A program to draw schematic diagrams of protein secondary structures , 1990, Proteins.

[111]  R. Lavery,et al.  Describing protein structure: A general algorithm yielding complete helicoidal parameters and a unique overall axis , 1989, Proteins.

[112]  G. Rose,et al.  Helix signals in proteins. , 1988, Science.

[113]  B. L. Sibanda,et al.  Analysis, design and modification of loop regions in proteins , 1988, BioEssays : news and reviews in molecular, cellular and developmental biology.

[114]  F. Richards,et al.  Identification of structural motifs from protein coordinate data: Secondary structure and first‐level supersecondary structure * , 1988, Proteins.

[115]  Janet M. Thornton,et al.  Structural and sequence patterns in the loops of βαβ units , 1987 .

[116]  J M Thornton,et al.  Structural and sequence patterns in the loops of beta alpha beta units. , 1987, Protein engineering.

[117]  Barry Robson,et al.  Introduction to proteins and protein engineering , 1986 .

[118]  G. Rose,et al.  Turns in peptides and proteins. , 1985, Advances in protein chemistry.

[119]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[120]  J. Richardson,et al.  The anatomy and taxonomy of protein structure. , 1981, Advances in protein chemistry.

[121]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[122]  M. Levitt,et al.  Automatic identification of secondary structure in globular proteins. , 1977, Journal of molecular biology.

[123]  G. Rose,et al.  A new algorithm for finding the peptide chain turns in a globular protein. , 1977, Journal of molecular biology.

[124]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[125]  L. Pauling,et al.  The pleated sheet, a new layer configuration of polypeptide chains. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[126]  L. Pauling,et al.  The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. , 1951, Proceedings of the National Academy of Sciences of the United States of America.