Protein secondary structure assignment through Voronoï tessellation

We present a new automatic algorithm, named VoTAP (Voronoı̈ Tessellation Assignment Procedure), which assigns secondary structures of a polypeptide chain using the list of -carbon coordinates. This program uses three-dimensional Voronoı̈ tessellation. This geometrical tool associates with each amino acid a Voronoı̈ polyhedron, the faces of which unambiguously define contacts between residues. Thanks to the face area, for the contacts close together along the primary structure (low-order contacts) a distinction is made between strong and normal ones. This new definition yields new contact matrices, which are analyzed and used to assign secondary structures. This assignment is performed in two stages. The first one uses contacts between residues close together along the primary structure and is based on data collected on a bank of 282 well-refined nonredundant structures. In this bank, associations were made between the prints defined by these low-order contacts and the assignments performed by different automatic methods. The second step focuses on the strand assignment and uses contacts between distant residues. Comparison with several other automatic assignment methods are presented, and the influence of resolution on the assignment is investigated. Proteins 2004;55:519–528. © 2004 Wiley-Liss, Inc.

[1]  B. Matthews,et al.  Accurate calculation of the density of proteins. , 2000, Acta crystallographica. Section D, Biological crystallography.

[2]  A Tropsha,et al.  A new approach to protein fold recognition based on Delaunay tessellation of protein structure. , 1997, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[3]  D. Phillips The development of crystallographic enzymology. , 1970, Biochemical Society symposium.

[4]  F. Cohen,et al.  Beta-breakers: an aperiodic secondary structure. , 1991, Journal of molecular biology.

[5]  Joël Pothier,et al.  P-SEA: a new efficient assignment of secondary structure from C alpha trace of proteins , 1997, Comput. Appl. Biosci..

[6]  K Namba,et al.  Structure of 3-isopropylmalate dehydrogenase in complex with 3-isopropylmalate at 2.0 A resolution: the role of Glu88 in the unique substrate-recognition mechanism. , 1998, Structure.

[7]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[8]  M. Levitt,et al.  Automatic identification of secondary structure in globular proteins. , 1977, Journal of molecular biology.

[9]  H. Scheraga,et al.  Folding of polypeptide chains in proteins: a proposed mechanism for folding. , 1971, Proceedings of the National Academy of Sciences of the United States of America.

[10]  N. Colloc'h,et al.  Comparison of three algorithms for the assignment of secondary structure in proteins: the advantages of a consensus assignment. , 1993, Protein engineering.

[11]  P. Y. Chou,et al.  Beta-turns in proteins. , 1977, Journal of molecular biology.

[12]  I D Kuntz Protein folding. , 1972, Journal of the American Chemical Society.

[13]  K. Soman,et al.  Identification of secondary structures in globular proteins--a new algorithm. , 2009, International journal of peptide and protein research.

[14]  J. L. Crawford,et al.  The reverse turn as a polypeptide conformation in globular proteins. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[15]  J L Finney,et al.  Volume occupation, environment and accessibility in proteins. The problem of the protein surface. , 1975, Journal of molecular biology.

[16]  Scott R. Presnell,et al.  A segment-based approach to protein secondary structure prediction. , 1991, Biochemistry.

[17]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[18]  G. Labesse,et al.  Deciphering protein sequence information through hydrophobic cluster analysis (HCA): current status and perspectives , 1997, Cellular and Molecular Life Sciences CMLS.

[19]  Adel Said Elmaghraby,et al.  Is it better to combine predictions? , 2000, Protein engineering.

[20]  Reversals of polypeptide chain in globular proteins. , 2009, International journal of peptide and protein research.

[21]  J Chomilier,et al.  Voronoï tessellation reveals the condensed matter character of folded proteins. , 2000, Physical review letters.

[22]  G R Marshall,et al.  Ab initio modeling of small, medium, and large loops in proteins. , 2001, Biopolymers.

[23]  H. Wako,et al.  Novel method to detect a motif of local structures in different protein conformations. , 1998, Protein engineering.

[24]  F. S. Mathews,et al.  Refinement and structural analysis of bovine cytochrome b5 at 1.5 A resolution. , 1996, Acta crystallographica. Section D, Biological crystallography.

[25]  R. Lavery,et al.  Describing protein structure: A general algorithm yielding complete helicoidal parameters and a unique overall axis , 1989, Proteins.

[26]  K Nishikawa,et al.  Comparison of homologous tertiary structures of proteins. , 1974, Journal of theoretical biology.

[27]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[28]  Ralf Zimmer,et al.  New scoring Schemes for Protein fold recognition based on Voronoi contacts , 1997, German Conference on Bioinformatics.

[29]  C. Chothia,et al.  The atomic structure of protein-protein recognition sites. , 1999, Journal of molecular biology.

[30]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[31]  G. Powis,et al.  Crystal structures of reduced, oxidized, and mutated human thioredoxins: evidence for a regulatory homodimer. , 1996, Structure.

[32]  L. Pauling,et al.  Configurations of Polypeptide Chains With Favored Orientations Around Single Bonds: Two New Pleated Sheets. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[33]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[34]  Iosif I. Vaisman,et al.  Delaunay Tessellation of Proteins: Four Body Nearest-Neighbor Propensities of Amino Acid Residues , 1996, J. Comput. Biol..

[35]  F. Richards The interpretation of protein structures: total volume, group volume distributions and packing density. , 1974, Journal of molecular biology.

[36]  J Selbig Contact pattern-induced pair potentials for protein fold recognition. , 1995, Protein engineering.

[37]  M Gerstein,et al.  Packing at the protein-water interface. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[38]  F. Richards,et al.  Identification of structural motifs from protein coordinate data: Secondary structure and first‐level supersecondary structure * , 1988, Proteins.

[39]  B. Rupp,et al.  Structure of bovine pancreatic trypsin inhibitor at 125 K definition of carboxyl-terminal residues Gly57 and Ala58. , 1996, Acta crystallographica. Section D, Biological crystallography.

[40]  M. Go Correlation of DNA exonic regions with protein structural units in haemoglobin , 1981, Nature.

[41]  Mark Gerstein,et al.  Determining the minimum number of types necessary to represent the sizes of protein atoms , 2001, Bioinform..

[42]  Jacques Chomilier,et al.  Nonatomic solvent‐driven voronoi tessellation of proteins: An open tool to analyze protein folds , 2002, Proteins.

[43]  G. Vriend,et al.  Prediction of protein residue contacts with a PDB-derived likelihood matrix. , 2002, Protein engineering.

[44]  P. Munson,et al.  Statistical significance of hierarchical multi‐body potentials based on Delaunay tessellation and their application in sequence‐structure alignment , 1997, Protein science : a publication of the Protein Society.

[45]  B. Stec,et al.  Refinement of purothionins reveals solute particles important for lattice formation and toxicity. Part 2: structure of beta-purothionin at 1.7 A resolution. , 1995, Acta crystallographica. Section D, Biological crystallography.

[46]  G. Rose,et al.  A new algorithm for finding the peptide chain turns in a globular protein. , 1977, Journal of molecular biology.

[47]  J. Thornton,et al.  PROMOTIF—A program to identify and analyze structural motifs in proteins , 1996, Protein science : a publication of the Protein Society.

[48]  L. Pauling,et al.  The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[49]  G. Chirikjian,et al.  Efficient generation of feasible pathways for protein conformational transitions. , 2002, Biophysical journal.