Simplicial neighborhood analysis of protein packing (SNAPP): a computational geometry approach to studying proteins.

Publisher Summary This chapter describes the application of computational geometry methodology to protein-structure analysis and comparison. The Simplicial Neighborhood Analysis of Protein Packing (SNAPP) method is applied to the problem of automatically identifying recurrent substructures in a large database of diverse protein structures. The chapter presents a novel approach to mapping protein cores with application to fold recognition via structural templates. It also describes the use of the SNAPP methodology for recognizing functional patterns characteristic of three unique protein families. SNAPP employs Delaunay tessellation to identify recurrent tertiary packing motifs that may be characteristic of protein structural and functional families. This method includes automatic identification of elementary tertiary packing motifs recurring in a large database of protein structures, automatic identification of global patterns of protein structure organization by recognizing the protein's hydrophobic core, and identification of functional signature motifs in a family of proteins that can assist structure-based annotation.

[1]  A. Fersht,et al.  Transition-state structure as a unifying basis in protein-folding mechanisms: contact order, chain topology, stability, and the extended nucleus mechanism. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[2]  T Schlick,et al.  Lattice protein folding with two and four‐body statistical potentials , 2001, Proteins.

[3]  D. F. Watson Computing the n-Dimensional Delaunay Tesselation with Application to Voronoi Polytopes , 1981, Comput. J..

[4]  J. Skolnick,et al.  Monte Carlo studies of the thermodynamics and kinetics of reduced protein models: Application to small helical, β, and α/β proteins , 1998 .

[5]  J L Finney,et al.  Volume occupation, environment and accessibility in proteins. The problem of the protein surface. , 1975, Journal of molecular biology.

[6]  J. L. Finney,et al.  Random packings and the structure of simple liquids. I. The geometry of random close packing , 1970, Proceedings of the Royal Society of London. A. Mathematical and Physical Sciences.

[7]  C. Chothia,et al.  Helix to helix packing in proteins. , 1981, Journal of molecular biology.

[8]  M. Levitt,et al.  Computer simulation of protein folding , 1975, Nature.

[9]  Raghavan Varadarajan,et al.  A Procedure for Detection and Quantitation of Cavity Volumes in Proteins , 2002, The Journal of Biological Chemistry.

[10]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[11]  Amos Bairoch,et al.  The PROSITE database, its status in 1999 , 1999, Nucleic Acids Res..

[12]  G. Crippen,et al.  Contact potential that recognizes the correct folding of globular proteins. , 1992, Journal of molecular biology.

[13]  Brendan J. McConkey,et al.  Quantification of protein surfaces, volumes and atom-atom contacts using a constrained Voronoi procedure , 2002, Bioinform..

[14]  David C. Jones,et al.  Potential energy functions for threading. , 1996, Current opinion in structural biology.

[15]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[16]  C. Chothia,et al.  The Packing Density in Proteins: Standard Radii and Volumes , 1999 .

[17]  D. Eisenberg,et al.  Correlation of sequence hydrophobicities measures similarity in three-dimensional protein structure. , 1983, Journal of molecular biology.

[18]  Nikolai N. Medvedev,et al.  Can Various Classes of Atomic Configurations (Delaunay Simplices) be Distinguished in Random Dense Packings of Spherical Particles , 1989 .

[19]  Nikolai N. Medvedev,et al.  Geometrical analysis of the structure of simple liquids : percolation approach , 1991 .

[20]  Franz Aurenhammer,et al.  Voronoi diagrams—a survey of a fundamental geometric data structure , 1991, CSUR.

[21]  G. Crippen Prediction of protein folding from amino acid sequence over discrete conformation spaces. , 1991, Biochemistry.

[22]  C. Chothia Structural invariants in protein folding , 1975, Nature.

[23]  H Edelsbrunner,et al.  Analytical shape computation of macromolecules: II. Inaccessible cavities in proteins , 1998, Proteins.

[24]  Cyrus Chothia,et al.  Packing of α-Helices onto β-Pleated sheets and the anatomy of αβ proteins☆ , 1980 .

[25]  N. Go,et al.  A method to search for similar protein local structures at ligand-binding sites and its application to adenine recognition , 1997, European Biophysics Journal.

[26]  F. Richards The interpretation of protein structures: total volume, group volume distributions and packing density. , 1974, Journal of molecular biology.

[27]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[28]  Shmuel Pietrokovski,et al.  Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations , 1999, Bioinform..

[29]  H. Edelsbrunner,et al.  Anatomy of protein pockets and cavities: Measurement of binding site geometry and implications for ligand design , 1998, Protein science : a publication of the Protein Society.

[30]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[31]  Fabrice Leclerc,et al.  Effective atom volumes for implicit solvent models: comparison between Voronoi volumes and minimum fluctuation volumes , 2001, J. Comput. Chem..

[32]  S. Bryant,et al.  An empirical energy function for threading protein sequence through the folding motif , 1993, Proteins.

[33]  C. Sander,et al.  Verification of protein structures : Side-chain planarity , 1996 .

[34]  Dirk Bakowies,et al.  Water in protein cavities: A procedure to identify internal water and exchange pathways and application to fatty acid‐binding protein , 2002, Proteins.

[35]  M Gerstein,et al.  Volume changes on protein folding. , 1994, Structure.

[36]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[37]  Olivier Poch,et al.  Partition of tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs , 1990, Nature.

[38]  Richard Wolfenden,et al.  Comparing the polarities of the amino acids: side-chain distribution coefficients between the vapor phase, cyclohexane, 1-octanol, and neutral aqueous solution , 1988 .

[39]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[40]  J. D. Bernal,et al.  A Geometrical Approach to the Structure Of Liquids , 1959, Nature.

[41]  A. Tropsha,et al.  Four-body potentials reveal protein-specific correlations to stability changes caused by hydrophobic core mutations. , 2001, Journal of molecular biology.

[42]  Jacques Chomilier,et al.  Nonatomic solvent‐driven voronoi tessellation of proteins: An open tool to analyze protein folds , 2002, Proteins.

[43]  A. Godzik,et al.  Sequence-structure matching in globular proteins: application to supersecondary and tertiary structure determination. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[44]  H. Scheraga,et al.  Model of protein folding: inclusion of short-, medium-, and long-range interactions. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[45]  G J Pielak,et al.  Patterned library analysis: a method for the quantitative assessment of hypotheses concerning the determinants of protein structure. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Iosif I. Vaisman,et al.  Mobility of stretched water , 1993 .

[47]  Mark Gerstein,et al.  Calculations of protein volumes: sensitivity analysis and parameter database , 2002, Bioinform..

[48]  J. Thornton,et al.  Determinants of strand register in antiparallel β‐sheets of proteins , 1998, Protein science : a publication of the Protein Society.

[49]  Chris Sander,et al.  Touring protein fold space with Dali/FSSP , 1998, Nucleic Acids Res..

[50]  M. Sippl,et al.  Detection of native‐like models for amino acid sequences of unknown three‐dimensional structure in a data base of known protein conformations , 1992, Proteins.

[51]  U. Hobohm,et al.  Enlarged representative set of protein structures , 1994, Protein science : a publication of the Protein Society.

[52]  J L Finney,et al.  Calculation of protein volumes: an alternative to the Voronoi procedure. , 1982, Journal of molecular biology.

[53]  M J Sippl,et al.  Knowledge-based potentials for proteins. , 1995, Current opinion in structural biology.

[54]  S Subramaniam,et al.  Analytical shape computation of macromolecules: I. molecular area and volume through alpha shape , 1998, Proteins.

[55]  C. Chothia,et al.  Structure of proteins: packing of alpha-helices and pleated sheets. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[56]  K Yue,et al.  Folding proteins with a simple energy function and extensive conformational searching , 1996, Protein science : a publication of the Protein Society.

[57]  Iosif I. Vaisman,et al.  Distance Dependence of Water Structure around Model Solutes , 1994 .

[58]  J Chomilier,et al.  Voronoï tessellation reveals the condensed matter character of folded proteins. , 2000, Physical review letters.

[59]  P. Munson,et al.  Statistical significance of hierarchical multi‐body potentials based on Delaunay tessellation and their application in sequence‐structure alignment , 1997, Protein science : a publication of the Protein Society.

[60]  F. M. Richards,et al.  Calculation of molecular volumes and areas for structures of known geometry. , 1985, Methods in enzymology.

[61]  E. Shakhnovich,et al.  Pseudodihedrals: Simplified protein backbone representation with knowledge‐based energy , 1994, Protein science : a publication of the Protein Society.

[62]  N Go,et al.  Mechanical property of a TIM‐barrel protein , 1997, Proteins.

[63]  Alexander Tropsha,et al.  Development of a four-body statistical pseudo-potential to discriminate native from non-native protein conformations , 2003, Bioinform..

[64]  Iosif I. Vaisman,et al.  Delaunay Tessellation of Proteins: Four Body Nearest-Neighbor Propensities of Amino Acid Residues , 1996, J. Comput. Biol..

[65]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[66]  Eytan Domany,et al.  Protein fold recognition and dynamics in the space of contact maps , 1996, Proteins.

[67]  U. Hobohm,et al.  Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.

[68]  S. Wodak,et al.  Factors influencing the ability of knowledge-based potentials to identify native sequence-structure matches. , 1994, Journal of molecular biology.