Prediction of Functional Sites Based on the Fuzzy Oil Drop Model

A description of many biological processes requires knowledge of the 3-D structure of proteins and, in particular, the defined active site responsible for biological function. Many proteins, the genes of which have been identified as the result of human genome sequencing, and which were synthesized experimentally, await identification of their biological activity. Currently used methods do not always yield satisfactory results, and new algorithms need to be developed to recognize the localization of active sites in proteins. This paper describes a computational model that can be used to identify potential areas that are able to interact with other molecules (ligands, substrates, inhibitors, etc.). The model for active site recognition is based on the analysis of hydrophobicity distribution in protein molecules. It is shown, based on the analyses of proteins with known biological activity and of proteins of unknown function, that the region of significantly irregular hydrophobicity distribution in proteins appears to be function related.

[1]  S. Jones,et al.  Analysis of protein-protein interaction sites using surface patches. , 1997, Journal of molecular biology.

[2]  Janet M. Thornton,et al.  The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data , 2004, Nucleic Acids Res..

[3]  M. Levitt A simplified representation of protein conformations for rapid simulation of protein folding. , 1976, Journal of molecular biology.

[4]  Irena Roterman-Konieczna,et al.  Gauss-Function-Based Model of Hydrophobicity Density in Proteins , 2006, Silico Biol..

[5]  M. Ondrechen,et al.  THEMATICS: A simple computational predictor of enzyme function from structure , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Malin M. Young,et al.  Design, docking, and evaluation of multiple libraries against multiple targets , 2001, Proteins.

[7]  A. Sali,et al.  Structural genomics: beyond the Human Genome Project , 1999, Nature Genetics.

[8]  H. Edelsbrunner,et al.  Anatomy of protein pockets and cavities: Measurement of binding site geometry and implications for ligand design , 1998, Protein science : a publication of the Protein Society.

[9]  Irena Roterman-Konieczna,et al.  Limited conformational space for early-stage protein folding simulation , 2004, Bioinform..

[10]  M. Oobatake,et al.  Thermal Stability of Escherichia coli Ribonuclease HI and Its Active Site Mutants in the Presence and Absence of the Mg2+ Ion , 1996, The Journal of Biological Chemistry.

[11]  H. B. Wood,et al.  Biophysical and molecular properties of annexin-formed channels. , 2000, Progress in biophysics and molecular biology.

[12]  G M Crippen,et al.  Protein densities. , 1979, International journal of peptide and protein research.

[13]  S Roy,et al.  Hydrophobic basis of packing in globular proteins. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[14]  C. Pace,et al.  Forces contributing to the conformational stability of proteins , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[15]  R. Bruccoleri,et al.  Criteria that discriminate between native proteins and incorrectly folded models , 1988, Proteins.

[16]  H. Scheraga,et al.  Empirical Studies of Hydrophobicity. 2. Distribution of the Hydrophobic, Hydrophilic, Neutral, and Ambivalent Amino Acids in the Interior and Exterior Layers of Native Proteins , 1980 .

[17]  J. L. Jimenez,et al.  Does structural and chemical divergence play a role in precluding undesirable protein interactions? , 2005, Proteins.

[18]  A. Valencia,et al.  Practical limits of function prediction , 2000, Proteins.

[19]  R. Altman,et al.  Recognizing protein binding sites using statistical descriptions of their 3D environments. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[20]  A Irbäck,et al.  Evidence for nonrandom hydrophobicity structures in protein chains. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[21]  J. Skolnick,et al.  From genes to protein structure and function: novel applications of computational approaches in the genomic era. , 2000, Trends in biotechnology.

[22]  Ying Wei,et al.  Active Site Prediction for Comparative Model Structures with Thematics , 2005, J. Bioinform. Comput. Biol..

[23]  John F. Hunt,et al.  Crystal Structures of the BtuF Periplasmic-binding Protein for Vitamin B12 Suggest a Functionally Important Reduction in Protein Mobility upon Ligand Binding* , 2003, The Journal of Biological Chemistry.

[24]  A. Elcock Prediction of functionally important residues based solely on the computed energetics of protein structure. , 2001, Journal of molecular biology.

[25]  M. Gerstein,et al.  Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores. , 2000, Journal of molecular biology.

[26]  H. Scheraga,et al.  Empirical studies of hydrophobicity. 3. Radial distribution of clusters of hydrophobic and hydrophilic amino acids , 1981 .

[27]  B K Shoichet,et al.  A relationship between protein stability and protein function. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Valerie J. Gillet,et al.  SPROUT, HIPPO and CAESA: Tools for de novo structure generation and estimation of synthetic accessibility , 1995 .

[29]  Russ B Altman,et al.  Microenvironment analysis and identification of magnesium binding sites in RNA. , 2003, Nucleic acids research.

[30]  Leszek Konieczny,et al.  Conformational subspace in simulation of early‐stage protein folding , 2004, Proteins.

[31]  J. Scott Dixon,et al.  Flexible ligand docking using a genetic algorithm , 1995, J. Comput. Aided Mol. Des..

[32]  David E. Kim,et al.  Physically realistic homology models built with ROSETTA can be more accurate than their templates. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[33]  S. Rackovsky,et al.  Empirical Studies of Hydrophobicity. 1. Effect of Protein Size on the Hydrophobic Behavior of Amino Acids , 1980 .

[34]  M. Gerstein,et al.  The relationship between protein structure and function: a comprehensive survey with application to the yeast genome. , 1999, Journal of molecular biology.

[35]  Janet M. Thornton,et al.  ProFunc: a server for predicting protein function from 3D structure , 2005, Nucleic Acids Res..

[36]  K. von der Mark,et al.  Annexin V interactions with collagen , 1997, Cellular and Molecular Life Sciences CMLS.

[37]  Fabio Polticelli,et al.  Investigation of de novo Totally Random Biosequences, Part II , 2006, Chemistry & biodiversity.

[38]  Gil Amitai,et al.  Network analysis of protein structures identifies functional residues. , 2004, Journal of molecular biology.

[39]  H Luecke,et al.  Annexin V-crystal structure and its implications on function. , 1992, Behring Institute Mitteilungen.

[40]  M Hendlich,et al.  LIGSITE: automatic and efficient detection of potential small molecule-binding sites in proteins. , 1997, Journal of molecular graphics & modelling.

[41]  G M Crippen,et al.  A survey of atom packing in globular proteins. , 2009, International journal of peptide and protein research.

[42]  M. Klapper,et al.  On the nature of the protein interior. , 1971, Biochimica et biophysica acta.

[43]  Shekhar C Mande,et al.  The TB structural genomics consortium: providing a structural foundation for drug discovery. , 2002, Current drug targets. Infectious disorders.

[44]  M. Jambon,et al.  A new bioinformatic approach to detect common 3D sites in protein structures , 2003, Proteins.

[45]  I M Klotz,et al.  Comparison of molecular structures of proteins: helix content; distribution of apolar residues. , 1970, Archives of biochemistry and biophysics.

[46]  Imre G. Csizmadia,et al.  Validation of the SPROUT de novo design program , 2003 .

[47]  Leszek Konieczny,et al.  Fuzzy-Oil-Drop Hydrophobic Force Field—A Model to Represent Late-stage Folding (In Silico) of Lysozyme , 2006, Journal of biomolecular structure & dynamics.

[48]  Richard Bonneau,et al.  Improving the performance of rosetta using multiple sequence alignment information and global measures of hydrophobic core formation , 2001, Proteins.

[49]  Janet M Thornton,et al.  Protein function prediction using local 3D templates. , 2005, Journal of molecular biology.

[50]  Steven E Brenner,et al.  Structural genomics and structural biology: compare and contrast , 2004, Genome Biology.

[51]  R. L. Baldwin,et al.  Making a Network of Hydrophobic Clusters , 2002, Science.

[52]  D. Eisenberg,et al.  Analysis of membrane and surface protein sequences with the hydrophobic moment plot. , 1984, Journal of molecular biology.

[53]  L. Gierasch,et al.  Mutating the charged residues in the binding pocket of cellular retinoic acid‐binding protein simultaneously reduces its binding affinity to retinoic acid and increases its thermostability , 1992, Proteins.

[54]  Leszek Konieczny,et al.  Hydrophobic collapse in late-stage folding (in silico) of bovine pancreatic trypsin inhibitor. , 2006, Biochimie.

[55]  M J Sternberg,et al.  Analysis and prediction of the location of catalytic residues in enzymes. , 1988, Protein engineering.

[56]  Christophe Combet,et al.  The SuMo server: 3D search for protein functional sites , 2005, Bioinform..

[57]  J. Thornton,et al.  Tess: A geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites , 1997, Protein science : a publication of the Protein Society.

[58]  Ronald J. Williams,et al.  Statistical criteria for the identification of protein active sites using theoretical microscopic titration curves , 2005, Proteins.

[59]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[60]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[61]  Russ B. Altman,et al.  WebFEATURE: an interactive web tool for identifying and visualizing functional sites on macromolecular structures , 2003, Nucleic Acids Res..

[62]  Yan P. Yuan,et al.  Predicting function: from genes to genomes and back. , 1998, Journal of molecular biology.

[63]  L Serrano,et al.  Effect of active site residues in barnase on activity and stability. , 1992, Journal of molecular biology.

[64]  C Sander,et al.  Polarity as a criterion in protein design. , 1989, Protein engineering.

[65]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[66]  W. Kauzmann Some factors in the interpretation of protein denaturation. , 1959, Advances in protein chemistry.

[67]  Pier Luigi Luisi,et al.  Investigation of de novo Totally Random Biosequences, Part I , 2006, Chemistry & biodiversity.

[68]  C. Chothia Structural invariants in protein folding , 1975, Nature.

[69]  C. Frömmel,et al.  The automatic search for ligand binding sites in proteins of known three-dimensional structure using only geometric criteria. , 1996, Journal of molecular biology.

[70]  Irena Roterman-Konieczna,et al.  Hydrophobic collapse in (in silico) protein folding , 2006, Comput. Biol. Chem..

[71]  J L Finney,et al.  Molecular and mesoscale structures in hydrophobically driven aqueous solutions. , 2003, Biophysical chemistry.

[72]  M. Vidal,et al.  Structural genomics: A pipeline for providing structures for the biologist , 2002, Protein science : a publication of the Protein Society.

[73]  Ying Wei,et al.  Prediction of active sites for protein structures from computed chemical properties , 2005, ISMB.

[74]  Leszek Konieczny,et al.  Early-Stage Folding in Proteins (In Silico) Sequence-to-Structure Relation , 2005, Journal of biomedicine & biotechnology.

[75]  Leszek Konieczny,et al.  Ligation site in proteins recognized in silico , 2006, Bioinformation.

[76]  C. Sander,et al.  Evaluation of protein models by atomic solvation preference. , 1992, Journal of molecular biology.

[77]  Richard M. Jackson,et al.  Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites , 2005, Bioinform..

[78]  R. Abagyan,et al.  Comprehensive identification of "druggable" protein ligand binding sites. , 2004, Genome informatics. International Conference on Genome Informatics.