Comparison of substructural epitopes in enzyme active sites using self-organizing maps

SummaryThis paper presents a new algorithm to compare substructural epitopes in protein binding cavities. Through the comparison of binding cavities accommodating well characterized ligands with cavities whose actual guests are yet unknown, it is possible to draw some conclusions on the required shape of a putative ligand likely to bind to the latter cavities. To detect functional relationships among proteins, their binding-site exposed physicochemical characteristics are described by assigning generic pseudocenters to the functional groups of the amino acids flanking the particular active site. The cavities are divided into small local regions of four pseudocenters having the shape of a pyramid with triangular basis. To find similar local regions, an emergent self-organizing map is used for clustering. Two local regions within the same cluster are similar and form the basis for the superpositioning of the corresponding cavities to score this match. First results show that the similarities between enzymes with the same EC number can be found correctly. Enzymes with different EC numbers are detected to have no common substructures. These results indicate the benefit of this method and motivate further studies.

[1]  Amos Bairoch,et al.  The ENZYME database in 2000 , 2000, Nucleic Acids Res..

[2]  G J Kleywegt,et al.  Recognition of spatial motifs in protein structures. , 1999, Journal of molecular biology.

[3]  J. Thornton,et al.  Tess: A geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites , 1997, Protein science : a publication of the Protein Society.

[4]  P Willett,et al.  Identification of tertiary structure resemblance in proteins using a maximal common subgraph isomorphism algorithm. , 1993, Journal of molecular biology.

[5]  Gerhard Klebe,et al.  Relibase: design and development of a database for comprehensive analysis of protein-ligand interactions. , 2003, Journal of molecular biology.

[6]  K. Kinoshita,et al.  Identification of protein biochemical functions by similarity search using the molecular surface database eF‐site , 2003, Protein science : a publication of the Protein Society.

[7]  Robert B. Russell,et al.  Annotation in three dimensions. PINTS: Patterns in Non-homologous Tertiary Structures , 2003, Nucleic Acids Res..

[8]  R. Russell,et al.  Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution. , 1998, Journal of molecular biology.

[9]  G Schneider,et al.  Mapping of protein surface cavities and prediction of enzyme class by a self-organizing neural network. , 2000, Protein engineering.

[10]  Ashish V. Tendulkar,et al.  Functional sites in protein families uncovered via an objective and automated graph theoretic approach. , 2003, Journal of molecular biology.

[11]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[12]  J M Thornton,et al.  Derivation of 3D coordinate templates for searching structural databases: Application to ser‐His‐Asp catalytic triads in the serine proteinases and lipases , 1996, Protein science : a publication of the Protein Society.

[13]  Robert B Russell,et al.  A model for statistical significance of local similarities in structure. , 2003, Journal of molecular biology.

[14]  R. Nussinov,et al.  Molecular shape comparisons in searches for active sites and functional similarity. , 1998, Protein engineering.

[15]  Peter Willett,et al.  Searching for Patterns of Amino Acids in 3D Protein Structures , 2003, J. Chem. Inf. Comput. Sci..

[16]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[17]  G. Klebe,et al.  A new method to detect related function among proteins independent of sequence and fold homology. , 2002, Journal of molecular biology.

[18]  Johann Gasteiger,et al.  Prediction of Aqueous Solubility of Organic Compounds Based on a 3D Structure Representation , 2003, J. Chem. Inf. Comput. Sci..

[19]  Peter Willett,et al.  Comparison of protein surfaces using a genetic algorithm , 1997, J. Comput. Aided Mol. Des..

[20]  Thomas Hamelryck,et al.  Efficient identification of side‐chain patterns using a multidimensional index tree , 2003, Proteins.

[21]  D Fischer,et al.  A computer vision based technique for 3-D sequence-independent structural comparison of proteins. , 1993, Protein engineering.

[22]  M Hendlich,et al.  LIGSITE: automatic and efficient detection of potential small molecule-binding sites in proteins. , 1997, Journal of molecular graphics & modelling.

[23]  Jukka V. Lehtonen,et al.  Finding local structural similarities among families of unrelated protein structures: A generic non‐linear alignment algorithm , 1999, Proteins.