Three‐dimensional, sequence order‐independent structural comparison of a serine protease against the crystallographic database reveals active site similarities: Potential implications to evolution and to protein folding

We have recently developed a fast approach to comparisons of 3‐dimensional structures. Our method is unique, treating protein structures as collections of unconnected points (atoms) in space. It is completely independent of the amino acid sequence order. It is unconstrained by insertions, deletions, and chain directionality. It matches single, isolated amino acids between 2 different structures strictly by their spatial positioning regardless of their relative sequential position in the amino acid chain. It automatically detects a recurring 3D motif in protein molecules. No predefinition of the motif is required. The motif can be either in the interior of the proteins or on their surfaces. In this work, we describe an enhancement over our previously developed technique, which considerably reduces the complexity of the algorithm. This results in an extremely fast technique. A typical pairwise comparison of 2 protein molecules requires less than 3 s on a workstation. We have scanned the structural database with dozens of probes, successfully detecting structures that are similar to the probe. To illustrate the power of this method, we compare the structure of a trypsin‐like serine protease against the structural database. Besides detecting homologous trypsin‐like proteases, we automatically obtain 3D, sequence order‐independent, active‐site similarities with subtilisin‐like and sulfhydryl proteases. These similarities equivalence isolated residues, not conserving the linear order of the amino acids in the chains. The active‐site similarities are well known and have been detected by manually inspecting the structures in a time‐consuming, laborious procedure. This is the first time such equivalences are obtained automatically from the comparison of full structures. The far‐reaching advantages and the implications of our novel algorithm to studies of protein folding, to evolution, and to searches for pharmacophoric patterns are discussed.

[1]  J. Kraut,et al.  The aromatic substrate binding site in subtilisin BPN' and its resemblance to chymotrypsin. , 1972, Cold Spring Harbor symposia on quantitative biology.

[2]  P Argos,et al.  Convergence of active center geometries. , 1977, Biochemistry.

[3]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[4]  E. Baker,et al.  Crystallographic refinement of the structure of actinidin at 1.7 Å resolution by fast Fourier least‐squares methods , 1980 .

[5]  I. G. Kamphuis,et al.  Structure of papain refined at 1.65 A resolution. , 1984, Journal of molecular biology.

[6]  M G Rossmann,et al.  Comparison of protein structures. , 1985, Methods in enzymology.

[7]  Randy J. Read,et al.  Crystal and molecular structures of the complex of α-chymotrypsin with its inhibitor Turkey ovomucoid third domain at 1.8 Å resolution , 1987 .

[8]  C. Betzel,et al.  Synchrotron X-ray data collection and restrained least-squares refinement of the crystal structure of proteinase K at 1.5 A resolution. , 1988, Acta Crystallographica Section B Structural Science.

[9]  W R Taylor,et al.  Protein structure alignment. , 1989, Journal of molecular biology.

[10]  P Willett,et al.  Use of techniques derived from graph theory to compare secondary structure motifs in proteins. , 1990, Journal of molecular biology.

[11]  H. Wolfson,et al.  Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[12]  C. Sander,et al.  Detection of common three‐dimensional substructures in proteins , 1991, Proteins.

[13]  H. Wolfson,et al.  An efficient automated computer vision based technique for detection of three dimensional structural motifs in proteins. , 1992, Journal of biomolecular structure & dynamics.

[14]  N. Go,et al.  Common spatial arrangements of backbone fragments in homologous and non-homologous proteins. , 1992, Journal of molecular biology.

[15]  M. Rossmann,et al.  Refined structure of Sindbis virus core protein and comparison with other chymotrypsin-like serine proteinase structures. , 1993, Journal of molecular biology.

[16]  P Willett,et al.  Identification of tertiary structure resemblance in proteins using a maximal common subgraph isomorphism algorithm. , 1993, Journal of molecular biology.

[17]  D Fischer,et al.  Spatial, sequence-order-independent structural comparison of alpha/beta proteins: evolutionary implications. , 1993, Journal of biomolecular structure & dynamics.

[18]  D Fischer,et al.  A computer vision based technique for 3-D sequence-independent structural comparison of proteins. , 1993, Protein engineering.

[19]  D Fischer,et al.  Surface motifs by a computer vision technique: Searches, detection, and implications for protein–ligand recognition , 1993, Proteins.

[20]  D Fischer,et al.  Molecular surface representations by sparse critical points , 1994, Proteins.

[21]  H. Wolfson,et al.  Molecular surface recognition by a computer vision-based technique. , 1994, Protein engineering.