A fast unbiased comparison of protein structures by means of the Needleman-Wunsch algorithm

SummaryA fast dynamic programming algorithm for the spatial superposition of protein structure without prior knowledge of an initial alignment has been developed. The program was applied to serine proteases, hemoglobins, cytochromes C, small copper-binding proteins, and lysozymes. In most cases the existing structural homology could be detected in a completely unbiased way. The results of the method presented are in general agreement with other studies. Applying our method, the different alignment results obtained by other authors for serine proteases and cytochromes C can be classified in terms of different alignment parameters such as gap penalties or cut-off length. Limitations of the method are discussed.

[1]  A M Lesk,et al.  Evolution of proteins formed by beta-sheets. I. Plastocyanin and azurin. , 1982, Journal of molecular biology.

[2]  Robert Huber,et al.  On the disordered activation domain in trypsinogen: chemical labelling and low‐temperature crystallography , 1982 .

[3]  Michael Liebmann Structural Organization in the Serine Proteases , 1986 .

[4]  G. Bhatia Refinement of the Crystal Structure of Oxidized Rhodospirillum Rubrum Cytochrome C2 , 1984 .

[5]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[6]  R J Read,et al.  Structure of the complex of Streptomyces griseus protease B and the third domain of the turkey ovomucoid inhibitor at 1.8-A resolution. , 1983, Biochemistry.

[7]  R Diamond,et al.  Real-space refinement of the structure of hen egg-white lysozyme. , 1977, Journal of molecular biology.

[8]  E N Baker,et al.  Structure of azurin from Alcaligenes denitrificans refinement at 1.8 A resolution and comparison of the two crystallographically independent molecules. , 1987, Journal of molecular biology.

[9]  Tom L. Blundell,et al.  Molecular anatomy: Phyletic relationships derived from three-dimensional structures of proteins , 2005, Journal of Molecular Evolution.

[10]  L. Delbaere,et al.  Structures of product and inhibitor complexes of Streptomyces griseus protease A at 1.8 A resolution. A model for serine protease catalysis. , 1980, Journal of molecular biology.

[11]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[12]  Brian W. Matthews,et al.  Structure of a serine protease from rat mast cells determined from twinned crystals by isomorphous and molecular replacement , 1985 .

[13]  W. B. Church,et al.  The crystal structure of mercury-substituted poplar plastocyanin at 1.9-A resolution. , 1986, The Journal of biological chemistry.

[14]  R Bott,et al.  Crystal structure of turkey egg-white lysozyme: results of the molecular replacement method at 5 A resolution. , 1976, Journal of molecular biology.

[15]  S. Rackovsky,et al.  Differential Geometry and Polymer Conformation. 2. Development of a Conformational Distance Function , 1980 .

[16]  J. Guss,et al.  Structure of oxidized poplar plastocyanin at 1.6 A resolution. , 1983, Journal of molecular biology.

[17]  S Rackovsky,et al.  Protein comparison and classification: a differential geometric approach. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[18]  A H Louie,et al.  Differential geometry of proteins. Helical approximations. , 1983 .

[19]  P J Artymiuk,et al.  Refinement of human lysozyme at 1.5 A resolution analysis of non-bonded and hydrogen-bond interactions. , 1981, Journal of molecular biology.

[20]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[21]  M. Perutz,et al.  The structure of horse methaemoglobin at 2-0 A resolution. , 1977, Journal of molecular biology.

[22]  R. L. Somorjai,et al.  The alignment of protein structures in three dimensions , 1989 .

[23]  W A Hendrickson,et al.  Refinement of a molecular model for lamprey hemoglobin from Petromyzon marinus. , 1985, Journal of molecular biology.

[24]  STRUCTURE OF BACTERIOPHAGE T4 LYSOZYME REFINED AT 1.7 ANGSTROMS RESOLUTION , 1986 .

[25]  S J Remington,et al.  A systematic approach to the comparison of protein structures. , 1980, Journal of molecular biology.

[26]  P Argos,et al.  Structural comparisons of heme binding proteins. , 1979, Biochemistry.

[27]  D C Carter,et al.  Crystal structure of Azotobacter cytochrome c5 at 2.5 A resolution. , 1985, Journal of molecular biology.

[28]  F. Richards,et al.  Identification of structural motifs from protein coordinate data: Secondary structure and first‐level supersecondary structure * , 1988, Proteins.

[29]  M. Sternberg,et al.  On the prediction of protein structure: The significance of the root-mean-square deviation. , 1980, Journal of molecular biology.

[30]  Brian W. Kernighan,et al.  The C Programming Language , 1978 .

[31]  L. Delbaere,et al.  Refined structure of alpha-lytic protease at 1.7 A resolution. Analysis of hydrogen bonding and solvent structure. , 1985, Journal of molecular biology.

[32]  M N Liebman Structural organization in the serine proteases. I. Macromolecular specificity in limited proteolysis. , 1986, Enzyme.

[33]  G. Rose,et al.  Turns in peptides and proteins. , 1985, Advances in protein chemistry.

[34]  T L Blundell,et al.  Comparison of solvent-inaccessible cores of homologous proteins: definitions useful for protein modelling. , 1987, Protein engineering.

[35]  P Argos,et al.  The taxonomy of protein structure. , 1977, Journal of molecular biology.

[36]  N. Isaacs,et al.  Comparison of goose-type, chicken-type, and phage-type lysozymes illustrates the changes that occur in both amino acid sequence and three-dimensional structure during evolution , 1985, Journal of Molecular Evolution.

[37]  J. W. Campbell,et al.  The atomic structure of crystalline porcine pancreatic elastase at 2.5 A resolution: comparisons with the structure of alpha-chymotrypsin. , 1976, Journal of molecular biology.

[38]  R. Huber,et al.  STRUCTURAL STUDIES ON THE PANCREATIC TRYPSIN INHIBITOR – TRYPSIN COMPLEX AND ITS FREE COMPONENTS: STRUCTURE AND FUNCTION RELATIONSHIP IN SERINE PROTEASE INHIBITION AND CATALYSIS , 1976 .

[39]  A. Tulinsky,et al.  The refinement and the structure of the dimer of alpha-chymotrypsin at 1.67-A resolution. , 1985, The Journal of biological chemistry.

[40]  G. Cohen,et al.  Refined crystal structure of gamma-chymotrypsin at 1.9 A resolution. Comparison with other pancreatic serine proteases. , 1981, Journal of molecular biology.

[41]  W R Taylor,et al.  Protein structure alignment. , 1989, Journal of molecular biology.

[42]  R. Dickerson,et al.  The structure of Paracoccus denitrificans cytochrome c550. , 1976, The Journal of biological chemistry.

[43]  T. Takano Structure of myoglobin refined at 2-0 A resolution. II. Structure of deoxymyoglobin from sperm whale. , 1976, Journal of molecular biology.

[44]  W. Bode,et al.  Refined 2 A X-ray crystal structure of porcine pancreatic kallikrein A, a specific trypsin-like serine proteinase. Crystallization, structure determination, crystallographic refinement, structure and its comparison with bovine trypsin. , 1983, Journal of molecular biology.

[45]  H. Hill,et al.  The effect of pH and temperature on the structure of the active site of azurin from Pseudomonas aeruginosa , 1982, FEBS letters.

[46]  M. Perutz,et al.  THE STRUCTURE OF HORSE METHAEMOGLOBIN AT 2.0 ANGSTROMS RESOLUTION , 1983 .

[47]  L. Delbaere,et al.  The 1.8 A structure of the complex between chymostatin and Streptomyces griseus protease A. A model for serine protease catalytic tetrahedral intermediates. , 1985, Journal of molecular biology.

[48]  M G Rossmann,et al.  Comparison of super-secondary structures in proteins. , 1973, Journal of molecular biology.

[49]  T. Blundell,et al.  Knowledge based modelling of homologous proteins, Part I: Three-dimensional frameworks derived from the simultaneous superposition of multiple structures. , 1987, Protein engineering.

[50]  George D. Rose,et al.  Protein Folding: New Twists , 1988, Bio/Technology.

[51]  A H Louie,et al.  Differential geometry of proteins: a structural and dynamical representation of patterns. , 1982, Journal of theoretical biology.

[52]  M J Sippl,et al.  On the problem of comparing protein structures. Development and applications of a new method for the assessment of structural similarities of polypeptide conformations. , 1982, Journal of molecular biology.

[53]  N. Xuong,et al.  The structure of oxidized cytochrome c 2 of Rhodospirillum rubrum. , 1976, The Journal of biological chemistry.

[54]  E. Adman,et al.  Structure and Function of Small Blue Copper Proteins , 1985 .

[55]  H Weinstein,et al.  Structural analysis of carboxypeptidase A and its complexes with inhibitors as a basis for modeling enzyme recognition and specificity , 1985, Biopolymers.

[56]  Phillips Dc,et al.  The development of crystallographic enzymology. , 1970 .

[57]  P Argos,et al.  Exploring structural homology of proteins. , 1976, Journal of molecular biology.

[58]  B. Matthews,et al.  Structure of bacteriophage T4 lysozyme refined at 1.7 A resolution. , 1987, Journal of molecular biology.

[59]  Y. Hata,et al.  Structure of rice ferricytochrome c at 2.0 A resolution. , 1983, Journal of molecular biology.

[60]  E. T. Adman,et al.  Structural Features of Azurin at 2.7 Å Resolution , 1981 .

[61]  R. Dickerson,et al.  Structure of cytochrome c551 from Pseudomonas aeruginosa refined at 1.6 A resolution and comparison of the two redox forms. , 1982, Journal of molecular biology.

[62]  M. James,et al.  Rat submaxillary gland serine protease, tonin. Structure solution and refinement at 1.8 A resolution. , 1987, Journal of molecular biology.

[63]  T L Blundell,et al.  Knowledge based modelling of homologous proteins, Part II: Rules for the conformations of substituted sidechains. , 1987, Protein engineering.

[64]  R. Abagyan,et al.  An automatic search for similar spatial arrangements of alpha-helices and beta-strands in globular proteins. , 1989, Journal of biomolecular structure & dynamics.

[65]  L. Sieker,et al.  Structures of triclinic mono- and di-N-acetylglucosamine: lysozyme complexes--a crystallographic study. , 1976, Journal of molecular biology.

[66]  R. Dickerson,et al.  Redox conformation changes in refined tuna cytochrome c. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[67]  R. Lavery,et al.  Describing protein structure: A general algorithm yielding complete helicoidal parameters and a unique overall axis , 1989, Proteins.

[68]  T L Blundell,et al.  Phylogenetic relationships from three-dimensional protein structures. , 1990, Methods in enzymology.

[69]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[70]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[71]  L. H. Jensen,et al.  Structural Features of Azurin at 2.7 Angstroms Resolution , 1980 .

[72]  M. Perutz,et al.  The crystal structure of human deoxyhaemoglobin at 1.74 A resolution. , 1984, Journal of molecular biology.

[73]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[74]  A. Mclachlan Gene duplications in the structural evolution of chymotrypsin. , 1979, Journal of molecular biology.