High throughput protein fold identification by using experimental constraints derived from intramolecular cross-links and mass spectrometry

We have used intramolecular cross-linking, MS, and sequence threading to rapidly identify the fold of a model protein, bovine basic fibroblast growth factor (FGF)-2. Its tertiary structure was probed with a lysine-specific cross-linking agent, bis(sulfosuccinimidyl) suberate (BS(3)). Sites of cross-linking were determined by tryptic peptide mapping by using time-of-flight MS. Eighteen unique intramolecular lysine (Lys-Lys) cross-links were identified. The assignments for eight cross-linked peptides were confirmed by using post source decay MS. The interatomic distance constraints were all consistent with the tertiary structure of FGF-2. These relatively few constraints, in conjunction with threading, correctly identified FGF-2 as a member of the beta-trefoil fold family. To further demonstrate utility, we used the top-scoring homolog, IL-1beta, to build an FGF-2 homology model with a backbone error of 4.8 A (rms deviation). This method is fast, is general, uses small amounts of material, and is amenable to automation.

[1]  Evelyn Camon,et al.  The EMBL Nucleotide Sequence Database , 2000, Nucleic Acids Res..

[2]  B. Chait,et al.  Matrix-assisted laser desorption ion trap mass spectrometry: efficient isolation and effective fragmentation of peptide ions. , 1996, Analytical chemistry.

[3]  Irwin D. Kuntz,et al.  Effects of distance constraints on macromolecular conformation. II. Simulation of experimental results and theoretical predictions , 1979 .

[4]  F J Moy,et al.  High-resolution solution structure of basic fibroblast growth factor determined by multidimensional heteronuclear magnetic resonance spectroscopy. , 1996, Biochemistry.

[5]  Timothy F. Havel,et al.  An evaluation of the combined use of nuclear magnetic resonance and distance geometry for the determination of protein conformations in solution. , 1985, Journal of molecular biology.

[6]  D. Baker,et al.  Improved recognition of native‐like protein structures using a combination of sequence‐dependent and sequence‐independent features of proteins , 1999, Proteins.

[7]  G J Arlaud,et al.  Structure of the catalytic region of human complement protease C1s: study by chemical cross-linking and three-dimensional homology modeling. , 1995, Biochemistry.

[8]  William A. Goddard,et al.  PROTEIN FOLD DETERMINATION FROM SPARSE DISTANCE RESTRAINTS : THE RESTRAINED GENERIC PROTEIN DIRECT MONTE CARLO METHOD , 1999 .

[9]  R Nussinov,et al.  Fast protein fold recognition via sequence to structure alignment and contact capacity potentials. , 1996, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[10]  R. Levy,et al.  Global folding of proteins using a limited number of distance constraints. , 1993, Protein engineering.

[11]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[12]  M J Sternberg,et al.  On the use of chemically derived distance constraints in the prediction of protein structure with myoglobin as an example. , 1980, Journal of molecular biology.

[13]  G J Arlaud,et al.  Structure and assembly of the catalytic region of human complement protease C1r: a three-dimensional model based on chemical cross-linking and homology modeling. , 1997, Biochemistry.

[14]  Brian W. Matthews,et al.  Refinement of the structure of human basic fibroblast growth factor at 1.6 Å resolution and analysis of presumed heparin binding sites by selenate substitution , 1993, Protein science : a publication of the Protein Society.

[15]  B. Chait,et al.  High-accuracy mass measurement as a tool for studying proteins. , 1994, Current Opinion in Biotechnology.

[16]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.

[17]  U. Hobohm,et al.  Enlarged representative set of protein structures , 1994, Protein science : a publication of the Protein Society.

[18]  G. Böhm,et al.  Structural relationships of homologous proteins as a fundamental principle in homology modeling , 1993, Proteins.

[19]  F W McLafferty,et al.  Biomolecule Mass Spectrometry , 1999, Science.

[20]  B. Matthews,et al.  Three-dimensional structure of human basic fibroblast growth factor. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[21]  R Kaufmann,et al.  Mass spectrometric sequencing of linear peptides by product-ion analysis in a reflectron time-of-flight mass spectrometer using matrix-assisted laser desorption ionization. , 1993, Rapid communications in mass spectrometry : RCM.

[22]  K Wüthrich,et al.  Efficient computation of three-dimensional protein structures in solution from nuclear magnetic resonance data using the program DIANA and the supporting programs CALIBA, HABAS and GLOMSA. , 1991, Journal of molecular biology.

[23]  M. Mann,et al.  Electrospray ionization for mass spectrometry of large biomolecules. , 1989, Science.

[24]  W. Taylor,et al.  Global fold determination from a small number of distance restraints. , 1995, Journal of molecular biology.

[25]  T. Arakawa,et al.  Recombinant human erythropoietin (rHuEPO): Cross‐linking with disuccinimidyl esters and identification of the interfacing domains in EPO , 1993, Protein science : a publication of the Protein Society.

[26]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[27]  S Karlin,et al.  Measures of residue density in protein structures. , 1999, Proceedings of the National Academy of Sciences of the United States of America.