Shotgun Protein Sequencing

Despite significant advances in the identification of known proteins, the analysis of unknown proteins by MS/MS still remains a challenging open problem. Although Klaus Biemann recognized the potential of MS/MS for sequencing of unknown proteins in the 1980s, low throughput Edman degradation followed by cloning still remains the main method to sequence unknown proteins. The automated interpretation of MS/MS spectra has been limited by a focus on individual spectra and has not capitalized on the information contained in spectra of overlapping peptides. Indeed the powerful shotgun DNA sequencing strategies have not been extended to automated protein sequencing. We demonstrate, for the first time, the feasibility of automated shotgun protein sequencing of protein mixtures by utilizing MS/MS spectra of overlapping and possibly modified peptides generated via multiple proteases of different specificities. We validate this approach by generating highly accurate de novo reconstructions of multiple regions of various proteins in western diamondback rattlesnake venom. We further argue that shotgun protein sequencing has the potential to overcome the limitations of current protein sequencing approaches and thus catalyze the otherwise impractical applications of proteomics methodologies in studies of unknown proteins.

[1]  Haixu Tang,et al.  De novo repeat classification and fragment assembly , 2004, RECOMB.

[2]  G. Bulaj,et al.  Conotoxins and the posttranslational modification of secreted gene products , 2005, Cellular and Molecular Life Sciences CMLS.

[3]  Dekel Tsur,et al.  Identification of post-translational modifications by blind search of mass spectra , 2005, Nature Biotechnology.

[4]  Dekel Tsur,et al.  A New Approach to Protein Identification , 2006, RECOMB.

[5]  Aaron A. Klammer,et al.  Effects of modified digestion schemes on the identification of proteins from complex mixtures. , 2006, Journal of proteome research.

[6]  M. Savitski,et al.  Proteomics-grade de novo sequencing approach. , 2005, Journal of proteome research.

[7]  Richard Sposto,et al.  A Novel Snake Venom Disintegrin That Inhibits Human Ovarian Cancer Dissemination and Angiogenesis in an Orthotopic Nude Mouse Model , 2001, Pathophysiology of Haemostasis and Thrombosis.

[8]  Ming-Yang Kao,et al.  A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry , 2000, SODA '00.

[9]  Ulrich Kuch,et al.  Complete amino acid sequence and phylogenetic analysis of a long-chain neurotoxin from the venom of the African banded water cobra, Boulengerina annulata. , 2004, Toxicon : official journal of the International Society on Toxinology.

[10]  E. Zandi,et al.  Complete reconstitution of human IkappaB kinase (IKK) complex in yeast. Assessment of its stoichiometry and the role of IKKgamma on the complex activity in the absence of stimulation. , 2001, The Journal of biological chemistry.

[11]  Bin Ma,et al.  SPIDER: software for protein identification from sequence tags with de novo sequencing error. , 2004, Proceedings. IEEE Computational Systems Bioinformatics Conference.

[12]  P. Pevzner,et al.  PepNovo: de novo peptide sequencing via probabilistic network modeling. , 2005, Analytical chemistry.

[13]  M. Santoro,et al.  Electrospray ionization quadrupole time-of-flight and matrix-assisted laser desorption/ionization tandem time-of-flight mass spectrometric analyses to solve micro-heterogeneity in post-translationally modified peptides from Phoneutria nigriventer (Aranea, Ctenidae) venom. , 2005, Rapid communications in mass spectrometry : RCM.

[14]  J. Joseph,et al.  Snake venom prothrombin activators similar to blood coagulation factor Xa. , 2004, Current drug targets. Cardiovascular & haematological disorders.

[15]  Virgil L. Woods,et al.  Rapid refinement of crystallographic protein construct definition employing enhanced hydrogen/deuterium exchange MS. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Jill P Mesirov,et al.  Assembly of polymorphic genomes: algorithms and application to Ciona savignyi. , 2005, Genome research.

[17]  J. Joseph,et al.  Procoagulant Proteins from Snake Venoms , 2001, Pathophysiology of Haemostasis and Thrombosis.

[18]  Pavel A. Pevzner,et al.  Mutation-tolerant protein identification by mass-spectrometry , 2000, RECOMB '00.

[19]  Bin Ma,et al.  SPIDER: software for protein identification from sequence tags with de novo sequencing error , 2004, Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004..

[20]  Pavel A. Pevzner,et al.  Protein identification by spectral networks analysis , 2007, Proceedings of the National Academy of Sciences.

[21]  P. Ho,et al.  Identification of novel bradykinin-potentiating peptides and C-type natriuretic peptide from Lachesis muta venom. , 2005, Toxicon : official journal of the International Society on Toxinology.

[22]  Pavel A. Pevzner,et al.  De Novo Peptide Sequencing via Tandem Mass Spectrometry , 1999, J. Comput. Biol..

[23]  L. S. Wermelinger,et al.  Fast analysis of low molecular mass compounds present in snake venom: identification of ten new pyroglutamate-containing peptides. , 2005, Rapid communications in mass spectrometry : RCM.

[24]  P. Escoubas Mass spectrometry in toxinology: a 21st-century technology for the study of biopolymers from venoms. , 2006, Toxicon : official journal of the International Society on Toxinology.

[25]  Q. Zhou,et al.  Molecular cloning and expression of catrocollastatin, a snake-venom protein from Crotalus atrox (western diamondback rattlesnake) which inhibits platelet adhesion to collagen. , 1995, The Biochemical journal.

[26]  I. Guerrera,et al.  Application of Mass Spectrometry in Proteomics , 2005, Bioscience reports.

[27]  B. Olivera,et al.  Amino acid sequence and biological activity of a γ-conotoxin-like peptide from the worm-hunting snail Conus austini , 2006, Peptides.

[28]  John S Haurum,et al.  Recombinant polyclonal antibodies: the next generation of antibody therapeutics? , 2006, Drug discovery today.

[29]  Adriano M C Pimenta,et al.  Small peptides, big world: biotechnological potential in neglected bioactive peptides from arthropod venoms , 2005, Journal of peptide science : an official publication of the European Peptide Society.

[30]  P. Pevzner,et al.  InsPecT: identification of posttranslationally modified peptides from tandem mass spectra. , 2005, Analytical chemistry.

[31]  A. Gomes,et al.  Snake venom as therapeutic agents: from toxin to drug development. , 2002, Indian journal of experimental biology.

[32]  P. Pevzner,et al.  Shotgun protein sequencing by tandem mass spectra assembly. , 2004, Analytical chemistry.

[33]  Jennie R Lill,et al.  De novo proteomic sequencing of a monoclonal antibody raised against OX40 ligand. , 2006, Analytical biochemistry.

[34]  M. C. Dos-Santos,et al.  Individual venom variability in Crotalus durissus ruruima snakes, a subspecies of Crotalus durissus from the Amazonian region. , 2005, Toxicon : official journal of the International Society on Toxinology.

[35]  Alexander Leitner,et al.  Current chemical tagging strategies for proteome analysis by mass spectrometry. , 2004, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[36]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[37]  J. Fox,et al.  Comparison of indirect and direct approaches using ion-trap and Fourier transform ion cyclotron resonance mass spectrometry for exploring viperid venom proteomes. , 2006, Toxicon : official journal of the International Society on Toxinology.

[38]  Eugene W. Myers,et al.  Toward Simplifying and Accurately Formulating Fragment Assembly , 1995, J. Comput. Biol..

[39]  John I. Clark,et al.  Shotgun identification of protein modifications from protein complexes and lens tissue , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[40]  R. Sherwin,et al.  Intravenous liposomal delivery of the snake venom disintegrin contortrostatin limits breast cancer progression. , 2004, Molecular cancer therapeutics.

[41]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[42]  S. Serrano,et al.  Sex-based individual variation of snake venom proteome among eighteen Bothrops jararaca siblings. , 2006, Toxicon : official journal of the International Society on Toxinology.

[43]  J. Shabanowitz,et al.  Peptide and protein sequence analysis by electron transfer dissociation mass spectrometry. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[44]  R S Johnson,et al.  The primary structure of thioredoxin from Chromatium vinosum determined by high-performance tandem mass spectrometry. , 1987, Biochemistry.

[45]  J. Daltry,et al.  Diet and snake venom evolution , 1996, Nature.

[46]  J. Boyd,et al.  Cyclization of N-terminal S-carbamoylmethylcysteine causing loss of 17 Da from peptides and extra peaks in peptide maps. , 2002, Journal of proteome research.

[47]  Virgil L. Woods,et al.  Protein structure change studied by hydrogen-deuterium exchange, functional labeling, and mass spectrometry , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[48]  G. Damonte,et al.  How to discriminate between leucine and isoleucine by low energy ESI-TRAP MSn , 2007, Journal of the American Society for Mass Spectrometry.

[49]  M. C. Hu,et al.  Role of IκB kinase in tumorigenesis , 2005 .

[50]  R. Lewis,et al.  Therapeutic potential of venom peptides , 2003, Nature Reviews Drug Discovery.

[51]  E. Zandi,et al.  Complete Reconstitution of Human IκB Kinase (IKK) Complex in Yeast , 2001, The Journal of Biological Chemistry.

[52]  C. Bartels Fast algorithm for peptide sequencing by mass spectroscopy. , 1990, Biomedical & environmental mass spectrometry.

[53]  P. Gearhart,et al.  Immunology: The roots of antibody diversity , 2002, Nature.