On peptide de novo sequencing: a new approach

A procedure is presented for the automatic determination of the amino acid sequence of peptides by processing data obtained from mass spectrometry analysis. This is a basic and relevant problem in the field of proteomics. Furthermore, it has an even higher conceptual and applicative interest in peptide research, as well as in other connected fields. The analysis does not rely on known protein databases, but on the computation of all amino acid sequences compatible with the given spectral data. By formulating a mathematical model for such combinatorial problems, the structural limitations of known methods are overcome, and efficient solution algorithms can be developed. The results are very encouraging both from the accuracy and computational points of view. Copyright © 2004 European Peptide Society and John Wiley & Sons, Ltd.

[1]  A. Podtelejnikov,et al.  Identification of the components of simple protein mixtures by high-accuracy peptide mass mapping and database searching. , 1997, Analytical chemistry.

[2]  Pavel A. Pevzner,et al.  De Novo Peptide Sequencing via Tandem Mass Spectrometry , 1999, J. Comput. Biol..

[3]  W. Geraerts,et al.  Processing and targeting of a molluscan egg-laying peptide prohormone as revealed by mass spectrometric peptide fingerprinting and peptide sequencing. , 1994, Endocrinology.

[4]  D. Williams,et al.  Peptide sequencing using the combination of edman degradation, carboxypeptidase digestion and fast atom bombardment mass spectrometry. , 1982, Biochemical and biophysical research communications.

[5]  J. A. Taylor,et al.  Sequence database searches via de novo peptide sequencing by tandem mass spectrometry. , 1997, Rapid communications in mass spectrometry : RCM.

[6]  J. A. Taylor,et al.  Searching sequence databases via De novo peptide sequencing by tandem mass spectrometry , 2002, Molecular biotechnology.

[7]  G. Gianfranceschi,et al.  Molecular Models of Acidic PePtides From Pea Bud Chromatin And Seminal Plasma. Divalent Cations-Mediated Interaction With Dna , 1999, Biological chemistry.

[8]  Terry D. Lee Fast Atom Bombardment and Secondary Ion Mass Spectrometry of Peptides and Proteins , 1986 .

[9]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[10]  P. Edman,et al.  A method for the determination of amino acid sequence in peptides. , 1949, Archives of biochemistry.

[11]  John N. Hooker,et al.  Logic-Based Methods for Optimization , 1994, PPCP.

[12]  G. Glish,et al.  C-terminal peptide sequencing via multistage mass spectrometry. , 1998, Analytical chemistry.

[13]  Gary Siuzdak,et al.  Mass spectrometry for biotechnology , 1996 .

[14]  K. Murayama,et al.  Rapid determination of parvalbumin amino acid sequence from Rana catesbeiana (pI 4.78) by combination of ESI mass spectrometry, protein sequencing, and amino acid analysis. , 2000, Journal of biochemistry.

[15]  William R. Pearson,et al.  Rapid Identification of Protein Phosphatase 1-binding Proteins by Mixed Peptide Sequencing and Data Base Searching , 1998, The Journal of Biological Chemistry.

[16]  M Levitt,et al.  Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins , 1998, Protein science : a publication of the Protein Society.

[17]  K. Wood,et al.  Detection of tyrosine phosphorylated peptides via skimmer collision-induced dissociation/ion trap mass spectrometry. , 2003, Journal of mass spectrometry : JMS.

[18]  J. Yates,et al.  DTASelect and Contrast: tools for assembling and comparing protein identifications from shotgun proteomics. , 2002, Journal of proteome research.

[19]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988, Wiley interscience series in discrete mathematics and optimization.

[20]  J. Stults Peptide sequencing by mass spectrometry. , 2006, Methods of biochemical analysis.

[21]  J. Evans,et al.  Identity of nuclear high‐mobility‐group protein, HMG‐1, and sulfoglucuronyl carbohydrate‐binding protein, SBP‐1, in brain , 2001, Journal of neurochemistry.

[22]  G. Gianfranceschi,et al.  Mass spectral and electrophoretic characterization of acidic peptides bound to chromatin of Pea Bud , 1998, Molecular Biology Reports.

[23]  W. Lehmann,et al.  Patchwork peptide sequencing: Extraction of sequence information from accurate mass data of peptide tandem mass spectra recorded at high resolution* , 2002, Proteomics.

[24]  N. Anderson,et al.  Proteomics: applications in basic and applied biology. , 2000, Current opinion in biotechnology.

[25]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[26]  J. A. Taylor,et al.  Implementation and uses of automated de novo peptide sequencing by tandem mass spectrometry. , 2001, Analytical chemistry.