An accurate and efficient algorithm for Peptide and ptm identification by tandem mass spectrometry.

Peptide identification by tandem mass spectrometry (MS/MS) is one of the most important problems in proteomics. Recent advances in high throughput MS/MS experiments result in huge amount of spectra. Unfortunately, identification of these spectra is relatively slow, and the accuracies of current algorithms are not high with the presence of noises and post-translational modifications (PTMs). In this paper, we strive to achieve high accuracy and efficiency for peptide identification problem, with special concern on identification of peptides with PTMs. This paper expands our previous work on PepSOM with the introduction of two accurate modified scoring functions: Slambda for peptide identification and Slambda* for identification of peptides with PTMs. Experiments showed that our algorithm is both fast and accurate for peptide identification. Experiments on spectra with simulated and real PTMs confirmed that our algorithm is accurate for identifying PTMs.

[1]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[2]  Hokeun Kim,et al.  MODi : a powerful and convenient web server for identifying multiple post-translational peptide modifications from tandem mass spectra , 2006, Nucleic Acids Res..

[3]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[4]  J. A. Taylor,et al.  Sequence database searches via de novo peptide sequencing by tandem mass spectrometry. , 1997, Rapid communications in mass spectrometry : RCM.

[5]  Hon Wai Leong,et al.  Efficient algorithm for path-based range query in spatial databases , 2004, Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04..

[6]  Hon Wai Leong,et al.  PepSOM: an algorithm for peptide identification by tandem mass spectrometry based on SOM. , 2006, Genome informatics. International Conference on Genome Informatics.

[7]  Daniel P. Miranker,et al.  A fast coarse filtering method for peptide identification by mass spectrometry , 2006, Bioinform..

[8]  Rong Wang,et al.  The need for a public proteomics repository , 2004, Nature Biotechnology.

[9]  Hon Wai Leong,et al.  A New Approach for Similarity Queries of Biological Sequences in Databases , 2007, PAKDD.

[10]  Pavel A. Pevzner,et al.  Peptide Sequence Tags for Fast Database Search in Mass-Spectrometry , 2005, RECOMB.

[11]  Eric W. Deutsch,et al.  The PeptideAtlas project , 2005, Nucleic Acids Res..

[12]  Vineet Bafna,et al.  InsPecT : Fast and accurate identification of post-translationally modified peptides from tandem mass spectra , 2005 .

[13]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[14]  Dekel Tsur,et al.  Identification of post-translational modifications by blind search of mass spectra , 2005, Nature Biotechnology.

[15]  Ming Li,et al.  PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry. , 2003, Rapid communications in mass spectrometry : RCM.

[16]  A. Nesvizhskii,et al.  Experimental protein mixture for validating tandem mass spectral analysis. , 2002, Omics : a journal of integrative biology.

[17]  Hon Wai Leong,et al.  Path-Based Range Query Processing Using Sorted Path and Rectangle Intersection Approach , 2004, DASFAA.

[18]  K. Ichikawa,et al.  Determination of sites citrullinated by peptidylarginine deiminase using 18O stable isotope labeling and mass spectrometry. , 2005, Rapid communications in mass spectrometry : RCM.

[19]  P. Pevzner,et al.  PepNovo: de novo peptide sequencing via probabilistic network modeling. , 2005, Analytical chemistry.

[20]  Jorma Laaksonen,et al.  SOM_PAK: The Self-Organizing Map Program Package , 1996 .

[21]  Bin Ma,et al.  PEAKS: Powerful Software for Peptide De Novo Sequencing by MS/MS , 2003 .

[22]  P. Pevzner,et al.  InsPecT: identification of posttranslationally modified peptides from tandem mass spectra. , 2005, Analytical chemistry.