SwissPIT: An workflow‐based platform for analyzing tandem‐MS spectra using the Grid

The identification and characterization of peptides from MS/MS data represents a critical aspect of proteomics. It has been the subject of extensive research in bioinformatics resulting in the generation of a fair number of identification software tools. Most often, only one program with a specific and unvarying set of parameters is selected for identifying proteins. Hence, a significant proportion of the experimental spectra do not match the peptide sequences in the screened database due to inappropriate parameters or scoring schemes. The Swiss protein identification toolbox (swissPIT) project provides the scientific community with an expandable multitool platform for automated in‐depth analysis of MS data also able to handle data from high‐throughput experiments. With swissPIT many problems have been solved: The missing standards for input and output formats (A), creation of analysis workflows (B), unified result visualization (C), and simplicity of the user interface (D). Currently, swissPIT supports four different programs implementing two different search strategies to identify MS/MS spectra. Conceived to handle the calculation‐intensive needs of each of the programs, swissPIT uses the distributed resources of a Swiss‐wide computer Grid (http://www.swing‐grid.ch).

[1]  Amos Bairoch,et al.  Post‐translational modifications: A challenge for proteomics and bioinformatics , 2004, Proteomics.

[2]  P. Pevzner,et al.  InsPecT: identification of posttranslationally modified peptides from tandem mass spectra. , 2005, Analytical chemistry.

[3]  Cesare Pautasso,et al.  swissPIT: a novel approach for pipelined analysis of mass spectrometry data , 2008, Bioinform..

[4]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[5]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[6]  Alexey I Nesvizhskii,et al.  Analysis and validation of proteomic data generated by tandem mass spectrometry , 2007, Nature Methods.

[7]  Heinz Stockinger,et al.  Defining the grid: a snapshot on the current view , 2007, The Journal of Supercomputing.

[8]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[9]  Gilbert S Omenn,et al.  An evaluation, comparison, and accurate benchmarking of several publicly available MS/MS search algorithms: Sensitivity and specificity analysis , 2005, Proteomics.

[10]  J. Yates,et al.  GutenTag: high-throughput sequence tagging via an empirically derived fragmentation model. , 2003, Analytical chemistry.

[11]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[12]  Kai A Reidegeld,et al.  Tryptic transpeptidation products observed in proteome analysis by liquid chromatography‐tandem mass spectrometry , 2005, Proteomics.

[13]  Dekel Tsur,et al.  Identification of post-translational modifications by blind search of mass spectra , 2005, Nature Biotechnology.

[14]  Ron D Appel,et al.  Proteome informatics I: Bioinformatics tools for processing experimental data , 2006, Proteomics.

[15]  Rovshan G Sadygov,et al.  Large-scale database searching using tandem mass spectra: Looking up the answer in the back of the book , 2004, Nature Methods.

[16]  M. Wilm,et al.  Error-tolerant identification of peptides in sequence databases by peptide sequence tags. , 1994, Analytical chemistry.

[17]  R. Appel,et al.  Popitam: Towards new heuristic strategies to improve protein identification from tandem mass spectrometry data , 2003, Proteomics.

[18]  Jacques Colinge,et al.  Improved peptide charge state assignment , 2003, Proteomics.

[19]  A. Shevchenko,et al.  Expanding the organismal scope of proteomics: Cross‐species protein identification by mass spectrometry and its implications , 2003, Proteomics.

[20]  Rune Matthiesen,et al.  Methods, algorithms and tools in computational proteomics: A practical point of view , 2007, Proteomics.

[21]  Markus Müller,et al.  Automated protein identification by tandem mass spectrometry: issues and strategies. , 2006, Mass spectrometry reviews.

[22]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[23]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[24]  O. Jensen Modification-specific proteomics: characterization of post-translational modifications by mass spectrometry. , 2004, Current opinion in chemical biology.