Protein identification using MS/MS data.

The subject of this tutorial is protein identification and characterisation by database searching of MS/MS Data. Peptide Mass Fingerprinting is excluded because it is covered in a separate tutorial. Practical aspects of database searching are emphasised, such as choice of sequence database, effect of mass tolerance, and how to identify post-translational modifications. The relationship between sensitivity and specificity is discussed, as is the challenge of using peptide match information to infer which proteins were present in the sample. Since these tutorials are introductory in nature, most references are to reviews, rather than primary research papers. Some familiarity with mass spectrometry and protein chemistry is assumed. There is an accompanying slide presentation, including speaker notes, and a collection of web-based, practical exercises, designed to reinforce key points. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP 6).

[1]  Samuel H. Payne,et al.  Accurate annotation of peptide modifications through unrestrictive database search. , 2008, Journal of proteome research.

[2]  W. Blackstock,et al.  Matching peptide mass spectra to EST and genomic DNA databases. , 2001, Trends in biotechnology.

[3]  John R Yates,et al.  Top down proteomics. , 2013, Analytical chemistry.

[4]  O. Jensen Interpreting the protein language using proteomics , 2006, Nature Reviews Molecular Cell Biology.

[5]  J. Yates,et al.  GutenTag: high-throughput sequence tagging via an empirically derived fragmentation model. , 2003, Analytical chemistry.

[6]  I. Papayannopoulos,et al.  The interpretation of collision‐induced dissociation tandem mass spectra of peptides , 1996 .

[7]  Rune Matthiesen,et al.  Methods, algorithms and tools in computational proteomics: A practical point of view , 2007, Proteomics.

[8]  Steven P Gygi,et al.  A probability-based approach for high-throughput protein phosphorylation analysis and site localization , 2006, Nature Biotechnology.

[9]  P. Pevzner,et al.  InsPecT: identification of posttranslationally modified peptides from tandem mass spectra. , 2005, Analytical chemistry.

[10]  B. Webb-Robertson Computational methods for mass spectrometry proteomics , 2011, Journal of the American Society for Mass Spectrometry.

[11]  A. Masselot,et al.  OLAV: Towards high‐throughput tandem mass spectrometry data identification , 2003, Proteomics.

[12]  B. Searle,et al.  High-throughput identification of proteins and unanticipated sequence modifications using a mass-based alignment algorithm for MS/MS de novo sequencing results. , 2004, Analytical chemistry.

[13]  S. Bryant,et al.  Open mass spectrometry search algorithm. , 2004, Journal of proteome research.

[14]  Bingwen Lu,et al.  Automatic validation of phosphopeptide identifications from tandem mass spectra. , 2007, Analytical chemistry.

[15]  John C Whittaker,et al.  Review of factors that influence the abundance of ions produced in a tandem mass spectrometer and statistical methods for discovering these factors. , 2009, Mass spectrometry reviews.

[16]  S. Carr,et al.  Reporting Protein Identification Data , 2006, Molecular & Cellular Proteomics.

[17]  B. Kuster,et al.  Confident Phosphorylation Site Localization Using the Mascot Delta Score , 2010, Molecular & Cellular Proteomics.

[18]  Rovshan G Sadygov,et al.  Large-scale database searching using tandem mass spectra: Looking up the answer in the back of the book , 2004, Nature Methods.

[19]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[20]  Brian E. Ruttenberg,et al.  PhosphoScore: an open-source phosphorylation site assignment tool for MSn data. , 2008, Journal of proteome research.

[21]  Martin Zeller,et al.  SLoMo: automated site localization of modifications from ETD/ECD mass spectra. , 2009, Journal of proteome research.

[22]  Leo C. McHugh,et al.  Computational Methods for Protein Identification from Mass Spectrometry Data , 2008, PLoS Comput. Biol..

[23]  Mikhail M Savitski,et al.  ModifiComb, a New Proteomic Tool for Mapping Substoichiometric Post-translational Modifications, Finding Novel Types of Modifications, and Fingerprinting Complex Protein Mixtures* , 2006, Molecular & Cellular Proteomics.

[24]  L. Fricker,et al.  Peptidomics: identification and quantification of endogenous peptides in neuroendocrine tissues. , 2006, Mass spectrometry reviews.

[25]  Ruedi Aebersold,et al.  The pros and cons of peptide-centric proteomics , 2010, Nature Biotechnology.

[26]  Dekel Tsur,et al.  Identification of post-translational modifications by blind search of mass spectra , 2005, Nature Biotechnology.

[27]  A. Shevchenko,et al.  MultiTag: multiple error-tolerant sequence tag search for the sequence-similarity identification of proteins by mass spectrometry. , 2003, Analytical chemistry.

[28]  Alexey I Nesvizhskii,et al.  Analysis and validation of proteomic data generated by tandem mass spectrometry , 2007, Nature Methods.

[29]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[30]  M. MacCoss,et al.  A fast SEQUEST cross correlation algorithm. , 2008, Journal of proteome research.

[31]  David Fenyö,et al.  RADARS, a bioinformatics solution that automates proteome mass spectral analysis, optimises protein identification, and archives data in a relational database , 2002, Proteomics.

[32]  马斌 Challenges in Computational Analysis of Mass Spectrometry Data for Proteomics , 2010 .

[33]  Lennart Martens,et al.  mzML—a Community Standard for Mass Spectrometry Data* , 2010, Molecular & Cellular Proteomics.

[34]  Markus Müller,et al.  Unrestricted identification of modified proteins using MS/MS , 2010, Proteomics.

[35]  P. Bork,et al.  Charting the proteomes of organisms with unsequenced genomes by MALDI-quadrupole time-of-flight mass spectrometry and BLAST homology searching. , 2001, Analytical chemistry.

[36]  M. Wilm,et al.  Error-tolerant identification of peptides in sequence databases by peptide sequence tags. , 1994, Analytical chemistry.

[37]  Neil L. Kelleher,et al.  Peer Reviewed: Top-Down Proteomics , 2004 .

[38]  W. Lehmann,et al.  De novo sequencing of peptides by MS/MS , 2010, Proteomics.

[39]  Chris F. Taylor,et al.  Guidelines for reporting the use of mass spectrometry informatics in proteomics , 2008, Nature Biotechnology.

[40]  Sándor Suhai,et al.  Fragmentation Pathways of Protonated Peptides , 2006 .

[41]  Alexey I Nesvizhskii,et al.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. , 2002, Analytical chemistry.

[42]  Matthias Mann,et al.  Bioinformatics analysis of mass spectrometry‐based proteomics data sets , 2009, FEBS letters.

[43]  William Stafford Noble,et al.  Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets. , 2009, Journal of proteome research.

[44]  Robert A. Grothe,et al.  Precursor-ion mass re-estimation improves peptide identification on hybrid instruments. , 2008, Journal of proteome research.

[45]  P. Roepstorff,et al.  Proposal for a common nomenclature for sequence ions in mass spectra of peptides. , 1984, Biomedical mass spectrometry.

[46]  G. McAlister,et al.  A proteomics grade electron transfer dissociation-enabled hybrid linear ion trap-orbitrap mass spectrometer. , 2008, Journal of proteome research.

[47]  M. Mann,et al.  Global, In Vivo, and Site-Specific Phosphorylation Dynamics in Signaling Networks , 2006, Cell.

[48]  Vineet Bafna,et al.  SCOPE: a probabilistic model for scoring tandem mass spectra against a peptide database , 2001, ISMB.

[49]  Markus Müller,et al.  Automated protein identification by tandem mass spectrometry: issues and strategies. , 2006, Mass spectrometry reviews.

[50]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[51]  Jürgen Cox,et al.  Computational principles of determining and improving mass precision and accuracy for proteome measurements in an Orbitrap , 2009, Journal of the American Society for Mass Spectrometry.

[52]  Peter R. Baker,et al.  Role of accurate mass measurement (+/- 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. , 1999, Analytical chemistry.

[53]  Alexey I Nesvizhskii,et al.  Interpretation of Shotgun Proteomic Data , 2005, Molecular & Cellular Proteomics.