The Mass Distance Fingerprint: a statistical framework for de novo detection of predominant modifications using high-accuracy mass spectrometry.

We describe a statistical measure, Mass Distance Fingerprint, for automatic de novo detection of predominant peptide mass distances, i.e., putative protein modifications. The method's focus is to globally detect mass differences, not to assign peptide sequences or modifications to individual spectra. The Mass Distance Fingerprint is calculated from high accuracy measured peptide masses. For the data sets used in this study, known mass differences are detected at electron mass accuracy or better. The proposed method is novel because it works independently of protein sequence databases and without any prior knowledge about modifications. Both modified and unmodified peptides have to be present in the sample to be detected. The method can be used for automated detection of chemical/post-translational modifications, quality control of experiments and labeling approaches, and to control the modification settings of protein identification tools. The algorithm is implemented as a web application and is distributed as open source software.

[1]  A. Bruins Proceedings of the 46th ASMS Conference on Mass Spectrometry and Allied Topics , 1998 .

[2]  A Bairoch,et al.  High-throughput mass spectrometric discovery of protein post-translational modifications. , 1999, Journal of molecular biology.

[3]  Mikhail M Savitski,et al.  ModifiComb, a New Proteomic Tool for Mapping Substoichiometric Post-translational Modifications, Finding Novel Types of Modifications, and Fingerprinting Complex Protein Mixtures* , 2006, Molecular & Cellular Proteomics.

[4]  John S. Garavelli,et al.  The RESID Database of Protein Modifications: 2003 developments , 2003, Nucleic Acids Res..

[5]  Martin Pelikan,et al.  Database independent detection of isotopically labeled MS/MS spectrum peptide pairs. , 2005, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[6]  P. Sonderegger,et al.  Neurotrypsin, a Novel Multidomain Serine Protease Expressed in the Nervous System , 1997, Molecular and Cellular Neuroscience.

[7]  S. Clarke,et al.  RNA and protein interactions modulated by protein arginine methylation. , 1998, Progress in nucleic acid research and molecular biology.

[8]  Thomas A Neubert,et al.  ABRF-PRG03: phosphorylation site determination. , 2003, Journal of biomolecular techniques : JBT.

[9]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[10]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[11]  A. Munnich,et al.  Truncating Neurotrypsin Mutation in Autosomal Recessive Nonsyndromic Mental Retardation , 2002, Science.

[12]  Hanno Steen,et al.  Protein Profiling with Cleavable Isotope-coded Affinity Tag (cICAT) Reagents , 2003, Molecular & Cellular Proteomics.

[13]  J. Yates,et al.  GutenTag: high-throughput sequence tagging via an empirically derived fragmentation model. , 2003, Analytical chemistry.

[14]  D. Creasy,et al.  Unimod: Protein modifications for mass spectrometry , 2004, Proteomics.

[15]  J. Yates,et al.  Automated identification of amino acid sequence variations in proteins by HPLC/microspray tandem mass spectrometry. , 2000, Analytical chemistry.

[16]  Chris L. Tang,et al.  Efficiency of database search for identification of mutated and modified proteins via mass spectrometry. , 2001, Genome research.

[17]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[18]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[19]  P Siekevitz,et al.  Isolation and characterization of postsynaptic densities from various brain regions: enrichment of different types of postsynaptic densities , 1980, The Journal of cell biology.

[20]  Bernhard Spengler,et al.  Isotopic Deconvolution of Matrix-Assisted Laser Desorption/Ionization Mass Spectra for Substance-Class Specific Analysis of Complex Samples , 2001 .

[21]  Peter R. Baker,et al.  Role of accurate mass measurement (+/- 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. , 1999, Analytical chemistry.

[22]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[23]  J. Yates Mass spectrometry and the age of the proteome. , 1998, Journal of mass spectrometry : JMS.

[24]  D. Creasy,et al.  Error tolerant searching of uninterpreted tandem mass spectrometry data , 2002, Proteomics.

[25]  D. Liebler,et al.  P-Mod: an algorithm and software to map modifications to peptide sequences using tandem MS data. , 2005, Journal of proteome research.