MassSorter: a tool for administrating and analyzing data from mass spectrometry experiments on proteins with known amino acid sequences

BackgroundProteomics is the study of the proteome, and is critical to the understanding of cellular processes. Two central and related tasks of proteomics are protein identification and protein characterization. Many small laboratories are interested in the characterization of a small number of proteins, e.g., how posttranslational modifications change under different conditions.ResultsWe have developed a software tool called MassSorter for administrating and analyzing data from peptide mass fingerprinting experiments on proteins with known amino acid sequences. It is meant for small scale mass spectrometry laboratories that are interested in posttranslational modifications of known proteins. Several experiments can be compared simultaneously, and the matched and unmatched peak values are clearly indicated. The hits can be sorted according to m/z values (default) or according to the sequence of the protein. Filters defined by the user can mark autolytic protease peaks and other contaminating peaks (keratins, proteins co-migrating with the protein of interest, etc.). Unmatched peaks can be further analyzed for unexpected modifications by searches against a local version of the UniMod database. They can also be analyzed for unexpected cleavages, a highly useful feature for proteins that undergo maturation by proteolytic cleavage, creating new N- or C-terminals. Additional tools exist for visualization of the results, like sequence coverage, accuracy plots, different types of statistics, 3D models, etc. The program and a tutorial are freely available for academic users at http://www.bioinfo.no/software/massSorter.ConclusionMassSorter has a number of useful features that can promote the analysis and administration of MS-data.

[1]  Frank Schmidt,et al.  Iterative data analysis is the key for exhaustive analysis of peptide mass fingerprints from proteins separated by two-dimensional electrophoresis , 2003, Journal of the American Society for Mass Spectrometry.

[2]  M. Mann,et al.  The abc's (and xyz's) of peptide sequencing , 2004, Nature Reviews Molecular Cell Biology.

[3]  Ingebrigt Sylte,et al.  Pancreatic trypsin activates human promatrix metalloproteinase-2. , 2005, Journal of molecular biology.

[4]  L. Liotta,et al.  The activation of human type IV collagenase proenzyme. Sequence identification of the major conversion product following organomercurial activation. , 1989, The Journal of biological chemistry.

[5]  B. Chait,et al.  ProFound: an expert system for protein identification using mass spectrometric peptide mapping information. , 2000, Analytical chemistry.

[6]  Ronald Kühne,et al.  The contributions of specific amino acid side chains to signal intensities of peptides in matrix-assisted laser desorption/ionization mass spectrometry. , 2004, Rapid communications in mass spectrometry : RCM.

[7]  D. Creasy,et al.  Unimod: Protein modifications for mass spectrometry , 2004, Proteomics.

[8]  Peter R. Baker,et al.  Role of accurate mass measurement (+/- 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. , 1999, Analytical chemistry.

[9]  Albert Sickmann,et al.  Challenges in mass spectrometry‐based proteomics , 2004, Proteomics.

[10]  H. Steen,et al.  GPMAW--a software tool for analyzing proteins and peptides. , 2001, Trends in biochemical sciences.

[11]  D. Hochstrasser,et al.  Peptide mass fingerprinting peak intensity prediction: Extracting knowledge from spectra , 2002, Proteomics.

[12]  Luigi Rossi Bernardi,et al.  Bioinformatics in mass spectrometry data analysis for proteomics studies , 2004, Expert review of proteomics.

[13]  Amos Bairoch,et al.  FindPept, a tool to identify unmatched masses in peptide mass fingerprinting protein identification , 2002, Proteomics.

[14]  Yuan-Fang Wang,et al.  FPV: fast protein visualization using Java 3D™ , 2003, SAC '03.

[15]  Ben Shneiderman,et al.  Designing The User Interface , 2013 .

[16]  Qing Zhang,et al.  The RCSB Protein Data Bank: a redesigned query system and relational database based on the mmCIF schema , 2004, Nucleic Acids Res..

[17]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[18]  Assaf Wool,et al.  Precalibration of matrix‐assisted laser desorption/ionization‐time of flight spectra for peptide mass fingerprinting , 2002, Proteomics.

[19]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[20]  Yuan-Fang Wang,et al.  FPV: Fast Protein Visualization Using Java 3DTM , 2003, Bioinform..

[21]  Ernest Giralt,et al.  An investigation of residue-specific contributions to peptide desorption in MALDI-TOF mass spectrometry* , 2004, Letters in Peptide Science.