Workflow for analysis of high mass accuracy salivary data set using MaxQuant and ProteinPilot search algorithm

LTQ Orbitrap data analyzed with ProteinPilot can be further improved by MaxQuant raw data processing, which utilizes precursor‐level high mass accuracy data for peak processing and MGF creation. In particular, ProteinPilot results from MaxQuant‐processed peaklists for Orbitrap data sets resulted in improved spectral utilization due to an improved peaklist quality with higher precision and high precursor mass accuracy (HPMA). The output and postsearch analysis tools of both workflows were utilized for previously unexplored features of a three‐dimensional fractionated and hexapeptide library (ProteoMiner) treated whole saliva data set comprising 200 fractions. ProteinPilot's ability to simultaneously predict multiple modifications showed an advantage from ProteoMiner treatment for modified peptide identification. We demonstrate that complementary approaches in the analysis pipeline provide comprehensive results for the whole saliva data set acquired on an LTQ Orbitrap. Overall our results establish a workflow for improved protein identification from high mass accuracy data.

[1]  Gilbert S. Omenn,et al.  Alternative Splice Variants, a New Class of Protein Cancer Biomarker Candidates: Findings in Pancreatic Cancer and Breast Cancer with Systems Biology Implications , 2010, Disease markers.

[2]  Judith A J Steen,et al.  When less can yield more – Computational preprocessing of MS/MS spectra for peptide identification , 2009, Proteomics.

[3]  Leonid Zamdborg,et al.  Tandem mass spectrometry with ultrahigh mass accuracy clarifies peptide identification by database retrieval. , 2009, Journal of proteome research.

[4]  John D. Venable,et al.  Improving protein identification sensitivity by combining MS and MS/MS information for shotgun proteomics using LTQ-Orbitrap high mass accuracy data. , 2008, Analytical chemistry.

[5]  Timothy J Griffin,et al.  Deep metaproteomic analysis of human salivary supernatant , 2012, Proteomics.

[6]  Jürgen Cox,et al.  A practical guide to the MaxQuant computational platform for SILAC-based quantitative proteomics , 2009, Nature Protocols.

[7]  M. Mann,et al.  Precision proteomics: The case for high resolution and high mass accuracy , 2008, Proceedings of the National Academy of Sciences.

[8]  John R Yates,et al.  Proteomics by mass spectrometry: approaches, advances, and applications. , 2009, Annual review of biomedical engineering.

[9]  Gordon A. Anderson,et al.  DtaRefinery, a Software Tool for Elimination of Systematic Errors from Parent Ion Mass Measurements in Tandem Mass Spectra Data Sets* , 2009, Molecular & Cellular Proteomics.

[10]  Jürgen Cox,et al.  Computational principles of determining and improving mass precision and accuracy for proteome measurements in an Orbitrap , 2009, Journal of the American Society for Mass Spectrometry.

[11]  Sean L Seymour,et al.  Nonlinear fitting method for determining local false discovery rates from decoy database searches. , 2008, Journal of proteome research.

[12]  A. Makarov,et al.  Orbitrap Mass Analyzer – Overview and Applications in Proteomics , 2006, Proteomics.

[13]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[14]  M. Mann,et al.  On the Proper Use of Mass Accuracy in Proteomics* , 2007, Molecular & Cellular Proteomics.

[15]  David L. Tabb,et al.  Performance Metrics for Liquid Chromatography-Tandem Mass Spectrometry Systems in Proteomics Analyses* , 2009, Molecular & Cellular Proteomics.

[16]  Alexander Makarov,et al.  Orbitrap Mass Analyzer - Overview and Applications , 2006 .

[17]  T. Griffin,et al.  A dynamic range compression and three-dimensional peptide fractionation analysis platform expands proteome coverage and the diagnostic potential of whole saliva. , 2009, Journal of proteome research.

[18]  Sean L Seymour,et al.  The Paragon Algorithm, a Next Generation Search Engine That Uses Sequence Temperature Values and Feature Probabilities to Identify Peptides from Tandem Mass Spectra*S , 2007, Molecular & Cellular Proteomics.