DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics

As a result of recent improvements in mass spectrometry (MS), there is increased interest in data-independent acquisition (DIA) strategies in which all peptides are systematically fragmented using wide mass-isolation windows ('multiplex fragmentation'). DIA-Umpire (http://diaumpire.sourceforge.net/), a comprehensive computational workflow and open-source software for DIA data, detects precursor and fragment chromatographic features and assembles them into pseudo–tandem MS spectra. These spectra can be identified with conventional database-searching and protein-inference tools, allowing sensitive, untargeted analysis of DIA data without the need for a spectral library. Quantification is done with both precursor- and fragment-ion intensities. Furthermore, DIA-Umpire enables targeted extraction of quantitative information based on peptides initially identified in only a subset of the samples, resulting in more consistent quantification across multiple samples. We demonstrated the performance of the method with control samples of varying complexity and publicly available glycoproteomics and affinity purification–MS data.

[1]  Alexey I Nesvizhskii,et al.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. , 2002, Analytical chemistry.

[2]  D. Goodlett,et al.  Shotgun collision‐induced dissociation of peptides using a time of flight mass analyzer , 2003, Proteomics.

[3]  R. Aebersold,et al.  A statistical model for identifying proteins by tandem mass spectrometry. , 2003, Analytical chemistry.

[4]  Robertson Craig,et al.  Open source system for analyzing, validating, and storing protein identification data. , 2004, Journal of proteome research.

[5]  John D. Venable,et al.  Automated approach for quantitative analysis of complex peptide mixtures from tandem mass spectra , 2004, Nature Methods.

[6]  M. Gorenstein,et al.  Absolute Quantification of Proteins by LCMSE , 2006, Molecular & Cellular Proteomics.

[7]  M. Gorenstein,et al.  Absolute Quantification of Proteins by LCMSE , 2006, Molecular & Cellular Proteomics.

[8]  R. Aebersold,et al.  Dynamic Spectrum Quality Assessment and Iterative Computational Analysis of Shotgun Proteomic Data , 2006, Molecular & Cellular Proteomics.

[9]  Mathieu Blanchette,et al.  Systematic analysis of the protein interaction network for the human transcription machinery reveals the identity of the 7SK capping enzyme. , 2007, Molecular cell.

[10]  Ruedi Aebersold,et al.  Building consensus spectral libraries for peptide identification in proteomics , 2008, Nature Methods.

[11]  Steffen Neumann,et al.  Highly sensitive feature detection for high resolution LC/MS , 2008, BMC Bioinformatics.

[12]  Samuel I. Miller,et al.  Precursor acquisition independent from ion count: how to dive deeper into the proteomics ocean. , 2009, Analytical chemistry.

[13]  Dan Golick,et al.  Database searching and accounting of multiplexed precursor and product ion spectra from the data independent analysis of simple and complex peptide mixtures , 2009, Proteomics.

[14]  Chih-Chiang Tsou,et al.  IDEAL-Q, an Automated Tool for Label-free Quantitation Analysis Using an Efficient Peptide Alignment Approach and Spectral Data Validation* , 2009, Molecular & Cellular Proteomics.

[15]  Brendan MacLean,et al.  Skyline: an open source document editor for creating and analyzing targeted proteomics experiments , 2010, Bioinform..

[16]  A. Nesvizhskii A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics. , 2010, Journal of proteomics.

[17]  Natalie I. Tasman,et al.  A guided tour of the Trans‐Proteomic Pipeline , 2010, Proteomics.

[18]  M. Mann,et al.  Proteomics on an Orbitrap Benchtop Mass Spectrometer Using All-ion Fragmentation , 2010, Molecular & Cellular Proteomics.

[19]  Lennart Martens,et al.  compomics-utilities: an open-source Java library for computational proteomics , 2011, BMC Bioinformatics.

[20]  P. Pevzner,et al.  The Generating Function of CID, ETD, and CID/ETD Pairs of Tandem Mass Spectra: Applications to Database Search* , 2010, Molecular & Cellular Proteomics.

[21]  Ruedi Aebersold,et al.  Artificial decoy spectral libraries for false discovery rate estimation in spectral library searching in proteomics. , 2010, Journal of proteome research.

[22]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2011 update , 2010, Nucleic Acids Res..

[23]  M. Selbach,et al.  Global quantification of mammalian gene expression control , 2011, Nature.

[24]  Ruedi Aebersold,et al.  Estimation of Absolute Protein Quantities of Unlabeled Samples by Selected Reaction Monitoring Mass Spectrometry , 2011, Molecular & Cellular Proteomics.

[25]  Natalie I. Tasman,et al.  iProphet: Multi-level Integrative Analysis of Shotgun Proteomic Data Improves Peptide and Protein Identification Rates and Error Estimates* , 2011, Molecular & Cellular Proteomics.

[26]  R. Aebersold,et al.  mProphet: automated data processing and statistical validation for large-scale SRM experiments , 2011, Nature Methods.

[27]  Hyungwon Choi,et al.  SAINT: Probabilistic Scoring of Affinity Purification - Mass Spectrometry Data , 2010, Nature Methods.

[28]  M. Mann,et al.  Software Lock Mass by Two-Dimensional Minimization of Peptide Mass Errors , 2011, Journal of the American Society for Mass Spectrometry.

[29]  M. Mann,et al.  More than 100,000 detectable peptide species elute in single shotgun proteomics runs but the majority is inaccessible to data-dependent LC-MS/MS. , 2011, Journal of proteome research.

[30]  Chad R. Weisbrod,et al.  Accurate peptide fragment mass analysis: multiplexed peptide identification and quantification. , 2012, Journal of proteome research.

[31]  John Chilton,et al.  Using iRT, a normalized retention time for more targeted measurement of peptides , 2012, Proteomics.

[32]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[33]  Bernhard Kuster,et al.  Quantitative mass spectrometry in proteomics: critical review update from 2007 to the present , 2012, Analytical and Bioanalytical Chemistry.

[34]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[35]  Alexey I Nesvizhskii,et al.  Computational and informatics strategies for identification of specific protein interaction partners in affinity purification mass spectrometry experiments , 2012, Proteomics.

[36]  Hyungwon Choi,et al.  SAINT-MS1: protein-protein interaction scoring using label-free intensity data in affinity purification-mass spectrometry experiments. , 2012, Journal of proteome research.

[37]  Brett Larsen,et al.  Label-free quantitative proteomics trends for protein-protein interactions. , 2013, Journal of proteomics.

[38]  Florent Gluck,et al.  Clustering and Filtering Tandem Mass Spectra Acquired in Data-Independent Mode , 2013, Journal of The American Society for Mass Spectrometry.

[39]  J. Eng,et al.  Comet: An open‐source MS/MS sequence database search tool , 2013, Proteomics.

[40]  Frank Kjeldsen,et al.  Deconvolution of mixture spectra and increased throughput of peptide identification by utilization of intensified complementary ions formed in tandem mass spectrometry. , 2013, Journal of proteome research.

[41]  Jarrett D. Egertson,et al.  Multiplexed MS/MS for Improved Data Independent Acquisition , 2013, Nature Methods.

[42]  Ludovic C. Gillet,et al.  Quantifying protein interaction dynamics by SWATH mass spectrometry: application to the 14-3-3 system , 2013, Nature Methods.

[43]  Lisa M. Chung,et al.  Review of software tools for design and analysis of large scale MRM proteomic datasets. , 2013, Methods.

[44]  Tony Pawson,et al.  Mapping differential interactomes by affinity purification coupled with data independent mass spectrometry acquisition , 2013, Nature Methods.

[45]  Eric W. Deutsch,et al.  A repository of assays to quantify 10,000 human proteins by SWATH-MS , 2014, Scientific Data.

[46]  Andrew R. Jones,et al.  ProteomeXchange provides globally co-ordinated proteomics data submission and dissemination , 2014, Nature Biotechnology.

[47]  Stefan Tenzer,et al.  Drift time-specific collision energies enable deep-coverage data-independent acquisition proteomics , 2013, Nature Methods.

[48]  Derek J. Bailey,et al.  Intelligent Data Acquisition Blends Targeted and Discovery Methods , 2014, Journal of proteome research.

[49]  Jing Chen,et al.  Glycoproteomic Analysis of Prostate Cancer Tissues by SWATH Mass Spectrometry Discovers N-acylethanolamine Acid Amidase and Protein Tyrosine Kinase 7 as Signatures for Tumor Aggressiveness , 2014, Molecular & Cellular Proteomics.

[50]  Ben C. Collins,et al.  OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data , 2014, Nature Biotechnology.

[51]  Barbara Frewen,et al.  Hybrid data acquisition and processing strategies with increased throughput and selectivity: pSMART analysis for global qualitative and quantitative analysis. , 2014, Journal of proteome research.