SWATH Mass Spectrometry Performance Using Extended Peptide MS/MS Assay Libraries*

The use of data-independent acquisition methods such as SWATH for mass spectrometry based proteomics is usually performed with peptide MS/MS assay libraries which enable identification and quantitation of peptide peak areas. Reference assay libraries can be generated locally through information dependent acquisition, or obtained from community data repositories for commonly studied organisms. However, there have been no studies performed to systematically evaluate how locally generated or repository-based assay libraries affect SWATH performance for proteomic studies. To undertake this analysis, we developed a software workflow, SwathXtend, which generates extended peptide assay libraries by integration with a local seed library and delivers statistical analysis of SWATH-quantitative comparisons. We designed test samples using peptides from a yeast extract spiked into peptides from human K562 cell lysates at three different ratios to simulate protein abundance change comparisons. SWATH-MS performance was assessed using local and external assay libraries of varying complexities and proteome compositions. These experiments demonstrated that local seed libraries integrated with external assay libraries achieve better performance than local assay libraries alone, in terms of the number of identified peptides and proteins and the specificity to detect differentially abundant proteins. Our findings show that the performance of extended assay libraries is influenced by the MS/MS feature similarity of the seed and external libraries, while statistical analysis using multiple testing corrections increases the statistical rigor needed when searching against large extended assay libraries.

[1]  Chih-Chiang Tsou,et al.  DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics , 2015, Nature Methods.

[2]  Jian Wang,et al.  MSPLIT-DIA: sensitive peptide identification for data-independent acquisition , 2015, Nature Methods.

[3]  D. Goodlett,et al.  Faster, quantitative, and accurate precursor acquisition independent from ion count. , 2011, Analytical chemistry.

[4]  Lars Malmström,et al.  Identification of a Set of Conserved Eukaryotic Internal Retention Time Standards for Data-independent Acquisition Mass Spectrometry* , 2015, Molecular & Cellular Proteomics.

[5]  R. Aebersold,et al.  mProphet: automated data processing and statistical validation for large-scale SRM experiments , 2011, Nature Methods.

[6]  Tony Pawson,et al.  Mapping differential interactomes by affinity purification coupled with data independent mass spectrometry acquisition , 2013, Nature Methods.

[7]  Olga Vitek,et al.  Extending the limits of quantitative proteome profiling with data-independent acquisition , 2015 .

[8]  M. Mann,et al.  Proteomics on an Orbitrap Benchtop Mass Spectrometer Using All-ion Fragmentation , 2010, Molecular & Cellular Proteomics.

[9]  Eric W. Deutsch,et al.  A repository of assays to quantify 10,000 human proteins by SWATH-MS , 2014, Scientific Data.

[10]  Ludovic C. Gillet,et al.  Quantitative measurements of N‐linked glycoproteins in human plasma by SWATH‐MS , 2013, Proteomics.

[11]  John Chilton,et al.  Using iRT, a normalized retention time for more targeted measurement of peptides , 2012, Proteomics.

[12]  Ruedi Aebersold,et al.  Conserved Peptide Fragmentation as a Benchmarking Tool for Mass Spectrometers and a Discriminating Feature for Targeted Proteomics* , 2014, Molecular & Cellular Proteomics.

[13]  Jing Chen,et al.  Glycoproteomic Analysis of Prostate Cancer Tissues by SWATH Mass Spectrometry Discovers N-acylethanolamine Acid Amidase and Protein Tyrosine Kinase 7 as Signatures for Tumor Aggressiveness , 2014, Molecular & Cellular Proteomics.

[14]  Chad R. Weisbrod,et al.  Accurate peptide fragment mass analysis: multiplexed peptide identification and quantification. , 2012, Journal of proteome research.

[15]  Oliver M. Bernhardt,et al.  Extending the Limits of Quantitative Proteome Profiling with Data-Independent Acquisition and Application to Acetaminophen-Treated Three-Dimensional Liver Microtissues* , 2015, Molecular & Cellular Proteomics.

[16]  Oliver M. Bernhardt,et al.  Reproducible and Consistent Quantification of the Saccharomyces cerevisiae Proteome by SWATH-mass spectrometry* , 2015, Molecular & Cellular Proteomics.

[17]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[18]  Tao Xu,et al.  Bioinformatics Applications Note Sequence Analysis Xdia: Improving on the Label-free Data-independent Analysis , 2022 .

[19]  Brendan MacLean,et al.  Panorama: A Targeted Proteomics Knowledge Base , 2014, Journal of proteome research.

[20]  Andrew R. Jones,et al.  ProteomeXchange provides globally co-ordinated proteomics data submission and dissemination , 2014, Nature Biotechnology.

[21]  John D. Venable,et al.  Automated approach for quantitative analysis of complex peptide mixtures from tandem mass spectra , 2004, Nature Methods.

[22]  R. Aebersold,et al.  Mass Spectrometry and Protein Analysis , 2006, Science.

[23]  Brendan MacLean,et al.  Building high-quality assay libraries for targeted analysis of SWATH MS data , 2015, Nature Protocols.

[24]  R. Aebersold,et al.  A High-Confidence Human Plasma Proteome Reference Set with Estimated Concentrations in PeptideAtlas* , 2011, Molecular & Cellular Proteomics.

[25]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[26]  Ben C. Collins,et al.  OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data , 2014, Nature Biotechnology.

[27]  Hanno Steen,et al.  Advancing Urinary Protein Biomarker Discovery by Data-Independent Acquisition on a Quadrupole-Orbitrap Mass Spectrometer. , 2015, Journal of proteome research.

[28]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[29]  Ruedi Aebersold,et al.  Quantitative variability of 342 plasma proteins in a human twin population , 2015 .

[30]  A. P. Diz,et al.  Multiple hypothesis testing in proteomics: A strategy for experimental work. , 2010, Molecular & cellular proteomics : MCP.

[31]  O. Krokhin,et al.  Sequence-specific retention calculator. Algorithm for peptide retention prediction in ion-pair RP-HPLC: application to 300- and 100-A pore size C18 sorbents. , 2006, Analytical chemistry.

[32]  Samuel I. Miller,et al.  Precursor acquisition independent from ion count: how to dive deeper into the proteomics ocean. , 2009, Analytical chemistry.

[33]  Ruedi Aebersold,et al.  Absolute Proteome Composition and Dynamics during Dormancy and Resuscitation of Mycobacterium tuberculosis. , 2015, Cell host & microbe.

[34]  Quanhui Wang,et al.  Expansion of the ion library for mining SWATH-MS data through fractionation proteomics. , 2014, Analytical chemistry.

[35]  Eugene A. Kapp,et al.  Overview of Tandem Mass Spectrometry (MS/MS) Database Search Algorithms , 2007, Current protocols in protein science.

[36]  Xiang Zhang,et al.  Comparative Analysis of Mass Spectral Similarity Measures on Peak Alignment for Comprehensive Two-Dimensional Gas Chromatography Mass Spectrometry , 2013, Comput. Math. Methods Medicine.