Automated selected reaction monitoring data analysis workflow for large-scale targeted proteomic studies

Targeted proteomics based on selected reaction monitoring (SRM) mass spectrometry is commonly used for accurate and reproducible quantification of protein analytes in complex biological mixtures. Strictly hypothesis-driven, SRM assays quantify each targeted protein by collecting measurements on its peptide fragment ions, called transitions. To achieve sensitive and accurate quantitative results, experimental design and data analysis must consistently account for the variability of the quantified transitions. This consistency is especially important in large experiments, which increasingly require profiling up to hundreds of proteins over hundreds of samples. Here we describe a robust and automated workflow for the analysis of large quantitative SRM data sets that integrates data processing, statistical protein identification and quantification, and dissemination of the results. The integrated workflow combines three software tools: mProphet for peptide identification via probabilistic scoring; SRMstats for protein significance analysis with linear mixed-effect models; and PASSEL, a public repository for storage, retrieval and query of SRM data. The input requirements for the protocol are files with SRM traces in mzXML format, and a file with a list of transitions in a text tab-separated format. The protocol is especially suited for data with heavy isotope–labeled peptide internal standards. We demonstrate the protocol on a clinical data set in which the abundances of 35 biomarker candidates were profiled in 83 blood plasma samples of subjects with ovarian cancer or benign ovarian tumors. The time frame to realize the protocol is 1–2 weeks, depending on the number of replicates used in the experiment.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  R. Aebersold,et al.  High Sensitivity Detection of Plasma Proteins by Multiple Reaction Monitoring of N-Glycosites*S , 2007, Molecular & Cellular Proteomics.

[3]  Ruedi Aebersold,et al.  Reproducible Quantification of Cancer-Associated Proteins in Body Fluids Using Targeted Proteomics , 2012, Science Translational Medicine.

[4]  R. Aebersold,et al.  Selected reaction monitoring–based proteomics: workflows, potential, pitfalls and future directions , 2012, Nature Methods.

[5]  M. Bissell,et al.  Cancer genetics-guided discovery of serum biomarker signatures for diagnosis and prognosis of prostate cancer , 2012 .

[6]  John Chilton,et al.  Using iRT, a normalized retention time for more targeted measurement of peptides , 2012, Proteomics.

[7]  Olga Vitek,et al.  Statistical design of quantitative mass spectrometry-based proteomic experiments. , 2009, Journal of proteome research.

[8]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Peter Willett,et al.  What is a tutorial , 2013 .

[10]  Christoph H Borchers,et al.  Multi-site assessment of the precision and reproducibility of multiple reaction monitoring–based measurements of proteins in plasma , 2009, Nature Biotechnology.

[11]  R. Aebersold,et al.  Selected reaction monitoring for quantitative proteomics: a tutorial , 2008, Molecular systems biology.

[12]  Ruedi Aebersold,et al.  On the development of plasma protein biomarkers. , 2011, Journal of proteome research.

[13]  Lukas N. Mueller,et al.  Full Dynamic Range Proteome Analysis of S. cerevisiae by Targeted Proteomics , 2009, Cell.

[14]  Dominique Cansell,et al.  B Method , 2006, The Seventeen Provers of the World.

[15]  Gunther Schadow,et al.  Protein quantification in label-free LC-MS experiments. , 2009, Journal of proteome research.

[16]  R. Aebersold,et al.  mProphet: automated data processing and statistical validation for large-scale SRM experiments , 2011, Nature Methods.

[17]  Brendan MacLean,et al.  Bioinformatics Applications Note Gene Expression Skyline: an Open Source Document Editor for Creating and Analyzing Targeted Proteomics Experiments , 2022 .

[18]  Leigh Anderson,et al.  Quantitative Mass Spectrometric Multiple Reaction Monitoring Assays for Major Plasma Proteins* , 2006, Molecular & Cellular Proteomics.

[19]  Ruedi Aebersold,et al.  Protein Significance Analysis in Selected Reaction Monitoring (SRM) Measurements* , 2011, Molecular & Cellular Proteomics.

[20]  Luis Mendoza,et al.  PASSEL: The PeptideAtlas SRMexperiment library , 2012, Proteomics.