An Optimized Informatics Pipeline for Mass Spectrometry-Based Peptidomics

AbstractThe comprehensive MS analysis of the peptidome, the intracellular and intercellular products of protein degradation, has the potential to provide novel insights on endogenous proteolytic processing and its utility in disease diagnosis and prognosis. Along with the advances in MS instrumentation and related platforms, a plethora of proteomics data analysis tools have been applied for direct use in peptidomics; however, an evaluation of the currently available informatics pipelines for peptidomics data analysis has yet to be reported. In this study, we began by evaluating the results of several popular MS/MS database search engines, including MS-GF+, SEQUEST, and MS-Align+, for peptidomics data analysis, followed by identification and label-free quantification using the well-established accurate mass and time (AMT) tag and newly developed informed quantification (IQ) approaches, both based on direct LC-MS analysis. Our results demonstrated that MS-GF+ outperformed both SEQUEST and MS-Align+ in identifying peptidome peptides. Using a database established from MS-GF+ peptide identifications, both the AMT tag and IQ approaches provided significantly deeper peptidome coverage and less missing data for each individual data set than the MS/MS methods, while achieving robust label-free quantification. Besides having an excellent correlation with the AMT tag quantification results, IQ also provided slightly higher peptidome coverage. Taken together, we propose an optimized informatics pipeline combining MS-GF+ for initial database searching with IQ (or AMT tag) approaches for identification and label-free quantification for high-throughput, comprehensive, and quantitative peptidomics analysis. Graphical Abstractᅟ

[1]  Jonathan V Sweedler,et al.  Peptides in the brain: mass spectrometry-based measurement approaches and challenges. , 2008, Annual review of analytical chemistry.

[2]  Gordon W. Slysz,et al.  Comprehensive Quantitative Analysis of Ovarian and Breast Cancer Tumor Peptidomes , 2014, Journal of proteome research.

[3]  Lloyd D. Fricker,et al.  Proteasome Inhibitors Alter Levels of Intracellular Peptides in HEK293T and SH-SY5Y Cells , 2014, PloS one.

[4]  Ronald J Moore,et al.  Global analysis of the Deinococcus radiodurans proteome by using accurate mass tags , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Gordon A. Anderson,et al.  DtaRefinery, a Software Tool for Elimination of Systematic Errors from Parent Ion Mass Measurements in Tandem Mass Spectra Data Sets* , 2009, Molecular & Cellular Proteomics.

[6]  Ying S. Ting,et al.  Protein Identification Using Top-Down Spectra* , 2012, Molecular & Cellular Proteomics.

[7]  Jerry D. Holman,et al.  Identifying Proteomic LC‐MS/MS Data Sets with Bumbershoot and IDPicker , 2012, Current protocols in bioinformatics.

[8]  Lloyd D. Fricker,et al.  Hemopressin and Other Bioactive Peptides from Cytosolic Proteins: Are These Non-Classical Neuropeptides? , 2010, The AAPS Journal.

[9]  Navdeep Jaitly,et al.  VIPER: an advanced software package to support high-throughput LC-MS peptide identification , 2007, Bioinform..

[10]  Inez Finoulst,et al.  Sample Preparation Techniques for the Untargeted LC-MS-Based Discovery of Peptides in Complex Biological Matrices , 2011, Journal of biomedicine & biotechnology.

[11]  Pavel A. Pevzner,et al.  Universal database search tool for proteomics , 2014, Nature Communications.

[12]  J. Sironi,et al.  Peptidomic analysis of human cell lines. , 2011, Journal of proteome research.

[13]  Albert J R Heck,et al.  Expanding the detectable HLA peptide repertoire using electron-transfer/higher-energy collision dissociation (EThcD) , 2014, Proceedings of the National Academy of Sciences.

[14]  Eve Marder,et al.  Mass spectrometric investigation of the neuropeptide complement and release in the pericardial organs of the crab, Cancer borealis , 2003, Journal of neurochemistry.

[15]  Jonathan V Sweedler,et al.  Discovering new invertebrate neuropeptides using mass spectrometry. , 2006, Mass spectrometry reviews.

[16]  Ljiljana Paša-Tolić,et al.  An accurate mass tag strategy for quantitative and high‐throughput proteome measurements , 2002, Proteomics.

[17]  Stefan Pieper,et al.  Liquid chromatography-mass spectrometry-based quantitative proteomics. , 2009, Methods in molecular biology.

[18]  W. Van Criekinge,et al.  Peptidomics coming of age: a review of contributions from a bioinformatics angle. , 2010, Journal of proteome research.

[19]  Ying Ge,et al.  Combining bottom-up and top-down mass spectrometric strategies for de novo sequencing of the crustacean hyperglycemic hormone from Cancer borealis. , 2009, Analytical chemistry.

[20]  A. Saghatelian,et al.  Investigating endogenous peptides and peptidases using peptidomics. , 2011, Biochemistry.

[21]  Navdeep Jaitly,et al.  Decon2LS: An open-source software package for automated processing and visualization of high resolution mass spectrometry data , 2009, BMC Bioinformatics.

[22]  Albert J R Heck,et al.  High-sensitivity Orbitrap mass analysis of intact macromolecular assemblies , 2012, Nature Methods.

[23]  M. Schrader,et al.  Peptidomics technologies for human body fluids. , 2001, Trends in biotechnology.

[24]  Pedro R Cutillas,et al.  Application of Label-free Quantitative Peptidomics for the Identification of Urinary Biomarkers of Kidney Chronic Allograft Dysfunction* , 2009, Molecular & Cellular Proteomics.

[25]  Liliane Schoofs,et al.  Comparison of extraction methods for peptidomics analysis of mouse brain tissue , 2011, Journal of Neuroscience Methods.

[26]  Richard D. Smith,et al.  DanteR: an extensible R-based tool for quantitative analysis of -omics data , 2012, Bioinform..

[27]  Oskar Karlsson,et al.  Uncovering effects of ex vivo protease activity during proteomics and peptidomics sample extraction in rat brain tissue by oxygen-18 labeling. , 2014, Journal of proteome research.

[28]  Richard D. Smith,et al.  Advances in proteomics data analysis and display using an accurate mass and time tag approach. , 2006, Mass spectrometry reviews.

[29]  A. Olshen,et al.  Differential exoprotease activities confer tumor-specific serum peptidome patterns. , 2005, The Journal of clinical investigation.

[30]  Samuel H Payne,et al.  Automated data extraction from in situ protein-stable isotope probing studies. , 2014, Journal of proteome research.

[31]  Lloyd D. Fricker,et al.  Peptidomic analysis of HEK293T cells: effect of the proteasome inhibitor epoxomicin on intracellular peptides. , 2012, Journal of proteome research.

[32]  S. Baumann,et al.  Standardized peptidome profiling of human urine by magnetic bead separation and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. , 2007, Clinical chemistry.

[33]  Alan R. Dabney,et al.  A statistical method for assessing peptide identification confidence in accurate mass and time tag proteomics. , 2011, Analytical chemistry.