IonStar enables high-precision, low-missing-data proteomics quantification in large biological cohorts

Significance Reliable proteome-wide quantification in large biological cohorts is highly valuable for clinical and pharmaceutical research yet remains extremely challenging despite recent technical advancements. Specifically, elevated missing data levels and compromised quantitative quality are common issues for prevalent methods. Here, we describe an IonStar technique taking advantage of sensitive and selective MS1 ion current-base quantification via innovations in effective and reproducible quantitative feature generation. Compared with several label-free strategies, IonStar showed superior performance in large-cohort analysis, manifested by excellent accuracy/precision, extremely low missing data, and confident discovery of subtle protein changes. In a proof-of-concept study, we demonstrated that IonStar quantified >7,000 unique proteins in 100 brain samples with no missing data and excellent quantitative quality, which has not been achievable by existing methods. Reproducible quantification of large biological cohorts is critical for clinical/pharmaceutical proteomics yet remains challenging because most prevalent methods suffer from drastically declined commonly quantified proteins and substantially deteriorated quantitative quality as cohort size expands. MS2-based data-independent acquisition approaches represent tremendous advancements in reproducible protein measurement, but often with limited depth. We developed IonStar, an MS1-based quantitative approach enabling in-depth, high-quality quantification of large cohorts by combining efficient/reproducible experimental procedures with unique data-processing components, such as efficient 3D chromatographic alignment, sensitive and selective direct ion current extraction, and stringent postfeature generation quality control. Compared with several popular label-free methods, IonStar exhibited far lower missing data (0.1%), superior quantitative accuracy/precision [∼5% intragroup coefficient of variation (CV)], the widest protein abundance range, and the highest sensitivity/specificity for identifying protein changes (<5% false altered-protein discovery) in a benchmark sample set (n = 20). We demonstrated the usage of IonStar by a large-scale investigation of traumatic injuries and pharmacological treatments in rat brains (n = 100), quantifying >7,000 unique protein groups (>99.8% without missing data across the 100 samples) with a low false discovery rate (FDR), two or more unique peptides per protein, and high quantitative precision. IonStar represents a reliable and robust solution for precise and reproducible protein measurement in large cohorts.

[1]  Fernando M. Maroto,et al.  ChromAlign: A two-step algorithmic procedure for time alignment of three-dimensional LC-MS chromatographic surfaces. , 2006, Analytical chemistry.

[2]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[3]  J. Hatton,et al.  A Review of Neuroprotection Pharmacology and Therapies in Patients with Acute Traumatic Brain Injury , 2012, CNS Drugs.

[4]  José A. Dianes,et al.  2016 update of the PRIDE database and its related tools , 2016, Nucleic Acids Res..

[5]  Knut Reinert,et al.  Tools for Label-free Peptide Quantification , 2012, Molecular & Cellular Proteomics.

[6]  D. DuBois,et al.  Highly Multiplexed and Reproducible Ion-Current-Based Strategy for Large-Scale Quantitative Proteomics and the Application to Protein Expression Dynamics Induced by Methylprednisolone in 60 Rats , 2014, Analytical chemistry.

[7]  Laurent Gatto,et al.  Accounting for the Multiple Natures of Missing Values in Label-Free Quantitative Proteomics Data Sets to Compare Imputation Strategies. , 2016, Journal of proteome research.

[8]  Quanhu Sheng,et al.  ICan: An Optimized Ion-Current-Based Quantification Procedure with Enhanced Quantitative Accuracy and Sensitivity in Biomarker Discovery , 2014, Journal of proteome research.

[9]  Marianne Sandin,et al.  Data processing has major impact on the outcome of quantitative label-free LC-MS analysis. , 2015, Journal of proteome research.

[10]  B. Searle Scaffold: A bioinformatic tool for validating MS/MS‐based proteomic studies , 2010, Proteomics.

[11]  Ruedi Aebersold,et al.  Options and considerations when selecting a quantitative proteomics strategy , 2010, Nature Biotechnology.

[12]  Richard D Smith,et al.  Review, evaluation, and discussion of the challenges of missing value imputation for mass spectrometry-based label-free global proteomics. , 2015, Journal of proteome research.

[13]  J. Yates,et al.  A model for random sampling and estimation of relative protein abundance in shotgun proteomics. , 2004, Analytical chemistry.

[14]  Lukas N. Mueller,et al.  SuperHirn – a novel tool for high resolution LC‐MS‐based peptide/protein profiling , 2007, Proteomics.

[15]  Hao Wang,et al.  Global proteomic analysis in trypanosomes reveals unique proteins and conserved cellular processes impacted by arginine methylation. , 2013, Journal of proteomics.

[16]  Sung Kyu Park,et al.  A quantitative analysis software tool for mass spectrometry–based proteomics , 2008, Nature Methods.

[17]  A. C. Thompson,et al.  Large-Scale, Ion-Current-Based Proteomic Investigation of the Rat Striatal Proteome in a Model of Short- and Long-Term Cocaine Withdrawal. , 2016, Journal of proteome research.

[18]  Quanhu Sheng,et al.  Systematic Assessment of Survey Scan and MS2-Based Abundance Strategies for Label-Free Quantitative Proteomics Using High-Resolution MS Data , 2014, Journal of proteome research.

[19]  Q. Hu,et al.  Experimental Null Method to Guide the Development of Technical Procedures and to Control False-Positive Discovery in Quantitative Proteomics. , 2015, Journal of proteome research.

[20]  Samuel H Payne,et al.  PECAN: Library Free Peptide Detection for Data-Independent Acquisition Tandem Mass Spectrometry Data , 2017, Nature Methods.

[21]  Thomas F. Rau,et al.  Phenoxybenzamine Is Neuroprotective in a Rat Model of Severe Traumatic Brain Injury , 2014, International journal of molecular sciences.

[22]  Thomas F. Rau,et al.  Administration of low dose methamphetamine 12h after a severe traumatic brain injury prevents neurological dysfunction and cognitive impairment in rats , 2014, Experimental Neurology.

[23]  Robert E. Kearney,et al.  Methods for combining peptide intensities to estimate relative protein abundance , 2010, Bioinform..

[24]  M. Girolami,et al.  Clinical proteomics: A need to define the field and to begin to set adequate standards , 2007, Proteomics. Clinical applications.

[25]  Lukas Käll,et al.  DeMix-Q: Quantification-Centered Data Processing Workflow* , 2016, Molecular & Cellular Proteomics.

[26]  Michael J. MacCoss,et al.  Platform-independent and Label-free Quantitation of Proteomic Data Using MS1 Extracted Ion Chromatograms in Skyline , 2012, Molecular & Cellular Proteomics.

[27]  Oliver M. Bernhardt,et al.  Extending the Limits of Quantitative Proteome Profiling with Data-Independent Acquisition and Application to Acetaminophen-Treated Three-Dimensional Liver Microtissues* , 2015, Molecular & Cellular Proteomics.

[28]  M. Ueffing,et al.  Direct comparison of MS‐based label‐free and SILAC quantitative proteome profiling strategies in primary retinal Müller cells , 2012, Proteomics.

[29]  David L Tabb,et al.  IDPQuantify: combining precursor intensity with spectral counts for protein and peptide quantification. , 2013, Journal of proteome research.

[30]  Michael D. Litton,et al.  IDPicker 2.0: Improved protein assembly with high discrimination peptide identification filtering. , 2009, Journal of proteome research.

[31]  Richard D. Smith,et al.  Normalization and missing value imputation for label-free LC-MS analysis , 2012, BMC Bioinformatics.

[32]  Ljiljana Paša-Tolić,et al.  An accurate mass tag strategy for quantitative and high‐throughput proteome measurements , 2002, Proteomics.

[33]  Richard E Higgs,et al.  Label-free LC-MS method for the identification of biomarkers. , 2008, Methods in molecular biology.

[34]  Birgit Schilling,et al.  Repeatability and reproducibility in proteomic identifications by liquid chromatography-tandem mass spectrometry. , 2010, Journal of proteome research.

[35]  Ben C. Collins,et al.  OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data , 2014, Nature Biotechnology.

[36]  Andreas Quandt,et al.  An automated pipeline for high-throughput label-free quantitative proteomics. , 2013, Journal of proteome research.

[37]  Timothy D. Veenstra,et al.  AN ACCURATE MASS TAG STRATEGY FOR QUANTITATIVE AND HIGH THROUGHPUT PROTEOME MEASUREMENTS , 2002 .

[38]  W. Zhou,et al.  Proteomic Analyses for the Global S-Nitrosylated Proteins in the Brain Tissues of Different Human Prion Diseases , 2016, Molecular Neurobiology.

[39]  Jean-Charles Sanchez,et al.  Proteomic analysis of human substantia nigra identifies novel candidates involved in Parkinson's disease pathogenesis , 2014, Proteomics.

[40]  Steven A Carr,et al.  Protein biomarker discovery and validation: the long and uncertain path to clinical utility , 2006, Nature Biotechnology.

[41]  Ludovic C. Gillet,et al.  Rapid mass spectrometric conversion of tissue biopsy samples into permanent quantitative digital proteome maps , 2015, Nature Medicine.

[42]  Chih-Chiang Tsou,et al.  DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics , 2015, Nature Methods.

[43]  Scott Peterman,et al.  Mass spectrometric discovery and selective reaction monitoring (SRM) of putative protein biomarker candidates in first trimester Trisomy 21 maternal serum. , 2011, Journal of proteome research.

[44]  Marco Y. Hein,et al.  Accurate Proteome-wide Label-free Quantification by Delayed Normalization and Maximal Peptide Ratio Extraction, Termed MaxLFQ * , 2014, Molecular & Cellular Proteomics.

[45]  Oliver M. Bernhardt,et al.  Reproducible and Consistent Quantification of the Saccharomyces cerevisiae Proteome by SWATH-mass spectrometry* , 2015, Molecular & Cellular Proteomics.

[46]  Pavel A. Pevzner,et al.  Universal database search tool for proteomics , 2014, Nature Communications.

[47]  Q. Hu,et al.  An IonStar Experimental Strategy for MS1 Ion Current-Based Quantification Using Ultrahigh-Field Orbitrap: Reproducible, In-Depth, and Accurate Protein Measurement in Large Cohorts. , 2017, Journal of proteome research.

[48]  Peter Filzmoser,et al.  Outlier identification in high dimensions , 2008, Comput. Stat. Data Anal..

[49]  Hendrik Weisser,et al.  Targeted Feature Detection for Data-Dependent Shotgun Proteomics , 2017, Journal of proteome research.

[50]  Adam A. Margolin,et al.  Empirical Bayes Analysis of Quantitative Proteomics Experiments , 2009, PloS one.

[51]  Richard D. Smith,et al.  Advances and Challenges in Liquid Chromatography-Mass Spectrometry-based Proteomics Profiling for Clinical Applications* , 2006, Molecular & Cellular Proteomics.

[52]  M. Mann,et al.  More than 100,000 detectable peptide species elute in single shotgun proteomics runs but the majority is inaccessible to data-dependent LC-MS/MS. , 2011, Journal of proteome research.

[53]  Michael K. Coleman,et al.  Correlation of relative abundance ratios derived from peptide ion chromatograms and spectrum counting for quantitative proteomic analysis using stable isotope labeling. , 2005, Analytical chemistry.

[54]  M. Mann,et al.  Analysis of proteins and proteomes by mass spectrometry. , 2001, Annual review of biochemistry.

[55]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[56]  K. Kultima,et al.  Quantification of the brain proteome in Alzheimer's disease using multiplexed mass spectrometry. , 2014, Journal of proteome research.

[57]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[58]  Fredrik Levander,et al.  Data processing methods and quality control strategies for label-free LC-MS protein quantification. , 2014, Biochimica et biophysica acta.