Threshold-avoiding proteomics pipeline.

We present a new proteomics analysis pipeline focused on maximizing the dynamic range of detected molecules in liquid chromatography-mass spectrometry (LC-MS) data and accurately quantifying low-abundance peaks to identify those with biological relevance. Although there has been much work to improve the quality of data derived from LC-MS instruments, the goal of this study was to extend the dynamic range of analyzed compounds by making full use of the information available within each data set and across multiple related chromatograms in an experiment. Our aim was to distinguish low-abundance signal peaks from noise by noting their coherent behavior across multiple data sets, and central to this is the need to delay the culling of noise peaks until the final peak-matching stage of the pipeline, when peaks from a single sample appear in the context of all others. The application of thresholds that might discard signal peaks early is thereby avoided, hence the name TAPP: threshold-avoiding proteomics pipeline. TAPP focuses on quantitative low-level processing of raw LC-MS data and includes novel preprocessing, peak detection, time alignment, and cluster-based matching. We demonstrate the performance of TAPP on biologically relevant sample data consisting of porcine cerebrospinal fluid spiked over a wide range of concentrations with horse heart cytochrome c.

[1]  Ruedi Aebersold,et al.  Options and considerations when selecting a quantitative proteomics strategy , 2010, Nature Biotechnology.

[2]  A. Marshall,et al.  Fourier transform ion cyclotron resonance mass spectrometry: a primer. , 1998, Mass spectrometry reviews.

[3]  M. Guilhaus,et al.  Special feature: Tutorial. Principles and instrumentation in time‐of‐flight mass spectrometry. Physical and instrumental concepts , 1995 .

[4]  Piera Versura,et al.  Improved Label-Free LC-MS Analysis by Wavelet-Based Noise Rejection , 2010, Journal of biomedicine & biotechnology.

[5]  Frank Suits,et al.  Two-dimensional method for time aligning liquid chromatography-mass spectrometry data. , 2008, Analytical chemistry.

[6]  Jeffrey S. Morris,et al.  Signal in noise: evaluating reported reproducibility of serum proteomic tests for ovarian cancer. , 2005, Journal of the National Cancer Institute.

[7]  Rainer Breitling,et al.  Increasing the mass accuracy of high‐resolution LC‐MS data using background ions – a case study on the LTQ‐Orbitrap , 2008, Proteomics.

[8]  Steffen Neumann,et al.  Highly sensitive feature detection for high resolution LC/MS , 2008, BMC Bioinformatics.

[9]  John R. Yates,et al.  The biological impact of mass-spectrometry-based proteomics , 2007, Nature.

[10]  D. Alotto,et al.  Preparation and Characterization of a Novel Skin Substitute , 2010, Journal of biomedicine & biotechnology.

[11]  T. Hankemeier,et al.  The effect of preanalytical factors on stability of the proteome and selected metabolites in cerebrospinal fluid (CSF). , 2009, Journal of proteome research.

[12]  Fredrik Levander,et al.  Wavelet-based method for noise characterization and rejection in high-performance liquid chromatography coupled to mass spectrometry. , 2008, Analytical chemistry.

[13]  Brendan K Faherty,et al.  Optimization and Use of Peptide Mass Measurement Accuracy in Shotgun Proteomics*S , 2006, Molecular & Cellular Proteomics.

[14]  Yufei Huang,et al.  Review of Peak Detection Algorithms in Liquid-Chromatography-Mass Spectrometry , 2009, Current genomics.

[15]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[16]  Alan H. Fielding,et al.  Cluster and Classification Techniques for the Biosciences , 2006 .

[17]  M. Mann,et al.  More than 100,000 detectable peptide species elute in single shotgun proteomics runs but the majority is inaccessible to data-dependent LC-MS/MS. , 2011, Journal of proteome research.

[18]  N. Anderson,et al.  The Human Plasma Proteome , 2002, Molecular & Cellular Proteomics.

[19]  Ruedi Aebersold,et al.  Quantitative interaction proteomics using mass spectrometry , 2009, Nature Methods.

[20]  John R Yates,et al.  Mass spectrometry in high-throughput proteomics: ready for the big time , 2010, Nature Methods.

[21]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[22]  J. Trottier,et al.  Book Review: Mass Spectrometry: Principles and Applications. E. de Hoffman, J. Charette and W. Stroobant. Wiley, Chichester 1996. ISBN 0 471 96697 5 , 1997 .

[23]  Frank Suits,et al.  Chip‐LC‐MS for label‐free profiling of human serum , 2007, Electrophoresis.

[24]  M. Girolami,et al.  Recommendations for Biomarker Identification and Qualification in Clinical Proteomics , 2010, Science Translational Medicine.

[25]  A. McCormack,et al.  Direct Analysis of Protein Mixtures by Tandem Mass Spectrometry , 1997, Journal of protein chemistry.

[26]  D. Goodlett,et al.  Faster, quantitative, and accurate precursor acquisition independent from ion count. , 2011, Analytical chemistry.