Proteomics Quality Control: Quality Control Software for MaxQuant Results.

Mass spectrometry-based proteomics coupled to liquid chromatography has matured into an automatized, high-throughput technology, producing data on the scale of multiple gigabytes per instrument per day. Consequently, an automated quality control (QC) and quality analysis (QA) capable of detecting measurement bias, verifying consistency, and avoiding propagation of error is paramount for instrument operators and scientists in charge of downstream analysis. We have developed an R-based QC pipeline called Proteomics Quality Control (PTXQC) for bottom-up LC-MS data generated by the MaxQuant software pipeline. PTXQC creates a QC report containing a comprehensive and powerful set of QC metrics, augmented with automated scoring functions. The automated scores are collated to create an overview heatmap at the beginning of the report, giving valuable guidance also to nonspecialists. Our software supports a wide range of experimental designs, including stable isotope labeling by amino acids in cell culture (SILAC), tandem mass tags (TMT), and label-free data. Furthermore, we introduce new metrics to score MaxQuant's Match-between-runs (MBR) functionality by which peptide identifications can be transferred across Raw files based on accurate retention time and m/z. Last but not least, PTXQC is easy to install and use and represents the first QC software capable of processing MaxQuant result tables. PTXQC is freely available at https://github.com/cbielow/PTXQC .

[1]  Karl Mechtler,et al.  SIMPATIQCO: A Server-Based Software Suite Which Facilitates Monitoring the Time Course of LC–MS Performance Metrics on Orbitrap Instruments , 2012, Journal of proteome research.

[2]  Sebastian Gibb,et al.  Visualization of proteomics data using R and Bioconductor , 2015, Proteomics.

[3]  John T. Prince,et al.  Metriculator: quality assessment for mass spectrometry-based proteomics , 2013, Bioinform..

[4]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian cancer , 2002, The Lancet.

[5]  Jeffrey S. Morris,et al.  Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments , 2004, Bioinform..

[6]  Brian L. LaMarche,et al.  Signatures for Mass Spectrometry Data Quality , 2014, Journal of proteome research.

[7]  Stephan M. Winkler,et al.  MS Amanda, a Universal Identification Algorithm Optimized for High Accuracy Tandem Mass Spectra , 2014, Journal of proteome research.

[8]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[9]  William Stafford Noble Mass spectrometrists should search only for peptides they care about , 2015, Nature Methods.

[10]  Birgit Schilling,et al.  Interlaboratory Study Characterizing a Yeast Performance Standard for Benchmarking LC-MS Platform Performance* , 2009, Molecular & Cellular Proteomics.

[11]  David L Tabb,et al.  Quality assessment for clinical proteomics. , 2013, Clinical biochemistry.

[12]  S. Bryant,et al.  Open mass spectrometry search algorithm. , 2004, Journal of proteome research.

[13]  Peter B. McGarvey,et al.  UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches , 2014, Bioinform..

[14]  Eduard Sabidó,et al.  Influence of the digestion technique, protease, and missed cleavage peptides in protein quantitation. , 2014, Journal of proteome research.

[15]  Lennart Martens,et al.  qcML: An Exchange Format for Quality Control Metrics from Mass Spectrometry Experiments , 2014, Molecular & Cellular Proteomics.

[16]  M. Mann,et al.  Andromeda: a peptide search engine integrated into the MaxQuant environment. , 2011, Journal of proteome research.

[17]  Johannes Griss,et al.  The Proteomics Identifications (PRIDE) database and associated tools: status in 2013 , 2012, Nucleic Acids Res..

[18]  Marco Y. Hein,et al.  Accurate Proteome-wide Label-free Quantification by Delayed Normalization and Maximal Peptide Ratio Extraction, Termed MaxLFQ * , 2014, Molecular & Cellular Proteomics.

[19]  M. Mann,et al.  Comparative Proteomic Analysis of Eleven Common Cell Lines Reveals Ubiquitous but Varying Expression of Most Proteins* , 2012, Molecular & Cellular Proteomics.

[20]  David L. Tabb,et al.  Performance Metrics for Liquid Chromatography-Tandem Mass Spectrometry Systems in Proteomics Analyses* , 2009, Molecular & Cellular Proteomics.

[21]  Hans G. Drexler,et al.  Mycoplasma Contamination Of Cell Cultures , 2003 .

[22]  Richard D Smith,et al.  Recommendations for mass spectrometry data quality metrics for open access data (corollary to the Amsterdam Principles). , 2012, Journal of proteome research.

[23]  Jean-Charles Sanchez,et al.  Proteomic analysis of human substantia nigra identifies novel candidates involved in Parkinson's disease pathogenesis , 2014, Proteomics.

[24]  Lorenzo J. Vega-Montoto,et al.  QuaMeter: multivendor performance metrics for LC-MS/MS proteomics instrumentation. , 2012, Analytical chemistry.

[25]  Phil Andrews,et al.  Recommendations from the 2008 International Summit on Proteomics Data Release and Sharing Policy: the Amsterdam principles. , 2009, Journal of proteome research.

[26]  H. Drexler,et al.  Mycoplasma contamination of cell cultures: Incidence, sources, effects, detection, elimination, prevention , 2002, Cytotechnology.