The mzQuantML Data Standard for Mass Spectrometry–based Quantitative Studies in Proteomics

The range of heterogeneous approaches available for quantifying protein abundance via mass spectrometry (MS)1 leads to considerable challenges in modeling, archiving, exchanging, or submitting experimental data sets as supplemental material to journals. To date, there has been no widely accepted format for capturing the evidence trail of how quantitative analysis has been performed by software, for transferring data between software packages, or for submitting to public databases. In the context of the Proteomics Standards Initiative, we have developed the mzQuantML data standard. The standard can represent quantitative data about regions in two-dimensional retention time versus mass/charge space (called features), peptides, and proteins and protein groups (where there is ambiguity regarding peptide-to-protein inference), and it offers limited support for small molecule (metabolomic) data. The format has structures for representing replicate MS runs, grouping of replicates (for example, as study variables), and capturing the parameters used by software packages to arrive at these values. The format has the capability to reference other standards such as mzML and mzIdentML, and thus the evidence trail for the MS workflow as a whole can now be described. Several software implementations are available, and we encourage other bioinformatics groups to use mzQuantML as an input, internal, or output format for quantitative software and for structuring local repositories. All project resources are available in the public domain from the HUPO Proteomics Standards Initiative http://www.psidev.info/mzquantml.

[1]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[2]  Lennart Martens,et al.  The minimum information about a proteomics experiment (MIAPE) , 2007, Nature Biotechnology.

[3]  Lennart Martens,et al.  mzML—a Community Standard for Mass Spectrometry Data* , 2010, Molecular & Cellular Proteomics.

[4]  R. Beynon,et al.  Multiplexed absolute quantification for proteomics using concatenated signature peptides encoded by QconCAT genes , 2006, Nature Protocols.

[5]  Da Qi,et al.  A software toolkit and interface for performing stable isotope labeling and top3 quantification using Progenesis LC-MS. , 2012, Omics : a journal of integrative biology.

[6]  Martin Eisenacher,et al.  Enabling BioSharing – a report on the Annual Spring Workshop of the HUPO‐PSI April 11–13, 2011, EMBL‐Heidelberg, Germany , 2011, Proteomics.

[7]  Martin Eisenacher,et al.  The HUPO proteomics standards initiative- mass spectrometry controlled vocabulary , 2013, Database J. Biol. Databases Curation.

[8]  K. Parker,et al.  Multiplexed Protein Quantitation in Saccharomyces cerevisiae Using Amine-reactive Isobaric Tagging Reagents*S , 2004, Molecular & Cellular Proteomics.

[9]  Eric W. Deutsch,et al.  The PeptideAtlas project , 2005, Nucleic Acids Res..

[10]  Lennart Martens,et al.  Proteomics data repositories: Providing a safe haven for your data and acting as a springboard for further research , 2010, Journal of proteomics.

[11]  Lennart Martens,et al.  The PSI formal document process and its implementation on the PSI website , 2007, Proteomics.

[12]  Peter Woollard,et al.  The minimum information required for reporting a molecular interaction experiment (MIMIx) , 2007, Nature Biotechnology.

[13]  Martin Eisenacher,et al.  The mzIdentML Data Standard for Mass Spectrometry-Based Proteomics Results , 2012, Molecular & Cellular Proteomics.

[14]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[15]  Henning Hermjakob,et al.  Ten Years of Standardizing Proteomic Data: A Report on the HUPO‐PSI Spring Workshop , 2012, Proteomics.

[16]  Lennart Martens,et al.  The Proteomics Identifications database: 2010 update , 2009, Nucleic Acids Res..

[17]  Henning Hermjakob,et al.  Tackling Quantitation: A Report on the Annual Spring Workshop of the HUPO‐PSI 28–30 March 2010, Seoul, South Korea , 2010, Proteomics.

[18]  Martin Eisenacher,et al.  Quality meets quantity – quality control, data standards and repositories , 2011, Proteomics.

[19]  Lennart Martens,et al.  TraML—A Standard Format for Exchange of Selected Reaction Monitoring Transition Lists* , 2011, Molecular & Cellular Proteomics.

[20]  C. Sander,et al.  The HUPO PSI's Molecular Interaction format—a community standard for the representation of protein interaction data , 2004, Nature Biotechnology.

[21]  Jun Fan,et al.  A critical appraisal of techniques, software packages, and standards for quantitative proteomic analysis. , 2012, Omics : a journal of integrative biology.

[22]  Knut Reinert,et al.  TOPP - the OpenMS proteomics pipeline , 2007, Bioinform..