This item is the archived peer-reviewed author-version of: CONSTANd : a normalization method for isobaric labeled spectra by constrained optimization

In quantitative proteomics applications, the use of isobaric labels is a very popular concept as they allow for multiplexing, such that peptides from multiple biological samples are quantified simultaneously in one mass spectrometry experiment. Although this multiplexing allows that peptide intensities are affected by the same amount of instrument variability, systematic effects during sample preparation can also introduce a bias in the quantitation measurements. Therefore, normalization methods are required to remove this systematic error. At present, a few dedicated normalization methods for isobaric labeled data are at hand. Most of these normalization methods include a framework for statistical data analysis and rely on ANOVA or linear mixed models. However, for swift quality control of the samples or data visualization a simple normalization technique is sufficient. To this aim, we present a new and easy-to-use data-driven normalization method, named CONSTANd. The CONSTANd method employs constrained optimization and prior information about the labeling strategy to normalize the peptide intensities. Further, it allows maintaining the connection to any biological effect while reducing the systematic and technical errors. As a result, peptides can not only be compared directly within a multiplexed experiment, but are also comparable between other isobaric labeled datasets from multiple experimental designs that are normalized by the CONSTANd method, without the need to include a reference sample in every experimental setup. The latter property is especially useful when more than six, eight or ten (TMT/iTRAQ) biological samples are required to detect differential peptides with sufficient statistical power and to optimally make use of the multiplexing capacity of isobaric labels.

[1]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[2]  Richard Stone,et al.  The Precision of National Income Estimates , 1942 .

[3]  K. E. Boulding,et al.  The structure of American economy, 1919-1929 : an empirical application of equilibrium analysis , 1942 .

[4]  Karl Mechtler,et al.  High precision quantitative proteomics using iTRAQ on an LTQ Orbitrap: a new mass spectrometric method combining the benefits of all. , 2009, Journal of proteome research.

[5]  Liliane Schoofs,et al.  Determination of Variation Parameters as a Crucial Step in Designing TMT-Based Clinical Proteomics Experiments , 2015, PloS one.

[6]  Karl Mechtler,et al.  Improved Precision of iTRAQ and TMT Quantification by an Axial Extraction Field in an Orbitrap HCD Cell , 2011, Analytical chemistry.

[7]  Friedrich Lottspeich,et al.  ICPL labeling strategies for proteome research. , 2011, Methods in molecular biology.

[8]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[9]  Yudi Pawitan,et al.  Normalization of oligonucleotide arrays based on the least-variant set of genes , 2008, BMC Bioinformatics.

[10]  J M Bland,et al.  Statistical methods for assessing agreement between two methods of clinical measurement , 1986 .

[11]  Louis de Mesnard,et al.  Biproportional Techniques in Input-Output Analysis: Table Updating and Structural Analysis , 2004 .

[12]  Tommi S. Jaakkola,et al.  Fast optimal leaf ordering for hierarchical clustering , 2001, ISMB.

[13]  S. Fienberg An Iterative Procedure for Estimation in Contingency Tables , 1970 .

[14]  J. Yates,et al.  Protein analysis by shotgun/bottom-up proteomics. , 2013, Chemical reviews.

[15]  Olga Vitek,et al.  Computational Mass Spectrometry–Based Proteomics , 2011, PLoS Comput. Biol..

[16]  Ingo Ruczinski,et al.  Detecting Significant Changes in Protein Abundance. , 2015, EuPA open proteomics.

[17]  W. Deming,et al.  On a Least Squares Adjustment of a Sampled Frequency Table When the Expected Marginal Totals are Known , 1940 .

[18]  Rachel M. Adams,et al.  Systematic comparison of label-free, metabolic labeling, and isobaric chemical labeling for quantitative proteomics on LTQ Orbitrap Velos. , 2012, Journal of proteome research.

[19]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[20]  W. Leontief,et al.  The structure of American economy, 1919-1929 : an empirical application of equilibrium analysis , 1942 .

[21]  S. Kullback,et al.  Contingency tables with given marginals. , 1968, Biometrika.

[22]  Karl Mechtler,et al.  General statistical modeling of data from protein relative expression isobaric tags. , 2011, Journal of proteome research.

[23]  P. Jagtap,et al.  Early events of Bacillus anthracis germination identified by time‐course quantitative proteomics , 2006, Proteomics.

[24]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[25]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.

[26]  T. Therneau,et al.  A statistical model for iTRAQ data analysis. , 2008, Journal of proteome research.

[27]  Bhavinkumar B. Patel,et al.  Isobaric labeling and data normalization without requiring protein quantitation. , 2012, Journal of biomolecular techniques : JBT.

[28]  Yi Zhang,et al.  A Robust Error Model for iTRAQ Quantification Reveals Divergent Signaling between Oncogenic FLT3 Mutants in Acute Myeloid Leukemia* , 2009, Molecular & Cellular Proteomics.

[29]  Geert Molenberghs,et al.  Likelihood Based Frequentist Inference When Data Are Missing at Random , 1998 .

[30]  J. Yates,et al.  Isobaric Labeling-Based Relative Quantification in Shotgun Proteomics , 2014, Journal of proteome research.

[31]  Michael Bacharach,et al.  Biproportional matrices & input-output change , 1970 .

[32]  Thomas Burger,et al.  DAPAR & ProStaR: software to perform statistical analyses in quantitative discovery proteomics , 2016, Bioinform..

[33]  Andrew H. Thompson,et al.  Tandem mass tags: a novel quantification strategy for comparative analysis of complex protein mixtures by MS/MS. , 2003, Analytical chemistry.

[34]  C. Bessant,et al.  i-Tracker: For quantitative proteomics using iTRAQ™ , 2005, BMC Genomics.

[35]  Ingo Ruczinski,et al.  Statistical inference from multiple iTRAQ experiments without using common reference standards. , 2013, Journal of proteome research.

[36]  H. Robinson,et al.  Sink populations in carnivore management: cougar demography and immigration in a hunted population. , 2008, Ecological applications : a publication of the Ecological Society of America.

[37]  Jeffrey S. Miller,et al.  iTRAQ is a useful method to screen for membrane-bound proteins differentially expressed in human natural killer cell types. , 2007, Journal of proteome research.

[38]  L. Bregman Proof of the convergence of Sheleikhovskii's method for a problem with transportation constraints , 1967 .

[39]  K. Parker,et al.  Multiplexed Protein Quantitation in Saccharomyces cerevisiae Using Amine-reactive Isobaric Tagging Reagents*S , 2004, Molecular & Cellular Proteomics.

[40]  Terry M Therneau,et al.  Statistical analysis of relative labeled mass spectrometry data from complex samples using ANOVA. , 2008, Journal of proteome research.

[41]  Ann L. Oberg,et al.  Statistical methods for quantitative mass spectrometry proteomic experiments with labeling , 2012, BMC Bioinformatics.

[42]  M. Arntzen,et al.  IsobariQ: software for isobaric quantitative proteomics using IPTL, iTRAQ, and TMT. , 2011, Journal of proteome research.

[43]  Edward L. Huttlin,et al.  Increasing the multiplexing capacity of TMTs using reporter ion isotopologues with isobaric masses. , 2012, Analytical chemistry.

[44]  Javier Cabrera,et al.  Analysis of Data From Viral DNA Microchips , 2001 .

[45]  M. Mann,et al.  A practical recipe for stable isotope labeling by amino acids in cell culture (SILAC) , 2006, Nature Protocols.

[46]  D. Hochstrasser,et al.  Relative quantification of proteins in human cerebrospinal fluids by MS/MS using 6-plex isobaric tags. , 2008, Analytical chemistry.

[47]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[48]  Albert Sickmann,et al.  Precise protein quantification based on peptide quantification using iTRAQ™ , 2007, BMC Bioinformatics.

[49]  Gilbert S Omenn,et al.  Differential protein expression profiling by iTRAQ-2DLC-MS/MS of lung cancer cells undergoing epithelial-mesenchymal transition reveals a migratory/invasive phenotype. , 2006, Journal of proteome research.

[50]  Tomasz Burzykowski,et al.  Evaluation of normalization methods to pave the way towards large-scale LC-MS-based metabolomics profiling experiments. , 2013, Omics : a journal of integrative biology.

[51]  Steven P Gygi,et al.  Hyperplexing: A Method for Higher-Order Multiplexed Quantitative Proteomics Provides a Map of the Dynamic Response to Rapamycin in Yeast , 2012, Science Signaling.

[52]  Michael Green Biproportional Matrices and Input‐Output Change , 1971 .