A hierarchical statistical modeling approach to analyze proteomic isobaric tag for relative and absolute quantitation data

MOTIVATION Isobaric tag for relative and absolute quantitation (iTRAQ) is a widely used method in quantitative proteomics. A robust data analysis strategy is required to determine protein quantification reliability, i.e. changes due to biological regulation rather than technical variation, so that proteins that are differentially expressed can be identified. METHODS Samples were created by mixing 5, 10, 15 and 20 μg Escherichia coli cell lysate with 100 μg of cell lysate from mouse, corresponding to expected relative fold changes of one for mouse proteins and from 0.25 to 4 for E.coli proteins. Relative quantification was carried out using eight channel isobaric tagging with iTRAQ reagent, and proteins were identified using a TripleTOF 5600 mass spectrometer. Technical variation inherent in this iTRAQ dataset was systematically investigated. RESULTS A hierarchical statistical model was developed to use quantitative information at peptide level and protein level simultaneously to estimate variation present in each individual peptide and protein. A novel data analysis strategy for iTRAQ, denoted in short as WHATraq, was subsequently proposed with its performance evaluated by the proportion of E.coli proteins that are successfully identified as differentially expressed. Compared with two benchmark data analysis strategies WHATraq was able to identify at least 62.8% more true positive proteins that are differentially expressed. Further validated using a biological iTRAQ dataset including multiple biological replicates from varied murine cell lines, WHATraq performed consistently and identified 375% more proteins as being differentially expressed among different cell lines than the other data analysis strategies.

[1]  Crispin Miller,et al.  Quantitative Proteomics Analysis Demonstrates Post-transcriptional Regulation of Embryonic Stem Cell Differentiation to Hematopoiesis*S , 2008, Molecular & Cellular Proteomics.

[2]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[3]  Trong Khoa Pham,et al.  Technical, experimental, and biological variations in isobaric tags for relative and absolute quantitation (iTRAQ). , 2007, Journal of proteome research.

[4]  References , 1971 .

[5]  R. Aebersold,et al.  Analysis of protein complexes using mass spectrometry , 2007, Nature Reviews Molecular Cell Biology.

[6]  Jenny Forshed,et al.  Defining, Comparing, and Improving iTRAQ Quantification in Mass Spectrometry Proteomics Data* , 2013, Molecular & Cellular Proteomics.

[7]  Michael J. Walker,et al.  Identification of Nuclear Protein Targets for Six Leukemogenic Tyrosine Kinases Governed by Post-Translational Regulation , 2012, PloS one.

[8]  John Chilton,et al.  LTQ‐iQuant: A freely available software pipeline for automated and accurate protein quantification of isobaric tagged peptide data from LTQ instruments , 2010, Proteomics.

[9]  K. Parker,et al.  Multiplexed Protein Quantitation in Saccharomyces cerevisiae Using Amine-reactive Isobaric Tagging Reagents*S , 2004, Molecular & Cellular Proteomics.

[10]  Ishtiaq Rehman,et al.  iTRAQ underestimation in simple and complex mixtures: "the good, the bad and the ugly". , 2009, Journal of proteome research.

[11]  Wen-Lian Hsu,et al.  Multi-Q: a fully automated tool for multiplexed protein quantitation. , 2006, Journal of proteome research.

[12]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Terry M Therneau,et al.  Relative quantification: characterization of bias, variability and fold changes in mass spectrometry data from iTRAQ-labeled peptides. , 2011, Journal of proteome research.

[14]  Yi Zhang,et al.  A Robust Error Model for iTRAQ Quantification Reveals Divergent Signaling between Oncogenic FLT3 Mutants in Acute Myeloid Leukemia* , 2009, Molecular & Cellular Proteomics.

[15]  Andrew H. Thompson,et al.  Tandem mass tags: a novel quantification strategy for comparative analysis of complex protein mixtures by MS/MS. , 2003, Analytical chemistry.

[16]  R. Aebersold,et al.  Mass Spectrometry and Protein Analysis , 2006, Science.

[17]  Y. Vassetzky,et al.  Antagonistic functional duality of cancer genes. , 2013, Gene.

[18]  J. Licht,et al.  Leukemic IDH1 and IDH2 mutations result in a hypermethylation phenotype, disrupt TET2 function, and impair hematopoietic differentiation. , 2010, Cancer cell.

[19]  L. Liau,et al.  Cancer-associated IDH1 mutations produce 2-hydroxyglutarate , 2009, Nature.

[20]  Terry M Therneau,et al.  Statistical analysis of relative labeled mass spectrometry data from complex samples using ANOVA. , 2008, Journal of proteome research.

[21]  Caroline Dive,et al.  Statistical Considerations of Optimal Study Design for Human Plasma Proteomics and Biomarker Discovery , 2012, Journal of proteome research.

[22]  Josselin Noirel,et al.  Minimising iTRAQ ratio compression through understanding LC‐MS elution dependence and high‐resolution HILIC fractionation , 2011, Proteomics.

[23]  Crispin J. Miller,et al.  Eight-channel iTRAQ Enables Comparison of the Activity of Six Leukemogenic Tyrosine Kinases*S , 2008, Molecular & Cellular Proteomics.

[24]  Michael J. Walker,et al.  A caspase-3 ‘death-switch' in colorectal cancer cells for induced and synchronous tumor apoptosis in vitro and in vivo facilitates the development of minimally invasive cell death biomarkers , 2013, Cell Death and Disease.

[25]  T. Therneau,et al.  A statistical model for iTRAQ data analysis. , 2008, Journal of proteome research.

[26]  Frank Klawonn,et al.  MS-specific noise model reveals the potential of iTRAQ in quantitative proteomics , 2009, Bioinform..

[27]  S. Gygi,et al.  ms3 eliminates ratio distortion in isobaric multiplexed quantitative , 2011 .

[28]  Karl Mechtler,et al.  General statistical modeling of data from protein relative expression isobaric tags. , 2011, Journal of proteome research.

[29]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[30]  J. Tavernier,et al.  Ectopic interleukin-5 receptor expression promotes proliferation without development in a multipotent hematopoietic cell line. , 1998, Journal of cell science.

[31]  Crispin J. Miller,et al.  Quantitative proteomics reveals posttranslational control as a regulatory factor in primary hematopoietic stem cells. , 2006, Blood.

[32]  P. Manow ‚The Good, the Bad, and the Ugly‘ , 2002 .

[33]  J. Martínková,et al.  Relative quantitation of proteins fractionated by the ProteomeLab™ PF 2D system using isobaric tags for relative and absolute quantitation (iTRAQ) , 2007, Analytical and bioanalytical chemistry.

[34]  N. Karp,et al.  Addressing Accuracy and Precision Issues in iTRAQ Quantitation* , 2010, Molecular & Cellular Proteomics.

[35]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[36]  Yan Li,et al.  Optimized proteomic analysis of a mouse model of cerebellar dysfunction using amine‐specific isobaric tags , 2006, Proteomics.

[37]  Bernhard Kuster,et al.  Robust and Sensitive iTRAQ Quantification on an LTQ Orbitrap Mass Spectrometer*S , 2008, Molecular & Cellular Proteomics.