Significance analysis of microarray for relative quantitation of LC/MS data in proteomics

BackgroundAlthough fold change is a commonly used criterion in quantitative proteomics for differentiating regulated proteins, it does not provide an estimation of false positive and false negative rates that is often desirable in a large-scale quantitative proteomic analysis. We explore the possibility of applying the Significance Analysis of Microarray (SAM) method (PNAS 98:5116-5121) to a differential proteomics problem of two samples with replicates. The quantitative proteomic analysis was carried out with nanoliquid chromatography/linear iron trap-Fourier transform mass spectrometry. The biological sample model included two Mycobacterium smegmatis unlabeled cell cultures grown at pH 5 and pH 7. The objective was to compare the protein relative abundance between the two unlabeled cell cultures, with an emphasis on significance analysis of protein differential expression using the SAM method. Results using the SAM method are compared with those obtained by fold change and the conventional t-test.ResultsWe have applied the SAM method to solve the two-sample significance analysis problem in liquid chromatography/mass spectrometry (LC/MS) based quantitative proteomics. We grew the pH5 and pH7 unlabelled cell cultures in triplicate resulting in 6 biological replicates. Each biological replicate was mixed with a common 15N-labeled reference culture cells for normalization prior to SDS/PAGE fractionation and LC/MS analysis. For each biological replicate, one center SDS/PAGE gel fraction was selected for triplicate LC/MS analysis. There were 121 proteins quantified in at least 5 of the 6 biological replicates. Of these 121 proteins, 106 were significant in differential expression by the t-test (p < 0.05) based on peptide-level replicates, 54 were significant in differential expression by SAM with Δ = 0.68 cutoff and false positive rate at 5%, and 29 were significant in differential expression by the t-test (p < 0.05) based on protein-level replicates. The results indicate that SAM appears to overcome the false positives one encounters using the peptide-based t-test while allowing for identification of a greater number of differentially expressed proteins than the protein-based t-test.ConclusionWe demonstrate that the SAM method can be adapted for effective significance analysis of proteomic data. It provides much richer information about the protein differential expression profiles and is particularly useful in the estimation of false discovery rates and miss rates.

[1]  Samuel Kaplan,et al.  Application of the accurate mass and time tag approach to the proteome analysis of sub-cellular fractions obtained from Rhodobacter sphaeroides 2.4.1. Aerobic and photosynthetic cell cultures. , 2006, Journal of proteome research.

[2]  J. Slonczewski,et al.  pH-Dependent Catabolic Protein Expression during Anaerobic Growth of Escherichia coli K-12 , 2004, Journal of bacteriology.

[3]  Wei Sun,et al.  Proteomic analysis of individual variation in normal livers of human beings using difference gel electrophoresis , 2006, Proteomics.

[4]  T. Miyamoto,et al.  Identification of DNA-binding proteins changed after induction of sporulation in Bacillus cereus. , 1995, Bioscience, biotechnology, and biochemistry.

[5]  S. Ruben,et al.  Reproducibility assessment of relative quantitation strategies for LC-MS based proteomics. , 2007, Analytical chemistry.

[6]  L. Rohlin,et al.  Quantitative proteomic and microarray analysis of the archaeon Methanosarcina acetivorans grown with acetate versus methanol. , 2007, Journal of proteome research.

[7]  Bryan A. P. Roxas,et al.  Determination of global protein turnover in stressed mycobacterium cells using hybrid-linear ion trap-fourier transform mass spectrometry. , 2008, Analytical chemistry.

[8]  W. Whitman,et al.  Quantitative Proteomics of the Archaeon Methanococcus maripaludis Validated by Microarray Analysis and Real Time PCR *S , 2006, Molecular & Cellular Proteomics.

[9]  Yang Liu,et al.  Transcriptional Adaptation of Mycobacterium tuberculosis within Macrophages , 2003, The Journal of experimental medicine.

[10]  Katherine H. Huang,et al.  Transcriptome Profiling of Shewanella oneidensis Gene Expression following Exposure to Acidic and Alkaline pH , 2006, Journal of bacteriology.

[11]  J. Yates,et al.  A model for random sampling and estimation of relative protein abundance in shotgun proteomics. , 2004, Analytical chemistry.

[12]  Qingbo Li,et al.  New algorithm for 15N/14N quantitation with LC-ESI-MS using an LTQ-FT mass spectrometer. , 2006, Journal of proteome research.

[13]  V. Wendisch,et al.  Gene Expression Analysis of Corynebacterium glutamicum Subjected to Long-Term Lactic Acid Adaptation , 2007, Journal of bacteriology.

[14]  T. Veenstra Global and targeted quantitative proteomics for biomarker discovery. , 2007, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[15]  Jeffrey Whiteaker,et al.  Quality control metrics for LC-MS feature detection tools demonstrated on Saccharomyces cerevisiae proteomic profiles. , 2006, Journal of proteome research.

[16]  Matthew C Wiener,et al.  Quantitative analysis of complex peptide mixtures using FTMS and differential mass spectrometry , 2007, Journal of the American Society for Mass Spectrometry.

[17]  John D. Storey The positive false discovery rate: a Bayesian interpretation and the q-value , 2003 .

[18]  Cheng Li,et al.  Simultaneous and exact interval estimates for the contrast of two groups based on an extremely high dimensional variable: application to mass spec data , 2007, Bioinform..

[19]  Guanghui Wang,et al.  Label-free protein quantification using LC-coupled ion trap or FT mass spectrometry: Reproducibility, linearity, and application with complex proteomes. , 2006, Journal of proteome research.

[20]  Rajesh S. Gokhale,et al.  Enzymic activation and transfer of fatty acids as acyl-adenylates in mycobacteria , 2004, Nature.

[21]  Matthias Abend,et al.  Tuberculosis: Pathogenesis, Protection and Control , 1996, Nature Medicine.

[22]  Thomas M. Shinnick,et al.  Microarray Analysis of the Mycobacterium tuberculosis Transcriptional Response to the Acidic Conditions Found in Phagosomes , 2002, Journal of bacteriology.

[23]  P. Wheeler,et al.  Metabolism of Mycobacterium tuberculosis , 1994 .

[24]  J. Listgarten,et al.  Statistical and Computational Methods for Comparative Proteomic Profiling Using Liquid Chromatography-Tandem Mass Spectrometry , 2005, Molecular & Cellular Proteomics.

[25]  W. Jacobs,et al.  Sulfite Reduction in Mycobacteria , 2007, Journal of bacteriology.

[26]  D. Nandi,et al.  PepN is the major aminopeptidase in Escherichia coli: insights on substrate specificity and role during sodium-salicylate-induced stress. , 2003, Microbiology.

[27]  M. Mann,et al.  Status of complete proteome analysis by mass spectrometry: SILAC labeled yeast as a model system , 2006, Genome Biology.

[28]  Giovanni Parmigiani,et al.  Assessing reproducibility of a protein dynamics study using in vivo labeling and liquid chromatography tandem mass spectrometry. , 2005, Analytical chemistry.

[29]  Qingbo Li,et al.  Electron Transport in the Pathway of Acetate Conversion to Methane in the Marine Archaeon Methanosarcina acetivorans , 2006, Journal of bacteriology.

[30]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[31]  J. Leigh,et al.  Comparison of spectral counting and metabolic stable isotope labeling for use with quantitative microbial proteomics. , 2006, The Analyst.

[32]  D. Goodlett,et al.  ICAT-based comparative proteomic analysis of non-replicating persistent Mycobacterium tuberculosis. , 2006, Tuberculosis.

[33]  J. Yates,et al.  Metabolic labeling of mammalian organisms with stable isotopes for quantitative proteomic analysis. , 2004, Analytical chemistry.

[34]  T. Rejtar,et al.  A new algorithm using cross-assignment for label-free quantitation with LC-LTQ-FT MS. , 2007, Journal of proteome research.

[35]  John D. Storey,et al.  Significance analysis of time course microarray experiments. , 2005, Proceedings of the National Academy of Sciences of the United States of America.