From proteomics toward systems biology: integration of different types of proteomics data into network models.

Living organisms are comprised of various systems at different levels, i.e., organs, tissues, and cells. Each system carries out its diverse functions in response to environmental and genetic perturbations, by utilizing biological networks, in which nodal components, such as, DNA, mRNAs, proteins, and metabolites, closely interact with each other. Systems biology investigates such systems by producing comprehensive global data that represent different levels of biological information, i.e., at the DNA, mRNA, protein, or metabolite levels, and by integrating this data into network models that generate coherent hypotheses for given biological situations. This review presents a systems biology framework, called the 'Integrative Proteomics Data Analysis Pipeline' (IPDAP), which generates mechanistic hypotheses from network models reconstructed by integrating diverse types of proteomic data generated by mass spectrometry-based proteomic analyses. The devised framework includes a serial set of computational and network analysis tools. Here, we demonstrate its functionalities by applying these tools to several conceptual examples.

[1]  J. Yates Mass spectral analysis in proteomics. , 2004, Annual review of biophysics and biomolecular structure.

[2]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[3]  Y. Yamauchi,et al.  Mass spectrometric identification of N-linked glycopeptides using lectin-mediated affinity capture and glycosylation site–specific stable isotope tagging , 2006, Nature Protocols.

[4]  J. Yates,et al.  GutenTag: high-throughput sequence tagging via an empirically derived fragmentation model. , 2003, Analytical chemistry.

[5]  Jing-lan Wang,et al.  Phosphoproteome profile of human liver Chang's cell based on 2‐DE with fluorescence staining and MALDI‐TOF/TOF‐MS , 2007, Electrophoresis.

[6]  X. Cui,et al.  Statistical tests for differential expression in cDNA microarray experiments , 2003, Genome Biology.

[7]  T. Ideker,et al.  A new approach to decoding life: systems biology. , 2001, Annual review of genomics and human genetics.

[8]  L. Liotta,et al.  Proteomic profiling of the cancer microenvironment by antibody arrays , 2001, Proteomics.

[9]  Alexey I Nesvizhskii,et al.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. , 2002, Analytical chemistry.

[10]  R. Aebersold,et al.  Automated statistical analysis of protein abundance ratios from data generated by stable-isotope dilution and tandem mass spectrometry. , 2003, Analytical chemistry.

[11]  Serhiy Souchelnytskyi,et al.  Bridging proteomics and systems biology: What are the roads to be traveled? , 2005, Proteomics.

[12]  Bin Liu,et al.  Michigan Molecular Interactions (MiMI): putting the jigsaw puzzle together , 2006, Nucleic Acids Res..

[13]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[14]  Christian von Mering,et al.  STRING 7—recent developments in the integration and prediction of protein interactions , 2006, Nucleic Acids Res..

[15]  David J Studholme,et al.  Multidimensional Protein Identification Technology (MudPIT) Analysis of Ubiquitinated Proteins in Plants*S , 2007, Molecular & Cellular Proteomics.

[16]  A. J. Gandolfi,et al.  Proteomic identification of ubiquitinated proteins from human cells expressing His‐tagged ubiquitin , 2005, Proteomics.

[17]  R. Aebersold,et al.  A statistical model for identifying proteins by tandem mass spectrometry. , 2003, Analytical chemistry.

[18]  C. Bessant,et al.  i-Tracker: For quantitative proteomics using iTRAQ™ , 2005, BMC Genomics.

[19]  Jacob D. Jaffe,et al.  PEPPeR, a Platform for Experimental Proteomic Pattern Recognition*S , 2006, Molecular & Cellular Proteomics.

[20]  Brad T. Sherman,et al.  The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists , 2007, Genome Biology.

[21]  Roman A Zubarev,et al.  Electron-capture dissociation tandem mass spectrometry. , 2004, Current opinion in biotechnology.

[22]  Martin Kuiper,et al.  BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks , 2005, Bioinform..

[23]  A. Bauch,et al.  An efficient tandem affinity purification procedure for interaction proteomics in mammalian cells , 2006, Nature Methods.

[24]  S. Nishimura,et al.  Molecular basis of guanine nucleotide dissociation inhibitor activity of human neuroglobin by chemical cross-linking and mass spectrometry. , 2007, Journal of molecular biology.

[25]  Hokeun Kim,et al.  MODi : a powerful and convenient web server for identifying multiple post-translational peptide modifications from tandem mass spectra , 2006, Nucleic Acids Res..

[26]  Robert E. Kearney,et al.  Quantitative Proteomics Analysis of the Secretory Pathway , 2006, Cell.

[27]  M. Mann,et al.  Protein interaction screening by quantitative immunoprecipitation combined with knockdown (QUICK) , 2006, Nature Methods.

[28]  Adam Rauch,et al.  Computational Proteomics Analysis System (CPAS): an extensible, open-source analytic system for evaluating and publishing proteomic data and high throughput biological experiments. , 2006, Journal of proteome research.

[29]  Ruedi Aebersold,et al.  Identification and quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling and mass spectrometry , 2003, Nature Biotechnology.

[30]  Leroy Hood,et al.  Systems biology, proteomics, and the future of health care: toward predictive, preventative, and personalized medicine. , 2004, Journal of proteome research.

[31]  Peter R. Baker,et al.  Role of accurate mass measurement (+/- 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. , 1999, Analytical chemistry.

[32]  Susumu Goto,et al.  The KEGG databases at GenomeNet , 2002, Nucleic Acids Res..

[33]  Ruedi Aebersold,et al.  Reproducible isolation of distinct, overlapping segments of the phosphoproteome , 2007, Nature Methods.

[34]  David J. Reiss,et al.  The Gaggle: An open-source software system for integrating bioinformatics software and data sources , 2006, BMC Bioinformatics.

[35]  Ruedi Aebersold,et al.  Quantitative phosphoproteome analysis using a dendrimer conjugation chemistry and tandem mass spectrometry , 2005, Nature Methods.

[36]  Setsuo Hirohashi,et al.  Label-free Quantitative Proteomics Using Large Peptide Data Sets Generated by Nanoflow Liquid Chromatography and Mass Spectrometry* , 2006, Molecular & Cellular Proteomics.

[37]  R. Aebersold,et al.  Quantitative profiling of differentiation-induced microsomal proteins using isotope-coded affinity tags and mass spectrometry , 2001, Nature Biotechnology.

[38]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[39]  Richard E Higgs,et al.  Comprehensive label-free method for the relative quantification of proteins from biological samples. , 2005, Journal of proteome research.

[40]  M. Mann,et al.  Proteomic analysis of post-translational modifications , 2003, Nature Biotechnology.

[41]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[42]  O. Jensen Modification-specific proteomics: characterization of post-translational modifications by mass spectrometry. , 2004, Current opinion in chemical biology.

[43]  Lewis Y. Geer,et al.  Analysis of phosphorylation sites on proteins from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass spectrometry , 2007, Proceedings of the National Academy of Sciences.

[44]  J. Yates,et al.  A correlation algorithm for the automated quantitative analysis of shotgun proteomics data. , 2003, Analytical chemistry.

[45]  K. Parker,et al.  Multiplexed Protein Quantitation in Saccharomyces cerevisiae Using Amine-reactive Isobaric Tagging Reagents*S , 2004, Molecular & Cellular Proteomics.

[46]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[47]  R. Aebersold,et al.  Mass spectrometry in proteomics. , 2001, Chemical reviews.

[48]  Gary D Bader,et al.  BIND--The Biomolecular Interaction Network Database. , 2001, Nucleic acids research.

[49]  Luonan Chen,et al.  Discovering functions and revealing mechanisms at molecular level from biological networks , 2007, Proteomics.

[50]  Sergei Egorov,et al.  Pathway studio - the analysis and navigation of molecular networks , 2003, Bioinform..

[51]  R. Aebersold,et al.  Mass Spectrometry and Protein Analysis , 2006, Science.

[52]  K. Resing,et al.  Mapping protein post-translational modifications with mass spectrometry , 2007, Nature Methods.

[53]  Christoph H Borchers,et al.  Isotopically Coded Cleavable Cross-linker for Studying Protein-Protein Interaction and Protein Complexes* , 2005, Molecular & Cellular Proteomics.

[54]  Alexey I Nesvizhskii,et al.  Analysis and validation of proteomic data generated by tandem mass spectrometry , 2007, Nature Methods.

[55]  M. Miyasaka,et al.  Chemokines in tumor progression and metastasis , 2005, Cancer science.

[56]  Yoshiya Oda,et al.  Quantitative chemical proteomics for identifying candidate drug targets. , 2003, Analytical chemistry.