Exploiting Interdata Relationships in Next-generation Proteomics Analysis*

Mass spectrometry-based proteomics and other technologies have matured to enable routine acquisition of system-wide data sets that describe concentrations, modifications, and interactions of proteins, mRNAs, and other molecules. Productive integrative studies differ from parallel data analysis by quantitative modeling of the relationships between data. We outline steps and considerations towards integromic studies to exploit the synergy between data sets. Graphical Abstract Highlights Technological advances produce many system-wide 'omics data sets. Integrating proteomics with other data needs quantitative modeling of inter-data relationships. Some tools have emerged for considering specific properties of 'omics data. Productive integration of proteomic and other data can provide exciting new insights. Mass spectrometry based proteomics and other technologies have matured to enable routine quantitative, system-wide analysis of concentrations, modifications, and interactions of proteins, mRNAs, and other molecules. These studies have allowed us to move toward a new field concerned with mining information from the combination of these orthogonal data sets, perhaps called “integromics.” We highlight examples of recent studies and tools that aim at relating proteomic information to mRNAs, genetic associations, and changes in small molecules and lipids. We argue that productive data integration differs from parallel acquisition and interpretation and should move toward quantitative modeling of the relationships between the data. These relationships might be expressed by temporal information retrieved from time series experiments, rate equations to model synthesis and degradation, or networks of causal, evolutionary, physical, and other interactions. We outline steps and considerations toward such integromic studies to exploit the synergy between data sets.

[1]  Bing Zhang,et al.  Proteomics Is Not an Island: Multi-omics Integration Is the Key to Understanding Biological Systems* , 2019, Molecular & Cellular Proteomics.

[2]  Christopher P. Lapointe,et al.  Multi-omics Reveal Specific Targets of the RNA-Binding Protein Puf3p and Its Orchestration of Mitochondrial Biogenesis. , 2017, Cell systems.

[3]  Paola Picotti,et al.  Measuring protein structural changes on a proteome-wide scale using limited proteolysis-coupled mass spectrometry , 2017, Nature Protocols.

[4]  M. Tress,et al.  Alternative Splicing May Not Be the Key to Proteome Complexity. , 2017, Trends in biochemical sciences.

[5]  Avi Ma'ayan,et al.  Integration of protein phosphorylation, acetylation, and methylation data sets to outline lung cancer signaling networks , 2018, Science Signaling.

[6]  Ze Cheng,et al.  Pervasive, Coordinated Protein-Level Changes Driven by Transcript Isoform Switching during Meiosis , 2018, Cell.

[7]  Henry Rodriguez,et al.  Revolutionizing Precision Oncology through Collaborative Proteogenomics and Data Sharing , 2018, Cell.

[8]  Na Liu,et al.  Sequential posttranslational modifications program FEN1 degradation during cell-cycle progression. , 2012, Molecular cell.

[9]  Rozbeh Jafari,et al.  Cellular Thermal Shift Assay Monitoring Drug Target Engagement in Cells and Tissues Using the , 2014 .

[10]  J. Marioni,et al.  Multi‐Omics Factor Analysis—a framework for unsupervised integration of multi‐omics data sets , 2018, Molecular systems biology.

[11]  Maxwell R. Mumbach,et al.  Dynamic profiling of the protein life cycle in response to pathogens , 2015, Science.

[12]  Christopher S. Poultney,et al.  One third of dynamic protein expression profiles can be predicted by a simple rate equation. , 2014, Molecular bioSystems.

[13]  N. Slavov,et al.  SCoPE-MS: mass spectrometry of single mammalian cells quantifies proteome heterogeneity during cell differentiation , 2017, Genome Biology.

[14]  E. Marcotte,et al.  Insights into the regulation of protein abundance from proteomic and transcriptomic analyses , 2012, Nature Reviews Genetics.

[15]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[16]  M. Selbach,et al.  Global quantification of mammalian gene expression control , 2011, Nature.

[17]  Lana X. Garmire,et al.  More Is Better: Recent Progress in Multi-Omics Data Integration Methods , 2017, Front. Genet..

[18]  Leonid Kruglyak,et al.  Genetics of single-cell protein abundance variation in large yeast populations , 2013 .

[19]  M. Jovanović,et al.  Global Proteome Remodeling during ER Stress Involves Hac1-Driven Expression of Long Undecoded Transcript Isoforms. , 2018, Developmental cell.

[20]  R. Britten,et al.  High-Resolution Density Gradient Sedimentation Analysis , 1960, Science.

[21]  E. Regan,et al.  Common Genetic Polymorphisms Influence Blood Biomarker Measurements in COPD , 2016, PLoS genetics.

[22]  R. Aebersold,et al.  Impact of Alternative Splicing on the Human Proteome , 2017, Cell Reports.

[23]  J. Fox,et al.  Elliptical Insights: Understanding Statistical Methods through Elliptical Geometry , 2013, 1302.4881.

[24]  Yufeng J. Tseng,et al.  3Omics: a web-based systems biology tool for analysis, integration and visualization of human transcriptomic, proteomic and metabolomic data , 2013, BMC Systems Biology.

[25]  Paolo Cifani,et al.  ProteomeGenerator: A framework for comprehensive proteomics based on de novo transcriptome assembly and high-accuracy peptide mass spectral matching , 2017, bioRxiv.

[26]  R. Pal,et al.  Send Orders of Reprints at Reprints@benthamscience.net Integrated Analysis of Transcriptomic and Proteomic Data , 2022 .

[27]  N. Slavov,et al.  SCoPE-MS: mass spectrometry of single mammalian cells quantifies proteome heterogeneity during cell differentiation , 2017, Genome Biology.

[28]  Scott A. Shaffer,et al.  Genetic Variation Shapes Protein Networks Mainly through Non-transcriptional Mechanisms , 2011, PLoS biology.

[29]  Evan G. Williams,et al.  Systems proteomics of liver mitochondria function , 2016, Science.

[30]  O. Elroy-Stein,et al.  Proteomic analysis of polyribosomes identifies splicing factors as potential regulators of translation during mitosis , 2017, Nucleic acids research.

[31]  Eric W. Deutsch,et al.  A complete mass-spectrometric map of the yeast proteome applied to quantitative trait analysis , 2013, Nature.

[32]  Steven A Carr,et al.  Integrated proteomic analysis of post-translational modifications by serial enrichment , 2013, Nature Methods.

[33]  E. Airoldi,et al.  Differential Stoichiometry among Core Ribosomal Proteins , 2014, bioRxiv.

[34]  Bo Li,et al.  Characterization of Proteome Variation During Modern Maize Breeding , 2018, Molecular & Cellular Proteomics.

[35]  Matej Oresic,et al.  A computational framework to integrate high-throughput ‘-omics’ datasets for the identification of potential mechanistic links , 2018, Nature Protocols.

[36]  R. Aebersold,et al.  Comparative Functional Analysis of the Caenorhabditis elegans and Drosophila melanogaster Proteomes , 2009, PLoS biology.

[37]  Michael L. Gatza,et al.  Proteogenomics connects somatic mutations to signaling in breast cancer , 2016, Nature.

[38]  Sandipan Ray,et al.  Metabolic oscillations on the circadian time scale in Drosophila cells lacking clock genes , 2018, Molecular systems biology.

[39]  Paul J. Choi,et al.  Quantifying E. coli Proteome and Transcriptome with Single-Molecule Sensitivity in Single Cells , 2010, Science.

[40]  M. Mann,et al.  Organellar Proteomics and Phospho-Proteomics Reveal Subcellular Reorganization in Diet-Induced Hepatic Steatosis. , 2018, Developmental cell.

[41]  Sean C. Bendall,et al.  Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum , 2011, Science.

[42]  F. Edfors,et al.  Gene‐specific correlation of RNA and protein levels in human cells and tissues , 2016, Molecular systems biology.

[43]  Gregory W. Schwartz,et al.  Differential Integration of Transcriptome and Proteome Identifies Pan-Cancer Prognostic Biomarkers , 2018, bioRxiv.

[44]  Hector H. Huang,et al.  Time-Resolved Proteomics Extends Ribosome Profiling-Based Measurements of Protein Synthesis Dynamics. , 2017, Cell systems.

[45]  COPDGene Investigators Common Genetic Polymorphisms Influence Blood Biomarker Measurements in COPD , 2016 .

[46]  Peter Nemes,et al.  Label-free Quantification of Proteins in Single Embryonic Cells with Neural Fate in the Cleavage-Stage Frog (Xenopus laevis) Embryo using Capillary Electrophoresis Electrospray Ionization High-Resolution Mass Spectrometry (CE-ESI-HRMS)* , 2016, Molecular & Cellular Proteomics.

[47]  P. Bickel,et al.  System wide analyses have underestimated protein abundances and the importance of transcription in mammals , 2012, PeerJ.

[48]  R. Aebersold,et al.  On the Dependency of Cellular Protein Levels on mRNA Abundance , 2016, Cell.

[49]  Jake Siegel,et al.  Genetics of trans-regulatory variation in gene expression , 2017, bioRxiv.

[50]  M. Selbach,et al.  Phosphorylation of the Ribosomal Protein RPL12/uL11 Affects Translation during Mitosis. , 2018, Molecular cell.

[51]  Andrew J. Sedgewick,et al.  Learning subgroup-specific regulatory interactions and regulator independence with PARADIGM , 2013, Bioinform..

[52]  Evan G. Williams,et al.  Systems Analyses Reveal Physiological Roles and Genetic Regulators of Liver Lipid Species. , 2018, Cell systems.

[53]  Christian von Mering,et al.  Cell-wide analysis of protein thermal unfolding reveals determinants of thermostability , 2017, Science.

[54]  Hyungwon Choi,et al.  New insights into the cellular temporal response to proteostatic stress , 2018, eLife.

[55]  D. Hwang,et al.  Integrated analysis of global proteome, phosphoproteome, and glycoproteome enables complementary interpretation of disease-related protein networks , 2015, Scientific Reports.

[56]  Hiromi W L Koh,et al.  Differential dynamics of the mammalian mRNA and protein expression response to misfolding stress , 2015, bioRxiv.

[57]  E. Marcotte,et al.  Global signatures of protein and mRNA expression levelsw , 2009 .

[58]  David R Goodlett,et al.  Genetic basis of proteome variation in yeast , 2007, Nature Genetics.

[59]  U. Sauer,et al.  A Map of Protein-Metabolite Interactions Reveals Principles of Chemical Communication , 2018, Cell.

[60]  P. Pavlidis,et al.  Can we predict protein from mRNA levels? , 2017, Nature.

[61]  Shuqiang Li,et al.  Multiplexed, targeted profiling of single-cell proteomes and transcriptomes in a single reaction , 2016, Genome Biology.

[62]  Pär Nordlund,et al.  Modulation of Protein-Interaction States through the Cell Cycle , 2018, Cell.

[63]  Thomas Lumley,et al.  Review of Statistical Learning Methods in Integrated Omics Studies (An Integrated Information Science) , 2018, Bioinformatics and biology insights.

[64]  A. Oliveira,et al.  Global analysis of protein structural changes in complex proteomes , 2014, Nature Biotechnology.

[65]  Jianguo Xia,et al.  OmicsNet: a web-based tool for creation and visual analysis of biological networks in 3D space , 2018, Nucleic Acids Res..

[66]  A. Gasch,et al.  Decoupling Yeast Cell Division and Stress Defense Implicates mRNA Repression in Translational Reallocation during Stress , 2018, Current Biology.

[67]  Marco Y. Hein,et al.  The Perseus computational platform for comprehensive analysis of (prote)omics data , 2016, Nature Methods.

[68]  Evan G. Williams,et al.  Systematic proteome and proteostasis profiling in human Trisomy 21 fibroblast cells , 2017, Nature Communications.

[69]  Marylyn D. Ritchie,et al.  Using knowledge-driven genomic interactions for multi-omics data analysis: metadimensional models for predicting clinical outcomes in ovarian carcinoma , 2017, J. Am. Medical Informatics Assoc..

[70]  Nicholas W. Kwiecien,et al.  Mitochondrial protein functions elucidated by multi-omic mass spectrometry profiling , 2016, Nature Biotechnology.

[71]  G. Drewes,et al.  Thermal proteome profiling monitors ligand interactions with cellular membrane proteins , 2015, Nature Methods.

[72]  Samuel H Payne,et al.  Methods, Tools and Current Perspectives in Proteogenomics * , 2017, Molecular & Cellular Proteomics.

[73]  C. Vogel,et al.  Next-generation analysis of gene expression regulation--comparing the roles of synthesis and degradation. , 2015, Molecular bioSystems.

[74]  Marc R Wilkins,et al.  Proteomic Validation of Transcript Isoforms, Including Those Assembled from RNA-Seq Data. , 2015, Journal of proteome research.

[75]  M. Selbach,et al.  Global analysis of cellular protein translation by pulsed SILAC , 2009, Proteomics.

[76]  S. Moody,et al.  Single-cell mass spectrometry reveals small molecules that affect cell fates in the 16-cell embryo , 2015, Proceedings of the National Academy of Sciences.

[77]  Joel Dudley,et al.  High-Throughput Characterization of Blood Serum Proteomics of IBD Patients with Respect to Aging and Genetic Factors , 2017, PLoS genetics.

[78]  Edoardo M. Airoldi,et al.  Post-transcriptional regulation across human tissues , 2015, bioRxiv.

[79]  Peer Bork,et al.  Pervasive Protein Thermal Stability Variation during the Cell Cycle , 2018, Cell.

[80]  Christian Gieger,et al.  Connecting genetic risk to disease end points through the human blood plasma proteome , 2016, Nature Communications.

[81]  Ronald J. Moore,et al.  Nanodroplet processing platform for deep and quantitative proteome profiling of 10–100 mammalian cells , 2018, Nature Communications.

[82]  Lydie Lane,et al.  Progress on the HUPO Draft Human Proteome: 2017 Metrics of the Human Proteome Project. , 2017, Journal of proteome research.

[83]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[84]  Bin Zhang,et al.  Multiscale network modeling of oligodendrocytes reveals molecular components of myelin dysregulation in Alzheimer’s disease , 2017, Molecular Neurodegeneration.

[85]  Jonathan K. Pritchard,et al.  Primate Transcript and Protein Expression Levels Evolve Under Compensatory Selection Pressures , 2013, Science.

[86]  Harkamal Walia,et al.  Protein abundances are more conserved than mRNA abundances across diverse taxa , 2010, Proteomics.

[87]  B. Blencowe The Relationship between Alternative Splicing and Proteomic Complexity. , 2017, Trends in biochemical sciences.

[88]  B. Kuster,et al.  Mass-spectrometry-based draft of the human proteome , 2014, Nature.

[89]  S. Le,et al.  Sequence signatures and mRNA concentration can explain two-thirds of protein abundance variation in a human cell line , 2010, Molecular systems biology.

[90]  Hyungwon Choi,et al.  PECAplus: statistical analysis of time-dependent regulatory changes in dynamic single-omics and dual-omics experiments , 2017, npj Systems Biology and Applications.