Development of a computational framework for the analysis of protein correlation profiling and spatial proteomics experiments.

UNLABELLED Standard approaches to studying an interactome do not easily allow conditional experiments but in recent years numerous groups have demonstrated the potential for co-fractionation/co-migration based approaches to assess an interactome at a similar sensitivity and specificity yet significantly lower cost and higher speed than traditional approaches. Unfortunately, there is as yet no implementation of the bioinformatics tools required to robustly analyze co-fractionation data in a way that can also integrate the valuable information contained in biological replicates. Here we have developed a freely available, integrated bioinformatics solution for the analysis of protein correlation profiling SILAC data. This modular solution allows the deconvolution of protein chromatograms into individual Gaussian curves enabling the use of these chromatography features to align replicates and assemble a consensus map of features observed across replicates; the chromatograms and individual curves are then used to quantify changes in protein interactions and construct the interactome. We have applied this workflow to the analysis of HeLa cells infected with a Salmonella enterica serovar Typhimurium infection model where we can identify specific interactions that are affected by the infection. These bioinformatics tools simplify the analysis of co-fractionation/co-migration data to the point where there is no specialized knowledge required to measure an interactome in this way. BIOLOGICAL SIGNIFICANCE We describe a set of software tools for the bioinformatics analysis of co-migration/co-fractionation data that integrates the results from multiple replicates to generate an interactome, including the impact on individual interactions of any external perturbation. This article is part of a Special Issue entitled: Protein dynamics in health and disease. Guest Editors: Pierre Thibault and Anne-Claude Gingras.

[1]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[2]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[3]  Antonio Carvajal-Rodríguez,et al.  Multiple Hypothesis Testing in Proteomics: A Strategy for Experimental Work* , 2010, Molecular & Cellular Proteomics.

[4]  L. Foster,et al.  Global Impact of Salmonella Pathogenicity Island 2-secreted Effectors on the Host Phosphoproteome* , 2013, Molecular & Cellular Proteomics.

[5]  Franco J. Vizeacoumar,et al.  Interaction landscape of membrane-protein complexes in Saccharomyces cerevisiae , 2012, Nature.

[6]  S. Pu,et al.  Up-to-date catalogues of yeast protein complexes , 2008, Nucleic acids research.

[7]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[8]  A. Reichert,et al.  Complexome profiling identifies TMEM126B as a component of the mitochondrial complex I assembly complex. , 2012, Cell metabolism.

[9]  Hans-Werner Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[10]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[11]  Thomas Burger,et al.  Mass-spectrometry-based spatial proteomics data analysis using pRoloc and pRolocdata , 2014, Bioinform..

[12]  P. Bork,et al.  Proteome Organization in a Genome-Reduced Bacterium , 2009, Science.

[13]  Leonard J Foster,et al.  Phosphoproteomic Analysis of Salmonella-Infected Cells Identifies Key Kinase Regulators and SopB-Dependent Host Phosphorylation Events , 2011, Science Signaling.

[14]  Derek J. Bailey,et al.  Intelligent Data Acquisition Blends Targeted and Discovery Methods , 2014, Journal of proteome research.

[15]  S. Teichmann,et al.  Protein Complexes Are under Evolutionary Selection to Assemble via Ordered Pathways , 2013, Cell.

[16]  G. Meister,et al.  A multiprotein complex mediates the ATP-dependent assembly of spliceosomal U snRNPs , 2001, Nature Cell Biology.

[17]  B. Coombes,et al.  Active modification of host inflammation by Salmonella , 2013, Gut microbes.

[18]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[19]  Andrei L. Turinsky,et al.  A Census of Human Soluble Protein Complexes , 2012, Cell.

[20]  Matthias Mann,et al.  Fractionation profiling: a fast and versatile approach for mapping vesicle proteomes and protein–protein interactions , 2014, Molecular biology of the cell.

[21]  A. Goldberg,et al.  Properties of the hybrid form of the 26S proteasome containing both 19S and PA28 complexes , 2002, The EMBO journal.

[22]  A. Emili,et al.  Sequential Peptide Affinity (SPA) system for the identification of mammalian and bacterial protein complexes. , 2004, Journal of proteome research.

[23]  Sarah A Teichmann,et al.  The origins and evolution of functional modules: lessons from protein complexes , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[24]  Lucas Pelkmans,et al.  Kinase-regulated quantal assemblies and kiss-and-run recycling of caveolae , 2005, Nature.

[25]  A. Emili,et al.  Interaction network containing conserved and essential protein complexes in Escherichia coli , 2005, Nature.

[26]  Rod B. Watson,et al.  Mapping the Arabidopsis organelle proteome. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Xiaohui S. Xie,et al.  A Mammalian Organelle Map by Protein Correlation Profiling , 2006, Cell.

[28]  A J Olson,et al.  Structural symmetry and protein function. , 2000, Annual review of biophysics and biomolecular structure.

[29]  S. Teichmann,et al.  Assembly reflects evolution of protein complexes , 2008, Nature.

[30]  D. Holden,et al.  Functions of the Salmonella pathogenicity island 2 (SPI-2) type III secretion system effectors. , 2012, Microbiology.

[31]  Y. Hiraoka,et al.  ORFeome cloning and global analysis of protein localization in the fission yeast Schizosaccharomyces pombe , 2006, Nature Biotechnology.

[32]  Albert-László Barabási,et al.  Error and attack tolerance of complex networks , 2000, Nature.

[33]  M. Trotter,et al.  Improved sub‐cellular resolution via simultaneous analysis of organelle proteomics data across varied experimental conditions , 2010, Proteomics.

[34]  M. MacCoss,et al.  Maximizing Peptide Identification Events in Proteomic Workflows Using Data-Dependent Acquisition (DDA)* , 2013, Molecular & Cellular Proteomics.

[35]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[36]  M. Trotter,et al.  The effect of organelle discovery upon sub-cellular protein localisation. , 2013, Journal of proteomics.

[37]  Nevan J. Krogan,et al.  From systems to structure: bridging networks and mechanism. , 2013, Molecular cell.

[38]  S. Kanaya,et al.  Large-scale identification of protein-protein interaction of Escherichia coli K-12. , 2006, Genome research.

[39]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[40]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[41]  Matthias Mann,et al.  Analysis of High Accuracy, Quantitative Proteomics Data in the MaxQB Database , 2012, Molecular & Cellular Proteomics.

[42]  Friedrich Kopp,et al.  Reconstitution of hybrid proteasomes from purified PA700–20 S complexes and PA28αβ activator: ultrastructure and peptidase activities , 2001 .

[43]  H. Choy,et al.  Caveolae-mediated entry of Salmonella typhimurium into senescent nonphagocytotic host cells , 2010, Aging cell.

[44]  L. Foster,et al.  A high-throughput approach for measuring temporal changes in the interactome , 2012, Nature Methods.

[45]  Jürgen Cox,et al.  1D and 2D annotation enrichment: a statistical method integrating quantitative proteomics with complementary high-throughput data , 2012, BMC Bioinformatics.

[46]  L. Mirny,et al.  Protein complexes and functional modules in molecular networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[47]  M. Mann,et al.  Proteomic characterization of the human centrosome by protein correlation profiling , 2003, Nature.

[48]  Laurent Gatto,et al.  A Foundation for Reliable Spatial Proteomics Data Analysis* , 2014, Molecular & Cellular Proteomics.

[49]  Lily Ting,et al.  Normalization and Statistical Analysis of Quantitative Proteomics Data Generated by Metabolic Labeling* , 2009, Molecular & Cellular Proteomics.

[50]  A. Emili,et al.  Protein-protein interaction networks: probing disease mechanisms using model systems , 2013, Genome Medicine.

[51]  L. Foster,et al.  An integrated global strategy for cell lysis, fractionation, enrichment and mass spectrometric analysis of phosphorylated peptides. , 2010, Molecular bioSystems.

[52]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[53]  A. Hershko,et al.  ATP-dependent incorporation of 20S protease into the 26S complex that degrades proteins conjugated to ubiquitin. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[54]  A. Barabasi,et al.  Interactome Networks and Human Disease , 2011, Cell.

[55]  Ran Kafri,et al.  Preferential protection of protein interaction network hubs in yeast: Evolved functionality of genetic redundancy , 2008, Proceedings of the National Academy of Sciences.

[56]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[57]  B. Raymond,et al.  Subversion of trafficking, apoptosis, and innate immunity by type III secretion system effectors. , 2013, Trends in microbiology.

[58]  Patrick Cramer,et al.  Review Conservation between the Rna Polymerase I, Ii, and Iii Transcription Initiation Machineries , 2022 .

[59]  Rod B. Watson,et al.  Localization of Organelle Proteins by Isotope Tagging (LOPIT)*S , 2004, Molecular & Cellular Proteomics.

[60]  S. Oliver Proteomics: Guilt-by-association goes global , 2000, Nature.

[61]  L. Foster,et al.  Protein synthesis rate is the predominant regulator of protein expression during differentiation , 2013, Molecular systems biology.

[62]  Lan V. Zhang,et al.  Evidence for dynamically organized modularity in the yeast protein–protein interaction network , 2004, Nature.

[63]  M. Mann,et al.  Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips , 2007, Nature Protocols.

[64]  P. Uetz,et al.  The binary protein-protein interaction landscape of Escherichia coli , 2014, Nature Biotechnology.

[65]  A. Goldberg,et al.  The Sizes of Peptides Generated from Protein by Mammalian 26 and 20 S Proteasomes , 1999, The Journal of Biological Chemistry.

[66]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[67]  M. Gerstein,et al.  Relating whole-genome expression data with protein-protein interactions. , 2002, Genome research.