ProteoViz: a tool for the analysis and interactive visualization of phosphoproteomics data.

Quantitative proteomics generates large datasets with increasing depth and quantitative information. With the advance of mass spectrometry and increasingly larger data sets, streamlined methodologies and tools for analysis and visualization of phosphoproteomics are needed both at the protein and modified peptide levels. To assist in addressing this need, we developed ProteoViz, which includes a set of R scripts that perform normalization and differential expression analysis of both the proteins and enriched phosphorylated peptides, and identify sequence motifs, kinases, and gene set enrichment pathways. The tool generates interactive visualization plots that allow users to interact with the phosphoproteomics results and quickly identify proteins and phosphorylated peptides of interest for their biological study. The tool also links significant phosphosites with sequence motifs and pathways that will help explain the experimental conditions and guide future experiments. Here, we present the workflow and demonstrate its functionality by analyzing a phosphoproteomic data set from two lymphoma cell lines treated with kinase inhibitors. The scripts and data are freely available at https://github.com/ByrumLab/ProteoViz and via the ProteomeXchange with identifier PXD015606.

[1]  Jill P. Mesirov,et al.  A Curated Resource for Phosphosite-specific Signature Analysis* , 2018, Molecular & Cellular Proteomics.

[2]  M. Mann,et al.  Universal sample preparation method for proteome analysis , 2009, Nature Methods.

[3]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[4]  M. Senko,et al.  Novel parallelized quadrupole/linear ion trap/Orbitrap tribrid mass spectrometer improving proteome coverage and peptide identification rates. , 2013, Analytical chemistry.

[5]  Jesper V Olsen,et al.  Benchmarking common quantification strategies for large-scale phosphoproteomics , 2018, Nature Communications.

[6]  P. Beltrão,et al.  Uncovering Phosphorylation-Based Specificities through Functional Interaction Networks* , 2015, Molecular & Cellular Proteomics.

[7]  P. Grossmann,et al.  Integration of TP53, DREAM, MMB-FOXM1 and RB-E2F target gene analyses identifies cell cycle gene regulatory networks , 2016, Nucleic acids research.

[8]  Edward L. Huttlin,et al.  MultiNotch MS3 Enables Accurate, Sensitive, and Multiplexed Detection of Differential Expression across Cancer Cell Line Proteomes , 2014, Analytical chemistry.

[9]  Julian Mintseris,et al.  Active Instrument Engagement Combined with a Real-Time Database Search for Improved Performance of Sample Multiplexing Workflows. , 2019, Journal of proteome research.

[10]  S. Gygi,et al.  An iterative statistical approach to the identification of protein phosphorylation motifs from large-scale data sets , 2005, Nature Biotechnology.

[11]  A. LaCasce,et al.  Adolescent and young adult lymphoma: collaborative efforts toward optimizing care and improving outcomes. , 2017, Blood advances.

[12]  Noah Dephoure,et al.  Deep Coverage of Global Protein Expression and Phosphorylation in Breast Tumor Cell Lines Using TMT 10-plex Isobaric Labeling. , 2017, Journal of proteome research.

[13]  Monther Alhamdoosh,et al.  Easy and efficient ensemble gene set testing with EGSEA , 2017, F1000Research.

[14]  Giuseppe Troiano,et al.  The crucial role of protein phosphorylation in cell signaling and its use as targeted therapy (Review) , 2017, International journal of molecular medicine.

[15]  William Stafford Noble,et al.  MoMo: Discovery of statistically significant post-translational modification motifs , 2018, bioRxiv.

[16]  Yanbao Yu,et al.  S-Trap, an Ultrafast Sample-Preparation Approach for Shotgun Proteomics. , 2018, Journal of proteome research.

[17]  D. Wessel,et al.  A method for the quantitative recovery of protein in dilute solution in the presence of detergents and lipids. , 1984, Analytical biochemistry.

[18]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[19]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.