PloGO: Plotting gene ontology annotation and abundance in multi‐condition proteomics experiments

We describe the PloGO R package, a simple open‐source tool for plotting gene ontology (GO) annotation and abundance information, which was developed to aid with the bioinformatics analysis of multi‐condition label‐free proteomics experiments using quantitation based on spectral counting. PloGO can incorporate abundance (raw spectral counts) or normalized spectral abundance factors (NSAF) data in addition to the GO annotation, as well as handle multiple files and allow for a targeted collection of GO categories of interest. Our main aims were to help identify interesting subsets of proteins for further analysis such as those arising from a protein data set partition based on the presence and absence or multiple pair‐wise comparisons, as well as provide GO summaries that can be easily used in subsequent analyses. Though developed with label‐free proteomics experiments in mind it is not specific to that approach and can be used for any multi‐condition experiment for which GO information has been generated.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  May D. Wang,et al.  GoMiner: a resource for biological interpretation of genomic and proteomic data , 2003, Genome Biology.

[3]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[4]  Hans Lehrach,et al.  GOblet: a platform for Gene Ontology annotation of anonymous sequence data , 2004, Nucleic Acids Res..

[5]  T. Speed,et al.  GOstat: find statistically overrepresented Gene Ontologies within a group of genes. , 2004, Bioinformatics.

[6]  John N. Weinstein,et al.  High-Throughput GoMiner, an 'industrial-strength' integrative gene ontology tool for interpretation of multiple-microarray experiments, with application to studies of Common Variable Immune Deficiency (CVID) , 2005, BMC Bioinformatics.

[7]  Juan Miguel García-Gómez,et al.  BIOINFORMATICS APPLICATIONS NOTE Sequence analysis Manipulation of FASTQ data with Galaxy , 2005 .

[8]  Nan Wang,et al.  AgBase: a functional genomics resource for agriculture , 2006, BMC Genomics.

[9]  Lin Fang,et al.  WEGO: a web tool for plotting GO annotations , 2006, Nucleic Acids Res..

[10]  Michael K. Coleman,et al.  Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae. , 2006, Journal of proteome research.

[11]  Robert Gentleman,et al.  Using GOstats to test gene lists for GO term association , 2007, Bioinform..

[12]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[13]  John R Yates,et al.  GO Explorer: A gene-ontology tool to aid in the interpretation of shotgun proteomics data , 2008, Proteome Science.

[14]  E. Birney,et al.  Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt , 2009, Nature Protocols.

[15]  Dana Pascovici,et al.  Differential metabolic response of cultured rice (Oryza sativa) cells exposed to high‐ and low‐temperature stress , 2010, Proteomics.

[16]  Zhou Du,et al.  agriGO: a GO analysis toolkit for the agricultural community , 2010, Nucleic Acids Res..

[17]  W. Majeran,et al.  Reconstruction of Metabolic Pathways, Protein Expression, and Homeostasis Machineries across Maize Bundle Sheath and Mesophyll Chloroplasts: Large-Scale Quantitative Proteomics Using the First Maize Genome Assembly1[W][OA] , 2010, Plant Physiology.

[18]  Dana Pascovici,et al.  Shotgun proteomic profiling of five species of New Zealand Pachycladon , 2011, Proteomics.

[19]  Dana Pascovici,et al.  Differential regulation of aquaporins, small GTPases and V‐ATPases proteins in rice leaves subjected to drought stress and recovery , 2012, Proteomics.