Pathway-GPS and SIGORA: identifying relevant pathways based on the over-representation of their gene-pair signatures

Motivation. Predominant pathway analysis approaches treat pathways as collections of individual genes and consider all pathway members as equally informative. As a result, at times spurious and misleading pathways are inappropriately identified as statistically significant, solely due to components that they share with the more relevant pathways. Results. We introduce the concept of Pathway Gene-Pair Signatures (Pathway-GPS) as pairs of genes that, as a combination, are specific to a single pathway. We devised and implemented a novel approach to pathway analysis, Signature Over-representation Analysis (SIGORA), which focuses on the statistically significant enrichment of Pathway-GPS in a user-specified gene list of interest. In a comparative evaluation of several published datasets, SIGORA outperformed traditional methods by delivering biologically more plausible and relevant results. Availability. An efficient implementation of SIGORA, as an R package with precompiled GPS data for several human and mouse pathway repositories is available for download from http://sigora.googlecode.com/svn/.

[1]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[2]  A. Casadevall,et al.  Fcγ Receptors Regulate Immune Activation and Susceptibility during Mycobacterium tuberculosis Infection1 , 2008, The Journal of Immunology.

[3]  Brad T. Sherman,et al.  DAVID-WS: a stateful web service to facilitate gene/protein list analysis , 2012, Bioinform..

[4]  A. Casadevall,et al.  Fc gamma receptors regulate immune activation and susceptibility during Mycobacterium tuberculosis infection. , 2008, Journal of immunology.

[5]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Niall J. Lennon,et al.  The Early Whole-Blood Transcriptional Signature of Dengue Virus and Features Associated with Progression to Dengue Shock Syndrome in Vietnamese Children and Young Adults , 2010, Journal of Virology.

[7]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[8]  A. Katyal,et al.  Peroxisome proliferator activating receptor (PPAR) in cerebral malaria (CM): a novel target for an additional therapy , 2011, European Journal of Clinical Microbiology & Infectious Diseases.

[9]  N. Copeland,et al.  Deciphering the genetic landscape of cancer--from genes to pathways. , 2009, Trends in genetics : TIG.

[10]  Thomas Lengauer,et al.  Improved scoring of functional groups from gene expression data by decorrelating GO graph structure , 2006, Bioinform..

[11]  Karin Breuer,et al.  InnateDB: systems biology of innate immunity and beyond—recent updates and continuing curation , 2012, Nucleic Acids Res..

[12]  L. Serghides The Case for the Use of PPARγ Agonists as an Adjunctive Therapy for Cerebral Malaria , 2011, PPAR research.

[13]  S. Barnum,et al.  The C5 Convertase Is Not Required for Activation of the Terminal Complement Pathway in Murine Experimental Cerebral Malaria* , 2012, The Journal of Biological Chemistry.

[14]  K. Seydel,et al.  Blood Coagulation, Inflammation, and Malaria , 2008, Microcirculation.

[15]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[16]  Samir N. Patel,et al.  Rosiglitazone modulates the innate immune response to Plasmodium falciparum infection and improves outcome in experimental cerebral malaria. , 2009, The Journal of infectious diseases.

[17]  N. Heaton,et al.  Dengue virus-induced autophagy regulates lipid metabolism. , 2010, Cell host & microbe.

[18]  Atul J. Butte,et al.  Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges , 2012, PLoS Comput. Biol..

[19]  S. Shresta Role of Complement in Dengue Virus Infection: Protection or Pathogenesis? , 2012, mBio.

[20]  J. Ernst,et al.  Macrophage Receptors for Mycobacterium tuberculosis , 1998, Infection and Immunity.

[21]  Qi Liu,et al.  Gene-set analysis and reduction , 2008, Briefings Bioinform..

[22]  H. Hakonarson,et al.  Analysing biological pathways in genome-wide association studies , 2010, Nature Reviews Genetics.

[23]  Jun Ma,et al.  Appearance frequency modulated gene set enrichment testing , 2011, BMC Bioinformatics.

[24]  Martin Vingron,et al.  Improved detection of overrepresentation of Gene-Ontology annotations with parent-child analysis , 2007, Bioinform..

[25]  Samir N. Patel,et al.  Expression microarray analysis implicates apoptosis and interferon-responsive mechanisms in susceptibility to experimental cerebral malaria. , 2007, The American journal of pathology.

[26]  Matthew R. Laird,et al.  Protein Protein Interaction Network Evaluation for Identifying Potential Drug Targets , 2009 .

[27]  J. Nolan,et al.  A unified hypothesis for the genesis of cerebral malaria: sequestration, inflammation and hemostasis leading to microcirculatory dysfunction. , 2006, Trends in parasitology.

[28]  Hedi Peterson,et al.  g:Profiler—a web-based toolset for functional profiling of gene lists from large-scale experiments , 2007, Nucleic Acids Res..

[29]  P. Khatri,et al.  A systems biology approach for pathway level analysis. , 2007, Genome research.

[30]  Satoko Yamamoto,et al.  INOH: ontology-based highly structured database of signal transduction pathways , 2011, Database J. Biol. Databases Curation.

[31]  Purvesh Khatri,et al.  Ontological analysis of gene expression data: current tools, limitations, and open problems , 2005, Bioinform..

[32]  Daniel Jupiter,et al.  TreeHugger: A New Test for Enrichment of Gene Ontology Terms , 2010, INFORMS J. Comput..

[33]  V. Perry,et al.  OVERRIDING THE BRAIN'S INTRINSIC RESISTANCE TO LEUKOCYTE RECRUITMENT WITH INTRAPARENCHYMAL INJECTIONS OF RECOMBINANT CHEMOKINES , 1996, Neuroscience.

[34]  Thorsten Schmidt,et al.  ProfCom: a web tool for profiling the complex functionality of gene groups identified from high-throughput data , 2008, Nucleic Acids Res..

[35]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[36]  D Repsilber,et al.  Human gene expression profiles of susceptibility and resistance in tuberculosis , 2011, Genes and Immunity.

[37]  Cengizhan Ozturk,et al.  Pathway analysis of high-throughput biological data within a Bayesian network framework , 2011, Bioinform..

[38]  G. Senaldi,et al.  Role of polymorphonuclear neutrophil leukocytes and their integrin CD11a (LFA-1) in the pathogenesis of severe murine malaria , 1994, Infection and immunity.

[39]  Duncan R. Smith,et al.  A role for autophagolysosomes in dengue virus 3 production in HepG2 cells. , 2009, The Journal of general virology.

[40]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[41]  E. Wherry,et al.  Molecular signature of CD8+ T cell exhaustion during chronic viral infection. , 2007, Immunity.

[42]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[43]  Yingyao Zhou,et al.  Genome Wide Analysis of Inbred Mouse Lines Identifies a Locus Containing Ppar-γ as Contributing to Enhanced Malaria Survival , 2010, PloS one.

[44]  Lincoln Stein,et al.  Reactome knowledgebase of human biological pathways and processes , 2008, Nucleic Acids Res..

[45]  Jesse Gillis,et al.  The Impact of Multifunctional Genes on "Guilt by Association" Analysis , 2011, PloS one.

[46]  Frank Emmert-Streib,et al.  Pathway Analysis of Expression Data: Deciphering Functional Building Blocks of Complex Diseases , 2011, PLoS Comput. Biol..

[47]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[48]  Vesteinn Thorsson,et al.  Identification of Tuberculosis Susceptibility Genes with Human Macrophage Gene Expression Profiles , 2008, PLoS pathogens.