Analyzing and interpreting genome data at the network level with ConsensusPathDB

ConsensusPathDB consists of a comprehensive collection of human (as well as mouse and yeast) molecular interaction data integrated from 32 different public repositories and a web interface featuring a set of computational methods and visualization tools to explore these data. This protocol describes the use of ConsensusPathDB (http://consensuspathdb.org) with respect to the functional and network-based characterization of biomolecules (genes, proteins and metabolites) that are submitted to the system either as a priority list or together with associated experimental data such as RNA-seq. The tool reports interaction network modules, biochemical pathways and functional information that are significantly enriched by the user's input, applying computational methods for statistical over-representation, enrichment and graph analysis. The results of this protocol can be observed within a few minutes, even with genome-wide data. The resulting network associations can be used to interpret high-throughput data mechanistically, to characterize and prioritize biomarkers, to integrate different omics levels, to design follow-up functional assay experiments and to generate topology for kinetic models at different scales.

[1]  E. Lehmann,et al.  Nonparametrics: Statistical Methods Based on Ranks , 1976 .

[2]  Trey Ideker,et al.  Evidence mining and novelty assessment of protein–protein interactions with the ConsensusPathDB plugin for Cytoscape , 2010, Bioinform..

[3]  Ralf Herwig,et al.  ConsensusPathDB—a database for integrating human functional interaction networks , 2008, Nucleic Acids Res..

[4]  Jianguo Xia,et al.  Web-based inference of biological patterns, functions and pathways from metabolomic data using MetaboAnalyst , 2011, Nature Protocols.

[5]  Eric Boerwinkle,et al.  Analysis of loss-of-function variants and 20 risk factor phenotypes in 8,554 individuals identifies loci influencing chronic disease , 2015, Nature Genetics.

[6]  Henning Hermjakob,et al.  The Reactome pathway Knowledgebase , 2015, Nucleic acids research.

[7]  Gary D Bader,et al.  A travel guide to Cytoscape plugins , 2012, Nature Methods.

[8]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[9]  Henryk Maciejewski,et al.  Gene set analysis methods: statistical models and methodological differences , 2013, Briefings Bioinform..

[10]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[11]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[12]  David Warde-Farley,et al.  Dynamic modularity in protein interaction networks predicts breast cancer outcome , 2009, Nature Biotechnology.

[13]  Ulrich Stelzl,et al.  Phospho-tyrosine dependent protein–protein interaction network , 2015, Molecular systems biology.

[14]  H. Kitano,et al.  A comprehensive pathway map of epidermal growth factor receptor signaling , 2005, Molecular systems biology.

[15]  Bjoern Peters,et al.  Epigenomic analysis of primary human T cells reveals enhancers associated with TH2 memory cell differentiation and asthma susceptibility , 2014, Nature Immunology.

[16]  Ruiqiang Li,et al.  Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells , 2013, Nature Structural &Molecular Biology.

[17]  Gary D. Bader,et al.  Pathguide: a Pathway Resource List , 2005, Nucleic Acids Res..

[18]  Ryan Miller,et al.  WikiPathways: capturing the full diversity of pathway knowledge , 2015, Nucleic Acids Res..

[19]  Atul J. Butte,et al.  Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges , 2012, PLoS Comput. Biol..

[20]  Tony Pawson,et al.  Temporal regulation of EGF signaling networks by the scaffold protein Shc1 , 2013, Nature.

[21]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[22]  C. Mungall,et al.  Gene Ontology Consortium : going forward The Gene Ontology , 2015 .

[23]  Helga Thorvaldsdóttir,et al.  Molecular signatures database (MSigDB) 3.0 , 2011, Bioinform..

[24]  Michael L. Creech,et al.  Integration of biological networks and gene expression data using Cytoscape , 2007, Nature Protocols.

[25]  Desmond J. Higham,et al.  Geometric De-noising of Protein-Protein Interaction Networks , 2009, PLoS Comput. Biol..

[26]  Ralf Herwig,et al.  Cluster-based assessment of protein-protein interaction confidence , 2012, BMC Bioinformatics.

[27]  Hans Lehrach,et al.  Multiple haplotype-resolved genomes reveal population patterns of gene and protein diplotypes , 2014, Nature Communications.

[28]  Ralf Herwig,et al.  IntScore: a web tool for confidence scoring of biological interactions , 2012, Nucleic Acids Res..

[29]  Ralf Herwig,et al.  Primary Differentiation in the Human Blastocyst: Comparative Molecular Portraits of Inner Cell Mass and Trophectoderm Cells , 2005, Stem cells.

[30]  Leslie Z Benet,et al.  Effects of uremic toxins on transport and metabolism of different biopharmaceutics drug disposition classification system xenobiotics. , 2011, Journal of pharmaceutical sciences.

[31]  Andrew M. Gross,et al.  Network-based stratification of tumor mutations , 2013, Nature Methods.

[32]  Ralf Herwig,et al.  The ConsensusPathDB interaction database: 2013 update , 2012, Nucleic Acids Res..

[33]  Yibo Wu,et al.  GOSemSim: an R package for measuring semantic similarity among GO terms and gene products , 2010, Bioinform..

[34]  Y. J. Kim,et al.  High-density genotyping of immune-related loci identifies new SLE risk variants in individuals with Asian ancestry , 2016, Nature Genetics.

[35]  A. Kamburov,et al.  Human Embryonic Stem Cell Derived Hepatocyte-Like Cells as a Tool for In Vitro Hazard Assessment of Chemical Carcinogenicity , 2011, Toxicological sciences : an official journal of the Society of Toxicology.

[36]  Susumu Goto,et al.  Data, information, knowledge and principle: back to metabolism in KEGG , 2013, Nucleic Acids Res..

[37]  Andreas Krämer,et al.  Causal analysis approaches in Ingenuity Pathway Analysis , 2013, Bioinform..

[38]  Roger A. Pedersen,et al.  Human preimplantation embryo development , 2022 .

[39]  Avi Ma'ayan,et al.  Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases , 2007, BMC Bioinformatics.

[40]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[41]  Mikko Koski,et al.  Chipster: user-friendly analysis software for microarray and other high-throughput data , 2011, BMC Genomics.

[42]  Hyeong Jun An,et al.  Estimating the size of the human interactome , 2008, Proceedings of the National Academy of Sciences.

[43]  Steven J. M. Jones,et al.  Comprehensive molecular profiling of lung adenocarcinoma , 2014, Nature.

[44]  Masafumi Fukagawa,et al.  p-Cresyl sulfate, a uremic toxin, causes vascular endothelial and smooth muscle cell damages by inducing oxidative stress , 2014, Pharmacology research & perspectives.

[45]  K. Niakan,et al.  Human pre-implantation embryo development , 2012, Development.

[46]  Ralf Herwig,et al.  ConsensusPathDB: toward a more complete picture of cell biology , 2010, Nucleic Acids Res..

[47]  Raymond Vanholder,et al.  Review on uremic toxins: classification, concentration, and interindividual variability. , 2003, Kidney international.

[48]  Avi Ma'ayan,et al.  Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool , 2013, BMC Bioinformatics.

[49]  D Hauschke,et al.  STAT3 expression, activity and functional consequences of STAT3 inhibition in esophageal squamous cell carcinomas and Barrett’s adenocarcinomas , 2014, Oncogene.

[50]  A. Barabasi,et al.  Interactome Networks and Human Disease , 2011, Cell.

[51]  D. Goldberg,et al.  Assessing experimentally derived interactions in a small world , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[52]  M. Roizen,et al.  Hallmarks of Cancer: The Next Generation , 2012 .

[53]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[54]  P. Jeffrey,et al.  Structural basis for inhibition of the epidermal growth factor receptor by cetuximab. , 2005, Cancer cell.

[55]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..