Creating and analyzing pathway and protein interaction compendia for modelling signal transduction networks

BackgroundUnderstanding the information-processing capabilities of signal transduction networks, how those networks are disrupted in disease, and rationally designing therapies to manipulate diseased states require systematic and accurate reconstruction of network topology. Data on networks central to human physiology, such as the inflammatory signalling networks analyzed here, are found in a multiplicity of on-line resources of pathway and interactome databases (Cancer CellMap, GeneGo, KEGG, NCI-Pathway Interactome Database (NCI-PID), PANTHER, Reactome, I2D, and STRING). We sought to determine whether these databases contain overlapping information and whether they can be used to construct high reliability prior knowledge networks for subsequent modeling of experimental data.ResultsWe have assembled an ensemble network from multiple on-line sources representing a significant portion of all machine-readable and reconcilable human knowledge on proteins and protein interactions involved in inflammation. This ensemble network has many features expected of complex signalling networks assembled from high-throughput data: a power law distribution of both node degree and edge annotations, and topological features of a “bow tie” architecture in which diverse pathways converge on a highly conserved set of enzymatic cascades focused around PI3K/AKT, MAPK/ERK, JAK/STAT, NFκB, and apoptotic signaling. Individual pathways exhibit “fuzzy” modularity that is statistically significant but still involving a majority of “cross-talk” interactions. However, we find that the most widely used pathway databases are highly inconsistent with respect to the actual constituents and interactions in this network. Using a set of growth factor signalling networks as examples (epidermal growth factor, transforming growth factor-beta, tumor necrosis factor, and wingless), we find a multiplicity of network topologies in which receptors couple to downstream components through myriad alternate paths. Many of these paths are inconsistent with well-established mechanistic features of signalling networks, such as a requirement for a transmembrane receptor in sensing extracellular ligands.ConclusionsWide inconsistencies among interaction databases, pathway annotations, and the numbers and identities of nodes associated with a given pathway pose a major challenge for deriving causal and mechanistic insight from network graphs. We speculate that these inconsistencies are at least partially attributable to cell, and context-specificity of cellular signal transduction, which is largely unaccounted for in available databases, but the absence of standardized vocabularies is an additional confounding factor. As a result of discrepant annotations, it is very difficult to identify biologically meaningful pathways from interactome networks a priori. However, by incorporating prior knowledge, it is possible to successively build out network complexity with high confidence from a simple linear signal transduction scaffold. Such reduced complexity networks appear suitable for use in mechanistic models while being richer and better justified than the simple linear pathways usually depicted in diagrams of signal transduction.

[1]  D. Bray Protein molecules as computational elements in living cells , 1995, Nature.

[2]  Chi-Ying F. Huang,et al.  Ultrasensitivity in the mitogen-activated protein kinase cascade. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[3]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[4]  D. Hanahan,et al.  The Hallmarks of Cancer , 2000, Cell.

[5]  Y. Yarden,et al.  Untangling the ErbB signalling network , 2001, Nature Reviews Molecular Cell Biology.

[6]  Steven C. Lawlor,et al.  MAPPFinder: using Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data , 2003, Genome Biology.

[7]  E. Gilles,et al.  Computational modeling of the dynamics of the MAP kinase cascade activated by surface and internalized EGF receptors , 2002, Nature Biotechnology.

[8]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[9]  Trey Ideker,et al.  Building with a scaffold: emerging strategies for high- to low-level cellular modeling. , 2003, Trends in biotechnology.

[10]  G. Casari,et al.  A physical and functional map of the human TNF-alpha/NF-kappa B signal transduction pathway. , 2004, Nature cell biology.

[11]  Hiroaki Kitano,et al.  Biological robustness , 2008, Nature Reviews Genetics.

[12]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[13]  R. Jove,et al.  Roles of Gab1 and SHP2 in Paxillin Tyrosine Dephosphorylation and Src Activation in Response to Epidermal Growth Factor* , 2004, Journal of Biological Chemistry.

[14]  Fang Liu,et al.  Identification and characterization of ERK MAP kinase phosphorylation sites in Smad3. , 2005, Biochemistry.

[15]  Natasa Przulj,et al.  High-Throughput Mapping of a Dynamic Signaling Network in Mammalian Cells , 2005, Science.

[16]  U. Alon,et al.  Spontaneous evolution of modularity and network motifs. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Igor Jurisica,et al.  Online Predicted Human Interaction Database , 2005, Bioinform..

[18]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[19]  H. Kitano,et al.  A comprehensive map of the toll-like receptor signaling network , 2006, Molecular systems biology.

[20]  John G. Albeck,et al.  Collecting and organizing systematic sets of protein data , 2006, Nature Reviews Molecular Cell Biology.

[21]  D. Lauffenburger,et al.  The Response of Human Epithelial Cells to TNF Involves an Inducible Autocrine Cascade , 2006, Cell.

[22]  Erich E. Wanker,et al.  Comparison of Human Protein-Protein Interaction Maps , 2007, German Conference on Bioinformatics.

[23]  D. Lauffenburger,et al.  Physicochemical modelling of cell signalling pathways , 2006, Nature Cell Biology.

[24]  Gary D. Bader,et al.  Pathguide: a Pathway Resource List , 2005, Nucleic Acids Res..

[25]  G. Wagner,et al.  The road to modularity , 2007, Nature Reviews Genetics.

[26]  A Kremling,et al.  Systems biology--an engineering perspective. , 2007, Journal of biotechnology.

[27]  Thomas Lengauer,et al.  Computational analysis of human protein interaction networks , 2007, Proteomics.

[28]  T. Pawson,et al.  Oncogenic re-wiring of cellular signaling pathways , 2007, Oncogene.

[29]  C. Daub,et al.  BMC Systems Biology , 2007 .

[30]  Feng Luo,et al.  Modular organization of protein interaction networks , 2007, Bioinform..

[31]  B. Kholodenko,et al.  Ligand-dependent responses of the ErbB signaling network: experimental and modeling analyses , 2007, Molecular systems biology.

[32]  D. Busam,et al.  An Integrated Genomic Analysis of Human Glioblastoma Multiforme , 2008, Science.

[33]  H. Hirt,et al.  Protein networking: insights into global functional organization of proteomes , 2008, Proteomics.

[34]  T. Helikar,et al.  Emergent decision-making in biological signal transduction networks , 2008, Proceedings of the National Academy of Sciences.

[35]  Arend Hintze,et al.  Evolution of Complex Modular Biological Networks , 2007, PLoS Comput. Biol..

[36]  P. Allavena,et al.  Pathways connecting inflammation and cancer. , 2008, Current opinion in genetics & development.

[37]  M. Vidal,et al.  Literature-curated protein interaction datasets , 2009, Nature Methods.

[38]  Pankaj Agarwal,et al.  A global pathway crosstalk network , 2008, Bioinform..

[39]  Andreas Zell,et al.  BowTieBuilder: modeling signal transduction pathways , 2009, BMC Systems Biology.

[40]  G. Parmigiani,et al.  Core Signaling Pathways in Human Pancreatic Cancers Revealed by Global Genomic Analyses , 2008, Science.

[41]  B. Corominas-Murtra,et al.  On the basic computational structure of gene regulatory networks. , 2009, Molecular bioSystems.

[42]  E. A. Sykes,et al.  Cell–cell interaction networks regulate blood stem and progenitor cell fate , 2009, Molecular systems biology.

[43]  E. Fraenkel,et al.  Integrating Proteomic, Transcriptional, and Interactome Data Reveals Hidden Components of Signaling and Regulatory Networks , 2009, Science Signaling.

[44]  A. Bauer-Mehren,et al.  Pathway databases and tools for their exploitation: benefits, current limitations and challenges , 2009, Molecular systems biology.

[45]  D. Lauffenburger,et al.  Discrete logic modelling as a means to link protein signalling networks with functional analysis of mammalian signal transduction , 2009, Molecular systems biology.

[46]  M. Vidal,et al.  Literature-curated protein interaction , 2009 .

[47]  S. Agelaki,et al.  Caveolin-1 regulates EGFR signalling in MCF-7 breast cancer cells and enhances gefitinib-induced tumor cell inhibition , 2009, Cancer biology & therapy.

[48]  H. Kitano,et al.  G-Protein Coupled Receptor Signaling Architecture of Mammalian Immune Cells , 2009, PloS one.

[49]  Steffen Klamt,et al.  The Logic of EGFR/ErbB Signaling: Theoretical Properties and Analysis of High-Throughput Data , 2009, PLoS Comput. Biol..

[50]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[51]  Christian von Mering,et al.  STRING 8—a global view on proteins and their functional interactions in 630 organisms , 2008, Nucleic Acids Res..

[52]  Bor-Sen Chen,et al.  Integrated cellular network of transcription regulations and protein-protein interactions , 2010, BMC Systems Biology.

[53]  D. Lauffenburger,et al.  Input–output behavior of ErbB signaling pathways as revealed by a mass action model trained against dynamic data , 2009, Molecular systems biology.

[54]  W. Lim,et al.  Defining Network Topologies that Can Achieve Biochemical Adaptation , 2009, Cell.

[55]  A. Sonnenberg,et al.  EGF-induced MAPK Signaling Inhibits Hemidesmosome Formation through Phosphorylation of the Integrin β4* , 2010, The Journal of Biological Chemistry.

[56]  Alfonso Valencia,et al.  Extending pathways and processes using molecular interaction networks to analyse cancer genome data , 2010, BMC Bioinformatics.

[57]  Kei-Hoi Cheung,et al.  BioPAX – A community standard for pathway data sharing , 2010, Nature Biotechnology.

[58]  Ying Chen,et al.  Construction of a large scale integrated map of macrophage pathogen recognition and effector systems , 2010, BMC Systems Biology.

[59]  B. Palsson,et al.  Towards genome-scale signalling-network reconstructions , 2010, Nature Reviews Genetics.

[60]  Gary D Bader,et al.  Dynamic interaction networks in a hierarchically organized tissue , 2010, Molecular systems biology.

[61]  Réka Albert,et al.  Elementary signaling modes predict the essentiality of signal transduction network components , 2011, BMC Systems Biology.

[62]  D. Lauffenburger,et al.  Networks Inferred from Biochemical Data Reveal Profound Differences in Toll-like Receptor and Inflammatory Signaling between Normal and Transformed Hepatocytes* , 2010, Molecular & Cellular Proteomics.

[63]  L. Bonetta Protein–protein interactions: Interactome under construction , 2010, Nature.

[64]  Chih-yuan Chiang,et al.  A Human MAP Kinase Interactome , 2010, Nature Methods.

[65]  Peter K. Sorger,et al.  Logic-Based Models for the Analysis of Cell Signaling Networks† , 2010, Biochemistry.

[66]  Junichi Tsujii,et al.  Event extraction for systems biology by text mining the literature. , 2010, Trends in biotechnology.

[67]  C. Sander,et al.  Automated Network Analysis Identifies Core Pathways in Glioblastoma , 2010, PloS one.

[68]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[69]  M. Nelson,et al.  A Bow-Tie Genetic Architecture for Morphogenesis Suggested by a Genome-Wide RNAi Screen in Caenorhabditis elegans , 2011, PLoS genetics.

[70]  Julio Saez-Rodriguez,et al.  Training Signaling Pathway Maps to Biochemical Data with Constrained Fuzzy Logic: Quantitative Analysis of Liver Cell Responses to Inflammatory Stimuli , 2011, PLoS Comput. Biol..