Integrating literature-constrained and data-driven inference of signalling networks

Motivation: Recent developments in experimental methods facilitate increasingly larger signal transduction datasets. Two main approaches can be taken to derive a mathematical model from these data: training a network (obtained, e.g., from literature) to the data, or inferring the network from the data alone. Purely data-driven methods scale up poorly and have limited interpretability, whereas literature-constrained methods cannot deal with incomplete networks. Results: We present an efficient approach, implemented in the R package CNORfeeder, to integrate literature-constrained and data-driven methods to infer signalling networks from perturbation experiments. Our method extends a given network with links derived from the data via various inference methods, and uses information on physical interactions of proteins to guide and validate the integration of links. We apply CNORfeeder to a network of growth and inflammatory signalling. We obtain a model with superior data fit in the human liver cancer HepG2 and propose potential missing pathways. Availability: CNORfeeder is in the process of being submitted to Bioconductor and in the meantime available at www.cellnopt.org. Contact: saezrodriguez@ebi.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Julio Saez-Rodriguez,et al.  CellNOptR: a flexible toolkit to train protein signaling networks to data using multiple logic formalisms , 2012, BMC Systems Biology.

[2]  Xin Liu,et al.  Dynamical and Structural Analysis of a T Cell Survival Network Identifies Novel Candidate Therapeutic Targets for Large Granular Lymphocyte Leukemia , 2011, PLoS Comput. Biol..

[3]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[4]  Peter K. Sorger,et al.  Logic-Based Models for the Analysis of Cell Signaling Networks† , 2010, Biochemistry.

[5]  P. Ghazal,et al.  Logic models of pathway biology. , 2008, Drug discovery today.

[6]  Alexander R. Pico,et al.  WikiPathways: Pathway Editing for the People , 2008, PLoS biology.

[7]  D. Floreano,et al.  Revealing strengths and weaknesses of methods for gene network inference , 2010, Proceedings of the National Academy of Sciences.

[8]  Denis Thieffry,et al.  Mathematical Modelling of Cell-Fate Decision in Response to Death Receptor Engagement , 2010, PLoS Comput. Biol..

[9]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[10]  Mingsheng Zhang,et al.  Comparing signaling networks between normal and transformed hepatocytes using discrete logical models. , 2011, Cancer research.

[11]  Christopher A. Penfold,et al.  How to infer gene networks from expression profiles, revisited , 2011, Interface Focus.

[12]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[13]  Alfonso Valencia,et al.  Extending pathways and processes using molecular interaction networks to analyse cancer genome data , 2010, BMC Bioinformatics.

[14]  A. Vinayagam,et al.  A Directed Protein Interaction Network for Investigating Intracellular Signal Transduction , 2011, Science Signaling.

[15]  Prahlad T. Ram,et al.  Formation of Regulatory Patterns During Signal Propagation in a Mammalian Cellular Network , 2005, Science.

[16]  Julio Saez-Rodriguez,et al.  Training Signaling Pathway Maps to Biochemical Data with Constrained Fuzzy Logic: Quantitative Analysis of Liver Cell Responses to Inflammatory Stimuli , 2011, PLoS Comput. Biol..

[17]  R. Sharan,et al.  Protein networks in disease. , 2008, Genome research.

[18]  Florian Markowetz,et al.  How to Understand the Cell by Breaking It: Network Analysis of Gene Perturbation Screens , 2009, PLoS Comput. Biol..

[19]  Lincoln Stein,et al.  Reactome: a knowledgebase of biological pathways , 2004, Nucleic Acids Res..

[20]  Steffen Klamt,et al.  The Logic of EGFR/ErbB Signaling: Theoretical Properties and Analysis of High-Throughput Data , 2009, PLoS Comput. Biol..

[21]  Beatriz Peñalver Bernabé,et al.  State–time spectrum of signal transduction logic models , 2012, Physical biology.

[22]  Tim Beißbarth,et al.  Inferring signalling networks from longitudinal data using sampling based approaches in the R-package 'ddepn' , 2010, BMC Bioinformatics.

[23]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[24]  D. Lauffenburger,et al.  Physicochemical modelling of cell signalling pathways , 2006, Nature Cell Biology.

[25]  Gianluca Bontempi,et al.  minet: A R/Bioconductor Package for Inferring Large Transcriptional Networks Using Mutual Information , 2008, BMC Bioinformatics.

[26]  Carlos Prieto,et al.  APID: Agile Protein Interaction DataAnalyzer , 2006, Nucleic Acids Res..

[27]  H. Hirt,et al.  Protein networking: insights into global functional organization of proteomes , 2008, Proteomics.

[28]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[29]  Julio Saez-Rodriguez,et al.  Modeling signaling networks using high-throughput phospho-proteomics. , 2012, Advances in experimental medicine and biology.

[30]  B. Di Camillo,et al.  A Boolean Approach to Linear Prediction for Signaling Network Modeling , 2010, PloS one.

[31]  D. Lauffenburger,et al.  Networks Inferred from Biochemical Data Reveal Profound Differences in Toll-like Receptor and Inflammatory Signaling between Normal and Transformed Hepatocytes* , 2010, Molecular & Cellular Proteomics.

[32]  D. Lauffenburger,et al.  Discrete logic modelling as a means to link protein signalling networks with functional analysis of mammalian signal transduction , 2009, Molecular systems biology.

[33]  Javier De Las Rivas,et al.  Protein–Protein Interactions Essentials: Key Concepts to Building and Analyzing Interactome Networks , 2010, PLoS Comput. Biol..

[34]  D. Lauffenburger,et al.  Systems Analysis of EGF Receptor Signaling Dynamics with Micro-Western Arrays , 2010, Nature Methods.

[35]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[36]  Sach Mukherjee,et al.  Network inference using informative priors , 2008, Proceedings of the National Academy of Sciences.

[37]  Julio Saez-Rodriguez,et al.  Crowdsourcing Network Inference: The DREAM Predictive Signaling Network Challenge , 2011, Science Signaling.

[38]  D. Pe’er Bayesian Network Analysis of Signaling Networks: A Primer , 2005, Science's STKE.

[39]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.