Dynamic probabilistic threshold networks to infer signaling pathways from time-course perturbation data

BackgroundNetwork inference deals with the reconstruction of molecular networks from experimental data. Given N molecular species, the challenge is to find the underlying network. Due to data limitations, this typically is an ill-posed problem, and requires the integration of prior biological knowledge or strong regularization. We here focus on the situation when time-resolved measurements of a system’s response after systematic perturbations are available.ResultsWe present a novel method to infer signaling networks from time-course perturbation data. We utilize dynamic Bayesian networks with probabilistic Boolean threshold functions to describe protein activation. The model posterior distribution is analyzed using evolutionary MCMC sampling and subsequent clustering, resulting in probability distributions over alternative networks. We evaluate our method on simulated data, and study its performance with respect to data set size and levels of noise. We then use our method to study EGF-mediated signaling in the ERBB pathway.ConclusionsDynamic Probabilistic Threshold Networks is a new method to infer signaling networks from time-series perturbation data. It exploits the dynamic response of a system after external perturbation for network reconstruction. On simulated data, we show that the approach outperforms current state of the art methods. On the ERBB data, our approach recovers a significant fraction of the known interactions, and predicts novel mechanisms in the ERBB pathway.

[1]  David M. Sabatini,et al.  Building mammalian signalling pathways with RNAi screens , 2006, Nature Reviews Molecular Cell Biology.

[2]  J. Vaqué,et al.  Estrogen receptor alpha mediates progestin-induced mammary tumor growth by interacting with progesterone receptors at the cyclin D1/MYC promoters. , 2012, Cancer research.

[3]  Gerhard Reinelt,et al.  Reconstructing nonlinear dynamic models of gene regulation using stochastic sampling , 2009, BMC Bioinformatics.

[4]  Julio Saez-Rodriguez,et al.  Integrating literature-constrained and data-driven inference of signalling networks , 2012, Bioinform..

[5]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[6]  T. Möröy,et al.  Malignant transformation by cyclin E and Ha-Ras correlates with lower sensitivity towards induction of cell death but requires functional Myc and CDK4 , 1997, Oncogene.

[7]  E. Davidson,et al.  The hardwiring of development: organization and function of genomic regulatory systems. , 1997, Development.

[8]  Dirk Husmeier,et al.  Gene Regulatory Network Reconstruction by Bayesian Integration of Prior Knowledge and/or Different Experimental Conditions , 2008, J. Bioinform. Comput. Biol..

[9]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[10]  Michael Boutros,et al.  The art and design of genetic screens: RNA interference , 2008, Nature Reviews Genetics.

[11]  C. Sander,et al.  Models from experiments: combinatorial drug perturbations of cancer cells , 2008, Molecular systems biology.

[12]  Reinhard Guthke,et al.  Dynamic network reconstruction from gene expression data applied to immune response during bacterial infection , 2005, Bioinform..

[13]  Dirk Husmeier,et al.  Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks , 2003, Bioinform..

[14]  Rainer Spang,et al.  Inferring cellular networks – a review , 2007, BMC Bioinformatics.

[15]  D. Bernardo,et al.  A Yeast Synthetic Network for In Vivo Assessment of Reverse-Engineering and Modeling Approaches , 2009, Cell.

[16]  Yufeng Liu,et al.  Support vector machines with adaptive Lq penalty , 2007, Comput. Stat. Data Anal..

[17]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[18]  Sophie Lèbre,et al.  Statistical Applications in Genetics and Molecular Biology Inferring Dynamic Genetic Networks with Low Order Independencies Inferring Dynamic Genetic Networks with Low Order Independencies ∗ , 2009 .

[19]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[20]  Jürgen Wolf,et al.  CASPAR: a hierarchical Bayesian approach to predict survival times in cancer from gene expression data , 2006, Bioinform..

[21]  Zheng Guo,et al.  Rictor regulates FBXW7-dependent c-Myc and cyclin E degradation in colorectal cancer cells. , 2012, Biochemical and biophysical research communications.

[22]  Sandya Liyanarachchi,et al.  Combinatorial analysis of transcription factor partners reveals recruitment of c-MYC to estrogen receptor-alpha responsive promoters. , 2006, Molecular cell.

[23]  Amy E. Cox,et al.  cMyc is a principal upstream driver of beta-cell proliferation in rat insulinoma cell lines and is an effective mediator of human beta-cell replication. , 2011, Molecular endocrinology.

[24]  Holger Fröhlich,et al.  Deterministic Effects Propagation Networks for reconstructing protein signaling networks from multiple interventions , 2009, BMC Bioinformatics.

[25]  Sui Huang Gene expression profiling, genetic networks, and cellular states: an integrating concept for tumorigenesis and drug discovery , 1999, Journal of Molecular Medicine.

[26]  B. Kholodenko,et al.  Cross-talk between mitogenic Ras/MAPK and survival PI3K/Akt pathways: a fine balance. , 2012, Biochemical Society transactions.

[27]  Stefan Wiemann,et al.  KEGGgraph: a graph approach to KEGG PATHWAY in R and bioconductor , 2009, Bioinform..

[28]  Nicole Radde,et al.  Inferring Gene Regulatory Networks from Expression Data , 2019 .

[29]  Lorenz Wernisch,et al.  Reconstruction of gene networks using Bayesian learning and manipulation experiments , 2004, Bioinform..

[30]  H. Erfle,et al.  From experimental setup to bioinformatics: An RNAi screening platform to identify host factors involved in HIV‐1 replication , 2010, Biotechnology journal.

[31]  Alexander J. Hartemink,et al.  Informative Structure Priors: Joint Learning of Dynamic Regulatory Networks from Multiple Types of Data , 2004, Pacific Symposium on Biocomputing.

[32]  Satoru Miyano,et al.  Finding Optimal Models for Small Gene Networks , 2003 .

[33]  Jesper Tegnér,et al.  Growing Bayesian network models of gene networks from seed genes , 2005, ECCB/JBI.

[34]  P. Thompson,et al.  Fibroblasts isolated from normal lungs and those with idiopathic pulmonary fibrosis differ in interleukin-6/gp130-mediated cell signaling and proliferation. , 2003, The American journal of pathology.

[35]  Arantxa Etxeverria The Origins of Order , 1993 .

[36]  Paul Marjoram,et al.  Markov chain Monte Carlo without likelihoods , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[37]  D. Mukhopadhyay,et al.  Circulating microvesicles in B-cell chronic lymphocytic leukemia can stimulate marrow stromal cells: implications for disease progression. , 2010, Blood.

[38]  Kevin P. Murphy,et al.  Exact Bayesian structure learning from uncertain interventions , 2007, AISTATS.

[39]  Lars Kaderali,et al.  Reconstructing signaling pathways from RNAi data using probabilistic Boolean threshold networks , 2009, Bioinform..

[40]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[41]  Olga G. Troyanskaya,et al.  Nested effects models for high-dimensional phenotyping screens , 2007, ISMB/ECCB.

[42]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[43]  Marc Toussaint,et al.  Probabilistic inference for solving discrete and continuous state Markov Decision Processes , 2006, ICML.

[44]  Bo Hu,et al.  Distributed evolutionary Monte Carlo for Bayesian computing , 2010, Comput. Stat. Data Anal..

[45]  Tommi S. Jaakkola,et al.  Using Graphical Models and Genomic Expression Data to Statistically Validate Models of Genetic Regulatory Networks , 2000, Pacific Symposium on Biocomputing.

[46]  J. McManaman,et al.  Heterotrimerization of the growth factor receptors erbB2, erbB3, and insulin-like growth factor-i receptor in breast cancer cells resistant to herceptin. , 2010, Cancer research.

[47]  Jens Timmer,et al.  Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge , 2007, BMC Systems Biology.

[48]  Holger Fröhlich,et al.  Learning gene network structure from time laps cell imaging in RNAi Knock downs , 2013, Bioinform..

[49]  Stuart A. Kauffman,et al.  The origins of order , 1993 .

[50]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[51]  Kevin P. Murphy,et al.  Bayesian structure learning using dynamic programming and MCMC , 2007, UAI.

[52]  Timothy S Gardner,et al.  Reverse-engineering transcription control networks. , 2005, Physics of life reviews.

[53]  Nir Friedman,et al.  Inferring subnetworks from perturbed expression profiles , 2001, ISMB.

[54]  Min Zou,et al.  A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data , 2005, Bioinform..

[55]  H. Kung,et al.  Lung tumorigenesis associated with erb-B-2 and erb-B-3 overexpression in human erb-B-3 transgenic mice is enhanced by methylnitrosourea , 2002, Oncogene.

[56]  D. Lauffenburger,et al.  Systems Analysis of EGF Receptor Signaling Dynamics with Micro-Western Arrays , 2010, Nature Methods.

[57]  Xinkun Wang,et al.  An effective structure learning method for constructing gene networks , 2006, Bioinform..

[58]  E. P. van Someren Searching for Limited Connectivity in Genetic Network Models , 2004 .

[59]  Terence P. Speed,et al.  Bayesian Inference of Signaling Network Topology in a Cancer Cell Line , 2012, Bioinform..

[60]  Edward R. Dougherty,et al.  Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks , 2002, Bioinform..

[61]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[62]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[63]  Holger Fröhlich,et al.  Dynamic deterministic effects propagation networks: learning signalling pathways from longitudinal protein array data , 2010, Bioinform..

[64]  D. Floreano,et al.  Revealing strengths and weaknesses of methods for gene network inference , 2010, Proceedings of the National Academy of Sciences.

[65]  Rebecca W Doerge,et al.  An Empirical Bayesian Method for Estimating Biological Networks from Temporal Microarray Data , 2010, Statistical applications in genetics and molecular biology.

[66]  M. Zeitz,et al.  The chemopreventive agent ursodeoxycholic acid inhibits proliferation of colon carcinoma cells by suppressing c-Myc expression , 2012, European journal of cancer prevention : the official journal of the European Cancer Prevention Organisation.

[67]  Christos Faloutsos,et al.  Halite: Fast and Scalable Multiresolution Local-Correlation Clustering , 2013, IEEE Transactions on Knowledge and Data Engineering.

[68]  Monilola A. Olayioye,et al.  Update on HER-2 as a target for cancer therapy: Intracellular signaling pathways of ErbB2/HER-2 and family members , 2001, Breast Cancer Research.

[69]  Achim Tresch,et al.  Modeling the temporal interplay of molecular signaling and gene expression by using dynamic nested effects models , 2009, Proceedings of the National Academy of Sciences.

[70]  Martin Madera,et al.  Improving protein secondary structure prediction using a simple k-mer model , 2010, Bioinform..

[71]  R C Roovers,et al.  Crosstalk between epidermal growth factor receptor- and insulin-like growth factor-1 receptor signaling: implications for cancer therapy. , 2009, Current cancer drug targets.

[72]  Julio Saez-Rodriguez,et al.  Crowdsourcing Network Inference: The DREAM Predictive Signaling Network Challenge , 2011, Science Signaling.