Discrete Logic Modelling Optimization to Contextualize Prior Knowledge Networks Using PRUNET

High-throughput technologies have led to the generation of an increasing amount of data in different areas of biology. Datasets capturing the cell’s response to its intra- and extra-cellular microenvironment allows such data to be incorporated as signed and directed graphs or influence networks. These prior knowledge networks (PKNs) represent our current knowledge of the causality of cellular signal transduction. New signalling data is often examined and interpreted in conjunction with PKNs. However, different biological contexts, such as cell type or disease states, may have distinct variants of signalling pathways, resulting in the misinterpretation of new data. The identification of inconsistencies between measured data and signalling topologies, as well as the training of PKNs using context specific datasets (PKN contextualization), are necessary conditions to construct reliable, predictive models, which are current challenges in the systems biology of cell signalling. Here we present PRUNET, a user-friendly software tool designed to address the contextualization of a PKNs to specific experimental conditions. As the input, the algorithm takes a PKN and the expression profile of two given stable steady states or cellular phenotypes. The PKN is iteratively pruned using an evolutionary algorithm to perform an optimization process. This optimization rests in a match between predicted attractors in a discrete logic model (Boolean) and a Booleanized representation of the phenotypes, within a population of alternative subnetworks that evolves iteratively. We validated the algorithm applying PRUNET to four biological examples and using the resulting contextualized networks to predict missing expression values and to simulate well-characterized perturbations. PRUNET constitutes a tool for the automatic curation of a PKN to make it suitable for describing biological processes under particular experimental conditions. The general applicability of the implemented algorithm makes PRUNET suitable for a variety of biological processes, for instance cellular reprogramming or transitions between healthy and disease states.

[1]  Alberto de la Fuente,et al.  Discovery of meaningful associations in genomic data using partial correlation coefficients , 2004, Bioinform..

[2]  Holger Fröhlich,et al.  Deterministic Effects Propagation Networks for reconstructing protein signaling networks from multiple interventions , 2009, BMC Bioinformatics.

[3]  A. Datta,et al.  From biological pathways to regulatory networks , 2010, 49th IEEE Conference on Decision and Control (CDC).

[4]  Rui Chang,et al.  Systematic Search for Recipes to Generate Induced Pluripotent Stem Cells , 2011, PLoS Comput. Biol..

[5]  Ernest Fraenkel,et al.  ResponseNet: revealing signaling and regulatory networks linking genetic and transcriptomic screening data , 2011, Nucleic Acids Res..

[6]  Steffen Klamt,et al.  Detecting and Removing Inconsistencies between Experimental Data and Signaling Network Topologies Using Integer Linear Programming on Interaction Graphs , 2013, PLoS Comput. Biol..

[7]  Pedro Larrañaga,et al.  Estimation of Distribution Algorithms , 2002, Genetic Algorithms and Evolutionary Computation.

[8]  Wiktor Jurkowski,et al.  Detecting cellular reprogramming determinants by differential stability analysis of gene regulatory networks , 2013, BMC Systems Biology.

[9]  Ron Shamir,et al.  MetaReg: a platform for modeling, analysis and visualization of biological systems using large-scale experimental data , 2008, Genome Biology.

[10]  Xiaobo Zhou,et al.  A Bayesian connectivity-based approach to constructing probabilistic gene regulatory networks , 2004, Bioinform..

[11]  M. Xiong,et al.  Identification of genetic networks. , 2004, Genetics.

[12]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[13]  Jing Qu,et al.  Global DNA methylation and transcriptional analyses of human ESC-derived cardiomyocytes , 2014, Protein & Cell.

[14]  Luis Mendoza,et al.  A robust model to describe the differentiation of T-helper cells , 2010, Theory in Biosciences.

[15]  Aniruddha Datta,et al.  Generating Boolean networks with a prescribed attractor structure , 2005, Bioinform..

[16]  DaraseliaNikolai,et al.  Extracting human protein interactions from MEDLINE using a full-sentence parser , 2004 .

[17]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[18]  Adam A. Margolin,et al.  Reverse engineering of regulatory networks in human B cells , 2005, Nature Genetics.

[19]  Edward R. Dougherty,et al.  CAN MARKOV CHAIN MODELS MIMIC BIOLOGICAL REGULATION , 2002 .

[20]  Sean P. Palecek,et al.  Robust cardiomyocyte differentiation from human pluripotent stem cells via temporal modulation of canonical Wnt signaling , 2012, Proceedings of the National Academy of Sciences.

[21]  Tamer Kahveci,et al.  Large-Scale Signaling Network Reconstruction , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[22]  Isaac Crespo,et al.  A Novel Network Integrating a miRNA-203/SNAI1 Feedback Loop which Regulates Epithelial to Mesenchymal Transition , 2012, PloS one.

[23]  Lars Kaderali,et al.  Reconstructing signaling pathways from RNAi data using probabilistic Boolean threshold networks , 2009, Bioinform..

[24]  Michele Ceccarelli,et al.  IRIS: a method for reverse engineering of regulatory relations in gene networks , 2009, BMC Bioinformatics.

[25]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[26]  Isaac Crespo,et al.  Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states , 2012, Nucleic acids research.

[27]  Ron Shamir,et al.  Modeling and Analysis of Heterogeneous Regulation in Biological Networks , 2004, Regulatory Genomics.

[28]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[29]  Achim Tresch,et al.  Structure Learning in Nested Effects Models , 2007, Statistical applications in genetics and molecular biology.

[30]  J. N. R. Jeffers,et al.  Graphical Models in Applied Multivariate Statistics. , 1990 .

[31]  Jessika Weiss,et al.  Graphical Models In Applied Multivariate Statistics , 2016 .

[32]  Edward R. Dougherty,et al.  Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks , 2002, Bioinform..

[33]  Olga G. Troyanskaya,et al.  Nested effects models for high-dimensional phenotyping screens , 2007, ISMB/ECCB.

[34]  Eric D. Adler,et al.  Human cardiovascular progenitor cells develop from a KDR+ embryonic-stem-cell-derived population , 2008, Nature.

[35]  Wenqing Cai,et al.  Small molecule-mediated TGF-β type II receptor degradation promotes cardiomyogenesis in embryonic stem cells. , 2012, Cell Stem Cell.

[36]  Julio Saez-Rodriguez,et al.  CellNOptR: a flexible toolkit to train protein signaling networks to data using multiple logic formalisms , 2012, BMC Systems Biology.

[37]  D. Lauffenburger,et al.  Discrete logic modelling as a means to link protein signalling networks with functional analysis of mammalian signal transduction , 2009, Molecular systems biology.

[38]  Xiaodong Wang,et al.  Gene Regulatory Network Reconstruction Using Conditional Mutual Information , 2008, EURASIP J. Bioinform. Syst. Biol..

[39]  Anton Yuryev,et al.  Extracting human protein interactions from MEDLINE using a full-sentence parser , 2004, Bioinform..

[40]  A. del Sol,et al.  A general strategy for cellular reprogramming: The importance of transcription factor cross‐repression , 2013, Stem cells.

[41]  Holger Fröhlich,et al.  Estimating large-scale signaling networks through nested effect models with intervention effects from microarray data , 2008, Bioinform..

[42]  Claudio Altafini,et al.  Comparing association network algorithms for reverse engineering of large-scale gene regulatory networks: synthetic versus real data , 2007, Bioinform..

[43]  Giovanni De Micheli,et al.  Synchronous versus asynchronous modeling of gene regulatory networks , 2008, Bioinform..

[44]  Roded Sharan,et al.  SPINE: a framework for signaling-regulatory pathway inference from cause-effect experiments , 2007, ISMB/ECCB.

[45]  Ying Gu,et al.  Global DNA methylation and transcriptional analyses of human ESC-derived cardiomyocytes , 2013, Protein & cell.

[46]  Achim Tresch,et al.  Nested effects models for learning signaling networks from perturbation data , 2009, Biometrical journal. Biometrische Zeitschrift.

[47]  Rainer Spang,et al.  Non-transcriptional pathway features reconstructed from secondary effects of RNA interference , 2005, Bioinform..

[48]  Sergei Egorov,et al.  MedScan, a natural language processing engine for MEDLINE abstracts , 2003, Bioinform..

[49]  Lincoln Stein,et al.  Reactome: a knowledgebase of biological pathways , 2004, Nucleic Acids Res..

[50]  Lars Kaderali,et al.  Reconstruction of Cellular Signal Transduction Networks Using Perturbation Assays and Linear Programming , 2013, PloS one.

[51]  Naoko Arai,et al.  Gata-3 Induces T Helper Cell Type 2 (Th2) Cytokine Expression and Chromatin Remodeling in Committed Th1 Cells , 2000, The Journal of experimental medicine.

[52]  K. Plath,et al.  The roles of the reprogramming factors Oct4, Sox2 and Klf4 in resetting the somatic cell epigenome during induced pluripotent stem cell generation , 2012, Genome Biology.

[53]  J. A. Lozano,et al.  Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[54]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.