Directed random walks and constraint programming reveal active pathways in hepatocyte growth factor signaling

An effective means to analyze mRNA expression data is to take advantage of established knowledge from pathway databases, using methods such as pathway‐enrichment analyses. However, pathway databases are not case‐specific and expression data could be used to infer gene‐regulation patterns in the context of specific pathways. In addition, canonical pathways may not always describe the signaling mechanisms properly, because interactions can frequently occur between genes in different pathways. Relatively few methods have been proposed to date for generating and analyzing such networks, preserving the causality between gene interactions and reasoning over the qualitative logic of regulatory effects. We present an algorithm (MCWalk) integrated with a logic programming approach, to discover subgraphs in large‐scale signaling networks by random walks in a fully automated pipeline. As an exemplary application, we uncover the signal transduction mechanisms in a gene interaction network describing hepatocyte growth factor‐stimulated cell migration and proliferation from gene‐expression measured with microarray and RT‐qPCR using in‐house perturbation experiments in a keratinocyte‐fibroblast co‐culture. The resulting subgraphs illustrate possible associations of hepatocyte growth factor receptor c‐Met nodes, differentially expressed genes and cellular states. Using perturbation experiments and Answer Set programming, we are able to select those which are more consistent with the experimental data. We discover key regulator nodes by measuring the frequency with which they are traversed when connecting signaling between receptors and significantly regulated genes and predict their expression‐shift consistently with the measured data. The Java implementation of MCWalk is publicly available under the MIT license at: https://bitbucket.org/akittas/biosubg.

[1]  Jagdish Chandra Patra,et al.  Integration of multiple data sources to prioritize candidate genes using discounted rating system , 2010, BMC Bioinformatics.

[2]  Fan Zhang,et al.  Topologically inferring risk-active pathways toward precise cancer classification by directed random walk , 2013, Bioinform..

[3]  Yang Xiang,et al.  Quantification of biological network perturbations for mechanistic insight and diagnostics using two-layer causal models , 2014, BMC Bioinformatics.

[4]  Steffen Klamt,et al.  Detecting and Removing Inconsistencies between Experimental Data and Signaling Network Topologies Using Integer Linear Programming on Interaction Graphs , 2013, PLoS Comput. Biol..

[5]  Michael A. White,et al.  Use of Data-Biased Random Walks on Graphs for the Retrieval of Context-Specific Networks from Genomic Data , 2010, PLoS Comput. Biol..

[6]  K. Goh,et al.  Betweenness centrality correlation in social networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Andreas Krämer,et al.  Causal analysis approaches in Ingenuity Pathway Analysis , 2013, Bioinform..

[9]  F. Schreiber,et al.  Centrality Analysis Methods for Biological Networks and Their Application to Gene Regulatory Networks , 2008, Gene regulation and systems biology.

[10]  Crispin J. Miller,et al.  Simpleaffy: a BioConductor package for Affymetrix Quality Control and data analysis , 2005, Bioinform..

[11]  W. Birchmeier,et al.  Met, metastasis, motility and more , 2003, Nature Reviews Molecular Cell Biology.

[12]  F. Moreews,et al.  BioQuali Cytoscape plugin: analysing the global consistency of regulatory networks , 2009, BMC Genomics.

[13]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[14]  Martin Vingron,et al.  Variance stabilization applied to microarray data calibration and to the quantification of differential expression , 2002, ISMB.

[15]  Z. Darżynkiewicz,et al.  Downregulation of uPAR inhibits migration, invasion, proliferation, FAK/PI3K/Akt signaling and induces senescence in papillary thyroid carcinoma cells , 2011, Cell cycle.

[16]  Natalie L. Catlett,et al.  Reverse causal reasoning: applying qualitative causal knowledge to the interpretation of high-throughput data , 2013, BMC Bioinformatics.

[17]  Florian Dittmann,et al.  Automatic generation of causal networks linking growth factor stimuli to functional cell state changes , 2012, The FEBS journal.

[18]  J. Foidart,et al.  Laminin--a glycoprotein from basement membranes. , 1979, The Journal of biological chemistry.

[19]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[20]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[21]  A. Reynolds,et al.  Emerging roles for p120-catenin in cell adhesion and cancer , 2004, Oncogene.

[22]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[23]  K. Rohr,et al.  Stathmin Regulates Keratinocyte Proliferation and Migration during Cutaneous Regeneration , 2013, PloS one.

[24]  Pierre Dupont,et al.  Systems biology Advance Access publication March 12, 2010 , 2009 .

[25]  Atul J. Butte,et al.  Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges , 2012, PLoS Comput. Biol..

[26]  E. Pilozzi,et al.  COX‐2 is induced by HGF stimulation in Met‐positive thyroid papillary carcinoma cells and is involved in tumour invasiveness , 2009, The Journal of pathology.

[27]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[28]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[29]  Heiko Rieger,et al.  Random walks on complex networks. , 2004, Physical review letters.

[30]  E. Långström,et al.  Polyamine starvation prolongs the S and G2 phases of polyamine-dependent (arginase-deficient) CHO cells , 1984, Molecular and cellular biology.

[31]  Takashi Kobayashi,et al.  Lamininα3 LG4 Module Induces Keratinocyte Migration: Involvement of Matrix Metalloproteinase-9 , 2005 .

[32]  S. Werner,et al.  c-Met is essential for wound healing in the skin , 2007, The Journal of cell biology.

[33]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[34]  R. Eils,et al.  Gene network dynamics controlling keratinocyte migration , 2008, Molecular systems biology.

[35]  Prahlad T. Ram,et al.  NetWalker: a contextual network analysis tool for functional genomics , 2012, BMC Genomics.