Identification of active signaling pathways by integrating gene expression and protein interaction data

BackgroundSignaling pathways are the key biological mechanisms that transduce extracellular signals to affect transcription factor mediated gene regulation within cells. A number of computational methods have been developed to identify the topological structure of a specific signaling pathway using protein-protein interaction data, but they are not designed for identifying active signaling pathways in an unbiased manner. On the other hand, there are statistical methods based on gene sets or pathway data that can prioritize likely active signaling pathways, but they do not make full use of active pathway structure that link receptor, kinases and downstream transcription factors.ResultsHere, we present a method to simultaneously predict the set of active signaling pathways, together with their pathway structure, by integrating protein-protein interaction network and gene expression data. We evaluated the capacity for our method to predict active signaling pathways for dental epithelial cells, ocular lens epithelial cells, human pluripotent stem cell-derived lens epithelial cells, and lens fiber cells. This analysis showed our approach could identify all the known active pathways that are associated with tooth formation and lens development.ConclusionsThe results suggest that SPAGI can be a useful approach to identify the potential active signaling pathways given a gene expression profile. Our method is implemented as an open source R package, available via https://github.com/VCCRI/SPAGI/.

[1]  T. Hunter,et al.  Signaling—2000 and Beyond , 2000, Cell.

[2]  Martin Steffen,et al.  Automated modelling of signal transduction networks , 2002, BMC Bioinformatics.

[3]  Christian von Mering,et al.  STRING: a database of predicted functional associations between proteins , 2003, Nucleic Acids Res..

[4]  Yin Liu,et al.  A computational approach for ordering signal transduction pathway components from genomics and proteomics Data , 2004, BMC Bioinformatics.

[5]  C. Niehrs,et al.  Function and biological roles of the Dickkopf family of Wnt modulators , 2006, Oncogene.

[6]  Roded Sharan,et al.  Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks , 2006, J. Comput. Biol..

[7]  Akiko Takahashi,et al.  Irreversibility of cellular senescence: dual roles of p16INK4a/Rb-pathway in cell cycle control , 2007, Cell Division.

[8]  Jiong Yang,et al.  PathFinder: mining signal transduction pathway segments from protein-protein interaction networks , 2007, BMC Bioinformatics.

[9]  K. Aihara,et al.  Uncovering signal transduction networks from high-throughput data by integer linear programming , 2008, Nucleic acids research.

[10]  X. Jiao,et al.  The EPHA2 gene is associated with cataracts linked to chromosome 1p , 2008, Molecular vision.

[11]  Jing Li,et al.  CASCADE_SCAN: mining signal transduction network from high-throughput data based on steepest descent method , 2011, BMC Bioinformatics.

[12]  D. Koller,et al.  Automated identification of pathways from quantitative genetic interaction data , 2010, Molecular systems biology.

[13]  Jun Miyoshi,et al.  The cell adhesion gene PVRL3 is associated with congenital ocular defects , 2011, Human Genetics.

[14]  Xing-Ming Zhao,et al.  Identifying dysregulated pathways in cancers from pathway interaction networks , 2012, BMC Bioinformatics.

[15]  Anupam Gupta,et al.  Discovering pathways by orienting edges in protein interaction networks , 2010, Nucleic acids research.

[16]  F. Lovicu,et al.  Understanding the role of growth factors in embryonic development: insights from the lens , 2011, Philosophical Transactions of the Royal Society B: Biological Sciences.

[17]  Sanjay Mishra,et al.  High-Affinity Dkk1 Receptor Kremen1 Is Internalized by Clathrin-Mediated Endocytosis , 2012, PloS one.

[18]  Philip Cayting,et al.  An encyclopedia of mouse DNA elements (Mouse ENCODE) , 2012, Genome Biology.

[19]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[20]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[21]  Donald E Ingber,et al.  A Wnt-Bmp Feedback Circuit Controls Intertissue Signaling Dynamics in Tooth Organogenesis , 2012, Science Signaling.

[22]  Rachael P. Huntley,et al.  Gene Ontology annotation of sequence-specific DNA binding transcription factors: setting the stage for a large-scale curation effort , 2013, Database J. Biol. Databases Curation.

[23]  P. Tsonis,et al.  Comparative transcriptome analysis of epithelial and fiber cells in newborn mouse lenses with RNA sequencing , 2014, Molecular vision.

[24]  Hao Zhu,et al.  Multi-label multi-instance transfer learning for simultaneous reconstruction and cross-talk modeling of multiple human signaling pathways , 2015, BMC Bioinformatics.

[25]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[26]  R. Jiang Walking on multiple disease-gene networks to prioritize candidate genes. , 2015, Journal of molecular cell biology.

[27]  Zexian Liu,et al.  Reconfiguring phosphorylation signaling by genetic polymorphisms affects cancer susceptibility. , 2015, Journal of molecular cell biology.

[28]  S. Andreadis,et al.  Capture of endothelial cells under flow using immobilized vascular endothelial growth factor , 2015, Biomaterials.

[29]  Xiaoping Liu,et al.  Diagnosing phenotypes of single-sample individuals by edge biomarkers. , 2015, Journal of molecular cell biology.

[30]  Ilan Y. Smoly,et al.  MyProteinNet: build up-to-date protein interaction networks for organisms, tissues and user-defined contexts , 2015, Nucleic Acids Res..

[31]  Piero Carninci,et al.  A draft network of ligand–receptor-mediated multicellular signalling in human , 2015, Nature Communications.

[32]  Andrew D. Rouillard,et al.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update , 2016, Nucleic Acids Res..

[33]  Anna Ritz,et al.  Pathways on demand: automated reconstruction of human signaling networks , 2016, npj Systems Biology and Applications.

[34]  Xingming Zhao,et al.  HISP: a hybrid intelligent approach for identifying directed signaling pathways , 2017, Journal of molecular cell biology.

[35]  A. Cvekl,et al.  Signaling and Gene Regulatory Networks in Mammalian Lens Development. , 2017, Trends in genetics : TIG.

[36]  Daniel Trejo-Baños,et al.  Integrating transcriptional activity in genome-scale models of metabolism , 2017, BMC Systems Biology.

[37]  T. M. Murali,et al.  The PathLinker app: Connect the dots in protein interaction networks , 2017, F1000Research.

[38]  Su Deng,et al.  Bayesian network model for identification of pathways by integrating protein interaction with genetic interaction data , 2017, BMC Systems Biology.

[39]  Djordje Djordjevic,et al.  Light-focusing human micro-lenses generated from pluripotent stem cells model lens development and drug-induced cataract in vitro , 2018, Development.