Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks

The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways.

[1]  Barbara J. Evans,et al.  Genomic Data Commons , 2017 .

[2]  Julie C. Sapp,et al.  Characterization of thrombosis in patients with Proteus syndrome , 2017, American journal of medical genetics. Part A.

[3]  Evan O. Paull,et al.  Inferring causal molecular networks: empirical assessment through a community-based effort , 2016, Nature Methods.

[4]  L. Stein,et al.  The Reactome pathway Knowledgebase , 2015, Nucleic Acids Res..

[5]  R. Bernards,et al.  An integrated genomic approach identifies that the PI3K/AKT/FOXO pathway is involved in breast cancer tumor initiation , 2015, Oncotarget.

[6]  Minoru Kanehisa,et al.  KEGG as a reference resource for gene and protein annotation , 2015, Nucleic Acids Res..

[7]  G. Gillet,et al.  Bcl-2 proteins, cell migration and embryonic development: lessons from zebrafish , 2015, Cell Death and Disease.

[8]  Yan Chen,et al.  Apoptosis of HL-60 human leukemia cells induced by Asiatic acid through modulation of B-cell lymphoma 2 family proteins and the mitogen-activated protein kinase signaling pathway. , 2015, Molecular medicine reports.

[9]  Yuka Kanno,et al.  Mechanisms of Jak/STAT Signaling in Immunity and Disease , 2015, The Journal of Immunology.

[10]  Concha Bielza,et al.  Bayesian networks in neuroscience: a survey , 2014, Front. Comput. Neurosci..

[11]  P. Juin,et al.  Protect and serve: Bcl-2 proteins as guardians and rulers of cancer cell survival , 2013, Cell cycle.

[12]  D. Calvisi,et al.  Functional crosstalk between AKT/mTOR and Ras/MAPK pathways in hepatocarcinogenesis , 2013, Cell cycle.

[13]  S. Pierre,et al.  Different roles of the cardiac Na+/Ca2+-exchanger in ouabain-induced inotropy, cell signaling, and hypertrophy. , 2013, American journal of physiology. Heart and circulatory physiology.

[14]  C. Eng,et al.  Germline PIK3CA and AKT1 mutations in Cowden and Cowden-like syndromes. , 2013, American journal of human genetics.

[15]  S. Pervaiz,et al.  Crosstalk between Bcl-2 family and Ras family small GTPases: potential cell fate regulation? , 2013, Front. Oncol..

[16]  Clifford Liongue,et al.  Evolution of the JAK-STAT pathway , 2013, JAK-STAT.

[17]  L. Stein,et al.  Annotating Cancer Variants and Anti-Cancer Therapeutics in Reactome , 2012, Cancers.

[18]  B. Hemmings,et al.  PI3K-PKB/Akt pathway. , 2012, Cold Spring Harbor perspectives in biology.

[19]  N. Tsuchida,et al.  Ras oncogenes in oral cancer: the past 20 years. , 2012, Oral oncology.

[20]  J. Darnell,et al.  The JAK-STAT pathway at twenty. , 2012, Immunity.

[21]  D. Calvisi,et al.  AKT (v‐akt murine thymoma viral oncogene homolog 1) and N‐Ras (neuroblastoma ras viral oncogene homolog) coactivation in the mouse liver promotes rapid carcinogenesis by way of mTOR (mammalian target of rapamycin complex 1), FOXM1 (forkhead box M1)/SKP2, and c‐Myc pathways , 2012, Hepatology.

[22]  Scott M Lippman,et al.  Targeting the MAPK–RAS–RAF signaling pathway in cancer therapy , 2012, Expert opinion on therapeutic targets.

[23]  B. Hemmings,et al.  PI3K/AKT, MAPK and AMPK signalling: protein kinases in glucose homeostasis , 2012, Expert Reviews in Molecular Medicine.

[24]  B. Goh,et al.  The small GTPase Rac1 is a novel binding partner of Bcl-2 and stabilizes its antiapoptotic activity. , 2011, Blood.

[25]  A. Brunet,et al.  Energy metabolism in adult neural stem cell fate , 2011, Progress in Neurobiology.

[26]  José M. Peña,et al.  Finding Consensus Bayesian Network Structures , 2011, J. Artif. Intell. Res..

[27]  Jennifer Y. Zhang,et al.  BCL2 inhibits Cell Adhesion, Spreading, and Motility by Enhancing Actin Polymerization , 2010, Cell Research.

[28]  Johann S de Bono,et al.  Targeting the PI3K/AKT Pathway for the Treatment of Prostate Cancer , 2009, Clinical Cancer Research.

[29]  A. Yoshimura Regulation of cytokine signaling by the SOCS and Spred family proteins. , 2009, The Keio journal of medicine.

[30]  K. Voelkerding,et al.  Next-generation sequencing: from basic research to diagnostics. , 2009, Clinical chemistry.

[31]  Stefan Wiemann,et al.  KEGGgraph: a graph approach to KEGG PATHWAY in R and bioconductor , 2009, Bioinform..

[32]  F. Denoeud,et al.  Annotating genomes with massive-scale RNA sequencing , 2008, Genome Biology.

[33]  Christophe Lemetre,et al.  An introduction to artificial neural networks in bioinformatics - application to complex microarray and mass spectrometry datasets in cancer studies , 2008, Briefings Bioinform..

[34]  Hiu Kiu,et al.  SOCS regulation of the JAK/STAT signalling pathway. , 2008, Seminars in cell & developmental biology.

[35]  Carlos Caldas,et al.  Meta-analysis confirms BCL2 is an independent prognostic marker in breast cancer , 2008, BMC Cancer.

[36]  Deyu Zhou,et al.  Methodological Review: Extracting interactions between proteins from the literature , 2008 .

[37]  S. Pierre,et al.  Association of PI3K-Akt signaling pathway with digitalis-induced hypertrophy of cardiac myocytes. , 2007, American journal of physiology. Cell physiology.

[38]  K. Geering,et al.  Evolution of Na,K-ATPase βm-subunit into a coregulator of transcription in placental mammals , 2007, Proceedings of the National Academy of Sciences.

[39]  D. Husmeier,et al.  Reconstructing Gene Regulatory Networks with Bayesian Networks by Combining Expression Data with Multiple Sources of Prior Knowledge , 2007, Statistical applications in genetics and molecular biology.

[40]  P. Murray The JAK-STAT Signaling Pathway: Input and Output Integration1 , 2007, The Journal of Immunology.

[41]  Y. Zhang,et al.  IntAct—open source resource for molecular interaction data , 2006, Nucleic Acids Res..

[42]  C. Chuong,et al.  PubFocus: semantic MEDLINE/PubMed citations analytics through integration of controlled biomedical dictionaries and ranking algorithm , 2006, BMC Bioinformatics.

[43]  Z. Zhao,et al.  Antagonism or Synergism , 2006, Journal of Biological Chemistry.

[44]  V. Chan,et al.  Thyrotoxic periodic paralysis and polymorphisms of sodium–potassium ATPase genes , 2006, Clinical endocrinology.

[45]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[46]  Yiling Lu,et al.  Exploiting the PI3K/AKT Pathway for Cancer Drug Discovery , 2005, Nature Reviews Drug Discovery.

[47]  D. Jo,et al.  Intracellular protein therapy with SOCS3 inhibits inflammation and apoptosis , 2005, Nature Medicine.

[48]  Claus-Wilhelm von der Lieth,et al.  PubFinder: a tool for improving retrieval rate of relevant PubMed abstracts , 2005, Nucleic Acids Res..

[49]  H. Waldmann,et al.  Inhibitors of protein tyrosine phosphatases: next-generation drugs? , 2005, Angewandte Chemie.

[50]  Daniel Berleant,et al.  MedKit: a helper toolkit for automatic mining of MEDLINE/PubMed citations , 2005, Bioinform..

[51]  Lincoln Stein,et al.  Reactome: a knowledgebase of biological pathways , 2004, Nucleic Acids Res..

[52]  Zheng Rong Yang,et al.  Biological applications of support vector machines , 2004, Briefings Bioinform..

[53]  M. Oshimura,et al.  PI3K-Akt pathway: Its functions and alterations in human cancer , 2004, Apoptosis.

[54]  Hao Chen,et al.  Content-rich biological network constructed by mining PubMed abstracts , 2004, BMC Bioinformatics.

[55]  Daniel H. Huson,et al.  Phylogenetic super-networks from partial trees , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[56]  N. Sonenberg,et al.  Upstream and downstream of mTOR. , 2004, Genes & development.

[57]  Vincent Moulton,et al.  Using consensus networks to visualize contradictory evidence for species phylogeny. , 2004, Molecular biology and evolution.

[58]  D. Harrison,et al.  The JAK/STAT signaling pathway , 2004, Journal of Cell Science.

[59]  Masato Kubo,et al.  Suppressors of cytokine signaling and immunity , 2003, Nature Immunology.

[60]  Miguel A. Andrade-Navarro,et al.  Update on XplorMed: a web server for exploring scientific literature , 2003, Nucleic Acids Res..

[61]  J. McCubrey,et al.  Involvement of PI3K/Akt pathway in cell cycle progression, apoptosis, and neoplastic transformation: a target for cancer chemotherapy , 2003, Leukemia.

[62]  Emanuel F. Petricoin,et al.  Medical applications of microarray technologies: a regulatory science perspective , 2002, Nature Genetics.

[63]  C. Sawyers,et al.  The phosphatidylinositol 3-Kinase–AKT pathway in human cancer , 2002, Nature Reviews Cancer.

[64]  Lewis C Cantley,et al.  The phosphoinositide 3-kinase pathway. , 2002, Science.

[65]  Zijian Xie,et al.  Na(+)/K(+)-ATPase as a signal transducer. , 2002, European journal of biochemistry.

[66]  Barbara Hoffman,et al.  The proto-oncogene c-myc in hematopoietic development and leukemogenesis , 2002, Oncogene.

[67]  Mary L Marazita,et al.  A mutation in the SOS1 gene causes hereditary gingival fibromatosis type 1. , 2002, American journal of human genetics.

[68]  Michael A. Patton,et al.  Mutations in PTPN11, encoding the protein tyrosine phosphatase SHP-2, cause Noonan syndrome , 2001, Nature Genetics.

[69]  T. Yoshino,et al.  Reduction of hematopoietic cell-specific tyrosine phosphatase SHP-1 gene expression in natural killer cell lymphoma and various types of lymphomas/leukemias : combination analysis with cDNA expression array and tissue microarray. , 2001, The American journal of pathology.

[70]  I. Roninson,et al.  Growth retardation and increased apoptosis in mice with homozygous disruption of the Akt1 gene. , 2001, Genes & development.

[71]  P Bork,et al.  XplorMed: a tool for exploring MEDLINE abstracts. , 2001, Trends in biochemical sciences.

[72]  P. Igaz,et al.  Biological and clinical significance of the JAK-STAT pathway; lessons from knockout mice , 2001, Inflammation Research.

[73]  L. Mei,et al.  Requirement of SHP2 Binding to Grb2-associated Binder-1 for Mitogen-activated Protein Kinase Activation in Response to Lysophosphatidic Acid and Epidermal Growth Factor* , 2000, The Journal of Biological Chemistry.

[74]  C. Der,et al.  Understanding Ras: 'it ain't over 'til it's over'. , 2000, Trends in cell biology.

[75]  B J Stapley,et al.  Biobibliometrics: information retrieval and visualization from co-occurrences of gene names in Medline abstracts. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[76]  G. Feng Shp-2 tyrosine phosphatase: signaling one cell or many. , 1999, Experimental cell research.

[77]  S. Sugiura Actin-myosin interaction. , 1999, Cardiovascular research.

[78]  G. Evan,et al.  The opposing roles of the Akt and c-Myc signalling pathways in survival from CD95-mediated apoptosis , 1998, Oncogene.

[79]  R. Haller,et al.  Reduced levels of skeletal muscle Na+K+-ATPase in McArdle disease , 1998, Neurology.

[80]  J. Sedivy,et al.  Phenotypes of c-Myc-deficient rat fibroblasts isolated by targeted homologous recombination. , 1997, Cell growth & differentiation : the molecular biology journal of the American Association for Cancer Research.

[81]  T. Pawson,et al.  Abnormal mesoderm patterning in mouse embryos mutant for the SH2 tyrosine phosphatase Shp‐2 , 1997, The EMBO journal.

[82]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[83]  E. Krebs,et al.  The MAPK signaling cascade , 1995, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[84]  G. Evan,et al.  c‐Myc‐induced apoptosis in fibroblasts is inhibited by specific cytokines. , 1994, The EMBO journal.

[85]  G. Evan,et al.  The c‐Myc protein induces cell cycle progression and apoptosis through dimerization with Max. , 1993, The EMBO journal.

[86]  Gregory F. Cooper,et al.  A Bayesian Method for Constructing Bayesian Belief Networks from Databases , 1991, UAI.

[87]  R. Eisenman,et al.  c-myc and c-myb protein degradation: effect of metabolic inhibitors and heat shock , 1988, Molecular and cellular biology.

[88]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[89]  小森和樹 Gene Expression Omnibus利用方法の検討 , 2016 .

[90]  M. Desouza,et al.  2012 Landes Bioscience. Do not distribute. The actin cytoskeleton as a sensor and mediator of apoptosis , 2012 .

[91]  Richard Bonneau,et al.  Integrated inference and analysis of regulatory networks from multi-level measurements. , 2012, Methods in cell biology.

[92]  Aparna Sriram,et al.  Predicting Gene Relations Using Bayesian Networks , 2011 .

[93]  D. Berleant,et al.  Assistant : A Biologist-Friendly Interface for Enhanced PubMed Search , 2006 .

[94]  M. Ackerman,et al.  Bmc Medical Informatics and Decision Making Slim: an Alternative Web Interface for Medline/pubmed Searches – a Preliminary Study , 2005 .

[95]  J. Downward Targeting RAS signalling pathways in cancer therapy , 2003, Nature Reviews Cancer.

[96]  M. Kanehisa,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[97]  S F Altschul,et al.  BRCA1 protein products ... Functional motifs... , 1996, Nature genetics.

[98]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .