Using machine learning algorithms to identify genes essential for cell survival

BackgroundWith the explosion of data comes a proportional opportunity to identify novel knowledge with the potential for application in targeted therapies. In spite of this huge amounts of data, the solutions to treating complex disease is elusive. One reason being that these diseases are driven by a network of genes that need to be targeted in order to understand and treat them effectively. Part of the solution lies in mining and integrating information from various disciplines. Here we propose a machine learning method to mining through publicly available literature on RNA interference with the goal of identifying genes essential for cell survival.ResultsA total of 32,164 RNA interference abstracts were identified from 10.5 million pubmed abstracts (2001 - 2015). These abstracts spanned over 1467 cancer cell lines and 4373 genes representing a total of 25,891 cell gene associations. Among the 1467 cell lines 88% of them had at least 1 or up to 25 genes studied in a given cell line. Among the 4373 genes 96% of them were studied in at least 1 or up to 25 different cell lines.ConclusionsIdentifying genes that are crucial for cell survival can be a critical piece of information especially in treating complex diseases, such as cancer. The efficacy of a therapeutic intervention is multifactorial in nature and in many cases the source of therapeutic disruption could be from an unsuspected source. Machine learning algorithms helps to narrow down the search and provides information about essential genes in different cancer types. It also provides the building blocks to generate a network of interconnected genes and processes. The information thus gained can be used to generate hypothesis which can be experimentally validated to improve our understanding of what triggers and maintains the growth of cancerous cells.

[1]  D. Swanson Medical literature as a potential source of new knowledge. , 1990, Bulletin of the Medical Library Association.

[2]  H. Grabsch,et al.  EphB2 is a Prognostic Factor in Colorectal Cancer , 2005, Clinical Cancer Research.

[3]  J. Brugge,et al.  Distinct roles of Akt1 and Akt2 in regulating cell migration and epithelial–mesenchymal transition , 2005, The Journal of cell biology.

[4]  S. Cory,et al.  The Bcl-2 protein family: arbiters of cell survival. , 1998, Science.

[5]  R. Figlin,et al.  Akt inhibitors in clinical development for the treatment of cancer , 2010, Expert opinion on investigational drugs.

[6]  R. Hotchkiss,et al.  Cell death. , 2009, The New England journal of medicine.

[7]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  J. Testa,et al.  AKT signaling in normal and malignant cells , 2005, Oncogene.

[9]  D. Hunter Gene–environment interactions in human diseases , 2005, Nature Reviews Genetics.

[10]  Flore Kruiswijk,et al.  p53 in survival, death and metabolic health: a lifeguard with a licence to kill , 2015, Nature Reviews Molecular Cell Biology.

[11]  T. Vanden Berghe,et al.  Major cell death pathways at a glance. , 2009, Microbes and infection.

[12]  F. Jiggins,et al.  The evolution of RNAi as a defence against viruses and transposable elements , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[13]  Martin Kuiper,et al.  Biological knowledge management: the emerging role of the Semantic Web technologies , 2009, Briefings Bioinform..

[14]  A. Fire,et al.  Specific inhibition of gene expression by small double-stranded RNAs in invertebrate and vertebrate systems , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  S. Ding,et al.  Induction and Suppression of RNA Silencing by an Animal Virus , 2002, Science.

[16]  Hemant K Roy,et al.  AKT proto-oncogene overexpression is an early event during sporadic colon carcinogenesis. , 2002, Carcinogenesis.

[17]  P. Gorman,et al.  A taxonomy of generic clinical questions: classification study , 2000, BMJ : British Medical Journal.

[18]  H. Kohrt,et al.  Therapeutic antitumor immunity by checkpoint blockade is enhanced by ibrutinib, an inhibitor of both BTK and ITK , 2015, Proceedings of the National Academy of Sciences.

[19]  R. Lothe,et al.  Three Epigenetic Biomarkers, GDF15, TMEFF2, and VIM, Accurately Predict Bladder Cancer from DNA-Based Analyses of Urine Samples , 2010, Clinical Cancer Research.

[20]  T. Tuschl,et al.  RNA Interference and Small Interfering RNAs , 2001, Chembiochem : a European journal of chemical biology.

[21]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[22]  A. Lykkesfeldt,et al.  Antiestrogen-resistant human breast cancer cells require activated protein kinase B/Akt for growth. , 2005, Endocrine-related cancer.

[23]  Ya-hong Zhang,et al.  Distinct roles of Akt1 in regulating proliferation, migration and invasion in HepG2 and HCT 116 cells. , 2014, Oncology reports.

[24]  Y. Shim,et al.  Lung cancer in never-smoker Asian females is driven by oncogenic mutations, most often involving EGFR , 2015, Oncotarget.

[25]  Wei Liu,et al.  EphB2 promotes cervical cancer progression by inducing epithelial-mesenchymal transition. , 2014, Human pathology.

[26]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[27]  M. Hahne,et al.  Cell Death , 2010, Cell Death and Differentiation.

[28]  C. Sawyers,et al.  The phosphatidylinositol 3-Kinase–AKT pathway in human cancer , 2002, Nature Reviews Cancer.

[29]  D. Swanson Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge , 2015, Perspectives in biology and medicine.

[30]  K. Hibi,et al.  Serum vimentin methylation as a potential marker for colorectal cancer. , 2014, Anticancer research.

[31]  R. Glazer,et al.  Role of AKT1 in 17beta-estradiol- and insulin-like growth factor I (IGF-I)-dependent proliferation and prevention of apoptosis in MCF-7 breast carcinoma cells. , 1999, Biochemical pharmacology.

[32]  R. Versteeg,et al.  Targeted BIRC5 silencing using YM155 causes cell death in neuroblastoma cells with low ABCB1 expression. , 2012, European journal of cancer.

[33]  Amy A. Caudy,et al.  Post-transcriptional gene silencing by double-stranded RNA , 2001, Nature Reviews Genetics.

[34]  Kei-Hoi Cheung,et al.  Advancing translational research with the Semantic Web , 2007, BMC Bioinformatics.

[35]  D. A. Schwartz The importance of gene–environment interactions and exposure assessment in understanding human diseases , 2006, Journal of Exposure Science and Environmental Epidemiology.

[36]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[37]  J. LoPiccolo,et al.  Targeting the PI3K/Akt/mTOR pathway: effective combinations and clinical considerations. , 2008, Drug resistance updates : reviews and commentaries in antimicrobial and anticancer chemotherapy.

[38]  Don R. Swanson,et al.  Two medical literatures that are logically but not bibliographically connected , 1987, J. Am. Soc. Inf. Sci..

[39]  Lewis C. Cantley,et al.  AKT/PKB Signaling: Navigating Downstream , 2007, Cell.

[40]  Leo Eberl,et al.  Essence of life: essential genes of minimal genomes. , 2011, Trends in cell biology.

[41]  Aleix Prat Aparicio Comprehensive molecular portraits of human breast tumours , 2012 .

[42]  Anjana Rao,et al.  NFAT, immunity and cancer: a transcription factor comes of age , 2010, Nature Reviews Immunology.

[43]  T. Furukawa,et al.  The BAX gene, the promoter of apoptosis, is mutated in genetically unstable cancers of the colorectum, stomach, and endometrium. , 1998, Clinical cancer research : an official journal of the American Association for Cancer Research.

[44]  T. Tuschl,et al.  Duplexes of 21-nucleotide RNAs mediate RNA interference in cultured mammalian cells , 2001, Nature.

[45]  Faye M. Johnson,et al.  Regulation of Src Family Kinases in Human Cancers , 2011, Journal of signal transduction.

[46]  M. Manoharan RNA interference and chemically modified small interfering RNAs. , 2004, Current opinion in chemical biology.

[47]  Peter Carmeliet,et al.  VEGF as a Key Mediator of Angiogenesis in Cancer , 2005, Oncology.

[48]  F. Graziano,et al.  The role of the E-cadherin gene (CDH1) in diffuse gastric cancer susceptibility: from the laboratory to clinical practice. , 2003, Annals of oncology : official journal of the European Society for Medical Oncology.

[49]  P. Sharp,et al.  RNA interference--2001. , 2001, Genes & development.

[50]  David L. Lewis,et al.  Efficient delivery of siRNA for inhibition of gene expression in postnatal mice , 2002, Nature Genetics.

[51]  S. R. Datta,et al.  Cellular survival: a play in three Akts. , 1999, Genes & development.

[52]  R. Eisenman,et al.  Stress-induced cleavage of Myc promotes cancer cell survival , 2014, Genes & development.

[53]  A. Toker,et al.  NFAT proteins: emerging roles in cancer progression , 2009, Nature Reviews Cancer.

[54]  Enrique Casado,et al.  PI3K/Akt signalling pathway and cancer. , 2004, Cancer treatment reviews.

[55]  Ian H. Witten,et al.  Data mining in bioinformatics using Weka , 2004, Bioinform..

[56]  M. Pan,et al.  NFAT gene family in inflammation and cancer. , 2013, Current molecular medicine.

[57]  A. Fire,et al.  RNA-triggered gene silencing. , 1999, Trends in genetics : TIG.

[58]  Suzanne Cory,et al.  The Bcl-2 family: roles in cell survival and oncogenesis , 2003, Oncogene.

[59]  Olivier Voinnet,et al.  Antiviral Immunity Directed by Small RNAs , 2007, Cell.

[60]  J. Harmey,et al.  Angiogenic and cell survival functions of Vascular Endothelial Growth Factor (VEGF) , 2005, Journal of cellular and molecular medicine.

[61]  Liang Wang,et al.  AHSA1 regulates proliferation, apoptosis, migration, and invasion of osteosarcoma. , 2016, Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie.

[62]  S. Bao,et al.  The activation of Akt/PKB signaling pathway and cell survival , 2005, Journal of cellular and molecular medicine.

[63]  Don R. Swanson,et al.  Online search for logically-related noninteractive medical literatures: A systematic trial-and-error strategy , 1989, JASIS.

[64]  J. Testa,et al.  Perturbations of the AKT signaling pathway in human cancer , 2005, Oncogene.

[65]  Don R. Swanson,et al.  A second example of mutually isolated medical literatures related by implicit, unnoticed connections , 1989, JASIS.

[66]  Ricardo Almeida,et al.  RNA silencing and genome regulation. , 2005, Trends in cell biology.

[67]  N. Pećina-Šlaus Tumor suppressor gene E-cadherin and its role in normal and malignant cells , 2003, Cancer Cell International.

[68]  P. Gorman,et al.  Clinical questions raised by clinicians at the point of care: a systematic review. , 2014, JAMA internal medicine.

[69]  M. Merad,et al.  CDKN1A regulates Langerhans cell survival and promotes Treg cell generation upon exposure to ionizing irradiation , 2015, Nature Immunology.

[70]  F. Eusebi,et al.  Cyclin D1 degradation enhances endothelial cell survival upon oxidative stress , 2006, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[71]  Arnold J Levine,et al.  p53 regulates cell survival by inhibiting PIK3CA in squamous cell carcinomas. , 2002, Genes & development.

[72]  Osamu Hori,et al.  Cellular Stress Responses: Cell Survival and Cell Death , 2010, International journal of cell biology.

[73]  R. Glazer,et al.  Role of AKT1 in 17β-estradiol- and insulin-like growth factor I (IGF-I)-dependent proliferation and prevention of apoptosis in MCF-7 breast carcinoma cells , 1999 .

[74]  W. Alexander,et al.  Inhibiting the akt pathway in cancer treatment: three leading candidates. , 2011, P & T : a peer-reviewed journal for formulary management.

[75]  M. Ellis,et al.  A Phase I Study of the AKT Inhibitor MK-2206 in Combination with Hormonal Therapy in Postmenopausal Women with Estrogen Receptor–Positive Metastatic Breast Cancer , 2016, Clinical Cancer Research.

[76]  Jie Zhou,et al.  Akt1 governs breast cancer progression in vivo , 2007, Proceedings of the National Academy of Sciences.

[77]  N. Normanno,et al.  Epidermal growth factor receptor (EGFR) signaling in cancer. , 2006, Gene.

[78]  J. Chen,et al.  Unraveling human complexity and disease with systems biology and personalized medicine. , 2010, Personalized medicine.

[79]  A. Nobel,et al.  Supervised risk predictor of breast cancer based on intrinsic subtypes. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[80]  J. Manfredi,et al.  p53 Promotes Cell Survival due to the Reversibility of Its Cell-Cycle Checkpoints , 2014, Molecular Cancer Research.

[81]  Amos Bairoch,et al.  The Cellosaurus, a Cell-Line Knowledge Resource. , 2018, Journal of biomolecular techniques : JBT.

[82]  D. Swanson Literature-based Resurrection of Neglected Medical Discoveries , 2011, Journal of biomedical discovery and collaboration.