Predicting Genes Involved in Human Cancer Using Network Contextual Information

Summary Protein-Protein Interaction (PPI) networks have been widely used for the task of predicting proteins involved in cancer. Previous research has shown that functional information about the protein for which a prediction is made, proximity to specific other proteins in the PPI network, as well as local network structure are informative features in this respect. In this work, we introduce two new types of input features, reflecting additional information: (1) Functional Context: the functions of proteins interacting with the target protein (rather than the protein itself); and (2) Structural Context: the relative position of the target protein with respect to specific other proteins selected according to a novel ANOVA (analysis of variance) based measure. We also introduce a selection strategy to pinpoint the most informative features. Results show that the proposed feature types and feature selection strategy yield informative features. A standard machine learning method (Naive Bayes) that uses the features proposed here outperforms the current state-of-the-art methods by more than 5% with respect to F-measure. In addition, manual inspection confirms the biological relevance of the top-ranked features.

[1]  Limsoon Wong,et al.  Exploiting Indirect Neighbours and Topological Weight to Predict Protein Function from Protein-Protein Interactions , 2006, BioDM.

[2]  Iver Petersen,et al.  IGFBP7 is a p53 target gene inactivated in human lung cancer by DNA hypermethylation. , 2011, Lung cancer.

[3]  Glenn Duval Francis Breast Cancer Database , 2013 .

[4]  S. Tsui,et al.  FHL2 exhibits anti-proliferative and anti-apoptotic activities in liver cancer cells. , 2011, Cancer letters.

[5]  Melanie Keppler,et al.  The SUMO modification pathway is involved in the BRCA1 response to genotoxic stress , 2009, Nature.

[6]  K. N. Chandrika,et al.  Analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets , 2006, Nature Genetics.

[7]  Zelmina Lubovac,et al.  Combining functional and topological properties to identify core modules in protein interaction networks , 2006, Proteins.

[8]  Qun Lu,et al.  Human homolog of Drosophila Hairy and enhancer of split 1, Hes1, negatively regulates δ-catenin (CTNND2) expression in cooperation with E2F1 in prostate cancer , 2010, Molecular Cancer.

[9]  Kunihiro Tsuchida,et al.  Follistatin Suppresses the Production of Experimental Multiple-Organ Metastasis by Small Cell Lung Cancer Cells in Natural Killer Cell–Depleted SCID Mice , 2008, Clinical Cancer Research.

[10]  H. Lehrach,et al.  A protein interaction network links GIT1, an enhancer of huntingtin aggregation, to Huntington's disease. , 2004, Molecular cell.

[11]  Elin Karlsson,et al.  Clinical potential of the mTOR targets S6K1 and S6K2 in breast cancer , 2011, Breast Cancer Research and Treatment.

[12]  V. Koteliansky,et al.  Interference with TGF‐β1 and ‐β3 in tumor stroma lowers tumor interstitial fluid pressure independently of growth in experimental carcinoma , 2002, International journal of cancer.

[13]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[14]  Andreas Bender,et al.  Predicting the functions of proteins in Protein-Protein Interaction networks from global information , 2009, MLSB.

[15]  David R Critchley,et al.  Tetraspanin CD9 in cell migration. , 2011, Biochemical Society transactions.

[16]  P. Radivojac,et al.  An integrated approach to inferring gene–disease associations in humans , 2008, Proteins.

[17]  Andrew M Hanby,et al.  Response to mTOR inhibition: activity of eIF4E predicts sensitivity in cell lines and acquired changes in eIF4E regulation in breast cancer , 2011, Molecular Cancer.

[18]  J. Rashbass Online Mendelian Inheritance in Man. , 1995, Trends in genetics : TIG.

[19]  Thomas Hielscher,et al.  The insulin-like growth factor binding proteins 3 and 7 are associated with colorectal cancer and liver metastasis , 2011, Cancer biology & therapy.

[20]  Tomasz Milczek,et al.  Selective gene expression profiling of mTOR-associated tumor suppressor and oncogenes in ovarian cancer. , 2011, Folia histochemica et cytobiologica.

[21]  Sherry L. Jenkins,et al.  Network analysis of FDA approved drugs and their targets. , 2007, The Mount Sinai journal of medicine, New York.

[22]  Christian Langer,et al.  Prognostic significance of, and gene and microRNA expression signatures associated with, CEBPA mutations in cytogenetically normal acute myeloid leukemia with high-risk molecular features: a Cancer and Leukemia Group B Study. , 2008, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[23]  M. Katoh,et al.  Identification and characterization of human SNAIL3 (SNAI3) gene in silico. , 2003, International journal of molecular medicine.

[24]  Lars Bolund,et al.  Haplotype frequencies in a sub-region of chromosome 19q13.3, related to risk and prognosis of cancer, differ dramatically between ethnic groups , 2009, BMC Medical Genetics.

[25]  H. Ngan,et al.  Semi-quantitative fluorescent PCR analysis identifies PRKAA1 on chromosome 5 as a potential candidate cancer gene of cervical cancer. , 2006, Gynecologic oncology.

[26]  B. Snel,et al.  Predicting disease genes using protein–protein interactions , 2006, Journal of Medical Genetics.

[27]  M. Gerstein,et al.  A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data , 2003, Science.

[28]  Ines Gockel,et al.  PDGFRalpha/beta expression correlates with the metastatic behavior of human colorectal cancer: a possible rationale for a molecular targeting strategy. , 2008, Oncology reports.

[29]  T. Milenković,et al.  Systems-level cancer gene identification from protein interaction network topology applied to melanogenesis-related functional genomics data , 2010, Journal of The Royal Society Interface.

[30]  Tijana Milenkoviæ,et al.  Uncovering Biological Network Function via Graphlet Degree Signatures , 2008, Cancer informatics.

[31]  Yishan Jiao,et al.  Faster and more accurate global protein function assignment from protein interaction networks using the MFGO algorithm , 2006, FEBS letters.

[32]  Alain Guénoche,et al.  Clustering proteins from interaction networks for the prediction of cellular functions , 2004, BMC Bioinformatics.

[33]  G M Clark,et al.  Insulin-like growth factor-I receptors are overexpressed and predict a low risk in human breast cancer. , 1993, Cancer research.

[34]  S. Kumar,et al.  Prognostic significance of TGF beta 1 and TGF beta 3 in human breast carcinoma. , 2000, Anticancer research.

[35]  S. Gendler,et al.  MUC1 alters oncogenic events and transcription in human breast cancer cells , 2006, Breast Cancer Research.

[36]  D. Peehl,et al.  Insulin-like growth factors (IGFs), IGF receptors, and IGF-binding proteins in primary cultures of prostate epithelial cells. , 1991, The Journal of clinical endocrinology and metabolism.

[37]  Christoph Engel,et al.  FHL2 expression in peritumoural fibroblasts correlates with lymphatic metastasis in sporadic but not in HNPCC-associated colon cancer , 2011, Laboratory Investigation.

[38]  Heung Tae Kim,et al.  Association of SUMO1 and UBC9 genotypes with tumor response in non-small-cell lung cancer treated with irinotecan-based chemotherapy , 2010, The Pharmacogenomics Journal.

[39]  W E Fisher,et al.  Insulin promotes pancreatic cancer: evidence for endocrine influence on exocrine pancreatic tumors. , 1996, Journal of Surgical Research.

[40]  Dimitrios Iliopoulos,et al.  Neurotensin signaling activates microRNAs-21 and -155 and Akt, promotes tumor growth in mice, and is increased in human colon tumors. , 2011, Gastroenterology.

[41]  荻野 広和,et al.  Follistatin suppresses the production of experimental multiple-organ metastasis by small cell lung cancer cells in natural killer cell-depleted SCID mice , 2008 .

[42]  G. Cavet,et al.  Inferring the functional effects of mutation through clusters of mutations in homologous proteins , 2010, Human mutation.

[43]  Jing Jin,et al.  E1B-55kD-deleted oncolytic adenovirus armed with canstatin gene yields an enhanced anti-tumor efficacy on pancreatic cancer. , 2009, Cancer letters.

[44]  Kyu Kwang Kim,et al.  Protein interaction network related to Helicobacter pylori infection response. , 2009, World journal of gastroenterology.

[45]  Rajiv Dhir,et al.  Talin1 promotes tumor invasion and metastasis via focal adhesion signaling and anoikis resistance. , 2010, Cancer research.

[46]  Daohai Zhang,et al.  Silencing of the PP2A catalytic subunit causes HER-2/neu positive breast cancer cells to undergo apoptosis. , 2010, Experimental cell research.

[47]  T. Gibson,et al.  Systematic Discovery of New Recognition Peptides Mediating Protein Interaction Networks , 2005, PLoS biology.

[48]  Yuan Qi,et al.  Molecular anatomy of breast cancer stroma and its prognostic value in estrogen receptor-positive and -negative cancers. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[49]  Jörg Rahnenführer,et al.  Changes in cortical cytoskeletal and extracellular matrix gene expression in prostate cancer are related to oncogenic ERG deregulation , 2010, BMC Cancer.

[50]  Carol Friedman,et al.  Discovery of Protein Interaction Networks Shared by Diseases , 2006, Pacific Symposium on Biocomputing.

[51]  Alessandro Vespignani,et al.  Global protein function prediction from protein-protein interaction networks , 2003, Nature Biotechnology.

[52]  M. Kerin,et al.  Influence of stromal–epithelial interactions on breast cancer in vitro and in vivo , 2011, Breast Cancer Research and Treatment.

[53]  S. L. Wong,et al.  Combining biological networks to predict genetic interactions. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[54]  R. Wolff,et al.  Genetic variation in RPS6KA1, RPS6KA2, RPS6KB1, RPS6KB2, and PDK1 and risk of colon or rectal cancer. , 2011, Mutation research.

[55]  F Chen JUNB (jun B proto-oncogene) , 2011 .

[56]  Xuebing Wu,et al.  Cancer Gene Prediction Using a Network Approach , 2009 .

[57]  David P. Davis,et al.  Discovering cancer genes by integrating network and functional properties , 2009, BMC Medical Genomics.

[58]  Jing Chen,et al.  Disease candidate gene identification and prioritization using protein interaction networks , 2009, BMC Bioinformatics.

[59]  Christos A Ouzounis,et al.  Structural and functional properties of genes involved in human cancer , 2006, BMC Genomics.

[60]  A. Bauer,et al.  Human protein-protein interaction networks and the value for drug discovery. , 2007, Drug discovery today.

[61]  Andreas Bender,et al.  Collaboration-Based Function Prediction in Protein-Protein Interaction Networks , 2011, IDA.

[62]  Eddie Reed,et al.  Modulation of Protein Phosphatase 2A Activity Alters Androgen-Independent Growth of Prostate Cancer Cells: Therapeutic Implications , 2011, Molecular Cancer Therapeutics.

[63]  A. F. Scott,et al.  OMIM: Online Mendelian Inheritance in Man , 2002 .

[64]  G. Cao,et al.  Identification of novel hub genes associated with liver metastasis of gastric cancer , 2009, International journal of cancer.

[65]  B. Zetter,et al.  Evidence of a Role for Antizyme and Antizyme Inhibitor as Regulators of Human Cancer , 2011, Molecular Cancer Research.

[66]  Xiaoke Huang,et al.  Endoglin suppresses human prostate cancer metastasis , 2010, Clinical & Experimental Metastasis.

[67]  Florence Dalenc,et al.  Formation of the eIF4F translation-initiation complex determines sensitivity to anticancer drugs targeting the EGFR and HER2 receptors. , 2011, Cancer research.

[68]  Robert Johansson,et al.  Polymorphisms in predicted microRNA-binding sites in integrin genes and breast cancer: ITGB4 as prognostic marker. , 2008, Carcinogenesis.

[69]  Q. Ye,et al.  Synergistic repression of estrogen receptor transcriptional activity by FHL2 and Smad4 in breast cancer cells , 2010, IUBMB life.

[70]  Boris Zhivotovsky,et al.  Phosphoproteomic profiling of NSCLC cells reveals that ephrin B3 regulates pro-survival signaling through Akt1-mediated phosphorylation of the EphA2 receptor. , 2011, Journal of proteome research.

[71]  Felice Petraglia,et al.  Differential expression of follistatin and FLRG in human breast proliferative disorders , 2009, BMC Cancer.

[72]  P. Dennis,et al.  Akt/protein kinase B is constitutively active in non-small cell lung cancer cells and promotes cellular survival and resistance to chemotherapy and radiation. , 2001, Cancer research.

[73]  Baldomero Oliva,et al.  Predicting cancer involvement of genes from heterogeneous data , 2008, BMC Bioinformatics.

[74]  Raghu Kalluri,et al.  Fibroblasts in cancer , 2006, Nature Reviews Cancer.

[75]  Igor Jurisica,et al.  Protein complex prediction via cost-based clustering , 2004, Bioinform..

[76]  Dipanwita Roy Chowdhury,et al.  Human protein reference database as a discovery resource for proteomics , 2004, Nucleic Acids Res..

[77]  Phillip G Febbo,et al.  MYC activity mitigates response to rapamycin in prostate cancer through 4EBP1-mediated inhibition of autophagy , 2010, Autophagy.

[78]  H. Lehrach,et al.  A Human Protein-Protein Interaction Network: A Resource for Annotating the Proteome , 2005, Cell.

[79]  A Sasaki,et al.  Negative regulation of cytokine signaling pathways. , 2000, Annual review of immunology.

[80]  R. Wooster The cancer genome project , 2002 .

[81]  Tao Wang,et al.  Increased nucleotide polymorphic changes in the 5′-untranslated region of δ-catenin (CTNND2) gene in prostate cancer , 2008, Oncogene.

[82]  B. Schwikowski,et al.  A network of protein–protein interactions in yeast , 2000, Nature Biotechnology.

[83]  Hideki Takai,et al.  cAMP and fibroblast growth factor 2 regulate bone sialoprotein gene expression in human prostate cancer cells. , 2011, Gene.

[84]  Y. Fujii,et al.  Identification of candidate genes involved in the radiosensitivity of esophageal cancer cells by microarray analysis. , 2008, Diseases of the esophagus : official journal of the International Society for Diseases of the Esophagus.

[85]  Torsten Haferlach,et al.  Expression of IGFBP7 in acute leukemia is regulated by DNA methylation , 2011, Cancer science.

[86]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[87]  Hee June Choi,et al.  SUMOylation of pontin chromatin-remodeling complex reveals a signal integration code in prostate cancer cells , 2007, Proceedings of the National Academy of Sciences.

[88]  Yongjin Li,et al.  Discovering disease-genes by topological features in human protein-protein interaction network , 2006, Bioinform..

[89]  Jolanta Lissowska,et al.  TGF-β Signaling Pathway and Breast Cancer Susceptibility , 2011, Cancer Epidemiology, Biomarkers & Prevention.

[90]  Lei Wan,et al.  Talin‐1 overexpression defines high risk for aggressive oral squamous cell carcinoma and promotes cancer metastasis , 2011, The Journal of pathology.