Simultaneous Interrogation of Cancer Omics to Identify Subtypes With Significant Clinical Differences

Recent advances in high-throughput sequencing have accelerated the accumulation of omics data on the same tumor tissue from multiple sources. Intensive study of multi-omics integration on tumor samples can stimulate progress in precision medicine and is promising in detecting potential biomarkers. However, current methods are restricted owing to highly unbalanced dimensions of omics data or difficulty in assigning weights between different data sources. Therefore, the appropriate approximation and constraints of integrated targets remain a major challenge. In this paper, we proposed an omics data integration method, named high-order path elucidated similarity (HOPES). HOPES fuses the similarities derived from various omics data sources to solve the dimensional discrepancy, and progressively elucidate the similarities from each type of omics data into an integrated similarity with various high-order connected paths. Through a series of incremental constraints for commonality, HOPES can take both specificity of single data and consistency between different data types into consideration. The fused similarity matrix gives global insight into patients' correlation and efficiently distinguishes subgroups. We tested the performance of HOPES on both a simulated dataset and several empirical tumor datasets. The test datasets contain three omics types including gene expression, DNA methylation, and microRNA data for five different TCGA cancer projects. Our method was shown to achieve superior accuracy and high robustness compared with several benchmark methods on simulated data. Further experiments on five cancer datasets demonstrated that HOPES achieved superior performances in cancer classification. The stratified subgroups were shown to have statistically significant differences in survival. We further located and identified the key genes, methylation sites, and microRNAs within each subgroup. They were shown to achieve high potential prognostic value and were enriched in many cancer-related biological processes or pathways.

[1]  A. Morgun,et al.  New approach reveals CD28 and IFNG gene interaction in the susceptibility to cervical cancer. , 2008, Human molecular genetics.

[2]  A. Frigessi,et al.  Principles and methods of integrative genomic analyses in cancer , 2014, Nature Reviews Cancer.

[3]  M. Noguchi,et al.  Drebrin: A new oncofetal biomarker associated with prognosis of lung adenocarcinoma. , 2016, Lung cancer.

[4]  N. van Baren,et al.  A novel cancer-germline transcript carrying pro-metastatic miR-105 and TET-targeting miR-767 induced by DNA hypomethylation in tumors , 2014, Epigenetics.

[5]  U. Lehmann,et al.  Epigenetic inactivation of microRNA gene hsa‐mir‐9‐1 in human breast cancer , 2008, The Journal of pathology.

[6]  J. Gu,et al.  Hsa-miR-9 methylation status is associated with cancer development and metastatic recurrence in patients with clear cell renal cell carcinoma , 2010, Oncogene.

[7]  Naimei Tang,et al.  Akt, FoxO and regulation of apoptosis. , 2011, Biochimica et biophysica acta.

[8]  Ranjana Mitra,et al.  Prediction of Postoperative Recurrence-Free Survival in Non–Small Cell Lung Cancer by Using an Internationally Validated Gene Expression Model , 2011, Clinical Cancer Research.

[9]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[10]  T. Ochiya,et al.  Novel combination of serum microRNA for detecting breast cancer in the early stage , 2016, Cancer science.

[11]  B. Leone,et al.  Expression of ERCC1 and TUBB3 in Locally Advanced Cervical Squamous Cell Cancer and its Correlation with Different Therapeutic Regimens , 2015, The International journal of biological markers.

[12]  S A Forbes,et al.  The Catalogue of Somatic Mutations in Cancer (COSMIC) , 2008, Current protocols in human genetics.

[13]  Bernhard Kuster,et al.  moCluster: Identifying Joint Patterns Across Multiple Omics Data Sets. , 2016, Journal of proteome research.

[14]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[15]  L. Cantley,et al.  Understanding the Warburg Effect: The Metabolic Requirements of Cell Proliferation , 2009, Science.

[16]  K. Conway,et al.  Racial Variation in Breast Tumor Promoter Methylation in the Carolina Breast Cancer Study , 2015, Cancer Epidemiology, Biomarkers & Prevention.

[17]  Dong Wang,et al.  Predictive value of APE1, BRCA1, ERCC1 and TUBB3 expression in patients with advanced non-small cell lung cancer (NSCLC) receiving first-line platinum–paclitaxel chemotherapy , 2014, Cancer Chemotherapy and Pharmacology.

[18]  Abdel Kareem Azab,et al.  The role of hypoxia in cancer progression, angiogenesis, metastasis, and resistance to therapy , 2015, Hypoxia.

[19]  Bo Liu,et al.  Down-regulated miR-9 and miR-433 in human gastric carcinoma , 2009, Journal of experimental & clinical cancer research : CR.

[20]  T. Ushijima,et al.  The presence of RNA polymerase II, active or stalled, predicts epigenetic fate of promoter CpG islands. , 2009, Genome research.

[21]  Steven J. M. Jones,et al.  Integrated genomic and molecular characterization of cervical cancer , 2017, Nature.

[22]  M. Schindl,et al.  Overexpression of hypoxia-inducible factor 1alpha is a marker for an unfavorable prognosis in early-stage invasive cervical cancer. , 2000, Cancer research.

[23]  A. Balakrishnan,et al.  Molecular profiling of the “plexinome” in melanoma and pancreatic cancer , 2009, Human mutation.

[24]  C. Croce,et al.  Oncogenic role of miR-483-3p at the IGF2/483 locus. , 2010, Cancer research.

[25]  Frank Speleman,et al.  miR-9, a MYC/MYCN-activated microRNA, regulates E-cadherin and cancer metastasis , 2010, Nature Cell Biology.

[26]  Lana X. Garmire,et al.  More Is Better: Recent Progress in Multi-Omics Data Integration Methods , 2017, Front. Genet..

[27]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[28]  S. Croul,et al.  The role of drebrin in glioma migration and invasion. , 2013, Experimental cell research.

[29]  R. Shamir,et al.  Multi-omic and multi-view clustering algorithms: review and cancer benchmark , 2018, bioRxiv.

[30]  Deng Cai,et al.  Unsupervised feature selection for multi-cluster data , 2010, KDD.

[31]  Ron Shamir,et al.  Multi-omic and multi-view clustering algorithms: review and cancer benchmark , 2018 .

[32]  Lorenz Wernisch,et al.  Clusternomics: Integrative context-dependent clustering for heterogeneous datasets , 2017, bioRxiv.

[33]  Hwee Tong Tan,et al.  iTRAQ analysis of colorectal cancer cell lines suggests Drebrin (DBN1) is overexpressed during liver metastasis , 2014, Proteomics.

[34]  R. Medema,et al.  AFX-like Forkhead transcription factors mediate cell-cycle regulation by Ras and PKB through p27kip1 , 2000, Nature.

[35]  Jill P. Mesirov,et al.  Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data , 2003, Machine Learning.

[36]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[37]  Rebecca SY Wong,et al.  Apoptosis in cancer: from pathogenesis to treatment , 2011, Journal of experimental & clinical cancer research : CR.

[38]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[39]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[40]  C. Sander,et al.  Pattern discovery and cancer gene identification in integrated cancer genomic data , 2013, Proceedings of the National Academy of Sciences.

[41]  Anthony J Gill,et al.  miR-195 and miR-483-5p Identified as Predictors of Poor Prognosis in Adrenocortical Cancer , 2009, Clinical Cancer Research.

[42]  Iris Barshack,et al.  MiR‐92b and miR‐9/9* Are Specifically Expressed in Brain Primary Tumors and Can Be Used to Differentiate Primary from Metastatic Brain Tumors , 2008, Brain pathology.

[43]  L. Lasky,et al.  The Forkhead Transcription Factor FOXO4 Induces the Down-regulation of Hypoxia-inducible Factor 1α by a von Hippel-Lindau Protein-independent Mechanism* , 2003, Journal of Biological Chemistry.

[44]  A. Services,et al.  Integrated genomic and molecular characterization of cervical cancer. , 2017 .

[45]  T. Shirao,et al.  A novel role for drebrin in regulating progranulin bioactivity in bladder cancer , 2015, Oncotarget.

[46]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[47]  P. Laird,et al.  Discovery of multi-dimensional modules by integrative analysis of cancer genomic data , 2012, Nucleic acids research.

[48]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[49]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[50]  Olli Yli-Harja,et al.  A joint finite mixture model for clustering genes from independent Gaussian and beta distributed data , 2009, BMC Bioinformatics.

[51]  Hsien-Da Huang,et al.  miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions , 2017, Nucleic Acids Res..

[52]  S. de Vos,et al.  FOXO‐dependent expression of the proapoptotic protein Bim: pivotal role for apoptosis signaling in endothelial progenitor cells , 2005, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[53]  H. Ngan,et al.  Activation of AMPK inhibits cervical cancer cell growth through AKT/FOXO3a/FOXM1 signaling cascade , 2013, BMC Cancer.

[54]  Juan Liu,et al.  Pattern fusion analysis by adaptive alignment of multiple heterogeneous omics data , 2017, Bioinform..

[55]  Pamela N. Munster,et al.  IN HUMAN BREAST CANCER , 2007 .

[56]  Harald Binder,et al.  Transforming RNA-Seq Data to Improve the Performance of Prognostic Gene Signatures , 2014, PloS one.

[57]  Y. Asmann,et al.  A Tissue Biomarker Panel Predicting Systemic Progression after PSA Recurrence Post-Definitive Prostate Cancer Therapy , 2008, PloS one.

[58]  May D. Wang,et al.  –Omic and Electronic Health Record Big Data Analytics for Precision Medicine , 2017, IEEE Transactions on Biomedical Engineering.

[59]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[60]  M. Hung,et al.  Aberrant Expression of proPTPRN2 in Cancer Cells Confers Resistance to Apoptosis. , 2015, Cancer research.

[61]  Kaiming Gao,et al.  Identification of intrinsic subtype-specific prognostic microRNAs in primary glioblastoma , 2014, Journal of experimental & clinical cancer research : CR.