Integrating Omic Data with a Multiplex Network-based Approach for the Identification of Cancer Subtypes.

Comprehensive characterization and identification of cancer subtypes have a number of applications and implications in life science and cancer research. Technologies centered on the integration of omics data hold great promise in this endeavor. This paper proposed a multiplex network-based approach for integrative analysis of heterogeneous omics data. It represents a useful alternative network-based solution to the problem and a significant step forward to the methods in which each type of data is treated independently. It has been tested on the identification of the subtypes of glioblastoma multiforme and breast invasive carcinoma from three omics data. The results obtained have shown that it has achieved the performance comparable to state-of-the-art techniques (Normalized Mutual Information > 0.8). In comparison to traditional systems biology tools, the proposed methodology has several significant advantages. It has the ability to correlate and integrate multiple data levels in a holistic manner which may be useful to facilitate our understanding of the pathogenesis of diseases and to capture the heterogeneity of biological processes and the complexity of phenotypes.

[1]  D G Altman,et al.  Survival probabilities (the Kaplan-Meier method) , 1998, BMJ.

[2]  David T. W. Jones,et al.  Hotspot mutations in H3F3A and IDH1 define distinct epigenetic and biological subgroups of glioblastoma. , 2012, Cancer cell.

[3]  Serdar Bozdag,et al.  Age-specific signatures of glioblastoma at the genomic, genetic, and epigenetic levels. , 2013, PloS one.

[4]  David W. Hosmer,et al.  Applied Survival Analysis: Regression Modeling of Time-to-Event Data , 2008 .

[5]  D. Pe’er,et al.  An Integrated Approach to Uncover Drivers of Cancer , 2010, Cell.

[6]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[7]  M. Ritchie,et al.  Methods of integrating data to uncover genotype–phenotype interactions , 2015, Nature Reviews Genetics.

[8]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[9]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[10]  R. Gibbs,et al.  Genomic analyses identify molecular subtypes of pancreatic cancer , 2016, Nature.

[11]  Ju Han Kim,et al.  Synergistic effect of different levels of genomic data for cancer clinical outcome prediction , 2012, J. Biomed. Informatics.

[12]  Yuan Qi,et al.  Integrated Genomic Analysis Identifies Clinically Relevant Subtypes of Glioblastoma Characterized by Abnormalities in PDGFRA , IDH 1 , EGFR , and NF 1 Citation Verhaak , 2010 .

[13]  D. Haussler,et al.  The Somatic Genomic Landscape of Glioblastoma , 2013, Cell.

[14]  Steven J. M. Jones,et al.  Integrated genomic characterization of endometrial carcinoma , 2013, Nature.

[15]  Rory Stark,et al.  Progesterone receptor modulates estrogen receptor-α action in breast cancer , 2015, Nature.

[16]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[17]  Jukka-Pekka Onnela,et al.  Community Structure in Time-Dependent, Multiscale, and Multiplex Networks , 2009, Science.

[18]  Nico Pfeifer,et al.  Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery , 2015, Bioinform..

[19]  K. Aldape,et al.  Integrated array-comparative genomic hybridization and expression array profiles identify clinically relevant molecular subtypes of glioblastoma. , 2005, Cancer research.

[20]  R. Gelber,et al.  Strategies for subtypes—dealing with the diversity of breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2011 , 2011, Annals of oncology : official journal of the European Society for Medical Oncology.

[21]  Huiru Zheng,et al.  Integrating omics data for identifying disease subtypes: A multiplex network-based approach , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[22]  Ash A. Alizadeh,et al.  Toward understanding and exploiting tumor heterogeneity , 2015, Nature Medicine.

[23]  Nello Cristianini,et al.  A statistical framework for genomic data fusion , 2004, Bioinform..

[24]  David Gomez-Cabrero,et al.  Data integration in the era of omics: current and future challenges , 2014, BMC Systems Biology.