Integrating Omics Data With a Multiplex Network-Based Approach for the Identification of Cancer Subtypes

Comprehensive characterization and identification of cancer subtypes have a number of applications and implications in life science and cancer research. Technologies centered on the integration of omics data hold great promise in this endeavor. This paper proposed a multiplex network-based approach for integrative analysis of heterogeneous omics data. It represents a useful alternative network-based solution to the problem and a significant step forward to the methods in which each type of data is treated independently. It has been tested on the identification of the subtypes of glioblastoma multiforme and breast invasive carcinoma from three omics data. The results obtained have shown that it has achieved the performance comparable to state-of-the-art techniques (Normalized Mutual Information >0.8). In comparison to traditional systems biology tools, the proposed methodology has several significant advantages. It has the ability to correlate and integrate multiple data levels in a holistic manner which may be useful to facilitate our understanding of the pathogenesis of diseases and to capture the heterogeneity of biological processes and the complexity of phenotypes.

[1]  Ash A. Alizadeh,et al.  Toward understanding and exploiting tumor heterogeneity , 2015, Nature Medicine.

[2]  R. Gelber,et al.  Strategies for subtypes—dealing with the diversity of breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2011 , 2011, Annals of oncology : official journal of the European Society for Medical Oncology.

[3]  Huiru Zheng,et al.  Integrating omics data for identifying disease subtypes: A multiplex network-based approach , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[4]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[5]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Ju Han Kim,et al.  Synergistic effect of different levels of genomic data for cancer clinical outcome prediction , 2012, J. Biomed. Informatics.

[7]  Jukka-Pekka Onnela,et al.  Community Structure in Time-Dependent, Multiscale, and Multiplex Networks , 2009, Science.

[8]  David T. W. Jones,et al.  Hotspot mutations in H3F3A and IDH1 define distinct epigenetic and biological subgroups of glioblastoma. , 2012, Cancer cell.

[9]  Yuan Qi,et al.  Integrated Genomic Analysis Identifies Clinically Relevant Subtypes of Glioblastoma Characterized by Abnormalities in PDGFRA , IDH 1 , EGFR , and NF 1 Citation Verhaak , 2010 .

[10]  K. Aldape,et al.  Integrated array-comparative genomic hybridization and expression array profiles identify clinically relevant molecular subtypes of glioblastoma. , 2005, Cancer research.

[11]  Nico Pfeifer,et al.  Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery , 2015, Bioinform..

[12]  D. Pe’er,et al.  An Integrated Approach to Uncover Drivers of Cancer , 2010, Cell.

[13]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[14]  Serdar Bozdag,et al.  Age-specific signatures of glioblastoma at the genomic, genetic, and epigenetic levels. , 2013, PloS one.

[15]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[16]  Steven J. M. Jones,et al.  Integrated genomic characterization of endometrial carcinoma , 2013, Nature.

[17]  Rory Stark,et al.  Progesterone receptor modulates estrogen receptor-α action in breast cancer , 2015, Nature.

[18]  W. Marsden I and J , 2012 .

[19]  P. V. Rao,et al.  Applied Survival Analysis: Regression Modeling of Time to Event Data , 2000 .

[20]  Nello Cristianini,et al.  A statistical framework for genomic data fusion , 2004, Bioinform..

[21]  David Gomez-Cabrero,et al.  Data integration in the era of omics: current and future challenges , 2014, BMC Systems Biology.

[22]  M. Ritchie,et al.  Methods of integrating data to uncover genotype–phenotype interactions , 2015, Nature Reviews Genetics.