Identification of cancer omics commonality and difference via community fusion

The analysis of cancer omics data is a "classic" problem; however, it still remains challenging. Advancing from early studies that are mostly focused on a single type of cancer, some recent studies have analyzed data on multiple "related" cancer types/subtypes, examined their commonality and difference, and led to insightful findings. In this article, we consider the analysis of multiple omics datasets, with each dataset on one type/subtype of "related" cancers. A Community Fusion (CoFu) approach is developed, which conducts marker selection and model building using a novel penalization technique, informatively accommodates the network community structure of omics measurements, and automatically identifies the commonality and difference of cancer omics markers. Simulation demonstrates its superiority over direct competitors. The analysis of TCGA lung cancer and melanoma data leads to interesting findings.

[1]  Chen Li,et al.  Identification of TRA2B-DNAH5 fusion as a novel oncogenic driver in human lung squamous cell carcinoma , 2016, Cell Research.

[2]  Qun Wang,et al.  Bioinformatics analyses of the differences between lung adenocarcinoma and squamous cell carcinoma using The Cancer Genome Atlas expression data , 2017, Molecular medicine reports.

[3]  J. Huo,et al.  MicroRNA-1246 promotes growth and metastasis of colorectal cancer cells involving CCNG2 reduction. , 2016, Molecular medicine reports.

[4]  Wei Zhang,et al.  Network-based machine learning and graph theory algorithms for precision oncology , 2017, npj Precision Oncology.

[5]  J. Ott,et al.  Genome-wide examination of genetic variants associated with response to platinum-based chemotherapy in patients with small-cell lung cancer , 2010, Pharmacogenetics and genomics.

[6]  V. Pesce,et al.  Increase in proteins involved in mitochondrial fission, mitophagy, proteolysis and antioxidant response in type I endometrial cancer as an adaptive response to respiratory complex I deficiency. , 2017, Biochemical and biophysical research communications.

[7]  Jian Huang,et al.  Integrative Analysis of High‐throughput Cancer Studies With Contrasted Penalization , 2014, Genetic epidemiology.

[8]  Xingjie Shi,et al.  Analysis of cancer gene expression data with an assisted robust marker identification approach , 2017, Genetic epidemiology.

[9]  Marián Boguñá,et al.  Extracting the multiscale backbone of complex weighted networks , 2009, Proceedings of the National Academy of Sciences.

[10]  M. Martinka,et al.  Stage-specific prognostic biomarkers in melanoma , 2015, Oncotarget.

[11]  Steven J. M. Jones,et al.  Comprehensive genomic characterization of squamous cell lung cancers , 2012, Nature.

[12]  Shuangge Ma,et al.  Promoting Similarity of Sparsity Structures in Integrative Analysis With Penalization , 2015, Journal of the American Statistical Association.

[13]  P. Brown,et al.  Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Habibollah Haron,et al.  Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[15]  Jian Huang,et al.  Integrative analysis of multiple cancer prognosis studies with gene expression measurements , 2011, Statistics in medicine.

[16]  Jin Liu,et al.  Promoting similarity of model sparsity structures in integrative analysis of cancer genetic data , 2017, Statistics in medicine.

[17]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[18]  M. Hestenes Multiplier and gradient methods , 1969 .

[19]  Michael Krauthammer,et al.  Integrated analysis of multidimensional omics data on cutaneous melanoma prognosis. , 2016, Genomics.

[20]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Jian Gu,et al.  Genome-wide association study of survival in non-small cell lung cancer patients receiving platinum-based chemotherapy. , 2011, Journal of the National Cancer Institute.

[22]  E. Korn,et al.  Stage‐specific alterations of the genome, transcriptome, and proteome during colorectal carcinogenesis , 2007, Genes, chromosomes & cancer.

[23]  Bas J. Wouters,et al.  C/EBPγ deregulation results in differentiation arrest in acute myeloid leukemia. , 2012, The Journal of clinical investigation.

[24]  Ashok Palaniappan,et al.  Computational Identification of Novel Stage-Specific Biomarkers in Colorectal Cancer Progression , 2016, PloS one.

[25]  Paolo Vineis,et al.  Deciphering the complex: Methodological overview of statistical models to derive OMICS‐based biomarkers , 2013, Environmental and molecular mutagenesis.

[26]  Michael Thomas,et al.  AURKA, DLGAP5, TPX2, KIF11 and CKAP5: Five specific mitosis-associated genes correlate with poor prognosis for non-small cell lung cancer patients , 2017, International journal of oncology.

[27]  Ling Guo,et al.  Morin inhibited lung cancer cells viability, growth, and migration by suppressing miR-135b and inducing its target CCNG2 , 2017, Tumour biology : the journal of the International Society for Oncodevelopmental Biology and Medicine.

[28]  Matthew J. Davis,et al.  Exome sequencing identifies recurrent somatic RAC1 mutations in melanoma , 2012, Nature Genetics.

[29]  Steven J. M. Jones,et al.  Comprehensive molecular profiling of lung adenocarcinoma , 2014, Nature.

[30]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[31]  Steven J. M. Jones,et al.  Genomic Classification of Cutaneous Melanoma , 2015, Cell.

[32]  Gary D Bader,et al.  Comprehensive identification of mutational cancer driver genes across 12 tumor types , 2013, Scientific Reports.

[33]  Weiwei Xiao,et al.  Preliminary investigation of the role of BTB domain-containing 3 gene in the proliferation and metastasis of hepatocellular carcinoma. , 2017, Oncology letters.