Integrative Analysis of Pathological Images and Multi-Dimensional Genomic Data for Early-Stage Cancer Prognosis

The integrative analysis of histopathological images and genomic data has received increasing attention for studying the complex mechanisms of driving cancers. However, most image-genomic studies have been restricted to combining histopathological images with the single modality of genomic data (e.g., mRNA transcription or genetic mutation), and thus neglect the fact that the molecular architecture of cancer is manifested at multiple levels, including genetic, epigenetic, transcriptional, and post-transcriptional events. To address this issue, we propose a novel ordinal multi-modal feature selection (OMMFS) framework that can simultaneously identify important features from both pathological images and multi-modal genomic data (i.e., mRNA transcription, copy number variation, and DNA methylation data) for the prognosis of cancer patients. Our model is based on a generalized sparse canonical correlation analysis framework, by which we also take advantage of the ordinal survival information among different patients for survival outcome prediction. We evaluate our method on three early-stage cancer datasets derived from The Cancer Genome Atlas (TCGA) project, and the experimental results demonstrated that both the selected image and multi-modal genomic markers are strongly correlated with survival enabling effective stratification of patients with distinct survival than the comparing methods, which is often difficult for early-stage cancer patients.

[1]  B. Yener,et al.  Automated cancer diagnosis based on histopathological images : a systematic survey , 2005 .

[2]  Jung-A Choi,et al.  Pro-survival of estrogen receptor-negative breast cancer cells is regulated by a BLT2-reactive oxygen species-linked signaling pathway. , 2010, Carcinogenesis.

[3]  Satoshi Inoue,et al.  Androgen-induced Long Noncoding RNA (lncRNA) SOCS2-AS1 Promotes Cell Growth and Inhibits Apoptosis in Prostate Cancer Cells* , 2016, The Journal of Biological Chemistry.

[4]  Maher Rizkalla,et al.  SALMON: Survival Analysis Learning With Multi-Omics Neural Networks on Breast Cancer , 2019, Front. Genet..

[5]  A. Jemal,et al.  Global Cancer Statistics , 2011 .

[6]  Jie Zhang,et al.  Pan-cancer analysis of frequent DNA co-methylation patterns reveals consistent epigenetic landscape changes in multiple cancers , 2017, BMC Genomics.

[7]  J Howard Jaster,et al.  Antibodies to Zic4 in paraneoplastic neurologic disorders and small-cell lung cancer , 2004, Neurology.

[8]  A. Jemal,et al.  Global cancer statistics, 2012 , 2015, CA: a cancer journal for clinicians.

[9]  Lawrence O. Hall,et al.  Nucleus segmentation in histology images with hierarchical multilevel thresholding , 2016, SPIE Medical Imaging.

[10]  Daniela M Witten,et al.  Extensions of Sparse Canonical Correlation Analysis with Applications to Genomic Data , 2009, Statistical applications in genetics and molecular biology.

[11]  Qianjin Feng,et al.  Identification of topological features in renal tumor microenvironment associated with patient survival , 2017, Bioinform..

[12]  Kyoung-Mee Kim,et al.  Deregulation of immune response genes in patients with Epstein-Barr virus-associated gastric cancer and outcomes. , 2015, Gastroenterology.

[13]  Inmoo Rhee,et al.  Cytokine Signaling in Tumor Progression , 2017, Immune network.

[14]  Li Yu,et al.  Prognostic Significance of Mixed-Lineage Leukemia (MLL) Gene Detected by Real-Time Fluorescence Quantitative PCR Assay in Acute Myeloid Leukemia , 2016, Medical science monitor : international medical journal of experimental and clinical research.

[15]  John D. Storey,et al.  Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach , 2004 .

[16]  S. Humphries,et al.  Cytokine and cytokine receptor gene polymorphisms and their functionality. , 2009, Cytokine & growth factor reviews.

[17]  R. Perona,et al.  Control of oncogenesis and cancer therapy resistance , 2004, British Journal of Cancer.

[18]  A Kuzucu,et al.  Cytogenetic findings in thirty lung carcinoma patients. , 1998, Cancer genetics and cytogenetics.

[19]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Shin-Tai Chen,et al.  Evidence that RASSF1C Stimulation of Lung Cancer Cell Proliferation Depends on IGFBP-5 and PIWIL1 Expression Levels , 2014, PloS one.

[21]  Xiaoyang Tan,et al.  Mixed bi-subject kinship verification via multi-view multi-task learning , 2016, Neurocomputing.

[22]  Z. Krawczyk,et al.  HSPA2 is expressed in human tumors and correlates with clinical features in non-small cell lung carcinoma patients. , 2014, Anticancer research.

[23]  Per Magne Ueland,et al.  Genetic Polymorphisms in 15q25 and 19q13 Loci, Cotinine Levels, and Risk of Lung Cancer in EPIC , 2011, Cancer Epidemiology, Biomarkers & Prevention.

[24]  F. Markowetz,et al.  Quantitative Image Analysis of Cellular Heterogeneity in Breast Tumors Complements Genomic Profiling , 2012, Science Translational Medicine.

[25]  Jianzhong Wu,et al.  Stacked Sparse Autoencoder (SSAE) for Nuclei Detection on Breast Cancer Histopathology Images , 2016, IEEE Transactions on Medical Imaging.

[26]  Peng Qiu,et al.  TCGA-Assembler: open-source software for retrieving and processing TCGA data , 2014, Nature Methods.

[27]  S. Yao,et al.  Prognostic Significance of Stromal Versus Intratumoral Infiltrating Lymphocytes in Different Subtypes of Breast Cancer Treated With Cytotoxic Neoadjuvant Chemotherapy , 2017, Applied immunohistochemistry & molecular morphology : AIMM.

[28]  George Michailidis,et al.  A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data , 2015, Bioinform..

[29]  Wenhui Liu,et al.  Nox1 is over‐expressed in human colon cancers and correlates with activating mutations in K‐Ras , 2008, International journal of cancer.

[30]  S. Srivastava,et al.  Naturally occurring human glutathione S-transferase GSTP1-1 isoforms with isoleucine and valine in position 104 differ in enzymic properties. , 1994, European journal of biochemistry.

[31]  N. Bander,et al.  Mechanisms of apoptosis in T cells from patients with renal cell carcinoma. , 1999, Clinical cancer research : an official journal of the American Association for Cancer Research.

[32]  David R. Cox,et al.  Proportional hazards model , 2018 .

[33]  L. Rönnstrand,et al.  Suppressor of cytokine signaling 2 (SOCS2) associates with FLT3 and negatively regulates downstream signaling , 2013, Molecular oncology.

[34]  Daoqiang Zhang,et al.  Ordinal Multi-modal Feature Selection for Survival Analysis of Early-Stage Renal Cancer , 2018, MICCAI.

[35]  F. Rizvi,et al.  NADPH oxidase expression and production of superoxide by human corneal stromal cells , 2009, Molecular vision.

[36]  Shi-Hua Zhang,et al.  Identifying multi-layer gene regulatory modules from multi-dimensional genomic data , 2012, Bioinform..

[37]  Mats Gustafsson,et al.  Antenna Current Optimization using MATLAB and CVX , 2016 .

[38]  Qianjin Feng,et al.  Integrative Analysis of Histopathological Images and Genomic Data Predicts Clear Cell Renal Cell Carcinoma Prognosis. , 2017, Cancer research.

[39]  D. Spandidos,et al.  Genetic Instability in Renal Cell Carcinoma , 1998, European Urology.

[40]  H. Akino,et al.  Prognostic value of nuclear area index in combination with the World Health Organization grading system for patients with renal cell carcinoma. , 2001, Urology.

[41]  Adam B. Olshen,et al.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis , 2009, Bioinform..

[42]  Shaoming Li,et al.  Suppression of OSCC malignancy by oral glands derived‐PIP identified by iTRAQ combined with 2D LC‐MS/MS , 2019, Journal of cellular physiology.

[43]  Dacheng Tao,et al.  Multi-View Intact Space Learning , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  L. J. Wei,et al.  The Robust Inference for the Cox Proportional Hazards Model , 1989 .

[45]  M. Paul,et al.  Tyrosine kinase – Role and significance in Cancer , 2004, International journal of medical sciences.

[46]  Camille Stephan-Otto Attolini,et al.  Stromal gene expression defines poor-prognosis subtypes in colorectal cancer , 2015, Nature Genetics.

[47]  Daoqiang Zhang,et al.  Multimodal classification of Alzheimer's disease and mild cognitive impairment , 2011, NeuroImage.

[48]  B. Mercier,et al.  A dual algorithm for the solution of nonlinear variational problems via finite element approximation , 1976 .

[49]  Bengt Klefsjö,et al.  Proportional hazards model: a review , 1994 .

[50]  Guy N. Brock,et al.  Mutational Mechanisms That Activate Wnt Signaling and Predict Outcomes in Colorectal Cancer Patients. , 2018, Cancer research.

[51]  James J. Chen,et al.  Assessment of performance of survival prediction models for cancer prognosis , 2012, BMC Medical Research Methodology.

[52]  Miki Ohira,et al.  Aberrant methylation of RASGRF2 and RASSF1A in human non-small cell lung cancer. , 2006, Oncology reports.

[53]  Ce Zhang,et al.  Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features , 2016, Nature Communications.

[54]  Jian Xian,et al.  Combined image and genomic analysis of high-grade serous ovarian cancer reveals PTEN loss as a common driver event and prognostic classifier , 2014, Genome Biology.