Prognostic analysis of histopathological images using pre-trained convolutional neural networks: application to hepatocellular carcinoma

Histopathological images contain rich phenotypic descriptions of the molecular processes underlying disease progression. Convolutional neural networks, state-of-the-art image analysis techniques in computer vision, automatically learn representative features from such images which can be useful for disease diagnosis, prognosis, and subtyping. Hepatocellular carcinoma (HCC) is the sixth most common type of primary liver malignancy. Despite the high mortality rate of HCC, little previous work has made use of CNN models to explore the use of histopathological images for prognosis and clinical survival prediction of HCC. We applied three pre-trained CNN models—VGG 16, Inception V3 and ResNet 50—to extract features from HCC histopathological images. Sample visualization and classification analyses based on these features showed a very clear separation between cancer and normal samples. In a univariate Cox regression analysis, 21.4% and 16% of image features on average were significantly associated with overall survival (OS) and disease-free survival (DFS), respectively. We also observed significant correlations between these features and integrated biological pathways derived from gene expression and copy number variation. Using an elastic net regularized Cox Proportional Hazards model of OS constructed from Inception image features, we obtained a concordance index (C-index) of 0.789 and a significant log-rank test (p = 7.6E−18). We also performed unsupervised classification to identify HCC subgroups from image features. The optimal two subgroups discovered using Inception model image features showed significant differences in both overall (C-index = 0.628 and p = 7.39E−07) and DFS (C-index = 0.558 and p = 0.012). Our work demonstrates the utility of extracting image features using pre-trained models by using them to build accurate prognostic models of HCC as well as highlight significant correlations between these features, clinical survival, and relevant biological pathways. Image features extracted from HCC histopathological images using the pre-trained CNN models VGG 16, Inception V3 and ResNet 50 can accurately distinguish normal and cancer samples. Furthermore, these image features are significantly correlated with survival and relevant biological pathways.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  Derek Y. Chiang,et al.  Integrative transcriptome analysis reveals common molecular subclasses of human hepatocellular carcinoma. , 2009, Cancer research.

[3]  Deok Won Kim,et al.  The possibility of the combination of OCT and fundus images for improving the diagnostic accuracy of deep learning for age-related macular degeneration: a preliminary experiment , 2018, Medical & Biological Engineering & Computing.

[4]  Daniel Smilkov,et al.  Similar image search for histopathology: SMILY , 2019, npj Digital Medicine.

[5]  A. Ruifrok,et al.  Quantification of histochemical staining by color deconvolution. , 2001, Analytical and quantitative cytology and histology.

[6]  Zenghui Wang,et al.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review , 2017, Neural Computation.

[7]  Jessica Zucman-Rossi,et al.  Exome sequencing of hepatocellular carcinomas identifies new mutational signatures and potential therapeutic targets , 2015, Nature Genetics.

[8]  Angel Cruz-Roa,et al.  High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection , 2018, PloS one.

[9]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[10]  L. Qin,et al.  The prognostic molecular markers in hepatocellular carcinoma. , 2002, World journal of gastroenterology.

[11]  D. Brat,et al.  Predicting cancer outcomes from histology and genomics using convolutional networks , 2017, Proceedings of the National Academy of Sciences.

[12]  H. Xi,et al.  Eph receptors and ephrins as targets for cancer therapy , 2012, Journal of cellular and molecular medicine.

[13]  G. Gores,et al.  Hepatocellular carcinoma , 2016, Nature Reviews Disease Primers.

[14]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15]  Heinz Handels,et al.  Unsupervised pathology detection in medical images using conditional variational autoencoders , 2018, International Journal of Computer Assisted Radiology and Surgery.

[16]  Inji Park,et al.  EphB/ephrinB Signaling in Cell Adhesion and Migration , 2014, Molecules and cells.

[17]  Andrew H. Beck,et al.  Region of Interest Identification and Diagnostic Agreement in Breast Pathology , 2016, Modern Pathology.

[18]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[19]  Y. Hoshida,et al.  Molecular classification of hepatocellular carcinoma: potential therapeutic implications. , 2015, Hepatic oncology.

[20]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[21]  Roderick Murray-Smith,et al.  Pathology GAN: Learning deep representations of cancer tissue , 2019, MIDL.

[22]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[23]  Balaji Krishnapuram,et al.  On Ranking in Survival Analysis: Bounds on the Concordance Index , 2007, NIPS.

[24]  Richard K. G. Do,et al.  Convolutional neural networks: an overview and application in radiology , 2018, Insights into Imaging.

[25]  E. Casanova,et al.  Impairment of Hepatic Growth Hormone and Glucocorticoid Receptor Signaling Causes Steatosis and Hepatocellular Carcinoma in Mice , 2011, Hepatology.

[26]  Hamid R. Tizhoosh,et al.  Comparing LBP, HOG and Deep Features for Classification of Histopathology Images , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[27]  Hava T. Siegelmann,et al.  Support Vector Clustering , 2002, J. Mach. Learn. Res..

[28]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[29]  Catarina Eloy,et al.  Classification of breast cancer histology images using Convolutional Neural Networks , 2017, PloS one.

[30]  Alexander Rakhlin,et al.  Deep Convolutional Neural Networks for Breast Cancer Histology Image Analysis , 2018, bioRxiv.

[31]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[32]  B. Arteta,et al.  Role of liver ICAM-1 in metastasis. , 2017, Oncology letters.

[33]  Steven J. M. Jones,et al.  Comprehensive and Integrative Genomic Characterization of Hepatocellular Carcinoma , 2017, Cell.

[34]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[36]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[37]  Ruixin Li,et al.  The role of EPH receptors in cancer-related epithelial-mesenchymal transition , 2014, Chinese journal of cancer.

[38]  Nico Karssemeijer,et al.  Stain Specific Standardization of Whole-Slide Histopathological Images , 2016, IEEE Transactions on Medical Imaging.

[39]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[40]  Hidenori Ojima,et al.  High-resolution characterization of a hepatocellular carcinoma genome , 2011, Nature Genetics.

[41]  A. Rastogi Changing role of histopathology in the diagnosis and management of hepatocellular carcinoma , 2018, World journal of gastroenterology.

[42]  M. Kojiro,et al.  Histopathology of liver cancers. , 2005, Best practice & research. Clinical gastroenterology.

[43]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[44]  M. Pencina,et al.  Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation , 2004, Statistics in medicine.

[45]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[46]  Heung-Il Suk,et al.  Deep Learning in Medical Image Analysis. , 2017, Annual review of biomedical engineering.

[47]  J. S. Marron,et al.  A method for normalizing histology slides for quantitative analysis , 2009, 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[48]  Qianjin Feng,et al.  Integrative Analysis of Histopathological Images and Genomic Data Predicts Clear Cell Renal Cell Carcinoma Prognosis. , 2017, Cancer research.

[49]  S. Imbeaud,et al.  Integrated analysis of somatic mutations and focal copy-number changes identifies key genes and pathways in hepatocellular carcinoma , 2012, Nature Genetics.

[50]  D. Waugh,et al.  The Interleukin-8 Pathway in Cancer , 2008, Clinical Cancer Research.

[51]  L. Terracciano,et al.  Histopathology of hepatocellular carcinoma. , 2014, World journal of gastroenterology.

[52]  Bing Lang,et al.  Deep Learning for Histopathological Image Analysis: Towards Computerized Diagnosis on Cancers , 2017, Deep Learning and Convolutional Neural Networks for Medical Image Computing.

[53]  Jonathan Schug,et al.  Glucocorticoid Receptor-Dependent Gene Regulatory Networks , 2005, PLoS genetics.

[54]  Kumardeep Chaudhary,et al.  Deep Learning–Based Multi-Omics Integration Robustly Predicts Survival in Liver Cancer , 2017, Clinical Cancer Research.

[55]  David Haussler,et al.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM , 2010, Bioinform..

[56]  S. Imbeaud,et al.  Histological subtypes of hepatocellular carcinoma are related to gene mutations and molecular tumour classification. , 2017, Journal of hepatology.

[57]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[58]  Hoo-Chang Hoo-Chang Shin Shin,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, Ieee Transactions on Medical Imaging.

[59]  Andrew H. Beck,et al.  Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer , 2017, JAMA.

[60]  Kumardeep Chaudhary,et al.  Multimodal Meta-Analysis of 1,494 Hepatocellular Carcinoma Samples Reveals Significant Impact of Consensus Driver Genes on Phenotypes , 2018, Clinical Cancer Research.

[61]  Daisuke Komura,et al.  Machine Learning Methods for Histopathological Image Analysis , 2017, Computational and structural biotechnology journal.

[62]  Saiful Islam,et al.  Cancer diagnosis in histopathological image: CNN based approach , 2019, Informatics in Medicine Unlocked.