Simple non-iterative clustering and CNNs for coarse segmentation of breast cancer whole-slide images

Breast cancer is the dominant cancer among women as it accounts for about one-quarter of all cancer cases in females. The digitized images of Hematoxylin and Eosin (H&E) stained slides of breast cancer specimens carry valuable diagnostic information. However, inspecting these slides manually is a non-trivial task prone to subjective interpretation. Digital pathology (DP) and artificial intelligence (AI) open an opportunity for objective interpretation of the image data. It is challenging to automate the segmentation process in the whole slide images due to the visual complexity of tissue appearance without the need for tedious and time-consuming fine annotations. Many algorithms classify the tissue regions into different types instead of segmenting them, as the classification algorithms require coarse annotations that are easier to acquire. In this paper, we propose a new segmentation framework that combines the simple non-iterative clustering algorithm with a standard convolutional neural network (CNN) classifier to segment whole slide images into different tissue types. In addition, a graph-based post-processing step is applied to improve the framework segmentation performance further. The results show promising improvement to the CNN classifier based coarse segmentation, which would give better feasibility to quantify and study tissues’ mutual relationships.

[1]  Lisheng Xu,et al.  Segmentation of lung parenchyma in CT images using CNN trained with the clustering algorithm generated dataset , 2019, BioMedical Engineering OnLine.

[2]  Yao Lu,et al.  RIC-Unet: An Improved Neural Network Based on Unet for Nuclei Segmentation in Histology Images , 2019, IEEE Access.

[3]  J. Makki,et al.  Diversity of Breast Carcinoma: Histological Subtypes and Clinical Relevance , 2015, Clinical medicine insights. Pathology.

[4]  David B. A. Epstein,et al.  Fast and Accurate Tumor Segmentation of Histology Images using Persistent Homology and Deep Convolutional Features , 2018, Medical Image Anal..

[5]  David B. A. Epstein,et al.  Micro‐Net: A unified model for segmentation of various objects in microscopy images , 2018, Medical Image Anal..

[6]  Jin Tae Kwak,et al.  Hover-Net: Simultaneous segmentation and classification of nuclei in multi-tissue histology images , 2018, Medical Image Anal..

[7]  N. Rajpoot,et al.  HyMaP: A hybrid magnitude-phase approach to unsupervised segmentation of tumor areas in breast cancer histology images , 2013, Journal of pathology informatics.

[8]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[9]  Karl Pearson F.R.S. X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling , 2009 .

[10]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Gökhan Bilgin,et al.  A Hybrid Method of Superpixel Segmentation Algorithm and Deep Learning Method in Histopathological Image Segmentation , 2018, 2018 Innovations in Intelligent Systems and Applications (INISTA).

[12]  Lin Chen,et al.  Fully Convolutional Neural Networks for Tissue Histopathology Image Classification and Segmentation , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[13]  Nasir M. Rajpoot,et al.  A Stochastic Polygons Model for Glandular Structures in Colon Histology Images , 2015, IEEE Transactions on Medical Imaging.

[14]  P. Jaccard THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.1 , 1912 .

[15]  Francesco Bianconi,et al.  Multi-class texture analysis in colorectal cancer histology , 2016, Scientific Reports.

[16]  Mita Nasipuri,et al.  Patch-based system for Classification of Breast Histology images using deep learning , 2019, Comput. Medical Imaging Graph..

[17]  Hao Chen,et al.  MILD‐Net: Minimal information loss dilated network for gland instance segmentation in colon histology images , 2018, Medical Image Anal..

[18]  Sabine Süsstrunk,et al.  Superpixels and Polygons Using Simple Non-iterative Clustering , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .