A Cervical Histopathology Dataset for Computer Aided Diagnosis of Precancerous Lesions

Cervical cancer, as one of the most frequently diagnosed cancers worldwide, is curable when detected early. Histopathology images play an important role in precision medicine of the cervical lesions. However, few computer aided algorithms have been explored on cervical histopathology images due to the lack of public datasets. In this article, we release a new cervical histopathology image dataset for automated precancerous diagnosis. Specifically, 100 slides from 71 patients are annotated by three independent pathologists. To show the difficulty of the task, benchmarks are obtained through both fully and weakly supervised learning. Extensive experiments based on typical classification and semantic segmentation networks are carried out to provide strong baselines. In particular, a strategy of assembling classification, segmentation, and pseudo-labeling is proposed to further improve the performance. The Dice coefficient reaches 0.7833, indicating the feasibility of computer aided diagnosis and the effectiveness of our weakly supervised ensemble algorithm. The dataset and evaluation codes are publicly available. To the best of our knowledge, it is the first public cervical histopathology dataset for automated precancerous segmentation. We believe that this work will attract researchers to explore novel algorithms on cervical automated diagnosis, thereby assisting doctors and patients clinically.

[1]  Kaili Cao,et al.  An Improved Res-UNet Model for Tree Species Classification Using Airborne High-Resolution Images , 2020, Remote. Sens..

[2]  Ganapathy Krishnamurthi,et al.  A generalized deep learning framework for whole-slide image segmentation and analysis , 2020, Scientific Reports.

[3]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[4]  Ying Wang,et al.  Pathological Image Classification Based on Hard Example Guided CNN , 2020, IEEE Access.

[5]  Tatsuya Harada,et al.  Multi-Stage Pathological Image Classification Using Semantic Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[6]  Hong Liu,et al.  A Fast and Refined Cancer Regions Segmentation Framework in Whole-slide Breast Pathological Images , 2019, Scientific Reports.

[7]  Andrew H. Beck,et al.  Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer , 2017, JAMA.

[8]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Luiz Eduardo Soares de Oliveira,et al.  A Dataset for Breast Cancer Histopathological Image Classification , 2016, IEEE Transactions on Biomedical Engineering.

[10]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[11]  Kai Ma,et al.  Self-Loop Uncertainty: A Novel Pseudo-Label for Semi-Supervised Medical Image Segmentation , 2020, MICCAI.

[12]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[13]  Yanling Liu,et al.  PAIP 2019: Liver cancer segmentation challenge , 2020, Medical Image Anal..

[14]  Max Welling,et al.  Rotation Equivariant CNNs for Digital Pathology , 2018, MICCAI.

[15]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Maciej Pajak,et al.  Teacher-Student chain for efficient semi-supervised histology image classification , 2020, ArXiv.

[17]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[20]  Tyng-Luh Liu,et al.  Self-similarity Student for Partial Label Histopathology Image Segmentation , 2020, ECCV.

[21]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  David C Wilbur,et al.  The Lower Anogenital Squamous Terminology Standardization Project for HPV-Associated Lesions: background and consensus recommendations from the College of American Pathologists and the American Society for Colposcopy and Cervical Pathology. , 2012, Journal of lower genital tract disease.

[23]  Gustavo Carneiro,et al.  An Improved Joint Optimization of Multiple Level Set Functions for the Segmentation of Overlapping Cervical Cells , 2015, IEEE Transactions on Image Processing.

[24]  Catarina Eloy,et al.  BACH: Grand Challenge on Breast Cancer Histology Images , 2018, Medical Image Anal..

[25]  R. Joe Stanley,et al.  Nuclei-Based Features for Uterine Cervical Cancer Histology Image Analysis With Fusion-Based Classification , 2016, IEEE Journal of Biomedical and Health Informatics.

[26]  Xin-Ping Guan,et al.  Adversarial neural networks for basal membrane segmentation of microinvasive cervix carcinoma in histopathology images , 2017, 2017 International Conference on Machine Learning and Cybernetics (ICMLC).

[27]  Dong-Hyun Lee,et al.  Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .

[28]  Brian Kenji Iwana,et al.  Negative Pseudo Labeling using Class Proportion for Semantic Segmentation in Pathology , 2020, ECCV.

[29]  Alexander T. Pearson,et al.  Clinical-grade Detection of Microsatellite Instability in Colorectal Tumors by Deep Learning. , 2020, Gastroenterology.

[30]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Anne L. Martel,et al.  Automatic cellularity assessment from post‐treated breast surgical specimens , 2017, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[32]  R. Joe Stanley,et al.  A fusion-based approach for uterine cervical cancer histology image classification , 2013, Comput. Medical Imaging Graph..

[33]  Aleksey Boyko,et al.  Detecting Cancer Metastases on Gigapixel Pathology Images , 2017, ArXiv.

[34]  Xiangyu Zhang,et al.  ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[35]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Alexander W. Jung,et al.  Pan-cancer computational histopathology reveals mutations, tumor composition and prognosis , 2019, Nature Cancer.

[37]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[38]  Zhicheng Zhao,et al.  Adaptive Elastic Loss Based on Progressive Inter-Class Association for Cervical Histology Image Segmentation , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[39]  Jeroen van der Laak,et al.  HookNet: multi-resolution convolutional neural networks for semantic segmentation in histopathology whole-slide images , 2020, Medical Image Anal..

[40]  Zhiqiang Hu,et al.  Signet Ring Cell Detection with a Semi-supervised Learning Framework , 2019, IPMI.

[41]  Ashish Sharma,et al.  Utilizing Automated Breast Cancer Detection to Identify Spatial Distributions of Tumor Infiltrating Lymphocytes in Invasive Breast Cancer , 2019, The American journal of pathology.

[42]  R. Richart,et al.  A follow-up study of patients with cervical dysplasia. , 1969, American journal of obstetrics and gynecology.

[43]  R. Joe Stanley,et al.  A Hybrid Deep Learning and Handcrafted Feature Approach for Cervical Cancer Digital Histology Image Classification , 2019, Int. J. Heal. Inf. Syst. Informatics.

[44]  Georgios Dounias,et al.  Pap-smear Benchmark Data For Pattern Classification , 2005 .

[45]  Ghassan Hamarneh,et al.  Evaluation of Three Algorithms for the Segmentation of Overlapping Cervical Cells , 2017, IEEE Journal of Biomedical and Health Informatics.

[46]  Hao Chen,et al.  ScanNet: A Fast and Dense Scanning Framework for Metastastic Breast Cancer Detection from Whole-Slide Image , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[47]  A. Jemal,et al.  Cancer statistics, 2020 , 2020, CA: a cancer journal for clinicians.

[48]  Fei Su,et al.  ENS-Unet: End-to-End Noise Suppression U-Net for Brain Tumor Segmentation , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).