A Dataset and a Technique for Generalized Nuclear Segmentation for Computational Pathology

Nuclear segmentation in digital microscopic tissue images can enable extraction of high-quality features for nuclear morphometrics and other analysis in computational pathology. Conventional image processing techniques, such as Otsu thresholding and watershed segmentation, do not work effectively on challenging cases, such as chromatin-sparse and crowded nuclei. In contrast, machine learning-based segmentation can generalize across various nuclear appearances. However, training machine learning algorithms requires data sets of images, in which a vast number of nuclei have been annotated. Publicly accessible and annotated data sets, along with widely agreed upon metrics to compare techniques, have catalyzed tremendous innovation and progress on other image classification problems, particularly in object recognition. Inspired by their success, we introduce a large publicly accessible data set of hematoxylin and eosin (H&E)-stained tissue images with more than 21000 painstakingly annotated nuclear boundaries, whose quality was validated by a medical doctor. Because our data set is taken from multiple hospitals and includes a diversity of nuclear appearances from several patients, disease states, and organs, techniques trained on it are likely to generalize well and work right out-of-the-box on other H&E-stained images. We also propose a new metric to evaluate nuclear segmentation results that penalizes object- and pixel-level errors in a unified manner, unlike previous metrics that penalize only one type of error. We also propose a segmentation technique based on deep learning that lays a special emphasis on identifying the nuclear boundaries, including those between the touching or overlapping nuclei, and works well on a diverse set of test images.

[1]  Andrew H. Beck,et al.  Systematic Analysis of Breast Cancer Morphology Uncovers Stromal Features Associated with Survival , 2011, Science Translational Medicine.

[2]  Samy Bengio,et al.  Torch: a modular machine learning software library , 2002 .

[3]  Andrew H. Beck,et al.  Crowdsourcing image annotation for nucleus detection and segmentation in computational pathology: evaluating experts, automated methods, and the crowd. , 2014, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[4]  Bai Ying Lei,et al.  Accurate Segmentation of Cervical Cytoplasm and Nuclei Based on Multiscale Convolutional Network and Graph Partitioning , 2015, IEEE Transactions on Biomedical Engineering.

[5]  Nasir M. Rajpoot,et al.  Locality Sensitive Deep Learning for Detection and Classification of Nuclei in Routine Colon Cancer Histology Images , 2016, IEEE Trans. Medical Imaging.

[6]  A. Ruifrok,et al.  Quantification of histochemical staining by color deconvolution. , 2001, Analytical and quantitative cytology and histology.

[7]  Alexis B. Carter,et al.  Computational Pathology: A Path Ahead. , 2016, Archives of pathology & laboratory medicine.

[8]  Andrew H. Beck,et al.  Computational Pathology to Discriminate Benign from Malignant Intraductal Proliferations of the Breast , 2014, PloS one.

[9]  D. M. Titterington,et al.  t -Tests, F -Tests and Otsu's Methods for Image Thresholding , 2011, IEEE Trans. Image Process..

[10]  Christophoros Nikou,et al.  Overlapping Cell Nuclei Segmentation Using a Spatially Adaptive Active Physical Model , 2012, IEEE Transactions on Image Processing.

[11]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[12]  Xiaobo Zhou,et al.  Nuclei Segmentation Using Marker-Controlled Watershed, Tracking Using Mean-Shift, and Kalman Filter in Time-Lapse Microscopy , 2006, IEEE Transactions on Circuits and Systems I: Regular Papers.

[13]  Lin Yang,et al.  An Automatic Learning-Based Framework for Robust Nucleus Segmentation , 2016, IEEE Transactions on Medical Imaging.

[14]  Anne E Carpenter,et al.  CellProfiler: image analysis software for identifying and quantifying cell phenotypes , 2006, Genome Biology.

[15]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[16]  Elli Angelopoulou,et al.  Retinal vessel segmentation by improved matched filtering: evaluation on a new high-resolution fundus image database , 2013, IET Image Process..

[17]  Mubarak Shah,et al.  UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.

[18]  Nassir Navab,et al.  Structure-Preserving Color Normalization and Sparse Stain Separation for Histological Images , 2016, IEEE Transactions on Medical Imaging.

[19]  B. S. Manjunath,et al.  A biosegmentation benchmark for evaluation of bioimage analysis methods , 2009, BMC Bioinformatics.

[20]  Anant Madabhushi,et al.  An Integrated Region-, Boundary-, Shape-Based Active Contour for Multiple Object Overlap Resolution in Histological Imagery , 2012, IEEE Transactions on Medical Imaging.

[21]  Roman Monczak,et al.  Computer-Aided Breast Cancer Diagnosis Based on the Analysis of Cytological Images of Fine Needle Biopsies , 2013, IEEE Transactions on Medical Imaging.

[22]  H Llewellyn,et al.  Observer variation, dysplasia grading, and HPV typing: a review. , 2000, American journal of clinical pathology.

[23]  Neeraj Kumar,et al.  Empirical comparison of color normalization methods for epithelial-stromal classification in H and E images , 2016, Journal of pathology informatics.

[24]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[25]  Lin Yang,et al.  Robust Nucleus/Cell Detection and Segmentation in Digital Pathology and Microscopy Images: A Comprehensive Review , 2016, IEEE Reviews in Biomedical Engineering.

[26]  Andrew H. Beck,et al.  Abstract LB-285: Computational pathology for predicting prostate cancer recurrence , 2015 .

[27]  Min Zhang,et al.  Small Blob Identification in Medical Images Using Regional Features From Optimum Scale , 2015, IEEE Transactions on Biomedical Engineering.

[28]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Anant Madabhushi,et al.  Automated gland and nuclei segmentation for grading of prostate and breast cancer histopathology , 2008, 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[30]  Kumar Neeraj,et al.  Detecting multiple sub-types of breast cancer in a single patient , 2016 .

[31]  Andrew Janowczyk,et al.  Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases , 2016, Journal of pathology informatics.

[32]  Bahram Parvin,et al.  Invariant Delineation of Nuclear Architecture in Glioblastoma Multiforme for Clinical and Molecular Association , 2013, IEEE Transactions on Medical Imaging.

[33]  P. Jaccard THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.1 , 1912 .

[34]  Daniel Heim,et al.  Detection and Segmentation of Cell Nuclei in Virtual Microscopy Images: A Minimum-Model Approach , 2012, Scientific Reports.

[35]  T. Rebbeck,et al.  Co-Occurring Gland Angularity in Localized Subgraphs: Predicting Biochemical Recurrence in Intermediate-Risk Prostate Cancer Patients , 2014, PloS one.

[36]  Neeraj Kumar,et al.  Learning based super-resolution of histological images , 2016, 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI).

[37]  Yousef Al-Kofahi,et al.  Improved Automatic Detection and Segmentation of Cell Nuclei in Histopathology Images , 2010, IEEE Transactions on Biomedical Engineering.

[38]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[39]  Johannes E. Schindelin,et al.  Fiji: an open-source platform for biological-image analysis , 2012, Nature Methods.

[40]  Jagath C. Rajapakse,et al.  Segmentation of Clustered Nuclei With Shape Markers and Marking Function , 2009, IEEE Transactions on Biomedical Engineering.

[41]  Hao Chen,et al.  Gland segmentation in colon histology images: The glas challenge contest , 2016, Medical Image Anal..

[42]  Stephen J. McKenna,et al.  Local structure prediction for gland segmentation , 2016, 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI).

[43]  Hui Kong,et al.  Partitioning Histopathological Images: An Integrated Framework for Supervised Color-Texture Segmentation and Cell Splitting , 2011, IEEE Transactions on Medical Imaging.

[44]  A. Huisman,et al.  Automatic Nuclei Segmentation in H&E Stained Breast Cancer Histopathology Images , 2013, PloS one.

[45]  Amit Sethi,et al.  Towards generalized nuclear segmentation in histological images , 2013, 13th IEEE International Conference on BioInformatics and BioEngineering.

[46]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[47]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[48]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[49]  H. Irshad,et al.  Methods for Nuclei Detection, Segmentation, and Classification in Digital Histopathology: A Review—Current Status and Future Potential , 2014, IEEE Reviews in Biomedical Engineering.