A Methodology for Texture Feature-based Quality Assessment in Nucleus Segmentation of Histopathology Image

Context: Image segmentation pipelines often are sensitive to algorithm input parameters. Algorithm parameters optimized for a set of images do not necessarily produce good-quality-segmentation results for other images. Even within an image, some regions may not be well segmented due to a number of factors, including multiple pieces of tissue with distinct characteristics, differences in staining of the tissue, normal versus tumor regions, and tumor heterogeneity. Evaluation of quality of segmentation results is an important step in image analysis. It is very labor intensive to do quality assessment manually with large image datasets because a whole-slide tissue image may have hundreds of thousands of nuclei. Semi-automatic mechanisms are needed to assist researchers and application developers to detect image regions with bad segmentations efficiently. Aims: Our goal is to develop and evaluate a machine-learning-based semi-automated workflow to assess quality of nucleus segmentation results in a large set of whole-slide tissue images. Methods: We propose a quality control methodology, in which machine-learning algorithms are trained with image intensity and texture features to produce a classification model. This model is applied to image patches in a whole-slide tissue image to predict the quality of nucleus segmentation in each patch. The training step of our methodology involves the selection and labeling of regions by a pathologist in a set of images to create the training dataset. The image regions are partitioned into patches. A set of intensity and texture features is computed for each patch. A classifier is trained with the features and the labels assigned by the pathologist. At the end of this process, a classification model is generated. The classification step applies the classification model to unlabeled test images. Each test image is partitioned into patches. The classification model is applied to each patch to predict the patch's label. Results: The proposed methodology has been evaluated by assessing the segmentation quality of a segmentation method applied to images from two cancer types in The Cancer Genome Atlas; WHO Grade II lower grade glioma (LGG) and lung adenocarcinoma (LUAD). The results show that our method performs well in predicting patches with good-quality segmentations and achieves F1 scores 84.7% for LGG and 75.43% for LUAD. Conclusions: As image scanning technologies advance, large volumes of whole-slide tissue images will be available for research and clinical use. Efficient approaches for the assessment of quality and robustness of output from computerized image analysis workflows will become increasingly critical to extracting useful quantitative information from tissue images. Our work demonstrates the feasibility of machine-learning-based semi-automated techniques to assist researchers and algorithm developers in this process.

[1]  Brian C. Lovell,et al.  A methodology for quality control in cell nucleus segmentation , 1999 .

[2]  George Lee,et al.  Evaluating stability of histomorphometric features across scanner and staining variations: predicting biochemical recurrence from prostate cancer whole slide images , 2016, SPIE Medical Imaging.

[3]  Lin Yang,et al.  Robust Nucleus/Cell Detection and Segmentation in Digital Pathology and Microscopy Images: A Comprehensive Review , 2016, IEEE Reviews in Biomedical Engineering.

[4]  Shaoting Zhang,et al.  Toward consistent cell segmentation: quality assessment of cell segments via appearance and geometry features , 2015, Medical Imaging.

[5]  Edward R. Dougherty,et al.  Effect of separate sampling on classification accuracy , 2014, Bioinform..

[6]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[7]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[8]  Lin Yang,et al.  PathMiner: A Web-Based Tool for Computer-Assisted Diagnostics in Pathology , 2009, IEEE Transactions on Information Technology in Biomedicine.

[9]  Jun Kong,et al.  Computerized Pathological Image Analysis For Neuroblastoma Prognosis , 2007, AMIA.

[10]  Prabhakar R. Gudla,et al.  Automatic segmentation and supervised learning‐based selection of nuclei in cancer tissue images , 2012, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[11]  H. Irshad,et al.  Methods for Nuclei Detection, Segmentation, and Classification in Digital Histopathology: A Review—Current Status and Future Potential , 2014, IEEE Reviews in Biomedical Engineering.

[12]  Joel H. Saltz,et al.  Hierarchical nucleus segmentation in digital pathology images , 2016, SPIE Medical Imaging.

[13]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[15]  Tony Pan,et al.  ImageMiner: a software system for comparative analysis of tissue microarrays using content-based image retrieval, high-performance computing, and grid technology , 2011, J. Am. Medical Informatics Assoc..

[16]  A. Madabhushi,et al.  Automated Tubule Nuclei Quantification and Correlation with Oncotype DX risk categories in ER+ Breast Cancer Whole Slide Images , 2016, Scientific reports.

[17]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[18]  Joel H. Saltz,et al.  Histopathological Image Analysis Using Model-Based Intermediate Representations and Color Texture: Follicular Lymphoma Grading , 2009, J. Signal Process. Syst..

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Hui Zhang,et al.  Image segmentation evaluation: A survey of unsupervised methods , 2008, Comput. Vis. Image Underst..

[21]  Matti Pietikäinen,et al.  Automated classification of breast cancer morphology in histopathological images , 2013, Diagnostic Pathology.

[22]  Prabhakar R. Gudla,et al.  Ranked retrieval of segmented nuclei for objective assessment of cancer gene repositioning , 2012, BMC Bioinformatics.

[23]  Jiong Wu,et al.  Molecular subtype can predict the response and outcome of Chinese locally advanced breast cancer patients treated with preoperative therapy. , 2010, Oncology reports.

[24]  Xin Qi,et al.  Computer aided analysis of prostate histopathology images to support a refined Gleason grading system , 2017, Medical Imaging.

[25]  Olcay Sertel,et al.  Computer-assisted grading of neuroblastic differentiation. , 2008, Archives of pathology & laboratory medicine.

[26]  Ce Zhang,et al.  Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features , 2016, Nature Communications.