Segment Anything Model (SAM) for Digital Pathology: Assess Zero-shot Segmentation on Whole Slide Imaging

The segment anything model (SAM) was released as a foundation model for image segmentation. The promptable segmentation model was trained by over 1 billion masks on 11M licensed and privacy-respecting images. The model supports zero-shot image segmentation with various segmentation prompts (e.g., points, boxes, masks). It makes the SAM attractive for medical image analysis, especially for digital pathology where the training data are rare. In this study, we evaluate the zero-shot segmentation performance of SAM model on representative segmentation tasks on whole slide imaging (WSI), including (1) tumor segmentation, (2) non-tumor tissue segmentation, (3) cell nuclei segmentation. Core Results: The results suggest that the zero-shot SAM model achieves remarkable segmentation performance for large connected objects. However, it does not consistently achieve satisfying performance for dense instance object segmentation, even with 20 prompts (clicks/boxes) on each image. We also summarized the identified limitations for digital pathology: (1) image resolution, (2) multiple scales, (3) prompt selection, and (4) model fine-tuning. In the future, the few-shot fine-tuning with images from downstream pathological segmentation tasks might help the model to achieve better performance in dense object segmentation.

[1]  QUAN LIU,et al.  Omni-Seg: A Scale-Aware Dynamic Network for Renal Pathological Image Segmentation , 2022, IEEE Transactions on Biomedical Engineering.

[2]  Michael S. Bernstein,et al.  On the Opportunities and Risks of Foundation Models , 2021, ArXiv.

[3]  Yuankai Huo,et al.  SimTriplet: Simple Triplet Representation Learning with a Single GPU , 2021, MICCAI.

[4]  Ilya Sutskever,et al.  Learning Transferable Visual Models From Natural Language Supervision , 2021, ICML.

[5]  Alec Radford,et al.  Zero-Shot Text-to-Image Generation , 2021, ICML.

[6]  Xing Li,et al.  Beds: Bagging Ensemble Deep Segmentation For Nucleus Segmentation With Testing Stage Stain Augmentation , 2021, 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI).

[7]  Quoc V. Le,et al.  Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision , 2021, ICML.

[8]  QUAN LIU,et al.  AI Applications in Renal Pathology. , 2021, Kidney international.

[9]  Catherine P. Jayapandian,et al.  Development and evaluation of deep learning–based segmentation of histologic structures in the kidney cortex with multiple histologic stains , 2020, Kidney international.

[10]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[11]  Hao Chen,et al.  A Multi-Organ Nucleus Segmentation Challenge , 2020, IEEE Transactions on Medical Imaging.

[12]  Chunyan Miao,et al.  A Survey of Zero-Shot Learning , 2019, ACM Trans. Intell. Syst. Technol..

[13]  John Tomaszewski,et al.  Digital pathology evaluation in the multicenter Nephrotic Syndrome Study Network (NEPTUNE). , 2013, Clinical journal of the American Society of Nephrology : CJASN.