Learnable Ophthalmology SAM

Segmentation is vital for ophthalmology image analysis. But its various modal images hinder most of the existing segmentation algorithms applications, as they rely on training based on a large number of labels or hold weak generalization ability. Based on Segment Anything (SAM), we propose a simple but effective learnable prompt layer suitable for multiple target segmentation in ophthalmology multi-modal images, named Learnable Ophthalmology Segment Anything (SAM). The learnable prompt layer learns medical prior knowledge from each transformer layer. During training, we only train the prompt layer and task head based on a one-shot mechanism. We demonstrate the effectiveness of our thought based on four medical segmentation tasks based on nine publicly available datasets. Moreover, we only provide a new improvement thought for applying the existing fundamental CV models in the medical field. Our codes are available at \href{https://github.com/Qsingle/LearnablePromptSAM}{website}.

[1]  Y. Zhang,et al.  Segment Anything Model for Medical Image Analysis: an Experimental Study , 2023, Medical Image Anal..

[2]  Yangming Ou,et al.  Accuracy of Segment-Anything Model (SAM) in medical image segmentation tasks , 2023, ArXiv.

[3]  Jun-Juan Zhu,et al.  Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection , 2023, ECCV.

[4]  S. Tulyakov,et al.  Rethinking Vision Transformers for MobileNet Size and Speed , 2022, IEEE International Conference on Computer Vision.

[5]  Yan Hu,et al.  Hard Exudate Segmentation Supplemented by Super-Resolution with Multi-scale Attention Fusion Module , 2022, 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[6]  Bruce X. B. Yu,et al.  Prompt-Matched Semantic Segmentation , 2022, ArXiv.

[7]  Juan Ye,et al.  FIVES: A Fundus Image Dataset for Artificial Intelligence based Vessel Segmentation , 2022, Scientific Data.

[8]  Serge J. Belongie,et al.  Visual Prompt Tuning , 2022, ECCV.

[9]  Sven Lončarić,et al.  Annotated retinal optical coherence tomography images (AROI) database for joint retinal layer and fluid segmentation , 2021, Automatika.

[10]  Yalin Zheng,et al.  ROSE: A Retinal OCT-Angiography Vessel Segmentation Dataset and New Model , 2020, IEEE Transactions on Medical Imaging.

[11]  Qiang Chen,et al.  Image Projection Network: 3D to 2D Image Segmentation in OCTA Images , 2020, IEEE Transactions on Medical Imaging.

[12]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[13]  X. Xia,et al.  Advances in Retinal Optical Imaging , 2018, Photonics.

[14]  Wei Liu,et al.  ParseNet: Looking Wider to See Better , 2015, ArXiv.

[15]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[16]  Andreas K. Maier,et al.  Robust Vessel Segmentation in Fundus Images , 2013, Int. J. Biomed. Imaging.

[17]  Bunyarit Uyyanonvara,et al.  An Ensemble Classification-Based Approach Applied to Retinal Blood Vessel Segmentation , 2012, IEEE Transactions on Biomedical Engineering.