论文信息 - Multi-class motion-based semantic segmentation for ureteroscopy and laser lithotripsy

Multi-class motion-based semantic segmentation for ureteroscopy and laser lithotripsy

Kidney stones represent a considerable burden for public health-care systems, with the total health-care expenditure for kidney stones exceeding US $ 2 billion annually in the USA alone. Ureteroscopy with laser lithotripsy has evolved as the most commonly used technique for the treatment of kidney stones. Automated segmentation of kidney stones and laser fiber is an important initial step to performing any automated quantitative analysis of the stones, particularly stone-size estimation, that can be used by the surgeon to decide if the stone requires further fragmentation. Factors such as turbid fluid inside the cavity, specularities, motion blur due to kidney movements and camera motion, bleeding, and stone debris impact the quality of vision within the kidney and lead to extended operative times. To the best of our knowledge, this is the first attempt made towards multi-class segmentation in ureteroscopy and laser lithotripsy data. We propose an end-to-end convolution neural network (CNN) based learning framework for the segmentation of stones and laser fiber. The proposed approach utilizes two sub-networks: I) HybResUNet, a hybrid version of residual U-Net, that uses residual connections in the encoder path of U-Net to improve semantic predictions, and II) a DVFNet that generates deformation vector field (DVF) predictions by leveraging motion differences between the adjacent video frames which is then used to prune the prediction maps. We also present ablation studies that combine different dilated convolutions, recurrent and residual connections, atrous spatial pyramid pooling and attention gate model. Further, we propose a compound loss function that significantly boosts the segmentation performance in our data. We have also provided an ablation study to determine the optimal data augmentation strategy for our dataset. Our qualitative and quantitative results illustrate that our proposed method outperforms state-of-the-art methods such as UNet and DeepLabv3+ showing an improvement of 5.2% and 15.93%, respectively, for the combined mean of DSC and JI in our in vivo test dataset. We also show that our proposed model generalizes better on a new clinical dataset showing a mean improvement of 25.4%, 20%, and 11% over UNet, HybResUNet, and DeepLabv3+, respectively, for the same metric.

[1] Max A. Viergever,et al. A deep learning framework for unsupervised affine and deformable image registration , 2018, Medical Image Anal..

[2] Kazuhiko Hamamoto,et al. An image preprocessing method for kidney stone segmentation in CT scan images , 2018, 2018 International Conference on Computer Engineering, Network and Intelligent Multimedia (CENIM).

[3] Klaus H. Maier-Hein,et al. OR-UNet: an Optimized Robust Residual U-Net for Instrument Segmentation in Endoscopic Images , 2020, ArXiv.

[4] J. Lingeman,et al. Management of kidney stones , 2007, BMJ : British Medical Journal.

[5] Jiaming Liu,et al. Accuracy Improvement of UNet Based on Dilated Convolution , 2019 .

[6] Jun Zhang,et al. Inverse-Consistent Deep Networks for Unsupervised Deformable Image Registration , 2018, ArXiv.

[7] Loïc Le Folgoc,et al. Attention U-Net: Learning Where to Look for the Pancreas , 2018, ArXiv.

[8] Dwarikanath Mahapatra,et al. Joint Registration And Segmentation Of Xray Images Using Generative Adversarial Networks , 2018, MLMI@MICCAI.

[9] Ross B. Girshick,et al. Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Vijayan K. Asari,et al. Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation , 2018, ArXiv.

[11] Sharib Ali,et al. Conv2Warp: An unsupervised deformable image registration with continuous convolution and warping , 2019, MLMI@MICCAI.

[12] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[13] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Abhishek Dutta,et al. The VIA Annotation Software for Images, Audio and Video , 2019, ACM Multimedia.

[15] Marc Niethammer,et al. Quicksilver: Fast predictive image registration – A deep learning approach , 2017, NeuroImage.

[16] P. Thangaraj,et al. Segmentation of Calculi from Ultrasound Kidney Images by Region Indicator with Contour Segmentation Method , 2011 .

[17] Dr. P. R. Tamiselvi. Segmentation of Renal Calculi Using Squared Euclidean Distance Method , 2013 .

[18] Jérôme Szewczyk,et al. An algorithm for calculi segmentation on ureteroscopic images , 2011, International Journal of Computer Assisted Radiology and Surgery.

[19] Daniel Rueckert,et al. Joint Learning of Motion Estimation and Segmentation for Cardiac MR Image Sequences , 2018, MICCAI.

[20] Shuai Wang,et al. Multi-Scale Context-Guided Deep Network for Automated Lesion Segmentation With Endoscopy Images of Gastrointestinal Tract , 2020, IEEE Journal of Biomedical and Health Informatics.

[21] Amy Loutfi,et al. Computer aided detection of ureteral stones in thin slice computed tomography volumes using Convolutional Neural Networks , 2018, Comput. Biol. Medicine.

[22] Hamid Soltanian-Zadeh,et al. Segmentation of Small Bowel Tumors in Wireless Capsule Endoscopy Using Level Set Method , 2014, 2014 IEEE 27th International Symposium on Computer-Based Medical Systems.

[23] Sharib Ali,et al. Real-Time Polyp Detection, Localisation and Segmentation in Colonoscopy Using Deep Learning , 2020, ArXiv.

[24] P. Thangaraj,et al. A Modified Watershed Segmentation Method to Segment Renal Calculi in Ultrasound Kidney Images , 2012, Int. J. Intell. Inf. Technol..

[25] P. Reddy,et al. Ureteroscopy: The standard of care in the management of upper tract urolithiasis in children , 2010, Indian journal of urology : IJU : journal of the Urological Society of India.

[26] Keisuke Nemoto,et al. Effective Use of Dilated Convolutions for Segmenting Small Object Instances in Remote Sensing Imagery , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[27] W. Roberts,et al. Holmium Laser Lithotripsy in the New Stone Age: Dust or Bust? , 2017, Front. Surg..

[28] B. Petros,et al. Kidney Stone Disease: An Update on Current Concepts , 2018, Advances in urology.

[29] V. B. Surya Prasath. Polyp Detection and Segmentation from Video Capsule Endoscopy: A Review , 2016, J. Imaging.

[30] Jacob Chakareski,et al. Effective Deep Learning for Semantic Segmentation Based Bleeding Zone Detection in Capsule Endoscopy Images , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[31] George Papandreou,et al. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[32] Prema T. Akkasaligar,et al. Kidney stone detection in computed tomography images , 2017, 2017 International Conference On Smart Technologies For Smart Nation (SmartTechCon).

[33] Irina Voiculescu,et al. A translational pathway of deep learning methods in GastroIntestinal Endoscopy , 2020, ArXiv.

[34] Dinggang Shen,et al. HAMMER: hierarchical attribute matching mechanism for elastic registration , 2002, IEEE Transactions on Medical Imaging.

[35] Frank Y. Shih,et al. Image Segmentation , 2007, Encyclopedia of Biometrics.

[36] Dinggang Shen,et al. Deformable Image Registration Using a Cue-Aware Deep Regression Network , 2018, IEEE Transactions on Biomedical Engineering.

[37] Thomas de Lange,et al. ResUNet++: An Advanced Architecture for Medical Image Segmentation , 2019, 2019 IEEE International Symposium on Multimedia (ISM).

[38] Vladlen Koltun,et al. Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[39] Max Q.-H. Meng,et al. A study on automated segmentation of blood regions in Wireless Capsule Endoscopy images using fully convolutional networks , 2017, 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017).

[40] Qingjie Liu,et al. Road Extraction by Deep Residual U-Net , 2017, IEEE Geoscience and Remote Sensing Letters.

[41] Evgeny Burnaev,et al. Boundary Loss for Remote Sensing Imagery Semantic Segmentation , 2019, ISNN.

[42] O. Traxer,et al. Complications of ureteroscopy: a complete overview , 2019, World Journal of Urology.

[43] Sharib Ali,et al. MI-UNet: Improved Segmentation in Ureteroscopy , 2020, 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI).

[44] Max Q.-H. Meng,et al. Saliency Based Ulcer Detection for Wireless Capsule Endoscopy Diagnosis , 2015, IEEE Transactions on Medical Imaging.

[45] Milan Tuba,et al. An algorithm for automated segmentation for bleeding detection in endoscopic images , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[46] Tom Vercauteren,et al. Diffeomorphic demons: Efficient non-parametric image registration , 2009, NeuroImage.

[47] Peter Caccetta,et al. ResUNet-a: a deep learning framework for semantic segmentation of remotely sensed data , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[48] Sharib Ali,et al. Motion induced segmentation of stone fragments in ureteroscopy video , 2020, Medical Imaging: Image-Guided Procedures.