论文信息 - Spatio-temporal deep learning model for distortion classification in laparoscopic video

Spatio-temporal deep learning model for distortion classification in laparoscopic video

Background: Laparoscopy is a surgery performed in the abdomen without making large incisions in the skin and with the aid of a video camera, resulting in laparoscopic videos. The laparoscopic video is prone to various distortions such as noise, smoke, uneven illumination, defocus blur, and motion blur. One of the main components in the feedback loop of video enhancement systems is distortion identification, which automatically classifies the distortions affecting the videos and selects the video enhancement algorithm accordingly. This paper aims to address the laparoscopic video distortion identification problem by developing fast and accurate multi-label distortion classification using a deep learning model. Current deep learning solutions based on convolutional neural networks (CNNs) can address laparoscopic video distortion classification, but they learn only spatial information. Methods: In this paper, utilization of both spatial and temporal features in a CNN-long short-term memory (CNN-LSTM) model is proposed as a novel solution to enhance the classification. First, pre-trained ResNet50 CNN was used to extract spatial features from each video frame by transferring representation from large-scale natural images to laparoscopic images. Next, LSTM was utilized to consider the temporal relation between the features extracted from the laparoscopic video frames to produce multi-label categories. A novel laparoscopic video dataset proposed in the ICIP2020 challenge was used for training and evaluation of the proposed method. Results: The experiments conducted show that the proposed CNN-LSTM outperforms the existing solutions in terms of accuracy (85%), and F1-score (94.2%). Additionally, the proposed distortion identification model is able to run in real-time with low inference time (0.15 sec). Conclusions: The proposed CNN-LSTM model is a feasible solution to be utilized in laparoscopic videos for distortion identification.

[1] Hui Huang,et al. Multi-Spectral RGB-NIR Image Classification Using Double-Channel CNN , 2019, IEEE Access.

[2] Mounir Kaaniche,et al. Efficient Enhancement of Stereo Endoscopic Images Based on Joint Wavelet Decomposition and Binocular Combination , 2019, IEEE Transactions on Medical Imaging.

[3] Alan C. Bovik,et al. A two-stage framework for blind image quality assessment , 2010, 2010 IEEE International Conference on Image Processing.

[4] Pablo Lamata,et al. Laparoscopic video analysis for training and image-guided surgery , 2011, Minimally invasive therapy & allied technologies : MITAT : official journal of the Society for Minimally Invasive Therapy.

[5] Mounir Kaaniche,et al. Towards a video quality assessment based framework for enhancement of laparoscopic videos , 2020, Medical Imaging.

[6] Munendra Singh,et al. Unsupervised smoke to desmoked laparoscopic surgery images using contrast driven Cyclic-DesmokeGAN , 2020, Comput. Biol. Medicine.

[7] Mateusz Buczkowski,et al. Convolutional Neural Network-Based Image Distortion Classification , 2019, 2019 International Conference on Systems, Signals and Image Processing (IWSSIP).

[8] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[9] Kai Xie,et al. Efficient medical image enhancement based on CNN-FBB model , 2019, IET Image Process..

[10] Congcong Wang,et al. A Smoke Removal Method for Laparoscopic Images , 2018, ArXiv.

[11] J. Dankelman,et al. Problems with technical equipment during laparoscopic surgery , 2007, Surgical Endoscopy.

[12] H. A. Karim,et al. Transfer Learning and Decision Fusion for Real Time Distortion Classification in Laparoscopic Videos , 2021, IEEE Access.

[13] Ganapathy Krishnamurthi,et al. Medical image retrieval using Resnet-18 , 2019, Medical Imaging.

[14] Alan C. Bovik,et al. Blind Image Quality Assessment: From Natural Scene Statistics to Perceptual Quality , 2011, IEEE Transactions on Image Processing.

[15] Andru Putra Twinanda,et al. EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos , 2016, IEEE Transactions on Medical Imaging.

[16] Guojun Lu,et al. Distortion Robust Image Classification Using Deep Convolutional Neural Network with Discrete Cosine Transform , 2018, 2019 IEEE International Conference on Image Processing (ICIP).

[17] Alan C. Bovik,et al. A Two-Step Framework for Constructing Blind Image Quality Indices , 2010, IEEE Signal Processing Letters.

[18] Mounir Kaaniche,et al. Residual Networks Based Distortion Classification and Ranking for Laparoscopic Image Quality Assessment , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[19] Mounir Kaaniche,et al. Joint Statistical Models for No-Reference Stereoscopic Image Quality Assessment , 2018, 2018 7th European Workshop on Visual Information Processing (EUVIP).

[20] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[21] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[22] Alan C. Bovik,et al. No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[23] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[24] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Md. Milon Islam,et al. A combined deep CNN-LSTM network for the detection of novel coronavirus (COVID-19) using X-ray images , 2020, Informatics in Medicine Unlocked.

[26] Congcong Wang,et al. Multiscale deep desmoking for laparoscopic surgery , 2019, Medical Imaging: Image Processing.

[27] Matthieu Cord,et al. Training data-efficient image transformers & distillation through attention , 2020, ICML.

[28] Seunghyeon Kim,et al. CNN-Based Semantic Segmentation Using Level Set Loss , 2019, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[29] Masimba Nyandowe,et al. Technical problems during laparoscopy: a systematic method of troubleshooting for surgeons , 2017, Innovative surgical sciences.