论文信息 - FetNet: a recurrent convolutional network for occlusion identification in fetoscopic videos

FetNet: a recurrent convolutional network for occlusion identification in fetoscopic videos

Purpose Fetoscopic laser photocoagulation is a minimally invasive surgery for the treatment of twin-to-twin transfusion syndrome (TTTS). By using a lens/fibre-optic scope, inserted into the amniotic cavity, the abnormal placental vascular anastomoses are identified and ablated to regulate blood flow to both fetuses. Limited field-of-view, occlusions due to fetus presence and low visibility make it difficult to identify all vascular anastomoses. Automatic computer-assisted techniques may provide better understanding of the anatomical structure during surgery for risk-free laser photocoagulation and may facilitate in improving mosaics from fetoscopic videos. Methods We propose FetNet, a combined convolutional neural network (CNN) and long short-term memory (LSTM) recurrent neural network architecture for the spatio-temporal identification of fetoscopic events. We adapt an existing CNN architecture for spatial feature extraction and integrated it with the LSTM network for end-to-end spatio-temporal inference. We introduce differential learning rates during the model training to effectively utilising the pre-trained CNN weights. This may support computer-assisted interventions (CAI) during fetoscopic laser photocoagulation. Results We perform quantitative evaluation of our method using 7 in vivo fetoscopic videos captured from different human TTTS cases. The total duration of these videos was 5551 s (138,780 frames). To test the robustness of the proposed approach, we perform 7-fold cross-validation where each video is treated as a hold-out or test set and training is performed using the remaining videos. Conclusion FetNet achieved superior performance compared to the existing CNN-based methods and provided improved inference because of the spatio-temporal information modelling. Online testing of FetNet, using a Tesla V100-DGXS-32GB GPU, achieved a frame rate of 114 fps. These results show that our method could potentially provide a real-time solution for CAI and automating occlusion and photocoagulation identification during fetoscopic procedures.

[1] Sébastien Ourselin,et al. Retrieval and registration of long-range overlapping frames for scalable mosaicking of in vivo fetoscopy , 2018, International Journal of Computer Assisted Radiology and Surgery.

[2] Matthieu Cord,et al. M2CAI Workflow Challenge: Convolutional Neural Networks with Time Smoothing and Hidden Markov Model for Video Frames Classification , 2016, ArXiv.

[3] Jean Ponce,et al. A Theoretical Analysis of Feature Pooling in Visual Recognition , 2010, ICML.

[4] Rory Windrim,et al. Fetoscopic laser therapy for twin-twin transfusion syndrome before 17 and after 26 weeks' gestation. , 2013, American journal of obstetrics and gynecology.

[5] Gregory D. Hager,et al. Recognizing Surgical Activities with Recurrent Neural Networks , 2016, MICCAI.

[6] Y. Ville,et al. Twin-to-twin transfusion syndrome (TTTS)* , 2011, Journal of perinatal medicine.

[7] Sotirios A. Tsaftaris,et al. Medical Image Computing and Computer Assisted Intervention , 2017 .

[8] Xenophon Papademetris,et al. Deep-learned placental vessel segmentation for intraoperative video enhancement in fetoscopic surgery , 2018, International Journal of Computer Assisted Radiology and Surgery.

[9] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[10] Jan Deprest,et al. The vascular anastomoses in monochorionic twin pregnancies and their clinical consequences. , 2012, American journal of obstetrics and gynecology.

[11] F. Walther,et al. Residual anastomoses after fetoscopic laser surgery in twin-to-twin transfusion syndrome: frequency, associated risks and outcome. , 2007, Placenta.

[12] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[13] Danail Stoyanov,et al. Pruning strategies for efficient online globally consistent mosaicking in fetoscopy , 2019, Journal of medical imaging.

[14] Sébastien Ourselin,et al. Towards computer-assisted TTTS: Laser ablation detection for workflow segmentation from fetoscopic video , 2018, International Journal of Computer Assisted Radiology and Surgery.

[15] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[17] R. Chmait,et al. Sequential selective laser photocoagulation of communicating vessels in twin–twin transfusion syndrome , 2007, The journal of maternal-fetal & neonatal medicine : the official journal of the European Association of Perinatal Medicine, the Federation of Asia and Oceania Perinatal Societies, the International Society of Perinatal Obstetricians.

[18] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[19] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Andru Putra Twinanda,et al. EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos , 2016, IEEE Transactions on Medical Imaging.

[21] Sébastien Ourselin,et al. Real-time mosaicing of fetoscopic videos using SIFT , 2016, SPIE Medical Imaging.

[22] J. Deprest,et al. Alternative technique for Nd : YAG laser coagulation in twin‐to‐twin transfusion syndrome with anterior placenta , 1998, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[23] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[24] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[25] Chi-Wing Fu,et al. SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network , 2018, IEEE Transactions on Medical Imaging.

[26] J. Stockman,et al. Endoscopic Laser Surgery Versus Serial Amnioreduction for Severe Twin-to-Twin Transfusion Syndrome , 2006 .

[27] Sébastien Ourselin,et al. Deep Sequential Mosaicking of Fetoscopic Videos , 2019, MICCAI.