FetNet: a recurrent convolutional network for occlusion identification in fetoscopic videos

Purpose Fetoscopic laser photocoagulation is a minimally invasive surgery for the treatment of twin-to-twin transfusion syndrome (TTTS). By using a lens/fibre-optic scope, inserted into the amniotic cavity, the abnormal placental vascular anastomoses are identified and ablated to regulate blood flow to both fetuses. Limited field-of-view, occlusions due to fetus presence and low visibility make it difficult to identify all vascular anastomoses. Automatic computer-assisted techniques may provide better understanding of the anatomical structure during surgery for risk-free laser photocoagulation and may facilitate in improving mosaics from fetoscopic videos. Methods We propose FetNet, a combined convolutional neural network (CNN) and long short-term memory (LSTM) recurrent neural network architecture for the spatio-temporal identification of fetoscopic events. We adapt an existing CNN architecture for spatial feature extraction and integrated it with the LSTM network for end-to-end spatio-temporal inference. We introduce differential learning rates during the model training to effectively utilising the pre-trained CNN weights. This may support computer-assisted interventions (CAI) during fetoscopic laser photocoagulation. Results We perform quantitative evaluation of our method using 7 in vivo fetoscopic videos captured from different human TTTS cases. The total duration of these videos was 5551 s (138,780 frames). To test the robustness of the proposed approach, we perform 7-fold cross-validation where each video is treated as a hold-out or test set and training is performed using the remaining videos. Conclusion FetNet achieved superior performance compared to the existing CNN-based methods and provided improved inference because of the spatio-temporal information modelling. Online testing of FetNet, using a Tesla V100-DGXS-32GB GPU, achieved a frame rate of 114 fps. These results show that our method could potentially provide a real-time solution for CAI and automating occlusion and photocoagulation identification during fetoscopic procedures.

[1]  Sébastien Ourselin,et al.  Retrieval and registration of long-range overlapping frames for scalable mosaicking of in vivo fetoscopy , 2018, International Journal of Computer Assisted Radiology and Surgery.

[2]  Matthieu Cord,et al.  M2CAI Workflow Challenge: Convolutional Neural Networks with Time Smoothing and Hidden Markov Model for Video Frames Classification , 2016, ArXiv.

[3]  Jean Ponce,et al.  A Theoretical Analysis of Feature Pooling in Visual Recognition , 2010, ICML.

[4]  Rory Windrim,et al.  Fetoscopic laser therapy for twin-twin transfusion syndrome before 17 and after 26 weeks' gestation. , 2013, American journal of obstetrics and gynecology.

[5]  Gregory D. Hager,et al.  Recognizing Surgical Activities with Recurrent Neural Networks , 2016, MICCAI.

[6]  Y. Ville,et al.  Twin-to-twin transfusion syndrome (TTTS)* , 2011, Journal of perinatal medicine.

[7]  Sotirios A. Tsaftaris,et al.  Medical Image Computing and Computer Assisted Intervention , 2017 .

[8]  Xenophon Papademetris,et al.  Deep-learned placental vessel segmentation for intraoperative video enhancement in fetoscopic surgery , 2018, International Journal of Computer Assisted Radiology and Surgery.

[9]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[10]  Jan Deprest,et al.  The vascular anastomoses in monochorionic twin pregnancies and their clinical consequences. , 2012, American journal of obstetrics and gynecology.

[11]  F. Walther,et al.  Residual anastomoses after fetoscopic laser surgery in twin-to-twin transfusion syndrome: frequency, associated risks and outcome. , 2007, Placenta.

[12]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[13]  Danail Stoyanov,et al.  Pruning strategies for efficient online globally consistent mosaicking in fetoscopy , 2019, Journal of medical imaging.

[14]  Sébastien Ourselin,et al.  Towards computer-assisted TTTS: Laser ablation detection for workflow segmentation from fetoscopic video , 2018, International Journal of Computer Assisted Radiology and Surgery.

[15]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[17]  R. Chmait,et al.  Sequential selective laser photocoagulation of communicating vessels in twin–twin transfusion syndrome , 2007, The journal of maternal-fetal & neonatal medicine : the official journal of the European Association of Perinatal Medicine, the Federation of Asia and Oceania Perinatal Societies, the International Society of Perinatal Obstetricians.

[18]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[19]  Trevor Darrell,et al.  Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Andru Putra Twinanda,et al.  EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos , 2016, IEEE Transactions on Medical Imaging.

[21]  Sébastien Ourselin,et al.  Real-time mosaicing of fetoscopic videos using SIFT , 2016, SPIE Medical Imaging.

[22]  J. Deprest,et al.  Alternative technique for Nd : YAG laser coagulation in twin‐to‐twin transfusion syndrome with anterior placenta , 1998, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[23]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[24]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[25]  Chi-Wing Fu,et al.  SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network , 2018, IEEE Transactions on Medical Imaging.

[26]  J. Stockman,et al.  Endoscopic Laser Surgery Versus Serial Amnioreduction for Severe Twin-to-Twin Transfusion Syndrome , 2006 .

[27]  Sébastien Ourselin,et al.  Deep Sequential Mosaicking of Fetoscopic Videos , 2019, MICCAI.