论文信息 - Attention-Based Surgical Phase Boundaries Detection in Laparoscopic Videos

Attention-Based Surgical Phase Boundaries Detection in Laparoscopic Videos

A new deep learning-based method is proposed for identifying the boundaries of all surgical phases in a laparoscopic video. The model is designed based on the sequence-to-sequence architecture with an attention mechanism, to map the extracted visual features to the frame numbers of the beginning and the ending of each phase. The main novelty is that the alignment vectors for each phase are taken as the outputs, and are trained directly to select the indices. We evaluated our model using a large publicly available dataset of laparoscopic cholecystectomy procedure and obtained the Mean Absolute Error (MAE) of 48 seconds.

[1] Babak Namazi,et al. Automatic Detection of Surgical Phases in Laparoscopic Videos , 2018 .

[2] Navdeep Jaitly,et al. Pointer Networks , 2015, NIPS.

[3] Andru Putra Twinanda,et al. EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos , 2016, IEEE Transactions on Medical Imaging.

[4] Chi-Wing Fu,et al. SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network , 2018, IEEE Transactions on Medical Imaging.

[5] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[6] Yoshua Bengio,et al. Professor Forcing: A New Algorithm for Training Recurrent Networks , 2016, NIPS.

[7] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[8] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[9] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[10] Samy Bengio,et al. Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.

[11] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[12] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).