论文信息 - CholecTriplet2021: A benchmark challenge for surgical action triplet recognition - 字舞流文

CholecTriplet2021: A benchmark challenge for surgical action triplet recognition

Helena R. Torres | Mohammad Hasan Sarhan | N. Padoy | D. Stoyanov | D. Mutter | G. Zheng | S. Bodenstedt | Huabin Chen | Jiacheng Wang | Liansheng Wang | Imanol Luengo | F. Jia | Winnie Pang | Chen Qian | Shuai Ding | Mobarakol Islam | Hongliang Ren | Zixu Zhao | Hao Wang | Li Zhang | P. Mascagni | B. Seeliger | Cristians Gonzalez | Zhen Li | S. Raviteja | A. Jenke | Ricardo Sánchez-Matilla | M. Robu | Bruno Oliveira | Liping Ling | Yuanbo Zhu | Tong Yu | Tobias Czempiel | Velmurugan Balasubramanian | R. Sathish | Deepak Alapatt | Mengya Xu | Armine Vardazaryan | Tong Xia | Satoshi Kondo | R. Tao | N. Getty | G. Bian | I. N. Wijma | Nithya Bhasker | R. Egging | Bokai Zhang | J. Abbing | D. Sheet | L. Seenivasan | Xiaotian Duan | Joao L. Vilacca | Pedro Morais | C. Nwoye | Fan Xia | Yuxuan Yang | De-Shuai Yu | Beerend G. A. Gerats | Finn Gaida | Jaime C. Fonseca | Jakob-Anton Aschenbrenner | Nicolas Elini van der Kar

[1] Pheng-Ann Heng,et al. Comparative Validation of Machine Learning Algorithms for Surgical Workflow and Skill Analysis with the HeiChole Benchmark , 2021, Medical Image Anal..

[2] N. Padoy,et al. Rendezvous: Attention Mechanisms for the Recognition of Surgical Action Triplets in Endoscopic Videos , 2021, Medical Image Anal..

[3] Riccardo Muradore,et al. The SARAS Endoscopic Surgeon Action Detection (ESAD) dataset: Challenges and methods , 2021, ArXiv.

[4] Arka Sadhu,et al. Visual Semantic Role Labeling for Video Understanding , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Quoc V. Le,et al. EfficientNetV2: Smaller Models and Faster Training , 2021, ICML.

[6] Mobarakol Islam,et al. Learning Domain Adaptation with Model Calibration for Surgical Report Generation in Robotic Surgery , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[7] Pheng-Ann Heng,et al. Temporal Memory Relation Network for Workflow Recognition From Surgical Video , 2021, IEEE Transactions on Medical Imaging.

[8] Jiashi Feng,et al. Coordinate Attention for Efficient Mobile Network Design , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Lena Maier-Hein,et al. Endoscopic Vision Challenge 2021 , 2021 .

[10] N. Padoy,et al. Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures , 2021, International Journal of Computer Assisted Radiology and Surgery.

[11] Yang Zhao,et al. Deep High-Resolution Representation Learning for Visual Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] Stephen Lin,et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[13] Toktam Khatibi,et al. Proposing novel methods for gynecologic surgical action recognition on laparoscopic videos , 2020, Multimedia Tools and Applications.

[14] Jacques Marescaux,et al. Recognition of Instrument-Tissue Interactions in Endoscopic Videos via Action Triplets , 2020, MICCAI.

[15] Cewu Lu,et al. Detailed 2D-3D Joint Representation for Human-Object Interaction , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Nassir Navab,et al. TeCNO: Surgical Phase Recognition with Multi-Stage Temporal Convolutional Networks , 2020, MICCAI.

[17] Yi-Zhe Song,et al. The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification , 2020, IEEE Transactions on Image Processing.

[18] Nicolas Martin,et al. Assisted phase and step annotation for surgical videos , 2020, International Journal of Computer Assisted Radiology and Surgery.

[19] Jun-Wei Hsieh,et al. CSPNet: A New Backbone that can Enhance Learning Capability of CNN , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[20] Jinwen Ma,et al. Multi-Label Classification with Label Graph Superimposing , 2019, AAAI.

[21] Mathias Unberath,et al. CAI4CAI: The Rise of Contextual Artificial Intelligence in Computer-Assisted Interventions , 2019, Proceedings of the IEEE.

[22] Debdoot Sheet,et al. Multitask Learning of Temporal Connectionism in Convolutional Networks using a Joint Distribution Loss Function to Simultaneously Identify Tools and Phase in Surgical Videos , 2019, ArXiv.

[23] Gregory D. Hager,et al. Segmenting and classifying activities in robot-assisted surgery with recurrent neural networks , 2019, International Journal of Computer Assisted Radiology and Surgery.

[24] Yazan Abu Farha,et al. MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Satoshi Kondo,et al. CATARACTS: Challenge on automatic tool annotation for cataRACT surgery , 2019, Medical Image Anal..

[26] Jitendra Malik,et al. SlowFast Networks for Video Recognition , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[27] Didier Mutter,et al. Weakly supervised convolutional LSTM approach for tool tracking in laparoscopic videos , 2018, International Journal of Computer Assisted Radiology and Surgery.

[28] Ami Wiesel,et al. Learning to Detect , 2018, IEEE Transactions on Signal Processing.

[29] Didier Mutter,et al. Learning from a tiny dataset of manual annotations: a teacher/student approach for surgical phase recognition , 2018, ArXiv.

[30] Song-Chun Zhu,et al. Learning Human-Object Interactions by Graph Parsing Neural Networks , 2018, ECCV.

[31] Danail Stoyanov,et al. DeepPhase: Surgical Phase Recognition in CATARACTS Videos , 2018, MICCAI.

[32] Gwénolé Quellec,et al. Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks , 2018, Medical Image Anal..

[33] Sebastian Bodenstedt,et al. Temporal coherence-based self-supervised learning for laparoscopic workflow analysis , 2018, OR 2.0/CARE/CLIP/ISIC@MICCAI.

[34] Qi Wu,et al. HCVRD: A Benchmark for Large-Scale Human-Centered Visual Relationship Detection , 2018, AAAI.

[35] Shu Liu,et al. Path Aggregation Network for Instance Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[36] Kaiming He,et al. Detecting and Recognizing Human-Object Interactions , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37] Jia Deng,et al. Learning to Detect Human-Object Interactions , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[38] Russell H. Taylor,et al. Surgical data science for next-generation interventions , 2017, Nature Biomedical Engineering.

[39] Matthieu Cord,et al. WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Debdoot Sheet,et al. Learning Latent Temporal Connectionism of Deep Residual Visual Abstractions for Identifying Surgical Tools in Laparoscopy Procedures , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[41] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[42] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Shih-Fu Chang,et al. Visual Translation Embedding Network for Visual Relation Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Andru Putra Twinanda,et al. EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos , 2016, IEEE Transactions on Medical Imaging.

[46] Nassir Navab,et al. The TUM LapChole dataset for the M2CAI 2016 workflow challenge , 2016, ArXiv.

[47] Nassir Navab,et al. Sensor substitution for video-based action recognition , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[48] Svetlana Lazebnik,et al. Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering , 2016, ECCV.

[49] Pierre Jannin,et al. Automatic data-driven real-time segmentation and recognition of surgical workflow , 2016, International Journal of Computer Assisted Radiology and Surgery.

[50] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51] Jiaxuan Wang,et al. HICO: A Benchmark for Recognizing Human-Object Interactions in Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[52] Dit-Yan Yeung,et al. Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[53] Rüdiger Dillmann,et al. LapOntoSPM: an ontology for laparoscopic surgeries and its application to surgical phase recognition , 2015, International Journal of Computer Assisted Radiology and Surgery.

[54] Łukasz Bolikowski,et al. Quo vadis, CARS? First steps towards text-mining-based analysis of topics in the International Journal of Computer Assisted Radiology and Surgery , 2015 .

[55] Rüdiger Dillmann,et al. Knowledge-Driven Formalization of Laparoscopic Surgeries for Rule-Based Intraoperative Context-Aware Assistance , 2014, IPCAI.

[56] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[57] Luc Van Gool,et al. The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[58] Hema Swetha Koppula,et al. Learning human activities and object affordances from RGB-D videos , 2012, Int. J. Robotics Res..

[59] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.

[60] Thomas Serre,et al. HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.

[61] Paul J. M. Havinga,et al. Activity Recognition Using Inertial Sensing for Healthcare, Wellbeing and Sports Applications: A Survey , 2010, ARCS Workshops.

[62] James S. Duncan,et al. Medical Image Analysis , 1999, IEEE Pulse.

[63] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.