Surgical Phase and Instrument Recognition: How to identify appropriate Dataset Splits

Purpose: The development of machine learning models for surgical workflow and instrument recognition from temporal data represents a challenging task due to the complex nature of surgical workflows. In particular, the imbalanced distribution of data is one of the major challenges in the domain of surgical workflow recognition. In order to obtain meaningful results, careful partitioning of data into training, validation, and test sets, as well as the selection of suitable evaluation metrics are crucial. Methods: In this work, we present an openly available web-based application that enables interactive exploration of dataset partitions. The proposed visual framework facilitates the assessment of dataset splits for surgical workflow recognition, especially with regard to identifying sub-optimal dataset splits. Currently, it supports visualization of surgical phase and instrument annotations. Results: In order to validate the dedicated interactive visualizations, we use a dataset split of the Cholec80 dataset. This dataset split was specifically selected to reflect a case of strong data imbalance. Using our software, we were able to identify phases, phase transitions, and combinations of surgical instruments that were not represented in one of the sets. Conclusion: In order to obtain meaningful results in highly unbalanced class distributions, special care should be taken with respect to the selection of an appropriate split. Interactive data visualization represents a promising approach for the assessment of machine learning datasets. The source code is available at https://github.com/Cardio-AI/endovis-ml

[1]  M. Wagner,et al.  Interactive visual exploration of surgical process data , 2022, International Journal of Computer Assisted Radiology and Surgery.

[2]  D. Stoyanov,et al.  Reducing prediction volatility in the surgical workflow recognition of endoscopic pituitary surgery , 2022, International Journal of Computer Assisted Radiology and Surgery.

[3]  N. Padoy,et al.  Rendezvous: Attention Mechanisms for the Recognition of Surgical Action Triplets in Endoscopic Videos , 2021, Medical Image Anal..

[4]  S. T. Kim,et al.  OperA: Attention-Regularized Transformers for Surgical Phase Recognition , 2021, MICCAI.

[5]  Peter M. Full,et al.  Heidelberg colorectal data set for surgical data science in the sensor operating room , 2020, Scientific Data.

[6]  Satoshi Kondo,et al.  CATARACTS: Challenge on automatic tool annotation for cataRACT surgery , 2019, Medical Image Anal..

[7]  Russell H. Taylor,et al.  Surgical data science for next-generation interventions , 2017, Nature Biomedical Engineering.

[8]  Nassir Navab,et al.  The TUM LapChole dataset for the M2CAI 2016 workflow challenge , 2016, ArXiv.

[9]  Andru Putra Twinanda,et al.  EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos , 2016, IEEE Transactions on Medical Imaging.

[10]  Silvia Miksch,et al.  Radial Sets: Interactive Visual Analysis of Large Overlapping Sets , 2013, IEEE Transactions on Visualization and Computer Graphics.

[11]  Jenny Dankelman,et al.  Workflow mining and outlier detection from clinical activity logs , 2012, J. Biomed. Informatics.

[12]  Nassir Navab,et al.  Workflow mining for visualization and analysis of surgeries , 2008, International Journal of Computer Assisted Radiology and Surgery.

[13]  Thomas Neumuth,et al.  Analysis of surgical intervention populations using generic surgical process models , 2010, International Journal of Computer Assisted Radiology and Surgery.