“Deep-Onto” network for surgical workflow and context recognition

PurposeSurgical workflow recognition and context-aware systems could allow better decision making and surgical planning by providing the focused information, which may eventually enhance surgical outcomes. While current developments in computer-assisted surgical systems are mostly focused on recognizing surgical phases, they lack recognition of surgical workflow sequence and other contextual element, e.g., “Instruments.” Our study proposes a hybrid approach, i.e., using deep learning and knowledge representation, to facilitate recognition of the surgical workflow.MethodsWe implemented “Deep-Onto” network, which is an ensemble of deep learning models and knowledge management tools, ontology and production rules. As a prototypical scenario, we chose robot-assisted partial nephrectomy (RAPN). We annotated RAPN videos with surgical entities, e.g., “Step” and so forth. We performed different experiments, including the inter-subject variability, to recognize surgical steps. The corresponding subsequent steps along with other surgical contexts, i.e., “Actions,” “Phase” and “Instruments,” were also recognized.ResultsThe system was able to recognize 10 RAPN steps with the prevalence-weighted macro-average (PWMA) recall of 0.83, PWMA precision of 0.74, PWMA F1 score of 0.76, and the accuracy of 74.29% on 9 videos of RAPN.ConclusionWe found that the combined use of deep learning and knowledge representation techniques is a promising approach for the multi-level recognition of RAPN surgical workflow.

[1]  J. Kaouk,et al.  Robot-assisted laparoscopic partial nephrectomy: step-by-step contemporary technique and surgical outcomes at a single high-volume institution. , 2012, European urology.

[2]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  P. Tamboli,et al.  Identifying the risk of disease progression after surgery for localized renal cell carcinoma , 2010, BJU international.

[4]  Gregory D. Hager,et al.  Automatic Detection and Segmentation of Robot-Assisted Surgical Motions , 2005, MICCAI.

[5]  Michael Kipp,et al.  ANVIL - a generic annotation tool for multimodal dialogue , 2001, INTERSPEECH.

[6]  C. Lee Giles,et al.  Sequence learning: from recognition and prediction to sequential decision making , 2001, IEEE Intelligent Systems.

[7]  Nassir Navab,et al.  Modeling and Segmentation of Surgical Workflow from Laparoscopic Video , 2010, MICCAI.

[8]  Keno März,et al.  Toward a standard ontology of surgical process models , 2018, International Journal of Computer Assisted Radiology and Surgery.

[9]  Gero Strauß,et al.  Acquisition of Process Descriptions from Surgical Interventions , 2006, DEXA.

[10]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[11]  Ryan R Brinkman,et al.  OntoFox: web-based support for ontology reuse , 2010, BMC Research Notes.

[12]  Giancarlo Ferrigno,et al.  Inductive Learning of the Surgical Workflow Model through Video Annotations , 2017, 2017 IEEE 30th International Symposium on Computer-Based Medical Systems (CBMS).

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  Sean Bechhofer,et al.  The OWL API: A Java API for OWL ontologies , 2011, Semantic Web.

[15]  Chi-Wing Fu,et al.  SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network , 2018, IEEE Transactions on Medical Imaging.

[16]  Cornelius Rosse,et al.  The Foundational Model of Anatomy Ontology , 2008, Anatomy Ontologies for Bioinformatics.

[17]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..

[18]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[19]  Jim C Hu,et al.  Technique and outcomes of robot-assisted retroperitoneoscopic partial nephrectomy: a multicenter study. , 2014, European urology.

[20]  Rüdiger Dillmann,et al.  LapOntoSPM: an ontology for laparoscopic surgeries and its application to surgical phase recognition , 2015, International Journal of Computer Assisted Radiology and Surgery.

[21]  Trevor Darrell,et al.  Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Giancarlo Ferrigno,et al.  Development of an intelligent surgical training system for Thoracentesis , 2017, Artif. Intell. Medicine.

[23]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[24]  Gregory Wilding,et al.  Cognitive skills assessment during robot‐assisted surgery: separating the wheat from the chaff , 2015, BJU international.

[25]  Rhona Flin,et al.  How do surgeons make intraoperative decisions? , 2007, Quality and Safety in Health Care.

[26]  Andru Putra Twinanda,et al.  EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos , 2016, IEEE Transactions on Medical Imaging.

[27]  Matthieu Cord,et al.  M2CAI Workflow Challenge: Convolutional Neural Networks with Time Smoothing and Hidden Markov Model for Video Frames Classification , 2016, ArXiv.

[28]  Barry Smith,et al.  SNAP and SPAN: Towards Dynamic Spatial Ontology , 2004, Spatial Cogn. Comput..

[29]  J. Ferlay,et al.  Cancer incidence and mortality patterns in Europe: estimates for 40 countries in 2012. , 2013, European journal of cancer.