An Application-Dependent Framework for the Recognition of High-Level Surgical Tasks in the OR

Surgical process analysis and modeling is a recent and important topic aiming at introducing a new generation of computer-assisted surgical systems. Among all of the techniques already in use for extracting data from the Operating Room, the use of image videos allows automating the surgeons' assistance without altering the surgical routine. We proposed in this paper an application-dependent framework able to automatically extract the phases of the surgery only by using microscope videos as input data and that can be adaptable to different surgical specialties. First, four distinct types of classifiers based on image processing were implemented to extract visual cues from video frames. Each of these classifiers was related to one kind of visual cue: visual cues recognizable through color were detected with a color histogram approach, for shape-oriented visual cues we trained a Haar classifier, for texture-oriented visual cues we used a bag-of-word approach with SIFT descriptors, and for all other visual cues we used a classical image classification approach including a feature extraction, selection, and a supervised classification. The extraction of this semantic vector for each video frame then permitted to classify time series using either Hidden Markov Model or Dynamic Time Warping algorithms. The framework was validated on cataract surgeries, obtaining accuracies of 95%.

[1]  Nassir Navab,et al.  Recovery of Surgical Workflow Without Explicit Models , 2006, MICCAI.

[2]  Pierre Jannin,et al.  Surgical Phases Detection from Microscope Videos by Combining SVM and HMM , 2010, MCV.

[3]  Terry M. Peters,et al.  Medical Image Computing and Computer-Assisted Intervention - MICCAI 2003 , 2003, Lecture Notes in Computer Science.

[4]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[5]  Guang-Zhong Yang,et al.  Episode Classification for the Analysis of Tissue/Instrument Interaction with Multiple Visual Cues , 2003, MICCAI.

[6]  D. H. Mellor,et al.  Real time , 1981 .

[7]  Nassir Navab,et al.  Medical Image Computing and Computer-Assisted Intervention - MICCAI 2010, 13th International Conference, Beijing, China, September 20-24, 2010, Proceedings, Part III , 2010, MICCAI.

[8]  Richard W. Hamming,et al.  Coding and Information Theory , 1980 .

[9]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[10]  Peter Fu-Ming Hu,et al.  Real-Time Identification of Operating Room State from Video , 2007, AAAI.

[11]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[12]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[13]  Nassir Navab,et al.  On-line Recognition of Surgical Activity for Monitoring in the Operating Room , 2008, AAAI.

[14]  Jenny Dankelman,et al.  Discovery of high-level tasks in the operating room , 2011, J. Biomed. Informatics.

[15]  G.D. Hager,et al.  Towards “real-time” tool-tissue interaction detection in robotically assisted laparoscopy , 2008, 2008 2nd IEEE RAS & EMBS International Conference on Biomedical Robotics and Biomechatronics.

[16]  Pierre Jannin,et al.  Automatic Phases Recognition in Pituitary Surgeries by Microscope Images Classification , 2010, IPCAI.

[17]  Jianqin Zhou,et al.  On discrete cosine transform , 2011, ArXiv.

[18]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[19]  Georg Langs,et al.  Medical Computer Vision , 2011 .

[20]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[21]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[22]  Lasse Riis Østergaard,et al.  Active Surface Approach for Extraction of the Human Cerebral Cortex from MRI , 2006, MICCAI.

[23]  Russell H. Taylor,et al.  Information Processing in Computer-Assisted Interventions - Second International Conference, IPCAI 2011, Berlin, Germany, June 22, 2011. Proceedings , 2011, IPCAI.

[24]  Nassir Navab,et al.  Modeling and Segmentation of Surgical Workflow from Laparoscopic Video , 2010, MICCAI.

[25]  Nassir Navab,et al.  Automatic feature generation in endoscopic images , 2008, International Journal of Computer Assisted Radiology and Surgery.