Motion-based video retrieval with application to computer-assisted retinal surgery

In this paper, we address the problem of computer-aided ophthalmic surgery. In particular, a novel Content-Based Video Retrieval (CBVR) system is presented : given a video stream captured by a digital camera monitoring the current surgery, the system retrieves, within digital archives, videos that resemble the current surgery monitoring video. The search results may be used to guide surgeons' decisions, for example, let the surgeon know what a more experienced fellow worker would do in a similar situation. With this goal, we propose to use motion information contained in MPEG- 4 AVC/H.264 video standard to extract features from videos. We propose two approaches, one of which is based on motion histogram created for every frame of a compressed video sequence to extract motion direction and intensity statistics. The other combine segmentation and tracking to extract region displacements between consecutive frames and therefore characterize region trajectories. To compare videos, an extension of the fast dynamic time warping to multidimensional time series was adopted. The system is applied to a dataset of 69 video-recorded retinal surgery steps. Results are promising: the retrieval efficiency is higher than 69%.

[1]  Yu Cao,et al.  Computer-Aided Detection of Diagnostic and Therapeutic Operations in Colonoscopy Videos , 2007, IEEE Transactions on Biomedical Engineering.

[2]  F. L. Hitchcock The Distribution of a Product from Several Sources to Numerous Localities , 1941 .

[3]  Guang-Zhong Yang,et al.  Content-Based Surgical Workflow Representation Using Probabilistic Motion Modeling , 2010, MIAR.

[4]  Alvy Ray Smith,et al.  Color gamut transform pairs , 1978, SIGGRAPH.

[5]  Mathias Lux,et al.  Visualization of video motion in context of video browsing , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[6]  Gregory D. Hager,et al.  Real-Time Endoscopic Mosaicking , 2006, MICCAI.

[7]  Carlo Tomasi,et al.  Perceptual metrics for image database navigation , 1999 .

[8]  Wesley W. Chu,et al.  Efficient searches for similar subsequences of different lengths in sequence databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[9]  Ivan V. Bajic,et al.  Predictive Decoding for Delay Reduction in Video Communications , 2007, IEEE GLOBECOM 2007 - IEEE Global Telecommunications Conference.

[10]  Guang-Zhong Yang,et al.  Eye-Gaze Driven Surgical Workflow Segmentation , 2007, MICCAI.