Large-Scale Endoscopic Image and Video Linking with Gradient-Based Signatures

Given a large-scale video archive of surgical interventions and a medical image showing a specific moment of an operation, how to find the most image-related videos efficiently without the utilization of additional semantic characteristics? In this paper, we investigate a novel content-based approach of linking medical images with relevant video segments arising from endoscopic procedures. We propose to approximate the video segments' content-based features by gradient-based signatures and to index these signatures with the Minkowski distance in order to determine the most query-like video segments efficiently. We benchmark our approach on a large endoscopic image and video archive and show that our approach achieves a significant improvement in efficiency in comparison to the state-of-the-art while maintaining high accuracy.

[1]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[2]  Yiannis S. Boutalis,et al.  Searching images with MPEG-7 (& MPEG-7-like) Powered Localized dEscriptors: The SIMPLE answer to effective Content Based Image Retrieval , 2014, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI).

[3]  Thomas Seidl,et al.  Signature Quadratic Form Distance , 2010, CIVR '10.

[4]  Mathias Lux,et al.  Visual information retrieval in endoscopic video archives , 2015, 2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI).

[5]  Thomas Seidl,et al.  Gradient-based Signatures for Efficient Similarity Search in Large-scale Multimedia Databases , 2015, CIKM.

[6]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Yiannis S. Boutalis,et al.  CEDD: Color and Edge Directivity Descriptor: A Compact Descriptor for Image Indexing and Retrieval , 2008, ICVS.

[8]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[9]  Thomas Seidl,et al.  Signature matching distance for content-based image retrieval , 2013, ICMR.

[10]  Mathias Lux,et al.  Content-based retrieval in videos from laparoscopic surgery , 2016, SPIE Medical Imaging.

[11]  Mathias Lux,et al.  Endoscopic Video Retrieval: A Signature-Based Approach for Linking Endoscopic Images with Video Segments , 2015, 2015 IEEE International Symposium on Multimedia (ISM).