A smart atlas for endomicroscopy using automated video retrieval

To support the challenging task of early epithelial cancer diagnosis from in vivo endomicroscopy, we propose a content-based video retrieval method that uses an expert-annotated database. Motivated by the recent successes of non-medical content-based image retrieval, we first adjust the standard Bag-of-Visual-Words method to handle single endomicroscopic images. A local dense multi-scale description is proposed to keep the proper level of invariance, in our case to translations, in-plane rotations and affine transformations of the intensities. Since single images may have an insufficient field-of-view to make a robust diagnosis, we introduce a video-mosaicing technique that provides large field-of-view mosaic images. To remove outliers, retrieval is followed by a geometrical approach that captures a statistical description of the spatial relationships between the local features. Building on image retrieval, we then focus on efficient video retrieval. Our approach avoids the time-consuming parts of the video-mosaicing by relying on coarse registration results only to account for spatial overlap between images taken at different times. To evaluate the retrieval, we perform a simple nearest neighbors classification with leave-one-patient-out cross-validation. From the results of binary and multi-class classification, we show that our approach outperforms, with statistical significance, several state-of-the art methods. We obtain a binary classification accuracy of 94.2%, which is quite close to clinical expectations.

[1]  David J. Sheskin,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 1997 .

[2]  Jeffrey J Rodríguez,et al.  Computer-aided identification of ovarian cancer in confocal microendoscope images. , 2008, Journal of biomedical optics.

[3]  Bill Triggs,et al.  Multilevel Image Coding with Hyperfeatures , 2008, International Journal of Computer Vision.

[4]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[5]  Nicholas Ayache,et al.  Endomicroscopic video retrieval using mosaicing and visualwords , 2010, 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[6]  Nicholas Ayache,et al.  Robust mosaicing with correction of motion distortions and tissue deformations for in vivo fibered microscopy , 2006, Medical Image Anal..

[7]  Antonio Criminisi,et al.  TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation , 2006, ECCV.

[8]  Nicholas Ayache,et al.  Endomicroscopic image retrieval and classification using invariant visual features , 2009, 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[9]  Nicholas Ayache,et al.  Introducing Space and Time in Local Feature-Based Endomicroscopic Image Retrieval , 2009, MCBR-CDS.

[10]  Michael B. Wallace,et al.  Comparison of probe-based confocal laser endomicroscopy with virtual chromoendoscopy for classification of colon polyps. , 2010, Gastroenterology.

[11]  Michael B. Wallace,et al.  High-Resolution Confocal Endomicroscopy Probe System for In Vivo Diagnosis of Colorectal Neoplasia , 2008 .

[12]  Andrew Zisserman,et al.  Video Google: Efficient Visual Search of Videos , 2006, Toward Category-Level Object Recognition.

[13]  James S. Duncan,et al.  Medical Image Analysis , 1999, IEEE Pulse.

[14]  Antoine Geissbühler,et al.  A Review of Content{Based Image Retrieval Systems in Medical Applications { Clinical Bene(cid:12)ts and Future Directions , 2022 .

[15]  Heinz-Otto Peitgen,et al.  Medical image analysis , 1999, Medical Image Anal..

[16]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[17]  Florent Perronnin,et al.  Fisher Kernels on Visual Vocabularies for Image Categorization , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Nicholas Ayache,et al.  Towards Optical Biopsies with an Integrated Fibered Confocal Fluorescence Microscope , 2004, MICCAI.

[19]  Kim L. Boyer,et al.  Computer-aided evaluation of neuroblastoma on whole-slide histology images: Classifying grade of neuroblastic differentiation , 2009, Pattern Recognit..

[20]  Murat Dundar,et al.  A methodology for training and validating a CAD system and potential pitfalls , 2004, CARS.

[21]  Andrew Zisserman,et al.  Efficient Visual Search of Videos Cast as Text Retrieval , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Michael B. Wallace,et al.  T1166 Interobserver Agreement and Accuracy Among International Experts of Probe-Based Confocal Laser Microscopy (pCLE) in Predicting Colorectal Neoplasia , 2009 .

[23]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[24]  Luc Van Gool,et al.  Wide Baseline Stereo Matching based on Local, Affinely Invariant Regions , 2000, BMVC.

[25]  Paul Fockens,et al.  Probe-based confocal laser endomicroscopy. , 2009, Gastroenterology.

[26]  Gang Hua,et al.  Descriptive visual words and visual phrases for image applications , 2009, ACM Multimedia.

[27]  C. Prinz,et al.  High-resolution miniprobe-based confocal microscopy in combination with video mosaicing (with video). , 2007, Gastrointestinal endoscopy.

[28]  Anant Madabhushi,et al.  A Boosting Cascade for Automated Detection of Prostate Cancer from Digitized Histology , 2006, MICCAI.

[29]  Eli Shechtman,et al.  In defense of Nearest-Neighbor based image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Josiane Zerubia,et al.  Estimation of Markov random field prior parameters using Markov chain Monte Carlo maximum likelihood , 1999, IEEE Trans. Image Process..

[31]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[32]  M. Vieth,et al.  Miniprobe confocal laser microscopy for the detection of invisible neoplasia in patients with Barrett’s oesophagus , 2008, Gut.

[33]  Jitendra Malik,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[34]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[35]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[36]  Michael Werman,et al.  Fast and robust Earth Mover's Distances , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[37]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[38]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[39]  D. Sheskin Handbook of Parametric and Nonparametric Statistical Procedures: Third Edition , 2000 .

[40]  Simon K Lo,et al.  The learning curve of in vivo probe-based confocal laser endomicroscopy for prediction of colorectal neoplasia. , 2009, Gastrointestinal endoscopy.

[41]  Cordelia Schmid,et al.  Evaluation of Local Spatio-temporal Features for Action Recognition , 2009, BMVC.

[42]  Roberta Piroddi,et al.  Texture recognition from sparsely and irregularly sampled data , 2006, Comput. Vis. Image Underst..

[43]  Andreas Uhl,et al.  Improving Pit-Pattern Classification of Endoscopy Images by a Combination of Experts , 2009, MICCAI.

[44]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[46]  Henning Müller,et al.  Overview of the ImageCLEFmed 2008 Medical Image Retrieval Task , 2008, CLEF.