Similar image search for histopathology: SMILY

The increasing availability of large institutional and public histopathology image datasets is enabling the searching of these datasets for diagnosis, research, and education. Although these datasets typically have associated metadata such as diagnosis or clinical notes, even carefully curated datasets rarely contain annotations of the location of regions of interest on each image. As pathology images are extremely large (up to 100,000 pixels in each dimension), further laborious visual search of each image may be needed to find the feature of interest. In this paper, we introduce a deep-learning-based reverse image search tool for histopathology images: Similar Medical Images Like Yours (SMILY). We assessed SMILY’s ability to retrieve search results in two ways: using pathologist-provided annotations, and via prospective studies where pathologists evaluated the quality of SMILY search results. As a negative control in the second evaluation, pathologists were blinded to whether search results were retrieved by SMILY or randomly. In both types of assessments, SMILY was able to retrieve search results with similar histologic features, organ site, and prostate cancer Gleason grade compared with the original query. SMILY may be a useful general-purpose tool in the pathologist’s arsenal, to improve the efficiency of searching large archives of histopathology images, without the need to develop and implement specific tools for each application.

[1]  Karl Rohr,et al.  Predicting breast tumor proliferation from whole‐slide images: The TUPAC16 challenge , 2018, Medical Image Anal..

[2]  I. Ellis,et al.  Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. , 2002, Histopathology.

[3]  Anant Madabhushi,et al.  Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces , 2015, Journal of pathology informatics.

[4]  Yang Song,et al.  Learning Fine-Grained Image Similarity with Deep Ranking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Fabio A. González,et al.  Content-based histopathology image retrieval using a kernel-based semantic annotation framework , 2011, J. Biomed. Informatics.

[6]  Vipin Chaudhary,et al.  Content based sub-image retrieval system for high resolution pathology images using salient interest points , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[7]  Manfredo Atzori,et al.  Deep Learning-Based Retrieval System for Gigapixel Histopathology Cases and the Open Access Literature , 2018, bioRxiv.

[8]  Thomas J. Fuchs,et al.  Terabyte-scale Deep Multiple Instance Learning for Classification and Localization in Pathology , 2018, ArXiv.

[9]  P. Harms,et al.  Histologic Mimics of Basal Cell Carcinoma. , 2017, Archives of pathology & laboratory medicine.

[10]  Daniel Fabbri,et al.  Toward content-based image retrieval with deep convolutional neural networks , 2015, Medical Imaging.

[11]  Ellery Wulczyn,et al.  Development and validation of a deep learning algorithm for improving Gleason scoring of prostate cancer , 2018, npj Digital Medicine.

[12]  James Zijun Wang,et al.  Pathfinder: multiresolution region-based searching of pathology images using IRM , 2000, AMIA.

[13]  Jin Tae Kwak,et al.  Automated prostate tissue referencing for cancer detection and diagnosis , 2016, BMC Bioinformatics.

[14]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Daisuke Komura,et al.  Luigi: Large-scale histopathological image retrieval system using deep texture representations , 2018, bioRxiv.

[16]  Zhiguo Jiang,et al.  Histopathological Whole Slide Image Analysis Using Context-Based CBIR , 2018, IEEE Transactions on Medical Imaging.

[17]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[18]  Lin Yang,et al.  Content-based histopathology image retrieval using CometCloud , 2014, BMC Bioinformatics.

[19]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[20]  Sung Wook Baik,et al.  SiNC: Saliency-injected neural codes for representation and efficient retrieval of medical radiographs , 2017, PloS one.

[21]  Marc Berndl,et al.  Improving Phenotypic Measurements in High-Content Imaging Screens , 2017, bioRxiv.

[22]  Szilárd Vajda,et al.  FaceMatch: Real-World Face Image Retrieval , 2016, RTIP2R.

[23]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[24]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[25]  Antoine Geissbühler,et al.  A Review of Content{Based Image Retrieval Systems in Medical Applications { Clinical Bene(cid:12)ts and Future Directions , 2022 .

[26]  Martin Wattenberg,et al.  Human-Centered Tools for Coping with Imperfect Algorithms During Medical Decision-Making , 2019, CHI.

[27]  Luca Maria Gambardella,et al.  Assessment of algorithms for mitosis detection in breast cancer histopathology images , 2014, Medical Image Anal..

[28]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[29]  Hayit Greenspan,et al.  Content-Based Image Retrieval in Radiology: Current Status and Future Directions , 2010, Journal of Digital Imaging.

[30]  Clive R. Taylor,et al.  Whole Slide Imaging Versus Microscopy for Primary Diagnosis in Surgical Pathology , 2017, The American journal of surgical pathology.

[31]  Anant Madabhushi,et al.  Out-of-Sample Extrapolation utilizing Semi-Supervised Manifold Learning (OSE-SSL): Content Based Image Retrieval for Histopathology Images , 2016, Scientific Reports.

[32]  Shahryar Rahnamayan,et al.  Classification and Retrieval of Digital Pathology Scans: A New Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[33]  Sos Agaian,et al.  Computer-Aided Prostate Cancer Diagnosis From Digitized Histopathology: A Review on Texture-Based Systems , 2015, IEEE Reviews in Biomedical Engineering.

[34]  Krassimira Ivanova Content-Based Image Retrieval in Digital Libraries of Art Images Utilizing Colour Semantics , 2011, TPDL.

[35]  Meyke Hermsen,et al.  1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset , 2018, GigaScience.