Fast and Scalable Image Search For Histology

The expanding adoption of digital pathology has enabled the curation of large repositories of histology whole slide images (WSIs), which contain a wealth of information. Similar pathology image search offers the opportunity to comb through large historical repositories of gigapixel WSIs to identify cases with similar morphological features and can be particularly useful for diagnosing rare diseases, identifying similar cases for predicting prognosis, treatment outcomes and potential clinical trial success. A critical challenge in developing a WSI search and retrieval system is scalability, which is uniquely challenging given the need to search a growing number of slides that each can consist of billions of pixels and are several gigabytes in size. Such systems are typically slow and retrieval speed often scales with the size of the repository they search through, making their clinical adoption tedious and are not feasible for repositories that are constantly growing. Here we present Fast Image Search for Histopathology (FISH), a histology image search pipeline that is infinitely scalable and achieves constant search speed that is independent of the image database size, while being interpretable and without requiring detailed annotations. FISH uses self-supervised deep learning to encode meaningful representations from WSIs and a Van Emde Boas tree for fast search, followed by an uncertainty-based ranking algorithm to retrieve similar WSIs. We evaluated FISH on multiple tasks and datasets with over 22,000 patient cases spanning 56 disease subtypes. We additionally demonstrate that FISH can be used to assist with the diagnosis of rare cancer types where sufficient cases may not be available to train traditional supervised deep models. FISH is available as an easy-to-use, open source software package (https://github.com/mahmoodlab/FISH).

[1]  A. Madabhushi,et al.  Artificial intelligence in digital pathology — new tools for diagnosis and precision oncology , 2019, Nature Reviews Clinical Oncology.

[2]  H. R. Tizhoosh,et al.  Yottixel - An Image Search Engine for Large Archives of Histopathology Whole Slide Images , 2019, ArXiv.

[3]  Andre Esteva,et al.  A guide to deep learning in healthcare , 2019, Nature Medicine.

[4]  Zhiguo Jiang,et al.  Histopathological Whole Slide Image Analysis Using Context-Based CBIR , 2018, IEEE Transactions on Medical Imaging.

[5]  Oriol Vinyals,et al.  Neural Discrete Representation Learning , 2017, NIPS.

[6]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[7]  Morteza Babaie,et al.  CNN and Deep Sets for End-to-End Whole Slide Image Representation Learning , 2021, MIDL.

[8]  Bernhard Kainz,et al.  A Survey on Active Learning and Human-in-the-Loop Deep Learning for Medical Image Analysis , 2019, Medical Image Anal..

[9]  Ming Y. Lu,et al.  Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis , 2019, IEEE Transactions on Medical Imaging.

[10]  Manfredo Atzori,et al.  Deep Learning-Based Retrieval System for Gigapixel Histopathology Cases and the Open Access Literature , 2018, bioRxiv.

[11]  Daisuke Komura,et al.  Machine Learning Methods for Histopathological Image Analysis , 2017, Computational and structural biotechnology journal.

[12]  Daniel Smilkov,et al.  Similar image search for histopathology: SMILY , 2019, npj Digital Medicine.

[13]  Morteza Babaie,et al.  Fine-Tuning and training of densenet for histopathology image representation using TCGA diagnostic slides , 2021, Medical Image Anal..

[14]  Ming Y. Lu,et al.  Data-efficient and weakly supervised computational pathology on whole-slide images , 2020, Nature Biomedical Engineering.

[15]  Hai Su,et al.  Supervised graph hashing for histopathology image retrieval and classification , 2017, Medical Image Anal..

[16]  Jin Tae Kwak,et al.  Automated prostate tissue referencing for cancer detection and diagnosis , 2016, BMC Bioinformatics.

[17]  Sébastien Ourselin,et al.  Interactive Medical Image Segmentation Using Deep Learning With Image-Specific Fine Tuning , 2017, IEEE Transactions on Medical Imaging.

[18]  M. Gurcan,et al.  Digital pathology and artificial intelligence. , 2019, The Lancet. Oncology.

[19]  Zhiguo Jiang,et al.  Breast Histopathological Image Retrieval Based on Latent Dirichlet Allocation , 2017, IEEE Journal of Biomedical and Health Informatics.

[20]  Yee-Wah Tsang,et al.  Validation of digital pathology imaging for primary histopathological diagnosis , 2016, Histopathology.

[21]  Clive R. Taylor,et al.  Whole Slide Imaging Versus Microscopy for Primary Diagnosis in Surgical Pathology , 2017, The American journal of surgical pathology.

[22]  Anant Madabhushi,et al.  Out-of-Sample Extrapolation utilizing Semi-Supervised Manifold Learning (OSE-SSL): Content Based Image Retrieval for Histopathology Images , 2016, Scientific Reports.

[23]  N. Rajpoot,et al.  Diagnostic concordance and discordance in digital pathology: a systematic review and meta-analysis , 2020, Journal of Clinical Pathology.

[24]  Lin Yang,et al.  Content-based histopathology image retrieval using CometCloud , 2014, BMC Bioinformatics.

[25]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Ming Y. Lu,et al.  Deep Learning-based Computational Pathology Predicts Origins for Cancers of Unknown Primary , 2020, ArXiv.

[27]  Morteza Babaie,et al.  Pan-cancer diagnostic consensus through searching archival histopathology images using artificial intelligence , 2019, npj Digital Medicine.

[28]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[29]  Metin Nafi Gürcan,et al.  Content-Based Microscopic Image Retrieval System for Multi-Image Queries , 2012, IEEE Transactions on Information Technology in Biomedicine.

[30]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[31]  Anant Madabhushi,et al.  Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces , 2015, Journal of pathology informatics.

[32]  Wei Liu,et al.  Towards Large-Scale Histopathological Image Analysis: Hashing-Based Image Retrieval , 2015, IEEE Transactions on Medical Imaging.

[33]  Mathias Unberath,et al.  UI-Net: Interactive Artificial Neural Networks for Iterative Image Segmentation Based on a User Model , 2017, VCBM.

[34]  Constantino Carlos Reyes-Aldasoro,et al.  Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study , 2019, PLoS medicine.

[35]  Sébastien Ourselin,et al.  DeepIGeoS: A Deep Interactive Geodesic Framework for Medical Image Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Daisuke Komura,et al.  Luigi: Large-scale histopathological image retrieval system using deep texture representations , 2018, bioRxiv.

[37]  Junzhou Huang,et al.  Scalable histopathological image analysis via supervised hashing with multiple features , 2016, Medical Image Anal..

[38]  M. Delgado-Rodríguez,et al.  Systematic review and meta-analysis. , 2017, Medicina intensiva.