Interpretable multimodal deep learning for real-time pan-tissue pan-disease pathology search on social media

Pathologists are responsible for rapidly providing a diagnosis on critical health issues. Challenging cases benefit from additional opinions of pathologist colleagues. In addition to on-site colleagues, there is an active worldwide community of pathologists on social media for complementary opinions. Such access to pathologists worldwide has the capacity to improve diagnostic accuracy and generate broader consensus on next steps in patient care. From Twitter we curate 13,626 images from 6,351 tweets from 25 pathologists from 13 countries. We supplement the Twitter data with 113,161 images from 1,074,484 PubMed articles. We develop machine learning and deep learning models to (i) accurately identify histopathology stains, (ii) discriminate between tissues, and (iii) differentiate disease states. Area Under Receiver Operating Characteristic is 0.805-0.996 for these tasks. We repurpose the disease classifier to search for similar disease states given an image and clinical covariates. We report precision@k=1 = 0.7618±0.0018 (chance 0.397±0.004, mean±stdev). The classifiers find texture and tissue are important clinico-visual features of disease. Deep features trained only on natural images (e.g. cats and dogs) substantially improved search performance, while pathology-specific deep features and cell nuclei features further improved search to a lesser extent. We implement a social media bot (@pathobot on Twitter) to use the trained classifiers to aid pathologists in obtaining real-time feedback on challenging cases. If a social media post containing pathology text and images mentions the bot, the bot generates quantitative predictions of disease state (normal/artifact/infection/injury/nontumor, pre-neoplastic/benign/ low-grade-malignant-potential, or malignant) and lists similar cases across social media and PubMed. Our project has become a globally distributed expert system that facilitates pathological diagnosis and brings expertise to underserved regions or hospitals with less expertise in a particular disease. This is the first pan-tissue pan-disease (i.e. from infection to malignancy) method for prediction and search on social media, and the first pathology study prospectively tested in public on social media. We will share data throughpathobotology.org. We expect our project to cultivate a more connected world of physicians and improve patient care worldwide.

[1]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[2]  Allen R. Hanson,et al.  Computer Vision Systems , 1978 .

[3]  Matti Pietikäinen,et al.  Performance evaluation of texture measures with classification based on Kullback discrimination of distributions , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[4]  H Nazeran,et al.  Biomedical image processing in pathology: a review. , 1995, Australasian physical & engineering sciences in medicine.

[5]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[6]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  James Zijun Wang,et al.  Pathfinder: multiresolution region-based searching of pathology images using IRM , 2000, AMIA.

[8]  Yoshua Bengio,et al.  Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[9]  Hyunjoong Kim,et al.  Classification Trees With Unbiased Multiway Splits , 2001 .

[10]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  G. Tang,et al.  Indian Hedgehog: A Mechanotransduction Mediator in Condylar Cartilage , 2004, Journal of dental research.

[13]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[14]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[15]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[17]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[18]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[19]  Yiannis S. Boutalis,et al.  CEDD: Color and Edge Directivity Descriptor: A Compact Descriptor for Image Indexing and Retrieval , 2008, ICVS.

[20]  Yiannis S. Boutalis,et al.  FCTH: Fuzzy Color and Texture Histogram - A Low Level Feature for Accurate Image Retrieval , 2008, 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services.

[21]  Mathias Lux,et al.  Lire: lucene image retrieval: an extensible java CBIR library , 2008, ACM Multimedia.

[22]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[23]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[24]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[25]  Andrew Zisserman,et al.  Efficient Visual Search of Videos Cast as Text Retrieval , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Vipin Chaudhary,et al.  Content based sub-image retrieval system for high resolution pathology images using salient interest points , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[27]  Jerad M Gardner,et al.  Diagnostic Approach and Prognostic Factors of Cancers , 2003, Advances in anatomic pathology.

[28]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[29]  Joachim M. Buhmann,et al.  Computational Pathology: Challenges and Promises for Tissue Analysis , 2015, Comput. Medical Imaging Graph..

[30]  O. Sanli,et al.  Oxidized regenerated cellulose granuloma mimicking recurrent mass lesion after laparoscopic nephron sparing surgery. , 2012, International journal of surgery case reports.

[31]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[32]  Matti Pietikäinen,et al.  Identification of tumor epithelium and stroma in tissue microarrays using texture analysis , 2012, Diagnostic Pathology.

[33]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[34]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.

[35]  Lin Yang,et al.  Content-based histopathology image retrieval using CometCloud , 2014, BMC Bioinformatics.

[36]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[37]  Jen-Hao Hsiao,et al.  Deep learning of binary hash codes for fast image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[38]  Anant Madabhushi,et al.  Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces , 2015, Journal of pathology informatics.

[39]  Sos Agaian,et al.  Computer-Aided Prostate Cancer Diagnosis From Digitized Histopathology: A Review on Texture-Based Systems , 2015, IEEE Reviews in Biomedical Engineering.

[40]  Wei Liu,et al.  Towards Large-Scale Histopathological Image Analysis: Hashing-Based Image Retrieval , 2015, IEEE Transactions on Medical Imaging.

[41]  K. Borgwardt,et al.  Machine Learning in Medicine , 2015, Mach. Learn. under Resour. Constraints Vol. 3.

[42]  Johannes Gehrke,et al.  Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission , 2015, KDD.

[43]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Janne Heikkilä,et al.  Transfer Learning for Cell Nuclei Classification in Histopathology Images , 2016, ECCV Workshops.

[45]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Michael C. Montalto,et al.  An industry perspective: An update on the adoption of whole slide imaging , 2016, Journal of pathology informatics.

[47]  Francesco Bianconi,et al.  Multi-class texture analysis in colorectal cancer histology , 2016, Scientific Reports.

[48]  Andrew J. Schaumberg,et al.  DeepScope: Nonintrusive Whole Slide Saliency Annotation and Prediction from Pathologists at the Microscope , 2016, bioRxiv.

[49]  Un Desa Transforming our world : The 2030 Agenda for Sustainable Development , 2016 .

[50]  G. Crane,et al.  Pathology Image-Sharing on Social Media: Recommendations for Protecting Privacy While Motivating Education. , 2016, AMA journal of ethics.

[51]  Nima Tajbakhsh,et al.  Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? , 2016, IEEE Transactions on Medical Imaging.

[52]  D. Gleason,et al.  PREDICTION OF PROGNOSIS FOR PROSTATIC ADENOCARCINOMA BY COMBINED HISTOLOGICAL GRADING AND CLINICAL STAGING , 2017, The Journal of urology.

[53]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[54]  Philip S. Yu,et al.  HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[55]  Alexander J. Smola,et al.  Deep Sets , 2017, 1703.06114.

[56]  Shuang Bai,et al.  Growing random forest on deep convolutional neural networks for scene categorization , 2017, Expert Syst. Appl..

[57]  S. Joseph Sirintrapun,et al.  DeepScope: Nonintrusive Whole Slide Saliency Annotation and Prediction from Pathologists at the Microscope , 2017 .

[58]  F. Cabitza,et al.  Unintended Consequences of Machine Learning in Medicine , 2017, JAMA.

[59]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[60]  Henning Müller,et al.  The Parallel Distributed Image Search Engine (ParaDISE) , 2017, ArXiv.

[61]  Zhiguo Jiang,et al.  Histopathological Whole Slide Image Analysis Using Context-Based CBIR , 2018, IEEE Transactions on Medical Imaging.

[62]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[63]  Matthew D. Klimek,et al.  Neural network-based approach to phase space integration , 2018, SciPost Physics.

[64]  Hongyi Zhang,et al.  mixup: Beyond Empirical Risk Minimization , 2017, ICLR.

[65]  Daisuke Komura,et al.  Luigi: Large-scale histopathological image retrieval system using deep texture representations , 2018, bioRxiv.

[66]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection , 2018, J. Open Source Softw..

[67]  R. Wu,et al.  #EBUSTwitter: Novel Use of Social Media for Conception, Coordination and Completion of an International, Multi-Center Pathology Study , 2018, Journal of the American Society of Cytopathology.

[68]  D. Anthony,et al.  Neuropathology Education Using Social Media , 2018, Journal of neuropathology and experimental neurology.

[69]  J. Gardner,et al.  Effective use of Twitter and Facebook in pathology practice. , 2018, Human pathology.

[70]  Samuel J. Yang,et al.  In Silico Labeling: Predicting Fluorescent Labels in Unlabeled Images , 2018, Cell.

[71]  Andrew J. Schaumberg,et al.  D R A F T H&E-stained Whole Slide Image Deep Learning Predicts SPOP Mutation State in Prostate Cancer , 2017 .

[72]  Geraint Rees,et al.  Clinically applicable deep learning for diagnosis and referral in retinal disease , 2018, Nature Medicine.

[73]  Manfredo Atzori,et al.  Deep Learning-Based Retrieval System for Gigapixel Histopathology Cases and the Open Access Literature , 2018, bioRxiv.

[74]  Jianmin Wang,et al.  Deep Cauchy Hashing for Hamming Space Retrieval , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[75]  F. Dirilenoglu,et al.  A welcoming guide to social media for cytopathologists: Tips, tricks, and the best practices of social cytopathology , 2019, CytoJournal.

[76]  T. Allen,et al.  Keep Calm and Tweet On: Legal and Ethical Considerations for Pathologists Using Social Media. , 2018, Archives of pathology & laboratory medicine.

[77]  Finale Doshi-Velez,et al.  Evaluating Machine Learning Articles. , 2019, JAMA.

[78]  Nassir Navab,et al.  Multi-task Learning of a Deep K-Nearest Neighbour Network for Histopathological Image Classification and Retrieval , 2019, MICCAI.

[79]  Po-Hsuan Cameron Chen,et al.  How to Read Articles That Use Machine Learning: Users' Guides to the Medical Literature. , 2019, JAMA.

[80]  Tanmoy Bhattacharya,et al.  The need for uncertainty quantification in machine-assisted medical decision making , 2019, Nat. Mach. Intell..

[81]  Thomas J. Fuchs,et al.  Clinical-grade computational pathology using weakly supervised deep learning on whole slide images , 2019, Nature Medicine.

[82]  Xi Chen,et al.  Machine learning to predict the long-term risk of myocardial infarction and cardiac death based on clinical risk, coronary calcium, and epicardial adipose tissue: a prospective study. , 2019, Cardiovascular research.

[83]  Karl Rohr,et al.  Predicting breast tumor proliferation from whole‐slide images: The TUPAC16 challenge , 2018, Medical Image Anal..

[84]  Daniel Smilkov,et al.  Similar image search for histopathology: SMILY , 2019, npj Digital Medicine.

[85]  Navid Farahani,et al.  A Practical Guide to Whole Slide Imaging: A White Paper From the Digital Pathology Association. , 2018, Archives of pathology & laboratory medicine.

[86]  Interpretable multimodal deep learning for real-time pan-tissue pan-disease pathology search on social media , 2020, Modern Pathology.