论文信息 - InfoMask: Masked Variational Latent Representation to Localize Chest Disease

InfoMask: Masked Variational Latent Representation to Localize Chest Disease

The scarcity of richly annotated medical images is limiting supervised deep learning based solutions to medical image analysis tasks, such as localizing discriminatory radiomic disease signatures. Therefore, it is desirable to leverage unsupervised and weakly supervised models. Most recent weakly supervised localization methods apply attention maps or region proposals in a multiple instance learning formulation. While attention maps can be noisy, leading to erroneously highlighted regions, it is not simple to decide on an optimal window/bag size for multiple instance learning approaches. In this paper, we propose a learned spatial masking mechanism to filter out irrelevant background signals from attention maps. The proposed method minimizes mutual information between a masked variational representation and the input while maximizing the information between the masked representation and class labels. This results in more accurate localization of discriminatory regions. We tested the proposed model on the ChestX-ray8 dataset to localize pneumonia from chest X-ray images without using any pixel-level or bounding-box annotations.

[1] Chong Wang,et al. Weakly Supervised Object Localization with Latent Category Learning , 2014, ECCV.

[2] Zaïd Harchaoui,et al. On learning to localize objects with minimal supervision , 2014, ICML.

[3] Jinjun Xiong,et al. TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection , 2018, ECCV.

[4] Yang Wang,et al. Attention Networks for Weakly Supervised Object Localization , 2016, BMVC.

[5] Alexander A. Alemi,et al. Deep Variational Information Bottleneck , 2017, ICLR.

[6] Cengiz Öztireli,et al. Towards better understanding of gradient-based attribution methods for Deep Neural Networks , 2017, ICLR.

[7] Kyunghyun Cho,et al. Classifier-agnostic saliency map extraction , 2020, Comput. Vis. Image Underst..

[8] Ruoyu Li,et al. Weakly Supervised Deep Learning for Thoracic Disease Classification and Localization on Chest X-rays , 2018, BCB.

[9] Yong Jae Lee,et al. Weakly-supervised Discovery of Visual Pattern Configurations , 2014, NIPS.

[10] Qitao Huang,et al. Weakly Supervised Deep Learning for Whole Slide Lung Cancer Image Analysis , 2020, IEEE Transactions on Cybernetics.

[11] Ramprasaath R. Selvaraju,et al. Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization , 2016 .

[12] Markus H. Gross,et al. Gradient-Based Attribution Methods , 2019, Explainable AI.

[13] Qitao Huang,et al. Weakly Supervised Learning for Whole Slide Lung Cancer Image Classification , 2018 .

[14] B. S. Manjunath,et al. Weakly Supervised Localization Using Deep Feature Maps , 2016, ECCV.

[15] Daphne Koller,et al. Self-Paced Learning for Latent Variable Models , 2010, NIPS.

[16] Martin Wattenberg,et al. SmoothGrad: removing noise by adding noise , 2017, ArXiv.

[17] Naftali Tishby,et al. The information bottleneck method , 2000, ArXiv.

[18] Ronald M. Summers,et al. ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[19] Lijie Fan,et al. Adversarial Localization Network , 2017 .

[20] Kyunghyun Cho,et al. Classifier-agnostic saliency map extraction , 2018, AAAI.

[21] Avanti Shrikumar,et al. Learning Important Features Through Propagating Activation Differences , 2017, ICML.

[22] Andrea Vedaldi,et al. Weakly Supervised Deep Detection Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Andrew Y. Ng,et al. CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[24] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.