论文信息 - SmoothGrad: removing noise by adding noise - 字舞流文

SmoothGrad: removing noise by adding noise

Explaining the output of a deep network remains a challenge. In the case of an image classifier, one type of explanation is to identify pixels that strongly influence the final decision. A starting point for this strategy is the gradient of the class score function with respect to the input image. This gradient can be interpreted as a sensitivity map, and there are several techniques that elaborate on this basic idea. This paper makes two contributions: it introduces SmoothGrad, a simple method that can help visually sharpen gradient-based sensitivity maps, and it discusses lessons in the visualization of these maps. We publish the code for our experiments and a website with our results.

Martin Wattenberg | Been Kim | Fernanda B. Viégas | Daniel Smilkov | Nikhil Thorat | F. Viégas | M. Wattenberg | Nikhil Thorat | Been Kim | D. Smilkov

[1] Ankur Taly,et al. Axiomatic Attribution for Deep Networks , 2017, ICML.

[2] Abhishek Das,et al. Grad-CAM: Why did you say that? , 2016, ArXiv.

[3] Max Welling,et al. A New Method to Visualize Deep Neural Networks , 2016, ArXiv.

[4] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[5] Thomas Brox,et al. Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[6] Finale Doshi-Velez,et al. Comorbidity Clusters in Autism Spectrum Disorders: An Electronic Health Record Time-Series Analysis , 2014, Pediatrics.

[7] Alex Alves Freitas,et al. Comprehensible classification models: a position paper , 2014, SKDD.

[8] Been Kim,et al. Towards A Rigorous Science of Interpretable Machine Learning , 2017, 1702.08608.

[9] Motoaki Kawanabe,et al. How to Explain Individual Classification Decisions , 2009, J. Mach. Learn. Res..

[10] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[11] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[12] Johannes Gehrke,et al. Intelligible models for classification and regression , 2012, KDD.

[13] Seong Joon Oh,et al. Exploiting Saliency for Object Segmentation from Image Level Labels , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Pascal Vincent,et al. Visualizing Higher-Layer Features of a Deep Network , 2009 .

[15] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[16] Alexander Binder,et al. On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation , 2015, PloS one.

[17] Michael C. Hughes,et al. Supervised topic models for clinical interpretability , 2016, 1612.01678.

[18] Been Kim,et al. iBCM: Interactive Bayesian Case Model Empowering Humans via Intuitive Interaction , 2015 .

[19] Bolei Zhou,et al. Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Christopher M. Bishop,et al. Current address: Microsoft Research, , 2022 .

[22] Avanti Shrikumar,et al. Learning Important Features Through Propagating Activation Differences , 2017, ICML.