论文信息 - The Building Blocks of Interpretability - 字舞流文

The Building Blocks of Interpretability

Arvind Satyanarayan | Ian Johnson | Alexander Mordvintsev | Shan Carter | Ludwig Schubert | Christopher Olah | Katherine Q. Ye | C. Olah | A. Mordvintsev | Shan Carter | Ludwig Schubert | I. Johnson | Arvind Satyanarayan

[1] Dumitru Erhan,et al. The (Un)reliability of saliency methods , 2017, Explainable AI.

[2] Martin Wattenberg,et al. TCAV: Relative concept importance testing with Linear Concept Activation Vectors , 2018 .

[3] Andrea Vedaldi,et al. Net2Vec: Quantifying and Explaining How Concepts are Encoded by Filters in Deep Neural Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4] Xiaoming Liu,et al. Do Convolutional Neural Networks Learn Class Hierarchy? , 2017, IEEE Transactions on Visualization and Computer Graphics.

[5] Minsuk Kahng,et al. ActiVis: Visual Exploration of Industry-Scale Deep Neural Network Models , 2017, IEEE Transactions on Visualization and Computer Graphics.

[6] Alexander M. Rush,et al. LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks , 2016, IEEE Transactions on Visualization and Computer Graphics.

[7] Shan Carter,et al. Using Artificial Intelligence to Augment Human Intelligence , 2017 .

[8] Jascha Sohl-Dickstein,et al. SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability , 2017, NIPS.

[9] Shane Legg,et al. Deep Reinforcement Learning from Human Preferences , 2017, NIPS.

[10] Klaus-Robert Müller,et al. PatternNet and PatternLRP - Improving the interpretability of neural networks , 2017, ArXiv.

[11] Bolei Zhou,et al. Network Dissection: Quantifying Interpretability of Deep Visual Representations , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Andrea Vedaldi,et al. Interpretable Explanations of Black Boxes by Meaningful Perturbation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[13] Percy Liang,et al. Understanding Black-box Predictions via Influence Functions , 2017, ICML.

[14] Ankur Taly,et al. Axiomatic Attribution for Deep Networks , 2017, ICML.

[15] Yoshua Bengio,et al. Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Abhishek Das,et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[17] Vincent Dumoulin,et al. Deconvolution and Checkerboard Artifacts , 2016 .

[18] Kenney Ng,et al. Interacting with Predictions: Visual Inspection of Black-box Machine Learning Models , 2016, CHI.

[19] Bolei Zhou,et al. Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[21] David J. Fleet,et al. Adversarial Manipulation of Deep Representations , 2015, ICLR.

[22] Hod Lipson,et al. Understanding Neural Networks Through Deep Visualization , 2015, ArXiv.

[23] David Maxwell Chickering,et al. ModelTracker: Redesigning Performance Analysis Tools for Machine Learning , 2015, CHI.

[24] Thomas Brox,et al. Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[25] Jason Yosinski,et al. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Alexander Mordvintsev,et al. Inceptionism: Going Deeper into Neural Networks , 2015 .

[28] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[29] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[30] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[31] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[32] Desney S. Tan,et al. Interactive optimization for steering machine classification , 2010, CHI.

[33] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[34] Deborah Silver,et al. Feature Visualization , 1994, Scientific Visualization.

[35] Jock D. Mackinlay,et al. Automating the design of graphical presentations of relational information , 1986, TOGS.