论文信息 - Diverse feature visualizations reveal invariances in early layers of deep neural networks

Diverse feature visualizations reveal invariances in early layers of deep neural networks

Visualizing features in deep neural networks (DNNs) can help understanding their computations. Many previous studies aimed to visualize the selectivity of individual units by finding meaningful images that maximize their activation. However, comparably little attention has been paid to visualizing to what image transformations units in DNNs are invariant. Here we propose a method to discover invariances in the responses of hidden layer units of deep neural networks. Our approach is based on simultaneously searching for a batch of images that strongly activate a unit while at the same time being as distinct from each other as possible. We find that even early convolutional layers in VGG-19 exhibit various forms of response invariance: near-perfect phase invariance in some units and invariance to local diffeomorphic transformations in others. At the same time, we uncover representational differences with ResNet-50 in its corresponding layers. We conclude that invariance transformations are a major computational component learned by DNNs and we provide a systematic method to study them.

[1] D. Hubel,et al. Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[2] E H Adelson,et al. Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[3] Deborah Silver,et al. Feature Visualization , 1994, Scientific Visualization.

[4] Z. C. Liu,et al. Observation of vortex packets in direct numerical simulation of fully turbulent channel flow , 2002 .

[5] Laurenz Wiskott,et al. Slow feature analysis yields a rich repertoire of complex cell properties. , 2005, Journal of vision.

[6] Eero P. Simoncelli,et al. Spatiotemporal Elements of Macaque V1 Receptive Fields , 2005, Neuron.

[7] Matthias Bethge,et al. Unsupervised learning of a steerable basis for invariant image representations , 2007, Electronic Imaging.

[8] Quoc V. Le,et al. Measuring Invariances in Deep Networks , 2009, NIPS.

[9] Pascal Vincent,et al. Visualizing Higher-Layer Features of a Deep Network , 2009 .

[10] M. Bethge,et al. Mixtures of Conditional Gaussian Scale Mixtures Applied to Multiscale Image Representations , 2011, PloS one.

[11] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[12] Matthias Bethge,et al. Slowness and Sparseness Have Diverging Effects on Complex Cell Learning , 2014, PLoS Comput. Biol..

[13] Daniel L. K. Yamins,et al. Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition , 2014, PLoS Comput. Biol..

[14] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[15] Leon A. Gatys,et al. Texture Synthesis Using Convolutional Neural Networks , 2015, NIPS.

[16] Jason Yosinski,et al. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Andrea Vedaldi,et al. Understanding deep image representations by inverting them , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Nikolaus Kriegeskorte,et al. Deep neural networks: a new framework for modelling biological vision and brain information processing , 2015, bioRxiv.

[19] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[20] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[21] Marcel A. J. van Gerven,et al. Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream , 2014, The Journal of Neuroscience.

[22] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[23] Matthias Bethge,et al. Generative Image Modeling Using Spatial LSTMs , 2015, NIPS.

[24] Bolei Zhou,et al. Understanding Intra-Class Knowledge Inside CNN , 2015, ArXiv.

[25] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.

[26] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Koray Kavukcuoglu,et al. Pixel Recurrent Neural Networks , 2016, ICML.

[28] Leon A. Gatys,et al. Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Andrea Vedaldi,et al. Visualizing Deep Convolutional Neural Networks Using Natural Pre-images , 2015, International Journal of Computer Vision.

[30] Jason Yosinski,et al. Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks , 2016, ArXiv.

[31] Thomas Brox,et al. Synthesizing the preferred inputs for neurons in neural networks via deep generator networks , 2016, NIPS.

[32] Yoshua Bengio,et al. Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Xi Chen,et al. PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications , 2017, ICLR.

[34] Leon A. Gatys,et al. Deep convolutional models improve predictions of macaque V1 responses to natural images , 2019, PLoS Comput. Biol..