论文信息 - Style Memory: Making a Classifier Network Generative - 字舞流文

Style Memory: Making a Classifier Network Generative

Deep networks have shown great performance in classification tasks. However, the parameters learned by the classifier networks usually discard stylistic information of the input, in favour of information strictly relevant to classification. We introduce a network that has the capacity to do both classification and reconstruction by adding a “style memory” to the output layer of the network. We also show how to train such a neural network as a deep multi-layer autoencoder, jointly minimizing both classification and reconstruction losses. The generative capacity of our network demonstrates that the combination of style-memory neurons with the classifier neurons yield good reconstructions of the inputs when the classification is correct. We further investigate the nature of the style memory, and how it relates to composing digits and letters. Finally, we propose that this architecture enables the bidirectional flow of information used in predictive coding, and that such bidirectional networks can help mitigate against being fooled by ambiguous or adversarial input.

Jeff Orchard | Rey Wiyatno | Jeff Orchard | R. Wiyatno

[1] James R. Glass,et al. Bidirectional Backpropagation: Towards Biologically Plausible Error Signal Transmission in Neural Networks , 2017, ArXiv.

[2] Geoffrey E. Hinton,et al. To recognize shapes, first learn to generate images. , 2007, Progress in brain research.

[3] H. Neumann,et al. The Role of Attention in Figure-Ground Segregation in Areas V1 and V4 of the Visual Cortex , 2012, Neuron.

[4] Jascha Sohl-Dickstein,et al. Adversarial Examples that Fool both Human and Computer Vision , 2018, ArXiv.

[5] C. Summerfield,et al. Expectation in perceptual decision making: neural and computational mechanisms , 2014, Nature Reviews Neuroscience.

[6] Geoffrey E. Hinton,et al. Deep Boltzmann Machines , 2009, AISTATS.

[7] G. Henry,et al. Physiological studies on the feedback connection to the striate cortex from cortical areas 18 and 19 of the cat , 1988, Experimental Brain Research.

[8] Samy Bengio,et al. Adversarial Machine Learning at Scale , 2016, ICLR.

[9] Samy Bengio,et al. Adversarial examples in the physical world , 2016, ICLR.

[10] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[11] Gregory Cohen,et al. EMNIST: an extension of MNIST to handwritten letters , 2017, CVPR 2017.

[12] Yang Song,et al. PixelDefend: Leveraging Generative Models to Understand and Defend against Adversarial Examples , 2017, ICLR.

[13] Jeff Orchard,et al. Combating Adversarial Inputs Using a Predictive-Estimator Network , 2017, ICONIP.

[14] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[15] Yoshua Bengio,et al. Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[16] Rama Chellappa,et al. Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models , 2018, ICLR.

[17] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[18] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.

[19] Geoffrey E. Hinton,et al. Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure , 2007, AISTATS.

[20] David A. Wagner,et al. Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples , 2018, ICML.

[21] Brad Wyble,et al. Detecting meaning in RSVP at 13 ms per picture , 2013, Attention, perception & psychophysics.

[22] A. Borst. Seeing smells: imaging olfactory learning in bees , 1999, Nature Neuroscience.

[23] Pascal Vincent,et al. Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[24] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[25] Thomas Serre,et al. Reading the mind's eye: Decoding category information during mental imagery , 2010, NeuroImage.

[26] Geoffrey E. Hinton,et al. The "wake-sleep" algorithm for unsupervised neural networks. , 1995, Science.

[27] David Xu,et al. Symmetric Predictive Estimator for Biologically Plausible Neural Learning , 2018, IEEE Transactions on Neural Networks and Learning Systems.