论文信息 - Generalized Dropout

Generalized Dropout

Deep Neural Networks often require good regularizers to generalize well. Dropout is one such regularizer that is widely used among Deep Learning practitioners. Recent work has shown that Dropout can also be viewed as performing Approximate Bayesian Inference over the network parameters. In this work, we generalize this notion and introduce a rich family of regularizers which we call Generalized Dropout. One set of methods in this family, called Dropout++, is a version of Dropout with trainable parameters. Classical Dropout emerges as a special case of this method. Another member of this family selects the width of neural network layers. Experiments show that these methods help in improving generalization performance over Dropout.

R. Venkatesh Babu | Suraj Srinivas | Suraj Srinivas

[1] Julien Cornebise,et al. Weight Uncertainty in Neural Networks , 2015, ArXiv.

[2] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] B. Ripley,et al. Pattern Recognition , 1968, Nature.

[4] Yann LeCun,et al. Regularization of Neural Networks using DropConnect , 2013, ICML.

[5] Prabhat,et al. Scalable Bayesian Optimization Using Deep Neural Networks , 2015, ICML.

[6] Alex Graves,et al. Practical Variational Inference for Neural Networks , 2011, NIPS.

[7] Misha Denil,et al. Predicting Parameters in Deep Learning , 2014 .

[8] Brendan J. Frey,et al. Adaptive dropout for training deep neural networks , 2013, NIPS.

[9] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[10] Ming Yang,et al. DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Zoubin Ghahramani,et al. Bayesian Convolutional Neural Networks with Bernoulli Approximate Variational Inference , 2015, ArXiv.