Deep Boltzmann Machines and the Centering Trick
暂无分享,去创建一个
[1] Geoffrey E. Hinton,et al. Learning and relearning in Boltzmann machines , 1986 .
[2] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .
[3] Boris Polyak,et al. Acceleration of stochastic approximation by averaging , 1992 .
[4] G. Kane. Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .
[5] Barak A. Pearlmutter. Fast Exact Multiplication by the Hessian , 1994, Neural Computation.
[6] Nicol N. Schraudolph,et al. Centering Neural Network Gradient Factors , 1996, Neural Networks.
[7] Bernhard Schölkopf,et al. Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.
[8] Gunnar Rätsch,et al. Input space versus feature space in kernel-based methods , 1999, IEEE Trans. Neural Networks.
[9] Radford M. Neal. Annealed importance sampling , 1998, Stat. Comput..
[10] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.
[11] Ruslan Salakhutdinov,et al. On the quantitative analysis of deep belief networks , 2008, ICML '08.
[12] Joachim M. Buhmann,et al. On Relevant Dimensions in Kernel Feature Spaces , 2008, J. Mach. Learn. Res..
[13] Tijmen Tieleman,et al. Training restricted Boltzmann machines using approximations to the likelihood gradient , 2008, ICML '08.
[14] R. Salakhutdinov. Learning and Evaluating Boltzmann Machines , 2008 .
[15] Geoffrey E. Hinton,et al. Using fast weights to improve persistent contrastive divergence , 2009, ICML '09.
[16] Geoffrey E. Hinton,et al. Deep Boltzmann Machines , 2009, AISTATS.
[17] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.
[18] Tapani Raiko,et al. Enhanced Gradient and Adaptive Learning Rate for Training Restricted Boltzmann Machines , 2011, ICML.
[19] Ilya Sutskever,et al. Data Normalization in the Learning of Restricted Boltzmann Machines , 2011 .
[20] Klaus-Robert Müller,et al. Kernel Analysis of Deep Networks , 2011, J. Mach. Learn. Res..
[21] Wei Xu,et al. Towards Optimal One Pass Large Scale Learning with Averaged Stochastic Gradient Descent , 2011, ArXiv.
[22] Klaus-Robert Müller,et al. Deep Boltzmann Machines as Feed-Forward Hierarchies , 2012, AISTATS.
[23] Grgoire Montavon,et al. Neural Networks: Tricks of the Trade , 2012, Lecture Notes in Computer Science.
[24] Klaus-Robert Müller,et al. Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.
[25] Geoffrey E. Hinton,et al. An Efficient Learning Procedure for Deep Boltzmann Machines , 2012, Neural Computation.
[26] Geoffrey E. Hinton. A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.