暂无分享,去创建一个
Yoshua Bengio | Aaron C. Courville | Alex Lamb | Guillaume Alain | Chinnadhurai Sankar | Yoshua Bengio | Guillaume Alain | Alex Lamb | Chinnadhurai Sankar
[1] Ian J. Goodfellow,et al. Efficient Per-Example Gradient Computations , 2015, ArXiv.
[2] Yoshua Bengio,et al. Maxout Networks , 2013, ICML.
[3] David J. C. MacKay,et al. Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.
[4] Xinyun Chen. Under Review as a Conference Paper at Iclr 2017 Delving into Transferable Adversarial Ex- Amples and Black-box Attacks , 2016 .
[5] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.
[6] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .
[7] Robert E. Kass,et al. Importance sampling: a review , 2010 .
[8] Tong Zhang,et al. Stochastic Optimization with Importance Sampling , 2014, ArXiv.
[9] Itamar Arel,et al. Low-Rank Approximations for Conditional Feedforward Computation in Deep Neural Networks , 2013, ICLR.
[10] Jürgen Schmidhuber,et al. Compete to Compute , 2013, NIPS.
[11] Guillaume Bouchard,et al. Accelerating Stochastic Gradient Descent via Online Learning to Sample , 2015, ArXiv.
[12] Razvan Pascanu,et al. Theano: new features and speed improvements , 2012, ArXiv.
[13] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.
[14] Yijun Huang,et al. Asynchronous Parallel Stochastic Gradient for Nonconvex Optimization , 2015, NIPS.
[15] Stephen J. Wright,et al. Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.
[16] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[17] John C. Duchi,et al. Distributed delayed stochastic optimization , 2011, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).