Deep Nets Don't Learn via Memorization
暂无分享,去创建一个
Tegan Maharaj | Aaron C. Courville | Stanislaw Jastrzebski | Asja Fischer | Devansh Arpit | Nicolas Ballas | Emmanuel Bengio | David Krueger | Maxinder S. Kanwal | Nicolas Ballas | David Krueger | Stanislaw Jastrzebski | Devansh Arpit | Asja Fischer | Tegan Maharaj | Emmanuel Bengio
[1] Jorge Nocedal,et al. On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima , 2016, ICLR.
[2] Samy Bengio,et al. Understanding deep learning requires rethinking generalization , 2016, ICLR.
[3] Barak A. Pearlmutter,et al. Automatic Learning Rate Maximization by On-Line Estimation of the Hessian's Eigenvectors , 1992, NIPS 1992.