暂无分享,去创建一个
This technical report describes an efficient technique for computing the norm of the gradient of the loss function for a neural network with respect to its parameters. This gradient norm can be computed efficiently for every example.
[1] Tong Zhang,et al. Stochastic Optimization with Importance Sampling , 2014, ArXiv.