Analysis of Gradient Descent Methods With Nondiminishing Bounded Errors
暂无分享,去创建一个
[1] J. Kiefer,et al. Stochastic Estimation of the Maximum of a Regression Function , 1952 .
[2] J. Aubin,et al. Differential inclusions set-valued maps and viability theory , 1984 .
[3] J. Spall. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation , 1992 .
[4] O. Mangasarian,et al. Serial and parallel backpropagation convergence via nonmonotone perturbed minimization , 1994 .
[5] M. Hurley. Chain recurrence, semiflows, and gradients , 1995 .
[6] M. Benaïm. A Dynamical System Approach to Stochastic Approximations , 1996 .
[7] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[8] James C. Spall,et al. Adaptive stochastic approximation by the simultaneous perturbation method , 2000, IEEE Trans. Autom. Control..
[9] Sean P. Meyn,et al. The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning , 2000, SIAM J. Control. Optim..
[10] John N. Tsitsiklis,et al. Gradient Convergence in Gradient methods with Errors , 1999, SIAM J. Optim..
[11] Josef Hofbauer,et al. Stochastic Approximations and Differential Inclusions , 2005, SIAM J. Control. Optim..
[12] James C. Spall,et al. Introduction to Stochastic Search and Optimization. Estimation, Simulation, and Control (Spall, J.C. , 2007 .
[13] V. Borkar. Stochastic Approximation: A Dynamical Systems Viewpoint , 2008 .
[14] Simon Haykin,et al. Neural Networks and Learning Machines , 2010 .
[15] Arnaud Doucet,et al. Asymptotic bias of stochastic gradient search , 2011, IEEE Conference on Decision and Control and European Control Conference.
[16] Josef Hofbauer,et al. Perturbations of Set-Valued Dynamical Systems, with Applications to Game Theory , 2012, Dyn. Games Appl..