Dynamics of stochastic approximation with Markov iterate-dependent noise with the stability of the iterates not ensured
暂无分享,去创建一个
[1] Vivek S. Borkar,et al. On the Lock-in Probability of Stochastic Approximation , 2002, Combinatorics, Probability and Computing.
[2] Han-Fu Chen. Stochastic approximation and its applications , 2002 .
[3] Mark Broadie,et al. General Bounds and Finite-Time Improvement for the Kiefer-Wolfowitz Stochastic Approximation Algorithm , 2011, Oper. Res..
[4] Eric Moulines,et al. Stability of Stochastic Approximation under Verifiable Conditions , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.
[5] John N. Tsitsiklis,et al. Linear stochastic approximation driven by slowly varying Markov chains , 2003, Syst. Control. Lett..
[6] Odile Brandière,et al. Some Pathological Traps for Stochastic Approximation , 1998 .
[7] Sameer Kamal,et al. On the Convergence, Lock-In Probability, and Sample Complexity of Stochastic Approximation , 2010, SIAM J. Control. Optim..
[8] Shalabh Bhatnagar,et al. Stability of Stochastic Approximations With “Controlled Markov” Noise and Temporal Difference Learning , 2015, IEEE Transactions on Automatic Control.
[9] Shie Mannor,et al. Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning , 2017, COLT.
[10] C. Andrieu,et al. On the stability of some controlled Markov chains and its applications to stochastic approximation with Markovian dynamic , 2012, 1205.4181.
[11] Alexander Shapiro,et al. Stochastic Approximation approach to Stochastic Programming , 2013 .
[12] R. Pemantle,et al. Nonconvergence to Unstable Points in Urn Models and Stochastic Approximations , 1990 .
[13] Sean P. Meyn,et al. The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning , 2000, SIAM J. Control. Optim..
[14] Shalabh Bhatnagar,et al. Two Timescale Stochastic Approximation with Controlled Markov noise , 2015, Math. Oper. Res..
[15] V. Borkar,et al. A Concentration Bound for Stochastic Approximation via Alekseev’s Formula , 2015, Stochastic Systems.