暂无分享,去创建一个
Xian Wu | Prateek Jain | Praneeth Netrapalli | Guy Bresler | Dheeraj Nagaraj | Prateek Jain | Praneeth Netrapalli | Guy Bresler | Dheeraj M. Nagaraj | Xian Wu
[1] A. Mokkadem. Mixing properties of ARMA processes , 1988 .
[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[3] Imre Csiszár,et al. Context tree estimation for not necessarily finite memory processes, via BIC and MDL , 2005, IEEE Transactions on Information Theory.
[4] D. Paulin. Concentration inequalities for Markov chains by Marton couplings and spectral methods , 2012, 1212.2015.
[5] Jalaj Bhandari,et al. A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation , 2018, COLT.
[6] R. K. Agrawal,et al. An Introductory Study on Time Series Modeling and Forecasting , 2013, ArXiv.
[7] Marcin Andrychowicz,et al. Hindsight Experience Replay , 2017, NIPS.
[8] R. Srikant,et al. Finite-Time Error Bounds For Linear Stochastic Approximation and TD Learning , 2019, COLT.
[9] Prateek Jain,et al. Parallelizing Stochastic Gradient Descent for Least Squares Regression: Mini-batching, Averaging, and Model Misspecification , 2016, J. Mach. Learn. Res..
[10] Siddhartha V. Jayanti,et al. Learning from weakly dependent data under Dobrushin's condition , 2019, COLT.
[11] H. Kushner,et al. Stochastic Approximation and Recursive Algorithms and Applications , 2003 .
[12] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.
[13] Jaouad Mourtada. Exact minimax risk for linear least squares, and the lower tail of sample covariance matrices , 2019 .
[14] Michael I. Jordan,et al. Ergodic mirror descent , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).
[15] Constantinos Daskalakis,et al. Regression from dependent observations , 2019, STOC.
[16] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[17] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.