Q-Learning in Regularized Mean-field Games
暂无分享,去创建一个
[1] Maxim Raginsky,et al. Approximate Markov-Nash Equilibria for Discrete-Time Risk-Sensitive Mean-Field Games , 2020, Math. Oper. Res..
[2] Yongxin Chen,et al. Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games , 2019, ICLR.
[3] Can Deha Kariksiz,et al. Value Iteration Algorithm for Mean-field Games , 2019, Syst. Control. Lett..
[4] Naci Saldi,et al. Discrete-time average-cost mean-field games on Polish spaces , 2019, ArXiv.
[5] Piotr Więcek,et al. Discrete-Time Ergodic Mean-Field Games with Average Reward on Compact Spaces , 2019, Dynamic Games and Applications.
[6] Ali Devran Kara,et al. Robustness to Incorrect System Models in Stochastic Control , 2018, SIAM J. Control. Optim..
[7] Naci Saldi,et al. Fitted Q-Learning in Mean-field Games , 2019, ArXiv.
[8] Mathieu Lauriere,et al. Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods , 2019, ArXiv.
[9] J. Pérolat,et al. Approximate Fictitious Play for Mean Field Games , 2019, ArXiv.
[10] Matthieu Geist,et al. A Theory of Regularized Markov Decision Processes , 2019, ICML.
[11] Renyuan Xu,et al. Learning Mean-Field Games , 2019, NeurIPS.
[12] Serdar Yüksel,et al. Robustness to incorrect priors in partially observed stochastic control , 2018, SIAM J. Control. Optim..
[13] Maxim Raginsky,et al. Approximate Nash Equilibria in Partially Observed Stochastic Games with Mean-Field Interactions , 2017, Math. Oper. Res..
[14] Hongyuan Zha,et al. Learning Deep Mean Field Games for Modeling Large Population Behavior , 2017, ICLR.
[15] Hongyuan Zha,et al. Deep Mean Field Games for Learning Optimal Behavior Policy of Large Populations , 2017, ICLR 2018.
[16] Vicenç Gómez,et al. A unified view of entropy-regularized Markov decision processes , 2017, ArXiv.
[17] Tamer Basar,et al. Markov-Nash equilibria in mean-field games with discounted cost , 2016, 2017 American Control Conference (ACC).
[18] Tamer Basar,et al. Robust mean field games for coupled Markov jump linear systems , 2016, Int. J. Control.
[19] A. Biswas. Mean Field Games with Ergodic cost for Discrete Time Markov Processes , 2015, 1510.08968.
[20] Tamer Basar,et al. Discrete-time decentralized control using the risk-sensitive performance criterion in the large population regime: A mean field approach , 2015, 2015 American Control Conference (ACC).
[21] Ramesh Johari,et al. Equilibria of Dynamic Games with Many Players: Existence, Approximation, and Market Structure , 2010, J. Econ. Theory.
[22] Diogo A. Gomes,et al. Mean Field Games Models—A Brief Survey , 2013, Dynamic Games and Applications.
[23] Sean P. Meyn,et al. Learning in Mean-Field Games , 2014, IEEE Transactions on Automatic Control.
[24] Quanyan Zhu,et al. Risk-Sensitive Mean-Field Games , 2012, IEEE Transactions on Automatic Control.
[25] Girish N. Nair,et al. Linear-quadratic-Gaussian mean field games under high rate quantization , 2013, 52nd IEEE Conference on Decision and Control.
[26] A. Bensoussan,et al. Mean Field Games and Mean Field Type Control Theory , 2013 .
[27] Xun Li,et al. Discrete time mean-field stochastic linear-quadratic optimal control problems , 2013, Autom..
[28] René Carmona,et al. Probabilistic Analysis of Mean-field Games , 2013 .
[29] P. Cardaliaguet,et al. Mean Field Games , 2020, Lecture Notes in Mathematics.
[30] Eitan Altman,et al. Stationary Anonymous Sequential Games with Undiscounted Rewards , 2011, Journal of Optimization Theory and Applications.
[31] D. Gomes,et al. Discrete Time, Finite State Space Mean Field Games , 2010 .
[32] Minyi Huang,et al. Large-Population LQG Games Involving a Major Player: The Nash Certainty Equivalence Principle , 2009, SIAM J. Control. Optim..
[33] Sean P. Meyn,et al. Q-learning and Pontryagin's Minimum Principle , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.
[34] K. Ramanan,et al. Concentration Inequalities for Dependent Random Variables via the Martingale Method , 2006, math/0609835.
[35] Csaba Szepesvári,et al. Fitted Q-iteration in continuous action-space MDPs , 2007, NIPS.
[36] Minyi Huang,et al. Large-Population Cost-Coupled LQG Problems With Nonuniform Agents: Individual-Mass Behavior and Decentralized $\varepsilon$-Nash Equilibria , 2007, IEEE Transactions on Automatic Control.
[37] P. Lions,et al. Mean field games , 2007 .
[38] Shai Shalev-Shwartz,et al. Online learning: theory, algorithms and applications (למידה מקוונת.) , 2007 .
[39] Peter E. Caines,et al. Large population stochastic dynamic games: closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle , 2006, Commun. Inf. Syst..
[40] Mathukumalli Vidyasagar,et al. Learning and Generalization: With Applications to Neural Networks , 2002 .
[41] Vladimir Vapnik,et al. Statistical learning theory , 1998 .
[42] Hans-Otto Georgii,et al. Gibbs Measures and Phase Transitions , 1988 .