Q-Learning in Regularized Mean-field Games

[1]  Maxim Raginsky,et al.  Approximate Markov-Nash Equilibria for Discrete-Time Risk-Sensitive Mean-Field Games , 2020, Math. Oper. Res..

[2]  Yongxin Chen,et al.  Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games , 2019, ICLR.

[3]  Can Deha Kariksiz,et al.  Value Iteration Algorithm for Mean-field Games , 2019, Syst. Control. Lett..

[4]  Naci Saldi,et al.  Discrete-time average-cost mean-field games on Polish spaces , 2019, ArXiv.

[5]  Piotr Więcek,et al.  Discrete-Time Ergodic Mean-Field Games with Average Reward on Compact Spaces , 2019, Dynamic Games and Applications.

[6]  Ali Devran Kara,et al.  Robustness to Incorrect System Models in Stochastic Control , 2018, SIAM J. Control. Optim..

[7]  Naci Saldi,et al.  Fitted Q-Learning in Mean-field Games , 2019, ArXiv.

[8]  Mathieu Lauriere,et al.  Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods , 2019, ArXiv.

[9]  J. Pérolat,et al.  Approximate Fictitious Play for Mean Field Games , 2019, ArXiv.

[10]  Matthieu Geist,et al.  A Theory of Regularized Markov Decision Processes , 2019, ICML.

[11]  Renyuan Xu,et al.  Learning Mean-Field Games , 2019, NeurIPS.

[12]  Serdar Yüksel,et al.  Robustness to incorrect priors in partially observed stochastic control , 2018, SIAM J. Control. Optim..

[13]  Maxim Raginsky,et al.  Approximate Nash Equilibria in Partially Observed Stochastic Games with Mean-Field Interactions , 2017, Math. Oper. Res..

[14]  Hongyuan Zha,et al.  Learning Deep Mean Field Games for Modeling Large Population Behavior , 2017, ICLR.

[15]  Hongyuan Zha,et al.  Deep Mean Field Games for Learning Optimal Behavior Policy of Large Populations , 2017, ICLR 2018.

[16]  Vicenç Gómez,et al.  A unified view of entropy-regularized Markov decision processes , 2017, ArXiv.

[17]  Tamer Basar,et al.  Markov-Nash equilibria in mean-field games with discounted cost , 2016, 2017 American Control Conference (ACC).

[18]  Tamer Basar,et al.  Robust mean field games for coupled Markov jump linear systems , 2016, Int. J. Control.

[19]  A. Biswas Mean Field Games with Ergodic cost for Discrete Time Markov Processes , 2015, 1510.08968.

[20]  Tamer Basar,et al.  Discrete-time decentralized control using the risk-sensitive performance criterion in the large population regime: A mean field approach , 2015, 2015 American Control Conference (ACC).

[21]  Ramesh Johari,et al.  Equilibria of Dynamic Games with Many Players: Existence, Approximation, and Market Structure , 2010, J. Econ. Theory.

[22]  Diogo A. Gomes,et al.  Mean Field Games Models—A Brief Survey , 2013, Dynamic Games and Applications.

[23]  Sean P. Meyn,et al.  Learning in Mean-Field Games , 2014, IEEE Transactions on Automatic Control.

[24]  Quanyan Zhu,et al.  Risk-Sensitive Mean-Field Games , 2012, IEEE Transactions on Automatic Control.

[25]  Girish N. Nair,et al.  Linear-quadratic-Gaussian mean field games under high rate quantization , 2013, 52nd IEEE Conference on Decision and Control.

[26]  A. Bensoussan,et al.  Mean Field Games and Mean Field Type Control Theory , 2013 .

[27]  Xun Li,et al.  Discrete time mean-field stochastic linear-quadratic optimal control problems , 2013, Autom..

[28]  René Carmona,et al.  Probabilistic Analysis of Mean-field Games , 2013 .

[29]  P. Cardaliaguet,et al.  Mean Field Games , 2020, Lecture Notes in Mathematics.

[30]  Eitan Altman,et al.  Stationary Anonymous Sequential Games with Undiscounted Rewards , 2011, Journal of Optimization Theory and Applications.

[31]  D. Gomes,et al.  Discrete Time, Finite State Space Mean Field Games , 2010 .

[32]  Minyi Huang,et al.  Large-Population LQG Games Involving a Major Player: The Nash Certainty Equivalence Principle , 2009, SIAM J. Control. Optim..

[33]  Sean P. Meyn,et al.  Q-learning and Pontryagin's Minimum Principle , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[34]  K. Ramanan,et al.  Concentration Inequalities for Dependent Random Variables via the Martingale Method , 2006, math/0609835.

[35]  Csaba Szepesvári,et al.  Fitted Q-iteration in continuous action-space MDPs , 2007, NIPS.

[36]  Minyi Huang,et al.  Large-Population Cost-Coupled LQG Problems With Nonuniform Agents: Individual-Mass Behavior and Decentralized $\varepsilon$-Nash Equilibria , 2007, IEEE Transactions on Automatic Control.

[37]  P. Lions,et al.  Mean field games , 2007 .

[38]  Shai Shalev-Shwartz,et al.  Online learning: theory, algorithms and applications (למידה מקוונת.) , 2007 .

[39]  Peter E. Caines,et al.  Large population stochastic dynamic games: closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle , 2006, Commun. Inf. Syst..

[40]  Mathukumalli Vidyasagar,et al.  Learning and Generalization: With Applications to Neural Networks , 2002 .

[41]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[42]  Hans-Otto Georgii,et al.  Gibbs Measures and Phase Transitions , 1988 .