Balancing Safety and Exploitability in Opponent Modeling
暂无分享,去创建一个
Jan Peters | Katharina Mülling | Zhikun Wang | Abdeslam Boularias | Jan Peters | Katharina Muelling | Abdeslam Boularias | Zhikun Wang
[1] O. H. Brownlee,et al. ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .
[2] H. Simon,et al. Bounded Rationality and Organizational Learning , 1991 .
[3] Manuela M. Veloso,et al. Multiagent learning using a variable learning rate , 2002, Artif. Intell..
[4] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.
[5] Peter McCracken,et al. Safe Strategies for Agent Modelling in Games , 2004, AAAI Technical Report.
[6] Shaul Markovitch,et al. Learning and Exploiting Relative Weaknesses of Opponent Agents , 2005, Autonomous Agents and Multi-Agent Systems.
[7] Michael H. Bowling,et al. Convergence and No-Regret in Multiagent Learning , 2004, NIPS.
[8] Yoav Shoham,et al. A general criterion and an algorithmic framework for learning in multi-agent systems , 2007, Machine Learning.
[9] Vincent Conitzer,et al. AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents , 2003, Machine Learning.
[10] Michael H. Bowling,et al. Data Biased Robust Counter Strategies , 2009, AISTATS.
[11] Naftali Tishby,et al. PAC-Bayesian Analysis of Co-clustering and Beyond , 2010, J. Mach. Learn. Res..
[12] Jan Peters,et al. A biomimetic approach to robot table tennis , 2010, IROS.
[13] Jan Peters,et al. A biomimetic approach to robot table tennis , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.