Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning
暂无分享,去创建一个
Kun Kuang | Fei Wu | Baoxiang Wang | Furui Liu | Long Chen | Jun Xiao | Jiahui Li | Kun Kuang | Baoxiang Wang | Long Chen | Furui Liu | Jun Xiao | Fei Wu | Jiahui Li
[1] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[2] Natalia Díaz Rodríguez,et al. Explainability in Deep Reinforcement Learning , 2020, Knowl. Based Syst..
[3] Jianye Hao,et al. Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning , 2020, ArXiv.
[4] Frans A. Oliehoek,et al. A Concise Introduction to Decentralized POMDPs , 2016, SpringerBriefs in Intelligent Systems.
[5] Nicholas R. Jennings,et al. A linear approximation method for the Shapley value , 2008, Artif. Intell..
[6] Suresh Venkatasubramanian,et al. Problems with Shapley-value-based explanations as feature importance measures , 2020, ICML.
[7] Le Song,et al. L-Shapley and C-Shapley: Efficient Model Interpretation for Structured Data , 2018, ICLR.
[8] Yuk Ying Chung,et al. Learning Implicit Credit Assignment for Multi-Agent Actor-Critic , 2020, ArXiv.
[9] Yujing Hu,et al. Q-value Path Decomposition for Deep Multiagent Reinforcement Learning , 2020, ICML.
[10] Kagan Tumer,et al. Optimal Payoff Functions for Members of Collectives , 2001, Adv. Complex Syst..
[11] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[12] James Y. Zou,et al. Data Shapley: Equitable Valuation of Data for Machine Learning , 2019, ICML.
[13] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[14] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[15] Shimon Whiteson,et al. The StarCraft Multi-Agent Challenge , 2019, AAMAS.
[16] Fei Sha,et al. Actor-Attention-Critic for Multi-Agent Reinforcement Learning , 2018, ICML.
[17] Jing Peng,et al. Function Optimization using Connectionist Reinforcement Learning Algorithms , 1991 .
[18] Yunjie Gu,et al. Shapley Q-value: A Local Reward Approach to Solve Global Reward Games , 2020, AAAI.
[19] Fan-Yong Meng. The Core and Shapley Function for Games on Augmenting Systems with a Coalition Structure , 2012 .
[20] Long Chen,et al. Counterfactual Critic Multi-Agent Training for Scene Graph Generation , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[21] Jesús Mario Bilbao,et al. The Shapley value on convex geometries , 2000, Discret. Appl. Math..
[22] Markus H. Gross,et al. Explaining Deep Neural Networks with a Polynomial Time Algorithm for Shapley Values Approximation , 2019, ICML.
[23] Sarvapali D. Ramchurn,et al. Decentralized Coordination in RoboCup Rescue , 2010, Comput. J..
[24] Yung Yi,et al. QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning , 2019, ICML.
[25] Nikos A. Vlassis,et al. Optimal and Approximate Q-value Functions for Decentralized POMDPs , 2008, J. Artif. Intell. Res..
[26] Mukund Sundararajan,et al. The many Shapley values for model explanation , 2019, ICML.
[27] Rahul Savani,et al. Lenient Multi-Agent Deep Reinforcement Learning , 2017, AAMAS.
[28] Bart De Schutter,et al. A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
[29] Bikramjit Banerjee,et al. Multi-agent reinforcement learning as a rehearsal for decentralized planning , 2016, Neurocomputing.
[30] Colin Rowat,et al. Asymmetric Shapley values: incorporating causal knowledge into model-agnostic explainability , 2020, NeurIPS.
[31] L. Shapley. A Value for n-person Games , 1988 .
[32] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.
[33] Nicolas Le Roux,et al. Understanding the impact of entropy on policy optimization , 2018, ICML.
[34] Jenna Wiens,et al. Shapley Flow: A Graph-based Approach to Interpreting Model Predictions , 2020, ArXiv.
[35] Francesco Borrelli,et al. Decentralized Receding Horizon Control and Coordination of Autonomous Vehicle Formations , 2008, IEEE Transactions on Control Systems Technology.
[36] Yun Yang,et al. A Multi-Agent Framework for Packet Routing in Wireless Sensor Networks , 2015, Sensors.
[37] Wenwu Yu,et al. An Overview of Recent Progress in the Study of Distributed Multi-Agent Coordination , 2012, IEEE Transactions on Industrial Informatics.
[38] Shimon Whiteson,et al. QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning , 2018, ICML.
[39] Ming Zhou,et al. Mean Field Multi-Agent Reinforcement Learning , 2018, ICML.
[40] Guy Lever,et al. Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward , 2018, AAMAS.
[41] Tom Claassen,et al. Causal Shapley Values: Exploiting Causal Knowledge to Explain Individual Predictions of Complex Models , 2020, NeurIPS.
[42] Mykel J. Kochenderfer,et al. Cooperative Multi-agent Control Using Deep Reinforcement Learning , 2017, AAMAS Workshops.
[43] James Zou,et al. Neuron Shapley: Discovering the Responsible Neurons , 2020, NeurIPS.
[44] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.