Normalizing Flow Model for Policy Representation in Continuous Action Multi-agent Systems
暂无分享,去创建一个
[1] Guy Lever,et al. Human-level performance in 3D multiplayer games with population-based reinforcement learning , 2018, Science.
[2] Max Welling,et al. Improved Variational Inference with Inverse Autoregressive Flow , 2016, NIPS 2016.
[3] Michael H. Bowling,et al. Actor-Critic Policy Optimization in Partially Observable Multiagent Environments , 2018, NeurIPS.
[4] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[5] Dean Pomerleau,et al. Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.
[6] Shakir Mohamed,et al. Variational Inference with Normalizing Flows , 2015, ICML.
[7] Prafulla Dhariwal,et al. Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.
[8] Samy Bengio,et al. Density estimation using Real NVP , 2016, ICLR.
[9] Marco Pavone,et al. Multimodal Probabilistic Model-Based Planning for Human-Robot Interaction , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[10] David Silver,et al. A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning , 2017, NIPS.
[11] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[12] Pieter Abbeel,et al. Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design , 2019, ICML.
[13] Hugo Larochelle,et al. MADE: Masked Autoencoder for Distribution Estimation , 2015, ICML.
[14] Katherine Rose Driggs-Campbell,et al. Simulating Emergent Properties of Human Driving Behavior Using Multi-Agent Reward Augmented Imitation Learning , 2019, 2019 International Conference on Robotics and Automation (ICRA).
[15] Daan Wierstra,et al. Deep AutoRegressive Networks , 2013, ICML.
[16] Bart De Schutter,et al. A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
[17] Yoshua Bengio,et al. NICE: Non-linear Independent Components Estimation , 2014, ICLR.
[18] Gabriel Peyré,et al. Computational Optimal Transport , 2018, Found. Trends Mach. Learn..
[19] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[20] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.