论文信息 - Online Robust Policy Learning in the Presence of Unknown Adversaries

Online Robust Policy Learning in the Presence of Unknown Adversaries

The growing prospect of deep reinforcement learning (DRL) being used in cyber-physical systems has raised concerns around safety and robustness of autonomous agents. Recent work on generating adversarial attacks have shown that it is computationally feasible for a bad actor to fool a DRL policy into behaving sub optimally. Although certain adversarial attacks with specific attack models have been addressed, most studies are only interested in off-line optimization in the data space (e.g., example fitting, distillation). This paper introduces a Meta-Learned Advantage Hierarchy (MLAH) framework that is attack model-agnostic and more suited to reinforcement learning, via handling the attacks in the decision space (as opposed to data space) and directly mitigating learned bias introduced by the adversary. In MLAH, we learn separate sub-policies (nominal and adversarial) in an online manner, as guided by a supervisory master agent that detects the presence of the adversary by leveraging the advantage function for the sub-policies. We demonstrate that the proposed algorithm enables policy learning with significantly lower bias as compared to the state-of-the-art policy learning approaches even in the presence of heavy state information attacks. We present algorithm analysis and simulation results using popular OpenAI Gym environments.

Soumik Sarkar | Zhanhong Jiang | Aaron J. Havens | S. Sarkar | Zhanhong Jiang

[1] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[2] Silvio Savarese,et al. Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[3] Sandy H. Huang,et al. Adversarial Attacks on Neural Network Policies , 2017, ICLR.

[4] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.

[5] Ming-Yu Liu,et al. Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight , 2017, ArXiv.

[6] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[8] R.J. Williams,et al. Reinforcement learning is direct adaptive optimal control , 1991, IEEE Control Systems.

[9] Dawn Xiaodong Song,et al. Delving into adversarial attacks on deep policies , 2017, ICLR.

[10] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[11] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[12] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.

[13] Tei-Wei Kuo,et al. Designing CPS/IoT applications for smart buildings and cities , 2016, IET Cyper-Phys. Syst.: Theory & Appl..

[14] Plamen Angelov,et al. A general purpose intelligent surveillance system for mobile devices using Deep Learning , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[15] David A. Wagner,et al. Towards Evaluating the Robustness of Neural Networks , 2016, 2017 IEEE Symposium on Security and Privacy (SP).

[16] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[17] Abhinav Gupta,et al. Robust Adversarial Reinforcement Learning , 2017, ICML.

[18] Gongjun Yan,et al. Towards intelligent transportation Cyber-Physical Systems: Real-time computing and communications perspectives , 2015, SoutheastCon 2015.

[19] Balaraman Ravindran,et al. EPOpt: Learning Robust Neural Network Policies Using Model Ensembles , 2016, ICLR.

[20] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[21] Michael I. Jordan,et al. Bayesian Nonparametric Inference of Switching Dynamic Linear Models , 2010, IEEE Transactions on Signal Processing.

[22] Girish Chowdhary,et al. Robust Deep Reinforcement Learning with Adversarial Attacks , 2017, AAMAS.

[23] Pieter Abbeel,et al. Meta Learning Shared Hierarchies , 2017, ICLR.

[24] Aleksander Madry,et al. Towards Deep Learning Models Resistant to Adversarial Attacks , 2017, ICLR.

[25] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.