Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations
暂无分享,去创建一个
Silvio Savarese | Li Fei-Fei | Yuke Zhu | Animesh Garg | Ajay Mandlekar | Li Fei-Fei | S. Savarese | Animesh Garg | Yuke Zhu | Ajay Mandlekar
[1] David J. N. Limebeer,et al. Linear Robust Control , 1994 .
[2] C. He. On the distance to uncontrollability and the distance to instability and their relation to some condition numbers in control , 1997 .
[3] Alexei A. Mailybaev. Uncontrollability for Linear Autonomous Multi-input Dynamical Systems Depending on Parameters , 2003, SIAM J. Control. Optim..
[4] Laurent El Ghaoui,et al. Robust Control of Markov Decision Processes with Uncertain Transition Matrices , 2005, Oper. Res..
[5] Pieter Abbeel,et al. Using inaccurate models in reinforcement learning , 2006, ICML.
[6] Blaine Nelson,et al. Can machine learning be secure? , 2006, ASIACCS '06.
[7] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[8] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[9] Steven Boyce. The Distance to Uncontrollability via Linear Matrix Inequalities , 2010 .
[10] Patrick Jaillet,et al. Loss bounds for uncertain transition probabilities in Markov decision processes , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).
[11] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[12] Fabio Roli,et al. Evasion Attacks against Machine Learning at Test Time , 2013, ECML/PKDD.
[13] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..
[14] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.
[15] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[16] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.
[17] Javier García,et al. A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..
[18] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[19] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[20] Seyed-Mohsen Moosavi-Dezfooli,et al. DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[21] Razvan Pascanu,et al. Progressive Neural Networks , 2016, ArXiv.
[22] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[23] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[24] Ananthram Swami,et al. Practical Black-Box Attacks against Deep Learning Systems using Adversarial Examples , 2016, ArXiv.
[25] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[26] Brijen Thananjeyan,et al. Multilateral surgical pattern cutting in 2D orthotropic gauze with deep reinforcement learning policies for tensioning , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[27] Patrick D. McDaniel,et al. On the (Statistical) Detection of Adversarial Examples , 2017, ArXiv.
[28] Seyed-Mohsen Moosavi-Dezfooli,et al. Universal Adversarial Perturbations , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Arslan Munir,et al. Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks , 2017, MLDM.
[30] Silvio Savarese,et al. ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems , 2017, ISRR.
[31] Razvan Pascanu,et al. Sim-to-Real Robot Learning from Pixels with Progressive Nets , 2016, CoRL.
[32] Greg Turk,et al. Preparing for the Unknown: Learning a Universal Policy with Online System Identification , 2017, Robotics: Science and Systems.
[33] Beilun Wang,et al. DeepMask: Masking DNN Models for robustness against adversarial samples , 2017, ArXiv.
[34] Ali Farhadi,et al. Target-driven visual navigation in indoor scenes using deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[35] Sergey Levine,et al. Learning modular neural network policies for multi-task and multi-robot transfer , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[36] Balaraman Ravindran,et al. EPOpt: Learning Robust Neural Network Policies Using Model Ensembles , 2016, ICLR.
[37] Samy Bengio,et al. Adversarial examples in the physical world , 2016, ICLR.
[38] Sandy H. Huang,et al. Adversarial Attacks on Neural Network Policies , 2017, ICLR.
[39] Sergey Levine,et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..