暂无分享,去创建一个
Herke van Hoof | Frans A. Oliehoek | Elise van der Pol | Daniel E. Worrall | Max Welling | M. Welling | F. Oliehoek | H. V. Hoof
[1] Gabriel J. Brostow,et al. CubeNet: Equivariance to 3D Rotation and Translation , 2018, ECCV.
[2] Daniel E. Worrall,et al. Deep Scale-spaces: Equivariance Over Scale , 2019, NeurIPS.
[3] Mitko Veta,et al. Roto-Translation Covariant Convolutional Networks for Medical Image Analysis , 2018, MICCAI.
[4] John Schulman,et al. Concrete Problems in AI Safety , 2016, ArXiv.
[5] Ilya Kostrikov,et al. Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels , 2020, ArXiv.
[6] Theja Tulabandhula,et al. Symmetry Learning for Function Approximation in Reinforcement Learning , 2017, ArXiv.
[7] Kibok Lee,et al. Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning , 2020, ICLR.
[8] Alexander G. Schwing,et al. PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning , 2019, CoRL.
[9] Max Welling,et al. 3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data , 2018, NeurIPS.
[10] Kim Peter Wabersich,et al. Linear Model Predictive Safety Certification for Learning-Based Control , 2018, 2018 IEEE Conference on Decision and Control (CDC).
[11] Balaraman Ravindran,et al. Symmetries and Model Minimization in Markov Decision Processes , 2001 .
[12] Dmitry Yarotsky,et al. Universal Approximations of Invariant Maps by Neural Networks , 2018, Constructive Approximation.
[13] Doina Precup,et al. Bounding Performance Loss in Approximate MDP Homomorphisms , 2008, NIPS.
[14] Balaraman Ravindran,et al. SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes , 2003, IJCAI.
[15] Max Welling,et al. Steerable CNNs , 2016, ICLR.
[16] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.
[17] Pieter Abbeel,et al. rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch , 2019, ArXiv.
[18] Taehoon Kim,et al. Quantifying Generalization in Reinforcement Learning , 2018, ICML.
[19] Maurice Weiler,et al. A General Theory of Equivariant CNNs on Homogeneous Spaces , 2018, NeurIPS.
[20] Nichita Diaconu,et al. Learning to Convolve: A Generalized Weight-Tying Approach , 2019, ICML.
[21] Balaraman Ravindran. Approximate Homomorphisms : A framework for non-exact minimization in Markov Decision Processes , 2022 .
[22] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[23] Maurice Weiler,et al. Learning Steerable Filters for Rotation Equivariant CNNs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[24] Amos J. Storkey,et al. Training Deep Convolutional Neural Networks to Play Go , 2015, ICML.
[25] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.
[26] Frans A. Oliehoek,et al. Plannable Approximations to MDP Homomorphisms: Equivariance under Actions , 2020, AAMAS.
[27] Thomas J. Walsh,et al. Towards a Unified Theory of State Abstraction for MDPs , 2006, AI&M.
[28] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[29] Yisheng Guan,et al. Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning , 2019, IEEE Robotics and Automation Letters.
[30] Terrence J. Sejnowski,et al. Temporal Difference Learning of Position Evaluation in the Game of Go , 1993, NIPS.
[31] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[32] Taco Cohen,et al. 3D G-CNNs for Pulmonary Nodule Detection , 2018, ArXiv.
[33] Pieter Abbeel,et al. Reinforcement Learning with Augmented Data , 2020, NeurIPS.
[34] Farzad Abdolhosseini,et al. On Learning Symmetric Locomotion , 2019, MIG.
[35] Stephan J. Garbin,et al. Harmonic Networks: Deep Translation and Rotation Equivariance , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[36] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[37] Aditi Mavalankar,et al. Goal-conditioned Batch Reinforcement Learning for Rotation Invariant Locomotion , 2020, ArXiv.
[38] Maurice Weiler,et al. General E(2)-Equivariant Steerable CNNs , 2019, NeurIPS.
[39] Robert Platt,et al. Online abstraction with MDP homomorphisms for Deep Learning , 2018, AAMAS.
[40] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[41] Max Welling,et al. Group Equivariant Convolutional Networks , 2016, ICML.
[42] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.
[43] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[44] Balaraman Ravindran,et al. On the hardness of finding symmetries in Markov decision processes , 2008, ICML '08.
[45] H. O. Foulkes. Abstract Algebra , 1967, Nature.
[46] Tiejun Huang,et al. Graph Convolutional Reinforcement Learning , 2020, ICLR.