暂无分享,去创建一个
Sergey Levine | Ruslan Salakhutdinov | Xinyang Geng | Benjamin Eysenbach | S. Levine | R. Salakhutdinov | Benjamin Eysenbach | Xinyang Geng
[1] Thomas M. Cover,et al. Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing) , 2006 .
[2] Leslie Pack Kaelbling,et al. Learning to Achieve Goals , 1993, IJCAI.
[3] Geoffrey E. Hinton,et al. Using Expectation-Maximization for Reinforcement Learning , 1997, Neural Computation.
[4] Rich Caruana,et al. Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.
[5] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[6] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[7] Emanuel Todorov,et al. Linearly-solvable Markov decision problems , 2006, NIPS.
[8] Stefan Schaal,et al. Reinforcement learning by reward-weighted regression for operational space control , 2007, ICML '07.
[9] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[10] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[11] Emanuel Todorov,et al. General duality between optimal control and estimation , 2008, 2008 47th IEEE Conference on Decision and Control.
[12] Marc Toussaint,et al. Robot trajectory optimization using approximate inference , 2009, ICML '09.
[13] J. Andrew Bagnell,et al. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , 2010 .
[14] Marc Toussaint,et al. On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference , 2012, Robotics: Science and Systems.
[15] Evangelos Theodorou,et al. Relative entropy and free energy dualities: Connections to Path Integral and KL control , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).
[16] Kee-Eung Kim,et al. Nonparametric Bayesian Inverse Reinforcement Learning for Multiple Reward Functions , 2012, NIPS.
[17] Vicenç Gómez,et al. Optimal control as a graphical model inference problem , 2009, Machine Learning.
[18] Sergey Levine,et al. Variational Policy Search via Trajectory Optimization , 2013, NIPS.
[19] Siddhartha S. Srinivasa,et al. Shared Autonomy via Hindsight Optimization , 2015, Robotics: Science and Systems.
[20] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.
[21] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.
[22] Razvan Pascanu,et al. Policy Distillation , 2015, ICLR.
[23] Ruslan Salakhutdinov,et al. Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning , 2015, ICLR.
[24] Marcin Andrychowicz,et al. Hindsight Experience Replay , 2017, NIPS.
[25] Chen Liang,et al. Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision , 2016, ACL.
[26] C A Nelson,et al. Learning to Learn , 2017, Encyclopedia of Machine Learning and Data Mining.
[27] Anca D. Dragan,et al. Inverse Reward Design , 2017, NIPS.
[28] Chen Sun,et al. Revisiting Unreasonable Effectiveness of Data in Deep Learning Era , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[29] Sergey Levine,et al. Reinforcement Learning with Deep Energy-Based Policies , 2017, ICML.
[30] Yee Whye Teh,et al. Distral: Robust multitask reinforcement learning , 2017, NIPS.
[31] Vladlen Koltun,et al. Semi-parametric Topological Memory for Navigation , 2018, ICLR.
[32] Henry Zhu,et al. Soft Actor-Critic Algorithms and Applications , 2018, ArXiv.
[33] Eder Santana,et al. A Commute in Data: The comma2k19 Dataset , 2018, ArXiv.
[34] Alexei A. Efros,et al. Investigating Human Priors for Playing Video Games , 2018, ICML.
[35] Sergey Levine,et al. Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review , 2018, ArXiv.
[36] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[37] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[38] Sergey Levine,et al. Temporal Difference Models: Model-Free Deep RL for Model-Based Control , 2018, ICLR.
[39] Sergey Levine,et al. Divide-and-Conquer Reinforcement Learning , 2017, ICLR.
[40] Yuval Tassa,et al. Maximum a Posteriori Policy Optimisation , 2018, ICLR.
[41] Yuval Tassa,et al. DeepMind Control Suite , 2018, ArXiv.
[42] Trevor Darrell,et al. BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling , 2018, ArXiv.
[43] Martin A. Riedmiller,et al. Learning by Playing - Solving Sparse Reward Tasks from Scratch , 2018, ICML.
[44] Satinder Singh,et al. Self-Imitation Learning , 2018, ICML.
[45] S. Levine,et al. Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning , 2019, CoRL.
[46] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.
[47] Wojciech Czarnecki,et al. Multi-task Deep Reinforcement Learning with PopArt , 2018, AAAI.
[48] Simon Lucey,et al. Argoverse: 3D Tracking and Forecasting With Rich Maps , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[49] Rémi Munos,et al. Recurrent Experience Replay in Distributed Reinforcement Learning , 2018, ICLR.
[50] Sergey Levine,et al. Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning , 2019, ArXiv.
[51] Rui Zhao,et al. Maximum Entropy-Regularized Multi-Goal Reinforcement Learning , 2019, ICML.
[52] Sergey Levine,et al. Search on the Replay Buffer: Bridging Planning and Reinforcement Learning , 2019, NeurIPS.
[53] Learning Latent Plans from Play , 2019, CoRL.
[54] S. Levine,et al. RoboNet: Large-Scale Multi-Robot Learning , 2019, Conference on Robot Learning.
[55] S. Levine,et al. Learning To Reach Goals Without Reinforcement Learning , 2019, ArXiv.
[56] Dragomir Anguelov,et al. Scalability in Perception for Autonomous Driving: Waymo Open Dataset , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[57] S. Levine,et al. Gradient Surgery for Multi-Task Learning , 2020, NeurIPS.
[58] Jimmy Ba,et al. Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning , 2020, ICML.
[59] Pieter Abbeel,et al. Generalized Hindsight for Reinforcement Learning , 2020, NeurIPS.
[60] Qiang Xu,et al. nuScenes: A Multimodal Dataset for Autonomous Driving , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[61] Sergey Levine,et al. Learning to Reach Goals via Iterated Supervised Learning , 2019, ICLR.