暂无分享,去创建一个
[1] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[2] Mohit Sharma,et al. Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information , 2018, ICLR.
[3] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[4] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.
[5] Pushmeet Kohli,et al. CompILE: Compositional Imitation Learning and Execution , 2018, ICML.
[6] John Valasek,et al. Efficiently Combining Human Demonstrations and Interventions for Safe Training of Autonomous Systems in Real-Time , 2018, AAAI.
[7] Doina Precup,et al. Learning Options in Reinforcement Learning , 2002, SARA.
[8] Xin Zhang,et al. End to End Learning for Self-Driving Cars , 2016, ArXiv.
[9] Pieter Abbeel,et al. Variational Option Discovery Algorithms , 2018, ArXiv.
[10] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[11] Stefano Ermon,et al. InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations , 2017, NIPS.
[12] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[13] Martial Hebert,et al. Learning monocular reactive UAV control in cluttered natural environments , 2012, 2013 IEEE International Conference on Robotics and Automation.
[14] Vinay P. Namboodiri,et al. InfoRL: Interpretable Reinforcement Learning using Information Maximization , 2019, ArXiv.
[15] Nando de Freitas,et al. Robust Imitation of Diverse Behaviors , 2017, NIPS.
[16] Sergey Levine,et al. Diversity is All You Need: Learning Skills without a Reward Function , 2018, ICLR.
[17] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[18] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.