One-Shot Imitation Learning
暂无分享,去创建一个
Marcin Andrychowicz | Wojciech Zaremba | Pieter Abbeel | Ilya Sutskever | Yan Duan | Jonas Schneider | Jonathan Ho | Bradly C. Stadie | P. Abbeel | Marcin Andrychowicz | Yan Duan | Ilya Sutskever | Wojciech Zaremba | Jonathan Ho | Jonas Schneider | I. Sutskever
[1] G. Evans,et al. Learning to Optimize , 2008 .
[2] Sergey Levine,et al. (CAD)$^2$RL: Real Single-Image Flight without a Single Real Image , 2016, Robotics: Science and Systems.
[3] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[4] Yoshua Bengio,et al. Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations , 2016, ICLR.
[5] Razvan Pascanu,et al. Interaction Networks for Learning about Objects, Relations and Physics , 2016, NIPS.
[6] John N. Tsitsiklis,et al. Neuro-dynamic programming: an overview , 1995, Proceedings of 1995 34th IEEE Conference on Decision and Control.
[7] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[8] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[9] S. Shankar Sastry,et al. Autonomous Helicopter Flight via Reinforcement Learning , 2003, NIPS.
[10] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[11] Stefan Schaal,et al. 2008 Special Issue: Reinforcement learning of motor skills with policy gradients , 2008 .
[12] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[13] Sergey Levine,et al. Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning , 2017, ICLR.
[14] Stefan Schaal,et al. Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.
[15] Razvan Pascanu,et al. Progressive Neural Networks , 2016, ArXiv.
[16] Rong Yan,et al. Cross-domain video concept detection using adaptive svms , 2007, ACM Multimedia.
[17] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[18] Henk Nijmeijer,et al. Robot Programming by Demonstration , 2010, SIMPAR.
[19] Sergey Levine,et al. Adapting Deep Visuomotor Representations with Weak Pairwise Constraints , 2015, WAFR.
[20] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.
[21] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[22] Sergey Levine,et al. Nonlinear Inverse Reinforcement Learning with Gaussian Processes , 2011, NIPS.
[23] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.
[24] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .
[25] Geoffrey E. Hinton,et al. Using Fast Weights to Attend to the Recent Past , 2016, NIPS.
[26] Gerald Tesauro,et al. Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..
[27] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.
[28] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[29] Yishay Mansour,et al. Domain Adaptation: Learning Bounds and Algorithms , 2009, COLT.
[30] Richard J. Mammone,et al. Meta-neural networks that learn by learning , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.
[31] Pieter Abbeel,et al. Third-Person Imitation Learning , 2017, ICLR.
[32] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[33] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[34] Michael I. Jordan,et al. Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.
[35] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.
[36] Yoshua Bengio,et al. On the Optimization of a Synaptic Learning Rule , 2007 .
[37] Sergey Levine,et al. Towards Adapting Deep Visuomotor Representations from Simulated to Real Environments , 2015, ArXiv.
[38] Jürgen Schmidhuber,et al. Learning to Control Fast-Weight Memories: An Alternative to Dynamic Recurrent Networks , 1992, Neural Computation.
[39] Joshua B. Tenenbaum,et al. A Compositional Object-Based Approach to Learning Physical Dynamics , 2016, ICLR.
[40] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[41] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[42] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.
[43] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.
[44] Yuval Tassa,et al. Learning Continuous Control Policies by Stochastic Value Gradients , 2015, NIPS.
[45] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[46] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[47] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.
[48] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[49] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .
[50] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[51] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.
[52] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[53] Amos J. Storkey,et al. Towards a Neural Statistician , 2016, ICLR.
[54] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.
[55] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[56] Ivor W. Tsang,et al. Learning with Augmented Features for Heterogeneous Domain Adaptation , 2012, ICML.
[57] Tim Salimans,et al. Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.
[58] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[59] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[60] Trevor Darrell,et al. Deep Domain Confusion: Maximizing for Domain Invariance , 2014, CVPR 2014.
[61] Trevor Darrell,et al. Efficient Learning of Domain-invariant Image Representations , 2013, ICLR.
[62] Daan Wierstra,et al. One-Shot Generalization in Deep Generative Models , 2016, ICML.
[63] Vladlen Koltun,et al. Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.
[64] Trevor Darrell,et al. What you saw is not what you get: Domain adaptation using asymmetric kernel transforms , 2011, CVPR 2011.
[65] C A Nelson,et al. Learning to Learn , 2017, Encyclopedia of Machine Learning and Data Mining.
[66] Andrew Zisserman,et al. Tabula rasa: Model transfer for object category detection , 2011, 2011 International Conference on Computer Vision.