暂无分享,去创建一个
[1] Yann LeCun,et al. Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[2] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[3] Shie Mannor,et al. Bayesian Reinforcement Learning: A Survey , 2015, Found. Trends Mach. Learn..
[4] Yann LeCun,et al. Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[5] Sergey Levine,et al. Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction , 2019, NeurIPS.
[6] Marley M. B. R. Vellasco,et al. Automatic parameters selection in machine learning , 2012, Neurocomputing.
[7] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.
[8] Martin A. Riedmiller,et al. Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning , 2020, ICLR.
[9] Sebastian Risi,et al. Behind DeepMind’s AlphaStar AI that Reached Grandmaster Level in StarCraft II , 2020, KI - Künstliche Intelligenz.
[10] Aviv Tamar,et al. Offline Meta Reinforcement Learning , 2020, ArXiv.
[11] Alexander A. Alemi,et al. Deep Variational Information Bottleneck , 2017, ICLR.
[12] Alexander J. Smola,et al. Meta-Q-Learning , 2020, ICLR.
[13] L LittmanMichael,et al. Planning and acting in partially observable stochastic domains , 1998 .
[14] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[15] Feiping Nie,et al. Cauchy Graph Embedding , 2011, ICML.
[16] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[17] VilaltaRicardo,et al. A perspective view and survey of meta-learning , 2002 .
[18] An Optimistic Perspective on Offline Reinforcement Learning , 2020, ICML.
[19] Doina Precup,et al. Off-Policy Deep Reinforcement Learning without Exploration , 2018, ICML.
[20] Sergey Levine,et al. Residual Reinforcement Learning for Robot Control , 2018, 2019 International Conference on Robotics and Automation (ICRA).
[21] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .
[22] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[23] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.
[24] Kihyuk Sohn,et al. Improved Deep Metric Learning with Multi-class N-pair Loss Objective , 2016, NIPS.
[25] Tao Xiang,et al. Learning to Compare: Relation Network for Few-Shot Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[26] John Nisbet,et al. Learning to learn , 2017 .
[27] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[28] Hao Su,et al. Multi-task Batch Reinforcement Learning with Metric Learning , 2020, NeurIPS.