暂无分享,去创建一个
[1] Patrick M. Pilarski,et al. Adaptive artificial limbs: a real-time approach to prediction and anticipation , 2013, IEEE Robotics & Automation Magazine.
[2] Etienne Barnard,et al. Temporal-difference methods and Markov models , 1993, IEEE Trans. Syst. Man Cybern..
[3] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[4] Richard S. Sutton,et al. Adapting Bias by Gradient Descent: An Incremental Version of Delta-Bar-Delta , 1992, AAAI.
[5] Robert C. Wilson,et al. Inferring Relevance in a Changing World , 2012, Front. Hum. Neurosci..
[6] Will Dabney,et al. ADAPTIVE STEP-SIZES FOR REINFORCEMENT LEARNING , 2014 .
[7] David Silver,et al. Meta-Gradient Reinforcement Learning , 2018, NeurIPS.
[8] Patrick M. Pilarski,et al. Tuning-free step-size adaptation , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Andrew G. Barto,et al. Adaptive Step-Size for Online Temporal Difference Learning , 2012, AAAI.
[10] Patrick M. Pilarski,et al. Representing high-dimensional data to intelligent prostheses and other wearable assistive robots: A first comparison of tile coding and selective Kanerva coding , 2017, 2017 International Conference on Rehabilitation Robotics (ICORR).
[11] Richard S. Sutton,et al. True Online TD(lambda) , 2014, ICML.
[12] M. R. Dawson,et al. DEVELOPMENT OF THE BENTO ARM : AN IMPROVED ROBOTIC ARM FOR MYOELECTRIC TRAINING AND RESEARCH , 2014 .
[13] R. S. Sutton,et al. Dynamic switching and real-time machine learning for improved human control of assistive biomedical robots , 2012, 2012 4th IEEE RAS & EMBS International Conference on Biomedical Robotics and Biomechatronics (BioRob).
[14] Linda B. Smith,et al. From the lexicon to expectations about kinds: a role for associative learning. , 2005, Psychological review.
[15] M. Arbib,et al. A model of cerebellar metaplasticity. , 1998, Learning & memory.
[16] Richard S. Sutton,et al. Representation Search through Generate and Test , 2013, AAAI Workshop: Learning Rich Representations from Low-Level Sensors.
[17] Nicol N. Schraudolph,et al. Local Gain Adaptation in Stochastic Gradient Descent , 1999 .
[18] Matthew E. Taylor,et al. Metatrace Actor-Critic: Online Step-Size Tuning by Meta-gradient Descent for Reinforcement Learning Control , 2018, IJCAI.
[19] Pierre-Yves Oudeyer,et al. What is Intrinsic Motivation? A Typology of Computational Approaches , 2007, Frontiers Neurorobotics.
[20] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[21] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.
[22] Patrick M. Pilarski,et al. Machine learning and unlearning to autonomously switch between the functions of a myoelectric arm , 2016, 2016 6th IEEE International Conference on Biomedical Robotics and Biomechatronics (BioRob).