Teacher algorithms for curriculum learning of Deep RL in continuously parameterized environments
暂无分享,去创建一个
Pierre-Yves Oudeyer | C'edric Colas | Katja Hofmann | R'emy Portelas | Cédric Colas | Katja Hofmann | P. Oudeyer | Rémy Portelas
[1] Rui Wang,et al. Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions , 2019, ArXiv.
[2] Pierre-Yves Oudeyer,et al. In Search of the Neural Circuits of Intrinsic Motivation , 2007, Front. Neurosci..
[3] Pierre-Yves Oudeyer,et al. CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning , 2018, ICML.
[4] H. Bozdogan. Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions , 1987 .
[5] Pieter Abbeel,et al. Automatic Goal Generation for Reinforcement Learning Agents , 2017, ICML.
[6] Pierre-Yves Oudeyer,et al. Self-organization of early vocal development in infants and machines: the role of intrinsic motivation , 2014, Front. Psychol..
[7] Pierre-Yves Oudeyer,et al. Active learning of inverse models with intrinsically motivated goal exploration in robots , 2013, Robotics Auton. Syst..
[8] Kai A. Krueger,et al. Flexible shaping: How learning in small steps helps , 2009, Cognition.
[9] Siddharth Mysore. Reward-guided Curriculum for Robust Reinforcement Learning , 2019 .
[10] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[11] Pierre-Yves Oudeyer,et al. The strategic student approach for life-long exploration and learning , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).
[12] Katja Hofmann,et al. The Malmo Platform for Artificial Intelligence Experimentation , 2016, IJCAI.
[13] J. Elman. Learning and development in neural networks: the importance of starting small , 1993, Cognition.
[14] Jon Louis Bentley,et al. Multidimensional binary search trees used for associative searching , 1975, CACM.
[15] Pierre-Yves Oudeyer,et al. Multi-Armed Bandits for Intelligent Tutoring Systems , 2013, EDM.
[16] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[17] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[18] Pierre-Yves Oudeyer,et al. Curiosity Driven Exploration of Learned Disentangled Goal Spaces , 2018, CoRL.
[19] Alex Graves,et al. Automated Curriculum Learning for Neural Networks , 2017, ICML.
[20] John Schulman,et al. Teacher–Student Curriculum Learning , 2017, IEEE Transactions on Neural Networks and Learning Systems.
[21] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[22] Pierre-Yves Oudeyer,et al. Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.
[23] Pierre-Yves Oudeyer,et al. Modular active curiosity-driven discovery of tool use , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[24] Deepak Kumar,et al. BRINGING UP ROBOT: FUNDAMENTAL MECHANISMS FOR CREATING A SELF-MOTIVATED, SELF-ORGANIZING ARCHITECTURE , 2005, Cybern. Syst..
[25] Carl E. Rasmussen,et al. The Infinite Gaussian Mixture Model , 1999, NIPS.
[26] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[27] Pierre-Yves Oudeyer,et al. R-IAC: Robust Intrinsically Motivated Exploration and Active Learning , 2009, IEEE Transactions on Autonomous Mental Development.
[28] Pierre-Yves Oudeyer,et al. Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning , 2017, J. Mach. Learn. Res..
[29] David Ha,et al. Reinforcement Learning for Improving Agent Design , 2018, Artificial Life.