论文信息 - Toward high-performance, memory-efficient, and fast reinforcement learning—Lessons from decision neuroscience

Toward high-performance, memory-efficient, and fast reinforcement learning—Lessons from decision neuroscience

Insights from decision neuroscience raise hope for intelligent brain-inspired solutions to robot learning in real dynamic environments. Recent insights from decision neuroscience raise hope for the development of intelligent brain-inspired solutions to robot learning in real dynamic environments full of noise and unpredictability.

[1] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.

[2] Shimon Whiteson,et al. Learning with Opponent-Learning Awareness , 2017, AAMAS.

[3] Shinsuke Shimojo,et al. Neural Computations Mediating One-Shot Learning in the Human Brain , 2013, PLoS biology.

[4] P. Kollock. SOCIAL DILEMMAS: The Anatomy of Cooperation , 1998 .

[5] Timothy E. J. Behrens,et al. Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[6] Joel Z. Leibo,et al. Prefrontal cortex as a meta-reinforcement learning system , 2018, bioRxiv.

[7] Shinsuke Shimojo,et al. Neural Computations Underlying Arbitration between Model-Based and Model-free Learning , 2013, Neuron.

[8] N. Daw,et al. Deciding How To Decide: Self-Control and Meta-Decision Making , 2015, Trends in Cognitive Sciences.

[9] Joshua B. Tenenbaum,et al. Coordinate to cooperate or compete: Abstract goals and joint intentions in social interaction , 2016, CogSci.

[10] P. Dayan,et al. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[11] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[12] H. Lau,et al. How to measure metacognition , 2014, Front. Hum. Neurosci..

[13] Alexander Peysakhovich,et al. Maintaining cooperation in complex social dilemmas using deep reinforcement learning , 2017, ArXiv.

[14] E. Koechlin,et al. Executive control and decision-making in the prefrontal cortex , 2015, Current Opinion in Behavioral Sciences.

[15] Stefan Elfwing,et al. Parallel reward and punishment control in humans and robots: Safe reinforcement learning using the MaxPain algorithm , 2017, 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).