Toward high-performance, memory-efficient, and fast reinforcement learning—Lessons from decision neuroscience

Insights from decision neuroscience raise hope for intelligent brain-inspired solutions to robot learning in real dynamic environments. Recent insights from decision neuroscience raise hope for the development of intelligent brain-inspired solutions to robot learning in real dynamic environments full of noise and unpredictability.

[1]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[2]  Shimon Whiteson,et al.  Learning with Opponent-Learning Awareness , 2017, AAMAS.

[3]  Shinsuke Shimojo,et al.  Neural Computations Mediating One-Shot Learning in the Human Brain , 2013, PLoS biology.

[4]  P. Kollock SOCIAL DILEMMAS: The Anatomy of Cooperation , 1998 .

[5]  Timothy E. J. Behrens,et al.  Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[6]  Joel Z. Leibo,et al.  Prefrontal cortex as a meta-reinforcement learning system , 2018, bioRxiv.

[7]  Shinsuke Shimojo,et al.  Neural Computations Underlying Arbitration between Model-Based and Model-free Learning , 2013, Neuron.

[8]  N. Daw,et al.  Deciding How To Decide: Self-Control and Meta-Decision Making , 2015, Trends in Cognitive Sciences.

[9]  Joshua B. Tenenbaum,et al.  Coordinate to cooperate or compete: Abstract goals and joint intentions in social interaction , 2016, CogSci.

[10]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[11]  Joshua B. Tenenbaum,et al.  Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[12]  H. Lau,et al.  How to measure metacognition , 2014, Front. Hum. Neurosci..

[13]  Alexander Peysakhovich,et al.  Maintaining cooperation in complex social dilemmas using deep reinforcement learning , 2017, ArXiv.

[14]  E. Koechlin,et al.  Executive control and decision-making in the prefrontal cortex , 2015, Current Opinion in Behavioral Sciences.

[15]  Stefan Elfwing,et al.  Parallel reward and punishment control in humans and robots: Safe reinforcement learning using the MaxPain algorithm , 2017, 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).