论文信息 - How do people learn how to plan?

How do people learn how to plan?

How does the brain learn how to plan? We reverseengineer people’s underlying learning mechanisms by combining rational process models of cognitive plasticity with recently developed empirical methods that allow us to trace the temporal evolution of people’s planning strategies. We find that our Learned Value of Computation model (LVOC) accurately captures people’s average learning curve. However, there were also substantial individual differences in metacognitive learning that are best understood in terms of multiple different learning mechanisms – including strategy selection learning. Furthermore, we observed that LVOC could not fully capture people’s ability to adaptively decide when to stop planning. We successfully extended the LVOC model to address these discrepancies. Our models broadly capture people’s ability to improve their decision mechanisms and represent a significant step towards reverseengineering how the brain learns increasingly effective cognitive strategies through its interaction with the environment.

[1] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[2] M. Rushworth,et al. Valuation and decision-making in frontal cortex: one or many serial or parallel systems? , 2012, Current Opinion in Neurobiology.

[3] Falk Lieder,et al. Enhancing metacognitive reinforcement learning using reward structures and feedback , 2021, CogSci.

[4] T. Griffiths,et al. Strategy Selection as Rational Metareasoning , 2017, Psychological review.

[5] Falk Lieder,et al. A resource-rational analysis of human planning , 2018, CogSci.

[6] Falk Lieder,et al. Measuring how people learn how to plan , 2019, CogSci.