An automatic method for discovering rational heuristics for risky choice

What is the optimal way to make a decision given that your time is limited and your cognitive resources are bounded? To answer this question, we formalized the bounded optimal decision process as the solution to a meta-level Markov decision process whose actions are costly computations. We approximated the optimal solution and evaluated its predictions against human choice behavior in the Mouselab paradigm, which is widely used to study decision strategies. Our computational method rediscovered well-known heuristic strategies and the conditions under which they are used, as well as novel heuristics. A Mouselab experiment confirmed our model’s main predictions. These findings are a proof-of-concept that optimal cognitive strategies can be automatically derived as the rational use of finite time and bounded cognitive resources.

[1]  G Gigerenzer,et al.  Reasoning the fast and frugal way: models of bounded rationality. , 1996, Psychological review.

[2]  Alexandre Pouget,et al.  Optimal policy for value-based decision-making , 2016, Nature Communications.

[3]  H. Simon,et al.  Rational choice and the structure of the environment. , 1956, Psychological review.

[4]  Steven M. Shugan The Cost Of Thinking , 1980 .

[5]  Thomas L. Griffiths,et al.  Rational Use of Cognitive Resources: Levels of Analysis Between the Computational and the Algorithmic , 2015, Top. Cogn. Sci..

[6]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[7]  R. Selten,et al.  Bounded rationality: The adaptive toolbox , 2000 .

[8]  David Tolpin,et al.  Selecting Computations: Theory and Applications , 2012, UAI.

[9]  Shipra Agrawal,et al.  Thompson Sampling for Contextual Bandits with Linear Payoffs , 2012, ICML.

[10]  Thomas L. Griffiths,et al.  "Burn-in, bias, and the rationality of anchoring" , 2012, NIPS.

[11]  Eric J. Johnson,et al.  Adaptive Strategy Selection in Decision Making. , 1988 .

[12]  John W. Payne,et al.  Monitoring Information Processing and Decisions: The Mouselab System , 1989 .

[13]  A. Tversky,et al.  Judgment under Uncertainty , 1982 .

[14]  Thomas L. Griffiths,et al.  Algorithm selection by rational metareasoning as a model of human strategy selection , 2014, NIPS.

[15]  Thomas L. Griffiths,et al.  The high availability of extreme events serves resource-rational decision-making , 2014, CogSci.

[16]  Stuart J. Russell,et al.  Principles of Metareasoning , 1989, Artif. Intell..

[17]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.