论文信息 - Including cognitive biases and distance-based rewards in a connectionist model of complex problem solving

Including cognitive biases and distance-based rewards in a connectionist model of complex problem solving

We present a cognitive, connectionist-based model of complex problem solving that integrates cognitive biases and distance-based and environmental rewards under a temporal-difference learning mechanism. The model is tested against experimental data obtained in a well-defined and planning-intensive problem. We show that incorporating cognitive biases (symmetry and simplicity) in a temporal-difference learning rule (SARSA) increases model adequacy-the solution space explored by biased models better fits observed human solutions. While learning from explicit rewards alone is intrinsically slow, adding distance-based rewards, a measure of closeness to goal, to the learning rule significantly accelerates learning. Finally, the model correctly predicts that explicit rewards have little impact on problem solvers' ability to discover optimal solutions.

[1] M. Tarr,et al. Mental rotation and orientation-dependence in shape recognition , 1989, Cognitive Psychology.

[2] Kathy Sweeney,et al. Conceptual Blockbusting: A Guide to Better Ideas , 2005 .

[3] John E. Laird,et al. Soar-RL: integrating reinforcement learning with Soar , 2005, Cognitive Systems Research.

[4] Keith J. Holyoak,et al. Problem solving , 1990 .

[5] M. Botvinick,et al. Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective , 2009, Cognition.

[6] Eugene Fink,et al. Integrating planning and learning: the PRODIGY architecture , 1995, J. Exp. Theor. Artif. Intell..

[7] Nick Chater,et al. A simplicity principle in unsupervised human categorization , 2002, Cogn. Sci..

[8] B Tversky,et al. Force of symmetry in form perception. , 1984, The American journal of psychology.

[9] Frank J. Lee,et al. Production Compilation: A Simple Mechanism to Model Complex Skill Acquisition , 2003, Hum. Factors.

[10] N. Chater,et al. Simplicity: a unifying principle in cognitive science? , 2003, Trends in Cognitive Sciences.

[11] D. Gentner,et al. On Mental Leaps: Analogy in Creative Thought (Keith J. Holyoak and Paul Thagard) , 1996 .

[12] A. Luchins. Mechanization in problem solving: The effect of Einstellung. , 1942 .

[13] Jonathan D. Cohen,et al. Prefrontal cortex and flexible cognitive control: rules without symbols. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[14] M. Gabriel,et al. Learning and Computational Neuroscience: Foundations of Adaptive Networks , 1990 .

[15] J. Feldman,et al. Bayes and the Simplicity Principle in Perception Simplicity versus Likelihood Principles in Perception , 2022 .

[16] K. Holyoak,et al. Mental Leaps: Analogy in Creative Thought , 1994 .

[17] Lorenz Halbeisen,et al. The general counterfeit coin problem , 1995, Discret. Math..

[18] Niels Taatgen,et al. Proceedings of the 12th International Conference on Cognitive Modeling , 2004, ICCM 2013.

[19] John A. Michon,et al. Soar: A Cognitive Architecture in Perspective , 1992 .

[20] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[21] Ron Sun,et al. Learning to plan probabilistically from neural networks , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[22] T. Shultz,et al. Strategies, Heuristics and Biases in Complex Problem Solving , 2007 .

[23] Yoshio Takane,et al. Rule following and rule use in the balance-scale task , 2007, Cognition.

[24] P. Todd,et al. Simple Heuristics That Make Us Smart , 1999 .

[25] Harlan D. Mills,et al. Coin Weighing Problems, On , 1964 .

[26] Steven Minton,et al. Machine Learning Methods for Planning , 1994 .

[27] H. B. Barlow,et al. Finding Minimum Entropy Codes , 1989, Neural Computation.

[28] Ron Sun,et al. Learning, action and consciousness: a hybrid approach toward modelling consciousness , 1997, Neural Networks.

[29] J. Gerard Wolff,et al. Language acquisition, data compression and generalization , 1982 .

[30] Gavin Adrian Rummery. Problem solving with reinforcement learning , 1995 .

[31] Sachiyo Arai,et al. Guiding Inference Through Relational Reinforcement Learning , 2005, ILP.

[32] Ron Sun,et al. Learning Plans without a priori Knowledge , 2000, Adapt. Behav..

[33] Reinaldo A. C. Bianchi,et al. Accelerating autonomous learning by using heuristic selection of actions , 2008, J. Heuristics.