Comparison of basic assumptions embedded in learning models for experience-based decision making

The present study examined basic assumptions embedded in learning models for predicting behavior in decisions based on experience. In such decisions, the probabilities and payoffs are initially unknown and are learned from repeated choice with payoff feedback. We examined combinations of two rules for updating past experience with new payoff feedback and of two choice rule assumptions for mapping experience onto choices. The combination of these assumptions produced four classes of models that were systematically compared. Two methods were employed to evaluate the success of learning models for approximating players’ choices: One was based on estimating parameters from each person’s data to maximize the prediction of choices one step ahead, conditioned by the observed past history of feedback. The second was based on making a priori predictions for the entire sequence of choices using parameters estimated from a separate experiment. The results indicated the advantage of a class of models incorporating decay of previous experience, whereas the ranking of choice rules depended on the evaluation method used.

[1]  Jörg Rieskamp,et al.  How do people learn to allocate resources? Comparing two learning theories. , 2003, Journal of experimental psychology. Learning, memory, and cognition.

[2]  Timothy C. Salmon An Evaluation of Econometric Models of Adaptive Learning , 2001 .

[3]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[4]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[5]  W. Estes,et al.  A theory of stimulus variability in learning. , 1953, Psychological review.

[6]  David W Harless,et al.  The predictive utility of generalized expected utility theories , 1994 .

[7]  A. Damasio,et al.  Insensitivity to future consequences following damage to human prefrontal cortex , 1994, Cognition.

[8]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[9]  Klaus Oberauer,et al.  Beyond resources: Formal models of complexity effects and age differences in working memory , 2001 .

[10]  A. Cournot Researches into the Mathematical Principles of the Theory of Wealth , 1898, Forerunners of Realizable Values Accounting in Financial Reporting.

[11]  Peter M. Todd,et al.  Biases to the left, fallacies to the right: stuck in the middle with null hypothesis significance testing , 2000 .

[12]  R. Duncan Luce,et al.  Individual Choice Behavior , 1959 .

[13]  D. Stahl Boundedly rational rule learning in a guessing game , 1996 .

[14]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[15]  Frederick Mosteller,et al.  Stochastic Models for Learning , 1956 .

[16]  R. Sarin,et al.  Payoff Assessments without Probabilities: A Simple Dynamic Model of Choice , 1999 .

[17]  A. Tversky,et al.  Prospect theory: analysis of decision under risk , 1979 .

[18]  T. Robbins,et al.  Decision-making deficits in drug addiction , 2002, Trends in Cognitive Sciences.

[19]  G. Bower,et al.  From conditioning to category learning: an adaptive network model. , 1988 .

[20]  A. Newell Unified Theories of Cognition and the Role of Soar , 1992 .

[21]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[22]  Hanna Damasio,et al.  Decision-making and addiction (part I): impaired activation of somatic states in substance dependent individuals when pondering decisions with negative future consequences , 2002, Neuropsychologia.

[23]  A. Rapoport,et al.  Coordination, “Magic,” and Reinforcement Learning in a Market Entry Game , 1998 .

[24]  Nick Feltovich,et al.  Reinforcement-based vs. Belief-based Learning Models in Experimental Asymmetric-information Games , 2000 .

[25]  A. Tversky,et al.  Prospect theory: an analysis of decision under risk — Source link , 2007 .

[26]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[27]  Jerome R. Busemeyer,et al.  An adaptive approach to human decision making: Learning theory, decision theory, and human performance. , 1992 .

[28]  R. Hertwig,et al.  Decisions from Experience and the Effect of Rare Events in Risky Choice , 2004, Psychological science.

[29]  D. Fudenberg,et al.  Consistency and Cautious Fictitious Play , 1995 .

[30]  John R. Anderson,et al.  Explorations of an Incremental, Bayesian Algorithm for Categorization , 1992, Machine Learning.

[31]  Daniel Friedman,et al.  Individual Learning in Normal Form Games: Some Laboratory Results☆☆☆ , 1997 .

[32]  D. Broadbent Perception and communication , 1958 .

[33]  I. Erev,et al.  Small feedback‐based decisions and their limited correspondence to description‐based decisions , 2003 .

[34]  J. Busemeyer,et al.  Model Comparisons and Model Selections Based on Generalization Criterion Methodology. , 2000, Journal of mathematical psychology.

[35]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[36]  Fred D Merritt Book Review:Researches into the Mathematical Principles of the Theory of Wealth Augustin Cournot, Nathaniel T. Bacon , 1898 .

[37]  Farshid Vahid,et al.  Predicting How People Play Games: A Simple Dynamic Model of Choice , 2001, Games Econ. Behav..

[38]  Jerome R. Busemeyer,et al.  Cognitive models for evaluating basic decision processes in clinical populations , 2007 .

[39]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[40]  Ido Erev,et al.  On the Application and Interpretation of Learning Models , 2002 .

[41]  C. Anderson The Psychology of Doing Nothing: Forms of Decision Avoidance Result from Reason and Emotion , 2003, Psychological bulletin.

[42]  J. Busemeyer,et al.  The effect of foregone payoffs on underweighting small probability events , 2006 .

[43]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[44]  Jerome R. Busemeyer,et al.  Individual differences in the response to forgone payoffs: an examination of high functioning drug abusers , 2005 .

[45]  David Elkind,et al.  Learning: An Introduction , 1968 .

[46]  Erev,et al.  Organizational Behavior and Human Decision Processes Accidents and Decision Making under Uncertainty: a Comparison of Four Models , 2022 .

[47]  Darryl A. Seale,et al.  Reinforcement-Based Adaptive Learning in Asymmetric Two-Person Bargaining with Incomplete Information , 1998 .

[48]  I. Erev,et al.  On adaptation, maximization, and reinforcement learning among cognitive strategies. , 2005, Psychological review.

[49]  Cleotilde Gonzalez,et al.  Instance-based learning in dynamic decision making , 2003, Cogn. Sci..

[50]  Richard C. Atkinson,et al.  Human Memory: A Proposed System and its Control Processes , 1968, Psychology of Learning and Motivation.