On designing economic agents that behave like human agents

This paper explores the idea of constructing theoretical economic agents that behave like actual human agents and using them in neoclassical economic models. It does this in a repeated-choice setting by postulating “artificial agents” who use a learning algorithm calibrated against human learning data from psychological experiments. The resulting calibrated algorithm appears to replicate human learning behavior to a high degree and reproduces several “stylized facts” of learning. It can therefore be used to replace the idealized, perfectly rational agents in appropriate neoclassical models with “calibrated agents” that represent actual human behavior. The paper discusses the possibilities of using the algorithm to represent human learning in normal-form stage games and in more general neoclassical models in economics. It explores the likelihood of convergence to long-run optimality and to Nash behavior, and the “characteristic learning time” implicit in human adaptation in the economy.

[1]  Richard C. Atkinson,et al.  Stimulus Sampling Theory , 1967 .

[2]  J. E. Mazur,et al.  Choice behavior in transition: development of preference for the higher probability of reinforcement. , 1990, Journal of the experimental analysis of behavior.

[3]  Kumpati S. Narendra,et al.  Learning automata - an introduction , 1989 .

[4]  Edmond P. Odescalchi Can a Machine Think , 1958 .

[5]  R. Herrnstein,et al.  Utility maximization and melioration: Internalities in individual choice , 1993 .

[6]  Mikhail Borisovich Nevelʹson,et al.  Stochastic Approximation and Recursive Estimation , 1976 .

[7]  R. Lucas ASSET PRICES IN AN EXCHANGE ECONOMY , 1978 .

[8]  B. Skinner,et al.  Principles of Behavior , 1944 .

[9]  E. Tolman Purposive behavior in animals and men , 1932 .

[10]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[11]  I︠a︡. Z. T︠S︡ypkin,et al.  Foundations of the theory of learning systems , 1973 .

[12]  Gardner Ra Probability-learning with two and three choices. , 1957 .

[13]  M. Rothschild A two-armed bandit theory of market pricing , 1974 .

[14]  R. Pemantle,et al.  Nonconvergence to Unstable Points in Urn Models and Stochastic Approximations , 1990 .

[15]  D. O. Hebb,et al.  The organization of behavior , 1988 .

[16]  Merrill M. Flood,et al.  On Stochastic Learning Theory , 1952 .

[17]  Frederick Mosteller,et al.  Stochastic Models for Learning , 1956 .

[18]  R. Has’minskiĭ,et al.  Stochastic Approximation and Recursive Estimation , 1976 .

[19]  R. Herrnstein,et al.  CHAPTER 5 – Melioration and Behavioral Allocation1 , 1980 .

[20]  Paul R. Milgrom,et al.  Adaptive and sophisticated learning in normal form games , 1991 .

[21]  J. Goodnow Determinants of choice-distribution in two-choice situations. , 1955, The American journal of psychology.

[22]  M. Bray Learning, estimation, and the stability of rational expectations , 1982 .

[23]  M. L. Tsetlin,et al.  Automaton theory and modeling of biological systems , 1973 .

[24]  James G. Greeno,et al.  Introduction to mathematical psychology , 1970 .

[25]  T. Sargent,et al.  Convergence of Least Squares Learning Mechanisms in Self- Referential Linear Stochastic Models* , 1989 .

[26]  John H. Holland,et al.  Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .

[27]  Ellen R. McGrattan,et al.  Money as a medium of exchange in an economy with artificially intelligent agents , 1990 .

[28]  Richard J. Herrnstein,et al.  Derivatives of Matching. , 1979 .

[29]  R. Luce,et al.  Individual Choice Behavior: A Theoretical Analysis. , 1960 .