An Economist's Perspective on Probability Matching

The experimental phenomenon known as ‘probability matching’ is often offered as evidence in support of adaptive learning models and against the idea that people maximise their expected utility. Recent interest in dynamic-based equilibrium theories means the term re-appears in Economics. However, there seems to be conflicting views on what is actually meant by the term and about the validity of the data. The purpose of this paper is therefore threefold: First, to introduce today’s readers to what is meant by probability matching, and in particular to clarify which aspects of this phenomenon challenge the utility-maximisation hypothesis. Second, to familiarise the reader with the different theoretical approaches to behaviour in such circumstances, and to focus on the differences in predictions between these theories in light of recent advances. Third, to provide a comprehensive survey of repeated, binary choice experiments.

[1]  E. Thorndike “Animal Intelligence” , 1898, Nature.

[2]  L. Humphreys Acquisition and extinction of verbal expectations in a situation analogous to conditioning. , 1939 .

[3]  E. Brunswik Probability as a determiner of rat behavior. , 1939 .

[4]  M. Jarvik,et al.  Probability learning and a negative recency effect in the serial anticipation of alternative symbols. , 1951, Journal of experimental psychology.

[5]  H. Hake,et al.  Perception of the statistical structure of a random series of binary symbols. , 1953, Journal of experimental psychology.

[6]  W. Estes,et al.  Rate of verbal conditioning in relation to stimulus variability. , 1954, Journal of experimental psychology.

[7]  W. Estes,et al.  Analysis of a verbal conditioning situation in terms of statistical learning theory , 1954 .

[8]  J. Goodnow Determinants of choice-distribution in two-choice situations. , 1955, The American journal of psychology.

[9]  H. Simon,et al.  Rational choice and the structure of the environment. , 1956, Psychological review.

[10]  E. Neimark Effects of type of nonreinforcement and number of alternative responses in two verbal conditioning situations. , 1956, Journal of experimental psychology.

[11]  W. Edwards Reward probability, amount, and information as determiners of sequential two-alternative decisions. , 1956, Journal of experimental psychology.

[12]  W. K. Estes,et al.  Theory of learning with constant, variable, or contingent probabilities of reinforcement , 1957 .

[13]  Kenneth J. Arrow,et al.  Utilities, Attitudes, Choices: A Review Note , 1958 .

[14]  J. Engler Marginal and conditional stimulus and response probabilities in verbal conditioning. , 1958, Journal of experimental psychology.

[15]  A. Rechtschaffen,et al.  Replication report: two- and three-choice verbal-conditioning phenomena. , 1958, Journal of experimental psychology.

[16]  E. Galanter,et al.  Some experiments on a simple thought-problem. , 1958, The American journal of psychology.

[17]  E. H. Shuford,et al.  Comparison of predictions and estimates in a probability learning situation. , 1959, Journal of experimental psychology.

[18]  D. C. Nicks Prediction of sequential two-choice decisions from event runs. , 1959, Journal of experimental psychology.

[19]  S. Siegel,et al.  Decision making behavior in a two-choice uncertain outcome situation. , 2010, Journal of experimental psychology.

[20]  I RUBINSTEIN,et al.  Some factors in probability matching. , 1959, Journal of experimental psychology.

[21]  P. D. Mccormack Spatial Generalization and Probability-Learning in a Five-Choice Situation , 1959 .

[22]  W. Runquist,et al.  Probability-matching with an unscheduled random sequence. , 1960, The American journal of psychology.

[23]  N. Anderson,et al.  Likelihood judgments and sequential effects in a two-choice probability learning situation. , 1960, Journal of experimental psychology.

[24]  N. Anderson Effect of first-order conditional probability in two-choice learning situation. , 1960, Journal of experimental psychology.

[25]  M. Stone,et al.  Studies in mathematical learning theory. , 1960 .

[26]  J. Laurie Snell,et al.  Studies in mathematical learning theory. , 1960 .

[27]  S. Siegel DECISION MAKING AND LEARNING UNDER VARYING CONDITIONS OF REINFORCEMENT * , 1961 .

[28]  Patrick Suppes,et al.  Behavioristic foundations of utility , 1961 .

[29]  W. Edwards,et al.  Probability learning in 1000 trials. , 1961, Journal of experimental psychology.

[30]  R J HERRNSTEIN,et al.  Relative and absolute strength of response as a function of frequency of reinforcement. , 1961, Journal of the experimental analysis of behavior.

[31]  Y. Brackbill,et al.  Magnitude of reward and probability learning. , 1962, Journal of experimental psychology.

[32]  Patrick Suppes,et al.  Markov learning models for multiperson interactions , 1962 .

[33]  M E Bitterman,et al.  Probability Learning. , 1962, Science.

[34]  Y. Brackbill,et al.  Supplementary report: the utility of correctly predicting infrequent events. , 1962, Journal of Experimental Psychology.

[35]  J. L. Myers,et al.  DIFFERENTIAL MONETARY GAINS AND LOSSES AND EVENT PROBABILITY IN A TWO-CHOICE SITUATION. , 1963, Journal of experimental psychology.

[36]  H. Gelfand,et al.  The learning of choices between bets , 1964 .

[37]  Frank Restle,et al.  Psychology of judgment and choice: A theoretical essay. , 1962 .

[38]  A. Tversky,et al.  Information versus reward in binary choices. , 1966, Journal of experimental psychology.

[39]  Amnon Rapoport,et al.  Handbook of Mathematical Psychology, Volume III. , 1967 .

[40]  P. Derks,et al.  Simple strategies in binary prediction by children and adults. , 1967 .

[41]  W M Baum,et al.  Choice as time allocation. , 1969, Journal of the experimental analysis of behavior.

[42]  M. Fiorina A note on probability matching and rational choice , 1971 .

[43]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[44]  J. Cross A Stochastic Learning Model of Economic Behavior , 1973 .

[45]  W M Baum,et al.  On two types of deviation from the matching law: bias and undermatching. , 1974, Journal of the experimental analysis of behavior.

[46]  M. Rothschild A two-armed bandit theory of market pricing , 1974 .

[47]  R J Herrnstein,et al.  Formal properties of the matching law. , 1974, Journal of the experimental analysis of behavior.

[48]  R. Herrnstein,et al.  Maximizing and matching on concurrent ratio schedules. , 1975, Journal of the experimental analysis of behavior.

[49]  Richard J. Herrnstein,et al.  MAXIMIZING AND MATCHING ON CONCURRENT RATIO SCHEDULES1 , 1975 .

[50]  J. Myerson,et al.  The kinetics of choice: An operant systems analysis. , 1980 .

[51]  M. Davison,et al.  The matching law: A research review. , 1988 .

[52]  D R Shanks,et al.  Connectionism and the Learning of Probabilistic Concepts , 1990, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[53]  Reinhard Selten,et al.  Anticipatory Learning in Two-Person Games , 1991 .

[54]  W. Estes Toward a Statistical Theory of Learning. , 1994 .

[55]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[56]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .

[57]  Fang-Fang Tang,et al.  Anticipatory Learning in Two-Person Games: An Experimental Study, Part II. Learning , 1996 .

[58]  Teck-Hua Ho,et al.  Experience-Weighted Attraction Learning in Games: A Unifying Approach , 1997 .

[59]  Tilman Börgers,et al.  Learning Through Reinforcement and Replicator Dynamics , 1997 .

[60]  Bereby-Meyer,et al.  On Learning To Become a Successful Loser: A Comparison of Alternative Abstractions of Learning Processes in the Loss Domain. , 1998, Journal of mathematical psychology.

[61]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[62]  Tilman Börgers,et al.  Naive Reinforcement Learning With Endogenous Aspirations , 2000 .