The matching law and melioration learning

[1]  N. Squires,et al.  Choice behavior and the accessibility of the reinforcer. , 1972, Journal of the experimental analysis of behavior.

[2]  Richard J. Herrnstein,et al.  Derivatives of Matching. , 1979 .

[3]  M. Macy,et al.  Learning dynamics in social dilemmas , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  D. E. Matthews Evolution and the Theory of Games , 1977 .

[5]  J. E. Mazur Optimization theory fails to predict performance of pigeons in a two-response situation. , 1981, Science.

[6]  J. Borrero,et al.  An application of the matching law to social dynamics. , 2007, Journal of applied behavior analysis.

[7]  R J HERRNSTEIN,et al.  Relative and absolute strength of response as a function of frequency of reinforcement. , 1961, Journal of the experimental analysis of behavior.

[8]  S. Hart,et al.  A simple adaptive procedure leading to correlated equilibrium , 2000 .

[9]  Vincent Buskens,et al.  Effects of Network Characteristics on Reaching the Payoff-Dominant Equilibrium in Coordination Games: A Simulation study , 2015, Dynamic Games and Applications.

[10]  J. Staddon,et al.  On matching and maximizing in operant choice experiments. , 1978 .

[11]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[12]  R. Herrnstein Rational Choice Theory Necessary but Not Sufficient , 1990 .

[13]  Thomas Brenner,et al.  Agent Learning Representation - Advice in Modelling Economic Learning , 2004 .

[14]  Peter Vrancx,et al.  Game Theory and Multi-agent Reinforcement Learning , 2012, Reinforcement Learning.

[15]  P. Glimcher,et al.  Activity in Posterior Parietal Cortex Is Correlated with the Relative Subjective Desirability of Action , 2004, Neuron.

[16]  Melanie Mitchell,et al.  Genetic Algorithms and Artificial Life , 1994, Artificial Life.

[17]  Daniel Gopher,et al.  Melioration and the Transition from Touch-Typing Training to Everyday Use , 2003, Hum. Factors.

[18]  A. Diekmann Cooperation in an asymmetric Volunteer's dilemma game theory and experimental evidence , 1993 .

[19]  Jarek Gryz,et al.  Algorithms and analyses for maximal vector computation , 2007, The VLDB Journal.

[20]  Jacob K. Goeree,et al.  An experimental examination of the volunteer's dilemma , 2017, Games Econ. Behav..

[21]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[22]  Tuomas Sandholm,et al.  On Multiagent Q-Learning in a Semi-Competitive Domain , 1995, Adaption and Learning in Multi-Agent Systems.

[23]  Paul W. Goldberg,et al.  The Complexity of Computing a Nash Equilibrium , 2009, SIAM J. Comput..

[24]  E. Fantino,et al.  Human choice in concurrent ratio-interval schedules of reinforcement. , 1994, Journal of the experimental analysis of behavior.

[25]  Nicholas Mark Gotts,et al.  Transient and asymptotic dynamics of reinforcement learning in games , 2007, Games Econ. Behav..

[26]  Ryszard Kowalczyk,et al.  Dynamic analysis of multiagent Q-learning with ε-greedy exploration , 2009, ICML '09.

[27]  Linda D. Molm,et al.  A Behavioral Analysis of the Dynamics of Social Exchange in the Dyad , 1979 .

[28]  Greg Barron,et al.  Private e-mail requests and the diffusion of responsibility , 2002, Comput. Hum. Behav..

[29]  R. Heiner Rule-governed behavior in evolution and human society , 1990 .

[30]  Axel Franzen Group Size and One-Shot Collective Action , 1995 .

[31]  F. Bianchi,et al.  Agent‐based models in sociology , 2015 .

[32]  Robert Axelrod Advancing the art of simulation in the social sciences , 1997 .

[33]  S. Thalberg Rational Behavior And Bargaining Equilibrium In Games And Social Situations , 2016 .

[34]  T. Critchfield,et al.  Generality of the matching law as a descriptor of shot selection in basketball. , 2009, Journal of applied behavior analysis.

[35]  Richard S. Sutton,et al.  Learning and Sequential Decision Making , 1989 .

[36]  Yael Niv,et al.  Operant Conditioning , 1971 .

[37]  Dean P. Foster,et al.  A Randomization Rule for Selecting Forecasts , 1993, Oper. Res..

[38]  J. Ochs Games with Unique, Mixed Strategy Equilibria: An Experimental Study , 1995 .

[39]  R. L. Burgess,et al.  Behavioral sociology : the experimental analysis of social process , 1970 .

[40]  Meredith S. Berry,et al.  Concurrent performance in a three-alternative choice situation: Response allocation in a Rock/Paper/Scissors game , 2009, Behavioural Processes.

[41]  Douglas D. Heckathorn,et al.  Collective Sanctions and the Creation of Prisoner's Dilemma Norms , 1988, American Journal of Sociology.

[42]  D. Judson,et al.  A Test of the Satisfaction-Balance Decision Model Using Direct Numeric Estimation , 1991 .

[43]  W. Vaughan Choice: A local analysis. , 1985, Journal of the experimental analysis of behavior.

[44]  R. Heiner Experimental Economics: Comment , 1985 .

[46]  J. J. McDowell,et al.  Matching Theory in Natural Human Environments , 1988, The Behavior analyst.

[47]  G. Antonides,et al.  Effects of feedback and educational training on maximization in choice tasks: experimental-game evidence , 2002 .

[48]  R J Herrnstein,et al.  Formal properties of the matching law. , 1974, Journal of the experimental analysis of behavior.

[49]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[50]  B. Skinner Two Types of Conditioned Reflex: A Reply to Konorski and Miller , 1937 .

[51]  Xiao-Jing Wang,et al.  A Biophysically Based Neural Model of Matching Law Behavior: Melioration by Stochastic Synapses , 2006, The Journal of Neuroscience.

[52]  C. Hauert,et al.  Volunteering as Red Queen Mechanism for Cooperation in Public Goods Games , 2002, Science.

[53]  A V Bacotti,et al.  Matching under concurrent fixed-ratio variable-interval schedules of food presentation. , 1977, Journal of the experimental analysis of behavior.

[54]  Marco Wiering,et al.  Reinforcement Learning and Markov Decision Processes , 2012, Reinforcement Learning.

[55]  Yutaka Nakamura,et al.  Additive utilities on densely ordered sets , 2002 .

[56]  R. Herrnstein Method and theory in the study of avoidance. , 1969, Psychological review.

[57]  Steven D. Levitt,et al.  Testing Mixed-Strategy Equilibria When Players Are Heterogeneous: The Case of Penalty Kicks in Soccer , 2002 .

[58]  R. Herrnstein Quantitative hedonism. , 1971, Journal of psychiatric research.

[59]  I. Pavlov Conditioned Reflexes: An Investigation of the Physiological Activity of the Cerebral Cortex , 1929 .

[60]  C. E. Lemke,et al.  Equilibrium Points of Bimatrix Games , 1964 .

[61]  H. Young Individual Strategy and Social Structure , 2020 .

[62]  W M Baum,et al.  Optimization and the matching law as accounts of instrumental behavior. , 1981, Journal of the experimental analysis of behavior.

[63]  R. Herrnstein Behavior, Reinforcement and Utility , 1990 .

[64]  M. Macy Learning to Cooperate: Stochastic and Tacit Collusion in Social Exchange , 1991, American Journal of Sociology.

[65]  Peter Hedström,et al.  What is Analytical Sociology All About? An Introductory Essay , 2009 .

[66]  Alvin E. Roth,et al.  Learning in High Stakes Ultimatum Games: An Experiment in the Slovak Republic , 1998 .

[67]  Matthijs T. J. Spaan,et al.  Partially Observable Markov Decision Processes , 2010, Encyclopedia of Machine Learning.

[68]  Nicholas Mark Gotts,et al.  Agent-Based Simulation in the Study of Social Dilemmas , 2003, Artificial Intelligence Review.

[69]  Karen S. Cook,et al.  Power in exchange networks: a power-dependence formulation , 1992 .

[70]  Anil K. Seth Evolving Behavioural Choice: An Investigation into Herrnstein's Matching Law , 1999, ECAL.

[71]  Tommi S. Jaakkola,et al.  Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[72]  A I Houston,et al.  How to maximize reward rate on two variable-interval paradigms. , 1981, Journal of the experimental analysis of behavior.

[73]  D. Shanks,et al.  A re‐examination of melioration and rational choice , 2002 .

[74]  Peter Stone,et al.  Learning and Using Models , 2012, Reinforcement Learning.

[75]  J. Bendor,et al.  In Good Times and Bad: Reciprocity in an Uncertain World , 1987 .

[76]  Robert Axelrod,et al.  The Evolution of Strategies in the Iterated Prisoner's Dilemma , 2001 .

[77]  M. Macy,et al.  The evolution of trust and cooperation between strangers: A computational model. , 1998 .

[78]  S. Railsback,et al.  The Evolution of Agent-based Simulation Platforms : A Review of NetLogo 5 . 0 and ReLogo , 2012 .

[79]  R. Boyd,et al.  The evolution of altruistic punishment , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[80]  J J McDowell,et al.  On the classic and modern theories of matching. , 2005, Journal of the experimental analysis of behavior.

[81]  E. Thorndike The fundamentals of learning , 1972 .

[82]  H Rachlin,et al.  On the tautology of the matching law. , 1971, Journal of the experimental analysis of behavior.

[83]  R. Herrnstein,et al.  Melioration: A Theory of Distributed Choice , 1991 .

[84]  Yutaka Sakai,et al.  The Actor-Critic Learning Is Behind the Matching Law: Matching Versus Optimal Behaviors , 2008, Neural Computation.

[85]  J. Coleman Foundations of Social Theory , 1990 .

[86]  R. Smaniotto,et al.  RECIPROCAL ALTRUISM UNDER CONDITIONS OF PARTNER SELECTION , 2001 .

[87]  G. Homans,et al.  Social Behavior: Its Elementary Forms. , 1961 .

[88]  R. Herrnstein A first law for behavioral analysis , 1981, Behavioral and Brain Sciences.

[89]  R. Bellman A Markovian Decision Process , 1957 .

[90]  George Tsebelis,et al.  Penalty has no Impact on Crime: , 1990 .

[91]  J J McDowell,et al.  A quantitative evolutionary theory of adaptive behavior dynamics. , 2013, Psychological review.

[92]  J. E. Mazur,et al.  On the functions relating delay, reinforcer value, and behavior , 1988, Behavioral and Brain Sciences.

[93]  C. Hauert,et al.  Reward and punishment , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[94]  J. M. Stahl,et al.  Optimal behavior and concurrent variable interval schedules , 2004 .

[95]  M. Macy,et al.  Social dynamics from the bottom up: Agent-based models of social interaction , 2009 .

[96]  C. Hauert,et al.  Punishment and reputation in spatial public goods games , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[97]  Ignacio Palacios-Huerta Professionals Play Minimax , 2003 .

[98]  Lars Udéhn THE CHANGING FACE OF METHODOLOGICAL INDIVIDUALISM , 2002 .

[99]  Comparison Levine,et al.  Quantitative Applications in the Social Sciences , 2006 .

[100]  J J McDowell,et al.  A computational model of selection by consequences. , 2004, Journal of the experimental analysis of behavior.

[101]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[102]  W. Newsome,et al.  Matching Behavior and the Representation of Value in the Parietal Cortex , 2004, Science.

[103]  Kevin A. Gluck,et al.  SAwSu: An Integrated Model of Associative and Reinforcement Learning , 2014, Cogn. Sci..

[104]  Y. Loewenstein,et al.  Reinforcement learning and human behavior , 2014, Current Opinion in Neurobiology.

[105]  Hado van Hasselt,et al.  Reinforcement Learning in Continuous State and Action Spaces , 2012, Reinforcement Learning.

[106]  Jeroen Weesie,et al.  Consent or Conflict: Coevolution of Coordination and Networks , 2008 .

[107]  A. Flache The rational weakness of strong ties: Failure of group solidarity in a highly cohesive group of rational agents , 2002 .

[108]  J J McDowell,et al.  On the theoretical and empirical status of the matching law and matching theory. , 2013, Psychological bulletin.

[109]  Christos Papadimitriou,et al.  Algorithms, complexity, and the sciences , 2014, Proceedings of the National Academy of Sciences.

[110]  Michael M. Cohen,et al.  A comparison of learning models , 1995 .

[111]  Michael G. Dyer,et al.  Toward Synthesizing Artificial Neural Networks that Exhibit Cooperative Intelligent Behavior: Some Open Issues in Artificial Life , 1993, Artificial Life.

[112]  Steven L. Lytinen,et al.  Agent-based Simulation Platforms: Review and Development Recommendations , 2006, Simul..

[113]  이경원,et al.  Collective Action , 2014, Encyclopedia of Social Network Analysis and Mining.

[114]  Christopher G. Langton,et al.  Artificial Life , 2019, Philosophical Posthumanism.

[115]  Michael I. Jordan,et al.  MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .

[116]  J. Nash,et al.  NON-COOPERATIVE GAMES , 1951, Classics in Game Theory.

[117]  H. Seung,et al.  JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 2005, 84, 581–617 NUMBER 3(NOVEMBER) LINEAR-NONLINEAR-POISSON MODELS OF PRIMATE CHOICE DYNAMICS , 2022 .

[118]  Tetsuya Saito How Do We Get Cobb-Douglas and Leontief Functions from CES Function: A Lecture Note on Discrete and Continuum Differentiated Object Models , 2011 .

[119]  R. Conger,et al.  Use of Concurrent Operants in Small Group Research , 1974 .

[120]  P. Richerson,et al.  Punishment allows the evolution of cooperation (or anything else) in sizable groups , 1992 .

[121]  L. Green,et al.  The substitutability of reinforcers. , 1993, Journal of the experimental analysis of behavior.

[122]  D. Collett,et al.  Modelling Binary Data. , 1994 .

[123]  Karl-Dieter Opp Verhaltenstheoretische Soziologie : eine neue soziologische Forschungsrichtung , 1972 .

[124]  J. Nevin,et al.  Maximization theory: Some empirical problems , 1981, Behavioral and Brain Sciences.

[125]  Yutaka Sakai,et al.  When Does Reward Maximization Lead to Matching Law? , 2008, PloS one.

[126]  D. Ward,et al.  Modeling the Deterrent Effects of Punishment , 1986 .

[127]  S R Hursh,et al.  The economics of daily consumption controlling food- and water-reinforced responding. , 1978, Journal of the experimental analysis of behavior.

[128]  Yoav Shoham,et al.  Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[129]  R. Herrnstein,et al.  Toward a law of response strength. , 1976 .

[130]  On the empirical status of the matching law: comment on McDowell (2013). , 2013, Psychological bulletin.

[131]  M. Loève,et al.  Probability Theory II (4th ed.). , 1979 .

[132]  Anil K. Seth,et al.  Modeling Group Foraging: Individual Suboptimality, Interference, and a Kind of Matching , 2001, Adapt. Behav..

[133]  C. Gallistel,et al.  The rat approximates an ideal detector of changes in rates of reward: implications for the law of effect. , 2001, Journal of experimental psychology. Animal behavior processes.

[134]  R. Selten Reexamination of the perfectness concept for equilibrium points in extensive games , 1975, Classics in Game Theory.

[135]  H. Sebastian Seung,et al.  Operant Matching as a Nash Equilibrium of an Intertemporal Game , 2009, Neural Computation.

[136]  Pier-Olivier Caron On applying the matching law to between-subject data , 2013, Animal Behaviour.

[137]  Dario Floreano,et al.  Competitive Foraging, Decision Making, and the Ecological Rationality of the Matching Law , 2002 .

[138]  R. Heiner,et al.  Origin of Predictable Behavior: Further Modeling and Applications , 1985 .

[139]  G. Marwell,et al.  A Theory of the Critical Mass. I. Interdependence, Group Heterogeneity, and the Production of Collective Action , 1985, American Journal of Sociology.

[140]  Andrei Popa,et al.  Toward a mechanics of adaptive behavior: evolutionary dynamics and matching theory statics. , 2010, Journal of the experimental analysis of behavior.

[141]  R. Shull,et al.  Changeover delay and concurrent schedules: some effects on relative performance measures. , 1967, Journal of the experimental analysis of behavior.

[142]  J. Kagel,et al.  Substitutability in time allocation. , 1980 .

[143]  H. Simon,et al.  A Behavioral Model of Rational Choice , 1955 .

[144]  Yonatan Loewenstein,et al.  Synaptic Theory of Replicator-Like Melioration , 2010, Front. Comput. Neurosci..

[145]  Patrick Suppes,et al.  Additive and Polynomial Representations , 2014 .

[146]  W M Baum,et al.  On two types of deviation from the matching law: bias and undermatching. , 1974, Journal of the experimental analysis of behavior.

[147]  John C. Harsanyi,et al.  Общая теория выбора равновесия в играх / A General Theory of Equilibrium Selection in Games , 1989 .

[148]  R. Herrnstein On the law of effect. , 1970, Journal of the experimental analysis of behavior.

[149]  U. Netlogo Wilensky,et al.  Center for Connected Learning and Computer-Based Modeling , 1999 .

[150]  D. Sage The Darwin economy: liberty, competition and the common good , 2011 .

[151]  Pier-Olivier Caron Matching without learning , 2015, Adapt. Behav..

[152]  Derek D. Reed,et al.  The Matching Law: A Tutorial for Practitioners , 2011, Behavior analysis in practice.

[153]  H. Rachlin The Science of Self-Control , 2004 .

[154]  R. Herrnstein,et al.  The Matching Law Papers in Psychology and Economics , 1997 .

[155]  Richard S. Sutton,et al.  Time-Derivative Models of Pavlovian Reinforcement , 1990 .

[156]  Alasdair I. Houston,et al.  Learning rules, matching and frequency dependence , 1987 .

[157]  R. Heiner The Origin of Predictable Behavior , 1983 .

[158]  W D Pierce,et al.  Choice, Matching, and Human Behavior: A Review of the Literature , 1983, The Behavior analyst.

[159]  Y. Niv,et al.  Evolution of Reinforcement Learning in Uncertain Environments: A Simple Explanation for Complex Foraging Behaviors , 2002 .

[160]  Nick Chater,et al.  Economic irrationality is optimal during noisy decision making , 2016, Proceedings of the National Academy of Sciences.

[161]  Comment: Adaptive Models in Sociology and the Problem of Empirical Content1 , 2007, American Journal of Sociology.

[162]  R. Dawes,et al.  Social welfare, cooperators' advantage, and the option of not playing the game. , 1993 .

[163]  Frederick Mosteller,et al.  Stochastic Models for Learning , 1956 .

[164]  R. Herrnstein,et al.  Utility maximization and melioration: Internalities in individual choice , 1993 .

[165]  R. Rob,et al.  Learning, Mutation, and Long Run Equilibria in Games , 1993 .

[166]  John E. R. Staddon Adaptive Dynamics: The Theoretical Analysis of Behavior , 2001 .

[167]  H. Gintis The Bounds of Reason: Game Theory and the Unification of the Behavioral Sciences , 2014 .

[168]  D. Parisi,et al.  Classical and instrumental conditioning : From laboratory phenomena to integrated mechanisms for adaptation , 2000 .

[169]  T. Vollmer,et al.  An application of the matching law to evaluate the allocation of two- and three-point shots by college basketball players. , 2000, Journal of applied behavior analysis.

[170]  J. Kagel,et al.  Economic Demand Theory and Psychological Studies of Choice1 , 1976 .

[171]  Ulrich Schwalbe,et al.  Conventions, local interaction, and automata networks , 1996 .

[172]  J. J. McDowell,et al.  Undermatching is an emergent property of selection by consequences , 2007, Behavioural Processes.

[173]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[174]  T. Schelling,et al.  The Strategy of Conflict. , 1961 .

[175]  W. Hamilton,et al.  The evolution of cooperation. , 1984, Science.

[176]  M. Nowak,et al.  Evolutionary games and spatial chaos , 1992, Nature.

[177]  R. L. Hamblin Behavioral Choice and Social Reinforcement: Step Function Versus Matching , 1979 .

[178]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[179]  M. Macy,et al.  Stochastic Collusion and the Power Law of Learning , 2002 .

[180]  Peter C. Fishburn,et al.  Utility theory for decision making , 1970 .

[181]  Shie Mannor,et al.  Bayesian Reinforcement Learning , 2012, Reinforcement Learning.

[182]  B. Skinner,et al.  Science and human behavior , 1953 .

[183]  R. Heiner Predictable Behavior: Reply [The Origin of Predictable Behavior] , 1985 .

[184]  R. Herrnstein,et al.  CHAPTER 5 – Melioration and Behavioral Allocation1 , 1980 .

[185]  Jon D. Ringen Radical Behaviorism: B. F. Skinner's Philosophy of Science , 1999 .

[186]  B Skyrms,et al.  A dynamic model of social network formation. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[187]  Nick Feltovich,et al.  Reinforcement-based vs. Belief-based Learning Models in Experimental Asymmetric-information Games , 2000 .

[188]  W. Baum,et al.  Matching, undermatching, and overmatching in studies of choice. , 1979, Journal of the experimental analysis of behavior.

[189]  B. Latané,et al.  Bystander intervention in emergencies: diffusion of responsibility. , 1968, Journal of personality and social psychology.

[190]  J. H. Kunkel Behavior, social problems, and change : a social learning approach , 1975 .

[191]  Yonatan Loewenstein,et al.  Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity , 2006, Proceedings of the National Academy of Sciences.

[192]  J. E. Mazur Learning and behavior (5th ed.). , 2002 .

[193]  Michael L. Littman,et al.  Classes of Multiagent Q-learning Dynamics with epsilon-greedy Exploration , 2010, ICML.

[194]  Yutaka Sakai,et al.  Computational algorithms and neuronal network models underlying decision processes , 2006, Neural Networks.

[195]  T. Brenner,et al.  Melioration learning in games with constant and frequency-dependent pay-offs , 2003 .

[196]  L. Gray,et al.  Social Matching Over Multiple Reinforcement Domains: An Explanation of Local Exchange Imbalance , 1982 .

[197]  Victor R. Lesser,et al.  Learning the task allocation game , 2006, AAMAS '06.

[198]  Michael W. Macy,et al.  The Signal Importance of Noise , 2015 .

[199]  Drazen Prelec,et al.  A Theory of Addiction , 1992 .

[200]  A. Seth The ecology of action selection: insights from artificial life , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[201]  H. Rauhut Higher Punishment, Less Control? , 2009 .

[202]  Ronald A. Heiner,et al.  The necessity of imperfect decisions , 1988 .

[203]  D. Watts,et al.  Small Worlds: The Dynamics of Networks between Order and Randomness , 2001 .

[204]  Jean-Yves Jaffray,et al.  On the extension of additive utilities to infinite sets , 1974 .

[205]  Alan Poling,et al.  The Matching Law , 2011 .

[206]  Christopher G. Langton,et al.  Cooperation and Community Structure in Artificial Ecosystems , 1997 .

[207]  Aram Galstyan,et al.  Dynamics of Boltzmann Q learning in two-player two-action games. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[208]  L. Gray,et al.  A satisfaction balance model of decision making and choice behavior , 1984 .

[209]  W Vaughan,et al.  Melioration, matching, and maximization. , 1981, Journal of the experimental analysis of behavior.

[210]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[211]  ปิยดา สมบัติวัฒนา Behavioral Game Theory: Experiments in Strategic Interaction , 2013 .

[212]  W. Pierce,et al.  The Matching Law and Bias in a Social Exchange Involving Choice between Alternatives , 1982 .

[213]  Jonathan D. Cohen,et al.  Explicit melioration by a neural diffusion model , 2009, Brain Research.

[214]  J J McDowell,et al.  A computational theory of selection by consequences applied to concurrent schedules. , 2008, Journal of the experimental analysis of behavior.

[215]  A. Diekmann Volunteer's Dilemma , 1985 .

[216]  H. Peyton Young,et al.  Strategic Learning and Its Limits , 2004 .

[217]  M. C. Stafford,et al.  Rewards and Punishments in Complex Human Choices , 1991 .

[218]  C. Hauert,et al.  Via Freedom to Coercion: The Emergence of Costly Punishment , 2007, Science.