Learning to play 3×3 games: Neural networks as bounded-rational players

[1]  H. Simon,et al.  A Behavioral Model of Rational Choice , 1955 .

[2]  H. Simon,et al.  Theories of Decision-Making in Economics and Behavioural Science , 1966 .

[3]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[4]  I. Gilboa The Complexity of Computing Best-Response Automata in Repeated Games , 1988 .

[5]  Eduardo D. Sontag,et al.  Backpropagation Can Give Rise to Spurious Local Minima Even for Networks without Hidden Layers , 1989, Complex Syst..

[6]  Eitan Zemel,et al.  Nash and correlated equilibria: Some complexity considerations , 1989 .

[7]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[8]  Elchanan Ben-Porath The complexity of computing a best response automaton in repeated games with mixed strategies , 1990 .

[9]  Halbert White,et al.  Artificial Neural Networks: Approximation and Learning Theory , 1992 .

[10]  Ariel Rubinstein,et al.  On Price Recognition and Computational Complexity in a Monopolistic Model , 1993, Journal of Political Economy.

[11]  D. Stahl,et al.  Experimental evidence on players' models of other players , 1994 .

[12]  In-Koo Cho,et al.  Bounded rationality, neural network and folk theorem in repeated games with discounting , 1994 .

[13]  Peter Auer,et al.  Exponentially many local minima for single neurons , 1995, NIPS.

[14]  D. Stahl,et al.  On Players' Models of Other Players: Theory and Experimental Evidence , 1995 .

[15]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[16]  In-Koo Cho,et al.  Perceptrons Play the Repeated Prisoner′s Dilemma , 1995 .

[17]  Raymond J. Mooney,et al.  Induction of First-Order Decision Lists: Results on Learning the Past Tense of English Verbs , 1995, J. Artif. Intell. Res..

[18]  Thomas J. Sargent,et al.  Chapter 9 Neural networks for encoding and adapting in dynamic economies , 1996 .

[19]  Teck-Hua Ho,et al.  Finite automata play repeated prisoner's dilemma with information processing costs , 1996 .

[20]  In-Koo Cho Perceptrons Play Repeated Games with Imperfect Monitoring , 1996 .

[21]  Eduardo D. Stonag Critical points for least-squares problems involving certain analytic functions, with applications to sigmoidal nets , 1996, Adv. Comput. Math..

[22]  John H. Miller,et al.  The coevolution of automata in the repeated Prisoner's Dilemma , 1996 .

[23]  Michael W. Macy,et al.  Natural Selection and Social Learning in Prisoner's Dilemma: Coadaptation with Genetic Algorithms and Artificial Neural Networks , 1996 .

[24]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[25]  Dale O. Stahl,et al.  Population Rule Learning in Symmetric Normal-Form Games , 2001 .

[26]  Edmund T. Rolls,et al.  Introduction to Connectionist Modelling of Cognitive Processes , 1998 .

[27]  Robert Slonim,et al.  Learning rules of thumb or learning more rational rules , 1999 .

[28]  Miguel A. Costa-Gomes,et al.  Cognition and Behavior in Normal-Form Games: An Experimental Study , 1998 .

[29]  Peter L. Bartlett,et al.  Neural Network Learning - Theoretical Foundations , 1999 .

[30]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[31]  Dale O. Stahl,et al.  Rule Learning in Symmetric Normal-Form Games: Theory and Evidence , 2000, Games Econ. Behav..

[32]  Kenji Fukumizu,et al.  Local minima and plateaus in hierarchical structures of multilayer perceptrons , 2000, Neural Networks.

[33]  Brian Hazlehurst,et al.  Learning in the Cultural Process , 2000 .

[34]  Dale O. Stahl,et al.  Action-Reinforcement Learning Versus Rule Learning , 2000 .

[35]  Itzhak Gilboa,et al.  A theory of case-based decisions , 2001 .

[36]  Daniel Sgroi,et al.  Bounded-Rational Behavior by Neural Networks in Normal Form Games , 2001 .

[37]  D. Sgroi,et al.  Strategy Learning in 3x3 Games by Neural Networks , 2002 .

[38]  Daniel John Zizzo,et al.  Neurobiological measurements of cardinal utility: Hedonimeters or learning algorithms? , 2002, Soc. Choice Welf..

[39]  J. Kagel,et al.  Lessons Learned: Generalizing Learning Across Games , 2003 .

[40]  Phillip Ein-Dor,et al.  Neural nets in a group decision process , 2003, Int. J. Game Theory.

[41]  Roberto A. Weber 'Learning' with no feedback in a competitive guessing game , 2003, Games Econ. Behav..

[42]  Daniel Sgroi,et al.  Using Neural Networks to Model Bounded Rationality in Interactive Decision-Making , 2003 .

[43]  Jason Barr,et al.  Cournot Competition, Organization and Learning ∗ , 2005 .

[44]  Daniel John Zizzo,et al.  Transfer of knowledge in economic decision making , 2005 .

[45]  Philipp Slusallek,et al.  Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.