论文信息 - Learning to play 3×3 games: Neural networks as bounded-rational players - 字舞流文

Learning to play 3×3 games: Neural networks as bounded-rational players

D. Sgroi | D. Zizzo | D. J. Zizzo

[1] H. Simon,et al. A Behavioral Model of Rational Choice , 1955 .

[2] H. Simon,et al. Theories of Decision-Making in Economics and Behavioural Science , 1966 .

[3] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[4] I. Gilboa. The Complexity of Computing Best-Response Automata in Repeated Games , 1988 .

[5] Eduardo D. Sontag,et al. Backpropagation Can Give Rise to Spurious Local Minima Even for Networks without Hidden Layers , 1989, Complex Syst..

[6] Eitan Zemel,et al. Nash and correlated equilibria: Some complexity considerations , 1989 .

[7] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[8] Elchanan Ben-Porath. The complexity of computing a best response automaton in repeated games with mixed strategies , 1990 .

[9] Halbert White,et al. Artificial Neural Networks: Approximation and Learning Theory , 1992 .

[10] Ariel Rubinstein,et al. On Price Recognition and Computational Complexity in a Monopolistic Model , 1993, Journal of Political Economy.

[11] D. Stahl,et al. Experimental evidence on players' models of other players , 1994 .

[12] In-Koo Cho,et al. Bounded rationality, neural network and folk theorem in repeated games with discounting , 1994 .

[13] Peter Auer,et al. Exponentially many local minima for single neurons , 1995, NIPS.

[14] D. Stahl,et al. On Players' Models of Other Players: Theory and Experimental Evidence , 1995 .

[15] A. Roth,et al. Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[16] In-Koo Cho,et al. Perceptrons Play the Repeated Prisoner′s Dilemma , 1995 .

[17] Raymond J. Mooney,et al. Induction of First-Order Decision Lists: Results on Learning the Past Tense of English Verbs , 1995, J. Artif. Intell. Res..

[18] Thomas J. Sargent,et al. Chapter 9 Neural networks for encoding and adapting in dynamic economies , 1996 .

[19] Teck-Hua Ho,et al. Finite automata play repeated prisoner's dilemma with information processing costs , 1996 .

[20] In-Koo Cho. Perceptrons Play Repeated Games with Imperfect Monitoring , 1996 .

[21] Eduardo D. Stonag. Critical points for least-squares problems involving certain analytic functions, with applications to sigmoidal nets , 1996, Adv. Comput. Math..

[22] John H. Miller,et al. The coevolution of automata in the repeated Prisoner's Dilemma , 1996 .

[23] Michael W. Macy,et al. Natural Selection and Social Learning in Prisoner's Dilemma: Coadaptation with Genetic Algorithms and Artificial Neural Networks , 1996 .

[24] A. Roth,et al. Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[25] Dale O. Stahl,et al. Population Rule Learning in Symmetric Normal-Form Games , 2001 .

[26] Edmund T. Rolls,et al. Introduction to Connectionist Modelling of Cognitive Processes , 1998 .

[27] Robert Slonim,et al. Learning rules of thumb or learning more rational rules , 1999 .

[28] Miguel A. Costa-Gomes,et al. Cognition and Behavior in Normal-Form Games: An Experimental Study , 1998 .

[29] Peter L. Bartlett,et al. Neural Network Learning - Theoretical Foundations , 1999 .

[30] Colin Camerer,et al. Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[31] Dale O. Stahl,et al. Rule Learning in Symmetric Normal-Form Games: Theory and Evidence , 2000, Games Econ. Behav..

[32] Kenji Fukumizu,et al. Local minima and plateaus in hierarchical structures of multilayer perceptrons , 2000, Neural Networks.

[33] Brian Hazlehurst,et al. Learning in the Cultural Process , 2000 .

[34] Dale O. Stahl,et al. Action-Reinforcement Learning Versus Rule Learning , 2000 .

[35] Itzhak Gilboa,et al. A theory of case-based decisions , 2001 .

[36] Daniel Sgroi,et al. Bounded-Rational Behavior by Neural Networks in Normal Form Games , 2001 .

[37] D. Sgroi,et al. Strategy Learning in 3x3 Games by Neural Networks , 2002 .

[38] Daniel John Zizzo,et al. Neurobiological measurements of cardinal utility: Hedonimeters or learning algorithms? , 2002, Soc. Choice Welf..

[39] J. Kagel,et al. Lessons Learned: Generalizing Learning Across Games , 2003 .

[40] Phillip Ein-Dor,et al. Neural nets in a group decision process , 2003, Int. J. Game Theory.

[41] Roberto A. Weber. 'Learning' with no feedback in a competitive guessing game , 2003, Games Econ. Behav..

[42] Daniel Sgroi,et al. Using Neural Networks to Model Bounded Rationality in Interactive Decision-Making , 2003 .

[43] Jason Barr,et al. Cournot Competition, Organization and Learning ∗ , 2005 .

[44] Daniel John Zizzo,et al. Transfer of knowledge in economic decision making , 2005 .

[45] Philipp Slusallek,et al. Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.