Reflective Oracles: A Foundation for Game Theory in Artificial Intelligence

Game theory treats players as special: A description of a game contains a full, explicit enumeration of all players. This isn’t a realistic assumption for autonomous intelligent agents. In this paper, we propose a framework in which agents and their environments are both modelled as probablistic oracle machines with access to a “reflective” oracle, which is able to answer questions about the outputs of other machines with access to the same oracle. These oracles avoid diagonalization problems by answering some queries randomly. Agents make decisions by asking the oracle questions about their environment, which they model as an arbitrary oracle machines. Since agents are themselves oracle machines, the environment can contain other agents as non-distinguished subprocesses, removing the special treatment of players in the classical theory. We show that agents interacting in this way play Nash equilibria.

[1]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[2]  Nick Bostrom,et al.  Superintelligence: Paths, Dangers, Strategies , 2014 .

[3]  Benja Fallenstein,et al.  Aligning Superintelligence with Human Interests: A Technical Research Agenda , 2015 .

[4]  Ray J. Solomonoff,et al.  A Formal Theory of Inductive Inference. Part II , 1964, Inf. Control..

[5]  Stuart J. Russell,et al.  Research Priorities for Robust and Beneficial Artificial Intelligence , 2015, AI Mag..

[6]  Haim Gaifman,et al.  Reasoning with Limited Resources and Assigning Probabilities to Arithmetical Statements , 2004, Synthese.

[7]  Kevin B. Korb,et al.  Bayesian Artificial Intelligence , 2004, Computer science and data analysis series.

[8]  David Lewis,et al.  Causal decision theory , 1981 .

[9]  Marcus Hutter,et al.  Universal Artificial Intellegence - Sequential Decisions Based on Algorithmic Probability , 2005, Texts in Theoretical Computer Science. An EATCS Series.

[10]  Roger B. Myerson,et al.  Game theory - Analysis of Conflict , 1991 .

[11]  B. Jones BOUNDED RATIONALITY , 1999 .

[12]  E. Eells Causal Decision Theory , 1984, PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association.

[13]  D. C. Cooper,et al.  Theory of Recursive Functions and Effective Computability , 1969, The Mathematical Gazette.

[14]  Marcus Hutter,et al.  Universal Artificial Intelligence: Sequential Decisions Based on Algorithmic Probability (Texts in Theoretical Computer Science. An EATCS Series) , 2006 .

[15]  Ken Binmore,et al.  Modeling Rational Players: Part II , 1987, Economics and Philosophy.

[16]  David M. Kreps,et al.  A Course in Microeconomic Theory , 2020 .

[17]  Ken Binmore,et al.  Modeling rational players I , 1987 .

[18]  Dr. Marcus Hutter,et al.  Universal artificial intelligence , 2004 .

[19]  J. Schreiber Foundations Of Statistics , 2016 .

[20]  Benja Fallenstein,et al.  Reflective Variants of Solomonoff Induction and AIXI , 2015, AGI.

[21]  Benja Fallenstein,et al.  Reflective Oracles: A Foundation for Classical Game Theory , 2015, ArXiv.

[22]  K. Fan Fixed-point and Minimax Theorems in Locally Convex Topological Linear Spaces. , 1952, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Ray J. Solomonoff,et al.  A Formal Theory of Inductive Inference. Part I , 1964, Inf. Control..

[24]  Laurent Orseau,et al.  Space-Time Embedded Intelligence , 2012, AGI.