Universal Decision Models

Humans are universal decision makers: we reason causally to understand the world; we act competitively to gain advantage in commerce, games, and war; and we are able to learn to make better decisions through trial and error. Whilst these individual modalities of decision making have been studied for decades in various subfields of AI and ML, there has been commensurately less effort in developing formalisms that unify these various modalities into common framework. In this paper, we propose Universal Decision Model (UDM), a mathematical formalism based on category theory, to address this challenge. Decision objects in a UDM correspond to instances of decision tasks, ranging from causal models and dynamical systems such as Markov decision processes and predictive state representations, to network multiplayer games and Witsenhausen’s intrinsic models, which generalizes all these previous formalisms. A UDM is a category of objects, which include decision objects, observation objects, and solution objects. Bisimulation morphisms map between decision objects that capture structure-preserving abstractions. We formulate universal properties of UDMs, including information integration, decision solvability, and hierarchical abstraction. Information integration consolidates data from heterogeneous sources by forming products or limits in the UDM category. Abstraction simulates complex decision processes by simpler processes through bisimulation morphisms by forming quotients, co-products and co-limits in the UDM category. Finally, solvability of a UDM decision object is defined by a fixed point equation, and it corresponds to an isotonic order-preserving morphism across the topology induced by UDM objects. We describe universal functorial representations of UDMs, and propose an algorithm for computing the minimal object in a UDM using algebraic topology. We sketch out an application of UDMs to causal inference in network economics, using a complex multiplayer producer-consumer two-sided marketplace.

[1]  Tilman Wolf,et al.  A Cournot–Nash–Bertrand game theory model of a service-oriented Internet with price and quality competition among network transport providers , 2014, Comput. Manag. Sci..

[2]  Ashutosh Nayyar,et al.  Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach , 2012, IEEE Transactions on Automatic Control.

[3]  STEPHEN PATRIAS ENUMERATION OF HOMOTOPY CLASSES OF FINITE T0 TOPOLOGICAL SPACES , 2008 .

[4]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[5]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[6]  H. Witsenhausen On Information Structures, Feedback and Causality , 1971 .

[7]  Hans S. Witsenhausen,et al.  A standard form for sequential stochastic control , 1973, Mathematical systems theory.

[8]  Vishal Soni,et al.  Abstraction in Predictive State Representations , 2007, AAAI.

[9]  Glynn Winskel,et al.  Bisimulation from Open Maps , 1994, Inf. Comput..

[10]  Devika Subramanian,et al.  Provably Bounded Optimal Agents , 1993, IJCAI.

[11]  Hans S. Witsenhausen,et al.  The Intrinsic Model for Discrete Stochastic Control: Some Open Problems , 1975 .

[12]  Herbert Jaeger,et al.  Links between multiplicity automata, observable operator models and predictive state representations: a unified learning framework , 2015, J. Mach. Learn. Res..

[13]  J. Mooij,et al.  Markov Properties for Graphical Models with Cycles and Latent Variables , 2017, 1710.08775.

[14]  Jean-Philippe Chancelier,et al.  Stochastic Multi-Stage Optimization , 2015 .

[15]  Ashutosh Nayyar,et al.  Common Knowledge and Sequential Team Problems , 2019, IEEE Transactions on Automatic Control.

[16]  Dimitri P. Bertsekas,et al.  Dynamic programming and optimal control, 3rd Edition , 2005 .

[17]  Richard Edwin Stearns,et al.  Some Dangers in State Reduction of Sequential Machines , 1962, Inf. Control..

[18]  Jean-Philippe Chancelier,et al.  Causal Inference Theory with Information Dependency Models , 2021, ArXiv.

[19]  Brendan Fong,et al.  The algebra of open and interconnected systems , 2016, 1609.05382.

[20]  Yoav Shoham,et al.  Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[21]  Pulkit Grover,et al.  Information Structures, the Witsenhausen Counterexample, and Communicating Using Actions , 2015, Encyclopedia of Systems and Control.

[22]  Jonathan Ariel Barmak,et al.  Algebraic Topology of Finite Topological Spaces and Applications , 2011 .

[23]  Robert Givan,et al.  Model Minimization in Markov Decision Processes , 1997, AAAI/IAAI.

[24]  J. K. Hunter,et al.  Measure Theory , 2007 .

[25]  Dimitri P. Bertsekas,et al.  Incremental constraint projection methods for variational inequalities , 2014, Math. Program..

[26]  Michael A. Arbib,et al.  Basic concepts of category theory applicable to computation and control , 1974, Category Theory Applied to Computation and Control.

[27]  Tamer Basar,et al.  Dynamic stochastic games with asymmetric information , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[28]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[29]  A. Nagurney Network Economics: A Variational Inequality Approach , 1992 .

[30]  Hiroshi Imura,et al.  Finite Topological Spaces , 1992 .

[31]  Balaraman Ravindran,et al.  SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes , 2003, IJCAI.

[32]  James R. Munkres,et al.  Elements of algebraic topology , 1984 .

[33]  Tim Roughgarden,et al.  Algorithmic Game Theory , 2007 .

[34]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[35]  M. Dufwenberg Game theory. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[36]  R. Goldblatt Topoi, the Categorial Analysis of Logic , 1979 .

[37]  Glynn Winskel,et al.  Bisimulation and open maps , 1993, [1993] Proceedings Eighth Annual IEEE Symposium on Logic in Computer Science.

[38]  Zhi Zhou,et al.  Edge Intelligence in the Making: Optimization, Deep Learning, and Applications , 2020, Edge Intelligence in the Making.

[39]  Sridhar Mahadevan,et al.  Causal Inference in Network Economics , 2021, ArXiv.

[40]  Bart De Moor,et al.  Subspace algorithms for the stochastic identification problem, , 1993, Autom..

[41]  Polly S Nichols,et al.  Agreeing to disagree. , 2005, General dentistry.

[42]  F. Facchinei,et al.  Finite-Dimensional Variational Inequalities and Complementarity Problems , 2003 .

[43]  Peter Norvig,et al.  Artificial intelligence - a modern approach, 2nd Edition , 2003, Prentice Hall series in artificial intelligence.

[44]  R. Aumann Borel structures for function spaces , 1961 .

[45]  Hans S. Witsenhausen,et al.  On the Relations Between the Value of a Game and Its Information Structure , 1971, Inf. Control..

[46]  Alison Gopnik,et al.  Children's causal inferences from indirect evidence: Backwards blocking and Bayesian reasoning in preschoolers , 2004, Cogn. Sci..

[47]  Katta G. Murty,et al.  Linear complementarity, linear and nonlinear programming , 1988 .

[48]  Hans S. Witsenhausen,et al.  Equivalent stochastic control problems , 1988, Math. Control. Signals Syst..

[49]  G. M. Korpelevich The extragradient method for finding saddle points and other problems , 1976 .

[50]  Kim G. Larsen,et al.  Bisimulation through Probabilistic Testing , 1991, Inf. Comput..

[51]  Ashutosh Nayyar,et al.  Optimal Control Strategies in Delayed Sharing Information Structures , 2010, IEEE Transactions on Automatic Control.