Elements of a Utilitarian Theory of Knowledge and Action

According to the utilitarian paradigm, an autonomous intelligent agent's interactions with the environment should be guided by the principle of expected utility maximization. We apply this paradigm to reasoning about an agent's physical actions and exploratory behavior in urgent, time-constrained situations. We model an agent's knowledge with a temporalized version of Kripke structures--as a set of branching time lines described by fluents, with accessibility relations holding among the states comprising the time lines. We describe how to compute utility based on this model which reflects the urgency that the environment imposes on time. Since the physical and exploratory actions that an agent could undertake transform the model of branching time lines in specific ways, the expected utilities of these actions can be computed, dictating rational tradeoffs among them depending on the agent's state of knowledge and the urgency of the situation.

[1]  Drew McDermott,et al.  A Temporal Logic for Reasoning About Processes and Plans , 1982, Cogn. Sci..

[2]  Peter Haddawy,et al.  A Temporal Probability Logic for Representing Actions , 1991, KR.

[3]  Edmund H. Durfee,et al.  A decision-theoretic approach to coordinating multiagent interactions , 1991, IJCAI 1991.

[4]  Steve Hanks,et al.  Practical Temporal Projection , 1990, AAAI.

[5]  von Weizäscker,et al.  Existence of Optimal Programs of Accumulation for an Infinite Time Horizon , 1965 .

[6]  Robert C. Moore A Formal Theory of Knowledge and Action , 1984 .

[7]  L. McKenzie,et al.  Optimal economic growth, turnpike theorems and comparative dynamics , 1986 .

[8]  Edmund H. Durfee,et al.  A Logic of Knowledge and Belief for Recursive Modeling: A Preliminary Report , 1992, AAAI.

[9]  Matthew L. Ginsberg,et al.  Computational Considerations in Reasoning about Action , 1991, KR.

[10]  Jon Doyle,et al.  Doyle See Infer Choose Do Perceive Act , 2009 .

[11]  Michael P. Wellman,et al.  Planning and Control , 1991 .

[12]  Joseph Y. Halpern,et al.  Reasoning about knowledge: a survey , 1995 .

[13]  Ralph L. Keeney,et al.  Decisions with multiple objectives: preferences and value tradeoffs , 1976 .

[14]  Stuart J. Russell,et al.  On Optimal Game-Tree Search using Rational Meta-Reasoning , 1989, IJCAI.

[15]  Mark S. Boddy,et al.  An Analysis of Time-Dependent Planning , 1988, AAAI.

[16]  R. N. Pelavin A formal approach to planning with concurrent actions and external events , 1988 .

[17]  Edmund H. Durfee,et al.  The Utility of Communication in Coordinating Intelligent Agents , 1991, AAAI.

[18]  Joseph Y. Halpern,et al.  A Guide to the Modal Logics of Knowledge and Belief: Preliminary Draft , 1985, IJCAI.

[19]  Roy Radner,et al.  Equilibrium under uncertainty , 1993 .

[20]  R. L. Keeney,et al.  Decisions with Multiple Objectives: Preferences and Value Trade-Offs , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  F. B. Vernadat,et al.  Decisions with Multiple Objectives: Preferences and Value Tradeoffs , 1994 .

[22]  Eric Horvitz,et al.  Reflection and Action Under Scarce Resources: Theoretical Principles and Empirical Study , 1989, IJCAI.

[23]  Leora Morgenstern,et al.  Knowledge Preconditions for Actions and Plans , 1988, IJCAI.

[24]  Yoav Shoham,et al.  Time for Action: On the Relation Between Time, Knowledge and Action , 1989, IJCAI.

[25]  P. Cohen Intention = Choice + Commitment1 , 2022 .

[26]  Mark S. Boddy,et al.  Solving Time-Dependent Planning Problems , 1989, IJCAI.