A theory of Markovian time-inconsistent stochastic control in discrete time

We develop a theory for a general class of discrete-time stochastic control problems that, in various ways, are time-inconsistent in the sense that they do not admit a Bellman optimality principle. We attack these problems by viewing them within a game theoretic framework, and we look for subgame perfect Nash equilibrium points. For a general controlled Markov process and a fairly general objective functional, we derive an extension of the standard Bellman equation, in the form of a system of nonlinear equations, for the determination of the equilibrium strategy as well as the equilibrium value function. Most known examples of time-inconsistent stochastic control problems in the literature are easily seen to be special cases of the present theory. We also prove that for every time-inconsistent problem, there exists an associated time-consistent problem such that the optimal control and the optimal value function for the consistent problem coincide with the equilibrium control and value function, respectively for the time-inconsistent problem. To exemplify the theory, we study some concrete examples, such as hyperbolic discounting and mean–variance control.

[1]  Suleyman Basak,et al.  Dynamic Mean-Variance Asset Allocation , 2009 .

[2]  Tomas Bjork,et al.  A General Theory of Markovian Time Inconsistent Stochastic Control Problems , 2010 .

[3]  B. Peleg,et al.  On the Existence of a Consistent Course of Action when Tastes are Changing , 1973 .

[4]  Esben Masotti Kryger,et al.  Some Solvable Portfolio Problems with Quadratic and Collective Objectives , 2010 .

[5]  Robert J. Barro,et al.  Ramsey Meets Laibson in the Neoclassical Growth Model , 1999 .

[6]  Christoph Czichowsky,et al.  Time-consistent mean-variance portfolio selection in discrete and continuous time , 2012, Finance and Stochastics.

[7]  I. Ekeland,et al.  Investment and consumption without commitment , 2007, 0708.0588.

[8]  R. H. Strotz Myopia and Inconsistency in Dynamic Utility Maximization , 1955 .

[9]  Anthony A. Smith,et al.  Consumption--Savings Decisions with Quasi--Geometric Discounting , 2003 .

[10]  David I. Laibson,et al.  Dynamic Choices of Hyperbolic Consumers , 2001 .

[11]  Anthony A. Smith Consumption and Savings Decisions with Quasi-Geometric Discounting , 1999 .

[12]  J. Weibull,et al.  Multiple solutions under quasi-exponential discounting , 2009 .

[13]  Thomas Mariotti,et al.  Subjective Discounting in an Exchange Economy , 2003, Journal of Political Economy.

[14]  R. Pollak,et al.  SECOND-BEST NATIONAL SAVING AND GAME-EQUILIBRIUM GROWTH , 1980 .

[15]  X. Zhou,et al.  MEAN–VARIANCE PORTFOLIO OPTIMIZATION WITH STATE‐DEPENDENT RISK AVERSION , 2014 .

[16]  A Theory of Markovian Time Inconsistent Stochastic Control in Continuous Time , 2016 .

[17]  I. Ekeland,et al.  Being serious about non-commitment: subgame perfect equilibrium in continuous time , 2006, math/0604264.