Path Integral Formulation of Stochastic Optimal Control with Generalized Costs

Abstract Path integral control solves a class of stochastic optimal control problems with a Monte Carlo (MC) method for an associated Hamilton-Jacobi-Bellman (HJB) equation. The MC approach avoids the need for a global grid of the domain of the HJB equation and, therefore, path integral control is in principle applicable to control problems of moderate to large dimension. The class of problems path integral control can solve, however, is defined by requirements on the cost function, the noise covariance matrix and the control input matrix. We relax the requirements on the cost function by introducing a new state that represents an augmented running cost. In our new formulation the cost function can contain stochastic integral terms and linear control costs, which are important in applications in engineering, economics and finance. We find an efficient numerical implementation of our grid-free MC approach and demonstrate its performance and usefulness in examples from hierarchical electric load management. The dimension of one of our examples is large enough to make classical grid-based HJB solvers impractical.

[1]  R Bellman,et al.  DYNAMIC PROGRAMMING AND LAGRANGE MULTIPLIERS. , 1956, Proceedings of the National Academy of Sciences of the United States of America.

[2]  R. C. Sonderegger Dynamic models of house heating based on equivalent thermal parameters , 1978 .

[3]  Paul R. Milgrom,et al.  AGGREGATION AND LINEARITY IN THE PROVISION OF INTERTEMPORAL INCENTIVES , 1987 .

[4]  W. Fleming,et al.  Controlled Markov processes and viscosity solutions , 1992 .

[5]  Alex M. Andrew,et al.  Level Set Methods and Fast Marching Methods: Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision, and Materials Science (2nd edition) , 2000 .

[6]  Hui Ou-Yang,et al.  Optimal Contracts in a Continuous-Time Delegated Portfolio Management Problem , 2003 .

[7]  Ronald Fedkiw,et al.  Level set methods and dynamic implicit surfaces , 2002, Applied mathematical sciences.

[8]  H. Kappen Linear theory for control of nonlinear stochastic systems. , 2004, Physical review letters.

[9]  A. Chorin,et al.  Stochastic Tools in Mathematics and Science , 2005 .

[10]  H. Kappen Path integrals and symmetry breaking for optimal control theory , 2005, physics/0505066.

[11]  W.M. McEneaney,et al.  Curse-of-complexity attenuation in the curse-of-dimensionality-free method for HJB PDEs , 2008, 2008 American Control Conference.

[12]  Emanuel Todorov,et al.  Efficient computation of optimal actions , 2009, Proceedings of the National Academy of Sciences.

[13]  A. Chorin,et al.  Implicit sampling for particle filters , 2009, Proceedings of the National Academy of Sciences.

[14]  William M. McEneaney,et al.  Convergence Rate for a Curse-of-Dimensionality-Free Method for a Class of HJB PDEs , 2009, SIAM J. Control. Optim..

[15]  Matthias Morzfeld,et al.  Implicit particle filters for data assimilation , 2010, 1005.4002.

[16]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[17]  Warren B. Powell,et al.  Adaptive Stochastic Control for the Smart Grid , 2011, Proceedings of the IEEE.

[18]  Evangelos A. Theodorou,et al.  An iterative path integral stochastic optimal control approach for learning robotic tasks , 2011 .

[19]  Ian A. Hiskens,et al.  Achieving Controllability of Electric Loads , 2011, Proceedings of the IEEE.

[20]  A. Chorin,et al.  Implicit particle filtering for models with partial noise, and an application to geomagnetic data assimilation , 2011, 1109.3664.

[21]  Matthias Morzfeld,et al.  A random map implementation of implicit filters , 2011, J. Comput. Phys..

[22]  Matthias Morzfeld,et al.  Implicit Sampling for Path Integral Control, Monte Carlo Localization, and SLAM , 2013, 1309.3615.

[23]  A. Chorin,et al.  Implicit Particle Methods and Their Connection with Variational Data Assimilation , 2012, 1205.1830.

[24]  Claire J. Tomlin,et al.  Dynamic contracts with partial observations: Application to indirect load control , 2014, 2014 American Control Conference.

[25]  Matthias Morzfeld Implicit sampling for path integral control , 2014, 2014 American Control Conference.