Relaxed long run average continuous control of piecewise deterministic Markov processes

In this paper we consider the long run average continuous control problem of piecewise-deterministic Markov processes (PDP's for short). The control variable acts on the jump rate λ and transition measure Q of the PDP. We consider relaxed open loop policies which choose, at each jump time, randomized (rather than deterministic) control actions. The advantage of allowing randomized actions is that the optimality equation for the continuous-time problem can be re-written as a discrete-time Markov decision process with compact action space. The main goal of this paper is to show the compactness proprieties of the action space for discrete-time problem as well as to prove the equivalence between the optimality equations of the continuous and discrete-time problems.

[1]  O. Hernández-Lerma,et al.  Discrete-time Markov control processes , 1999 .

[2]  O. Costa,et al.  Impulse and continuous control of piecewise deterministic Markov processes , 2000 .

[3]  ANTHONY ALMUDEVAR,et al.  A Dynamic Programming Algorithm for the Optimal Control of Piecewise Deterministic Markov Processes , 2001, SIAM J. Control. Optim..

[4]  Mark H. Davis Markov Models and Optimization , 1995 .

[5]  Oswaldo Luiz do Valle Costa,et al.  Stability of Piecewise-Deterministic Markov Processes , 1999 .

[6]  Mark H. A. Davis Piecewise‐Deterministic Markov Processes: A General Class of Non‐Diffusion Stochastic Models , 1984 .

[7]  Quanxin Zhu,et al.  Average optimality for Markov decision processes in borel spaces: a new condition and approach , 2006, Journal of Applied Probability.

[8]  Oswaldo Luiz do Valle Costa,et al.  Average Continuous Control of Piecewise Deterministic Markov Processes , 2005, CDC 2005.

[9]  O.L.V. Costa,et al.  Optimal stopping with continuous control of piecewise deterministic Markov processes , 2000 .

[10]  Juan Ye Optimal control of piecewise deterministic Markov processes. , 1990 .

[11]  A. A. Yushkevich,et al.  Continuous time markov decision processes with interventions , 1983 .

[12]  Jane J. Ye,et al.  Impulse Control of Piecewise Deterministic Markov Processes , 1995 .

[13]  S. Ethier,et al.  Markov Processes: Characterization and Convergence , 2005 .

[14]  Arie Hordijk,et al.  Markov Decision Drift Processes; Conditions for Optimality Obtained by Discretization , 1985, Math. Oper. Res..

[15]  Dariusz Gatarek,et al.  Optimality conditions for impulsive control of piecewise-deterministic processes , 1992, Math. Control. Signals Syst..

[16]  A. A. Yushkevich,et al.  Bellman inequalities in markov decision deterministic drift processes , 1988 .

[17]  François Dufour,et al.  On the Poisson Equation for Piecewise-Deterministic Markov Processes , 2003, SIAM J. Control. Optim..

[18]  J. Schwartz,et al.  Linear Operators. Part I: General Theory. , 1960 .

[19]  Jane J. Ye,et al.  Necessary and sufficient optimality conditions for control of piecewise deterministic markov processes , 1992 .

[20]  U. S. Gugerui Optimal stopping of a piecewise-deterministic markov process , 1986 .

[21]  Arie Hordijk,et al.  Discretization and Weak Convergence in Markov Decision Drift Processes , 1984, Math. Oper. Res..