Potentials based optimization with embedded Markov chain for stochastic constrained system

In this paper, a RBF neural network based on-line optimization algorithm with performance potentials analysis method is presented for a class of stochastic constrained dynamic systems. The control signals of the considered systems are constrained to a range according to a subset of the whole state space. With the conception of an embedded Markov chain, an optimization approach on the basis of potentials is presented for a stochastic constrained system, where the optimization criterion is the long-time average performance. With this approach, the computation burden has been reduced because it only requires one to compute the control strategy on the states concerned, which are a subset of the whole state space. Furthermore, with the characteristic of approximation performance of RBF neural network, the potentials and the transition probability matrix are estimated conveniently by a sample path compared with the statistic approach or the method by solving the Poisson equation. The effectiveness of the optimization approach has been shown by the simulation results, finally.

[1]  R. Bellman Dynamic programming. , 1957, Science.

[2]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[3]  Xi-Ren Cao,et al.  Perturbation realization, potentials, and sensitivity analysis of Markov processes , 1997, IEEE Trans. Autom. Control..

[4]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[5]  Onésimo Hernández-Lerma,et al.  Minimax Control of Discrete-Time Stochastic Systems , 2002, SIAM J. Control. Optim..

[6]  Zhiyuan Ren,et al.  A time aggregation approach to Markov decision processes , 2002, Autom..

[7]  Xianping Guo,et al.  Drift and monotonicity conditions for continuous-time controlled Markov chains with an average criterion , 2003, IEEE Trans. Autom. Control..

[8]  Daniel N. Nikovski,et al.  Non-Linear Stochastic Control in Continuous State Spaces by Exact Integration in Bellman's Equations , 2003 .

[9]  H. Kappen Linear theory for control of nonlinear stochastic systems. , 2004, Physical review letters.

[10]  Xi-Ren Cao,et al.  Basic Ideas for Event-Based Optimization of Markov Systems , 2005, Discret. Event Dyn. Syst..

[11]  Alexander Zadorojniy,et al.  Robustness of policies in constrained Markov decision processes , 2006, IEEE Transactions on Automatic Control.

[12]  Xi-Ren Cao,et al.  Event-Based Optimization of Markov Systems , 2008, IEEE Transactions on Automatic Control.

[13]  Xi Chen,et al.  Policy iteration based feedback control , 2008, Autom..

[14]  Adam Shwartz,et al.  Exact finite approximations of average-cost countable Markov decision processes , 2007, Autom..

[15]  Andrea Castelletti,et al.  Receding horizon control for water resources management , 2008, Appl. Math. Comput..

[16]  Lei Wang,et al.  Asymptotic coefficients for Gaussian radial basis function interpolants , 2010, Appl. Math. Comput..