论文信息 - Control of pH neutralization process using simulation based dynamic programming

Control of pH neutralization process using simulation based dynamic programming

The pH neutralization process has long been taken as a representative benchmark problem of nonlinear chemical process control due to its nonlinearity and time-varying nature. For general nonlinear processes, it is difficult to control with a linear model-based control method so nonlinear controls must be considered. Among the numerous approaches suggested, the most rigorous approach is the dynamic optimization. However, as the size of the problem grows, the dynamic programming approach suffers from the curse of dimensionality. In order to avoid this problem, the Neuro-Dynamic Programming (NDP) approach was proposed by Bertsekas and Tsitsiklis [1996]. The NDP approach is to utilize all the data collected to generate an approximation of optimal cost-to-go function which was used to find the optimal input movement in real time control. The approximation could be any type of function such as polynomials, neural networks, etc. In this study, an algorithm using NDP approach was applied to a pH neutralization process to investigate the feasibility of the NDP algorithm and to deepen the understanding of the basic characteristics of this algorithm. As the approximator, the neural network which requires training and the k-nearest neighbor method which requires querying instead of training are investigated. The approximator has to use data from the optimal control strategy. If the optimal control strategy is not readily available, a suboptimal control strategy can be used instead. However, the laborious Bellman iterations are necessary in this case. For pH neutralization process it is rather easy to devise an optimal control strategy. Thus, we used an optimal control strategy and did not perform the Bellman iteration. Also, the effects of constraints on control moves are studied. From the simulations, the NDP method outperforms the conventional PID control.

[1] Arthur E. Bryson,et al. Dynamic Optimization , 1998 .

[2] Dale E. Seborg,et al. Adaptive nonlinear control of a pH neutralization process , 1994, IEEE Trans. Control. Syst. Technol..

[3] Michael Athans,et al. Nonlinear and Adaptive Control , 1989 .

[4] Chang-Soo Han,et al. Development of the Medical Support Service Robot Using Ergonomic Design , 2003 .

[5] M. A. Henson,et al. ADAPTIVE INPUT–OUTPUT LINEARIZATION OF A pH NEUTRALIZATION PROCESS , 1997 .

[6] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[7] Alan S. I. Zinober,et al. Nonlinear and Adaptive Control , 2003 .

[8] Jong Min Lee,et al. Simulation-based learning of cost-to-go for control of nonlinear processes , 2004 .

[9] Jin Bae Park,et al. Self-Recurrent Wavelet Neural Network Based Direct Adaptive Control for Stable Path Tracking of Mobile Robots , 2004 .

[10] S.H.Sung,et al. Biomimetic Hopping Strategy for Robots , 2003 .

[11] Jin Bae Park,et al. The Modeling of Chaotic Nonlinear System Using Wavelet Based Fuzzy Neural Network , 2004 .

[12] T. Gustafsson,et al. Nonlinear and adaptive control of pH , 1992 .

[13] Niket S. Kaisare,et al. Simulation based strategy for nonlinear optimal control: application to a microbial cell reactor , 2003 .