Data-driven controller design for general MIMO nonlinear systems via virtual reference feedback tuning and neural networks

In this paper, we develop a novel data-driven multivariate nonlinear controller design method for multi-input-multi-output (MIMO) nonlinear systems via virtual reference feedback tuning (VRFT) and neural networks. To the best of authors' knowledge, it is the first time to introduce VRFT to MIMO nonlinear systems in theory. Unlike the standard VRFT for linear systems, we restate the model reference control problem with time-domain model in the absence of transfer functions and simplify the objective function of VRFT without a linear filter. Then, we prove that the objective function of VRFT reaches the minimum at the same point as the optimization problem of model reference control and give the relationship between the bounds of the two optimization problems of model reference control and VRFT. A three-layer neural network is used to implement the developed method. Finally, two simulations are conducted to verify the validity of our method.

[1]  Sergio M. Savaresi,et al.  Non-iterative direct data-driven controller tuning for multivariable systems: theory and application , 2012 .

[2]  Zhuo Wang,et al.  From model-based control to data-driven control: Survey, classification and perspective , 2013, Inf. Sci..

[3]  Qinglai Wei,et al.  Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming , 2012, Autom..

[4]  Derong Liu,et al.  Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique , 2013, Neurocomputing.

[5]  Derong Liu,et al.  Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm , 2014, Neurocomputing.

[6]  Simone Formentin,et al.  Enhancing statistical performance of data-driven controller tuning via L2-regularization , 2014, Autom..

[7]  Derong Liu,et al.  Online Synchronous Approximate Optimal Learning Algorithm for Multi-Player Non-Zero-Sum Games With Unknown Dynamics , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[8]  Derong Liu,et al.  Data-Based Controllability and Observability Analysis of Linear Discrete-Time Systems , 2011, IEEE Transactions on Neural Networks.

[9]  Marco Forgione,et al.  Data-driven model improvement for model-based control , 2015, Autom..

[10]  D. Owens Iterative learning control-convergence using high gain feedback , 1992, [1992] Proceedings of the 31st IEEE Conference on Decision and Control.

[11]  Shangtai Jin,et al.  A unified data-driven design framework of optimality-based generalized iterative learning control , 2015, Comput. Chem. Eng..

[12]  Simone Formentin,et al.  A comparison of model‐based and data‐driven controller tuning , 2014 .

[13]  Pedro Albertos,et al.  Neural networks in virtual reference tuning , 2011, Eng. Appl. Artif. Intell..

[14]  Haibo He,et al.  Online Learning Control Using Adaptive Critic Designs With Sparse Kernel Machines , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Yongsheng Ding,et al.  Immunological mechanism inspired iterative learning control , 2014, Neurocomputing.

[16]  Matteo Corno,et al.  Data-Driven Design of Braking Control Systems , 2013, IEEE Transactions on Control Systems Technology.

[17]  WangZhuo,et al.  From model-based control to data-driven control , 2013 .

[18]  Radu-Emil Precup,et al.  Iterative Data-Driven Tuning of Controllers for Nonlinear Systems With Constraints , 2014, IEEE Transactions on Industrial Electronics.

[19]  Min-Sen Chiu,et al.  Integrated neuro-fuzzy model and dynamic R-parameter based quadratic criterion-iterative learning control for batch process , 2012, Neurocomputing.

[20]  Jakob Kjøbsted Huusom,et al.  A design algorithm using external perturbation to improve Iterative Feedback Tuning convergence , 2011, Autom..

[21]  Håkan Hjalmarsson,et al.  Iterative feedback tuning—an overview , 2002 .

[22]  Shangtai Jin,et al.  A Novel Data-Driven Control Approach for a Class of Discrete-Time Nonlinear Systems , 2011, IEEE Transactions on Control Systems Technology.

[23]  Xiong Yang,et al.  Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints , 2014, Int. J. Control.

[24]  MF Marcel Heertjes,et al.  Robust Data-Driven Control for the Stage Synchronization Problem , 2014 .

[25]  Derong Liu,et al.  Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming , 2013, Int. J. Control.

[26]  Derong Liu,et al.  Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm , 2013, Neurocomputing.

[27]  Masaru Uchiyama,et al.  Formation of High-Speed Motion Pattern of a Mechanical Arm by Trial , 1978 .

[28]  Sergio M. Savaresi,et al.  Noniterative data-driven design of multivariable controllers , 2011, IEEE Conference on Decision and Control and European Control Conference.

[29]  Shangtai Jin,et al.  Data-Driven Model-Free Adaptive Control for a Class of MIMO Nonlinear Discrete-Time Systems , 2011, IEEE Transactions on Neural Networks.

[30]  J. Spall,et al.  Model-free control of general discrete-time systems , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.

[31]  Huaguang Zhang,et al.  Model-free optimal control design for a class of linear discrete-time systems with multiple delays using adaptive dynamic programming , 2014, Neurocomputing.

[32]  Xuesong Wang,et al.  Multi-source transfer ELM-based Q learning , 2014, Neurocomputing.

[33]  Michael J. Grimble,et al.  Iterative Learning Control for Deterministic Systems , 1992 .

[34]  YangQuan Chen,et al.  Iterative Learning Control: Convergence, Robustness and Applications , 1999 .

[35]  Frank L. Lewis,et al.  Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Derong Liu,et al.  Data-based stability analysis of a class of nonlinear discrete-time systems , 2013, Inf. Sci..

[37]  Kevin L. Moore,et al.  Iterative learning control in optimal tracking problems with specified data points , 2013, Autom..

[38]  Derong Liu,et al.  Data-based Self-learning Optimal Control: Research Progress and Prospects , 2013 .

[39]  M. Nakamoto An application of the virtual reference feedback tuning for an MIMO process , 2004, SICE 2004 Annual Conference.

[40]  F. Lewis,et al.  Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[41]  Qi Wang,et al.  Rate of convergence analysis of discrete simultaneous perturbation stochastic approximation algorithm , 2013, 2013 American Control Conference.

[42]  A. Karimi,et al.  Data‐driven model reference control with asymptotically guaranteed stability , 2011 .

[43]  Sergio M. Savaresi,et al.  Direct nonlinear control design: the virtual reference feedback tuning (VRFT) approach , 2006, IEEE Transactions on Automatic Control.

[44]  Sergio M. Savaresi,et al.  Virtual reference direct design method: an off-line approach to data-based control system design , 2000, IEEE Trans. Autom. Control..

[45]  Sergio M. Savaresi,et al.  A Data-Driven Approach to Control of Batch Processes With an Application to a Gravimetric Blender , 2014, IEEE Transactions on Industrial Electronics.

[46]  Torsten Jeinsch,et al.  Data-driven self-tuning control by iterative learning control with application to optimize the control parameter of turbocharged engines , 2014, 2014 19th International Conference on Methods and Models in Automation and Robotics (MMAR).

[47]  Derong Liu,et al.  Data-Driven Neuro-Optimal Temperature Control of Water–Gas Shift Reaction Using Stable Iterative Adaptive Dynamic Programming , 2014, IEEE Transactions on Industrial Electronics.

[48]  Huaguang Zhang,et al.  Multi-objective optimal control for a class of unknown nonlinear systems based on finite-approximation-error ADP algorithm , 2013, Neurocomputing.

[49]  S. Gunnarsson,et al.  A convergent iterative restricted complexity control design scheme , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.

[50]  Huaguang Zhang,et al.  Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions , 2009, Neurocomputing.