A neuro-adaptive architecture for extremum seeking control using hybrid learning dynamics

This paper presents a novel approach to achieve online multivariable hybrid optimization of response maps associated to set-valued dynamical systems, without requiring the use of averaging theory. In particular, we propose a prescriptive framework for the analysis and design of a class of adaptive control architectures based on neural networks (NN) and learning dynamics described by hybrid dynamical systems (HDS). The NNs are used as model-free gradient approximators that are online tuned in order to obtain an arbitrarily precise estimation on a compact set of the gradient of the response map of the system under control. For the closed-loop system a semi-global practical asymptotic stability result is obtained, and the results are illustrated via numerical examples.

[1]  M. Krstić,et al.  Real-Time Optimization by Extremum-Seeking Control , 2003 .

[2]  Jorge Cortés,et al.  Finite-time convergent gradient flows with applications to network consensus , 2006, Autom..

[3]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[4]  Denis Dochain,et al.  Extremum-seeking control of retention for a microparticulate system , 2008 .

[5]  Martin Guay,et al.  Model adequacy for a precise optimization using extremum seeking control , 2016, 2016 European Control Conference (ECC).

[6]  Denis Dochain,et al.  A time-varying extremum-seeking control approach , 2013, ACC.

[7]  Denis Dochain,et al.  A constrained extremum‐seeking control approach , 2015 .

[8]  A. Teel,et al.  Solving smooth and nonsmooth multivariable extremum seeking problems by the methods of nonlinear programming , 2001, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[9]  A. Teel,et al.  Semi-global practical asymptotic stability and averaging , 1999 .

[10]  Dragan Nesic,et al.  Analysis for a class of singularly perturbed hybrid systems via averaging , 2012, Autom..

[11]  Chunlei Zhang,et al.  Extremum-Seeking Control and Applications: A Numerical Optimization-Based Approach , 2011 .

[12]  Andrew R. Teel,et al.  A framework for a class of hybrid extremum seeking controllers with dynamic inclusions , 2017, Autom..

[13]  Andrew R. Teel,et al.  A Robust Event-Triggered Approach for Fast Sampled-Data Extremization and Learning , 2017, IEEE Transactions on Automatic Control.

[14]  Frank L. Lewis,et al.  Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2010, Autom..

[15]  Frank L. Lewis,et al.  Reinforcement Learning and Approximate Dynamic Programming for Feedback Control , 2012 .

[16]  Ricardo G. Sanfelice,et al.  On singular perturbations due to fast actuators in hybrid control systems , 2011, Autom..

[17]  Milos S. Stankovic,et al.  Lie bracket approximation of extremum seeking systems , 2011, Autom..

[18]  Petros A. Ioannou,et al.  Robust Adaptive Control , 2012 .

[19]  R. Sanfelice,et al.  Hybrid dynamical systems , 2009, IEEE Control Systems.

[20]  Shouchuan Hu Differential equations with discontinuous right-hand sides☆ , 1991 .

[21]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[22]  Ying Tan,et al.  On non-local stability properties of extremum seeking control , 2006, Autom..