Switching Supervisory Control Using Calibrated Forecasts

In this paper, we approach supervisory control as an online decision problem. In particular, we introduce ldquocalibrated forecastsrdquo as a mechanism for controller selection in supervisory control. The forecasted quantity is a candidate controller's performance level, or reward, over finite implementation horizon. Controller selection is based on using the controller with the maximum calibrated forecast of the reward. The proposed supervisor does not perform a pre-routed search of candidate controllers and does not require the presence of exogenous inputs for excitation or identification. Assuming the existence of a stabilizing controller within the set of candidate controllers, we show that under the proposed supervisory controller, the output of the system remains bounded for any bounded disturbance, even if the disturbance is chosen in an adversarial manner. The use of calibrated forecasts enables one to establish overall performance guarantees for the supervisory scheme even though non-stabilizing controllers may be persistently selected by the supervisor because of the effects of initial conditions, exogenous disturbances, or random selection. The main results are obtained for a general class of system dynamics and specialized to linear systems.

[1]  M. Benaïm Dynamics of stochastic approximation algorithms , 1999 .

[2]  Joao P. Hespanha,et al.  Logic-based switching control of a nonholonomic system with parametric modeling uncertainty , 1999 .

[3]  A. Paul,et al.  Cost detectability and stability of adaptive control systems , 2007 .

[4]  A. Dawid The Well-Calibrated Bayesian: Rejoinder , 1982 .

[5]  A. Dawid The Well-Calibrated Bayesian , 1982 .

[6]  Nimrod Megiddo,et al.  Combining expert advice in reactive environments , 2006, JACM.

[7]  A. Morse Supervisory control of families of linear set-point controllers Part I. Exact matching , 1996, IEEE Trans. Autom. Control..

[8]  P. Ramadge,et al.  Controller switching based on output prediction errors , 1998, IEEE Trans. Autom. Control..

[9]  Dean P. Foster,et al.  Regret in the On-Line Decision Problem , 1999 .

[10]  B. Barmish,et al.  Adaptive stabilization of linear systems via switching control , 1986, 1986 25th IEEE Conference on Decision and Control.

[11]  João Pedro Hespanha,et al.  Postprints from CCDC Title Hysteresis-based switching algorithms for supervisory control of uncertain systems Permalink , 2002 .

[12]  João Pedro Hespanha,et al.  Overcoming the limitations of adaptive control by means of logic-based switching , 2003, Syst. Control. Lett..

[13]  Michael G. Safonov,et al.  The unfalsified control concept and learning , 1997 .

[14]  Sham M. Kakade,et al.  Deterministic calibration and Nash equilibrium , 2004, J. Comput. Syst. Sci..

[15]  A. Dawid Self-Calibrating Priors Do Not Exist: Comment , 1985 .

[16]  Shie Mannor,et al.  Online calibrated forecasts: Memory efficiency versus universality for learning in games , 2006, Machine Learning.

[17]  Ali H. Sayed,et al.  Fundamentals Of Adaptive Filtering , 2003 .

[18]  A. Morse Supervisory control of families of linear set-point controllers. 2. Robustness , 1997, IEEE Trans. Autom. Control..

[19]  Peter Auer,et al.  The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..