论文信息 - Learning and information for dual control

Learning and information for dual control

In dual control problems, the aim is to concurrently learn and control an unknown system. However, actively learning the system conflicts directly with any given control objective as it involves disturbing the system for exploration. This paper presents a multi-objective approach to dual control, which explicitly quantifies both the learning and control objectives. Mutual information and relative entropy from information theory are used to quantify the information gain in active learning as part of the exploration process. The information gain is then balanced against a standard control objective. The presented approach is illustrated using Gaussian process regression, which provides a framework for learning nonlinear systems and is used as a demonstrative example. It is shown that the derived information measures are closely related to the variance of the predictive Gaussian distribution estimating the system.

Tansu Alpcan | Michael Cantoni | Iman Shames | Girish N. Nair

[1] B. Wittenmark. Adaptive dual control , 2002 .

[2] Dan Wang,et al. A Neural Network Based Method for Solving Discrete-Time Nonlinear Output Regulation Problem in Sampled-Data Systems , 2004, ISNN.

[3] D. Mackay,et al. Introduction to Gaussian processes , 1998 .

[4] James B. Rawlings,et al. Tutorial overview of model predictive control , 2000 .

[5] TANSU ALPCAN,et al. A Risk-Based Approach to Optimisation under Limited Information , 2012 .

[6] Tamer Basar,et al. Dual Control Theory , 2001 .

[7] Dan Wang,et al. A neural network-based approximation method for discrete-time nonlinear servomechanism problem , 2001, IEEE Trans. Neural Networks.

[8] U. Ammann,et al. Model Predictive Control—A Simple and Powerful Method to Control Power Converters , 2009, IEEE Transactions on Industrial Electronics.

[9] Tansu Alpcan,et al. A framework for optimization under limited information , 2011, Journal of Global Optimization.

[10] Vladimír Havlena,et al. MPC‐based approximate dual controller by information matrix maximization , 2013 .

[11] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[12] Michael Cantoni,et al. Model predictive control for systems with scheduled load and its application to automated irrigation channels , 2011, 2011 International Conference on Networking, Sensing and Control.

[13] Shuhai Quan,et al. Model predictive control of water management in PEMFC , 2008 .

[14] D. V. Gokhale,et al. Entropy expressions and their estimators for multivariate distributions , 1989, IEEE Trans. Inf. Theory.

[15] Giancarlo Marafioti,et al. Enhanced Model Predictive Control:Dual Control Approach and State Estimation Issues , 2010 .

[16] Rob A. Rutenbar,et al. Simulated annealing algorithms: an overview , 1989, IEEE Circuits and Devices Magazine.

[17] A. Atiya,et al. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[18] Christopher K. I. Williams,et al. Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .

[19] Björn Wittenmark,et al. Adaptive Dual Control Methods: An Overview , 1995 .