Optimistic planning with a limited number of action switches for near-optimal nonlinear control

We consider infinite-horizon optimal control of nonlinear systems where the actions (inputs) are discrete. With the goal of limiting computations, we introduce a search algorithm for action sequences constrained to switch at most a given number of times between different actions. The new algorithm belongs to the optimistic planning class originating in artificial intelligence, and is called optimistic switch-limited planning (OSP). It inherits the generality of the OP class, so it works for nonlinear, nonsmooth systems with nonquadratic costs. We develop analysis showing that the switch constraint leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of state-of-the-art OP; and to a correspondingly faster convergence. The degree of the polynomial varies with the problem and is a meaningful measure for the difficulty of solving it. We study this degree in two representative, opposite cases. In simulations we first apply OSP to a problem where limited-switch sequences are near-optimal, and then in a networked control setting where the switch constraint must be satisfied in closed loop.

[1]  B. De Schutter Optimal traffic light control for a single intersection , 1999 .

[2]  Bart De Schutter,et al.  Optimal Control of a Class of Linear Hybrid Systems with Saturation , 1999, SIAM J. Control. Optim..

[3]  Alberto Bemporad,et al.  The explicit linear quadratic regulator for constrained systems , 2003, Autom..

[4]  Alberto Bemporad,et al.  An algorithm for multi-parametric quadratic programming and explicit MPC solutions , 2003, Autom..

[5]  Alberto Bemporad,et al.  Evaluation of piecewise affine control via binary search tree , 2003, Autom..

[6]  Olivier Teytaud,et al.  Modification of UCT with Patterns in Monte-Carlo Go , 2006 .

[7]  Csaba Szepesvári,et al.  Bandit Based Monte-Carlo Planning , 2006, ECML.

[8]  João C. N. Clímaco,et al.  A review of interactive methods for multiobjective integer and mixed-integer programming , 2007, Eur. J. Oper. Res..

[9]  Paulo Tabuada,et al.  Event-Triggered Real-Time Scheduling of Stabilizing Control Tasks , 2007, IEEE Transactions on Automatic Control.

[10]  Alberto Bemporad,et al.  On Hybrid Model Predictive Control of Sewer Networks , 2007 .

[11]  Rémi Munos,et al.  Optimistic Planning of Deterministic Systems , 2008, EWRL.

[12]  Michael Cantoni,et al.  A {0, 1} linear program for fixed-profile load scheduling and demand management in automated irrigation channels , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[13]  Bart De Schutter,et al.  Approximate dynamic programming with a fuzzy parameterization , 2010, Autom..

[14]  B. De Schutter,et al.  Time-instant optimization for hybrid model predictive control of the Rhine–Meuse delta , 2010 .

[15]  Tor Arne Johansen,et al.  Using hash tables to manage the time-storage complexity in a point location problem: Application to explicit model predictive control , 2011, Autom..

[16]  Michael L. Littman,et al.  Sample-Based Planning for Continuous Action Markov Decision Processes , 2011, ICAPS.

[17]  Xiang Li,et al.  Model predictive control with robust feasibility , 2011 .

[18]  Wen-Hua Chen,et al.  Piecewise constant model predictive control for autonomous helicopters , 2011, Robotics Auton. Syst..

[19]  Lucian Busoniu,et al.  Optimistic planning for Markov decision processes , 2012, AISTATS.

[20]  Claudio De Persis,et al.  Robust Self-Triggered Coordination With Ternary Controllers , 2012, IEEE Transactions on Automatic Control.

[21]  Rémi Munos,et al.  From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning , 2014, Found. Trends Mach. Learn..

[22]  B. Schutter,et al.  Event-driven hierarchical control of irrigation canals , 2016 .