Regression-based LP solver for chance-constrained finite horizon optimal control with nonconvex constraints

This paper presents a novel algorithm for finite-horizon optimal control problems subject to additive Gaussian-distributed stochastic disturbance and chance constraints that are defined over feasible, non-convex state spaces. Our previous work [1] proposed a branch and bound-based algorithm that can find a near-optimal solution by iteratively solving non-linear convex optimization problems, as well as their LP relaxations called Fixed Risk Relaxation (FRR) problems. The aim of this work is to significantly reduce the computation time of the previous algorithm so that it can be applied to practical problems, such as a path planning with multiple obstacles. Our approach is to use machine learning to efficiently estimate the objective function values of FRRs within an error bound that is fixed for a given problem domain and choice of model complexity. We exploit the fact that all the FRR problems associated with the branch-and-bound tree nodes are similar to each other, both in terms of the solutions as well as the objective function and constraint coefficients. A standard optimizer is first used to generate a training data set in the form of optimal FRR solutions. Matrix transformations and boosting trees are then applied to generate learning models; fast inference is performed at run-time for new but similar FRR problems that occur when the system dynamics and/or the environment changes slightly. By using this regression technique to estimate the lower bound of the cost function value, and subsequently solving the convex optimization problems exactly at the leaf nodes of the branch-and-bound tree, we achieve 10-35 times reduction in the computation time without compromising the optimality of the solution.

[1]  Dagfinn Gangsaas,et al.  Wind models for flight simulator certification of landing and approach guidance and control systems , 1974 .

[2]  R. Wets,et al.  Stochastic programming , 1989 .

[3]  M. Kothare,et al.  Robust constrained model predictive control using linear matrix inequalities , 1994, Proceedings of 1994 American Control Conference - ACC '94.

[4]  Thomas G. Dietterich,et al.  High-Performance Job-Shop Scheduling With A Time-Delay TD(λ) Network , 1995, NIPS 1995.

[5]  András Prékopa The use of discrete moment bounds in probabilisticconstrained stochastic programming models , 1999, Ann. Oper. Res..

[6]  Michael Nikolaou,et al.  Chance‐constrained model predictive control , 1999 .

[7]  E. Kerrigan Robust Constraint Satisfaction: Invariant Sets and Predictive Control , 2000 .

[8]  M. Wendt,et al.  Robust model predictive control under chance constraints , 2000 .

[9]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[10]  C. Scherer,et al.  LMI-based closed-loop economic optimization of stochastic process operation under state and input constraints , 2001, Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No.01CH37228).

[11]  Pu Li,et al.  A probabilistically constrained model predictive controller , 2002, Autom..

[12]  Yoshiaki Kuwata,et al.  Real-time Trajectory Design for Unmanned Aerial Vehicles using Receding Horizon Control , 2003 .

[13]  J. Löfberg Minimax approaches to robust model predictive control , 2003 .

[14]  Aachen,et al.  Stochastic Inequality Constrained Closed-loop Model Predictive Control: With Application To Chemical Process Operation , 2004 .

[15]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[16]  M. Manfrin Ant Colony Optimization for the Vehicle Routing Problem , 2004 .

[17]  Patrick R. McMullen,et al.  Ant colony optimization techniques for the vehicle routing problem , 2004, Adv. Eng. Informatics.

[18]  Hui Li,et al.  Generalized Conflict Learning for Hybrid Discrete/Linear Optimization , 2005, CP.

[19]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[20]  Pieter Abbeel,et al.  An Application of Reinforcement Learning to Aerobatic Helicopter Flight , 2006, NIPS.

[21]  Giuseppe Carlo Calafiore,et al.  The scenario approach to robust control design , 2006, IEEE Transactions on Automatic Control.

[22]  L. Blackmore A Probabilistic Particle Control Approach to Optimal, Robust Predictive Control , 2006 .

[23]  Alexander Shapiro,et al.  Convex Approximations of Chance Constrained Programs , 2006, SIAM J. Optim..

[24]  Hui X. Li,et al.  A probabilistic approach to optimal robust path planning with obstacles , 2006, 2006 American Control Conference.

[25]  James A. Primbs Stochastic Receding Horizon Control of Constrained Linear Systems with State and Control Multiplicative Noise , 2007, ACC.

[26]  A. Richards,et al.  Robust Receding Horizon Control using Generalized Constraint Tightening , 2007, 2007 American Control Conference.

[27]  Masahiro Ono,et al.  An Efficient Motion Planning Algorithm for Stochastic Dynamic Systems with Constraints on Probability of Failure , 2008, AAAI.

[28]  Masahiro Ono,et al.  Iterative Risk Allocation: A new approach to robust Model Predictive Control with a joint chance constraint , 2008, 2008 47th IEEE Conference on Decision and Control.

[29]  Bostjan Slivnik,et al.  Improving Job Scheduling in GRID Environments with Use of Simple Machine Learning Methods , 2009, 2009 Sixth International Conference on Information Technology: New Generations.

[30]  L. Blackmore,et al.  Convex Chance Constrained Predictive Control without Sampling , 2009 .

[31]  Hanif D. Sherali,et al.  Disjunctive Programming , 2009, Encyclopedia of Optimization.

[32]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[33]  M Ono,et al.  Chance constrained finite horizon optimal control with nonconvex constraints , 2010, Proceedings of the 2010 American Control Conference.

[34]  Nicholas Roy,et al.  Learning Solutions of Similar Linear Programming Problems using Boosting Trees , 2010 .

[35]  P. Parthiban,et al.  Optimization of Multiple Vehicle Routing Problems using Approximation Algorithms , 2010, ArXiv.

[36]  W. Marsden I and J , 2012 .

[37]  A Chance Constrained Programming , 2012 .

[38]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.