Learning for Constrained Optimization: Identifying Optimal Active Constraint Sets

In many engineered systems, optimization is used for decision making at time scales ranging from real-time operation to long-term planning. This process often involves solving similar optimization problems over and over again with slightly modified input parameters, often under tight latency requirements. We consider the problem of using the information available through this repeated solution process to learn important characteristics of the optimal solution as a function of the input parameters. Our proposed method is based on learning relevant sets of active constraints, from which the optimal solution can be obtained efficiently. Using active sets as features preserves information about the physics of the system, enables interpretable results, accounts for relevant safety constraints, and is easy to represent and encode. However, the total number of active sets is also very large, as it grows exponentially with system size. The key contribution of this paper is a streaming algorithm that learns the relevant active sets from training samples consisting of the input parameters and the corresponding optimal solution, without any restrictions on the problem type, problem structure or probability distribution of the input parameters. The algorithm comes with theoretical performance guarantees and is shown to converge fast for problem instances with a small number of relevant active sets. It can thus be used to establish simultaneously learn the relevant active sets and the practicability of the learning method. Through case studies in optimal power flow, supply chain planning, and shortest path routing, we demonstrate that often only a few active sets are relevant in practice, suggesting that active sets provide an appropriate level of abstraction for a learning algorithm to target.

[1]  Daniel K. Molzahn,et al.  Examining the limits of the application of semidefinite programming to power flow problems , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[2]  Benjamin Karg,et al.  Efficient Representation and Approximation of Model Predictive Control Laws via Deep Learning , 2018, IEEE Transactions on Cybernetics.

[3]  Louis Wehenkel,et al.  A Machine Learning-Based Approximation of Strong Branching , 2017, INFORMS J. Comput..

[4]  Tor Arne Johansen,et al.  Using hash tables to manage the time-storage complexity in a point location problem: Application to explicit model predictive control , 2011, Autom..

[5]  Manfred Morari,et al.  Evaluation of piecewise affine control law via graph traversal , 2013, 2013 European Control Conference (ECC).

[6]  Alan Edelman,et al.  Julia: A Fresh Approach to Numerical Computing , 2014, SIAM Rev..

[7]  Jaime F. Fisac,et al.  A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems , 2017, IEEE Transactions on Automatic Control.

[8]  Colin N. Jones,et al.  Optimized decision trees for point location in polytopic data sets - application to explicit MPC , 2010, Proceedings of the 2010 American Control Conference.

[9]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[10]  A. Bemporad,et al.  Efficient on-line computation of constrained optimal control , 2001, Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No.01CH37228).

[11]  Alan Edelman,et al.  Julia: A Fast Dynamic Language for Technical Computing , 2012, ArXiv.

[12]  James E. Price,et al.  Reduced network modeling of WECC as a market design prototype , 2011, 2011 IEEE Power and Energy Society General Meeting.

[13]  Pieter Abbeel,et al.  Safe Exploration in Markov Decision Processes , 2012, ICML.

[14]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[15]  Alberto Bemporad,et al.  The explicit linear quadratic regulator for constrained systems , 2003, Autom..

[16]  Iain Dunning,et al.  JuMP: A Modeling Language for Mathematical Optimization , 2015, SIAM Rev..

[17]  Sidhant Misra,et al.  Statistical Learning for DC Optimal Power Flow , 2018, 2018 Power Systems Computation Conference (PSCC).

[18]  Thomas J. Overbye,et al.  Grid Structural Characteristics as Validation Criteria for Synthetic Networks , 2017, IEEE Transactions on Power Systems.

[19]  Jakob Krarup,et al.  LP formulations of the shortest path tree problem , 2004, 4OR.

[20]  Mohammad Shahidehpour,et al.  The IEEE Reliability Test System-1996. A report prepared by the Reliability Test System Task Force of the Application of Probability Methods Subcommittee , 1999 .

[21]  Tor Arne Johansen,et al.  Approximate explicit constrained linear model predictive control via orthogonal search tree , 2003, IEEE Trans. Autom. Control..

[22]  Helfried Peyrl,et al.  Accelerating space traversal methods for explicit model predictive control via space partitioning trees , 2014, 2014 European Control Conference (ECC).

[23]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[24]  Craig Boutilier,et al.  Safe Exploration for Identifying Linear Systems via Robust Optimization , 2017, ArXiv.

[25]  Javier García,et al.  A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..

[26]  Alberto Bemporad,et al.  Model predictive control based on linear programming - the explicit solution , 2002, IEEE Transactions on Automatic Control.

[27]  C. Pantelides,et al.  Design of Multi-echelon Supply Chain Networks under Demand Uncertainty , 2001 .

[28]  Ju Zhang,et al.  K-d tree based approach for point location problem in explicit model predictive control , 2018, J. Frankl. Inst..

[29]  I. Wangensteen,et al.  Transmission management in the deregulated environment , 2000, Proceedings of the IEEE.

[30]  Fangxing Li,et al.  Small test systems for power system economic studies , 2010, IEEE PES General Meeting.

[31]  Manfred Morari,et al.  Optimal complexity reduction of polyhedral piecewise affine systems , 2008, Autom..

[32]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[33]  Alberto Bemporad,et al.  Evaluation of piecewise affine control via binary search tree , 2003, Autom..

[34]  Tor Arne Johansen,et al.  Flexible Piecewise Function Evaluation Methods Based on Truncated Binary Search Trees and Lattice Representation in Explicit MPC , 2012, IEEE Transactions on Control Systems Technology.

[35]  Hu Biaobiao,et al.  Using a Two-Level Structure to Manage the Point Location Problem in Explicit Model Predictive Control , 2016 .

[36]  Min Wu,et al.  Safety Verification of Deep Neural Networks , 2016, CAV.