Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information

Recent works in learning-integrated optimization have shown promise in settings where the optimization problem is only partially observed or where general-purpose optimizers perform poorly without expert tuning. By learning an optimizer $\mathbf{g}$ to tackle these challenging problems with $f$ as the objective, the optimization process can be substantially accelerated by leveraging past experience. The optimizer can be trained with supervision from known optimal solutions or implicitly by optimizing the compound function $f\circ \mathbf{g}$. The implicit approach may not require optimal solutions as labels and is capable of handling problem uncertainty; however, it is slow to train and deploy due to frequent calls to optimizer $\mathbf{g}$ during both training and testing. The training is further challenged by sparse gradients of $\mathbf{g}$, especially for combinatorial solvers. To address these challenges, we propose using a smooth and learnable Landscape Surrogate $M$ as a replacement for $f\circ \mathbf{g}$. This surrogate, learnable by neural networks, can be computed faster than the solver $\mathbf{g}$, provides dense and smooth gradients during training, can generalize to unseen optimization problems, and is efficiently learned via alternating optimization. We test our approach on both synthetic problems, including shortest path and multidimensional knapsack, and real-world problems such as portfolio optimization, achieving comparable or superior objective values compared to state-of-the-art baselines while reducing the number of calls to $\mathbf{g}$. Notably, our approach outperforms existing methods for computationally expensive high-dimensional problems.

[1]  P. V. Hentenryck,et al.  Reinforcement Learning from Optimization Proxy for Ride-Hailing Vehicle Relocation , 2022, J. Artif. Intell. Res..

[2]  Benoit Steiner,et al.  SurCo: Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems , 2022, ICML.

[3]  S. Wright,et al.  BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach , 2022, NeurIPS.

[4]  Elias Boutros Khalil,et al.  PyEPO: A PyTorch-based End-to-End Predict-then-Optimize Library for Linear and Integer Programming , 2022, ArXiv.

[5]  Jasper C. H. Lee,et al.  Branch & Learn for Recursively and Iteratively Solvable Problems in Predict+Optimize , 2022, NeurIPS.

[6]  Seonho Park,et al.  Learning Optimization Proxies for Large-Scale Security-Constrained Economic Dispatch , 2021, Electric Power Systems Research.

[7]  Tias Guns,et al.  Decision-Focused Learning: Through the Lens of Learning to Rank , 2021, ICML.

[8]  Pasquale Minervini,et al.  Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions , 2021, NeurIPS.

[9]  G. Martius,et al.  CombOptNet: Fit the Right NP-Hard Problem by Learning Integer Programming Constraints , 2021, ICML.

[10]  Kotagiri Ramamohanarao,et al.  Dynamic Programming for Predict+Optimise , 2020, AAAI.

[11]  G. Martius,et al.  Differentiation of Blackbox Combinatorial Solvers , 2019, ICLR.

[12]  Tias Guns,et al.  Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems , 2019, AAAI.

[13]  Stephen P. Boyd,et al.  Differentiable Convex Optimization Layers , 2019, NeurIPS.

[14]  Stephen Gould,et al.  Deep Declarative Networks , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Milind Tambe,et al.  MIPaaL: Mixed Integer Program as a Layer , 2019, AAAI.

[16]  Milind Tambe,et al.  End to end learning and optimization on graphs , 2019, NeurIPS.

[17]  Priya L. Donti,et al.  SATNet: Bridging deep learning and logical reasoning using a differentiable satisfiability solver , 2019, ICML.

[18]  Milind Tambe,et al.  Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization , 2018, AAAI.

[19]  Adam N. Elmachtoub,et al.  Smart "Predict, then Optimize" , 2017, Manag. Sci..

[20]  Elias Boutros Khalil,et al.  Learning Combinatorial Optimization Algorithms over Graphs , 2017, NIPS.

[21]  Priya L. Donti,et al.  Task-based End-to-end Model Learning in Stochastic Optimization , 2017, NIPS.

[22]  J. Zico Kolter,et al.  OptNet: Differentiable Optimization as a Layer in Neural Networks , 2017, ICML.

[23]  Anoop Cherian,et al.  On Differentiating Parameterized Argmin and Argmax Problems with Application to Bi-level Optimization , 2016, ArXiv.

[24]  Stephen P. Boyd,et al.  CVXPY: A Python-Embedded Modeling Language for Convex Optimization , 2016, J. Mach. Learn. Res..

[25]  Christian Kirches,et al.  Mixed-integer nonlinear optimization*† , 2013, Acta Numerica.

[26]  Daniela Rus,et al.  Practical Route Planning Under Delay Uncertainty: Stochastic Shortest Path Queries , 2012, Robotics: Science and Systems.

[27]  Adam N. Letchford,et al.  Non-convex mixed-integer nonlinear programming: A survey , 2012 .

[28]  Justin Domke,et al.  Generic Methods for Optimization-Based Modeling , 2012, AISTATS.

[29]  Yi-Hao Kao,et al.  Directed Regression , 2009, NIPS.

[30]  Tobias Achterberg,et al.  SCIP: solving constraint integer programs , 2009, Math. Program. Comput..

[31]  John Langford,et al.  The offset tree for learning with partial labels , 2008, KDD.

[32]  Jens Vygen,et al.  The Book Review Column1 , 2020, SIGACT News.

[33]  R. Kalaba,et al.  Arriving on Time , 2005 .

[34]  John Liechty,et al.  Portfolio selection with higher moments , 2004 .

[35]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[36]  Long Ji Lin,et al.  Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.

[37]  A. Perrault,et al.  Decision-Focused Learning without Decision-Making: Learning Locally Optimized Decision Losses , 2022, NeurIPS.

[38]  G. Hunanyan,et al.  Portfolio Selection , 2019, Finanzwirtschaft, Banken und Bankmanagement I Finance, Banks and Bank Management.

[39]  Dimitris Bertsimas,et al.  Portfolio Construction Through Mixed-Integer Programming at Grantham, Mayo, Van Otterloo and Company , 1999, Interfaces.