Risk Bounds and Calibration for a Smart Predict-then-Optimize Method

The predict-then-optimize framework is fundamental in practical stochastic decision-making problems: first predict unknown parameters of an optimization model, then solve the problem using the predicted values. A natural loss function in this setting is defined by measuring the decision error induced by the predicted parameters, which was named the Smart Predict-thenOptimize (SPO) loss by Elmachtoub and Grigas [2021]. Since the SPO loss is typically nonconvex and possibly discontinuous, Elmachtoub and Grigas [2021] introduced a convex surrogate, called the SPO+ loss, that importantly accounts for the underlying structure of the optimization model. In this paper, we greatly expand upon the consistency results for the SPO+ loss provided by Elmachtoub and Grigas [2021]. We develop risk bounds and uniform calibration results for the SPO+ loss relative to the SPO loss, which provide a quantitative way to transfer the excess surrogate risk to excess true risk. By combining our risk bounds with generalization bounds, we show that the empirical minimizer of the SPO+ loss achieves low excess true risk with high probability. We first demonstrate these results in the case when the feasible region of the underlying optimization problem is a polyhedron, and then we show that the results can be strengthened substantially when the feasible region is a level set of a strongly convex function. We perform experiments to empirically demonstrate the strength of the SPO+ surrogate, as compared to standard `1 and squared `2 prediction error losses, on portfolio allocation and cost-sensitive multi-class classification problems.

[1]  Yi-Hao Kao,et al.  Directed Regression , 2009, NIPS.

[2]  Yurii Nesterov,et al.  Generalized Power Method for Sparse Principal Component Analysis , 2008, J. Mach. Learn. Res..

[3]  Richard Pibernik,et al.  Prescriptive Analytics for Flexible Capacity Management , 2019, Manag. Sci..

[4]  Tong Zhang,et al.  Statistical Analysis of Some Multi-Category Large Margin Classification Methods , 2004, J. Mach. Learn. Res..

[5]  Anil K. Bera,et al.  Optimal Portfolio Diversification Using the Maximum Entropy Principle , 2008 .

[6]  Francis R. Bach,et al.  On Structured Prediction Theory with Calibrated Convex Surrogate Losses , 2017, NIPS.

[7]  Ambuj Tewari,et al.  On the Consistency of Multiclass Classification Methods , 2007, J. Mach. Learn. Res..

[8]  Andreas Maurer,et al.  A Vector-Contraction Inequality for Rademacher Complexities , 2016, ALT.

[9]  Priya L. Donti,et al.  Task-based End-to-end Model Learning in Stochastic Optimization , 2017, NIPS.

[10]  Ingo Steinwart How to Compare Different Loss Functions and Their Risks , 2007 .

[11]  P.R. Srivastava,et al.  On Data-Driven Prescriptive Analytics with Side Information: A Regularized Nadaraya-Watson Approach , 2021, 2110.04855.

[12]  Adam N. Elmachtoub,et al.  Smart "Predict, then Optimize" , 2017, Manag. Sci..

[13]  Nathan Kallus,et al.  Fast Rates for Contextual Linear Optimization , 2020, Manag. Sci..

[14]  Peter L. Bartlett,et al.  Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , 2003, J. Mach. Learn. Res..

[15]  Dimitris Bertsimas,et al.  From Predictive to Prescriptive Analytics , 2014, Manag. Sci..

[16]  Nam Ho-Nguyen,et al.  Risk Guarantees for End-to-End Prediction and Optimization Processes , 2020, Manag. Sci..

[17]  P. Massart,et al.  Risk bounds for statistical learning , 2007, math/0702683.

[18]  Jean-Philippe P. Richard,et al.  Objective-Aligned Regression for Two-Stage Linear Programs , 2019, SSRN Electronic Journal.

[19]  Michael I. Jordan,et al.  Convexity, Classification, and Risk Bounds , 2006 .

[20]  Tong Zhang Statistical behavior and consistency of classification methods based on convex risk minimization , 2003 .

[21]  Ferdinando Fioretto,et al.  End-to-End Constrained Optimization Learning: A Survey , 2021, IJCAI.

[22]  G. Hunanyan,et al.  Portfolio Selection , 2019, Finanzwirtschaft, Banken und Bankmanagement I Finance, Banks and Bank Management.