论文信息 - Strong mixed-integer programming formulations for trained neural networks

Strong mixed-integer programming formulations for trained neural networks

We present strong mixed-integer programming (MIP) formulations for high-dimensional piecewise linear functions that correspond to trained neural networks. These formulations can be used for a number of important tasks, such as verifying that an image classification network is robust to adversarial inputs, or solving decision problems where the objective function is a machine learning model. We present a generic framework, which may be of independent interest, that provides a way to construct sharp or ideal formulations for the maximum of d affine functions over arbitrary polyhedral input domains. We apply this result to derive MIP formulations for a number of the most popular nonlinear operations (e.g. ReLU and max pooling) that are strictly stronger than other approaches from the literature. We corroborate this computationally, showing that our formulations are able to offer substantial improvements in solve time on verification tasks for image classification networks.

[1] Leon A. Gatys,et al. A Neural Algorithm of Artistic Style , 2015, ArXiv.

[2] Christian Tjandraatmadja,et al. Strong mixed-integer programming formulations for trained neural networks , 2020, Math. Program..

[3] David Simchi-Levi,et al. Distributionally Robust Linear and Discrete Optimization with Marginals , 2018, Oper. Res..

[4] Matteo Fischetti,et al. On handling indicator constraints in mixed integer programming , 2016, Comput. Optim. Appl..

[5] Jean Ponce,et al. Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6] Dimitris Bertsimas,et al. From Predictive to Prescriptive Analytics , 2014, Manag. Sci..

[7] Nikolaos V. Sahinidis,et al. Convexification and Global Optimization in Continuous and Mixed-Integer Nonlinear Programming , 2002 .

[8] Chih-Hong Cheng,et al. Maximum Resilience of Artificial Neural Networks , 2017, ATVA.

[9] J. Zico Kolter,et al. Scaling provable adversarial defenses , 2018, NeurIPS.

[10] Alessio Lomuscio,et al. An approach to reachability analysis for feed-forward ReLU neural networks , 2017, ArXiv.

[11] Cho-Jui Hsieh,et al. A Convex Relaxation Barrier to Tight Robustness Verification of Neural Networks , 2019, NeurIPS.

[12] David A. Wagner,et al. Towards Evaluating the Robustness of Neural Networks , 2016, 2017 IEEE Symposium on Security and Privacy (SP).

[13] Mykel J. Kochenderfer,et al. Algorithms for Verifying Deep Neural Networks , 2019, Found. Trends Optim..

[14] Yoshua Bengio,et al. Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[15] George L. Nemhauser,et al. Modeling disjunctive constraints with a logarithmic number of binary variables and constraints , 2011, Math. Program..

[16] Juan Pablo Vielma,et al. Embedding Formulations and Complexity for Unions of Polyhedra , 2015, Manag. Sci..

[17] Ashish Tiwari,et al. Output Range Analysis for Deep Feedforward Neural Networks , 2018, NFM.