论文信息 - Deep neural networks and mixed integer linear optimization

Deep neural networks and mixed integer linear optimization

Deep Neural Networks (DNNs) are very popular these days, and are the subject of a very intense investigation. A DNN is made up of layers of internal units (or neurons), each of which computes an affine combination of the output of the units in the previous layer, applies a nonlinear operator, and outputs the corresponding value (also known as activation). A commonly-used nonlinear operator is the so-called rectified linear unit (ReLU), whose output is just the maximum between its input value and zero. In this (and other similar cases like max pooling, where the max operation involves more than one input value), for fixed parameters one can model the DNN as a 0-1 Mixed Integer Linear Program (0-1 MILP) where the continuous variables correspond to the output values of each unit, and a binary variable is associated with each ReLU to model its yes/no nature. In this paper we discuss the peculiarity of this kind of 0-1 MILP models, and describe an effective bound-tightening technique intended to ease its solution. We also present possible applications of the 0-1 MILP model arising in feature visualization and in the construction of adversarial examples. Computational results are reported, aimed at investigating (on small DNNs) the computational performance of a state-of-the-art MILP solver when applied to a known test case, namely, hand-written digit recognition.

Matteo Fischetti | Jason Jo | Jason Jo | M. Fischetti

[1] Matteo Fischetti,et al. Proximity search for 0-1 mixed-integer convex programming , 2014, J. Heuristics.

[2] Chih-Hong Cheng,et al. Maximum Resilience of Artificial Neural Networks , 2017, ATVA.

[3] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[4] Matteo Fischetti,et al. On handling indicator constraints in mixed integer programming , 2016, Comput. Optim. Appl..

[5] Edward Rothberg,et al. An Evolutionary Algorithm for Polishing Mixed Integer Programming Solutions , 2007, INFORMS J. Comput..

[6] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[7] Christian Tjandraatmadja,et al. Bounding and Counting Linear Regions of Deep Neural Networks , 2017, ICML.

[8] Matteo Fischetti,et al. Local branching , 2003, Math. Program..

[9] Matteo Fischetti,et al. Fast training of Support Vector Machines with Gaussian kernel , 2016, Discret. Optim..

[10] Pascal Vincent,et al. Visualizing Higher-Layer Features of a Deep Network , 2009 .

[11] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[12] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[13] Russ Tedrake,et al. Verifying Neural Networks with Mixed Integer Programming , 2017, ArXiv.

[14] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.