论文信息 - Convex Programs for Global Optimization of Convolutional Neural Networks in Polynomial-Time

Convex Programs for Global Optimization of Convolutional Neural Networks in Polynomial-Time

We study training of Convolutional Neural Networks (CNNs) with ReLU activations and introduce exact convex optimization formulations with a polynomial complexity with respect to the number of data samples, the number of neurons and data dimension. Particularly, we develop a convex analytic framework utilizing semi-inﬁnite duality to obtain equivalent convex optimization problems for two-layer CNNs, where convex problems are regularized by the sum of (cid:96) 2 norms of variables.

Mert Pilanci | Tolga Ergen | Ergen

[1] Mert Pilanci,et al. Convex Geometry and Duality of Over-parameterized Neural Networks , 2020, J. Mach. Learn. Res..

[2] Mert Pilanci,et al. Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-Layer Networks , 2020, ICML.

[3] Robert D. Nowak,et al. Minimum "Norm" Neural Networks are Splines , 2019, ArXiv.

[4] Guy Blanc,et al. Implicit regularization for deep neural networks driven by an Ornstein-Uhlenbeck like process , 2019, COLT.

[5] Nathan Srebro,et al. How do infinite width bounded norm networks look in function space? , 2019, COLT.

[6] Edouard Oyallon,et al. Greedy Layerwise Learning Can Scale to ImageNet , 2018, ICML.

[7] Guanghui Lan,et al. Complexity of Training ReLU Neural Network , 2018, Discret. Optim..

[8] Nathan Srebro,et al. Implicit Bias of Gradient Descent on Linear Convolutional Networks , 2018, NeurIPS.

[9] Sylvain Gelly,et al. Gradient Descent Quantizes ReLU Network Features , 2018, ArXiv.

[10] Stephen P. Boyd,et al. A Rewriting System for Convex Optimization Problems , 2017, J. Control. Decis..

[11] Samy Bengio,et al. Understanding deep learning requires rethinking generalization , 2016, ICLR.