论文信息 - Towards Task and Architecture-Independent Generalization Gap Predictors

Towards Task and Architecture-Independent Generalization Gap Predictors

Can we use deep learning to predict when deep learning works? Our results suggest the affirmative. We created a dataset by training 13,500 neural networks with different architectures, on different variations of spiral datasets, and using different optimization parameters. We used this dataset to train task-independent and architecture-independent generalization gap predictors for those neural networks. We extend Jiang et al. (2018) to also use DNNs and RNNs and show that they outperform the linear model, obtaining $R^2=0.965$. We also show results for architecture-independent, task-independent, and out-of-distribution generalization gap prediction tasks. Both DNNs and RNNs consistently and significantly outperform linear models, with RNNs obtaining $R^2=0.584$.

[1] Guillermo Sapiro,et al. Robust Large Margin Deep Neural Networks , 2016, IEEE Transactions on Signal Processing.

[2] Hossein Mobahi,et al. Large Margin Deep Networks for Classification , 2018, NeurIPS.

[3] Vijay Vasudevan,et al. Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4] Yi Zhang,et al. Stronger generalization bounds for deep nets via a compression approach , 2018, ICML.

[5] Samy Bengio,et al. Understanding deep learning requires rethinking generalization , 2016, ICLR.

[6] Hossein Mobahi,et al. Predicting the Generalization Gap in Deep Networks with Margin Distributions , 2018, ICLR.

[7] Yann LeCun,et al. Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks , 2018, ArXiv.

[8] Yuanzhi Li,et al. Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers , 2018, NeurIPS.

[9] Nathan Srebro,et al. Exploring Generalization in Deep Learning , 2017, NIPS.

[10] Stephan K. Chalup,et al. Variations of the two-spiral task , 2007, Connect. Sci..