论文信息 - Accelerating Neural Architecture Search with Rank-Preserving Surrogate Models

Accelerating Neural Architecture Search with Rank-Preserving Surrogate Models

Over the past years, deep learning has enabled significant progress in several tasks, such as image recognition, speech recognition and language modelling. Novel Neural architectures are behind this achievement. However, manually designing these architectures by human experts is time-consuming and error-prone. Neural architecture search (NAS) automates the design process by searching for the best architecture in a huge search space. This search process requires evaluating each sampled architecture via time-consuming training. To speed up NAS algorithms, several existing approaches use surrogate models that predict the neural architectures’ precision instead of training each sampled one. In this paper, we propose RS-NAS for Rank-preserving Surrogate model in NAS, a surrogate model trained with a rank-preserving loss function. We posit that the search algorithm doesn’t need to know the exact accuracy of a candidate architecture but instead needs to know if it is better or worse than others. We thoroughly experiment and validate our surrogate models with state-of-the-art search algorithms. Using the rank-preserving surrogate models, local search in DARTS finds a 2% more accurate architecture than using the NAS-Bench-301 surrogate model on the same search time. The code and models are available: https://github.com/IHIaadj/ranked_nas

[1] Junjie Yan,et al. Peephole: Predicting Network Performance Before Training , 2017, ArXiv.

[2] Yu Wang,et al. A Generic Graph-based Neural Architecture Encoding Scheme for Predictor-based NAS , 2020, ECCV.

[3] Bo Chen,et al. MnasNet: Platform-Aware Neural Architecture Search for Mobile , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Aaron Klein,et al. NAS-Bench-101: Towards Reproducible Neural Architecture Search , 2019, ICML.

[5] Yiming Yang,et al. DARTS: Differentiable Architecture Search , 2018, ICLR.

[6] Li Fei-Fei,et al. Progressive Neural Architecture Search , 2017, ECCV.

[7] Rong Yan,et al. NeuNetS: An Automated Synthesis Engine for Neural Network Design , 2019, ArXiv.

[8] Jure Leskovec,et al. How Powerful are Graph Neural Networks? , 2018, ICLR.

[9] Margret Keuper,et al. NAS-Bench-301 and the Case for Surrogate Benchmarks for Neural Architecture Search , 2020, ArXiv.

[10] Willie Neiswanger,et al. BANANAS: Bayesian Optimization with Neural Architectures for Neural Architecture Search , 2021, AAAI.

[11] Song Han,et al. ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware , 2018, ICLR.

[12] Yi Yang,et al. NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search , 2020, ICLR.

[13] Max Welling,et al. Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[14] Martin Wistuba,et al. Learning to Rank Learning Curves , 2020, ICML.

[15] Hamza Ouarnoughi,et al. Hardware-Aware Neural Architecture Search: Survey and Taxonomy , 2021, IJCAI.