论文信息 - AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment

AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment

Neural architecture search (NAS) is an approach for automatically designing a neural network architecture without human effort or expert knowledge. However, the high computational cost of NAS limits its use in commercial applications. Two recent NAS paradigms, namely one-shot and sparse propagation, which reduce the time and space complexities, respectively, provide clues for solving this problem. In this paper, we propose a novel search strategy for one-shot and sparse propagation NAS, namely AdvantageNAS, which further reduces the time complexity of NAS by reducing the number of search iterations. AdvantageNAS is a gradient-based approach that improves the search efficiency by introducing credit assignment in gradient estimation for architecture updates. Experiments on the NAS-Bench-201 and PTB dataset show that AdvantageNAS discovers an architecture with higher performance under a limited time budget compared to existing sparse propagation NAS. To further reveal the reliabilities of AdvantageNAS, we investigate it theoretically and find that it monotonically improves the expected loss and thus converges.

Youhei Akimoto | Jun Sakuma | Rei Sato

[1] Fabio Maria Carlucci,et al. NAS evaluation is frustratingly hard , 2020, ICLR.

[2] Marvin Minsky,et al. Steps toward Artificial Intelligence , 1995, Proceedings of the IRE.

[3] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.

[4] Vijay Vasudevan,et al. Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[5] Tim B. Swartz,et al. Approximating Integrals Via Monte Carlo and Deterministic Methods , 2000 .

[6] Isao Ono,et al. Theoretical Foundation for CMA-ES from Information Geometry Perspective , 2012, Algorithmica.

[7] Zhanxing Zhu,et al. Efficient Neural Architecture Search via Proximal Iterations , 2020, AAAI.

[8] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[10] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[11] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.