Accuracy Prediction with Non-neural Model for Neural Architecture Search

Luo, Renqian; Tan, Xu; Wang, Rui; Qin, Tao; Chen, Enhong; Liu, Tie-Yan

Computer Science > Machine Learning

arXiv:2007.04785 (cs)

[Submitted on 9 Jul 2020 (v1), last revised 19 Jul 2021 (this version, v3)]

Title:Accuracy Prediction with Non-neural Model for Neural Architecture Search

Authors:Renqian Luo, Xu Tan, Rui Wang, Tao Qin, Enhong Chen, Tie-Yan Liu

View PDF

Abstract:Neural architecture search (NAS) with an accuracy predictor that predicts the accuracy of candidate architectures has drawn increasing attention due to its simplicity and effectiveness. Previous works usually employ neural network-based predictors which require more delicate design and are easy to overfit. Considering that most architectures are represented as sequences of discrete symbols which are more like tabular data and preferred by non-neural predictors, in this paper, we study an alternative approach which uses non-neural model for accuracy prediction. Specifically, as decision tree based models can better handle tabular data, we leverage gradient boosting decision tree (GBDT) as the predictor for NAS. We demonstrate that the GBDT predictor can achieve comparable (if not better) prediction accuracy than neural network based predictors. Moreover, considering that a compact search space can ease the search process, we propose to prune the search space gradually according to important features derived from GBDT. In this way, NAS can be performed by first pruning the search space and then searching a neural architecture, which is more efficient and effective. Experiments on NASBench-101 and ImageNet demonstrate the effectiveness of using GBDT as predictor for NAS: (1) On NASBench-101, it is 22x, 8x, and 6x more sample efficient than random search, regularized evolution, and Monte Carlo Tree Search (MCTS) in finding the global optimum; (2) It achieves 24.2% top-1 error rate on ImageNet, and further achieves 23.4% top-1 error rate on ImageNet when enhanced with search space pruning. Code is provided at this https URL.

Comments:	Code is available at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2007.04785 [cs.LG]
	(or arXiv:2007.04785v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2007.04785

Submission history

From: Renqian Luo [view email]
[v1] Thu, 9 Jul 2020 13:28:49 UTC (499 KB)
[v2] Fri, 16 Jul 2021 11:43:02 UTC (428 KB)
[v3] Mon, 19 Jul 2021 07:31:57 UTC (428 KB)

Computer Science > Machine Learning

Title:Accuracy Prediction with Non-neural Model for Neural Architecture Search

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Accuracy Prediction with Non-neural Model for Neural Architecture Search

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators