AIO-P: Expanding Neural Performance Predictors Beyond Image Classification

Mills, Keith G.; Niu, Di; Salameh, Mohammad; Qiu, Weichen; Han, Fred X.; Liu, Puyuan; Zhang, Jialin; Lu, Wei; Jui, Shangling

doi:10.1609/aaai.v37i8.26101

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.17228 (cs)

[Submitted on 30 Nov 2022 (v1), last revised 24 Apr 2023 (this version, v2)]

Title:AIO-P: Expanding Neural Performance Predictors Beyond Image Classification

Authors:Keith G. Mills, Di Niu, Mohammad Salameh, Weichen Qiu, Fred X. Han, Puyuan Liu, Jialin Zhang, Wei Lu, Shangling Jui

View PDF

Abstract:Evaluating neural network performance is critical to deep neural network design but a costly procedure. Neural predictors provide an efficient solution by treating architectures as samples and learning to estimate their performance on a given task. However, existing predictors are task-dependent, predominantly estimating neural network performance on image classification benchmarks. They are also search-space dependent; each predictor is designed to make predictions for a specific architecture search space with predefined topologies and set of operations. In this paper, we propose a novel All-in-One Predictor (AIO-P), which aims to pretrain neural predictors on architecture examples from multiple, separate computer vision (CV) task domains and multiple architecture spaces, and then transfer to unseen downstream CV tasks or neural architectures. We describe our proposed techniques for general graph representation, efficient predictor pretraining and knowledge infusion techniques, as well as methods to transfer to downstream tasks/spaces. Extensive experimental results show that AIO-P can achieve Mean Absolute Error (MAE) and Spearman's Rank Correlation (SRCC) below 1% and above 0.5, respectively, on a breadth of target downstream CV tasks with or without fine-tuning, outperforming a number of baselines. Moreover, AIO-P can directly transfer to new architectures not seen during training, accurately rank them and serve as an effective performance estimator when paired with an algorithm designed to preserve performance while reducing FLOPs.

Comments:	AAAI 2023 Oral Presentation; version includes supplementary material; 16 Pages, 4 Figures, 22 Tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2211.17228 [cs.CV]
	(or arXiv:2211.17228v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.17228
Related DOI:	https://doi.org/10.1609/aaai.v37i8.26101

Submission history

From: Keith Mills [view email]
[v1] Wed, 30 Nov 2022 18:30:41 UTC (1,261 KB)
[v2] Mon, 24 Apr 2023 20:07:09 UTC (1,259 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:AIO-P: Expanding Neural Performance Predictors Beyond Image Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:AIO-P: Expanding Neural Performance Predictors Beyond Image Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators