iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients

Zhang, Miao; Su, Steven; Pan, Shirui; Chang, Xiaojun; Abbasnejad, Ehsan; Haffari, Reza

Computer Science > Machine Learning

arXiv:2106.10784 (cs)

[Submitted on 21 Jun 2021]

Title:iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients

Authors:Miao Zhang, Steven Su, Shirui Pan, Xiaojun Chang, Ehsan Abbasnejad, Reza Haffari

View PDF

Abstract:\textit{Differentiable ARchiTecture Search} (DARTS) has recently become the mainstream of neural architecture search (NAS) due to its efficiency and simplicity. With a gradient-based bi-level optimization, DARTS alternately optimizes the inner model weights and the outer architecture parameter in a weight-sharing supernet. A key challenge to the scalability and quality of the learned architectures is the need for differentiating through the inner-loop optimisation. While much has been discussed about several potentially fatal factors in DARTS, the architecture gradient, a.k.a. hypergradient, has received less attention. In this paper, we tackle the hypergradient computation in DARTS based on the implicit function theorem, making it only depends on the obtained solution to the inner-loop optimization and agnostic to the optimization path. To further reduce the computational requirements, we formulate a stochastic hypergradient approximation for differentiable NAS, and theoretically show that the architecture optimization with the proposed method, named iDARTS, is expected to converge to a stationary point. Comprehensive experiments on two NAS benchmark search spaces and the common NAS search space verify the effectiveness of our proposed method. It leads to architectures outperforming, with large margins, those learned by the baseline methods.

Comments:	ICML2021
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2106.10784 [cs.LG]
	(or arXiv:2106.10784v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.10784

Submission history

From: Miao Zhang [view email]
[v1] Mon, 21 Jun 2021 00:44:11 UTC (332 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Miao Zhang
Steven W. Su
Shirui Pan
Xiaojun Chang
Ehsan Abbasnejad

export BibTeX citation

Computer Science > Machine Learning

Title:iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators