The Flip Side of the Reweighted Coin: Duality of Adaptive Dropout and Regularization

LeJeune, Daniel; Javadi, Hamid; Baraniuk, Richard G.

Computer Science > Machine Learning

arXiv:2106.07769 (cs)

[Submitted on 14 Jun 2021 (v1), last revised 3 Jan 2022 (this version, v3)]

Title:The Flip Side of the Reweighted Coin: Duality of Adaptive Dropout and Regularization

Authors:Daniel LeJeune, Hamid Javadi, Richard G. Baraniuk

View PDF

Abstract:Among the most successful methods for sparsifying deep (neural) networks are those that adaptively mask the network weights throughout training. By examining this masking, or dropout, in the linear case, we uncover a duality between such adaptive methods and regularization through the so-called "$\eta$-trick" that casts both as iteratively reweighted optimizations. We show that any dropout strategy that adapts to the weights in a monotonic way corresponds to an effective subquadratic regularization penalty, and therefore leads to sparse solutions. We obtain the effective penalties for several popular sparsification strategies, which are remarkably similar to classical penalties commonly used in sparse optimization. Considering variational dropout as a case study, we demonstrate similar empirical behavior between the adaptive dropout method and classical methods on the task of deep network sparsification, validating our theory.

Comments:	19 pages, 2 figures. Appeared in NeurIPS 2021. Small typographical correction
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2106.07769 [cs.LG]
	(or arXiv:2106.07769v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.07769

Submission history

From: Daniel LeJeune [view email]
[v1] Mon, 14 Jun 2021 21:47:17 UTC (520 KB)
[v2] Fri, 22 Oct 2021 10:02:33 UTC (521 KB)
[v3] Mon, 3 Jan 2022 11:56:27 UTC (521 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-06

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Daniel LeJeune
Hamid Javadi
Richard G. Baraniuk

export BibTeX citation

Computer Science > Machine Learning

Title:The Flip Side of the Reweighted Coin: Duality of Adaptive Dropout and Regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Flip Side of the Reweighted Coin: Duality of Adaptive Dropout and Regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators