The Geometric Occam's Razor Implicit in Deep Learning

Dherin, Benoit; Munn, Michael; Barrett, David G. T.

Computer Science > Machine Learning

arXiv:2111.15090 (cs)

[Submitted on 30 Nov 2021 (v1), last revised 1 Dec 2021 (this version, v2)]

Title:The Geometric Occam's Razor Implicit in Deep Learning

Authors:Benoit Dherin, Michael Munn, David G.T. Barrett

View PDF

Abstract:In over-parameterized deep neural networks there can be many possible parameter configurations that fit the training data exactly. However, the properties of these interpolating solutions are poorly understood. We argue that over-parameterized neural networks trained with stochastic gradient descent are subject to a Geometric Occam's Razor; that is, these networks are implicitly regularized by the geometric model complexity. For one-dimensional regression, the geometric model complexity is simply given by the arc length of the function. For higher-dimensional settings, the geometric model complexity depends on the Dirichlet energy of the function. We explore the relationship between this Geometric Occam's Razor, the Dirichlet energy and other known forms of implicit regularization. Finally, for ResNets trained on CIFAR-10, we observe that Dirichlet energy measurements are consistent with the action of this implicit Geometric Occam's Razor.

Comments:	Accepted as a NeurIPS 2021 workshop paper (OPT2021)
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2111.15090 [cs.LG]
	(or arXiv:2111.15090v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2111.15090

Submission history

From: Benoit Dherin [view email]
[v1] Tue, 30 Nov 2021 03:05:11 UTC (230 KB)
[v2] Wed, 1 Dec 2021 04:54:50 UTC (230 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-11

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

David G. T. Barrett

export BibTeX citation

Computer Science > Machine Learning

Title:The Geometric Occam's Razor Implicit in Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Geometric Occam's Razor Implicit in Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators