Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks

Zhang, Leyang; Zhang, Yaoyu; Luo, Tao

Computer Science > Machine Learning

arXiv:2405.17501 (cs)

[Submitted on 26 May 2024]

Title:Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks

Authors:Leyang Zhang, Yaoyu Zhang, Tao Luo

View PDF HTML (experimental)

Abstract:This paper presents a comprehensive analysis of critical point sets in two-layer neural networks. To study such complex entities, we introduce the critical embedding operator and critical reduction operator as our tools. Given a critical point, we use these operators to uncover the whole underlying critical set representing the same output function, which exhibits a hierarchical structure. Furthermore, we prove existence of saddle branches for any critical set whose output function can be represented by a narrower network. Our results provide a solid foundation to the further study of optimization and training behavior of neural networks.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2405.17501 [cs.LG]
	(or arXiv:2405.17501v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.17501

Submission history

From: Leyang Zhang [view email]
[v1] Sun, 26 May 2024 02:32:28 UTC (424 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-05

Change to browse by:

cs
math
math.OC

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators