Leveraging the Graph Structure of Neural Network Training Dynamics

Vahedian, Fatemeh; Li, Ruiyu; Trivedi, Puja; Jin, Di; Koutra, Danai

Computer Science > Machine Learning

arXiv:2111.05410 (cs)

[Submitted on 9 Nov 2021 (v1), last revised 20 Feb 2023 (this version, v2)]

Title:Leveraging the Graph Structure of Neural Network Training Dynamics

Authors:Fatemeh Vahedian, Ruiyu Li, Puja Trivedi, Di Jin, Danai Koutra

View PDF

Abstract:Understanding the training dynamics of deep neural networks (DNNs) is important as it can lead to improved training efficiency and task performance. Recent works have demonstrated that representing the wirings of static graph cannot capture how DNNs change over the course of training. Thus, in this work, we propose a compact, expressive temporal graph framework that effectively captures the dynamics of many workhorse architectures in computer vision. Specifically, it extracts an informative summary of graph properties (e.g., eigenvector centrality) over a sequence of DNN graphs obtained during training. We demonstrate that our framework captures useful dynamics by accurately predicting trained, task performance when using a summary over early training epochs (<5) across four different architectures and two image datasets. Moreover, by using a novel, highly-scalable DNN graph representation, we also show that the proposed framework captures generalizable dynamics as summaries extracted from smaller-width networks are effective when evaluated on larger widths.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2111.05410 [cs.LG]
	(or arXiv:2111.05410v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2111.05410

Submission history

From: Fatemeh Vahedian [view email]
[v1] Tue, 9 Nov 2021 20:38:48 UTC (3,865 KB)
[v2] Mon, 20 Feb 2023 21:31:26 UTC (10,114 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-11

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Fatemeh Vahedian
Ruiyu Li
Di Jin
Danai Koutra

export BibTeX citation

Computer Science > Machine Learning

Title:Leveraging the Graph Structure of Neural Network Training Dynamics

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Leveraging the Graph Structure of Neural Network Training Dynamics

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators