Optimizing Neural Networks via Koopman Operator Theory

Dogra, Akshunna S.; Redman, William T

Computer Science > Neural and Evolutionary Computing

arXiv:2006.02361 (cs)

[Submitted on 3 Jun 2020 (v1), last revised 22 Oct 2020 (this version, v3)]

Title:Optimizing Neural Networks via Koopman Operator Theory

Authors:Akshunna S. Dogra, William T Redman

View PDF

Abstract:Koopman operator theory, a powerful framework for discovering the underlying dynamics of nonlinear dynamical systems, was recently shown to be intimately connected with neural network training. In this work, we take the first steps in making use of this connection. As Koopman operator theory is a linear theory, a successful implementation of it in evolving network weights and biases offers the promise of accelerated training, especially in the context of deep networks, where optimization is inherently a non-convex problem. We show that Koopman operator theoretic methods allow for accurate predictions of weights and biases of feedforward, fully connected deep networks over a non-trivial range of training time. During this window, we find that our approach is >10x faster than various gradient descent based methods (e.g. Adam, Adadelta, Adagrad), in line with our complexity analysis. We end by highlighting open questions in this exciting intersection between dynamical systems and neural network theory. We highlight additional methods by which our results could be expanded to broader classes of networks and larger training intervals, which shall be the focus of future work.

Comments:	11 main content pages (7 supplementary pages), 3 main content figures (3 supplementary figures), 2 main content Tables (5 supplementary Tables). 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada
Subjects:	Neural and Evolutionary Computing (cs.NE); Signal Processing (eess.SP); Dynamical Systems (math.DS); Computational Physics (physics.comp-ph)
Cite as:	arXiv:2006.02361 [cs.NE]
	(or arXiv:2006.02361v3 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.2006.02361
Journal reference:	Advances in Neural Information Processing Systems 33, 2087-2097 (2020)

Submission history

From: Akshunna S. Dogra [view email]
[v1] Wed, 3 Jun 2020 16:23:07 UTC (2,699 KB)
[v2] Thu, 11 Jun 2020 18:34:09 UTC (3,545 KB)
[v3] Thu, 22 Oct 2020 03:48:46 UTC (5,563 KB)

Computer Science > Neural and Evolutionary Computing

Title:Optimizing Neural Networks via Koopman Operator Theory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Optimizing Neural Networks via Koopman Operator Theory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators