Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks

Dangel, Felix; Müller, Johannes; Zeinhofer, Marius

Computer Science > Machine Learning

arXiv:2405.15603 (cs)

[Submitted on 24 May 2024 (v1), last revised 30 Oct 2024 (this version, v3)]

Title:Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks

Authors:Felix Dangel, Johannes Müller, Marius Zeinhofer

View PDF

Abstract:Physics-informed neural networks (PINNs) are infamous for being hard to train. Recently, second-order methods based on natural gradient and Gauss-Newton methods have shown promising performance, improving the accuracy achieved by first-order methods by several orders of magnitude. While promising, the proposed methods only scale to networks with a few thousand parameters due to the high computational cost to evaluate, store, and invert the curvature matrix. We propose Kronecker-factored approximate curvature (KFAC) for PINN losses that greatly reduces the computational cost and allows scaling to much larger networks. Our approach goes beyond the established KFAC for traditional deep learning problems as it captures contributions from a PDE's differential operator that are crucial for optimization. To establish KFAC for such losses, we use Taylor-mode automatic differentiation to describe the differential operator's computation graph as a forward network with shared weights. This allows us to apply KFAC thanks to a recently-developed general formulation for networks with weight sharing. Empirically, we find that our KFAC-based optimizers are competitive with expensive second-order methods on small problems, scale more favorably to higher-dimensional neural networks and PDEs, and consistently outperform first-order methods and LBFGS.

Subjects:	Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
Cite as:	arXiv:2405.15603 [cs.LG]
	(or arXiv:2405.15603v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.15603
Journal reference:	Advances in Neural Information Processing Systems (NeurIPS) 2024

Submission history

From: Felix Dangel [view email]
[v1] Fri, 24 May 2024 14:36:02 UTC (6,122 KB)
[v2] Mon, 27 May 2024 14:23:46 UTC (6,115 KB)
[v3] Wed, 30 Oct 2024 15:53:30 UTC (25,565 KB)

Computer Science > Machine Learning

Title:Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators