Stochastic Quasi-Newton Optimization in Large Dimensions Including Deep Network Training

Suman, Uttam; Mamajiwala, Mariya; Saxena, Mukul; Tyagi, Ankit; Roy, Debasish

Computer Science > Machine Learning

arXiv:2410.14270 (cs)

[Submitted on 18 Oct 2024]

Title:Stochastic Quasi-Newton Optimization in Large Dimensions Including Deep Network Training

Authors:Uttam Suman, Mariya Mamajiwala, Mukul Saxena, Ankit Tyagi, Debasish Roy

View PDF HTML (experimental)

Abstract:Our proposal is on a new stochastic optimizer for non-convex and possibly non-smooth objective functions typically defined over large dimensional design spaces. Towards this, we have tried to bridge noise-assisted global search and faster local convergence, the latter being the characteristic feature of a Newton-like search. Our specific scheme -- acronymed FINDER (Filtering Informed Newton-like and Derivative-free Evolutionary Recursion), exploits the nonlinear stochastic filtering equations to arrive at a derivative-free update that has resemblance with the Newton search employing the inverse Hessian of the objective function. Following certain simplifications of the update to enable a linear scaling with dimension and a few other enhancements, we apply FINDER to a range of problems, starting with some IEEE benchmark objective functions to a couple of archetypal data-driven problems in deep networks to certain cases of physics-informed deep networks. The performance of the new method vis-á-vis the well-known Adam and a few others bears evidence to its promise and potentialities for large dimensional optimization problems of practical interest.

Comments:	19 pages, 12 figures, 3 tables
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2410.14270 [cs.LG]
	(or arXiv:2410.14270v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.14270

Submission history

From: Uttam Suman [view email]
[v1] Fri, 18 Oct 2024 08:25:28 UTC (332 KB)

Computer Science > Machine Learning

Title:Stochastic Quasi-Newton Optimization in Large Dimensions Including Deep Network Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Stochastic Quasi-Newton Optimization in Large Dimensions Including Deep Network Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators