Warped geometric information on the optimisation of Euclidean functions

Hartmann, Marcelo; Williams, Bernardo; Yu, Hanlin; Girolami, Mark; Barp, Alessandro; Klami, Arto

Statistics > Machine Learning

arXiv:2308.08305 (stat)

[Submitted on 16 Aug 2023 (v1), last revised 18 Mar 2024 (this version, v2)]

Title:Warped geometric information on the optimisation of Euclidean functions

Authors:Marcelo Hartmann, Bernardo Williams, Hanlin Yu, Mark Girolami, Alessandro Barp, Arto Klami

View PDF HTML (experimental)

Abstract:We consider the fundamental task of optimising a real-valued function defined in a potentially high-dimensional Euclidean space, such as the loss function in many machine-learning tasks or the logarithm of the probability distribution in statistical inference. We use Riemannian geometry notions to redefine the optimisation problem of a function on the Euclidean space to a Riemannian manifold with a warped metric, and then find the function's optimum along this manifold. The warped metric chosen for the search domain induces a computational friendly metric-tensor for which optimal search directions associated with geodesic curves on the manifold becomes easier to compute. Performing optimization along geodesics is known to be generally infeasible, yet we show that in this specific manifold we can analytically derive Taylor approximations up to third-order. In general these approximations to the geodesic curve will not lie on the manifold, however we construct suitable retraction maps to pull them back onto the manifold. Therefore, we can efficiently optimize along the approximate geodesic curves. We cover the related theory, describe a practical optimization algorithm and empirically evaluate it on a collection of challenging optimisation benchmarks. Our proposed algorithm, using 3rd-order approximation of geodesics, tends to outperform standard Euclidean gradient-based counterparts in term of number of iterations until convergence.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2308.08305 [stat.ML]
	(or arXiv:2308.08305v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2308.08305

Submission history

From: Marcelo Hartmann [view email]
[v1] Wed, 16 Aug 2023 12:08:50 UTC (1,735 KB)
[v2] Mon, 18 Mar 2024 18:16:00 UTC (2,040 KB)

Statistics > Machine Learning

Title:Warped geometric information on the optimisation of Euclidean functions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Warped geometric information on the optimisation of Euclidean functions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators