Tangent-Space Gradient Optimization of Tensor Network for Machine Learning

Sun, Zheng-zhi; Ran, Shi-ju; Su, Gang

doi:10.1103/PhysRevE.102.012152

Computer Science > Machine Learning

arXiv:2001.04029 (cs)

[Submitted on 10 Jan 2020]

Title:Tangent-Space Gradient Optimization of Tensor Network for Machine Learning

Authors:Zheng-zhi Sun, Shi-ju Ran, Gang Su

View PDF

Abstract:The gradient-based optimization method for deep machine learning models suffers from gradient vanishing and exploding problems, particularly when the computational graph becomes deep. In this work, we propose the tangent-space gradient optimization (TSGO) for the probabilistic models to keep the gradients from vanishing or exploding. The central idea is to guarantee the orthogonality between the variational parameters and the gradients. The optimization is then implemented by rotating parameter vector towards the direction of gradient. We explain and testify TSGO in tensor network (TN) machine learning, where the TN describes the joint probability distribution as a normalized state $\left| \psi \right\rangle $ in Hilbert space. We show that the gradient can be restricted in the tangent space of $\left\langle \psi \right.\left| \psi \right\rangle = 1$ hyper-sphere. Instead of additional adaptive methods to control the learning rate in deep learning, the learning rate of TSGO is naturally determined by the angle $\theta $ as $\eta = \tan \theta $. Our numerical results reveal better convergence of TSGO in comparison to the off-the-shelf Adam.

Comments:	5 pages, 4 figures
Subjects:	Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (stat.ML)
Cite as:	arXiv:2001.04029 [cs.LG]
	(or arXiv:2001.04029v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2001.04029
Journal reference:	Phys. Rev. E 102, 012152 (2020)
Related DOI:	https://doi.org/10.1103/PhysRevE.102.012152

Submission history

From: Zhengzhi Sun [view email]
[v1] Fri, 10 Jan 2020 16:40:40 UTC (184 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-01

Change to browse by:

cond-mat
cond-mat.dis-nn
cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zheng-Zhi Sun
Shi-Ju Ran
Gang Su

export BibTeX citation

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:Tangent-Space Gradient Optimization of Tensor Network for Machine Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:Tangent-Space Gradient Optimization of Tensor Network for Machine Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators