Memorized Sparse Backpropagation

Zhang, Zhiyuan; Yang, Pengcheng; Ren, Xuancheng; Su, Qi; Sun, Xu

doi:10.1016/j.neucom.2020.08.055

Computer Science > Machine Learning

arXiv:1905.10194 (cs)

[Submitted on 24 May 2019 (v1), last revised 27 Oct 2020 (this version, v3)]

Title:Memorized Sparse Backpropagation

Authors:Zhiyuan Zhang, Pengcheng Yang, Xuancheng Ren, Qi Su, Xu Sun

View PDF

Abstract:Neural network learning is usually time-consuming since backpropagation needs to compute full gradients and backpropagate them across multiple layers. Despite its success of existing works in accelerating propagation through sparseness, the relevant theoretical characteristics remain under-researched and empirical studies found that they suffer from the loss of information contained in unpropagated gradients. To tackle these problems, this paper presents a unified sparse backpropagation framework and provides a detailed analysis of its theoretical characteristics. Analysis reveals that when applied to a multilayer perceptron, our framework essentially performs gradient descent using an estimated gradient similar enough to the true gradient, resulting in convergence in probability under certain conditions. Furthermore, a simple yet effective algorithm named memorized sparse backpropagation (MSBP) is proposed to remedy the problem of information loss by storing unpropagated gradients in memory for learning in the next steps. Experimental results demonstrate that the proposed MSBP is effective to alleviate the information loss in traditional sparse backpropagation while achieving comparable acceleration.

Comments:	Accepted to Neurocomputing
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1905.10194 [cs.LG]
	(or arXiv:1905.10194v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.10194
Journal reference:	Neurocomputing 415C (2020) pp. 397-407
Related DOI:	https://doi.org/10.1016/j.neucom.2020.08.055

Submission history

From: Zhiyuan Zhang [view email]
[v1] Fri, 24 May 2019 12:38:31 UTC (467 KB)
[v2] Sat, 1 Jun 2019 05:18:14 UTC (467 KB)
[v3] Tue, 27 Oct 2020 05:08:14 UTC (2,286 KB)

Computer Science > Machine Learning

Title:Memorized Sparse Backpropagation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Memorized Sparse Backpropagation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators