Large-Scale Gradient-Free Deep Learning with Recursive Local Representation Alignment

Ororbia, Alexander; Mali, Ankur; Kifer, Daniel; Giles, C. Lee

Computer Science > Machine Learning

arXiv:2002.03911v2 (cs)

[Submitted on 10 Feb 2020 (v1), revised 14 Jun 2020 (this version, v2), latest version 18 Sep 2020 (v3)]

Title:Large-Scale Gradient-Free Deep Learning with Recursive Local Representation Alignment

Authors:Alexander Ororbia, Ankur Mali, Daniel Kifer, C. Lee Giles

View PDF

Abstract:Training deep neural networks on large-scale datasets requires significant hardware resources whose costs (even on cloud platforms) put them out of reach of smaller organizations, groups, and individuals. Backpropagation (backprop), the workhorse for training these networks, is an inherently sequential process that is difficult to parallelize. Furthermore, it requires researchers to continually develop various tricks, such as specialized weight initializations and activation functions, in order to ensure a stable parameter optimization. Our goal is to seek an effective, parallelizable alternative to backprop that can be used to train deep networks. In this paper, we propose a gradient-free learning procedure, recursive local representation alignment, for training large-scale neural architectures. Experiments with deep residual networks on CIFAR-10 and the large-scale benchmark, ImageNet, show that our algorithm generalizes as well as backprop while converging sooner due to weight updates that are parallelizable and computationally less demanding. This is empirical evidence that a backprop-free algorithm can scale up to larger datasets. Another contribution is that we also significantly reduce total parameter count of our networks by utilizing fast, fixed noise maps in place of convolutional operations without compromising generalization.

Comments:	Revised submission. Additional experimental results, revisions, and substantial appendix with further details
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:2002.03911 [cs.LG]
	(or arXiv:2002.03911v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2002.03911

Submission history

From: Alexander Ororbia [view email]
[v1] Mon, 10 Feb 2020 16:20:02 UTC (393 KB)
[v2] Sun, 14 Jun 2020 04:46:28 UTC (422 KB)
[v3] Fri, 18 Sep 2020 06:16:08 UTC (524 KB)

Computer Science > Machine Learning

Title:Large-Scale Gradient-Free Deep Learning with Recursive Local Representation Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Large-Scale Gradient-Free Deep Learning with Recursive Local Representation Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators