Neural Networks with Few Multiplications

Lin, Zhouhan; Courbariaux, Matthieu; Memisevic, Roland; Bengio, Yoshua

Computer Science > Machine Learning

arXiv:1510.03009 (cs)

[Submitted on 11 Oct 2015 (v1), last revised 26 Feb 2016 (this version, v3)]

Title:Neural Networks with Few Multiplications

Authors:Zhouhan Lin, Matthieu Courbariaux, Roland Memisevic, Yoshua Bengio

View PDF

Abstract:For most deep learning algorithms training is notoriously time consuming. Since most of the computation in training neural networks is typically spent on floating point multiplications, we investigate an approach to training that eliminates the need for most of these. Our method consists of two parts: First we stochastically binarize weights to convert multiplications involved in computing hidden states to sign changes. Second, while back-propagating error derivatives, in addition to binarizing the weights, we quantize the representations at each layer to convert the remaining multiplications into binary shifts. Experimental results across 3 popular datasets (MNIST, CIFAR10, SVHN) show that this approach not only does not hurt classification performance but can result in even better performance than standard stochastic gradient descent training, paving the way to fast, hardware-friendly training of neural networks.

Comments:	Published as a conference paper at ICLR 2016. 9 pages, 3 figures
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1510.03009 [cs.LG]
	(or arXiv:1510.03009v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1510.03009

Submission history

From: Zhouhan Lin [view email]
[v1] Sun, 11 Oct 2015 04:32:39 UTC (85 KB)
[v2] Mon, 9 Nov 2015 20:16:10 UTC (111 KB)
[v3] Fri, 26 Feb 2016 05:24:30 UTC (111 KB)

Computer Science > Machine Learning

Title:Neural Networks with Few Multiplications

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Neural Networks with Few Multiplications

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators