DHP: Differentiable Meta Pruning via HyperNetworks

Li, Yawei; Gu, Shuhang; Zhang, Kai; Van Gool, Luc; Timofte, Radu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2003.13683 (cs)

[Submitted on 30 Mar 2020 (v1), last revised 1 Aug 2020 (this version, v3)]

Title:DHP: Differentiable Meta Pruning via HyperNetworks

Authors:Yawei Li, Shuhang Gu, Kai Zhang, Luc Van Gool, Radu Timofte

View PDF

Abstract:Network pruning has been the driving force for the acceleration of neural networks and the alleviation of model storage/transmission burden. With the advent of AutoML and neural architecture search (NAS), pruning has become topical with automatic mechanism and searching based architecture optimization. Yet, current automatic designs rely on either reinforcement learning or evolutionary algorithm. Due to the non-differentiability of those algorithms, the pruning algorithm needs a long searching stage before reaching the convergence.
To circumvent this problem, this paper introduces a differentiable pruning method via hypernetworks for automatic network pruning. The specifically designed hypernetworks take latent vectors as input and generate the weight parameters of the backbone network. The latent vectors control the output channels of the convolutional layers in the backbone network and act as a handle for the pruning of the layers. By enforcing $\ell_1$ sparsity regularization to the latent vectors and utilizing proximal gradient solver, sparse latent vectors can be obtained. Passing the sparsified latent vectors through the hypernetworks, the corresponding slices of the generated weight parameters can be removed, achieving the effect of network pruning. The latent vectors of all the layers are pruned together, resulting in an automatic layer configuration. Extensive experiments are conducted on various networks for image classification, single image super-resolution, and denoising. And the experimental results validate the proposed method.

Comments:	ECCV camera-ready. Code is available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2003.13683 [cs.CV]
	(or arXiv:2003.13683v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2003.13683

Submission history

From: Yawei Li [view email]
[v1] Mon, 30 Mar 2020 17:59:18 UTC (4,698 KB)
[v2] Fri, 17 Jul 2020 11:16:27 UTC (5,498 KB)
[v3] Sat, 1 Aug 2020 10:59:30 UTC (2,728 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DHP: Differentiable Meta Pruning via HyperNetworks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DHP: Differentiable Meta Pruning via HyperNetworks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators