DMCP: Differentiable Markov Channel Pruning for Neural Networks

Guo, Shaopeng; Wang, Yujie; Li, Quanquan; Yan, Junjie

Computer Science > Computer Vision and Pattern Recognition

arXiv:2005.03354 (cs)

[Submitted on 7 May 2020 (v1), last revised 8 May 2020 (this version, v2)]

Title:DMCP: Differentiable Markov Channel Pruning for Neural Networks

Authors:Shaopeng Guo, Yujie Wang, Quanquan Li, Junjie Yan

View PDF

Abstract:Recent works imply that the channel pruning can be regarded as searching optimal sub-structure from unpruned networks. However, existing works based on this observation require training and evaluating a large number of structures, which limits their application. In this paper, we propose a novel differentiable method for channel pruning, named Differentiable Markov Channel Pruning (DMCP), to efficiently search the optimal sub-structure. Our method is differentiable and can be directly optimized by gradient descent with respect to standard task loss and budget regularization (e.g. FLOPs constraint). In DMCP, we model the channel pruning as a Markov process, in which each state represents for retaining the corresponding channel during pruning, and transitions between states denote the pruning process. In the end, our method is able to implicitly select the proper number of channels in each layer by the Markov process with optimized transitions. To validate the effectiveness of our method, we perform extensive experiments on Imagenet with ResNet and MobilenetV2. Results show our method can achieve consistent improvement than state-of-the-art pruning methods in various FLOPs settings. The code is available at this https URL

Comments:	CVPR2020 Oral. Code has been released at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2005.03354 [cs.CV]
	(or arXiv:2005.03354v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2005.03354

Submission history

From: Shaopeng Guo [view email]
[v1] Thu, 7 May 2020 09:39:55 UTC (895 KB)
[v2] Fri, 8 May 2020 03:41:52 UTC (906 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computer Vision and Pattern Recognition

Title:DMCP: Differentiable Markov Channel Pruning for Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computer Vision and Pattern Recognition

Title:DMCP: Differentiable Markov Channel Pruning for Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators