Effective Layer Pruning Through Similarity Metric Perspective

Pons, Ian; Yamamoto, Bruno; Costa, Anna H. Reali; Jordao, Artur

Computer Science > Machine Learning

arXiv:2405.17081 (cs)

[Submitted on 27 May 2024 (v1), last revised 4 Nov 2024 (this version, v2)]

Title:Effective Layer Pruning Through Similarity Metric Perspective

Authors:Ian Pons, Bruno Yamamoto, Anna H. Reali Costa, Artur Jordao

View PDF HTML (experimental)

Abstract:Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the field. Extensive research demonstrated that pruning structures from these models is a straightforward approach to reducing network complexity. In this direction, most efforts focus on removing weights or filters. Studies have also been devoted to layer pruning as it promotes superior computational gains. However, layer pruning often hurts the network predictive ability (i.e., accuracy) at high compression rates. This work introduces an effective layer-pruning strategy that meets all underlying properties pursued by pruning methods. Our method estimates the relative importance of a layer using the Centered Kernel Alignment (CKA) metric, employed to measure the similarity between the representations of the unpruned model and a candidate layer for pruning. We confirm the effectiveness of our method on standard architectures and benchmarks, in which it outperforms existing layer-pruning strategies and other state-of-the-art pruning techniques. Particularly, we remove more than 75% of computation while improving predictive ability. At higher compression regimes, our method exhibits negligible accuracy drop, while other methods notably deteriorate model accuracy. Apart from these benefits, our pruned models exhibit robustness to adversarial and out-of-distribution samples.

Comments:	Published at International Conference on Pattern Recognition (ICPR), 2024. Oral presentation
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2405.17081 [cs.LG]
	(or arXiv:2405.17081v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.17081

Submission history

From: Ian Pons [view email]
[v1] Mon, 27 May 2024 11:54:51 UTC (1,358 KB)
[v2] Mon, 4 Nov 2024 18:39:10 UTC (1,358 KB)

Computer Science > Machine Learning

Title:Effective Layer Pruning Through Similarity Metric Perspective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Effective Layer Pruning Through Similarity Metric Perspective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators