Knowledge Distillation with Refined Logits

Sun, Wujie; Chen, Defang; Lyu, Siwei; Chen, Genlang; Chen, Chun; Wang, Can

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.07703 (cs)

[Submitted on 14 Aug 2024 (v1), last revised 19 Aug 2024 (this version, v2)]

Title:Knowledge Distillation with Refined Logits

Authors:Wujie Sun, Defang Chen, Siwei Lyu, Genlang Chen, Chun Chen, Can Wang

View PDF HTML (experimental)

Abstract:Recent research on knowledge distillation has increasingly focused on logit distillation because of its simplicity, effectiveness, and versatility in model compression. In this paper, we introduce Refined Logit Distillation (RLD) to address the limitations of current logit distillation methods. Our approach is motivated by the observation that even high-performing teacher models can make incorrect predictions, creating a conflict between the standard distillation loss and the cross-entropy loss. This conflict can undermine the consistency of the student model's learning objectives. Previous attempts to use labels to empirically correct teacher predictions may undermine the class correlation. In contrast, our RLD employs labeling information to dynamically refine teacher logits. In this way, our method can effectively eliminate misleading information from the teacher while preserving crucial class correlations, thus enhancing the value and efficiency of distilled knowledge. Experimental results on CIFAR-100 and ImageNet demonstrate its superiority over existing methods. The code is provided at \text{this https URL}.

Comments:	11 pages, 7 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2408.07703 [cs.CV]
	(or arXiv:2408.07703v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.07703

Submission history

From: Wujie Sun [view email]
[v1] Wed, 14 Aug 2024 17:59:32 UTC (1,947 KB)
[v2] Mon, 19 Aug 2024 07:52:15 UTC (1,546 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Knowledge Distillation with Refined Logits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Knowledge Distillation with Refined Logits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators