A Deep Model for Partial Multi-Label Image Classification with Curriculum Based Disambiguation

Sun, Feng; Xie, Ming-Kun; Huang, Sheng-Jun

doi:10.1007/s11633-023-1439-3

Computer Science > Computer Vision and Pattern Recognition

arXiv:2207.02410 (cs)

[Submitted on 6 Jul 2022 (v1), last revised 6 May 2024 (this version, v2)]

Title:A Deep Model for Partial Multi-Label Image Classification with Curriculum Based Disambiguation

Authors:Feng Sun, Ming-Kun Xie, Sheng-Jun Huang

View PDF HTML (experimental)

Abstract:In this paper, we study the partial multi-label (PML) image classification problem, where each image is annotated with a candidate label set consists of multiple relevant labels and other noisy labels. Existing PML methods typically design a disambiguation strategy to filter out noisy labels by utilizing prior knowledge with extra assumptions, which unfortunately is unavailable in many real tasks. Furthermore, because the objective function for disambiguation is usually elaborately designed on the whole training set, it can be hardly optimized in a deep model with SGD on mini-batches. In this paper, for the first time we propose a deep model for PML to enhance the representation and discrimination ability. On one hand, we propose a novel curriculum based disambiguation strategy to progressively identify ground-truth labels by incorporating the varied difficulties of different classes. On the other hand, a consistency regularization is introduced for model retraining to balance fitting identified easy labels and exploiting potential relevant labels. Extensive experimental results on the commonly used benchmark datasets show the proposed method significantly outperforms the SOTA methods.

Comments:	12 pages, 5 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2207.02410 [cs.CV]
	(or arXiv:2207.02410v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2207.02410
Journal reference:	Machine Intelligence Research, 2023
Related DOI:	https://doi.org/10.1007/s11633-023-1439-3

Submission history

From: Ming-Kun Xie [view email]
[v1] Wed, 6 Jul 2022 02:49:02 UTC (199 KB)
[v2] Mon, 6 May 2024 07:33:24 UTC (199 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Deep Model for Partial Multi-Label Image Classification with Curriculum Based Disambiguation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Deep Model for Partial Multi-Label Image Classification with Curriculum Based Disambiguation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators