Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

Xu, Yinghao; Wei, Fangyun; Sun, Xiao; Yang, Ceyuan; Shen, Yujun; Dai, Bo; Zhou, Bolei; Lin, Stephen

Computer Science > Computer Vision and Pattern Recognition

arXiv:2112.09690 (cs)

[Submitted on 17 Dec 2021 (v1), last revised 18 Apr 2022 (this version, v2)]

Title:Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

Authors:Yinghao Xu, Fangyun Wei, Xiao Sun, Ceyuan Yang, Yujun Shen, Bo Dai, Bolei Zhou, Stephen Lin

View PDF

Abstract:Semi-supervised action recognition is a challenging but important task due to the high cost of data annotation. A common approach to this problem is to assign unlabeled data with pseudo-labels, which are then used as additional supervision in training. Typically in recent work, the pseudo-labels are obtained by training a model on the labeled data, and then using confident predictions from the model to teach itself. In this work, we propose a more effective pseudo-labeling scheme, called Cross-Model Pseudo-Labeling (CMPL). Concretely, we introduce a lightweight auxiliary network in addition to the primary backbone, and ask them to predict pseudo-labels for each other. We observe that, due to their different structural biases, these two models tend to learn complementary representations from the same video clips. Each model can thus benefit from its counterpart by utilizing cross-model predictions as supervision. Experiments on different data partition protocols demonstrate the significant improvement of our framework over existing alternatives. For example, CMPL achieves $17.6\%$ and $25.1\%$ Top-1 accuracy on Kinetics-400 and UCF-101 using only the RGB modality and $1\%$ labeled data, outperforming our baseline model, FixMatch, by $9.0\%$ and $10.3\%$, respectively.

Comments:	CVPR 2022 camera-ready, Project webpage: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2112.09690 [cs.CV]
	(or arXiv:2112.09690v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2112.09690

Submission history

From: Yinghao Xu [view email]
[v1] Fri, 17 Dec 2021 18:59:41 UTC (868 KB)
[v2] Mon, 18 Apr 2022 12:03:08 UTC (869 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators