CoKe: Localized Contrastive Learning for Robust Keypoint Detection

Bai, Yutong; Wang, Angtian; Kortylewski, Adam; Yuille, Alan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2009.14115 (cs)

[Submitted on 29 Sep 2020 (v1), last revised 5 Dec 2022 (this version, v4)]

Title:CoKe: Localized Contrastive Learning for Robust Keypoint Detection

Authors:Yutong Bai, Angtian Wang, Adam Kortylewski, Alan Yuille

View PDF

Abstract:In this paper, we introduce a contrastive learning framework for keypoint detection (CoKe). Keypoint detection differs from other visual tasks where contrastive learning has been applied because the input is a set of images in which multiple keypoints are annotated. This requires the contrastive learning to be extended such that the keypoints are represented and detected independently, which enables the contrastive loss to make the keypoint features different from each other and from the background. Our approach has two benefits: It enables us to exploit contrastive learning for keypoint detection, and by detecting each keypoint independently the detection becomes more robust to occlusion compared to holistic methods, such as stacked hourglass networks, which attempt to detect all keypoints jointly. Our CoKe framework introduces several technical innovations. In particular, we introduce: (i) A clutter bank to represent non-keypoint features; (ii) a keypoint bank that stores prototypical representations of keypoints to approximate the contrastive loss between keypoints; and (iii) a cumulative moving average update to learn the keypoint prototypes while training the feature extractor. Our experiments on a range of diverse datasets (PASCAL3D+, MPII, ObjectNet3D) show that our approach works as well, or better than, alternative methods for keypoint detection, even for human keypoints, for which the literature is vast. Moreover, we observe that CoKe is exceptionally robust to partial occlusion and previously unseen object poses.

Comments:	Accepted to WACV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2009.14115 [cs.CV]
	(or arXiv:2009.14115v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2009.14115

Submission history

From: Adam Kortylewski [view email]
[v1] Tue, 29 Sep 2020 16:00:43 UTC (5,413 KB)
[v2] Wed, 30 Sep 2020 01:32:46 UTC (5,414 KB)
[v3] Mon, 23 Nov 2020 16:22:35 UTC (7,991 KB)
[v4] Mon, 5 Dec 2022 08:56:16 UTC (3,567 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CoKe: Localized Contrastive Learning for Robust Keypoint Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CoKe: Localized Contrastive Learning for Robust Keypoint Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators