Positive Unlabeled Contrastive Learning

Acharya, Anish; Sanghavi, Sujay; Jing, Li; Bhushanam, Bhargav; Choudhary, Dhruv; Rabbat, Michael; Dhillon, Inderjit

Computer Science > Machine Learning

arXiv:2206.01206 (cs)

[Submitted on 1 Jun 2022 (v1), last revised 28 Mar 2024 (this version, v3)]

Title:Positive Unlabeled Contrastive Learning

Authors:Anish Acharya, Sujay Sanghavi, Li Jing, Bhargav Bhushanam, Dhruv Choudhary, Michael Rabbat, Inderjit Dhillon

View PDF HTML (experimental)

Abstract:Self-supervised pretraining on unlabeled data followed by supervised fine-tuning on labeled data is a popular paradigm for learning from limited labeled examples. We extend this paradigm to the classical positive unlabeled (PU) setting, where the task is to learn a binary classifier given only a few labeled positive samples, and (often) a large amount of unlabeled samples (which could be positive or negative).
We first propose a simple extension of standard infoNCE family of contrastive losses, to the PU setting; and show that this learns superior representations, as compared to existing unsupervised and supervised approaches. We then develop a simple methodology to pseudo-label the unlabeled samples using a new PU-specific clustering scheme; these pseudo-labels can then be used to train the final (positive vs. negative) classifier. Our method handily outperforms state-of-the-art PU methods over several standard PU benchmark datasets, while not requiring a-priori knowledge of any class prior (which is a common assumption in other PU methods). We also provide a simple theoretical analysis that motivates our methods.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2206.01206 [cs.LG]
	(or arXiv:2206.01206v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.01206

Submission history

From: Anish Acharya [view email]
[v1] Wed, 1 Jun 2022 20:16:32 UTC (2,299 KB)
[v2] Tue, 15 Aug 2023 11:13:59 UTC (4,314 KB)
[v3] Thu, 28 Mar 2024 23:25:14 UTC (2,299 KB)

Computer Science > Machine Learning

Title:Positive Unlabeled Contrastive Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Positive Unlabeled Contrastive Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators