Learning from Binary Labels with Instance-Dependent Corruption

Menon, Aditya Krishna; van Rooyen, Brendan; Natarajan, Nagarajan

Computer Science > Machine Learning

arXiv:1605.00751 (cs)

[Submitted on 3 May 2016 (v1), last revised 4 May 2016 (this version, v2)]

Title:Learning from Binary Labels with Instance-Dependent Corruption

Authors:Aditya Krishna Menon, Brendan van Rooyen, Nagarajan Natarajan

View PDF

Abstract:Suppose we have a sample of instances paired with binary labels corrupted by arbitrary instance- and label-dependent noise. With sufficiently many such samples, can we optimally classify and rank instances with respect to the noise-free distribution? We provide a theoretical analysis of this question, with three main contributions. First, we prove that for instance-dependent noise, any algorithm that is consistent for classification on the noisy distribution is also consistent on the clean distribution. Second, we prove that for a broad class of instance- and label-dependent noise, a similar consistency result holds for the area under the ROC curve. Third, for the latter noise model, when the noise-free class-probability function belongs to the generalised linear model family, we show that the Isotron can efficiently and provably learn from the corrupted sample.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1605.00751 [cs.LG]
	(or arXiv:1605.00751v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1605.00751

Submission history

From: Aditya Menon [view email]
[v1] Tue, 3 May 2016 04:47:02 UTC (364 KB)
[v2] Wed, 4 May 2016 04:59:21 UTC (361 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2016-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Aditya Krishna Menon
Brendan van Rooyen
Nagarajan Natarajan

export BibTeX citation

Computer Science > Machine Learning

Title:Learning from Binary Labels with Instance-Dependent Corruption

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning from Binary Labels with Instance-Dependent Corruption

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators