Sequential Learning without Feedback

Hanawal, Manjesh; Szepesvari, Csaba; Saligrama, Venkatesh

Computer Science > Machine Learning

arXiv:1610.05394 (cs)

[Submitted on 18 Oct 2016]

Title:Sequential Learning without Feedback

Authors:Manjesh Hanawal, Csaba Szepesvari, Venkatesh Saligrama

View PDF

Abstract:In many security and healthcare systems a sequence of features/sensors/tests are used for detection and diagnosis. Each test outputs a prediction of the latent state, and carries with it inherent costs. Our objective is to {\it learn} strategies for selecting tests to optimize accuracy \& costs. Unfortunately it is often impossible to acquire in-situ ground truth annotations and we are left with the problem of unsupervised sensor selection (USS). We pose USS as a version of stochastic partial monitoring problem with an {\it unusual} reward structure (even noisy annotations are unavailable). Unsurprisingly no learner can achieve sublinear regret without further assumptions. To this end we propose the notion of weak-dominance. This is a condition on the joint probability distribution of test outputs and latent state and says that whenever a test is accurate on an example, a later test in the sequence is likely to be accurate as well. We empirically verify that weak dominance holds on real datasets and prove that it is a maximal condition for achieving sublinear regret. We reduce USS to a special case of multi-armed bandit problem with side information and develop polynomial time algorithms that achieve sublinear regret.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1610.05394 [cs.LG]
	(or arXiv:1610.05394v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1610.05394

Submission history

From: Venkatesh Saligrama [view email]
[v1] Tue, 18 Oct 2016 01:15:57 UTC (228 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2016-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Manjesh Kumar Hanawal
Csaba Szepesvári
Venkatesh Saligrama

export BibTeX citation

Computer Science > Machine Learning

Title:Sequential Learning without Feedback

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sequential Learning without Feedback

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators