Randomized Confidence Bounds for Stochastic Partial Monitoring

Heuillet, Maxime; Ahmad, Ola; Durand, Audrey

Computer Science > Machine Learning

arXiv:2402.05002 (cs)

[Submitted on 7 Feb 2024 (v1), last revised 15 May 2024 (this version, v2)]

Title:Randomized Confidence Bounds for Stochastic Partial Monitoring

Authors:Maxime Heuillet, Ola Ahmad, Audrey Durand

View PDF

Abstract:The partial monitoring (PM) framework provides a theoretical formulation of sequential learning problems with incomplete feedback. On each round, a learning agent plays an action while the environment simultaneously chooses an outcome. The agent then observes a feedback signal that is only partially informative about the (unobserved) outcome. The agent leverages the received feedback signals to select actions that minimize the (unobserved) cumulative loss. In contextual PM, the outcomes depend on some side information that is observable by the agent before selecting the action on each round. In this paper, we consider the contextual and non-contextual PM settings with stochastic outcomes. We introduce a new class of PM strategies based on the randomization of deterministic confidence bounds. We also extend regret guarantees to settings where existing stochastic strategies are not applicable. Our experiments show that the proposed RandCBP and RandCBPsidestar strategies have favorable performance against state-of-the-art baselines in multiple PM games. To advocate for the adoption of the PM framework, we design a use case on the real-world problem of monitoring the error rate of any deployed classification system.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2402.05002 [cs.LG]
	(or arXiv:2402.05002v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.05002

Submission history

From: Maxime Heuillet [view email]
[v1] Wed, 7 Feb 2024 16:18:59 UTC (636 KB)
[v2] Wed, 15 May 2024 20:10:05 UTC (788 KB)

Computer Science > Machine Learning

Title:Randomized Confidence Bounds for Stochastic Partial Monitoring

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Randomized Confidence Bounds for Stochastic Partial Monitoring

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators