Online Learning with Unknown Constraints

Sridharan, Karthik; Yoo, Seung Won Wilson

Computer Science > Machine Learning

arXiv:2403.04033 (cs)

[Submitted on 6 Mar 2024]

Title:Online Learning with Unknown Constraints

Authors:Karthik Sridharan, Seung Won Wilson Yoo

View PDF HTML (experimental)

Abstract:We consider the problem of online learning where the sequence of actions played by the learner must adhere to an unknown safety constraint at every round. The goal is to minimize regret with respect to the best safe action in hindsight while simultaneously satisfying the safety constraint with high probability on each round. We provide a general meta-algorithm that leverages an online regression oracle to estimate the unknown safety constraint, and converts the predictions of an online learning oracle to predictions that adhere to the unknown safety constraint. On the theoretical side, our algorithm's regret can be bounded by the regret of the online regression and online learning oracles, the eluder dimension of the model class containing the unknown safety constraint, and a novel complexity measure that captures the difficulty of safe learning. We complement our result with an asymptotic lower bound that shows that the aforementioned complexity measure is necessary. When the constraints are linear, we instantiate our result to provide a concrete algorithm with $\sqrt{T}$ regret using a scaling transformation that balances optimistic exploration with pessimistic constraint satisfaction.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Statistics Theory (math.ST); Machine Learning (stat.ML)
Cite as:	arXiv:2403.04033 [cs.LG]
	(or arXiv:2403.04033v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.04033

Submission history

From: Seung Won Yoo [view email]
[v1] Wed, 6 Mar 2024 20:23:59 UTC (474 KB)

Computer Science > Machine Learning

Title:Online Learning with Unknown Constraints

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Online Learning with Unknown Constraints

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators